Shishir's photo

I am a Ph.D. student in Computer Science at UC Berkeley where I am advised by Joseph Gonzalez, Prabal Dutta, and Ion Stoica. I am broadly interested in ML-Systems, especially around LLM post-training, and agentic llm-systems. I am affiliated with the Sky Computing Lab (previously RISE), Lab11, and Berkeley AI Research (BAIR).

I created and lead the Gorilla, GoEx, RAFT, OpenFunctions, and Berkeley Function Calling Leaderboard (BFCL) projects. The Gorilla and BFCL projects kick-started and helped catalyze tool use in LLMs, and with millions of user requests, widespread enterprise adoption - including ALL leading LLM labs, and a thriving open-source community. The Gorilla project continues to shape the evolving field of tool-calling for agentic LLMs.

My dissertation [talk] on the evolution of Agentic LLMs from function calling to truly autonomous agents.


News:

    logo [Mar 2018] Presented our work to Bill Gates!
    [Jul 2025] Berkeley Function Calling Leaderboard (BFCL) published at ICML 2025
    [Jan 2025] Released Sky-T1 Reasoning Model
    [Dec 2024] Released Specifications: The missing link for LLM systems paper
    [Dec 2024] Gorilla published at NeurIPS 2024
    [Nov 2024] LLoCO published at EMNLP 2024
    [Oct 2024] Launched Agent Arena by 🦍 Gorilla X LMSYS Chatbot Arena. Check it out
    [Sep 2024] Released BFCL v3 with multi-turn. Release blog here
    [Aug 2024] Released BFCL v2 with community contributed APIs. Release blog
    [Apr 2024] Microsoft covers GoEx in their AI platform blog!
    [Apr 2024] Release GoEx: A runtime for Agentic LLMs. Read more in our release blog
    [Mar 2024] Meta and Microsoft cover our RAFT paper!
    [Mar 2024] Released RAFT: Retrieval-Augmented Fine-Tuning for LLMs.
    [Feb 2024] Released Berkeley Function Calling Leaderboard (BFCL). Read more in our release blog
    [Feb 2024] Released Gorilla OpenFunctions-v2. New SOTA in function calling.
    [Dec 2023] Released Gorilla OpenFunctions. Check out our release blog
    [Nov 2023] Presenting Gorilla at at UCL NLP, Meta, and DeepMind London!
    [Oct 2023] Attending the Dagstuhl seminar on EdgeAI!
    [May 2023] Released Gorilla LLM!
    [Apr 2023] Talk on POET at Harvard Systems + Theory group
    [Apr 2023] Skyplane published at NSDI 2023!
    [Dec 2022] Talk at Microsoft Research India, and Google Research, Bangalore
    [Sep 2022] IEEE Spectrum article on POET!
    [Sep 2022] Led the Skyplane tutorial for Skycamp 2022
    [Sep 2022] Skyplane accepted to NSDI 2023
    [Sep 2022] I will be talking about ML on Edge at Princeton and CMU
    [Aug 2022] Presented POET at Google Federated Learning Talks, and Google Language Seminar
    [Aug 2022] Presented Galaxy and On-device ML at the Conix workshop at UW, Seattle
    [Jul 2022] Presented POET as spotlight at ICML 2022! Camera ready on arXiv
    [May 2022] Presented POET and Skyplane posters at RISE Retreat [Tahoe, CA]
    [May 2022] POET accepted to ICML '22! Camera ready coming soon..
    [May 2022] I will be interning with the Brain and Cloud teams at Google this Summer
    [Apr 2022] Where the Sidewalk Ends: Privacy of Opportunistic Backhaul presented at EuroSec'22
    [May 2021] I will be interning with the Core-ML team at Amazon Science this Summer
    [Dec 2020] Embeddings for Indoor Navigation presented at NeurIPS'20 ORLR Workshop
    [Jan 2020] Attending The Quantum Wave in Computing Boot Camp at Simons Institute
    [Jan 2020] Presented poster at RISE Retreat [Monterey, CA]
    [Jan 2020] Gave a talk at VMare Retreat [Palo Alto, CA]
    [Jul 2019] GesturePod accepted to UIST 2019!
    [Dec 2018] ZD Net covers our work at (NeurIPS) NIPS 2018
    [Nov 2018] We will be presenting our work at (NeurIPS) NIPS 2018!
    [Nov 2018] Demonstrated programmable gesture recognition on Xbox controllers with EdgeML
    [Oct 2018] GesturePod implementation and simulation OSS
    [Dec 2017] Our work covered by Financial Express and Microsoft AI blog.

Here are some key projects I have co-created:
Star Gorilla : Large Language Model Connected with Massive APIs
Star   POET : Neural Network training on edge devices
Star EdgeML : Neural Network inference on edge devices
Star Gorilla-CLI : LLMs for your Command Line
Star Letta AI (aka MemGPT) : Memory for LLMs
Star   SkyRL (advisor) : A Modular Full-stack RL Library for LLMs


Talks

Teaching LLMs to use Tools at Scale
Princeton University Colloquium, Princeton — Apr 2025
AICamp Silicon Valley, Palo Alto — Aug 2024
Arize Observe Conference, San Francisco — Jul 2024
Stanford MLSys Seminar — May 2024
ASPLOS EMC2 Workshop, San Diego — Apr 2024
ML Agents and Agents Evaluation
University of Washington, Systems for ML Course — Nov 2024
Stanford University, Trustworthy Machine Learning — Oct 2024
Practicalities of Fine-Tuning Llama 2 with AI Studio
Microsoft Build Conference, Seattle — May 2024
The Age of Industrial AI: Agents Opportunities and Challenges
IEEE MIPR, San Jose — Aug 2024
RAG & Fine-Tuning for Domain Specific LLMs
GenAI Summit, San Francisco — May 2024
Open vs Closed — The Dichotomy of Open-Source Models in AI
GenAI Summit, Santa Clara — Sep 2023
Gorilla: LLMs for APIs
Kong API Summit, San Francisco — Nov 2023
Gorilla and Connecting Large Language Models to the Outside World
NeurIPS Conference, Vancouver — Dec 2024
Google Research, Bengaluru — Dec 2023
Microsoft Research India, Bengaluru — Dec 2023
DeepMind, London — Nov 2023
UCL NLP Seminar Series, London — Nov 2023
Meta, London — Nov 2023
Sky Camp, Berkeley — Oct 2023
Ray Summit, San Francisco — Sep 2023
Kong API Summit, San Francisco — Sep 2023
Simons Institute for Theory of Computing, Berkeley — Aug 2023
LlamaIndex Webinar — Aug 2023
Mosaic ML, San Francisco — Aug 2023
Intel, Santa Clara — Aug 2023
Microsoft Research, Redmond — Jun 2023
Berkeley LLM Hackathon — Jun 2023
Apple, Cupertino — May 2023
Sky Summer Retreat, Tahoe — May 2023
POET: Training Neural Networks for the Bleeding Edge!
Dagstuhl Seminar, Wadern, Germany — Oct 2023
Harvard Systems + Theory Group, Boston — Apr 2023
Sky Winter Retreat, Monterey — Jan 2023
Google Research, Bengaluru — Dec 2022
Microsoft Research, Bengaluru — Dec 2022
Carnegie Mellon University, Pittsburgh — Sep 2022
Princeton University — Sep 2022
Google Federated Learning Talks, Mountain View — Aug 2022
University of Washington, Seattle — Jul 2022
GesturePod and On-Device ML
UIST 2019, New Orleans — Oct 2019
Microsoft Research Techfest, Redmond — Mar 2019
VMware Research, Monterey — Jan 2019
RISE Retreat, Monterey — Jan 2019
Microsoft Research, Bengaluru — Jan 2019

Miscellaneous: I race go-karts all year-round, ski in winters, and sail in summers. If we are going sailing, please read this.