Publications [Google Scholar]

Gorilla: Large Language Model Connected with Massive APIs
Shishir G. Patil, Tianjun Zhang, Xin Wang, Joseph E. Gonzalez
Neural Information Processing Systems (NeurIPS) 2024
[Paper] [Code] [Slides] [Short Video] [Long Video] [bibtex]

The Berkeley Function Calling Leaderboard (BFCL): From Tool Use to Agentic Evaluation of Large Language Models
Shishir G. Patil, Huanzhi Mao, Fanjia Yan, Charlie Ji, Vishnu Suresh, Ion Stoica, Joseph E Gonzalez
International Conference of Machine Learning (ICML) 2025
[Paper] [Leaderboard] [Code] [bibtex]

GoEX: Perspectives and Designs Towards a Runtime for Autonomous LLM Applications
Shishir G. Patil, Tianjun Zhang, Vivian Fang, Noppapon C., Roy Huang, Aaron Hao, Martin Casado, Joseph E. Gonzalez, Raluca Ada Popa, Ion Stoica
arXiv preprint arXiv:2404.06921
[Paper] [Code] [bibtex]

The Llama 3 Herd of Models
Llama Team
[Paper] [Model Weights] [bibtex]

Efficient ML Model Updates for Deeply Embedded Microcontrollers
Shishir G. Patil, Sam Kumar, Prabal Dutta, Joseph E. Gonzalez
EuroSys 2026
[Paper] [Code] [bibtex]

LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters!
Dacheng Li, Shiyi Cao, Tyler Griggs, Shu Liu, Xiangxi Mo, Eric Tang, Sumanth Hegde, Kourosh Hakhamaneshi, Shishir G. Patil, Matei Zaharia, Joseph E. Gonzalez, Ion Stoica
Empirical Methods in Natural Language Processing (EMNLP) 2025
[Paper] [Code] [bibtex]

AdvancedIF: Rubric-Based Benchmarking and Reinforcement Learning for Advancing LLM Instruction Following
Llama Team
Association for Computational Linguistics (ACL 2026)
[Paper] [Model Weights] [bibtex]

LLoCO: Learning Long Contexts Offline
Sijun Tan, Xiuyu Li, Shishir G. Patil, Ziyang Wu, Tianjun Zhang, Kurt Keutzer, Joseph E. Gonzalez, Raluca Ada Popa
Empirical Methods in Natural Language Processing (EMNLP) 2024
[Paper] [Code] [bibtex]

RAFT: Adapting Language Model to Domain Specific RAG
Tianjun Zhang, Shishir G. Patil, Naman Jain, Sheng Shen, Matei Zaharia, Ion Stoica, Joseph E. Gonzalez
Conference on Language Modeling (COLM) 2024
[Paper] [Code] [bibtex]

Nebula: A Privacy-First Platform for Data Backhaul
Jean-Luc Watson, Tess Despres, Alvin Tan, Shishir G. Patil, Prabal Dutta, Raluca Ada Popa
IEEE Symposium of Security and Privacy - IEEE S&P (Oakland) 2024.
[Paper] [bibtex]

Revisiting Edge AI: Opportunities and Challenges
Tobias Meuser, Lauri Lovén, Monowar Bhuyan, Shishir G. Patil, Schahram Dustdar, Atakan Aral, Suzan Bayhan, Christian Becker, Eyal de Lara, Aaron Yi Ding, Janick Edinger, James Gross, Nitinder Mohan, Andy D. Pimentel, Etienne Rivière, Henning Schulzrinne, Pieter Simoens, Gürkan Solmaz
IEEE Internet Computing 2024
[Paper] [bibtex]

MemGPT: Towards LLMs as operating systems
Charles Packer, Vivian Fang, Shishir G. Patil, Kevin Lin, Sarah Wooders, Joseph E. Gonzalez
arXiv preprint arXiv:2310.08560
[Paper] [Code] [bibtex]

Skyplane: Optimizing Transfer Cost and Throughput Using Cloud-Aware Overlays
Paras Jain, Sam Kumar, Sarah Wooders, Shishir G. Patil, Joseph E. Gonzalez, Ion Stoica
USENIX Symposium on Networked Systems Design and Implementation (NSDI) 2023
[Paper] [Code] [bibtex]

POET: Training Neural Networks on Tiny Devices with Integrated Rematerialization and Paging
Shishir G. Patil, Paras Jain, Prabal Dutta, Ion Stoica, Joseph E. Gonzalez
[Spotlight] International Conference of Machine Learning (ICML) 2022
[Paper] [Code] [Poster] [Slides] [Video] [bibtex]

Where the Sidewalk Ends: Privacy of Opportunistic Backhaul
Tess Despres, Shishir Patil, Alvin Tan, Jean-Luc Watson, Prabal Dutta
Proceedings of the 15th European Workshop on Systems Security (EuroSec) 2022.
[Paper] [bibtex]

Learning Embeddings that Capture Spatial Semantics for Indoor Navigation
Vidhi Jain, Shishir G. Patil, Prakhar Agarwal, Katia Sycara
Neural Information Processing Systems (NeurIPS) 2020 (ORLR Workshop).
[Paper] [Code] [Poster] [bibtex]

GesturePod: Enabling On-device Gesture-based Interaction for White Cane Users
Shishir G. Patil, Don Dennis, Chirag Pabbaraju, Nadeem Shaheer, Harsha Vardhan Simhadri, Vivek Seshadri, Manik Varma, Prateek Jain
ACM User Interface Software and Technology Symposium (UIST) 2019.
Also available as Microsoft Research Technical Report, MSR-TR-2018-14, May 2018
[Video Preview] [Poster] [Paper] [Gesture Recognition Data set] [Simulation and Code] [bibtex]

Real-world Demonstration of ML-based Gesture Recognition
Shishir G. Patil, Don Dennis, Harsha Vardhan Simhadri, Prateek Jain
2nd Workshop on Machine Learning on the Phone and other Consumer Devices (MLPCD), NeurIPS 2018
[Poster] [bibtex]

Characterization and analysis of Transistor Outline TO-254 package for power device applications
Shishir G. Patil, B. Pavithra and M. M. Nayak
IEEE Electron Devices Society, 3rd International Conference on Emerging Electronics (ICEE) 2017
[Paper] [bibtex]