All Publications

All Agent-ready Systems Human-agent Collaboration Agent Intelligence Automation White & Position Papers
  1. Please Don't Kill My Vibe: Empowering Agents with Data Flow Control
    Charlie Summers, Haneen Mohammed, Eugene Wu
    CIDR 2026
  2. Set It and Forget It: Zero-Mod ML Magic for Linux Tuning
    Georgios Liargkovas, Prabhpreet Singh Sodhi, Kostis Kaffes
    PACMI Workshop at SOSP 2025
  3. Cortex: Workflow-Aware Resource Pooling and Scheduling for Agentic Serving
    Nikos Pagonas, Yeounoh Chung, Kostis Kaffes, Arvind Krishnamurthy
    SAA Workshop at SOSP 2025
  4. Toward Systems Foundations for Agentic Exploration
    Jiakai Xu, Tianle Zhou, Eugene Wu, Kostis Kaffes
    SAA Workshop at SOSP 2025
  5. Suna: Scalable Causal Confounder Discovery over Relational Data
    Jiaxiang Liu, Siyuan Xia, Daniel Alabi, Eugene Wu
    VLDB 2025
  6. Performance of LLMs on Stochastic Modeling Operations Research Problems: From Theory to Practice.
    Akshit Kumar, Tianyi Peng, Yuhang Wu, Assaf Zeevi
    Winter Simulation Conference 2025
  7. Prompt Editor: A Taxonomy-driven System for Guided LLM Prompt Development in Enterprise Settings
    Jeffery Cao, Lampros Flokas, Yujian Xu, Eugene Wu, Xu Chu, Cong Yu
    SIGMOD Demo 2025
  8. Towards a Framework for Optimizing Hierarchical Text Segmentation using LLMs
    Lampros Flokas, Jeffrey Cao, Yujian Xu, Eugene Wu, Xu Chu, Cong Yu
    DEEM Workshop at SIGMOD 2025
  9. Position Paper: A System-Centric Approach is Necessary for AI Agents
    Nikos Pagonas, Haonan Wang, Jiaxiang Liu, Tianle Zhou, Deepak Dastrala, Raman Jatkar, Anirudh Sivaraman, Zhou Yu, Kostis Kaffes, Eugene Wu
    ArXiv 2025
  10. Twin-2K-500: A dataset for building digital twins of over 2,000 people based on their answers to over 500 questions
    Olivier Toubia, George Z. Gui, Tianyi Peng, Daniel J. Merlau, Ang Li, and Haozhe Chen
    Marketing Science
  11. CrashFixer: A Crash Resolution Agent for the Linux Kernel
    Alex Mathai, Chenxi Huang, Suwei Ma, Jihwan Kim, Hailie Mitchell, Aleksandr Nogikh, Petros Maniatis, Franjo Ivančić, Junfeng Yang, Baishakhi Ray
    Arxiv 2025
  12. FeedQUAC: Quick Unobtrusive Agent-Generated Commentary
    Tao Long, Kendra Wannamaker, Jo Vermeulen, George Fitzmaurice, Justin Matejka
    arXiv 2025
  13. Steering Semantic Data Processing With DocWrangler
    Shreya Shankar, Bhavya Chopra, Mawil Hasan, Stephen Lee, Bjoern Hartmann, Joseph Hellerstein, Aditya Parameswaran, Eugene Wu
    UIST 2025
  14. Throughput-Optimal Scheduling Algorithms for LLM Inference and AI Agents
    Yueying Li, Jim Dai, Tianyi Peng
    Arxiv 2025
  15. AgentDynEx: Nudging the Mechanics and Dynamics of Multi-Agent Simulations
    Jenny Ma, Riya Sahni, Karthik Sreedhar, Lydia B. Chilton
    Under Submission
  16. DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing
    Shreya Shankar, Tristan Chambers, Tarak Shah, Aditya G. Parameswaran, Eugene Wu
    VLDB 2025
  17. LLM Generated Persona is a Promise with a Catch
    Ang Li, Haozhe Chen, Hongseok Namkoong, Tianyi Peng
    Arxiv 2025
  18. Program Synthesis Dialog Agents for Interactive Decision-Making
    Matthew Toles, Nikhil Balwani, Rattandeep Singh, Valentina Giulia Sartori Rodriguez, Zhou Yu
    ArXiv 2025
  19. How Well do LLMs Compress their Own Chain-of-Thought? A Token Complexity Approach
    Ayeong Lee, Ethan Che, Tianyi Peng
    ICML, Efficient Systems for Foundation Models Workshop 2025
  20. ExACT: Teaching AI Agents to Explore with Reflective-MCTS and Exploratory Learning
    Xiao Yu, Baolin Peng, Vineeth Vajipey, Hao Cheng, Michel Galley, Jianfeng Gao, Zhou Yu
    ICLR 2025
  21. AnimationAgents: A Multi-Modal Team of Agents for Generating, Debugging, and Human Editing of Animation Code
    Vivian Liu, Rubaiat Habib Kazi, Li-Yi Wei, Matthew Fisher, Timothy Langlois, Seth Walker, Lydia B. Chilton
    CHI 2025
  22. ACE: A LLM Agent-based Negotiation Coaching System
    Ryan Shea, Aymen Kallala, Xin Lucy Liu, Michael W. Morris, Zhou Yu
    EMNLP 2024
  23. Fast Userspace Networking for the Rest of Us
    Alireza Sanaee, Vahab Jabrayilov, Ilias Marinos, Anuj Kalia, Divyanshu Saxena, Prateesh Goyal, Kostis Kaffes, Gianni Antichi
    ArXiv 2025
  24. DynEx: Agentic Assistance to Bridge Design and Code
    Jenny Ma, Karthik Sreedhar, Vivian Liu, Pedro Alejandro Perez, Sitong Wang, Riya Sahni, Lydia B. Chilton
    CHI 2025
  25. Data Cleaning Using Large Language Models
    Shuo Zhang, Zezhou Huang, Eugene Wu
    DAIS Workshop at ICDE 2025
  26. Alexpaca: Learning Factual Clarification Question Generation Without Examples
    Matthew Toles, Yukun Huang, Zhou Yu, Luis Gravano
    GEM^2 Workshop at ACL 2025
  27. KGym: A Platform and Dataset to Benchmark Large Language Models on Linux Kernel Crash Resolution
    Alex Mathai, Chenxi Huang, Petros Maniatis, Aleksandr Nogikh, Franjo Ivančić, Junfeng Yang, Baishakhi Ray
    NeurIPS 2024
  28. Simulating Cooperative Prosocial Behavior with Multi-Agent LLMs
    Karthik Sreedhar, Alice Cai, Jenny Ma, Jeffrey V. Nickerson, Lydia B. Chilton
    IUI 2025