-
Panprediction: optimal predictions for any downstream task and loss
Sivaraman Balakrishnan, Nika Haghtalab, Daniel Hsu, Brian Lee, Eric Zhao
AISTATS 2026
-
Prior makes it possible: from sublinear graph algorithms to LLM test-time methods
Avrim Blum, Daniel Hsu, Cyrus Rashtchian, Donya Saless
AISTATS 2026
-
MIND: Empowering Mental Health Clinicians with Multimodal Data Insights through a Narrative Dashboard
Ruishi Zou, Shiyu Xu, Margaret E Morris, Jihan Ryu, Timothy D. Becker, Nicholas Allen, Anne Marie Albano, Randy Auerbach, Dan Adler, Varun Mishra, Lace Padilla, Dakuo Wang, Ryan Sultan, Xuhai "Orson" Xu
CHI 2026
-
More than Decision Support: Exploring Patients' Longitudinal Usage of Large Language Models in Real-World Healthcare-Seeking Journeys
Yancheng Cao, Yishu Ji, Chris Yue Fu, Sahiti Dharmavaram, Meghan Turchioe, Natalie C Benda, Lena Mamykina, Yuling Sun, Xuhai "Orson" Xu
CHI 2026
-
Agentic Data Environments
Elaine Ang, Chenxi Huang, Georgios Liargkovas, Jerry Liu, Jinhui Liu, Nikos Pagonas, Charlie Summers, Haonan Wang, Jiakai Xu, Tianle Zhou, Yusen Zhang, Zhou Yu, Zhuo Zhang, Tianyi Peng, Kostis Kaffes, Eugene Wu
IEEE Data Bulletin 2026
-
Human-Data Interaction, Exploration, and Visualization in the AI Era: Challenges and Opportunities
Jean-Daniel Fekete, Yifan Hu, Dominik Moritz, Arnab Nandi, Senjuti Basu Roy, Eugene Wu, Nikos Bikakis, George Papastefanatos, Panos K. Chrysanthis, Guoliang Li, Lingyun Yu
SIGMOD Record 2026
-
Group-realizable multi-group learning by minimizing empirical risk
Navid Ardeshir, Samuel Deng, Daniel Hsu, Jingwen Liu
ALT 2026
-
Please Don't Kill My Vibe: Empowering Agents with Data Flow Control
Charlie Summers, Haneen Mohammed, Eugene Wu
CIDR 2026
Slides
-
LLM Generated Persona is a Promise with a Catch
Ang Li, Haozhe Chen, Hongseok Namkoong, Tianyi Peng
NeurIPS 2025 Position Paper
-
Agents for Web Testing: A Case Study in the Wild
Naimeng Ye, Xiao Yu, Ruize Xu, Tianyi Peng, Zhou Yu
LAw Workshop at NeurIPS 2025
-
Data Mixture Optimization: A Multi-Fidelity Multi-Scale Bayesian Framework
Tzu-Ching Yen, Andrew Wei Tung Siah, Haozhe Chen, C. Daniel Guetta, Tianyi Peng, Hongseok Namkoong
NeurIPS 2025
-
Tail-Optimized Caching for LLM Inference
Wenxin Zhang, Yueying Li, Ciamac C. Moallemi, Tianyi Peng
NeurIPS 2025
-
Multi-Agent Markov Entanglement
Shuze Chen, Tianyi Peng
NeurIPS 2025 (Spotlight)
-
Touch in the Wild: Learning Fine-Grained Manipulation with a Portable Visuo-Tactile Gripper
Xinyue Zhu*, Binghao Huang*, Yunzhu Li
NeurIPS 2025
-
Q-learning with Posterior Sampling
Priyank Agrawal, Shipra Agrawal, Azmat Azati
NeurIPS 2025
-
RAISE: Reliable Agent Improvement via Simulated Experience
Sahar Omidi Shayegan, Joshua Meyer, Victor Shih, Sebastian Sosa, Tianyi Peng, Kostis Kaffes, Eugene Wu, Andi Partovi, Mehdi Jamei
NeurIPS 2025 (SEA Workshop)
-
LLM Agents for Always-On Operating System Tuning
Georgios Liargkovas, Vahab Jabrayilov, Hubertus Franke, Kostis Kaffes
NeurIPS 2025
-
Fast attention mechanisms: a tale of parallelism
Jingwen Liu, Hantao Yu, Clayton Sanford, Alexandr Andoni, Daniel Hsu
NeurIPS 2025
-
A Decade of Systems for Human Data Interaction
Eugene Wu, Yiru Chen, Haneen Mohammed, Zezhou Huang
ArXiV 2025
-
SAGE: A Top-Down Bottom-Up Knowledge-Grounded User Simulator for Multi-turn AGent Evaluation
Ryan Shea, Yunan Lu, Liang Qiu, Zhou Yu
EACL 2026
-
Set It and Forget It: Zero-Mod ML Magic for Linux Tuning
Georgios Liargkovas, Prabhpreet Singh Sodhi, Kostis Kaffes
PACMI Workshop at SOSP 2025
-
Cortex: Workflow-Aware Resource Pooling and Scheduling for Agentic Serving
Nikos Pagonas, Yeounoh Chung, Kostis Kaffes, Arvind Krishnamurthy
SAA Workshop at SOSP 2025
-
Toward Systems Foundations for Agentic Exploration
Jiakai Xu, Tianle Zhou, Eugene Wu, Kostis Kaffes
SAA Workshop at SOSP 2025
-
The Anatomy of a Personal Health Agent
A. Ali Heydari, Ken Gu, Vidya Srinivas, Hong Yu, Zhihan Zhang, Yuwei Zhang, Akshay Paruchuri, Qian He, Hamid Palangi, Nova Hammerquist, Ahmed A. Metwally, Brent Winslow, Yubin Kim, Kumar Ayush, Yuzhe Yang, Girish Narayanswamy, Maxwell A. Xu, Jake Garrison, Amy Armento Lee, Jenny Vafeiadou, Ben Graef, Isaac R. Galatzer-Levy, Erik Schenck, Andrew Barakat, Javier Perez, Jacqueline Shreibati, John Hernandez, Anthony Z. Faranesh, Javier L. Prieto, Connor Heneghan, Yun Liu, Jiening Zhan, Mark Malhotra, Shwetak Patel, Tim Althoff, Xin Liu, Daniel McDuff, Xuhai "Orson" Xu
ArXiv
-
Suna: Scalable Causal Confounder Discovery over Relational Data
Jiaxiang Liu, Siyuan Xia, Daniel Alabi, Eugene Wu
VLDB 2025
-
Performance of LLMs on Stochastic Modeling Operations Research Problems: From Theory to Practice.
Akshit Kumar, Tianyi Peng, Yuhang Wu, Assaf Zeevi
Winter Simulation Conference 2025
-
Prompt Editor: A Taxonomy-driven System for Guided LLM Prompt Development in Enterprise Settings
Jeffery Cao, Lampros Flokas, Yujian Xu, Eugene Wu, Xu Chu, Cong Yu
SIGMOD Demo 2025
-
Towards a Framework for Optimizing Hierarchical Text Segmentation using LLMs
Lampros Flokas, Jeffrey Cao, Yujian Xu, Eugene Wu, Xu Chu, Cong Yu
DEEM Workshop at SIGMOD 2025
-
Position Paper: A System-Centric Approach is Necessary for AI Agents
Nikos Pagonas, Haonan Wang, Jiaxiang Liu, Tianle Zhou, Deepak Dastrala, Raman Jatkar, Anirudh Sivaraman, Zhou Yu, Kostis Kaffes, Eugene Wu
ArXiv 2025
-
Twin-2K-500: A dataset for building digital twins of over 2,000 people based on their answers to over 500 questions
Olivier Toubia, George Z. Gui, Tianyi Peng, Daniel J. Merlau, Ang Li, and Haozhe Chen
Marketing Science
-
Diversity Helps Jailbreak Large Language Models
Weiliang Zhao, Daneil Ben-Levi, Wei Hao, Junfeng Yang, Chengzhi Mao
NAACL 2025
-
CrashFixer: A Crash Resolution Agent for the Linux Kernel
Alex Mathai, Chenxi Huang, Suwei Ma, Jihwan Kim, Hailie Mitchell, Aleksandr Nogikh, Petros Maniatis, Franjo Ivančić, Junfeng Yang, Baishakhi Ray
Arxiv 2025
-
FeedQUAC: Quick Unobtrusive Agent-Generated Commentary
Tao Long, Kendra Wannamaker, Jo Vermeulen, George Fitzmaurice, Justin Matejka
arXiv 2025
-
Steering Semantic Data Processing With DocWrangler
Shreya Shankar, Bhavya Chopra, Mawil Hasan, Stephen Lee, Bjoern Hartmann, Joseph Hellerstein, Aditya Parameswaran, Eugene Wu
UIST 2025
-
Throughput-Optimal Scheduling Algorithms for LLM Inference and AI Agents
Yueying Li, Jim Dai, Tianyi Peng
Arxiv 2025
-
AgentDynEx: Nudging the Mechanics and Dynamics of Multi-Agent Simulations
Jenny Ma, Riya Sahni, Karthik Sreedhar, Lydia B. Chilton
Under Submission
-
DocETL: Agentic Query Rewriting and Evaluation for Complex Document Processing
Shreya Shankar, Tristan Chambers, Tarak Shah, Aditya G. Parameswaran, Eugene Wu
VLDB 2025
-
Program Synthesis Dialog Agents for Interactive Decision-Making
Matthew Toles, Nikhil Balwani, Rattandeep Singh, Valentina Giulia Sartori Rodriguez, Zhou Yu
ArXiv 2025
-
How Well do LLMs Compress their Own Chain-of-Thought? A Token Complexity Approach
Ayeong Lee, Ethan Che, Tianyi Peng
ICML, Efficient Systems for Foundation Models Workshop 2025
-
ExACT: Teaching AI Agents to Explore with Reflective-MCTS and Exploratory Learning
Xiao Yu, Baolin Peng, Vineeth Vajipey, Hao Cheng, Michel Galley, Jianfeng Gao, Zhou Yu
ICLR 2025
-
AnimationAgents: A Multi-Modal Team of Agents for Generating, Debugging, and Human Editing of Animation Code
Vivian Liu, Rubaiat Habib Kazi, Li-Yi Wei, Matthew Fisher, Timothy Langlois, Seth Walker, Lydia B. Chilton
CHI 2025
-
ACE: A LLM Agent-based Negotiation Coaching System
Ryan Shea, Aymen Kallala, Xin Lucy Liu, Michael W. Morris, Zhou Yu
EMNLP 2024
-
Fast Userspace Networking for the Rest of Us
Alireza Sanaee, Vahab Jabrayilov, Ilias Marinos, Anuj Kalia, Divyanshu Saxena, Prateesh Goyal, Kostis Kaffes, Gianni Antichi
ArXiv 2025
-
DynEx: Agentic Assistance to Bridge Design and Code
Jenny Ma, Karthik Sreedhar, Vivian Liu, Pedro Alejandro Perez, Sitong Wang, Riya Sahni, Lydia B. Chilton
CHI 2025
-
DietGlance: dietary monitoring and personalized analysis at a glance with knowledge-empowered AI assistant
Zhihan Jiang, Running Zhao, Lin Lin, Yue Yu, Handi Chen, Xinchen Zhang, Xuhai "Orson" Xu, Yifang Wang, Xiaojuan Ma, Edith CH Ngai
ACM HEALTH
-
Data Cleaning Using Large Language Models
Shuo Zhang, Zezhou Huang, Eugene Wu
DAIS Workshop at ICDE 2025
-
Alexpaca: Learning Factual Clarification Question Generation Without Examples
Matthew Toles, Yukun Huang, Zhou Yu, Luis Gravano
GEM^2 Workshop at ACL 2025
-
KGym: A Platform and Dataset to Benchmark Large Language Models on Linux Kernel Crash Resolution
Alex Mathai, Chenxi Huang, Petros Maniatis, Aleksandr Nogikh, Franjo Ivančić, Junfeng Yang, Baishakhi Ray
NeurIPS 2024
-
Simulating Cooperative Prosocial Behavior with Multi-Agent LLMs
Karthik Sreedhar, Alice Cai, Jenny Ma, Jeffrey V. Nickerson, Lydia B. Chilton
IUI 2025