Building the Foundations for Trustworthy Agent Automation
The Data, Agents, and Processes Lab (DAPLab) at Columbia University develops the systems, infrastructure, and interaction principles required for AI agents to safely and reliably automate real work.
We believe trustworthy agent automation cannot be solved at a single layer of the stack. DAPLab vertically integrates expertise across operating systems, data systems, AI, HCI, security, and enterprise workflows to build end-to-end agentic systems with real guarantees around reliability, observability, safety, and control.
For more information about the lab, please contact ewu@cs.columbia.edu
Why Vertical Integration Matters
AI agents fail across the entire stack: models hallucinate, retrieval misses critical context, execution environments lack isolation, workflows leak data, and human oversight breaks under scale. Fixing only one layer is not enough.
DAPLab brings together researchers across systems, databases, AI, HCI, security, and organizational workflows because trustworthy automation requires coordinated advances across the full agent stack — from infrastructure and state management to evaluation, safety, and human interaction.
Trustworthy Automation Requires Integration Across Layers
News & Education
New Blog Series: Agentic Data Environments
We’re publishing a series of posts on what it takes to build data environments for AI agents. The first post lays out the vision; the second digs into why today’s branchable databases aren’t ready for agentic workloads. More posts coming soon.
Trustworthy AI for Code, Industry Roundtable NYC
A curated, invite-only gathering of industry and academic leaders at the IBM Flagship Office in New York City (June 3, 2026) to discuss trustworthy AI for code. Co-organized by DAPLab (Eugene Wu), Baishakhi Ray (Columbia), Abhik Roychoudhury (NUS), and IBM Research.
A one-day workshop in New York City (May 8, 2026) bringing together ML, systems, and HCI researchers to discuss reliable, scalable, and debuggable AI agents. Talks, posters, and demos. See the website for details.
Spring 2026 AI Entrepreneurship Series
The DAPLab is organizing a series of events that bring together Columbia students, faculty, and industry partners to discuss the process of transitioning lessons from research and the classroom into products and value.
DAPLab Receives Microsoft Azure Credit Award
DAPLab has received a $250K Microsoft Azure credit award through the AARI program to support research on robust generalization in agentic AI. The funding enables work on environment scaling and diversification to improve the reliability of agentic systems in real-world deployments.
Spring 2026 DAPLab Research Seminar
The DAPlab’s Tuesday 12PM research seminar in CSB 453 (CS Conference Room) invites speakers that can share cutting-edge agent-systems research or can talk about processes in their organizations and how they are trying to automate them.
Student Honors & Fellowships
Celebrating recent student recognitions: IBM PhD Fellowship (Jerry Jiaxiang Liu), AI & Autonomous Fellowship (Alex Jiakai Xu), and CRA Outstanding Undergraduate Researcher Honorable Mention (Tianle Zhou).
Events
-
2026-06-03
Trustworthy AI for Code, Industry Roundtable NYC Baishakhi Ray, Eugene Wu, Abhik Roychoudhury, Maja VukovicA curated gathering of leaders from industry and academia for a day of invited talks, sharp di...
-
2026-05-08
2026 North East AI Agents DayThe 2026 North East AI Agents Day is a one-day workshop that brings together researchers and p...
-
2026-05-05
Agentic Risk Standard Wenyue HuaPrior work on trustworthy AI emphasizes model-internal properties such as bias mitigation, adv...
-
2026-04-30
Empowering Future Gen-AI Enterprise and Research Through AI-Native Cloud: Together AI's Perspective Leon Song, Together.AIWe are living in the era of GenAI, which has transformed not only the computing industry but a...
Publications
-
Jul 2026, ICML 2026
Outrunning LLM Cutoffs: A Live Kernel Crash Resolution Benchmark for All -
Jul 2026, ICML 2026
LAKEQA: An Exploratory QA Benchmark over a Million-Scale Data Lake -
Jul 2026, CAIS Workshop 2026
BranchBench: An Extensible Benchmark for Agentic Database Branching -
May 2026, AISTATS 2026
Panprediction: optimal predictions for any downstream task and loss -
May 2026, AISTATS 2026
Prior makes it possible: from sublinear graph algorithms to LLM test-time methods -
May 2026, IEEE S&P 2026
Your Compiler is Backdooring Your Model: Understanding and Exploiting Compilation Inconsistency Vulnerabilities in Deep Learning Compilers -
Apr 2026, CHI 2026
MIND: Empowering Mental Health Clinicians with Multimodal Data Insights through a Narrative Dashboard