Data for AI systems
Building more reliable, efficient, and generalizable AI systems through data-centric methods.

Bio
I am a Member of Technical Staff at Anthropic, where I work on AI systems, data, evaluation, and public impact.
I recently completed my PhD at MIT, advised by Sandy Pentland, and previously conducted AI research at Google Brain, Apple, and Stanford.
I founded the Data Provenance Initiative, a 50+ member research collective auditing AI datasets and ecosystems. My research has received five best or outstanding paper awards and broad coverage in outlets including NYT, WaPo, The Atlantic, and MIT Tech Review.
Throughline
My work moves between technical AI research, empirical audits like the Data Provenance Initiative, and public arguments about how AI should be built, measured, and governed.
Building more reliable, efficient, and generalizable AI systems through data-centric methods.
Auditing how AI reshapes the web, data commons, markets, scientific practice, and accountability.
Measuring how open models, benchmarks, licensing, and transparency shape technical and geopolitical power.
Featured work
Dashboards, audits, open letters, reports, and papers built for both technical scrutiny and public use.
Research collective · infrastructure · public data audit
A 50+ member research initiative auditing the licensing, attribution, consent, and transparency of the data that powers AI systems.
Live dashboards · open intelligence · concentration of power
Empirical work on open model economies, open-weight model diffusion, and the institutions shaping global AI capability access.
Flaw disclosure · safe harbor · accountability
Research and policy work arguing for robust independent AI evaluation, coordinated disclosure, and legal protections for public-interest auditing.
Recent
2026Joined Anthropic as Member of Technical Staff.
2026Open model ecosystem data featured in the Stanford AI Index Report.
2025ATLAS released, with practical scaling laws for multilingual transfer.
2025Leaderboard Illusion accepted to NeurIPS and covered by TechCrunch, Ars Technica, 404 Media, and others.
Selected papers
ICLR 2026
Practical scaling laws for multilingual transfer across pretraining, finetuning, and decoding.
NeurIPS 2025
A practical framework for designing rigorous benchmarks for agentic systems.
NeurIPS 2025
An 8TB public-domain and openly licensed text corpus for rights-conscious model training.