Dr. Aria Chen

Professor of Machine Learning & Language Models

“LLMs, prompt engineering, and the science of getting models to do what you mean.”

Dr. Aria Chen at a whiteboard covered in transformer architecture diagrams

“The most important thing a language model can tell you is not the answer — it's why it thinks the answer is correct. That's what we need to learn to read.”
— Dr. Aria Chen

Biography

Aria Chen grew up in Seattle, where her parents — both software engineers — gave her a computer at age seven and the unspoken expectation that she would figure out what to do with it. She did. By thirteen she had written her first chatbot; by sixteen she had read the original backpropagation papers. The only question was whether she would go into theory or systems. She chose both.

Her PhD at Stanford focused on how neural networks generalize to new combinations of concepts they have seen separately — a problem that turns out to be deeply connected to the reliability of large language models. Her advisor once remarked that her proofs were unusually readable, which she took as the highest possible compliment. 'A proof no one can read,' she says, 'is a proof no one will trust.'

At Google Brain and later OpenAI, she worked on the internals of what would become some of the most widely used language models in the world. She has a reputation for asking the uncomfortable questions in design reviews — not to obstruct, but because she genuinely wants to know what happens when things go wrong.

At CMU, she teaches a graduate seminar on language model internals that students describe as 'the most technically demanding and the most rewarding course in the program.' Outside of research, she is an avid amateur photographer and a competitive ultimate frisbee player.

Selected Publications

Compositional Generalization in Large Language Models: A Mechanistic Perspective
NeurIPS, 2022
What Do Language Models Know About Their Own Uncertainty?
ICML, 2023
Prompt Engineering as a Scientific Discipline
arXiv, 2024

Beyond the Lab

◆She has read every publicly available technical report from every major AI lab — twice.
◆Plays ultimate frisbee competitively and has competed at the national club championships.
◆Her office contains a whiteboard that has never been fully erased in three years.
◆She names her experiment runs after characters in classic novels.

Learn with Chen

Ask about large language models or any topic in llms, prompt engineering, and model internals.

Chat now Start AI 101

Education

BS Computer Science & Mathematics
Massachusetts Institute of Technology, 2008
PhD Machine Learning
Stanford University, 2014
Thesis: Compositional Generalization in Neural Sequence Models

Career

Research Scientist
Google Brain
2014–2018
Senior Research Scientist
OpenAI
2018–2021
Associate Professor of Machine Learning
Carnegie Mellon University
2021–present
Turing Institute Chair in Language Models

Awards & Honours

★Turing Institute Chair in Language Models, CMU (2023)
★NSF CAREER Award (2022)
★Best Paper Award, NeurIPS (2022)

Research Areas

Large language model internals and interpretabilityPrompt engineering and in-context learningCompositional generalizationEvaluation methodology for language modelsInstruction following and alignment

Best for

Large language modelsPrompt engineeringModel internalsTokenizationFine-tuning

Next guide

Prof. Liam Carter

→

Disclaimer: Dr. Aria Chen is a fictional AI persona created for educational purposes on Guided Agentic AI. The biography, career history, publications, and personal details described above are entirely invented and do not represent any real person, living or deceased. Any resemblance to actual individuals is coincidental. All AI responses are generated by a large language model and are provided for educational use only.