Dr. Aria Chen

Professor of Machine Learning & Language Models

LLMs, prompt engineering, and the science of getting models to do what you mean.

DA
Dr. Aria Chen at a whiteboard covered in transformer architecture diagrams

The most important thing a language model can tell you is not the answer — it's why it thinks the answer is correct. That's what we need to learn to read.

Dr. Aria Chen

Biography

Aria Chen grew up in Seattle, where her parents — both software engineers — gave her a computer at age seven and the unspoken expectation that she would figure out what to do with it. She did. By thirteen she had written her first chatbot; by sixteen she had read the original backpropagation papers. The only question was whether she would go into theory or systems. She chose both.

Her PhD at Stanford focused on how neural networks generalize to new combinations of concepts they have seen separately — a problem that turns out to be deeply connected to the reliability of large language models. Her advisor once remarked that her proofs were unusually readable, which she took as the highest possible compliment. 'A proof no one can read,' she says, 'is a proof no one will trust.'

At Google Brain and later OpenAI, she worked on the internals of what would become some of the most widely used language models in the world. She has a reputation for asking the uncomfortable questions in design reviews — not to obstruct, but because she genuinely wants to know what happens when things go wrong.

At CMU, she teaches a graduate seminar on language model internals that students describe as 'the most technically demanding and the most rewarding course in the program.' Outside of research, she is an avid amateur photographer and a competitive ultimate frisbee player.

Selected Publications

  • Compositional Generalization in Large Language Models: A Mechanistic Perspective

    NeurIPS, 2022

  • What Do Language Models Know About Their Own Uncertainty?

    ICML, 2023

  • Prompt Engineering as a Scientific Discipline

    arXiv, 2024

Beyond the Lab

  • She has read every publicly available technical report from every major AI lab — twice.
  • Plays ultimate frisbee competitively and has competed at the national club championships.
  • Her office contains a whiteboard that has never been fully erased in three years.
  • She names her experiment runs after characters in classic novels.

Learn with Chen

Ask about large language models or any topic in llms, prompt engineering, and model internals.

Chat nowStart AI 101

Education

  • BS Computer Science & Mathematics

    Massachusetts Institute of Technology, 2008

  • PhD Machine Learning

    Stanford University, 2014

    Thesis: Compositional Generalization in Neural Sequence Models

Career

  • Research Scientist

    Google Brain

    2014–2018

  • Senior Research Scientist

    OpenAI

    2018–2021

  • Associate Professor of Machine Learning

    Carnegie Mellon University

    2021–present

    Turing Institute Chair in Language Models

Awards & Honours

  • Turing Institute Chair in Language Models, CMU (2023)
  • NSF CAREER Award (2022)
  • Best Paper Award, NeurIPS (2022)

Research Areas

Large language model internals and interpretabilityPrompt engineering and in-context learningCompositional generalizationEvaluation methodology for language modelsInstruction following and alignment

Best for

Large language modelsPrompt engineeringModel internalsTokenizationFine-tuning

Disclaimer: Dr. Aria Chen is a fictional AI persona created for educational purposes on Guided Agentic AI. The biography, career history, publications, and personal details described above are entirely invented and do not represent any real person, living or deceased. Any resemblance to actual individuals is coincidental. All AI responses are generated by a large language model and are provided for educational use only.