Yifan Zhou

CS + Math @ UCLA · Incoming MTS Intern @ OpenAI

I am a student at University of California, Los Angeles studying Computer Science and Applied Mathematics. I am currently conducting independent research in mechanistic interpretability, developing methods to analyze how post-training reshapes model behavior — using first-divergence model diffing to pinpoint the earliest token where a pretrained model and its instruction-tuned descendant disagree, then decomposing how upstream state and late-stack computation jointly produce that disagreement.

My broader interests also include reinforcement learning, multi-agent systems, and efficient ML systems. I have worked on multi-agent component analysis, agent behavior monitoring & evaluation, and CPU-efficient inference systems (SimHash, int8 quantization, ONNX kernel fusion).

Previously, I interned as an Applied Scientist at Microsoft, working on SimHash and efficient ML systems. I have also worked at Celestra, Judgment Labs, and Tencent. I will be joining OpenAI (Integrity) as a Member of Technical Staff Intern in Summer 2026.


Selected Publications

* equal contribution

A complete list of publications can be found on my Google Scholar page.

  1. Investigating Component Contributions in Multi-Agent ML Systems
    Junsung Kim*, Ilia Mireskandari*, Seungwan Son*, Yifan Zhou*, Khizer Shahid*, and Dylan Yihan Dai*
    International Conference on Machine Learning (ICML) 2026
  2. Preprint
    pt-it-model-diff.png
    Instruction Tuning Changes How Upstream State Conditions Late Readout: A Cross-Patching Diagnostic
    Yifan Zhou
    Preprint 2026
  3. Preprint
    convergence-gap.png
    The Convergence Gap: Instruction-Tuned Language Models Stabilize Later in the Forward Pass
    Yifan Zhou
    Preprint 2026

Industry Experience

OpenAI
OpenAI
Member of Technical Staff Intern (Incoming)
Jun 2026 – Sep 2026
San Francisco, CA
Incoming Summer 2026; Integrity organization.
Microsoft
Microsoft
Applied Scientist Intern
Jan 2026 – Apr 2026
Redmond, WA
Worked on similarity-preserving hashing and CPU-side inference systems, including ONNX kernel-fusion optimizations and int8 quantization. Mentored by Jinyu Li and Wenbin Zhu.
Celestra
Celestra
Research Intern
Sep 2025 – Dec 2025
Remote
Co-authored work on agent scaffolding (4,000+ ablations; ICML 2026) and developed a modular multi-agent system achieving top performance on MLE-Bench.
Judgment Labs
Judgment Labs
Member of Technical Staff
Jan 2025 – Sep 2025
San Francisco, CA
Built semantic search over long-form agent traces for trace bucketing and retrieval, and worked on RL-tuned LLM-as-a-judge for agent performance evaluation. Helped maintain judgeval.
Tencent
Tencent
Student Researcher
Jun 2024 – Jul 2024
Shenzhen, China
Reproduced an IBM Max-Cut QAOA paper on Tencent's superconducting quantum backend using TensorCircuit, then trained a small neural network to predict and mitigate per-circuit QAOA noise. Mentored by Yi-Chong Zheng.

Education

UCLA
University of California, Los Angeles
B.S. in Computer Science and Applied Mathematics · GPA: 3.92/4.0
Sep 2024 – Jun 2027 (Expected)
Los Angeles, CA
Reinforcement Learning, NLP, Foundations of ML, Algorithms, Database Systems, Probability Theory, Complex Analysis, Linear Algebra.

Miscellaneous 🙃