Ryan Zheng - Physics student at UCLA, AI researcher, Scale AI intern

ryan zheng

hi! i'm ryan, a

education

University of California, Los Angeles Graduating Spring 2027

B.S. in Physics, Data Science Engineering Minor

  • Relevant Coursework: Data Structures & Algorithms, Computer Architecture, Multivariable Calculus, Linear Algebra, Differential Equations, Modern Physics

experience

Scale AI • Generative AI Intern January 2025 - present

  • Contribute to training and evals for SOTA reasoning and agentic models across 5+ clients and 100+ tasks, ranging from abstract visual reasoning, to deep research studies, to next-gen SWE agents solving real world GitHub issues.
  • Execute rigorous quality assurance reviews for critical datasets powering SWE agents in Java, C++, Python, Go, and Rust, evaluating intern deliverables against customer specs to ensure data integrity. Improve data quality rating by organizing and leading in-person project workshops for contributing interns.
  • Construct robust Docker testing environments and write comprehensive rubrics for industry standard AI benchmarks such as the Aider LLM Leaderboards.

ACM AI @ UCLA • Projects Officer January 2025 - present

  • Design and lead student projects exploring advanced AI topics.

skills

  • Languages: C++, Java, Python, Swift, SQL, HTML/CSS, R
  • Frameworks: PyTorch, Hugging Face, React, Jekyll
  • Developer Tools: Git, Cursor, VS Code, XCode, Jupyter

projects

Kaggle S&P500 Prediction • ACM AI October 2025 - Present

  • Design and iterate neural networks, such as decision trees, feed-forward networks, and Long Short-Term Memory (LSTMs) models to predict forward returns of S&P 500 for Kaggle competition.
  • Improve data preprocessing and model evaluation by implementing KNN imputation for dataset NaN values and k-fold cross validation. Competition scoring in progress.

R1 Reasoning • ACM AI March 2025 - June 2025

  • Implemented reinforcement learning from human feedback (RLHF) system using Group Relative Policy Optimization (GRPO) to fine-tune Qwen2.5-7B-Instruct model for mathematical reasoning tasks. Improved out-of-the-box model accuracy on test data by 17 percentage points.
  • Identified and debugged issues with repetition rewards, correctness metric, and dataset parameters. Implemented custom correctness checking via regex pattern matching and repetition detection using n-gram analysis.
  • Optimized memory usage through gradient checkpointing, 8-bit optimizers, and automatic GPU memory management for multi-GPU training.

publications

Chen, Y., Jiao, J., & Zheng, R. (2024). Exploring changes in trip generation and impacts of built environment between regular and essential trips: A study based on the contiguous United States. Proceedings of the CICTP 2024 (pp. 3317–3326). Presented at the CICTP 2024. https://doi.org/10.1061/9780784485484.314