Skip to content
View yixinhuang48's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report yixinhuang48

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
yixinhuang48/README.md

Hi, I'm Yixin Huang

🌐 yixinhuang48.github.io — personal website 🔬 Hao AI Lab — research group 📚 Google Scholar — publications

Visitor Count

Research Assistant at UCSD Hao AI Lab · M.S. Computer Science Student · San Diego, CA

I work on LLM systems, evaluation, and GPU-accelerated ML infrastructure.

"A journey of a thousand miles begins with a single step." — Confucius


Research Interests

  • LLM evaluation & benchmarks (agents, games, scientific reasoning)
  • Large-scale training & inference systems (FSDP, vLLM, Ray, Slurm)
  • Reinforcement learning for agents (GRPO, NeMo-Gym)

Selected Projects

Project Description
GamingAgent LLM/VLM gaming agents for model evaluation — long-horizon reasoning, memory & perception (Doom, Sokoban, Tetris, Pokémon Red)
VideoScience Benchmark for scientific correctness in text-to-video models — physics & chemistry concepts, VLM-as-Judge scoring
NVIDIA NeMo Gym RL environments for LLM training — scalable RL, reward profiling, GRPO (Integrating Sokoban & Tetris)
lmenv LLM environment framework for interactive evaluation — standardized interfaces for game-based agent testing

Tech Stack

Python · PyTorch · CUDA · vLLM · SGLang · NeMo RL · Ray · Docker · Slurm · FSDP · DeepSpeed · Linux · Git

Pinned Loading

  1. hao-ai-lab/VideoScience hao-ai-lab/VideoScience Public

    Python 9 2

  2. lmgame-org/GamingAgent lmgame-org/GamingAgent Public

    [ICLR 2026] LLM/VLM gaming agents and model evaluation through games.

    Python 913 99

  3. lmgame-org/lmenv lmgame-org/lmenv Public

    Python 1

  4. NVIDIA-NeMo/Gym NVIDIA-NeMo/Gym Public

    Build RL environments for LLM training

    Python 813 115

  5. yixinhuang48.github.io yixinhuang48.github.io Public

    HTML 1