Yixin Huang yixinhuang48

🌐 yixinhuang48.github.io — personal website 🔬 Hao AI Lab — research group 📚 Google Scholar — publications

Research Assistant at UCSD Hao AI Lab · M.S. Computer Science Student · San Diego, CA

I work on LLM systems, evaluation, and GPU-accelerated ML infrastructure.

"A journey of a thousand miles begins with a single step." — Confucius

Project	Description
GamingAgent	LLM/VLM gaming agents for model evaluation — long-horizon reasoning, memory & perception (Doom, Sokoban, Tetris, Pokémon Red)
VideoScience	Benchmark for scientific correctness in text-to-video models — physics & chemistry concepts, VLM-as-Judge scoring
NVIDIA NeMo Gym	RL environments for LLM training — scalable RL, reward profiling, GRPO (Integrating Sokoban & Tetris)
lmenv	LLM environment framework for interactive evaluation — standardized interfaces for game-based agent testing

Python · PyTorch · CUDA · vLLM · SGLang · NeMo RL · Ray · Docker · Slurm · FSDP · DeepSpeed · Linux · Git

Provide feedback