Skip to content
View dashitongzhi's full-sized avatar

Highlights

  • Pro

Block or report dashitongzhi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
dashitongzhi/README.md

header

╔═══════════════════════════════════════════╗
β•‘                                           β•‘
β•‘   β–ˆβ–ˆβ•—  β–ˆβ–ˆβ•— β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ•—   β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ•— β–ˆβ–ˆβ•—          β•‘
β•‘   β–ˆβ–ˆβ•‘ β–ˆβ–ˆβ•”β• β–ˆβ–ˆβ•”β•β•β–ˆβ–ˆβ•— β–ˆβ–ˆβ•”β•β•β•β–ˆβ–ˆβ•— β–ˆβ–ˆβ•‘         β•‘
β•‘   β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ•”β•  β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ•”β• β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ•‘ β–ˆβ–ˆβ•‘         β•‘
β•‘   β–ˆβ–ˆβ•”β–ˆβ–ˆβ•—   β–ˆβ–ˆβ•”β•β•β–ˆβ–ˆβ•— β–ˆβ–ˆβ•”β•β•β•β–ˆβ–ˆβ•‘ β–ˆβ–ˆβ•‘         β•‘
β•‘   β–ˆβ–ˆβ•‘ β–ˆβ–ˆβ•—  β–ˆβ–ˆβ•‘  β–ˆβ–ˆβ•‘ β–ˆβ–ˆβ•‘   β–ˆβ–ˆβ•‘ β–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ–ˆβ•—   β•‘
β•‘   β•šβ•β•  β•šβ•β• β•šβ•β•  β•šβ•β• β•šβ•β•   β•šβ•β• β•šβ•β•β•β•β•β•β•β•   β•‘
β•‘                                           β•‘
β•šβ•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•β•

⚑ KRAL ⚑

Typing SVG

GitHub Followers GitHub Stars Email


Man Technologist About Me

Hi, I'm Cao Hanzhe (Kral) β€” a CS student, AI researcher, and open-source enthusiast.

  • πŸ”­ Building Multi-Agent Systems that reason, debate, and collaborate to solve complex real-world problems
  • 🧠 Advancing Reinforcement Learning β€” from RLHF reward modeling to agentic RL with multi-turn reasoning
  • πŸ€– Bridging the gap between simulation and real-world robotics β€” sim-to-real transfer, embodied AI
  • 🌐 Pushing the frontier of LLM Reasoning β€” test-time compute scaling, search-augmented generation, tool-use agents
  • πŸ—οΈ Creator of MingJian (ζ˜Žι‰΄) β€” an evidence-driven multi-agent simulation platform for strategic decision-making
  • πŸ“¬ Reach me at: [email protected]

Rocket What I Do

πŸ€– Multi-Agent Orchestration

Designing agent systems where multiple LLMs collaborate through debate protocols, evidence chains, and structured reasoning β€” not just simple tool-calling.

🧠 Reinforcement Learning

Implementing and fixing core RL algorithms β€” from PettingZoo parallel environments to LinUCB contextual bandits. Contributing fixes upstream to pytorch/rl and Pearl.

🦾 Robotics & Sim-to-Real

Working with robosuite and NVIDIA IsaacLab to build robust simulation pipelines that transfer to real robots. Fixing core physics engine bugs and resource management.

πŸ›‘οΈ AI Safety & Evaluation

Building automated safety checks β€” prompt injection detection, red-teaming frameworks, and LLM evaluation harnesses. Contributing to Giskard AI safety platform.


Rocket Featured Project

πŸ—οΈ MingJian (ζ˜Žι‰΄) β€” Multi-Agent Decision Platform

AI-powered multi-agent platform for evidence-driven scenario simulation and strategic decision-making

  • ⭐ 19 stars Β· Python Β· FastAPI
  • πŸ›οΈ Supports corporate and military strategic domains
  • 🎭 Multi-agent debate protocol with evidence chains
  • πŸ“Š Real-time scenario simulation engine
  • πŸ”— github.com/dashitongzhi/MingJian

Toolbox Tech Stack

Languages

Python Rust TypeScript Swift C++

AI / ML

PyTorch LangChain HuggingFace OpenAI

Infrastructure

Docker Linux FastAPI ROS


Trophy GitHub Stats

GitHub Stats

GitHub Streak

GitHub Trophies


Octopus Open Source Contributions

Active contributor to 30+ AI and agent projects across GitHub β€” fixing core bugs, adding safety features, and improving developer experience

Category Projects Highlights
πŸ€– Agent Frameworks rllm, notte, Composio/agent-orchestrator, stakpak/agent Core bug fixes, session management, async improvements
🧠 Reinforcement Learning pytorch/rl, facebookresearch/Pearl, alibaba/ROLL Fixing PettingZoo parallel env bugs, LinUCB tensor squeeze, agentic LR scheduler
🦾 Robotics robosuite, IsaacLab, SmolVM Resource leak fixes, docstring corrections, sim-to-real improvements
πŸ”§ AI Infrastructure Kokoro-FastAPI, any-llm, Art, burr Error message fixes, kwargs passthrough, install automation
πŸ›‘οΈ AI Safety Giskard-AI, Agent-R1 LLM-based prompt injection detection, red-teaming checks
πŸ“¦ Dev Tools visidata, cc-switch, hermecore, go-micro Shell command fixes, metadata parsing, CI improvements

Star Achievements

  • πŸ† Starstruck β€” Repository earned 16+ stars
  • 🦈 Pull Shark β€” Merged 30+ pull requests across major open-source projects
  • πŸ“Š 330+ contributions in the last year
  • 🌍 Contributed to projects from Meta, PyTorch, Alibaba, NVIDIA, Apache and more
  • πŸš€ Built and maintained MingJian β€” a production-grade multi-agent platform

Bar Chart Contribution Graph

Contribution Graph


Ask Me About Ask Me About

Multi-Agent Systems Reinforcement Learning Robotics LLM Reasoning AI Safety Sim-to-Real


Currently Working On Currently Working On

  • πŸ—οΈ MingJian v2 β€” Enhanced multi-agent debate protocol with evidence chain validation
  • 🧠 Agentic RL β€” Multi-turn reinforcement learning for LLM agents
  • 🦾 IsaacLab Contributions β€” Improving sim-to-real transfer pipelines
  • πŸ›‘οΈ Prompt Injection Detection β€” Building automated LLM safety evaluation tools

Quote Quote

"The question of whether machines can think is about as relevant as the question of whether submarines can swim." β€” Edsger W. Dijkstra


Handshake Let's Connect

I'm always open to collaboration on multi-agent systems, RL research, and robotics projects.

If you're building something in the AI agent space β€” let's talk! πŸš€

Email GitHub


Profile Views

footer

Popular repositories Loading

  1. MingJian MingJian Public

    AI-powered multi-agent platform for evidence-driven scenario simulation and strategic decision-making. Supports corporate \& military domains with debate protocol.

    Python 18 2

  2. NNovel NNovel Public

    ε†™ε°θ―΄ηš„θΎ…εŠ©ε·₯ε…·

    Python 3 1

  3. HoST HoST Public

    Forked from InternRobotics/HoST

    [RSS 2025 Best Systems Paper Finalist] πŸ’Official implementation of "Learning Humanoid Standing-up Control across Diverse Postures"

    Python 1

  4. TextOp TextOp Public

    Forked from TeleHuman/TextOp

    TextOp: Real-time Interactive Text-Driven Humanoid Robot Motion Generation and Control

    Python 1

  5. liquid-glass-react liquid-glass-react Public

    Forked from rdev/liquid-glass-react

    Apple's Liquid Glass effect for React

    TypeScript 1

  6. liquid-glass liquid-glass Public

    Forked from callstack/liquid-glass

    Liquid Glass in React Native

    TypeScript 1