Skip to content
View nhungnt7's full-sized avatar

Highlights

  • Pro

Block or report nhungnt7

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
nhungnt7/README.md

Nhung Thi Nguyen

πŸ“§ [email protected] | [email protected] | 🌐 Personal Website | πŸ’» GitHub

πŸŽ“ Education

Monash University
Starting May 2025

πŸ’Ό Experience

πŸ‘¨β€πŸ’Ό Team Leader

MISA JSC - misa.vn
January 2025 - May 2025

  • Built top-ranked Vietnamese LLMs (VMLU #1 as of March 25, 2025; government-organized national top 5)
  • Built automated data generation and evaluation tools, local LLMs and domain experts (Accounting, Finance Analysis)
  • Focus: LLM fine-tuning and alignment, safety alignment, RAG, agent and multi-agent chatbots.

🧠 AI Engineer

MISA JSC - misa.vn
April 2024 - December 2024

  • Developed Legal document Q&A chatbot (search accuracy >95%, answer accuracy >90% accuracy on 1000+ expert-curated real-world questions, 1000+ documents each 3-300 pages long)
  • Focus: Rasa, databases, chunking, RAG, text2sql, LLMs.

πŸ”¬ AI Resident

VinAI Research - vinai.io
March 2022 - April 2024

πŸ“ Publications

  • SharpSeq: Empowering Continual Event Detection through Sharpness-Aware Sequential-task Learning
    Thanh-Thien Le, Viet Dao*, Linh Văn Nguyen*, Thi-Nhung Nguyen, Linh Van Ngo, Thien Huu Nguyen*
    2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2024)

  • BKEE: Pioneering Event Extraction in the Vietnamese Language
    Thi-Nhung Nguyen, Bang Tran, Trong-Nghia Luu, Kiem-Hieu Nguyen and Thien Huu Nguyen
    The 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

  • A Self-enhancement Multitask for Unsupervised Aspect Category Detection
    Thi-Nhung Nguyen, Hoang Ngo, Kiem-Hieu Nguyen, Tuan-Dung Cao
    Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP) 2023

  • An Uncertainty-aware encoder for Aspect Detection
    Thi-Nhung Nguyen, Kiem-Hieu Nguyen, Young-In Song, Tuan-Dung Cao
    Findings of the Association for Computational Linguistics: EMNLP 2021

πŸš€Side Projects

πŸ€– PhoGPT: Generative Pre-training for Vietnamese

VinAI Research
March - August 2023

  • Supervisor: Dr. Dat Quoc Nguyen
  • Research Topic: LLM from scratch, Crawling, Ranking models, LLMs

πŸ› οΈ Technical Skills

  • Programming Languages: Python, SQL
  • Libraries & Frameworks: PyTorch, Rasa, FastAPI, LangGraph, ...
  • AI & ML Technologies: LLMs, RAG, Embedding Models, Text2SQL, LLMs Finetuning and Alignment, Reinforcement Learning
  • Databases & Search: MongoDB, SQLite, Elasticsearch, Qdrant, Hybrid Search
  • Cloud Technologies: AWS, Azure
  • DevOps & CI/CD: Docker, Jenkins
  • Developer Tools: Git, GitHub, GitLab, Jira, VS Code
  • Leadership & Management: Agile, Scrum, Team Mentorship, Project Management, Stakeholder Communication

Pinned Loading

  1. UCE UCE Public

    Uncertainty-Aware Encoder (Findings of EMNLP 2021)

    Jupyter Notebook 3

  2. ASEM ASEM Public

    A Self-enhancement Multitask framework (EMNLP 2023)

  3. BKEE BKEE Public

    BKEE: Pioneering Event Extraction in the Vietnamese Language

    Python 4

  4. OpenSynth OpenSynth Public

    Your efficient and high-quality domain-specific synthetic data generation pipeline!

    Python 5