git-disl

awesome_LLM-harmful-fine-tuning-papers Public
A survey on harmful fine-tuning attack for large language model

git-disl/awesome_LLM-harmful-fine-tuning-papers’s past year of commit activity

203 7 0 0 Updated Aug 13, 2025
awesome-LLM-game-agent-papers Public
A Survey on Large Language Model-Based Game Agents

git-disl/awesome-LLM-game-agent-papers’s past year of commit activity

674 23 0 1 Updated Aug 4, 2025
AFOG Public Forked from zacharyyahn/AFOG
Unofficial re-implementation of "Adversarial Attention Perturbations for Large Object Detection Transformers"

git-disl/AFOG’s past year of commit activity

Jupyter Notebook 4 2 0 0 Updated Jul 29, 2025
Antidote Public
This is the unofficial re-implementation of "Antidote: Post-fine-tuning Safety Alignment for Large Language Models against Harmful Fine-tuning Attack" (ICML2025)

git-disl/Antidote’s past year of commit activity

Shell 2 0 0 0 Updated Jul 14, 2025
Fusion-Shot Public

git-disl/Fusion-Shot’s past year of commit activity

Jupyter Notebook 1 0 0 0 Updated Jul 13, 2025
GTLLMZoo Public
GTLLMZoo: A comprehensive framework that aggregates LLM benchmark data from multiple sources with an interactive UI for efficient model comparison, filtering, and evaluation across performance, safety, and efficiency metrics.

git-disl/GTLLMZoo’s past year of commit activity

Python 3 0 0 0 Updated Jun 12, 2025
Booster Public
This is the official code for the paper "Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturbation" (ICLR2025 Oral).

git-disl/Booster’s past year of commit activity

Shell 29 Apache-2.0 1 1 0 Updated Mar 22, 2025
Safety-Tax Public
This is the official code for the paper "Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable".

git-disl/Safety-Tax’s past year of commit activity

Python 23 Apache-2.0 1 1 0 Updated Mar 11, 2025
Virus Public
This is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation"

git-disl/Virus’s past year of commit activity

Python 50 Apache-2.0 3 0 0 Updated Feb 2, 2025
llm-topla Public

git-disl/llm-topla’s past year of commit activity

Jupyter Notebook 6 1 1 0 Updated Jan 2, 2025

View all repositories

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

git-disl

Pinned Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics