The entry file is KOV.py
The NGCG algorithm optimizes adversarial suffixes by balancing two objectives:
- Negative log-likelihood (NLL) to induce harmful behavior.
- Log-perplexity to maintain natural language coherence.
The loss function is defined as:
| Name | Name | Last commit date | ||
|---|---|---|---|---|
The entry file is KOV.py
The NGCG algorithm optimizes adversarial suffixes by balancing two objectives:
The loss function is defined as: