VulEval

(Logo generated by DALL·E 3)

From Function to Repository: Towards Repository-Level Evaluation of Software Vulnerability Detection

📥 Load Data

Dataset is available at:

Download Data via Google Drive

Download the all data from Google Drive, or simply use the following links:

https://drive.google.com/file/d/1szQ9FnIC_onQRu_TjZ2uofkjv9z_s4pv/view?usp=drive_link

Implementation

⚖️ Function-Level Vulnerability Detection

The baseline's implementation code is under the VulnerabilityDetection\ folder.

📅 Vulnerability-Related Dependency Prediction

The baseline's implementation code is under the DependencyPrediction\ folder.

🔔 Repository-Level Vulnerability Detection

The baseline's implementation code is under the VulnerabilityDetection\ folder.

Response

Table S1: Dependency overview of vulnerability samples for VulEval.

Metric	Same File	Different File	All	Same File Ratio
Train	5711	8497	14208	40.20%
Eval	673	991	1664	40.44%
Test	673	851	1524	44.16%
All	7057	10339	17396	40.57%

Figure S1: The prompt template for VulEval.

Table S2: Distribution overview of CWE types in Test set.

CWE-id	Ratio
CWE-190	42.33%
CWE-787	16.93%
CWE-416	15.52%
CWE-125	6.88%
CWE-400	5.47%
CWE-476	3.17%
CWE-200	2.12%
CWE-22	2.12%
CWE-119	1.59%
CWE-863	1.59%

Table S3: T-Test result of CodeT5 and ChatGPT.

Model	Function	Repository	T-Test P-value
CodeT5	41.80	43.29	1.56e-03
ChatGPT	10.44	13.69	1.78e-03

Table S4: The number of related CVE entries, dependency and function tokens in each split, respectively. The numbers in front of "/" denotes the random split, and after denotes the time-split.

Set	Dependency Tokens	Function Tokens
train	866.58 / 881.88	321.44 / 320.94
valid	861.26 / 775.25	315.9 / 307.3
test	824.87 / 834.36	317.7 / 330.3
total	862.05 / 862.05	320.51 / 320.51

Table S5: Statistics of the dataset.

Set	Non-Vul	Defective	Total	Ratio
Train	180259	5532	185791	32.58:1
Eval	22503	721	23224	31.21:1
Test	22554	670	23224	33.66:1
All	225316	6923	232239	32.55:1

Baselines

VulEval 
├─ VulnerabilityDetection
│    ├─ Supervised
│    │    ├─ Devign
│    │    ├─ Reveal
│    ├─ Finetuning
│    │    ├─ CodeBERT
│    │    ├─ CodeLlama
│    │    ├─ CodeT5
│    │    ├─ UnixCoder
│    │    ├─ LineVul
│    │    ├─ EPVD
│    │    ├─ PILOT
│    │    ├─ PDBERT
│    ├─ Prompt
│    │    ├─ CodeLlama_LLaMA
│    │    ├─ ChatGPT_GPT-3.5-Instruct
│    │    ├─ GPT-4o
├─ DependencyPrediction
│    ├─ Random
│    │    ├─ Random
│    ├─ Lexical
│    │    ├─ ES
│    │    ├─ JS
│    │    ├─ BM25
│    │    ├─ BM25Plus
│    ├─ Semantic
│    │    ├─ CodeBERT
│    │    ├─ UnixCoder
├─ README.md

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
DependencyPrediction		DependencyPrediction
VulnerabilityDetection		VulnerabilityDetection
Candidates10.zip		Candidates10.zip
README.md		README.md
logo.png		logo.png
prompt.png		prompt.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VulEval

📥 Load Data

Download Data via Google Drive

Implementation

Response

Baselines

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

VulEval

📥 Load Data

Download Data via Google Drive

Implementation

Response

Baselines

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages