Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataset Release Timeline #1

Open
iNeil77 opened this issue Jan 29, 2025 · 4 comments
Open

Dataset Release Timeline #1

iNeil77 opened this issue Jan 29, 2025 · 4 comments

Comments

@iNeil77
Copy link

iNeil77 commented Jan 29, 2025

Hello to the authors!

I was reading the ProSec paper and was excited by the direct use of CWE descriptions to induce vulnerabilities in the LLM generations. I would like to ask about the authors' timelines for releasing the secure-vulnerable code pairs data and the synthesized instructions as mentioned in the paper.

Many Thanks

@XZ-X
Copy link
Member

XZ-X commented Jan 29, 2025

Thank you for your interest! We are currently working on an updated version of the dataset.

We plan to release the version by next week.

Please feel free to let us know if you have further questions!

@iNeil77
Copy link
Author

iNeil77 commented Jan 31, 2025

Thanks for getting back. I eagerly await the data release!

@XZ-X
Copy link
Member

XZ-X commented Feb 7, 2025

Thank you again for your interest!

We released the first version of our vulnerability-inducing instruction dataset at Hugging Face🤗.

We plan to release the model-specific code pairs and the aligned models in the following one or two weeks.

Please definitely let us know if you have questions!

@iNeil77
Copy link
Author

iNeil77 commented Feb 11, 2025

Thanks for getting back! I eagerly await the code pairs dataset.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants