-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[id2vec] Collect Role-Identifier dataset on full PGA #5
Comments
@zurk Please update the status. |
Since we have jgit-spark-connector (former sourced-engine) deprecated, first we should rewrite sourced-ml and after that, we can collect dataset. |
@vmarkovtsev I do not think this dataset add any value for us now. |
Actually no, this is still important and valuable. Once we copy PGAv2 in https://github.com/src-d/infrastructure/issues/788 we should extract the dataset. It is fine to write some ad-hoc code for that. |
Romain, you will be able to do this one the collected UASTs from PGAv2. |
Please remember to use |
Now I have a small piece of this dataset for tests as reported here: https://github.com/src-d/backlog/issues/1248#issuecomment-398319941
@vmarkovtsev wants to publish all datasets we have right now (https://github.com/src-d/backlog/issues/1310), so it is a good opportunity to collect full Role-Identifier dataset.
The text was updated successfully, but these errors were encountered: