Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

File an issue in src-d/enry about language misclassifications in Tensorflow #104

Open
vmarkovtsev opened this issue Oct 14, 2019 · 2 comments
Assignees

Comments

@vmarkovtsev
Copy link
Collaborator

As we saw on Friday, enry sometimes does very poor job at classifying languages. We need to properly report this.

cc @lwsanty

@EgorBu
Copy link

EgorBu commented Oct 14, 2019

Plan

  • Select commit
  • Launch enry and linguist
  • Create mapping {path: (enry_lang, linguist_lang)
  • Highlight caseswhen path has different predictions & group them by lang
  • Provide script & statistics

@smola
Copy link

smola commented Oct 23, 2019

If misclassifications come from linguist, make sure you open an issue at github/linguist too.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants