Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

n-gram phrase matching for many phrases #2

Closed
andehr opened this issue Jun 8, 2016 · 0 comments
Closed

n-gram phrase matching for many phrases #2

andehr opened this issue Jun 8, 2016 · 0 comments
Assignees

Comments

@andehr
Copy link
Owner

andehr commented Jun 8, 2016

Create a document processor that can efficiently identify custom ngrams of any length for a possibly large number of custom ngrams.

Create a corresponding feature inferrer that can pick out and produce these custom ngrams as features.

These should be able to override the normal ngrams to avoid feature repetition, but also pay attention to other filtering.

There should be an option to include or remove overlapping matches.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant