-
Notifications
You must be signed in to change notification settings - Fork 595
Error when trying to run the grammar example with Mistral #580
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Thanks for submitting an issue! I was able to reproduce, both in
Here is a related issue in Lark where excessive recursion results in the same error after they're 242 terminals deep: lark-parser/lark#550 The output of the following is unsurprising given the linked issue:
The way I see it we have 3 options here:
I'm leaning towards 2. The first option is a one-off and theoretically doesn't always solve the issue. @dborowiec10 my recommendation is to use Any thoughts as to why this is cropping up now? It seems to have trouble generating the As a short term fix, should we use |
@lapp0 Thanks for the suggestion, will test with v0.2 on my end this week. |
I tested with a quantized version on Mistral-7B-v0.1 and never encountered this issue. We can either change the model version, set the seed and/or a maximum number of tokens. |
The EOS Token probability is consistently low during generation, leading to a lack of termination. I've found this to be the case of any regex or grammar. This is likely because the model aims to continue generating beyond the constrained format, often attempting to provide further explanation. I'm going to experiment with stopping at newlines for single line regexp, and wrapping grammars in codeblocks which terminate at the end of the codeblock for CFG. |
I'm having issues with When generating with vLLM
|
Describe the issue as clearly as possible:
When trying to run the example from the README, it fails with a recursion depth exceeded error.
Running on a pretty straightforward
Nvidia A100 80GB
setup and environment as per thepyproject.toml
.Steps/code to reproduce the bug:
Expected result:
1+3-2-4+5-7+8-6+9-6+4-2+3+5-1+1 As per example in README
Error message:
Outlines/Python version information:
Context for the issue:
No response
The text was updated successfully, but these errors were encountered: