-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conflicting results between methods 'canCrawl', 'check' and 'parse' #77
Comments
Hi @sanderheilbron, thanks for your issue! Merging #76 will fix this. I'll merge that and release a new version. |
Hi @lucasfcosta, thanks for the update and your effort in fixing these issues! Yesterday I did some local tests with the fix for #76 and noticed this issue. Are you sure it will be fixed by merging #76? |
@sanderheilbron Yup! This was happening because whenever I'll be releasing a fix in a few minutes. |
@sanderheilbron Done! |
@lucasfcosta Thanks! |
Hi @lucasfcosta, just did some tests with v1.0.15, and unfortunately got the same results. |
Hi @sanderheilbron, thanks for getting in touch. EDIT: Actually I think this behavior is correct. |
Hi @lucasfcosta, you can test the behaviour of Googlebot with the robots.txt-tester inside Google Search Console (https://www.google.com/webmasters/tools/robots-testing-tool). Also you can use some other tools which follow how Googlebot handles robots.txt files: |
@sanderheilbron thank you very much! For now I will reopen it. Thanks for your help and sorry for not being able to solve it right now. However, I promise I'll work on this whenever I have some spare time. |
Thanks @lucasfcosta, I appreciate your time and effort. |
The output of both methods
canCrawl
andcheck
does not match with the result of parsing (parse
) the content of a robot.txt file.Example:
User-Agent: *
Disallow: /page-a
User-Agent: *
Disallow: /page-b
User-Agent: Googlebot
Crawl-Delay: 20
When checking if user-agent Googlebot is allowed to crawl /page-a Robotto provides the following results:
false
false
Results of parse:
Following to how Googlebot handles robots.txt files, both methods
isAllowed
andcheck
should result intrue
.The text was updated successfully, but these errors were encountered: