UNDER DEVELOPMENT
Node runner is a scrapy command designed to manage scrapy spiders via api.
This tool aims to:
- easily launch multiple scrapy spiders on the node
- expose api wich can be used by external scheduler
- support graceful shutdown processes to ensure data integrity and minimal disruption
- synchronizing its configuration with ZooKeeper, allowing seamless integration and accessibility by external scheduler
This command is supposed to used with scrapy-node-operator component which is under development now.
- start docker compose
- install deps with
poetry install - go into scrapy project with
cd example - start scrapy node with
scrapy node - send
{"id": "uniq-id-1", "spider": "quotes"}tohttp://localhost:8000/start
Note: This document is subject to further updates.