ArchitectureΒΆ
Similarly to Scrapy architecture
- Request (
base.BaseHttpRequest
) from queue (base.BaseQueue
) by engine (base.BaseEngine
) passed to middlewares (base.BaseMiddleware
) and then executed by downloader (base.BaseDownloader
). Downloader can process request in concurrent way and return deferred result (utils.Planned
). - Response (
base.BaseHttpResponse
) passed back to middlewares in reverse order and then to the crawler (base.BaseCrawler
). Crawler can process response in concurrent way and return deferred result like downloader. - Extracted data passed to item pipeline.
- Next requests if they exists will be putted to the queue.