You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It would be useful, especially for QA/page comparison purposes, to be able to have a 'page info' record which includes all the resources loaded from a particular page and their status codes.
The record can be a resource WARC record that might look as follows (current iteration):
Generate records for each page, containing a list of resources and their
status codes, to aid in future diffing/comparison.
Generates a `urn:pageinfo:<page url>` record for each page
- Adds POST / non-GET request canonicalization from warcio to handle
non-GET requests
- Adds `writeSingleRecord` to WARCWriter
Fixes#457
It would be useful, especially for QA/page comparison purposes, to be able to have a 'page info' record which includes all the resources loaded from a particular page and their status codes.
The record can be a resource WARC record that might look as follows (current iteration):
Non GET requests can be canonicalized into URLs using same canonicalization used for generating CDX.
The text was updated successfully, but these errors were encountered: