Why
Must retrieve the source document for each saved item. Requires robust network behavior and normalization
Description of Done
- Given an item identifier and a URL, the fetcher downloads the document with timeouts and redirects handled
- Compressed responses are supported. Character encoding is detected and normalized to UTF-8
- Robots and common anti-bot headers are respected where feasible
- Failures are mapped to returnable vs non-retryable categories
- Unit tests stub network calls and cover timeouts, redirects, bad certificates, and content encodings
Tasks
Why
Must retrieve the source document for each saved item. Requires robust network behavior and normalization
Description of Done
Tasks