Bug Description
The /wayback/available API endpoint returns inconsistent results depending on whether the URL includes the HTTPS protocol prefix.
Steps to Reproduce
Query the availability API with these three URLs that point to the same content:
http://archive.org/wayback/available?url=https://www.reddit.com/r/duckduckgo/
http://archive.org/wayback/available?url=www.reddit.com/r/duckduckgo/
http://archive.org/wayback/available?url=reddit.com/r/duckduckgo/
Expected Behavior
All three queries should return the same result since they reference the same web content.
Actual Behavior
- Query 1 (with https://): Returns
{"archived_snapshots": {}}
- Query 2 (with www): Returns archived snapshots correctly
- Query 3 (without www): Returns archived snapshots correctly
Impact
This inconsistency causes automated tools to incorrectly identify already-archived content as unarchived, leading to unnecessary re-archival attempts and potential rate limiting issues.
Environment
- API endpoint:
http://archive.org/wayback/available
- Tested on: 2025-09-23
- Affects: Programmatic archive checking tools
Suggested Fix
The API should normalize URLs internally to handle protocol variations consistently.
Bug Description
The
/wayback/availableAPI endpoint returns inconsistent results depending on whether the URL includes the HTTPS protocol prefix.Steps to Reproduce
Query the availability API with these three URLs that point to the same content:
http://archive.org/wayback/available?url=https://www.reddit.com/r/duckduckgo/http://archive.org/wayback/available?url=www.reddit.com/r/duckduckgo/http://archive.org/wayback/available?url=reddit.com/r/duckduckgo/Expected Behavior
All three queries should return the same result since they reference the same web content.
Actual Behavior
{"archived_snapshots": {}}Impact
This inconsistency causes automated tools to incorrectly identify already-archived content as unarchived, leading to unnecessary re-archival attempts and potential rate limiting issues.
Environment
http://archive.org/wayback/availableSuggested Fix
The API should normalize URLs internally to handle protocol variations consistently.