Internet Archive Wayback CDX API
Historical web capture index for domain and URL research.
- slug
- wayback-machine-cdx-api
- priority
- 98
- reviewed
- Apr 24, 2026
How this source is shaped
One of the highest-value sources for OSINT. It enables historical reconstruction of websites, page availability, old redirects, old robots exposure and deleted content context.
- Source type
- Archive
- Access model
- Free
- Pricing model
- Free Public API With Usage Considerations
- API available
- Yes
- Requires account
- No
- Risk level
- Low
- Sensitivity
- Normal
- Integration phase
- Phase 1
- Integration priority
- 98
Review dimensions
Each dimension is graded on a 0–10 scale. The overall score is a weighted aggregate.
Weighted aggregate across the eight review dimensions.
Where this source fits
What analysts use it for, and — just as important — where it does not belong.
- domain_history
- deleted_page_recovery
- redirect_history
- content_change_detection
- evidence_context
- journalists
- seo_analysts
- threat_researchers
- compliance_analysts
- real_time_monitoring
- private_content_recovery
Editorial take
Our qualitative read on the source — tone, framing and trust posture.
This should be one of the first real integrations. It is ethical, explainable, useful for SEO, journalism, compliance and cyber exposure reports.
Integration stance
Build, buy or defer. What shape the product integration would take, and why.
Build a Domain History module: capture timeline, first seen, last seen, status changes, content-type distribution and important archived URLs.
Ethics and compliance
What to handle carefully, and what must not ship without sign-off.
Do not imply archived content is current. Always label timestamps clearly and preserve context.
Respect Internet Archive access patterns and avoid abusive bulk requests.
Metadata
Catalog-side technical footer. Values as recorded in the source row.
- source owner
- Internet Archive
- report module
- domain_history
- integration candidate
- true