Archive.org Broken Link Mapper
Use cases
Recovering lost link equity Finding broken backlink targets Migration redirect mapping Historical URL recovery
Downloads historical URLs from the Wayback Machine, compares with your current crawl to find URLs that no longer exist, fetches H1s from archived versions, and uses PolyFuzz TF-IDF to suggest redirect targets.
Platform
Python script (requires Python 3.x)
Input
Screaming Frog crawl (internal_html.csv)
Output
CSV with archive URLs, H1s, similarity scores, and suggested redirect targets.
Features
- Wayback Machine integration
- H1 extraction from archives
- PolyFuzz redirect mapping
- Multi-threaded processing
How to use
- 1 Place internal_html.csv in the script folder
- 2 Configure threads and user agent
- 3 Run the script
- 4 Review redirect suggestions sorted by similarity
- 5 Export CSV with redirect mappings
Want me to run this for you?
I offer this as a managed service. You get the insights without touching the tool.
Related Tools
Let's work together
Monthly retainers or one-off projects. No lengthy reports that sit in a drawer.
Let's Talk