Back to Tools

Archive.org Broken Link Mapper

Use cases

Recovering lost link equity Finding broken backlink targets Migration redirect mapping Historical URL recovery

Downloads historical URLs from the Wayback Machine, compares with your current crawl to find URLs that no longer exist, fetches H1s from archived versions, and uses PolyFuzz TF-IDF to suggest redirect targets.

Platform

Python script (requires Python 3.x)

Input

Screaming Frog crawl (internal_html.csv)

Output

CSV with archive URLs, H1s, similarity scores, and suggested redirect targets.

View Source

Features

  • Wayback Machine integration
  • H1 extraction from archives
  • PolyFuzz redirect mapping
  • Multi-threaded processing

How to use

  1. 1 Place internal_html.csv in the script folder
  2. 2 Configure threads and user agent
  3. 3 Run the script
  4. 4 Review redirect suggestions sorted by similarity
  5. 5 Export CSV with redirect mappings

Let's work together

Monthly retainers or one-off projects. No lengthy reports that sit in a drawer.

Let's Talk