Back to Tools

Breadcrumb Relevancy Checker

Use cases

Finding products in wrong categories Improving site taxonomy E-commerce category optimisation Internal linking improvement

Uses PolyFuzz TF-IDF fuzzy matching to compare product H1s/titles against breadcrumb category paths.

Filters rows with missing values and calculates similarity scores.

Configurable product URL pattern (default: /product/), category URL pattern (default: /category/), and similarity threshold (0.0-1.0, default 0.3) for flagging potential miscategorisations.

Streamlit App

Platform

Browser-based (no installation required)

Input

Crawl CSV with URLs, titles, breadcrumbs

Rows with missing values filtered automatically

Output

CSV: product URLs, existing breadcrumbs, best matching categories, similarity scores, breadcrumb depth differences, miscategorisation flags.

Launch App View Source

Features

  • PolyFuzz TF-IDF product-to-category matching
  • Similarity threshold slider (0.0-1.0, default 0.3)
  • Configurable product/category URL patterns
  • Breadcrumb depth difference calculation
  • Summary metrics: products analysed, miscategorisations, "all" assignments

How to use

  1. 1 Upload CSV export from Screaming Frog
  2. 2 Map Address, H1/Title, and Breadcrumb columns
  3. 3 Set product URL pattern (default: /product/)
  4. 4 Set category URL pattern (default: /category/)
  5. 5 Adjust similarity threshold for flagging
  6. 6 Review and download results

Let's work together

Monthly retainers or one-off projects. No lengthy reports that sit in a drawer.

Let's Talk