Back to Tools

Category Keyword Finder

Use cases

Finding new category opportunities Expanding e-commerce taxonomy Keyword gap analysis for categories

Uses NLTK ngrams utility to extract word sequences from cleaned product titles.

Text preprocessing: lowercase, number removal via regex, punctuation stripping, whitespace normalisation.

Groups products by URL path segment preceding product pattern.

Counts n-gram occurrences across corpus per category.

Configurable n-gram range (min 1-5, max 2-8), minimum product threshold (2-20).

Streamlit App

Platform

Browser-based (no installation required)

Input

Screaming Frog CSV with URL and H1/Title columns

Optional: keyword dataset CSV with search volumes

Output

Excel: all keywords, new opportunities, category summary. Metrics: total keywords, new opportunities count, categories analysed.

Launch App View Source

Features

  • NLTK n-gram extraction
  • Configurable n-gram length: min 1-5, max 2-8 words
  • Minimum product threshold (2-20, default 3)
  • Customisable product/category URL patterns
  • Optional search volume matching
  • UTF-8 and Latin-1 encoding support
  • Three-sheet Excel export via xlsxwriter

How to use

  1. 1 Crawl your site with Screaming Frog
  2. 2 Upload crawl file
  3. 3 Set product URL pattern (default: /product/)
  4. 4 Set category URL pattern (default: /category/)
  5. 5 Configure n-gram length range and minimum products
  6. 6 Optionally upload keyword volume data
  7. 7 Download three-sheet Excel with suggestions

Let's work together

Monthly retainers or one-off projects. No lengthy reports that sit in a drawer.

Let's Talk