Keyword Deduplication Tool
Use cases
Uses rapidfuzz with token_sort_ratio scorer and process.extractOne() to identify near-duplicate keywords at 99-point similarity threshold.
Automatically detects file encoding via chardet (first 100,000 bytes).
Keeps the highest search volume variant and exports both processed and dropped keywords for review.
Uses stqdm for progress tracking.
Platform
Browser-based (no installation required)
Input
CSV or Excel file with keywords and search volumes
Encoding auto-detected via chardet
Output
Excel with deduplicated and dropped keywords
Features
- Rapidfuzz token_sort_ratio with 99-point threshold
- Chardet encoding detection (100KB sample)
- Keeps highest volume variant automatically
- stqdm progress bar integration
- Two-sheet Excel output (xlsxwriter)
- Supports CSV and Excel (.xlsx) input
How to use
- 1 Upload CSV or Excel file with keyword data
- 2 Select keyword column and volume column from dropdowns
- 3 Click Dedupe to run rapidfuzz matching
- 4 Review progress via stqdm progress bar
- 5 Download Excel with two sheets: Processed Keywords and Dropped Keywords
Want me to run this for you?
I offer this as a managed service. You get the insights without touching the tool.
Related Tools
Bulk Keyword Tagger
Keyword ResearchTag keywords using substring matching against up to 7 classification columns.
eBay Related Searches
Keyword ResearchTwo-level eBay related search scraping with ECharts tree visualisations.
Keyword Difficulty Checker
Keyword ResearchAssess keyword difficulty using allintitle, phrase match, and SERP clustering.
Let's work together
Monthly retainers or one-off projects. No lengthy reports that sit in a drawer.
Let's Talk