Entity Extractor
Use cases
Extracts named entities using spaCy NLP with selectable models (sm/md/lg for speed vs accuracy tradeoff).
Recognises 11 entity types: PERSON, ORG, GPE, LOC, PRODUCT, EVENT, WORK_OF_ART, LAW, LANGUAGE, NORP, and FAC.
Supports text input, HTML (auto-strips scripts/styles/nav), and batch CSV/Excel processing.
Platform
Browser-based (no installation required)
Input
Text, HTML, or CSV/Excel for batch
Output
Entities by type with frequency counts (CSV)
Features
- spaCy NLP with 3 model sizes (en_core_web_sm/md/lg)
- 11 entity types recognised (PERSON, ORG, GPE, LOC, PRODUCT, etc.)
- HTML parsing with noise removal (scripts, styles, nav)
- Batch processing via CSV/Excel upload
- Text truncation limit (100k chars) for memory efficiency
How to use
- 1 Select spaCy model size based on speed vs accuracy needs
- 2 Paste text/HTML or upload CSV/Excel for batch processing
- 3 Select entity types to extract (filter by PERSON, ORG, etc.)
- 4 Run extraction and review entities grouped by type
- 5 Download full results CSV or aggregated entity counts
Want me to run this for you?
I offer this as a managed service. You get the insights without touching the tool.
Related Tools
Competitor Content Gap Finder
ContentDiscover which descriptive words competitors use in titles that you are missing.
Content Block Extractor
ContentExtract content blocks and XPath patterns using Claude Haiku for template analysis.
Content Consolidation Analyser
ContentFind cannibalising pages by clustering URLs that share SERP overlap.
Let's work together
Monthly retainers or one-off projects. No lengthy reports that sit in a drawer.
Let's Talk