Thin Content Detector
Identifies thin, low-quality, or duplicate content across pages. Flags pages below word count thresholds, with low text-to-HTML ratio, or excessive boilerplate.
Install
pip install -r requirements.txtRun
python thin_content_detector.py --urls https://site.com/p1 https://site.com/p2 --min-words 300python thin_content_detector.py --files *.html --output thin_report.xlsxExport
Add --output report.xlsx to save results as a spreadsheet.
| Flag | Description |
|---|---|
--urls | URLs to check. Multiple values allowed |
--files | HTML files. Multiple values allowed |
--min-words | Min words (integer) |
--output | Save as XLSX |
python thin_content_detector.py --helpAnalyze existing content to find what needs updating, merging, or removing. Export results and create a content maintenance plan.
Run before publishing new content to ensure it meets quality thresholds. Fix issues before they go live.
Compare your content against top-ranking competitors. Identify gaps and opportunities to improve.
Combine with other tools for a complete workflow:
Requires: beautifulsoup4, pandas, requests. All included in requirements.txt.
Get all 154 Python SEO tools — $49
One-time payment. Lifetime access. No monthly fees.
Learn 25 tools and get 25% back. Earn from client work and get 50% back.
AAIO Inc — aaioinc.com/tools/thin_content_detector/