Text Chunker
❖ NLP & Text Analysis

Semantic Text Chunker

v1.0 documentation

Split long content into meaningful semantic chunks — not arbitrary word counts. Uses sentence boundaries, heading breaks, and topic shifts.

URL inputFile inputXLSX export
text_chunker.py143 lines5 paramsPython 3.8+
Quick start
1

Install

terminal
pip install -r requirements.txt
2

Run

terminal
python text_chunker.py --url https://example.com/post --chunk-size 200 --output chunks.csv
terminal
python text_chunker.py --file article.txt --method semantic --output chunks.csv
3

Export

Add --output report.xlsx to save results as a spreadsheet.

Parameters
FlagDescription
--urlURL to chunk
--fileText/HTML file
--methodMethod. Options: heading, sentence, semantic
--chunk-sizeTarget words per chunk. Default: 200 (integer)
--outputSave as CSV
help
python text_chunker.py --help
Use cases
Content quality audit
Writer evaluation
Client deliverable

Run across all your blog posts to score quality. Sort by score in the XLSX export, then prioritize rewrites for the lowest-scoring pages.

Before publishing freelance content, run this tool to check quality signals. Use specific metrics as concrete feedback for writers.

Include the analysis in your SEO audit report. Clients appreciate data-backed recommendations over subjective opinions.

Dependencies

Requires: beautifulsoup4, pandas, requests. All included in requirements.txt.

Get all 154 Python SEO tools — $49

One-time payment. Lifetime access. No monthly fees.
Learn 25 tools and get 25% back. Earn from client work and get 50% back.

Get the full toolkit

AAIO Inc — aaioinc.com/tools/text_chunker/