Prerequisites
Import packages and setup client
Upload the document
Define the schema
Parse with Tensorlake
Review the Tensorlake output
Recursive Chunking using Chonkie
semantic chunking
. Unlike recursive chunking
, which slices text by token limits, semantic chunking uses embeddings to detect boundaries where topics naturally shift. This produces higher-quality chunks that are easier to retrieve and align closely with the author’s intent.Review Chonkie's Semantic Chunks
semantic_chunks[7].text
we would get: