Skip to content

LLM prep panel

Every conversion comes with a compact panel that gets the text ready for a model — entirely locally, with zero AI calls.

  • Token count with tiktoken (o200k_base, baked into the image — works offline).
  • Tokens & cost saved by anonymization, so you can see what stripping PII buys you.
  • Live per-model cost estimate — pricing and context windows pulled from OpenRouter (hundreds of models, cached) so the numbers are never stale.
  • Context-window fit — at a glance, which models the document fits into.
  • One-click RAG chunking — split into overlapping, token-bounded chunks (semchunk), downloadable as .jsonl.
  • Prompt-injection detector — flags text that tries to hijack a downstream LLM.

The panel starts with no models selected — pick the ones you care about and Escriba estimates the cost of sending this exact document to each, using live pricing. It’s the fastest way to answer “which model should I use, and what will it cost?” before you spend a cent.

One click splits the Markdown into token-bounded, overlapping chunks suitable for a retrieval pipeline, downloadable as .jsonl. Useful when the document is larger than your target model’s context window.

All of this runs on your server. No API keys, no external calls — just local math over the text you converted.