Transcribe takes images of handwritten documents — letters, court records, journals — and produces a rough first-pass transcription. You verify and finish each page in a side-by-side editor, then export. Pages run in parallel, so a project of hundreds finishes in one batch instead of one at a time.
Free to sign up · pay per page transcribed
Loading...
Transcribe runs three preparation steps on each page: detect where the writing is, decide whether it's one page or two, and divide each page into distinct tiles. The transcription model then reads one tile at a time.
Transcribe sends the image to a computer-vision service, which returns a list of quadrilaterals — one per detected line of writing. These boxes mark where the handwriting is on the page; they are not the final transcription.
For each vertical slice of the image, Transcribe counts how many detected lines pass through it. A single page produces a roughly flat profile. A two-page spread produces a deep valley at the gutter. Six checks must all pass before the image is split; otherwise it is treated as a single page.
Every pixel of the page is assigned to a tile. The number of tiles is chosen so each one contains roughly 150 words, with a maximum of three tiles per page. This ensures full coverage — even ink the line detector missed still reaches the transcription model.
Each tile is sent to the transcription model as a separate image, with metadata about which page and region it came from. The results are stitched back together in order. This method multiplies the tokens the LLM processes per page.
To the Honble the Circuit Court of the District of Columbia
for the County of Alexandria —
Humbly complaining your Orator Alexander Henderson Esq.
sheweth unto your Honours — That a certain John Forbes and
Bennett Forbes, whom your Orator prays may be made
defendants to this Bill of Complaint, are indebted to your
Orator in the sum of Three Thousand dollars or thereabouts
as will appear by your Orators account
Upload a folder of pages and process them as a batch. Pages run in parallel and progress is shown live. Batch transcription is priced lower per page than one-off uploads.
Each page opens in a split-pane editor. Pan and zoom the image on one side, edit the text on the other. Save changes, mark the page verified, or copy the text out.
Image quality is the biggest predictor of how much editing each page will need. A clean 300+ dpi scan typically needs light corrections.
Choose a format, choose a page range, and optionally include the source images. PDF exports include a cover page and one section per page.