How to read old & handwritten Indian land records (when OCR fails)
A practical guide for genealogists, property lawyers, and archivists.
If you have ever tried to digitize a 1950s khasra/khatauni, a faded mutation entry, a hand-written 7/12 extract, or a temple register in old Devanagari, you already know the problem: you photograph the page, run it through an OCR app, and get back a wall of nonsense. Generic OCR was built for crisp, printed, modern type. It collapses on exactly the documents that matter most to Indian families and property offices — the old, faded, damaged, hand-written ones.
Why generic OCR fails on these pages
Tools like Tesseract and phone-scanner OCR work character-by-character. They assume even lighting, straight lines, standard fonts, and clean contrast. Old records violate every assumption: ink has faded unevenly, paper is stained or torn, the script is cursive Devanagari or a regional hand (Modi, old Bengali, Tamil), and the page is photographed at an angle on a phone. The OCR engine has no way to reason about what a smudged character probably is from the surrounding words.
What works instead: reconstruction, not recognition
Modern multimodal AI reads a page the way a skilled archivist does — holistically. It uses context to infer the most likely reading of a damaged letter, recognizes that a column of numbers is an area measurement, and knows that “ग्राम” is followed by a village name. Instead of recognizing isolated glyphs, it reconstructs the text. That is the difference between garbage and a usable transcription.
A simple workflow
1. Photograph well enough. Flatten the page, get even light, fill the frame. It does not need to be perfect. 2. Upload to a tool built for old documents — one that transcribes in the original script and flags what is truly illegible rather than inventing text. 3. Get both the original script and a translation so you can verify against the source. 4. Always cross-check legally significant fields (names, survey numbers, areas, dates) against the original image.
Preserve, then search
Once a page is clean text, it becomes searchable, copy-able, and durable. A box of crumbling records becomes a database you can query. For a genealogist that means finally placing an ancestor in a village; for a property office it means resolving a title chain in minutes instead of days.
Try it on one page free. ScrollReader is tuned specifically for old, faded, and handwritten Indian records and Indic-script manuscripts.
Transcribe a page →