Surya OCR toolkit

The Surya OCR toolkit is a suite of models for OCR in 90+ languages, line detection, layout analysis, table recognition, and reading order detection. Surya has 13k Github Stars, is used by hundreds of organizations, and benchmarks well against cloud services.

The Surya models

Here's what Surya can do.

OCR

Surya handles OCR in over 90 languages. See benchmarks.

Line Detection

Surya does state of the art line detection in any language. See benchmarks.

Layout Analysis

Surya does state of the art layout analysis on a range of documents. See benchmarks.

Reading Order

Surya does state of the art reading order detection on a range of documents. See benchmarks.

Table Recognition

Along with tabled, surya does state of the art table detection and recognition on a range of documents. See benchmarks.