Joaquim Rocha



OCRFeeder is a document layout analysis and optical character recognition system.

Given the images it will automatically outline its contents, distinguish between what’s graphics and text and perform OCR over the latter. It generates multiple formats being its main one ODT.

It features a complete gtk graphical user interface that allows the users to correct any unrecognized characters, defined or correct bounding boxes, set paragraph styles, clean the input images, import PDFs, save and load projects, export everything to multiple formats, etc.

OCRFeeder is the most complete Free Software OCR application available nowadays. It written completely in Python and was first developed as my Master’s Thesis project in the Computer Science degree (in 2008).