The first official alpha version of Google’s OCRopus scanning software for Linux was released yesterday. OCRopus is built on top of HP’s venerable open-source Tesseract optical character recognition (OCR) engine and is distributed under the Apache License 2.0.
OCRopus uses Tesseract for character recognition but has its own layout analysis system that is optimized for accuracy.
OCRopus sounds like an interesting tool – but probably doesn’t come close to a tool like FineReader – which is the most awesomest OCR package I’ve ever used!
Technorati Tags: ocr