Ocropus Windows Installer

5/9/2019by admin

I've used SimpleOCR, which has a nice GUI for correcting mistakes. Unfortunately it makes a lot of mistakes! (and suffers other bugs and limitations)

On the other hand Tesseract is more accurate but has no GUI at all.

Google releases OCRopus an 'open-source scanning software' The first official alpha version of Google's OCRopus scanning software for Linux was released.

My question is, is there a free OCR program for Windows which has a nice GUI and a low error rate? I want it to highlight suspect words (by OCR uncertainty, not just spell checking) and show the original (bitmap) word while I'm editing the OCRed word similar to what SimpleOCR does.

Open-source would be best, followed by freeware, then trial / demo / crippleware a long way behind.

Hugh Allen
Hugh AllenHugh Allen

3 Answers

Have you tried gimagereader , a gui front for Tesseract ?

ukanthukanth

OCRopus:

The software is partly based on Tesseract, the best open source OCR engine available for now. While the project is expected to be released at the end of next year and will be used for Google's book scanning project, the team has some interesting applications in mind:

  • a web service interface
  • PDF, camera, and screen OCR
  • integration with desktop search tools: Beagle, Spotlight, Google Desktop

OCRopus(tm) is a state-of-the-art document analysis and OCR system, featuring pluggable layout analysis, pluggable character recognition, statistical natural language modeling, and multi-lingual capabilities.

The OCRopus engine is based on two research projects: a high-performance handwriting recognizer developed in the mid-90's and deployed by the US Census bureau, and novel high-performance layout analysis methods.

OCRopus is development is sponsored by Google and is initially intended for high-throughput, high-volume document conversion efforts. We expect that it will also be an excellent OCR system for many other applications. Links:

GOCR is an OCR (Optical Character Recognition) program, developed under the GNU Public License. It converts scanned images of text back to text files. Joerg Schulenburg started the program, and now leads a team of developers. GOCR can be used with different front-ends, which makes it very easy to port to different OSes and architectures. It can open many different image formats, and its quality have been improving in a daily basis.

Links:

Krazy_KaosKrazy_Kaos

Windows Installer Package

There is also TOPOCR (a.k.a SnapReader), containing post-processing Spell Checker for 11 Languages:

SnapReader can be used to make your own searchable notes from almost any document image. Or you can use it as an an authoring tool and create your own editable content using your scanner or camera and save the results as HTML or PDF. SnapReader can also transform text into very high quality audio using Audrey. So not only can you use your scanner or camera to capture documents, you can now also use your portable music player or smartphone to 'read' them.

Ocropus Windows Installer
harrymcharrymc

Not the answer you're looking for? Browse other questions tagged windowsguiocr or ask your own question.

I am trying to create an answer paper marking (multiple choice question) python application. The answer sheet will be scanned into image file (gif,png,jpg,whichever format is needed).

My App has access to the database where all the answers are stored.

So,all it need is some kind of data from scanned image so that it can compare the answer and calculate the marks.

The answer sheet has fixed dimensions with the table format like this ( Answers will be marked by 'X' by the candidate to indicate their answers):

After searching through the internet, i found that there are a few OCR APIs available.

First one is Pytesser . It is very easy to use and the results are quite okay. But it only work for the images with just pure texts. So, i think it is not suitable.

The second one i found is Ocropus. It seems powerful but in it's documentation

Windows

OCRopus relies a lot on POSIX path names and file systems. You may be able to install OCRopus on Windows using . An easier way is to install VirtualBox and run OCRopus in Ubuntu under VirtualBox.

So i think it is mostly for linux. I could not find a detail installation guide for window platform. ( I am a beginner, so i could be wrong)

The third one i found is python-tesseract , a wrapper for Tesseract OCR. In their page, the installation guide was provided. Basically, i need,

  1. python-tesseract-win32.deb
  2. python-opencv
  3. numpy

but i have no clue on how to install .deb files on window. I have the opencv and nampy already installed.

So the following are my questions:

(1) In which way can i convert the table image into processable data(is it even possible?)?

(2) Is there any other useful OCR APIs that i have not mentioned here that could be helpful?

(3) Finally, (my silly idea) Is it possible to split the image into small chucks(based on the size of the table cells - since the table dimensions are known) using PIL and then use pytesser to convert each small images into text, thereafter process the data accordingly?

FYI: I only need it for Windows Platform, possibly for windows xp 32 bits. I am using python 2.7.5.

pythonpython-2.7python-imaging-libraryocrtesseract
Chris AungChris Aung

1 Answer

Answers correspond to your numbers

1) OCR is in general very hard, but (good news for you) for test score processing, I think it is nearly a solved problem. In this vein there are tried and true solutions for such problems. School systems have been doing this to automate grading 'scantron' tests for years, so if you have access to such resources going that route might be your best bet. At least you should check how they do it

2) I am sure there are others, but those are the main free ones I know of

3)a I think if you are trying to do this on a budget and time is less an issue, your 'silly' idea is actually not silly at all. It might be the best way to do it, and it is likely that the scantron test graders use a similar method. You know the exact dimensions of the test form. You can know the direct pixel mapping of where to look. You could use pytesser very easily. Keep in mind that pytesser sometimes needs you to resize the image (sometimes up, sometimes down) to get the best accuracy.

3)b You might want to consider rolling your own solution. You could use the concept of morphological operations (numpy and other image libraries can do this nearly out of the box). You might not even need these operators and simply do a binary threshold of the table rows (assuming you have already cut the image into table rows) and simply look for blobs and mark the score as coming from the column with the most blob values.

PaulPaul

Not the answer you're looking for? Browse other questions tagged pythonpython-2.7python-imaging-libraryocrtesseract or ask your own question.