Best Open / Closed Source tool to do OCR

Question

0.00/5 (No votes)

See more:

Hi,

I have been assigned a task to read text from (.jpg) image file.

After searching I came to know [Tessnet2] will help me in converting .jpg to .txt, but sample on [Tessnet2] has not helped me a lot.

Kindly help me by showing the ~~write~~ right path in achieving OCR with best accuracy.

Thank you.

Posted 28-May-12 19:15pm

Muthu Nadar

Updated 13-Jun-12 23:44pm

bbirajdar

v2

Add a Solution

5 solutions

Add a Solution

Add your solution here

Treat my content as plain text, not as HTML

Preview 0

…

Existing Members

Sign in to your account

...or Join us

Download, Vote, Comment, Publish.

Your Email
Password
Forgot your password?

Your Email
This email is in use. Do you need your password?
Optional Password

I have read and agree to the Terms of Service and Privacy Policy
Please subscribe me to the CodeProject newsletters

When answering a question please:

Read the question carefully.
Understand that English isn't everyone's first language so be lenient of bad spelling and grammar.
If a question is poorly phrased then either ask for clarification, ignore it, or edit the question and fix the problem. Insults are not welcome.
Don't tell someone to read the manual. Chances are they have and don't get it. Provide an answer or move on to the next question.

Let's work to help developers, not make them feel stupid.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

Sandeep Mewara · Answer 1 · 2012-05-28T19:24:00

Solution 1

Look here:
OCR with the Tesseract interface[^]
Tessnet2 a .NET 2.0 Open Source OCR assembly using Tesseract engine[^]
Tesseract OCR Library – Successfully compiled in Window [^]

Alternative using Office: How To: Use Office 2007 OCR Using C#[^]

Posted 28-May-12 19:24pm

Sandeep Mewara

SOSSR · Answer 2 · 2012-06-14T09:32:00

Solution 5

If you are running Win 7 or Server 2008 R2, the optional Windows component TIFF iFilter has OCR capabilities. I wrote my OCR option around that and it works really well. Best thing is, it is free if you are running one of these. You may want to investigate that as a possibility.

Posted 14-Jun-12 9:32am

SOSSR

Jαved · Answer 3 · 2012-05-28T20:39:00

Solution 2

Hi Muthukumar,
This question is solved here -
C# OCR (How to Read a single character from image)[^]

Posted 28-May-12 20:39pm

Jαved

Basem.Othman · Answer 4 · 2012-06-13T22:30:00

Although Tesseract is one of the more accurate free OCR engines, the last time I tried it a couple of years ago it was rather inaccurate. After trying some other open source libraries, we faced similar problems with the other free OCR engines and winded up using leadtools that provided faster and more accurate results.

You can see an example in the following article:
Minimum OCR demo

Joshi, Rushikesh · Answer 5 · 2012-06-13T22:43:00

There is no any solution which gives guarantee to read all JPG files as most of OCRs readers are based on preloaded font formats and at time of reading those JPG files must containing text in those supported font-families.

Otherwise all suggested solution provided by others should be used.

I have used few of them long back, and one problem you might faced is concatenation of text from Image where text are printed in various positions.

Thanks
Rushikesh Joshi