Click here to Skip to main content
15,881,600 members
Please Sign up or sign in to vote.
1.00/5 (1 vote)
See more:
HI Friends
I have code that returns the text in the image by Tesseract OCR.

But the accuracy of the resulting text is not acceptable at all!
I will attach the my photo. return text is :'EPVWUAG2LOSCBPZAANIZZ10 1'
Please guide me how to get the text more accurately

MY IMAGE t5 — Freeimage.host[^]

What I have tried:

VB
Dim api = OcrApi.Create()
 Using api
 api.Init(Patagames.Ocr.Enums.Languages.English) ' Languages.English)
 Dim plainText As String = api.GetTextFromImage(PictureBox1.Image) '(bmp)
 Dim lines As String() = plainText.Split(New String() {vbLf}, StringSplitOptions.None)
 TextBox1.Text = lines(0)
 End Using
Posted
Updated 2-Dec-22 17:29pm
v2
Comments
PIEBALDconsult 2-Dec-22 19:38pm    
Hire an MBA, they need the work.

1 solution

More accurately? Well, the source image you posted is pretty bad, so I doubt you're going to get acceptable accuracy with ANY OCR package.

Your only options are to try various OCR packages to see if they can do a better job, or write your own OCR library and train it with a fairly large set of images and known return values.

Do I have examples of doing that? No.
 
Share this answer
 
Comments
moshaveran 3-Dec-22 2:18am    
thanks from answer. If you have time, visit my new post and helping me
https://www.codeproject.com/Questions/5348693/Extract-text-from-a-rectangle-area-of-a-PDF

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900