Click here to Skip to main content
15,890,399 members
Everything / OCR

OCR

OCR

Great Reads

by Carey Payette
The problem I am looking to solve while evaluating the LEADTOOLS SDK is to use a photo of a hard copy page of text taken with my phone, have it processed and displayed in the document viewer, then have it converted to Word format so that it can be easily edited.
by Chris_Riley
Warewolf is a promising offering to help nimble front-end applications perform heavy backend tasks. This write up is on my experience with the product.
by Dynamsoft
In this article, I’ll share with you how to utilize Dynamic .NET TWAIN in a WPF application to process your documents.
by meconer
Developing a display reader for raspberry pi

Latest Articles

by ToughDev
How to restore text from PDF files Encoded using custom CID fonts
by gbeene
Capture protected document content as text without the need for breaking DRM
by Chris Maunder
Working through the combinations to get PaddlePaddle and PaddleOCR installed and working everywhere
by Michael Haephrati
How to use an OCR SDK using C++ and libCurl

All Articles

Sort by Updated

OCR 

30 Jan 2024 by Member 16192432
I have been working on extracting text from images, specifically focusing on seven-segment fonts, using .NET. Unfortunately, my attempts with popular libraries like Tesseract, IronOcr and many more have been unsuccessful, as they seem to excel...
15 Dec 2023 by Punit Kanet
Hello community, I have been working on extracting text from images, specifically focusing on seven-segment fonts, using .NET. Unfortunately, my attempts with popular libraries like Tesseract and IronOcr have been unsuccessful, as they seem to...
7 May 2023 by ToughDev
How to restore text from PDF files Encoded using custom CID fonts
21 Mar 2023 by gbeene
Capture protected document content as text without the need for breaking DRM
29 Jan 2023 by Nauman Electronics
My input image is this: https://www.linkpicture.com/q/degrecheck.jpg MY CODE Input: MY INPUT CODE MY CODE Output: My CODE Output the upr code is converting input image to text i want to extract only student name portion from this output using...
29 Jan 2023 by Richard MacCutchan
The problem is the same as your previoius question on this subject: How can I extract only name of person and enrolment number of certificate make string and convert into hash. I have write a code but it is extracting full certificate[^].
23 Jan 2023 by Top 10 awesome (Top 10 real facts)
I am looking for a way to extract images from within another image. For example: A picture taken with a phone of a paper that has text at the top, an image of a camera(the image changes), and an image of a qr code, and other random images. Is...
21 Nov 2022 by Chris Maunder
Working through the combinations to get PaddlePaddle and PaddleOCR installed and working everywhere
11 Nov 2022 by Richard MacCutchan
Take a look at CodeProject.AI Server: AI the easy way.[^].
6 Nov 2022 by Nauman Electronics
here is my code: from turtle import width import pytesseract #pip install tesseract import os from PIL import Image pytesseract.pytesseract.tesseract_cmd = r"C:\Program Files\tesseract-ocr\tesseract.exe" #Path to the tesseract img =...
29 Oct 2022 by Mike Lud
I am working on making License Plate Module for the below project and I am getting FileNotFoundError: [WinError 2] The system cannot find the file specified when running the project. It works fine using an interactive window Project Link: GitHub...
23 Sep 2022 by M@153
Input image I need to extract CR No.from the sample image above. Using Easyocr, I got the output in complex nested list form. How to update the code to filter out all the detected text/numbers and get only CR No. I tried to fetch the text "CR...
23 Sep 2022 by Richard MacCutchan
Something like: mainlist = [([[212, 26], [314, 26], [314, 50], [212, 50]], 'SCB MEDICAL', 0.998906268787747), ([[36, 56], [84, 56], [84, 80], [36, 80]], '6ein8', 0.19502338570146513), ([[303, 99], [335, 99], [335, 119], [303, 119]], 'OPD',...
1 Aug 2022 by NandaNurdin
I try to call this private void FindLicensePlate( VectorOfVectorOfPoint contours, int[,] hierachy, int idx, IInputArray gray, IInputArray canny, List licensePlateImagesList, List...
1 Aug 2022 by Member 15724888
In my case, I had to removed and add again the reference to the project I was trying to import, in the target project. I think I added the faulty class from an existing file and I am not sure VisualStudio properly added it to the project initially.
1 Jun 2022 by MohammedZr
i'm working on this project the idea of it is that you give the program an image and using OCR in javascript the program well detect or (recognize) a string or a word for example ('رقم العداد') and copies the the number or the integer after the...
11 Aug 2021 by Hossein_Hadi
I wanna design an OCR program. I'll thanks if you offer any way to design this program(Any algorithm)or if have a strong open source library to do this. Thanks for suggestion.
30 Jan 2021 by Michael Haephrati
How to use an OCR SDK using C++ and libCurl
4 Jan 2021 by boredpanda
Hey everyone, i'm beginner at c++ and i want to build an ocr on c++ program and i don't how to start it.. so i need ur help guys(: What I have tried: i opened DLL project and used class c++ item but i don't know how to write the code..
4 Jan 2021 by CPallini
Have a look, for instance, at Tesseract library[^].
4 Jan 2021 by OriginalGriff
I would agree with Richard that this really isn't a job for a beginner, or even close! But ... start reading here: OpenCV: cv::text::OCRTesseract Class Reference[^] but do not expect any of this to be simple or obvious; you are unlikely to find...
4 Jan 2021 by Richard MacCutchan
An OCR program is a major undertaking and requires a detailed knowledge of C++ and image processing. And your comment, "but i don't know how to write the code", makes it clear that you do not yet have sufficient experience of either subject. You...
22 Aug 2020 by Sandeep Mewara
Common causes of it: 1. you might not have added the required using directive 2. you might have forgotten to reference (-reference) the assembly that contains the type 3. there might be an issue with the assembly you are trying to reference Look...
22 Aug 2020 by OriginalGriff
The problem is that it can't find the class - it's not in the same namespace as your current code. That may be it's in a different project in the current solution, in a external DLL, or just in a different namespace in another file in the current...
3 Aug 2020 by gstolarov
A class library and command line utility to add OCR information to image-only PDF files
23 Jun 2020 by Member 10838538
I am having govt pan card and i want to process that to get text details using tesseract and emgucv. My problem is how i will detect angle of text so that i can correct and pass to ocr. Please find code below however it is less accurate. Can...
6 Feb 2020 by Member 14390147
"this type initialize for Emgu.CV.OCR.Tesseract threw an exception". and also i saw this message at the detail " See the end of this message for details on invoking just-in-time (JIT) debugging instead of this dialog box. ************** Exception Text **************...
6 Feb 2020 by Mike Diack
Yes the clue is in: System.BadImageFormatException (aka Windows error code: 0x8007000B ) That is almost always a result of trying to mix 32 and 64 bit code. Processes need to either be all 32 bit or all 64 bit (executable and dlls etc) i.e. am seconding what Richard said.
7 Sep 2018 by Member 13976944
Hello! I am a researcher and I have a database of about 20,000 pages across 700+ pdfs. The pdfs are searchable on a rudimentary level, but I'd need a coding tool (crawler?) that could quickly search through them. Additionally, it'd be great if the software could filter out noise, visualize...
7 Sep 2018 by Gerry Schmitz
You're talking "ETL" (extract; translate; load). You're still only at the "extract" phase; the rest (filter, aggregate, visualize) only comes "after". You need to be more specific about the "content". A "simple" "text" scanner can take a few minutes to develop; and even less to run. (pdfs...
31 Jul 2018 by Nicebrilliant
Find a baseline from arabic text image. What I have tried: I have tried using aforgenet and use pixel logic to find all the black point. Also I tried with histogram and believing if I can find smooth repeated values then i will treat them as baseline but no luck so far.
7 Mar 2018 by Steve Job
I am using google OCR image to text api. Because i want to store food menu in my database. But TextRecognizer can only detect TextBlocks. i want to scan column wise so i can get food dish name and its price same time so it will be easy for me to store it into database. OcrCaptureActivity //...
12 Dec 2017 by Hassan(Aych Jay)
I am trying to perform OCR on the attached sample images. Google OCR + Microsoft OCR have been tried. They work really work on text that is not hand written. However I have to read text from handwritten documents. Sample -1 [^] Sample-2 [^] Technology is not an issue. What I have tried: ...
12 Dec 2017 by CPallini
We have many articles on this topic[^].
24 Nov 2017 by Member 13540153
[^]Kindly I need your highly appreciated support as I've been assigned a task by a co-worker , where I need to remove or bypass the simple captcha in the below screenshot, the website is accessed locally only in Egypt, it requires a user name & password which are: User name: Mahmoud Abdelnaby...
24 Nov 2017 by Patrice T
Quote: where I need to remove or bypass the simple captcha in the below screenshot Captcha is used to ban bots from accessing websites. We won't help you to bypass website security features. If your usage is legitimate, it would be easier to ask owner for an access without Captcha.
13 Feb 2017 by Akarshan Majarjan
If you are using office 2003 and already enabled MODI features then just update office 2003 service pack 3. Here is the linkhttps://www.microsoft.com/en-us/download/details.aspx?id=8It will work fine....
21 Dec 2016 by Member 10062432
for google cloud based OCR check below link:https://developers.google.com/api-client-library/dotnet/apis/vision/v1 Also check this link, I donno if this is also cloud based or offline version.GitHub - A9T9/Google-OCR-Vision-API-CSharp: Test app for the Google OCR features of the Google...
6 Oct 2016 by Member 12759051
I have tried Tesseract OCR with typed text images and it works fine. Below is my code snippet.I want to read handwritten images too. Can someone, who might have achieved the same help me out with it? Or a reference to any other libraries with which I can do it will also...
30 Aug 2016 by Accusoft, Corey Morehead
This whitepaper shows how I used OCR Xpress for Java to OCR a scanned newspaper, redact the key words, and accomplish my goal.
2 Aug 2016 by Chris_Riley
Warewolf is a promising offering to help nimble front-end applications perform heavy backend tasks. This write up is on my experience with the product.
10 Jun 2016 by Derek_Gerber
29 May 2016 by meconer
Developing a display reader for raspberry pi
20 May 2016 by Utkarsh Sharma
i want to make an app in android.. using optical character recognition so i can convert camera click image into text and save it into txt file and upload it into database. But i dont know where i have to start because i never used optical character recognition.. So anyone can guide me or provide...
1 Apr 2016 by Carey Payette
The problem I am looking to solve while evaluating the LEADTOOLS SDK is to use a photo of a hard copy page of text taken with my phone, have it processed and displayed in the document viewer, then have it converted to Word format so that it can be easily edited.
29 Jan 2016 by Chris_Riley
In this post I will explain how quality imaging is the most critical element for an application I’m working on to support both OCR and NLP.And how I’m using the MobileImage SDK to do it.
2 Aug 2015 by Kornfeld Eliyahu Peter
There is no need (or place) for client/server separation...These libraries are standalone, the list of supported operating systems has 3 entries for every type of operating systems the library supports. Minimum for desktop, minimum for server and minimum for phone...What you should consider...
2 Aug 2015 by Member 11765988
I need to use Microsoft OCR in my standalone Winforms desktop application. Will it work with following software?Operating system= 8.1Winforms in VS 2013Their specifications states following requirements:Client = Windows 8.1Server = Windows Server 2012 R2Phone = Windows Phone...
27 Jul 2015 by Member 11765988
Please suggest a tool to read text/words inside an image. For example, I have an image.jpg file having words in separate lines as below: 1) Line 1 has word "Blind Spot"2) Line 2 has word "fix match"3) Line 3 has word "message"I want coordinates of the words on different lines....
13 Jul 2015 by Dharmesh .S. Patil
Step to solve error for Windows Vista or Windows 7:1. Click Start, type program and features in the Search box, and then press ENTER.2. Click to select the Office 2007 version that you have installed.3. Click Change.4. Click Add or Remove Features, and then click Continue.5. Expand...
11 Jul 2015 by salam_dust_man
hi everyone;i'm looking for source code or sample that get printer data before print (capture printer output) put it in variable and change some values then print it in form.any idea how to get data and put it into variable for special purpose.
10 Jul 2015 by Dave Kreskowiak
Well, you're going to have to write your own Notepad then."Intercepting" the print job and editing it is an extraordinarily complex way to do what amounts to Mail Merge.The application describes what the page image looks like to the print driver. The driver takes that "image" and...
23 Jun 2015 by Lakhan Anjana
you need to find some OCR API for That.this is the only one way.
23 Jun 2015 by srinivas vanka
scanned pdf as input and want to get normal searchable pdf using asp.net.Any suggestions .please..
19 May 2015 by Richard MacCutchan
See https://www.google.com/search?q=itextsharp[^].
18 May 2015 by Member 11564708
Hi,in my application we need to implement one page which should take input as pdf that pdf values should fill in our pageFor example In pdf we are having a table with some key value pairs likeName:Kishor NaikThis name should come and fill in a text boxHow we can achieve...
1 May 2015 by DaveAuld
I had a quick look at Tesseract search results and it appears that the OCR engine supports AMD GPUs.Therefore in theory it may be possible to speed up processing by;a) investing in a higher specification GPUcoupled with;b) higher performance CPU.Do some research, ask some questions...
1 May 2015 by Mehdi Gholam
1) Contact the library authors for performance tweaks.2) Run on faster hardware.
1 May 2015 by LEADTOOLS Support
LEADTOOLS includes a robust Check Scanning and Processing SDK within its document family of imaging toolkits, and is already being used by a number of applications across banking, insurance and retail industries.
18 Apr 2015 by shivmymail
During processing of tiff files, which are having 600 - 700 pages from Tesseract OCR engine with hocr option, we monitored that files are taking around 40 - 50 minutes.We monitored that it is so much time for processing large files.Do we have any way to speed up the process?Following...
31 Mar 2015 by BacchusBeale
Have a look at this OCR project:https://code.google.com/p/tesseract-ocr/[^]I used it once and it worked well.
31 Mar 2015 by MRTgang
Wich is the best and fastest way to have an OCR for recognizing some handwriting characters ?can i just modify an existant OCR project ?
2 Mar 2015 by LEADTOOLS Support
In this whitepaper, we will explore the inner-workings of this innovative framework and highlight how developers can write just a few lines of code to extend and customize it for their application.
24 Feb 2015 by dsanthosh
Did you get Solution for your problem for your issue ?
7 Feb 2015 by Jim Meadors
This project PDF Viewer Control Without Acrobat Reader Installed[^] on CP includes tesseract and code to use it. The article doesn't go into it in much detail, you'll have to dig into the code but not that difficult.
7 Feb 2015 by Member 11435748
ou please provide me steps to recignize english charaters from a image using tesserect. And find the recognition rate.
3 Feb 2015 by Xiao Ling
Open source library? I'll choose Tesseract OCR Check the .NET wrapper for tesseract-ocr:https://github.com/charlesw/tesseract[^]You can also read the article How to Build Tesseract OCR Library on Windows[^] to get familiar with the library.
2 Feb 2015 by Liju Sankar
Hi Hossein_Hadi,Please check the following links A C# Project in Optical Character Recognition (OCR) Using Chain Code[^]Open Source OCR SDK 1 : tesseract-ocr (code.google.com)[^]Open Source OCR SDK 2 : GOCR (sourceforge.net)[^]Similar thread in Code Project : Best Open -...
1 Feb 2015 by LEADTOOLS Support
In this white paper we will show how to set up a master form template then extract the data from a filled invoice.
29 Dec 2014 by Xiao Ling
Try this .Net wrapper[^] which recommended by Tesseract-OCR[^]. You can find an excellent OCR web demo[^] inside.
29 Dec 2014 by Xiao Ling
You can download this Tesseract .Net wrapper: https://github.com/charlesw/tesseract[^], and run the OCR sample inside. More Tesseract wrapper links could be found on Tesseract-OCR addons[^]
27 Nov 2014 by Kandiya
How to read text from the selected Image area using MODI OCR?I am able to extract the entire text from the image. But I want to read the text from the selected image area using MODI OCR.My code is as follows: MODI.Document md = new MODI.Document(); ...
26 Nov 2014 by Kandiya
Why I am getting OCR Running error on implementing MODI OCR on a text written in mspaint ?I have saved the image as TIFF. Otherwise if its simply extracting from a scanned image its working fine. I am using MODI OCR tool.
26 Nov 2014 by Kandiya
How to get the region of an image and extract the text in the same using MODI OCR?I am able to extract the entire text from a scanned image using MODI OCR. But I want to get the region of an image and extract the text in the same. My code is as follows:private string...
25 Nov 2014 by Kandiya
Why I am getting the error "Retrieving the COM class factory for component with CLSID {} failed due to the following error" on using MODI OCR?I am extracting an image on a scanned document using Microsoft Office Document Imaging. I am implementing using VS 2008 and C#3.5.I added the dll...
25 Nov 2014 by Kandiya
Why Tesseract OCR is not working in Asp.net with C#3.5? I am able to extract the text from the scanned image in c# application. But while using in web application the same is not working. Here OCR.Init method its not working in Asp.net but in windows application there are no such issues.I have...
24 Nov 2014 by Kandiya
How to extract the text of the selected region in a scanned jpg image in C# using OCR?
24 Nov 2014 by Kandiya
Why I am not able to recognize text in image using Tesseract? On debuggin the code after Init case the application isn't working. Required dll is also added. The exe and testdata folder compatible is set in the bin/debug folder. I am using C#3.5 too. Did so following below...
23 Nov 2014 by Kandiya
How to do the OCR using Abby Fine Reader? I want the source code related to the same. I am using Asp.net and C# 3.5. I want to do the OCR in images having Typed Texts and handwritten texts
20 Nov 2014 by Afzaal Ahmad Zeeshan
There ain't any list, since there are many libraries and tools already developed and available for you to use, you won't be able to differentiate between any of the library.What you should do is, check out this MSDN's OCR reader[^] by Microsoft. It is available in C#, C++ and JavaScript....
20 Nov 2014 by Mathew Soji
Every tool will have its on plus and minus .Research a bit on each and see which suits your requirement . Then post question here so that some here can guide. My suggestion is to check the below link for a better start .https://www.google.co.in/?gws_rd=ssl#q=OCR+tools+%2BAsp.Net+%2B+C%23.[^]
20 Nov 2014 by Kandiya
what are the list of fine reader tools available in OCR Technlogy? I am using OCR,Asp.Net and C#.
14 Nov 2014 by Sergey Alexandrovich Kryukov
I cannot give you a final solution: good OCR is hard to find. Please see my past answers and follow my links, especially to the CodeProject articles: i need an ocr for a camera image capture[^].The solution depends on the difficulty of your problem. It's possible that even a simple solution...
14 Nov 2014 by Member 11120869
Hello I am using visual studio 2010 and developing c# application in which I want to read the characters or say text and speak it through speech synthesis. I tried searching a lot for OCR but not getting proper solution.Please help. I also...
14 Nov 2014 by Afzaal Ahmad Zeeshan
This is the same question[^] you've had asked yesterday (or day before) and I have already posted an answer for reading the characters from an image. You need to use an OCR (Optical Character Recognition; recognizer) to read the characters from an image. That is a sample project[^] I have...
15 Oct 2014 by jase.y.
OCR? Sounds like you need ITextSharp. Check out their SourceFourge page and do some reading up on how to use it. Here's a simple snippet to get you started with extracting some text from a PDF file:itextsharp read pdf file[^]public string ParsePdf(string fileName){ if...
15 Oct 2014 by Member 11156776
I am trying to create the user interface for an educational video library. The videos are housed somewhere else and I want to create a site that will be user friendly and have an extensive search engine, but only for the content covered in the videos. At the moment I am manually tagging each...
23 Sep 2014 by LEADTOOLS Support
Why implementing Optical Character Recognition with Leadtools ODR SDKs is the faster, more accurate choice.
9 Sep 2014 by Gihan Liyanage
See this sample OCR applicationCreating Optical Character Recognition (OCR) applications using Neural Networks[^]
9 Sep 2014 by Member 11069130
how can I take a text from image I have code in ocr but it's not run it's broken in the linemodiDocument.OCR(MiLANGUAGES.miLANG_ENGLISH);it's all my codeprivate void button1_Click(object sender, EventArgs e){ //string filePath = Server.MapPath("~/Uploads/" +...
5 Sep 2014 by Dave Kreskowiak
Uhhh....no.CAPTCHA's are there to prevent you from doing what you're asking about ---- FOR A REASON.
5 Sep 2014 by Anderson Rissardi
I need to create an OCR to solve captchas. can anyone direct me to a free library or free component that I can use?
12 Aug 2014 by HanMoLingfeng
In my project,we need use OCR ,so I download some traineddata files from googole'code.But the result which they callback for me almost wrong.So I think I need do my traineddata.Why this traineddata which was created by myself does not work on android?Thx your help.Add: Thx to Richard...
15 Jul 2014 by Richard MacCutchan
Start your journey at https://www.google.com/search?q=ocr%20ios[^]; the second link includes source code.
15 Jul 2014 by jagds
Hello,I had searched on google last two days, and few info relates to achieve OCR implementation. Neural networks algorithm, text recognition will help to achieve OCR. But confusion going on that how to implement Neural Network in Objective C. Also read some where that need to apply...
30 Jun 2014 by Xiao Ling
change Me.dynamicDotNetTwain1.OCRLanguage = "English" to Me.dynamicDotNetTwain1.OCRLanguage = "eng"
30 Jun 2014 by Bernhard Hiller
Not sure, but that looks promissing:dynamicDotNetTwain1.CurrentImageIndexInBuffer in the top lines vs. dynamicDotNetTwain1.CurrentSelectedImageIndicesInBuffer in the last line.
30 Jun 2014 by Jocelyne El Khoury
i have the below code: ddlResultFormat.Items.Add("Text") dynamicDotNetTwain1.LoadImage("C:\Users\elj\Desktop\3b072e9b-76b4-4776-bd3c-5984074d8ecd.png") Dim r = dynamicDotNetTwain1.GetImage(dynamicDotNetTwain1.CurrentImageIndexInBuffer).Width.ToString() Dim l =...