Click here to Skip to main content
15,890,512 members
Everything / OCR

OCR

OCR

Great Reads

by Carey Payette
The problem I am looking to solve while evaluating the LEADTOOLS SDK is to use a photo of a hard copy page of text taken with my phone, have it processed and displayed in the document viewer, then have it converted to Word format so that it can be easily edited.
by Chris_Riley
Warewolf is a promising offering to help nimble front-end applications perform heavy backend tasks. This write up is on my experience with the product.
by Dynamsoft
In this article, I’ll share with you how to utilize Dynamic .NET TWAIN in a WPF application to process your documents.
by meconer
Developing a display reader for raspberry pi

Latest Articles

by ToughDev
How to restore text from PDF files Encoded using custom CID fonts
by gbeene
Capture protected document content as text without the need for breaking DRM
by Chris Maunder
Working through the combinations to get PaddlePaddle and PaddleOCR installed and working everywhere
by Michael Haephrati
How to use an OCR SDK using C++ and libCurl

All Articles

Sort by Title

OCR 

29 Jan 2016 by Chris_Riley
In this post I will explain how quality imaging is the most critical element for an application I’m working on to support both OCR and NLP.And how I’m using the MobileImage SDK to do it.
31 Jul 2013 by visnumca123
Hi Friends,while debugging i got the error like this Attempted to read or write protected memory. This is often an indication that other memory is corrupt.This below error code line works for capture screen and do ocr. But dynamically(Means Multiple time capturing window with certain...
31 Jul 2013 by Tadit Dash (ତଡିତ୍ କୁମାର ଦାଶ)
Refer - Attempt to read or write Protected Memory This is often an indicating that other memory is corrupt[^].The answer holds some explanation about the issue. You can also suppress this Exception.Take a look, it might help you.
31 Jul 2013 by Bernhard Hiller
From your desciption, I'd rather suspect that the library is not multithreading-proof. Hence you have to wait for the OCR processing to have finished before you start the next processing. That's a limitation of that library, and your code has to obey to those limitation.
2 Jul 2013 by LEADTOOLS Support
Automatically Classify Scanned Documents using LEADTOOLS Forms Recognition
5 Nov 2012 by Hendypodz
Hello all,I' m Hendy. I'm newbie in neural networks I was trying to develop some ocr project and implement backpropagation method. I have been already sets the train function, but I still confusing about using the output result. How can I use the output to recognize some character? Do I need...
13 Jun 2012 by Muthu Nadar
Hi,I have been assigned a task to read text from (.jpg) image file.After searching I came to know [Tessnet2] will help me in converting .jpg to .txt, but sample on [Tessnet2] has not helped me a lot.Kindly help me by showing the write right path in achieving OCR with best...
28 May 2012 by Sandeep Mewara
Look here:OCR with the Tesseract interface[^]Tessnet2 a .NET 2.0 Open Source OCR assembly using Tesseract engine[^]Tesseract OCR Library – Successfully compiled in Window [^]Alternative using Office: How To: Use Office 2007 OCR Using C#[^]
28 May 2012 by Jαved
Hi Muthukumar,This question is solved here -C# OCR (How to Read a single character from image)[^]
13 Jun 2012 by Basem.Othman
Although Tesseract is one of the more accurate free OCR engines, the last time I tried it a couple of years ago it was rather inaccurate. After trying some other open source libraries, we faced similar problems with the other free OCR engines and winded up using leadtools that provided faster...
13 Jun 2012 by Joshi, Rushikesh
There is no any solution which gives guarantee to read all JPG files as most of OCRs readers are based on preloaded font formats and at time of reading those JPG files must containing text in those supported font-families. Otherwise all suggested solution provided by others should be...
14 Jun 2012 by SOSSR
If you are running Win 7 or Server 2008 R2, the optional Windows component TIFF iFilter has OCR capabilities. I wrote my OCR option around that and it works really well. Best thing is, it is free if you are running one of these. You may want to investigate that as a possibility.
11 Aug 2021 by Hossein_Hadi
I wanna design an OCR program. I'll thanks if you offer any way to design this program(Any algorithm)or if have a strong open source library to do this. Thanks for suggestion.
23 Feb 2012 by Chandrakantt
You can use Tesseract OCR present in 1. http://sourceforge.net/projects/tesseract-ocr/[^]2. http://code.google.com/p/tesseract-ocr/[^]
2 Feb 2015 by Liju Sankar
Hi Hossein_Hadi,Please check the following links A C# Project in Optical Character Recognition (OCR) Using Chain Code[^]Open Source OCR SDK 1 : tesseract-ocr (code.google.com)[^]Open Source OCR SDK 2 : GOCR (sourceforge.net)[^]Similar thread in Code Project : Best Open -...
3 Feb 2015 by Xiao Ling
Open source library? I'll choose Tesseract OCR Check the .NET wrapper for tesseract-ocr:https://github.com/charlesw/tesseract[^]You can also read the article How to Build Tesseract OCR Library on Windows[^] to get familiar with the library.
1 Aug 2022 by NandaNurdin
I try to call this private void FindLicensePlate( VectorOfVectorOfPoint contours, int[,] hierachy, int idx, IInputArray gray, IInputArray canny, List licensePlateImagesList, List...
22 Aug 2020 by OriginalGriff
The problem is that it can't find the class - it's not in the same namespace as your current code. That may be it's in a different project in the current solution, in a external DLL, or just in a different namespace in another file in the current...
22 Aug 2020 by Sandeep Mewara
Common causes of it: 1. you might not have added the required using directive 2. you might have forgotten to reference (-reference) the assembly that contains the type 3. there might be an issue with the assembly you are trying to reference Look...
1 Aug 2022 by Member 15724888
In my case, I had to removed and add again the reference to the project I was trying to import, in the target project. I think I added the faulty class from an existing file and I am not sure VisualStudio properly added it to the project initially.
21 Feb 2011 by Member 7692532
What I'm trying to do is to recognize words from a BMP or preferably directly on screen. I honestly have no idea where to start with this. The best thing I can come up with is to have a preset image and compare it to where it should be on the screen, but that would require a lot of CPU and quite...
21 Feb 2011 by Manfred Rudolf Bihy
Try this CP article for starters: OCR With MODI in Visual C++[^] and here you'll find an open source .NET C++ library: http://www.pixel-technology.com/freeware/tessnet2/[^]Modification:MODI has been deprecated in MS Office 2010. The replacement is OneNote and here is a useful link to...
28 Nov 2011 by dartfrog
Hi all,I am now trying to develop OCR application using MODI. so i want to use together with AxMODI.AxMiDocView controller. how should I do ? I added MODI 12.0 as reference. and I also chose "Microsoft Office Document Imaging Viewer Control 12.0" in toolbox but I cannot see the viewer....
24 Feb 2015 by dsanthosh
Did you get Solution for your problem for your issue ?
1 May 2011 by incredible me
Hi All, I am working on my final year project. It is about CAPTCHA breaking. i.e. To show that captchas are easily broken.For that following steps need to be followed:1. Trace the position of characters in the input image.2. Extract each character. We can also say it as character...
1 May 2011 by Sergey Alexandrovich Kryukov
I don't think CAPTCHA can be easily breakable. At least, it looks like it is completely unbreakable for you, at least at this moment.I think this is because you're trying to apply training-based approach not to regular OCR problem but to CAPTCHA. What training can you apply if CAPTCHA can...
1 Jul 2011 by llovelysoso
i am having the same problem as you mensioned before i determined where is the character in the image and i specified its hight and width all the rest is about how to extract this charcters and recognize it if you found solution pleaze share it with me
1 Mar 2012 by adarsha nayak
helloI am searching for a matlab code to extract the characters from visiting card and save the characters in text file.So that the information of visiting card can be shared and maintained easily. can I get the code to extract features from visiting card?.
18 Aug 2012 by butterflyzx4
I'm doing project where I have to detect text in natural images, after the initial edge map generation stage, the connected component labeling must be done to identify text components. Are there any examples of connected component labeling done for text detection in C#? Please no rude...
18 Aug 2012 by Christian Graus
I assume natural means 'real world'. This is tough and I don't know of anyone doing this without using a prebuilt component to do it. Which usually means a commercial one.
2 Sep 2012 by Ammar_Khaled
There is a toolkit named leadtools that could help you doing your requirements. You can use it to do some processing on the image (i.e. binrize it, remove the unwanted dots from the image, convert it to 1-bit black & white, etc), and then pass it to their OCR engine.You can check their website...
8 Jun 2014 by Pavel Torgashov
The theory of contour analysis and its practical application to image recognition and OCR
10 Mar 2011 by Sri192
Hi,I need to perform OCR on an image. I am using Google's tessnet2 OCR engine for this. I had created a demo test for the google ocr in the past(which was and is working perfectly fine). I used the test demo as a reference to create one of my app's OCR capable, which I successfully was able...
2 Aug 2012 by Dynamsoft
In this article, I’ll show you how to embed document processing, such as image scanning, editing, and uploading, to your .NET application.
26 Apr 2013 by X2040
Hello.I want to enable English Handwriting functionality in my Android app.Does any body know a high accuracy English Handwriting SDK ?
26 Apr 2013 by Sudhakar Shinde
This [SDK ]will be useful for you which offers developers as well as commercial licence.
26 Mar 2012 by Darsh_Basha
i using this code to make (text) image segmentation#include #include #include #include const char *WIN1="Segmented"; // This will be the window nameIplImage* imagen;IplImage* imagen_color;IplImage* smooth;IplImage* threshold;IplImage*...
25 Apr 2014 by mahla.r
from PIL import Imageimg = Image.open('1.gif') # Your image here!img = img.convert("RGBA")width, height = img.sizepixdata = img.load()# Make the letters bolder for easier recognitionfor y in xrange(img.size[1]): for x in xrange(img.size[0]): if pixdata[x, y][0]
8 Oct 2013 by Ron Beyer
I see two problems, first:if (DetectNumbers2(pictureBox6.Image, GetReadColor(int.Parse(label1.Text))) != null) pictureBox11.Image = DetectNumbers2(pictureBox6.Image, GetReadColor(int.Parse(label1.Text)));OCR is expensive, and what you are doing (using GetPixel) is VERY VERY expensive,...
8 Oct 2013 by Member 10323594
Hello, i want to make an image to a text. And i want to find the text from the image (oCr). But it gives an error. "insufficient memory"My code:private Image DetectNumbers2(Image img, Color c) { try { Bitmap bimg = new Bitmap(img); for...
18 Feb 2013 by Sandeep Mewara
It does not work like this here.Here is what is expected of enquirers:1. TRY first what you want to do! You may find that it's not that hard.2. Formulate what was done by you that looks like an issue/not working. Try them and tell if you face issues.Members will be more than happy...
31 Jul 2018 by Nicebrilliant
Find a baseline from arabic text image. What I have tried: I have tried using aforgenet and use pixel logic to find all the black point. Also I tried with histogram and believing if I can find smooth repeated values then i will treat them as baseline but no luck so far.
21 Mar 2023 by gbeene
Capture protected document content as text without the need for breaking DRM
13 May 2014 by OriginalGriff
Good.I'm glad to hear it.Captchas are there to stop people like you from automating operations that the website owner doesn't want automated. Like posting messages for example, to prevent spam bots from flooding his site. Now, I'm not saying that that is what you intend to do, but anything...
21 Mar 2013 by lascuola_marimo
HelloI'd like to know how get text and image from external chat like this:http://imageshack.us/photo/my-images/42/tavoloocr.jpg/Can i use ocr or i must integrate autoit in vb net?Thanks for help and sorry for english
21 Mar 2013 by Dave Kreskowiak
YOu'd have to use OCR to get that. There's a few good articles here on CP that cover it, here[^].
21 Nov 2022 by Chris Maunder
Working through the combinations to get PaddlePaddle and PaddleOCR installed and working everywhere
13 Sep 2012 by chandni0422
Is there any way to determine gujarati characters using Microsoft Office Document Imaging DLL(Dynamic Link library) Please help me even if there is some other way with english to gujarati characters.I m coding in c# Asp.net
13 Sep 2012 by Sergey Alexandrovich Kryukov
I have found that the Gujarati Microsoft Office Language Pack is available here:http://office.microsoft.com/en-us/language-packs/[^].(It was relatively difficult for me to identify it as I never learned to recognize the spelling of the word "Gujarātī" in Gujarati: ગુજરાતી. So, I had to...
13 May 2014 by OriginalGriff
As you were told last time: We do not condone, support or assist in the production of malicious code in any way, form or manner. This is a professional site for professional developers. If you want to know how to create such things, you need to visit a hacking site: but be sure to disable...
4 Jan 2021 by boredpanda
Hey everyone, i'm beginner at c++ and i want to build an ocr on c++ program and i don't how to start it.. so i need ur help guys(: What I have tried: i opened DLL project and used class c++ item but i don't know how to write the code..
4 Jan 2021 by Richard MacCutchan
An OCR program is a major undertaking and requires a detailed knowledge of C++ and image processing. And your comment, "but i don't know how to write the code", makes it clear that you do not yet have sufficient experience of either subject. You...
4 Jan 2021 by OriginalGriff
I would agree with Richard that this really isn't a job for a beginner, or even close! But ... start reading here: OpenCV: cv::text::OCRTesseract Class Reference[^] but do not expect any of this to be simple or obvious; you are unlikely to find...
4 Jan 2021 by CPallini
Have a look, for instance, at Tesseract library[^].
6 Nov 2022 by Nauman Electronics
here is my code: from turtle import width import pytesseract #pip install tesseract import os from PIL import Image pytesseract.pytesseract.tesseract_cmd = r"C:\Program Files\tesseract-ocr\tesseract.exe" #Path to the tesseract img =...
11 Jul 2015 by salam_dust_man
hi everyone;i'm looking for source code or sample that get printer data before print (capture printer output) put it in variable and change some values then print it in form.any idea how to get data and put it into variable for special purpose.
10 Jul 2015 by Dave Kreskowiak
Well, you're going to have to write your own Notepad then."Intercepting" the print job and editing it is an extraordinarily complex way to do what amounts to Mail Merge.The application describes what the page image looks like to the print driver. The driver takes that "image" and...
14 Jan 2014 by ledtech3
A problem I see with that is if it dosen't correctly reconize the input then your data will be wrong. I recently tried to scan data to notepad and it got confused on the font for "k" and other letters and symbols.You could start with something like this to give you an idea on the scale...
14 Jan 2014 by Member 10524244
I perform a lot of data entry at work, which is time consuming and tedius.I want to make OCR software that can recognize recurring forms that I routinely see, for example a tax return. I want the software to recognize numbers in fields/areas I specify and return the numbers to an excel form...
12 Dec 2017 by Hassan(Aych Jay)
I am trying to perform OCR on the attached sample images. Google OCR + Microsoft OCR have been tried. They work really work on text that is not hand written. However I have to read text from handwritten documents. Sample -1 [^] Sample-2 [^] Technology is not an issue. What I have tried: ...
12 Dec 2017 by CPallini
We have many articles on this topic[^].
15 Dec 2023 by Punit Kanet
Hello community, I have been working on extracting text from images, specifically focusing on seven-segment fonts, using .NET. Unfortunately, my attempts with popular libraries like Tesseract and IronOcr have been unsuccessful, as they seem to...
9 Sep 2014 by Member 11069130
how can I take a text from image I have code in ocr but it's not run it's broken in the linemodiDocument.OCR(MiLANGUAGES.miLANG_ENGLISH);it's all my codeprivate void button1_Click(object sender, EventArgs e){ //string filePath = Server.MapPath("~/Uploads/" +...
9 Sep 2014 by Gihan Liyanage
See this sample OCR applicationCreating Optical Character Recognition (OCR) applications using Neural Networks[^]
5 Sep 2014 by Anderson Rissardi
I need to create an OCR to solve captchas. can anyone direct me to a free library or free component that I can use?
5 Sep 2014 by Dave Kreskowiak
Uhhh....no.CAPTCHA's are there to prevent you from doing what you're asking about ---- FOR A REASON.
31 Mar 2015 by MRTgang
Wich is the best and fastest way to have an OCR for recognizing some handwriting characters ?can i just modify an existant OCR project ?
31 Mar 2015 by BacchusBeale
Have a look at this OCR project:https://code.google.com/p/tesseract-ocr/[^]I used it once and it worked well.
23 Nov 2014 by Kandiya
How to do the OCR using Abby Fine Reader? I want the source code related to the same. I am using Asp.net and C# 3.5. I want to do the OCR in images having Typed Texts and handwritten texts
7 Mar 2018 by Steve Job
I am using google OCR image to text api. Because i want to store food menu in my database. But TextRecognizer can only detect TextBlocks. i want to scan column wise so i can get food dish name and its price same time so it will be easy for me to store it into database. OcrCaptureActivity //...
23 Jun 2020 by Member 10838538
I am having govt pan card and i want to process that to get text details using tesseract and emgucv. My problem is how i will detect angle of text so that i can correct and pass to ocr. Please find code below however it is less accurate. Can...
23 Sep 2022 by M@153
Input image I need to extract CR No.from the sample image above. Using Easyocr, I got the output in complex nested list form. How to update the code to filter out all the detected text/numbers and get only CR No. I tried to fetch the text "CR...
23 Sep 2022 by Richard MacCutchan
Something like: mainlist = [([[212, 26], [314, 26], [314, 50], [212, 50]], 'SCB MEDICAL', 0.998906268787747), ([[36, 56], [84, 56], [84, 80], [36, 80]], '6ein8', 0.19502338570146513), ([[303, 99], [335, 99], [335, 119], [303, 119]], 'OPD',...
29 Jan 2023 by Nauman Electronics
My input image is this: https://www.linkpicture.com/q/degrecheck.jpg MY CODE Input: MY INPUT CODE MY CODE Output: My CODE Output the upr code is converting input image to text i want to extract only student name portion from this output using...
29 Jan 2023 by Richard MacCutchan
The problem is the same as your previoius question on this subject: How can I extract only name of person and enrolment number of certificate make string and convert into hash. I have write a code but it is extracting full certificate[^].
24 Nov 2014 by Kandiya
How to extract the text of the selected region in a scanned jpg image in C# using OCR?
26 Nov 2014 by Kandiya
How to get the region of an image and extract the text in the same using MODI OCR?I am able to extract the entire text from a scanned image using MODI OCR. But I want to get the region of an image and extract the text in the same. My code is as follows:private string...
12 Jul 2013 by chady1
How can I develop an app which has Optical Character Recognition and can display the text from an Image? And how can I train it to understand handwritten text from a picture?Thanksps:I know java,android development, c++, c...
16 Aug 2012 by Dynamsoft
In this article, I’ll show you how to convert scanned images to searchable text and PDF files through a web browser.
26 Sep 2013 by Member 10235228
I am trying to implement the OCR Software using the C#; I'm new in this field;Can any body tell me how to make use of Tesseract OCR Engine in C#?
8 Jan 2014 by Dipanki99
https://gist.github.com/yatt/915443[^]
23 Mar 2014 by Xiao Ling
This post introduces an easy way to write OCR program in C#.http://www.codepool.biz/ocr-barcode-twain/twain-sdk/have-fun-with-the-wpf-demo-of-net-twain-5-0.html[^]dynamicDotNetTwainThum.IfShowCancelDialogWhenBarcodeOrOCR = true;dynamicDotNetTwainThum.OCRResultFormat =...
26 Jul 2012 by shah dipen
Hello everyone, I'm currently working on OCR and i'm facing some trouble. IS there any way to merger traindata files for all fonts?
1 Jun 2013 by David Jhones
hi, i am involved with handwritten OCR application. I use MNIST digit database for training process here. I use following code for read pixels from the database and re-create the image. programs doesnt give any error but it gives meaningless image(totally black and blur pixel patters) as output....
1 Jun 2013 by nv3
You are reading the image pixel by pixel into temp and throwing temp away after each single read. Then you call create_image with just the last pixel read. That can't work.You should instead create a buffer of size n_rows * n_cols and read an entire image into that buffer in a single read...
14 Nov 2014 by Member 11120869
Hello I am using visual studio 2010 and developing c# application in which I want to read the characters or say text and speak it through speech synthesis. I tried searching a lot for OCR but not getting proper solution.Please help. I also...
14 Nov 2014 by Afzaal Ahmad Zeeshan
This is the same question[^] you've had asked yesterday (or day before) and I have already posted an answer for reading the characters from an image. You need to use an OCR (Optical Character Recognition; recognizer) to read the characters from an image. That is a sample project[^] I have...
14 Nov 2014 by Sergey Alexandrovich Kryukov
I cannot give you a final solution: good OCR is hard to find. Please see my past answers and follow my links, especially to the CodeProject articles: i need an ocr for a camera image capture[^].The solution depends on the difficulty of your problem. It's possible that even a simple solution...
27 Nov 2014 by Kandiya
How to read text from the selected Image area using MODI OCR?I am able to extract the entire text from the image. But I want to read the text from the selected image area using MODI OCR.My code is as follows: MODI.Document md = new MODI.Document(); ...
30 Jan 2024 by Member 16192432
I have been working on extracting text from images, specifically focusing on seven-segment fonts, using .NET. Unfortunately, my attempts with popular libraries like Tesseract, IronOcr and many more have been unsuccessful, as they seem to excel...
11 Jun 2013 by David Jhones
hi,i am developing English Handwriting OCR. I use Zone based approach for feature extraction. Here I use 64 X 64 images.So i have 64 features for one sample image. My SVM will be Multi-class SVM because i have 52 classes for both capital and simple letters.Here is the format of feature...
14 Jun 2013 by David Jhones
hi all,i found scaling data means , keep all training and testing data within a range such as 0 - 1 or -1 - +1. i will proceed with that. does anyone has any idea or suggestion, please comment it here. thank you
7 Sep 2012 by Dynamsoft
In this article, I’ll share with you how to utilize Dynamic .NET TWAIN in a WPF application to process your documents.
13 Jul 2015 by Dharmesh .S. Patil
Step to solve error for Windows Vista or Windows 7:1. Click Start, type program and features in the Search box, and then press ENTER.2. Click to select the Office 2007 version that you have installed.3. Click Change.4. Click Add or Remove Features, and then click Continue.5. Expand...
13 Feb 2017 by Akarshan Majarjan
If you are using office 2003 and already enabled MODI features then just update office 2003 service pack 3. Here is the linkhttps://www.microsoft.com/en-us/download/details.aspx?id=8It will work fine....
9 Jun 2014 by Sergey Alexandrovich Kryukov
In your question, you did not have an idea who are you talking to and what source code do you mean. Unfortunately, your question is about to be automatically removed, due to some members' abuse reports (unclear or incomplete question).Please as the question again, this time with more detail,...
24 Nov 2017 by Member 13540153
[^]Kindly I need your highly appreciated support as I've been assigned a task by a co-worker , where I need to remove or bypass the simple captcha in the below screenshot, the website is accessed locally only in Egypt, it requires a user name & password which are: User name: Mahmoud Abdelnaby...
24 Nov 2017 by Patrice T
Quote: where I need to remove or bypass the simple captcha in the below screenshot Captcha is used to ban bots from accessing websites. We won't help you to bypass website security features. If your usage is legitimate, it would be easier to ask owner for an access without Captcha.
4 Nov 2013 by Erdinc Ay
Hi,I am new to C++, and need a step-by-step listing of how I can bring the open-source project Tesseract to be included in my linux c++ application (console application).https://code.google.com/p/tesseract-ocr/[^]kind regards,e.a.
4 Nov 2013 by Sergey Alexandrovich Kryukov
If you look at Tesseract "compiling" page, you will find that the step-by-step instructions are started from Linux and are written primarily for this OS:https://code.google.com/p/tesseract-ocr/wiki/Compiling[^].—SA