Click here to Skip to main content
15,881,715 members
Everything / OCR

OCR

OCR

Great Reads

by Carey Payette
The problem I am looking to solve while evaluating the LEADTOOLS SDK is to use a photo of a hard copy page of text taken with my phone, have it processed and displayed in the document viewer, then have it converted to Word format so that it can be easily edited.
by Chris_Riley
Warewolf is a promising offering to help nimble front-end applications perform heavy backend tasks. This write up is on my experience with the product.
by Dynamsoft
In this article, I’ll share with you how to utilize Dynamic .NET TWAIN in a WPF application to process your documents.
by meconer
Developing a display reader for raspberry pi

Latest Articles

by ToughDev
How to restore text from PDF files Encoded using custom CID fonts
by gbeene
Capture protected document content as text without the need for breaking DRM
by Chris Maunder
Working through the combinations to get PaddlePaddle and PaddleOCR installed and working everywhere
by Michael Haephrati
How to use an OCR SDK using C++ and libCurl

All Articles

Sort by Score

OCR 

1 Apr 2016 by Carey Payette
The problem I am looking to solve while evaluating the LEADTOOLS SDK is to use a photo of a hard copy page of text taken with my phone, have it processed and displayed in the document viewer, then have it converted to Word format so that it can be easily edited.
21 Feb 2011 by Manfred Rudolf Bihy
Try this CP article for starters: OCR With MODI in Visual C++[^] and here you'll find an open source .NET C++ library: http://www.pixel-technology.com/freeware/tessnet2/[^]Modification:MODI has been deprecated in MS Office 2010. The replacement is OneNote and here is a useful link to...
2 Aug 2016 by Chris_Riley
Warewolf is a promising offering to help nimble front-end applications perform heavy backend tasks. This write up is on my experience with the product.
13 Aug 2012 by Sergey Alexandrovich Kryukov
I found that SWT is available in CCV, an open-source Computer Vision Library:http://libccv.org/doc/doc-swt/[^].You can download the source code and learn how it works, to write it in C# or make a translation.Alternatively, you can compile the code into a native DLL and use it in your...
7 Sep 2012 by Dynamsoft
In this article, I’ll share with you how to utilize Dynamic .NET TWAIN in a WPF application to process your documents.
29 May 2016 by meconer
Developing a display reader for raspberry pi
21 Mar 2023 by gbeene
Capture protected document content as text without the need for breaking DRM
16 Aug 2012 by Dynamsoft
In this article, I’ll show you how to convert scanned images to searchable text and PDF files through a web browser.
26 Feb 2013 by chaau
There is a GOCR[^] library that is released under GNU licence. I have used it and it is great. It is a pure C library, and works in Windows as well. I recommend it.
27 Apr 2013 by Zoltán Zörgő
You could start here: https://code.google.com/p/number-regnization/[^]
8 Oct 2013 by Ron Beyer
I see two problems, first:if (DetectNumbers2(pictureBox6.Image, GetReadColor(int.Parse(label1.Text))) != null) pictureBox11.Image = DetectNumbers2(pictureBox6.Image, GetReadColor(int.Parse(label1.Text)));OCR is expensive, and what you are doing (using GetPixel) is VERY VERY expensive,...
3 Aug 2020 by gstolarov
A class library and command line utility to add OCR information to image-only PDF files
28 May 2012 by Sandeep Mewara
Look here:OCR with the Tesseract interface[^]Tessnet2 a .NET 2.0 Open Source OCR assembly using Tesseract engine[^]Tesseract OCR Library – Successfully compiled in Window [^]Alternative using Office: How To: Use Office 2007 OCR Using C#[^]
14 Jun 2012 by SOSSR
If you are running Win 7 or Server 2008 R2, the optional Windows component TIFF iFilter has OCR capabilities. I wrote my OCR option around that and it works really well. Best thing is, it is free if you are running one of these. You may want to investigate that as a possibility.
2 Aug 2012 by Dynamsoft
In this article, I’ll show you how to embed document processing, such as image scanning, editing, and uploading, to your .NET application.
13 Aug 2012 by butterflyzx4
Hi, I am doing a project where I have to detect text in an image "not OCR", just to detect if text is their and localize the text lines.I know that at first the edge map has to generated using an edge detection algorithm but I want to know if there are any examples of implementation of the...
13 Sep 2012 by Sergey Alexandrovich Kryukov
I have found that the Gujarati Microsoft Office Language Pack is available here:http://office.microsoft.com/en-us/language-packs/[^].(It was relatively difficult for me to identify it as I never learned to recognize the spelling of the word "Gujarātī" in Gujarati: ગુજરાતી. So, I had to...
23 Feb 2013 by Sandeep Mewara
Have a look at these CodeProject articles:Creating Optical Character Recognition (OCR) applications using Neural Networks[^]Unicode Optical Character Recognition[^]
26 Apr 2013 by Sudhakar Shinde
This [SDK ]will be useful for you which offers developers as well as commercial licence.
1 Jun 2013 by nv3
You are reading the image pixel by pixel into temp and throwing temp away after each single read. Then you call create_image with just the last pixel read. That can't work.You should instead create a buffer of size n_rows * n_cols and read an entire image into that buffer in a single read...
10 Nov 2013 by Orcun Iyigun
Yes there is. If you had searched before posting this question, you would have seen this awesome answer from Sergey here on CP:Stroke Width Transform (SWT) in C#[^]And google would return 74.000 results: Stroke width transform in C#[^]Good luck,OI
21 Apr 2014 by thatraja
What's wrong with those 3 engines? Anyway check this for other OCRsComparison of optical character recognition software[^]Check all OCR websites for more details like Documentation, tutorials, forum, etc.,
13 May 2014 by OriginalGriff
Good.I'm glad to hear it.Captchas are there to stop people like you from automating operations that the website owner doesn't want automated. Like posting messages for example, to prevent spam bots from flooding his site. Now, I'm not saying that that is what you intend to do, but anything...
21 May 2014 by Kornfeld Eliyahu Peter
This is probably the best open source library for OCR - https://code.google.com/p/tesseract-ocr/[^]Also Microsoft has an Inkanalyzer class (belongs to WPF) that can help you - http://msdn.microsoft.com/en-us/library/system.windows.ink.inkanalyzer(v=vs.90).aspx[^]
30 Jun 2014 by Xiao Ling
change Me.dynamicDotNetTwain1.OCRLanguage = "English" to Me.dynamicDotNetTwain1.OCRLanguage = "eng"
29 Jan 2016 by Chris_Riley
In this post I will explain how quality imaging is the most critical element for an application I’m working on to support both OCR and NLP.And how I’m using the MobileImage SDK to do it.
30 Aug 2016 by Accusoft, Corey Morehead
This whitepaper shows how I used OCR Xpress for Java to OCR a scanned newspaper, redact the key words, and accomplish my goal.
4 Jan 2021 by Richard MacCutchan
An OCR program is a major undertaking and requires a detailed knowledge of C++ and image processing. And your comment, "but i don't know how to write the code", makes it clear that you do not yet have sufficient experience of either subject. You...
21 Feb 2011 by Member 7692532
What I'm trying to do is to recognize words from a BMP or preferably directly on screen. I honestly have no idea where to start with this. The best thing I can come up with is to have a preset image and compare it to where it should be on the screen, but that would require a lot of CPU and quite...
1 May 2011 by incredible me
Hi All, I am working on my final year project. It is about CAPTCHA breaking. i.e. To show that captchas are easily broken.For that following steps need to be followed:1. Trace the position of characters in the input image.2. Extract each character. We can also say it as character...
1 May 2011 by Sergey Alexandrovich Kryukov
I don't think CAPTCHA can be easily breakable. At least, it looks like it is completely unbreakable for you, at least at this moment.I think this is because you're trying to apply training-based approach not to regular OCR problem but to CAPTCHA. What training can you apply if CAPTCHA can...
1 Jul 2011 by llovelysoso
i am having the same problem as you mensioned before i determined where is the character in the image and i specified its hight and width all the rest is about how to extract this charcters and recognize it if you found solution pleaze share it with me
23 Sep 2014 by LEADTOOLS Support
Why implementing Optical Character Recognition with Leadtools ODR SDKs is the faster, more accurate choice.
1 Mar 2012 by adarsha nayak
helloI am searching for a matlab code to extract the characters from visiting card and save the characters in text file.So that the information of visiting card can be shared and maintained easily. can I get the code to extract features from visiting card?.
26 Mar 2012 by Darsh_Basha
i using this code to make (text) image segmentation#include #include #include #include const char *WIN1="Segmented"; // This will be the window nameIplImage* imagen;IplImage* imagen_color;IplImage* smooth;IplImage* threshold;IplImage*...
13 Jun 2012 by Muthu Nadar
Hi,I have been assigned a task to read text from (.jpg) image file.After searching I came to know [Tessnet2] will help me in converting .jpg to .txt, but sample on [Tessnet2] has not helped me a lot.Kindly help me by showing the write right path in achieving OCR with best...
28 May 2012 by Jαved
Hi Muthukumar,This question is solved here -C# OCR (How to Read a single character from image)[^]
13 Jun 2012 by Basem.Othman
Although Tesseract is one of the more accurate free OCR engines, the last time I tried it a couple of years ago it was rather inaccurate. After trying some other open source libraries, we faced similar problems with the other free OCR engines and winded up using leadtools that provided faster...
14 Jun 2012 by elidrissi.amine1
Is there any open source OCR of .NET that can extract text from scanned pdf even if the text is in different fonts and it gives the ability to render it in html( or xml or text)format.
26 Jul 2012 by shah dipen
Hello everyone, I'm currently working on OCR and i'm facing some trouble. IS there any way to merger traindata files for all fonts?
18 Aug 2012 by Christian Graus
I assume natural means 'real world'. This is tough and I don't know of anyone doing this without using a prebuilt component to do it. Which usually means a commercial one.
28 Aug 2012 by Sandeep Mewara
I want the code for developing an OCR image to text in Asp.net.Sorry but it does not work like this here.Here is what is expected of enquirers:1. TRY first what you want to do! You may find that it's not that hard.2. Formulate what was done by you that looks like an issue/not...
28 Aug 2012 by lewax00
An OCR program is not trivial, it's a rather complicated subject, so no one here is going to write one from scratch to give to you. The best you'll get is someone doing a Google search and giving you links from that, so save yourself some time and just do the search yourself.That being said,...
2 Sep 2012 by Ammar_Khaled
There is a toolkit named leadtools that could help you doing your requirements. You can use it to do some processing on the image (i.e. binrize it, remove the unwanted dots from the image, convert it to 1-bit black & white, etc), and then pass it to their OCR engine.You can check their website...
13 Sep 2012 by chandni0422
Is there any way to determine gujarati characters using Microsoft Office Document Imaging DLL(Dynamic Link library) Please help me even if there is some other way with english to gujarati characters.I m coding in c# Asp.net
13 Sep 2012 by chandni0422
Can we use Microsoft Document Imaging MODI with non english characters, such as Gujarati_MODIDocument.OCR(_MODIParameters .Language , _MODIParameters.WithAutoRotation, _MODIParameters.WithStraightenImage);I want to scan the image with other language can I do that?Or Can I convert...
11 Jan 2013 by LEADTOOLS Support
iOS OCR and Barcode with LEADTOOLS 18.
21 Jan 2013 by Dynamsoft
1 Feb 2013 by LEADTOOLS Support
Multi-Platform OCR with LEADTOOLS 18.
18 Feb 2013 by Sandeep Mewara
It does not work like this here.Here is what is expected of enquirers:1. TRY first what you want to do! You may find that it's not that hard.2. Formulate what was done by you that looks like an issue/not working. Try them and tell if you face issues.Members will be more than happy...
23 Feb 2013 by Azad R Chouhan
Hello everyone I wanna to create a OCR application in .net using winforms.Please tell me how i can create a ocr using c sharp and winform Regard Azad R Chouhan
21 Mar 2013 by Dave Kreskowiak
YOu'd have to use OCR to get that. There's a few good articles here on CP that cover it, here[^].
26 Apr 2013 by X2040
Hello.I want to enable English Handwriting functionality in my Android app.Does any body know a high accuracy English Handwriting SDK ?
27 Apr 2013 by amir tarek
i have an image jpg has 12 numbers 0 1 2 3 4 5 6 7 8 9 and writing using normal shape with no change on its shape i want to make a program read all the image get a part of it that has the 12 numbers (right down)then make ocr program read the numbers very accuracy.this program i...
1 Jun 2013 by LEADTOOLS Support
Using LEADTOOLS OCR to Enhance Google Drive Search
1 Jun 2013 by David Jhones
hi, i am involved with handwritten OCR application. I use MNIST digit database for training process here. I use following code for read pixels from the database and re-create the image. programs doesnt give any error but it gives meaningless image(totally black and blur pixel patters) as output....
11 Jun 2013 by David Jhones
hi,i am developing English Handwriting OCR. I use Zone based approach for feature extraction. Here I use 64 X 64 images.So i have 64 features for one sample image. My SVM will be Multi-class SVM because i have 52 classes for both capital and simple letters.Here is the format of feature...
14 Jun 2013 by David Jhones
hi all,i found scaling data means , keep all training and testing data within a range such as 0 - 1 or -1 - +1. i will proceed with that. does anyone has any idea or suggestion, please comment it here. thank you
2 Jul 2013 by LEADTOOLS Support
Automatically Classify Scanned Documents using LEADTOOLS Forms Recognition
31 Jul 2013 by visnumca123
Hi Friends,while debugging i got the error like this Attempted to read or write protected memory. This is often an indication that other memory is corrupt.This below error code line works for capture screen and do ocr. But dynamically(Means Multiple time capturing window with certain...
31 Jul 2013 by Tadit Dash (ତଡିତ୍ କୁମାର ଦାଶ)
Refer - Attempt to read or write Protected Memory This is often an indicating that other memory is corrupt[^].The answer holds some explanation about the issue. You can also suppress this Exception.Take a look, it might help you.
31 Jul 2013 by Bernhard Hiller
From your desciption, I'd rather suspect that the library is not multithreading-proof. Hence you have to wait for the OCR processing to have finished before you start the next processing. That's a limitation of that library, and your code has to obey to those limitation.
1 Oct 2013 by LEADTOOLS Support
In the white paper that follows, we will show how to read and write the PDF keywords metadata, update the file on Google Drive, interface with your local Google Drive database, and do all of this within a single right-click context menu in Windows Explorer.
5 Oct 2013 by Richard MacCutchan
https://www.google.com/search?q=ocr[^].https://www.google.com/search?q=sql+server[^].
1 Nov 2013 by LEADTOOLS Support
Improving Forms Recognition Results with Automated Alignment
5 Nov 2013 by expert-programmer
whynot take a look of open source projects? https://code.google.com/p/tesseract-ocr/wiki/3rdParty
10 Nov 2013 by jules237
Hi, I am new to C# and image processing. I am interested in implementing of SWT for image detection and localization. Is there are any examples of implementation of the stroke width transform in C#?Thank you
14 Nov 2013 by imran kanjoo
Can some one please guide me step by step to integrate tessnet2 with visual studio for ocr. Or if you know any site for this please share. I have seen many sites but still I am unable to compile the project.some time I get some file missing. but i checked the files were present.sometimes i...
11 Dec 2013 by Member 10460822
Is there a way to remote-control a device like smartphones or tablets to perform OCR (text recognition) on photos stored in the device or being taken by it (or another camera) real time, without touching the device itself? If yes, which device(s) can do this? Is programming required?
8 Jan 2014 by Dipanki99
https://gist.github.com/yatt/915443[^]
14 Jan 2014 by ledtech3
A problem I see with that is if it dosen't correctly reconize the input then your data will be wrong. I recently tried to scan data to notepad and it got confused on the font for "k" and other letters and symbols.You could start with something like this to give you an idea on the scale...
23 Mar 2014 by Xiao Ling
This post introduces an easy way to write OCR program in C#.http://www.codepool.biz/ocr-barcode-twain/twain-sdk/have-fun-with-the-wpf-demo-of-net-twain-5-0.html[^]dynamicDotNetTwainThum.IfShowCancelDialogWhenBarcodeOrOCR = true;dynamicDotNetTwainThum.OCRResultFormat =...
21 Apr 2014 by 204.sharma
Hi, i am searching for tamil language ocr engine or DLL.I searched a lot on google but it only shows 3 Names (tesseract, AbbyFineReader and gtamilOCR).besides this if someone have information about other ocr engines, dll or sdk.please help me and share your information.Thanks...
25 Apr 2014 by mahla.r
from PIL import Imageimg = Image.open('1.gif') # Your image here!img = img.convert("RGBA")width, height = img.sizepixdata = img.load()# Make the letters bolder for easier recognitionfor y in xrange(img.size[1]): for x in xrange(img.size[0]): if pixdata[x, y][0]
13 May 2014 by OriginalGriff
As you were told last time: We do not condone, support or assist in the production of malicious code in any way, form or manner. This is a professional site for professional developers. If you want to know how to create such things, you need to visit a hacking site: but be sure to disable...
21 May 2014 by thatraja
Check this listComparison of optical character recognition software[^]Check all OCR websites for more details like Documentation, tutorials, forum, etc.,
25 May 2014 by vinodh107
How can we read Tamil languages using tesseract OCR with at least 70-80% accuracy.I will provide sample for your reference. Kindly provide me step by step process to know more knowledge about tesseract OCR with Tamil fonts.Note: i have tried Tamil fonts with tesseract OCR, But accuracy...
26 May 2014 by Richard MacCutchan
See https://code.google.com/p/tesseract-ocr/downloads/detail?name=tesseract-ocr-3.02.tam.tar.gz&can=2&q=[^].
9 Jun 2014 by Sergey Alexandrovich Kryukov
In your question, you did not have an idea who are you talking to and what source code do you mean. Unfortunately, your question is about to be automatically removed, due to some members' abuse reports (unclear or incomplete question).Please as the question again, this time with more detail,...
30 Jun 2014 by Jocelyne El Khoury
i have the below code: ddlResultFormat.Items.Add("Text") dynamicDotNetTwain1.LoadImage("C:\Users\elj\Desktop\3b072e9b-76b4-4776-bd3c-5984074d8ecd.png") Dim r = dynamicDotNetTwain1.GetImage(dynamicDotNetTwain1.CurrentImageIndexInBuffer).Width.ToString() Dim l =...
30 Jun 2014 by Bernhard Hiller
Not sure, but that looks promissing:dynamicDotNetTwain1.CurrentImageIndexInBuffer in the top lines vs. dynamicDotNetTwain1.CurrentSelectedImageIndicesInBuffer in the last line.
15 Jul 2014 by jagds
Hello,I had searched on google last two days, and few info relates to achieve OCR implementation. Neural networks algorithm, text recognition will help to achieve OCR. But confusion going on that how to implement Neural Network in Objective C. Also read some where that need to apply...
9 Sep 2014 by Member 11069130
how can I take a text from image I have code in ocr but it's not run it's broken in the linemodiDocument.OCR(MiLANGUAGES.miLANG_ENGLISH);it's all my codeprivate void button1_Click(object sender, EventArgs e){ //string filePath = Server.MapPath("~/Uploads/" +...
9 Sep 2014 by Gihan Liyanage
See this sample OCR applicationCreating Optical Character Recognition (OCR) applications using Neural Networks[^]
15 Oct 2014 by jase.y.
OCR? Sounds like you need ITextSharp. Check out their SourceFourge page and do some reading up on how to use it. Here's a simple snippet to get you started with extracting some text from a PDF file:itextsharp read pdf file[^]public string ParsePdf(string fileName){ if...
14 Nov 2014 by Sergey Alexandrovich Kryukov
I cannot give you a final solution: good OCR is hard to find. Please see my past answers and follow my links, especially to the CodeProject articles: i need an ocr for a camera image capture[^].The solution depends on the difficulty of your problem. It's possible that even a simple solution...
20 Nov 2014 by Mathew Soji
Every tool will have its on plus and minus .Research a bit on each and see which suits your requirement . Then post question here so that some here can guide. My suggestion is to check the below link for a better start .https://www.google.co.in/?gws_rd=ssl#q=OCR+tools+%2BAsp.Net+%2B+C%23.[^]
20 Nov 2014 by Afzaal Ahmad Zeeshan
There ain't any list, since there are many libraries and tools already developed and available for you to use, you won't be able to differentiate between any of the library.What you should do is, check out this MSDN's OCR reader[^] by Microsoft. It is available in C#, C++ and JavaScript....
24 Nov 2014 by Kandiya
How to extract the text of the selected region in a scanned jpg image in C# using OCR?
25 Nov 2014 by Kandiya
Why I am getting the error "Retrieving the COM class factory for component with CLSID {} failed due to the following error" on using MODI OCR?I am extracting an image on a scanned document using Microsoft Office Document Imaging. I am implementing using VS 2008 and C#3.5.I added the dll...
26 Nov 2014 by Kandiya
How to get the region of an image and extract the text in the same using MODI OCR?I am able to extract the entire text from a scanned image using MODI OCR. But I want to get the region of an image and extract the text in the same. My code is as follows:private string...
26 Nov 2014 by Kandiya
Why I am getting OCR Running error on implementing MODI OCR on a text written in mspaint ?I have saved the image as TIFF. Otherwise if its simply extracting from a scanned image its working fine. I am using MODI OCR tool.
29 Dec 2014 by Xiao Ling
You can download this Tesseract .Net wrapper: https://github.com/charlesw/tesseract[^], and run the OCR sample inside. More Tesseract wrapper links could be found on Tesseract-OCR addons[^]
29 Dec 2014 by Xiao Ling
Try this .Net wrapper[^] which recommended by Tesseract-OCR[^]. You can find an excellent OCR web demo[^] inside.
1 Feb 2015 by LEADTOOLS Support
In this white paper we will show how to set up a master form template then extract the data from a filled invoice.
7 Feb 2015 by Jim Meadors
This project PDF Viewer Control Without Acrobat Reader Installed[^] on CP includes tesseract and code to use it. The article doesn't go into it in much detail, you'll have to dig into the code but not that difficult.
2 Mar 2015 by LEADTOOLS Support
In this whitepaper, we will explore the inner-workings of this innovative framework and highlight how developers can write just a few lines of code to extend and customize it for their application.
24 Feb 2015 by dsanthosh
Did you get Solution for your problem for your issue ?