Click here to Skip to main content
15,890,512 members
Everything / OCR

OCR

OCR

Great Reads

by Carey Payette
The problem I am looking to solve while evaluating the LEADTOOLS SDK is to use a photo of a hard copy page of text taken with my phone, have it processed and displayed in the document viewer, then have it converted to Word format so that it can be easily edited.
by Chris_Riley
Warewolf is a promising offering to help nimble front-end applications perform heavy backend tasks. This write up is on my experience with the product.
by Dynamsoft
In this article, I’ll share with you how to utilize Dynamic .NET TWAIN in a WPF application to process your documents.
by meconer
Developing a display reader for raspberry pi

Latest Articles

by ToughDev
How to restore text from PDF files Encoded using custom CID fonts
by gbeene
Capture protected document content as text without the need for breaking DRM
by Chris Maunder
Working through the combinations to get PaddlePaddle and PaddleOCR installed and working everywhere
by Michael Haephrati
How to use an OCR SDK using C++ and libCurl

All Articles

Sort by Updated

OCR 

21 Apr 2014 by 204.sharma
Hi, i am searching for tamil language ocr engine or DLL.I searched a lot on google but it only shows 3 Names (tesseract, AbbyFineReader and gtamilOCR).besides this if someone have information about other ocr engines, dll or sdk.please help me and share your information.Thanks...
30 Aug 2016 by Accusoft, Corey Morehead
This whitepaper shows how I used OCR Xpress for Java to OCR a scanned newspaper, redact the key words, and accomplish my goal.
1 Mar 2012 by adarsha nayak
helloI am searching for a matlab code to extract the characters from visiting card and save the characters in text file.So that the information of visiting card can be shared and maintained easily. can I get the code to extract features from visiting card?.
14 Nov 2014 by Afzaal Ahmad Zeeshan
This is the same question[^] you've had asked yesterday (or day before) and I have already posted an answer for reading the characters from an image. You need to use an OCR (Optical Character Recognition; recognizer) to read the characters from an image. That is a sample project[^] I have...
20 Nov 2014 by Afzaal Ahmad Zeeshan
There ain't any list, since there are many libraries and tools already developed and available for you to use, you won't be able to differentiate between any of the library.What you should do is, check out this MSDN's OCR reader[^] by Microsoft. It is available in C#, C++ and JavaScript....
13 Feb 2017 by Akarshan Majarjan
If you are using office 2003 and already enabled MODI features then just update office 2003 service pack 3. Here is the linkhttps://www.microsoft.com/en-us/download/details.aspx?id=8It will work fine....
27 Apr 2013 by amir tarek
i have an image jpg has 12 numbers 0 1 2 3 4 5 6 7 8 9 and writing using normal shape with no change on its shape i want to make a program read all the image get a part of it that has the 12 numbers (right down)then make ocr program read the numbers very accuracy.this program i...
2 Sep 2012 by Ammar_Khaled
There is a toolkit named leadtools that could help you doing your requirements. You can use it to do some processing on the image (i.e. binrize it, remove the unwanted dots from the image, convert it to 1-bit black & white, etc), and then pass it to their OCR engine.You can check their website...
5 Sep 2014 by Anderson Rissardi
I need to create an OCR to solve captchas. can anyone direct me to a free library or free component that I can use?
23 Feb 2013 by Azad R Chouhan
Hello everyone I wanna to create a OCR application in .net using winforms.Please tell me how i can create a ocr using c sharp and winform Regard Azad R Chouhan
31 Mar 2015 by BacchusBeale
Have a look at this OCR project:https://code.google.com/p/tesseract-ocr/[^]I used it once and it worked well.
13 Jun 2012 by Basem.Othman
Although Tesseract is one of the more accurate free OCR engines, the last time I tried it a couple of years ago it was rather inaccurate. After trying some other open source libraries, we faced similar problems with the other free OCR engines and winded up using leadtools that provided faster...
31 Jul 2013 by Bernhard Hiller
From your desciption, I'd rather suspect that the library is not multithreading-proof. Hence you have to wait for the OCR processing to have finished before you start the next processing. That's a limitation of that library, and your code has to obey to those limitation.
30 Jun 2014 by Bernhard Hiller
Not sure, but that looks promissing:dynamicDotNetTwain1.CurrentImageIndexInBuffer in the top lines vs. dynamicDotNetTwain1.CurrentSelectedImageIndicesInBuffer in the last line.
4 Jan 2021 by boredpanda
Hey everyone, i'm beginner at c++ and i want to build an ocr on c++ program and i don't how to start it.. so i need ur help guys(: What I have tried: i opened DLL project and used class c++ item but i don't know how to write the code..
13 Aug 2012 by butterflyzx4
Hi, I am doing a project where I have to detect text in an image "not OCR", just to detect if text is their and localize the text lines.I know that at first the edge map has to generated using an edge detection algorithm but I want to know if there are any examples of implementation of the...
18 Aug 2012 by butterflyzx4
I'm doing project where I have to detect text in natural images, after the initial edge map generation stage, the connected component labeling must be done to identify text components. Are there any examples of connected component labeling done for text detection in C#? Please no rude...
1 Apr 2016 by Carey Payette
The problem I am looking to solve while evaluating the LEADTOOLS SDK is to use a photo of a hard copy page of text taken with my phone, have it processed and displayed in the document viewer, then have it converted to Word format so that it can be easily edited.
26 Feb 2013 by chaau
There is a GOCR[^] library that is released under GNU licence. I have used it and it is great. It is a pure C library, and works in Windows as well. I recommend it.
12 Jul 2013 by chady1
How can I develop an app which has Optical Character Recognition and can display the text from an Image? And how can I train it to understand handwritten text from a picture?Thanksps:I know java,android development, c++, c...
28 Aug 2012 by chandni0422
I want the code for developing an OCR image to text in Asp.net.Actually I have to develop a gujarati charater OCR reader and need help for that..
13 Sep 2012 by chandni0422
Is there any way to determine gujarati characters using Microsoft Office Document Imaging DLL(Dynamic Link library) Please help me even if there is some other way with english to gujarati characters.I m coding in c# Asp.net
13 Sep 2012 by chandni0422
Can we use Microsoft Document Imaging MODI with non english characters, such as Gujarati_MODIDocument.OCR(_MODIParameters .Language , _MODIParameters.WithAutoRotation, _MODIParameters.WithStraightenImage);I want to scan the image with other language can I do that?Or Can I convert...
23 Feb 2012 by Chandrakantt
You can use Tesseract OCR present in 1. http://sourceforge.net/projects/tesseract-ocr/[^]2. http://code.google.com/p/tesseract-ocr/[^]
21 Nov 2022 by Chris Maunder
Working through the combinations to get PaddlePaddle and PaddleOCR installed and working everywhere
29 Jan 2016 by Chris_Riley
In this post I will explain how quality imaging is the most critical element for an application I’m working on to support both OCR and NLP.And how I’m using the MobileImage SDK to do it.
2 Aug 2016 by Chris_Riley
Warewolf is a promising offering to help nimble front-end applications perform heavy backend tasks. This write up is on my experience with the product.
18 Aug 2012 by Christian Graus
I assume natural means 'real world'. This is tough and I don't know of anyone doing this without using a prebuilt component to do it. Which usually means a commercial one.
12 Dec 2017 by CPallini
We have many articles on this topic[^].
4 Jan 2021 by CPallini
Have a look, for instance, at Tesseract library[^].
26 Mar 2012 by Darsh_Basha
i using this code to make (text) image segmentation#include #include #include #include const char *WIN1="Segmented"; // This will be the window nameIplImage* imagen;IplImage* imagen_color;IplImage* smooth;IplImage* threshold;IplImage*...
28 Nov 2011 by dartfrog
Hi all,I am now trying to develop OCR application using MODI. so i want to use together with AxMODI.AxMiDocView controller. how should I do ? I added MODI 12.0 as reference. and I also chose "Microsoft Office Document Imaging Viewer Control 12.0" in toolbox but I cannot see the viewer....
21 Mar 2013 by Dave Kreskowiak
YOu'd have to use OCR to get that. There's a few good articles here on CP that cover it, here[^].
5 Sep 2014 by Dave Kreskowiak
Uhhh....no.CAPTCHA's are there to prevent you from doing what you're asking about ---- FOR A REASON.
10 Jul 2015 by Dave Kreskowiak
Well, you're going to have to write your own Notepad then."Intercepting" the print job and editing it is an extraordinarily complex way to do what amounts to Mail Merge.The application describes what the page image looks like to the print driver. The driver takes that "image" and...
1 May 2015 by DaveAuld
I had a quick look at Tesseract search results and it appears that the OCR engine supports AMD GPUs.Therefore in theory it may be possible to speed up processing by;a) investing in a higher specification GPUcoupled with;b) higher performance CPU.Do some research, ask some questions...
1 Jun 2013 by David Jhones
hi, i am involved with handwritten OCR application. I use MNIST digit database for training process here. I use following code for read pixels from the database and re-create the image. programs doesnt give any error but it gives meaningless image(totally black and blur pixel patters) as output....
11 Jun 2013 by David Jhones
hi,i am developing English Handwriting OCR. I use Zone based approach for feature extraction. Here I use 64 X 64 images.So i have 64 features for one sample image. My SVM will be Multi-class SVM because i have 52 classes for both capital and simple letters.Here is the format of feature...
14 Jun 2013 by David Jhones
hi all,i found scaling data means , keep all training and testing data within a range such as 0 - 1 or -1 - +1. i will proceed with that. does anyone has any idea or suggestion, please comment it here. thank you
10 Jun 2016 by Derek_Gerber
13 Jul 2015 by Dharmesh .S. Patil
Step to solve error for Windows Vista or Windows 7:1. Click Start, type program and features in the Search box, and then press ENTER.2. Click to select the Office 2007 version that you have installed.3. Click Change.4. Click Add or Remove Features, and then click Continue.5. Expand...
8 Jan 2014 by Dipanki99
https://gist.github.com/yatt/915443[^]
24 Feb 2015 by dsanthosh
Did you get Solution for your problem for your issue ?
2 Aug 2012 by Dynamsoft
In this article, I’ll show you how to embed document processing, such as image scanning, editing, and uploading, to your .NET application.
16 Aug 2012 by Dynamsoft
In this article, I’ll show you how to convert scanned images to searchable text and PDF files through a web browser.
7 Sep 2012 by Dynamsoft
In this article, I’ll share with you how to utilize Dynamic .NET TWAIN in a WPF application to process your documents.
21 Jan 2013 by Dynamsoft
14 Jun 2012 by elidrissi.amine1
Is there any open source OCR of .NET that can extract text from scanned pdf even if the text is in different fonts and it gives the ability to render it in html( or xml or text)format.
4 Nov 2013 by Erdinc Ay
Hi,I am new to C++, and need a step-by-step listing of how I can bring the open-source project Tesseract to be included in my linux c++ application (console application).https://code.google.com/p/tesseract-ocr/[^]kind regards,e.a.
5 Nov 2013 by expert-programmer
whynot take a look of open source projects? https://code.google.com/p/tesseract-ocr/wiki/3rdParty
21 Mar 2023 by gbeene
Capture protected document content as text without the need for breaking DRM
7 Sep 2018 by Gerry Schmitz
You're talking "ETL" (extract; translate; load). You're still only at the "extract" phase; the rest (filter, aggregate, visualize) only comes "after". You need to be more specific about the "content". A "simple" "text" scanner can take a few minutes to develop; and even less to run. (pdfs...
9 Sep 2014 by Gihan Liyanage
See this sample OCR applicationCreating Optical Character Recognition (OCR) applications using Neural Networks[^]
3 Aug 2020 by gstolarov
A class library and command line utility to add OCR information to image-only PDF files
12 Aug 2014 by HanMoLingfeng
In my project,we need use OCR ,so I download some traineddata files from googole'code.But the result which they callback for me almost wrong.So I think I need do my traineddata.Why this traineddata which was created by myself does not work on android?Thx your help.Add: Thx to Richard...
12 Dec 2017 by Hassan(Aych Jay)
I am trying to perform OCR on the attached sample images. Google OCR + Microsoft OCR have been tried. They work really work on text that is not hand written. However I have to read text from handwritten documents. Sample -1 [^] Sample-2 [^] Technology is not an issue. What I have tried: ...
5 Nov 2012 by Hendypodz
Hello all,I' m Hendy. I'm newbie in neural networks I was trying to develop some ocr project and implement backpropagation method. I have been already sets the train function, but I still confusing about using the output result. How can I use the output to recognize some character? Do I need...
11 Aug 2021 by Hossein_Hadi
I wanna design an OCR program. I'll thanks if you offer any way to design this program(Any algorithm)or if have a strong open source library to do this. Thanks for suggestion.
14 Nov 2013 by imran kanjoo
Can some one please guide me step by step to integrate tessnet2 with visual studio for ocr. Or if you know any site for this please share. I have seen many sites but still I am unable to compile the project.some time I get some file missing. but i checked the files were present.sometimes i...
1 May 2011 by incredible me
Hi All, I am working on my final year project. It is about CAPTCHA breaking. i.e. To show that captchas are easily broken.For that following steps need to be followed:1. Trace the position of characters in the input image.2. Extract each character. We can also say it as character...
15 Jul 2014 by jagds
Hello,I had searched on google last two days, and few info relates to achieve OCR implementation. Neural networks algorithm, text recognition will help to achieve OCR. But confusion going on that how to implement Neural Network in Objective C. Also read some where that need to apply...
15 Oct 2014 by jase.y.
OCR? Sounds like you need ITextSharp. Check out their SourceFourge page and do some reading up on how to use it. Here's a simple snippet to get you started with extracting some text from a PDF file:itextsharp read pdf file[^]public string ParsePdf(string fileName){ if...
7 Feb 2015 by Jim Meadors
This project PDF Viewer Control Without Acrobat Reader Installed[^] on CP includes tesseract and code to use it. The article doesn't go into it in much detail, you'll have to dig into the code but not that difficult.
21 May 2014 by jim rays
Hi there,We are building a solution where handwritten questionnaire forms with objective and detailed response questions are required to be interpreted.Could you please suggest any opensource tool for the same ? We already tried QueXF and it's suite but it does not bring satisfactory...
30 Jun 2014 by Jocelyne El Khoury
i have the below code: ddlResultFormat.Items.Add("Text") dynamicDotNetTwain1.LoadImage("C:\Users\elj\Desktop\3b072e9b-76b4-4776-bd3c-5984074d8ecd.png") Dim r = dynamicDotNetTwain1.GetImage(dynamicDotNetTwain1.CurrentImageIndexInBuffer).Width.ToString() Dim l =...
13 Jun 2012 by Joshi, Rushikesh
There is no any solution which gives guarantee to read all JPG files as most of OCRs readers are based on preloaded font formats and at time of reading those JPG files must containing text in those supported font-families. Otherwise all suggested solution provided by others should be...
10 Nov 2013 by jules237
Hi, I am new to C# and image processing. I am interested in implementing of SWT for image detection and localization. Is there are any examples of implementation of the stroke width transform in C#?Thank you
28 May 2012 by Jαved
Hi Muthukumar,This question is solved here -C# OCR (How to Read a single character from image)[^]
20 Nov 2014 by Kandiya
what are the list of fine reader tools available in OCR Technlogy? I am using OCR,Asp.Net and C#.
23 Nov 2014 by Kandiya
How to do the OCR using Abby Fine Reader? I want the source code related to the same. I am using Asp.net and C# 3.5. I want to do the OCR in images having Typed Texts and handwritten texts
24 Nov 2014 by Kandiya
Why I am not able to recognize text in image using Tesseract? On debuggin the code after Init case the application isn't working. Required dll is also added. The exe and testdata folder compatible is set in the bin/debug folder. I am using C#3.5 too. Did so following below...
24 Nov 2014 by Kandiya
How to extract the text of the selected region in a scanned jpg image in C# using OCR?
25 Nov 2014 by Kandiya
Why Tesseract OCR is not working in Asp.net with C#3.5? I am able to extract the text from the scanned image in c# application. But while using in web application the same is not working. Here OCR.Init method its not working in Asp.net but in windows application there are no such issues.I have...
25 Nov 2014 by Kandiya
Why I am getting the error "Retrieving the COM class factory for component with CLSID {} failed due to the following error" on using MODI OCR?I am extracting an image on a scanned document using Microsoft Office Document Imaging. I am implementing using VS 2008 and C#3.5.I added the dll...
26 Nov 2014 by Kandiya
How to get the region of an image and extract the text in the same using MODI OCR?I am able to extract the entire text from a scanned image using MODI OCR. But I want to get the region of an image and extract the text in the same. My code is as follows:private string...
26 Nov 2014 by Kandiya
Why I am getting OCR Running error on implementing MODI OCR on a text written in mspaint ?I have saved the image as TIFF. Otherwise if its simply extracting from a scanned image its working fine. I am using MODI OCR tool.
27 Nov 2014 by Kandiya
How to read text from the selected Image area using MODI OCR?I am able to extract the entire text from the image. But I want to read the text from the selected image area using MODI OCR.My code is as follows: MODI.Document md = new MODI.Document(); ...
21 May 2014 by Kornfeld Eliyahu Peter
This is probably the best open source library for OCR - https://code.google.com/p/tesseract-ocr/[^]Also Microsoft has an Inkanalyzer class (belongs to WPF) that can help you - http://msdn.microsoft.com/en-us/library/system.windows.ink.inkanalyzer(v=vs.90).aspx[^]
2 Aug 2015 by Kornfeld Eliyahu Peter
There is no need (or place) for client/server separation...These libraries are standalone, the list of supported operating systems has 3 entries for every type of operating systems the library supports. Minimum for desktop, minimum for server and minimum for phone...What you should consider...
23 Jun 2015 by Lakhan Anjana
you need to find some OCR API for That.this is the only one way.
21 Mar 2013 by lascuola_marimo
HelloI'd like to know how get text and image from external chat like this:http://imageshack.us/photo/my-images/42/tavoloocr.jpg/Can i use ocr or i must integrate autoit in vb net?Thanks for help and sorry for english
23 Sep 2014 by LEADTOOLS Support
Why implementing Optical Character Recognition with Leadtools ODR SDKs is the faster, more accurate choice.
11 Jan 2013 by LEADTOOLS Support
iOS OCR and Barcode with LEADTOOLS 18.
1 Feb 2013 by LEADTOOLS Support
Multi-Platform OCR with LEADTOOLS 18.
1 Jun 2013 by LEADTOOLS Support
Using LEADTOOLS OCR to Enhance Google Drive Search
2 Jul 2013 by LEADTOOLS Support
Automatically Classify Scanned Documents using LEADTOOLS Forms Recognition
1 Oct 2013 by LEADTOOLS Support
In the white paper that follows, we will show how to read and write the PDF keywords metadata, update the file on Google Drive, interface with your local Google Drive database, and do all of this within a single right-click context menu in Windows Explorer.
1 Nov 2013 by LEADTOOLS Support
Improving Forms Recognition Results with Automated Alignment
1 Feb 2015 by LEADTOOLS Support
In this white paper we will show how to set up a master form template then extract the data from a filled invoice.
2 Mar 2015 by LEADTOOLS Support
In this whitepaper, we will explore the inner-workings of this innovative framework and highlight how developers can write just a few lines of code to extend and customize it for their application.
1 May 2015 by LEADTOOLS Support
LEADTOOLS includes a robust Check Scanning and Processing SDK within its document family of imaging toolkits, and is already being used by a number of applications across banking, insurance and retail industries.
14 Jan 2014 by ledtech3
A problem I see with that is if it dosen't correctly reconize the input then your data will be wrong. I recently tried to scan data to notepad and it got confused on the font for "k" and other letters and symbols.You could start with something like this to give you an idea on the scale...
28 Aug 2012 by lewax00
An OCR program is not trivial, it's a rather complicated subject, so no one here is going to write one from scratch to give to you. The best you'll get is someone doing a Google search and giving you links from that, so save yourself some time and just do the search yourself.That being said,...
2 Feb 2015 by Liju Sankar
Hi Hossein_Hadi,Please check the following links A C# Project in Optical Character Recognition (OCR) Using Chain Code[^]Open Source OCR SDK 1 : tesseract-ocr (code.google.com)[^]Open Source OCR SDK 2 : GOCR (sourceforge.net)[^]Similar thread in Code Project : Best Open -...
1 Jul 2011 by llovelysoso
i am having the same problem as you mensioned before i determined where is the character in the image and i specified its hight and width all the rest is about how to extract this charcters and recognize it if you found solution pleaze share it with me
23 Sep 2022 by M@153
Input image I need to extract CR No.from the sample image above. Using Easyocr, I got the output in complex nested list form. How to update the code to filter out all the detected text/numbers and get only CR No. I tried to fetch the text "CR...
14 Jun 2012 by Maciej Los
Use this links:OCR[^]OCR Source code[^]
25 Apr 2014 by mahla.r
from PIL import Imageimg = Image.open('1.gif') # Your image here!img = img.convert("RGBA")width, height = img.sizepixdata = img.load()# Make the letters bolder for easier recognitionfor y in xrange(img.size[1]): for x in xrange(img.size[0]): if pixdata[x, y][0]
21 Feb 2011 by Manfred Rudolf Bihy
Try this CP article for starters: OCR With MODI in Visual C++[^] and here you'll find an open source .NET C++ library: http://www.pixel-technology.com/freeware/tessnet2/[^]Modification:MODI has been deprecated in MS Office 2010. The replacement is OneNote and here is a useful link to...