Click here to Skip to main content
15,666,681 members
Please Sign up or sign in to vote.
0.00/5 (No votes)
See more:
Dear friends,

I want to extract text from pdf with bold and italics identifiction. for example bold letters need t be extracted like this.<b>TEST</b> and italics must be enclosed like <i> test </i>
Currently i am using texttopdf.exe to extract text..the accuracy was good.but not able to identify bold italics.

any one have another idea or the same pdftoexe having the feature?

Thanks in Advance
Updated 6-May-14 22:04pm
CHill60 7-May-14 4:07am    
A link to the tool might be useful - the only texttopdf I know of converts text to pdf not the other way around.
And I would remove the VB6 tag from your question - it's not relevant?
jai_mca 7-May-14 6:17am    
Thanks Chill.But i can't able to get u .
it was relevent to vb6.
please refer the link
CHill60 8-May-14 19:31pm    
Spotted this comment by accident - if you want to respond to a comment use the "Reply" link so that the poster is notified.
Your link refers to "pdftotext.exe", not "texttopdf.exe" ... I will accept that "pdftotext.exe" will actually convert a pdf to text (!)
As that is opensource is it worth trying to communicate with the author, or adapting the code yourself?

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)

CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900