Click here to Skip to main content
15,887,421 members
Please Sign up or sign in to vote.
1.24/5 (3 votes)
See more:
I'd like to use IFilters to extract text from many filetypes.
It is possible, but:

I cannot call default pdf IFilter in Windows 8 and Windows Server 2012:

glcndFilter.dll

It gives me an error.
Also, Filtdump.exe gives an error on it too:
Failed to CoCreate ILoadFilter instance, hr == 0x80040154
FILTDUMP failed, hr == 0x80040154

In the same time my application can call any other filters.

How to do it?

I want to open file pdf, doc, txt etc in my c# application and extract plain text from the file using IFilter.
Now I can do it for any filetypes but only problem with pdf.
The reason is: Microsoft provides its own IFilter for pdf glcndFilter.dll for Windows 8 and Windows Server 2012. I cannot call this filter (see above).
BUT SQL Server is using this filter on the same machine.

I used this very useful code
https://github.com/Sicos1977/IFilterTextReader[^]
Posted
Updated 30-Jul-15 0:50am
v3
Comments
stibee 30-Jul-15 6:36am    
Your questions is a bit to short.... For me is not clear what you are doing..

1 solution

 
Share this answer
 
Comments
Win32nipuh 31-Jul-15 5:34am    
Thank you, it works with pdf filter glcndFilter.dll.
Microsoft provides in Win 8 and Win 2012 pdf filter with bugs.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900