Click here to Skip to main content
15,887,485 members
Please Sign up or sign in to vote.
0.00/5 (No votes)
See more:
I want to search a words or string from millions of excel files. And I want rapid searching!

So how is that possible and using which tools?

Search will be global.
I have no clue about this type of Indexing and Full text searching.

Please advise me for Best options available to add this functionality.

Thanks in advance.

I don't want to use SQL Server.

Technology/Language used .NET/C#

What I have tried:

I have tried surfing the web for my problem and I have found some tools like Apache Solar, Lucene, Tika and Toxy, But It seems very confusing to me. I'm not getting correct way to solve this problem.
Posted
Updated 5-May-16 9:17am
Comments
CHill60 5-May-16 7:17am    
Millions? That's going to take years to run.
ZurdoDev 5-May-16 7:31am    
Import into SQL first.

This is not an easy problem to solve. You mention that you don't want to use SQL, so we can assume you want to look at the actual contents of the Excel file. If you are talking about physically searching the contents of millions of Excel files then you are going to to have to wait a long time. There is no rapid way to do this.

There are document management systems that allow this to be done, but most are complex, expensive, and require a lot of hardware to run on.

You mention Solr which is a great tool for indexing and searching information. And using Solr Cell you are able to index the contents of file like Word, Excel, and PDf.

If you are confused the best advice to give is pick a tool like Solr and start learning. There are several great resources for this tool in particular. It took me a couple of days to first learn the basics and get a server setup, but once that was done I was able to move through the topics pretty quickly. We now use the tool to handle indexing of data for about 95% of our applications and in turn use that for searching.
 
Share this answer
 
Comments
Kishan Sojitra 7-May-16 2:32am    
Thanks for your help! :)

Started working with solr, Can You share some resources for it.
Like how to Use Solr with .NET WinForms Application?
I have tried google but results are confusing! :(
Thanks again for Help.
AnvilRanger 8-May-16 20:46pm    
Here is a very easy to user Solr Client
https://github.com/mausch/SolrNet/blob/master/Documentation/README.md
You've stated that you don't want to use SQL Server but a database would certainly be a good way forward. PostgreSQL might be your best option - see PostgreSQL: The world's most advanced open source database[^]

Alternatively you could use a file text search - there are some suggestions on this Tools to search for strings inside files without indexing - Stack Overflow[^]
 
Share this answer
 

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900