Click here to Skip to main content
15,887,135 members
Please Sign up or sign in to vote.
1.00/5 (1 vote)
I used indexing throw Lucene.net, but the max time that lucene.net can give you the result is 6 minute.
My website based on the search performance, i need another strategy or methodology to using them.
If someone know another way else (lucene.net) to search about text in million of records.

Thank you
Posted

1 solution

If the text records are in a database using the database manager's build in search functionality will probably be a lot faster. After all, that is what is build for.

If you read the text from files on the hard disk it might be your implementation. I am not familiar with Lucene and how it works, but if it looks through those files each query there is a lot of overhead in opening and closing the files each time.

I personally once wrote a search system that indexed txt files, but since the number of them was quite small, as was the amount of content, I just read the file as one long string and dumped it into SQLServer. Performance was very well, but I don't think this will be the most optimal strategy if you have 1M PDFs to index.

if you could you go into some more detail of the system: where does the text come from, where is it stored, how much text is in each record, that sort of thing, we might be able to provide some better, more directly usable, advice.
 
Share this answer
 
Comments
Christiaan Rakowski 22-Sep-12 9:30am    
I'm not sure why my answer got downvoted, but I would like to know so I can improve it. It's true it's not a solid answer, but there is no solid question to answer either.

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900