Click here to Skip to main content
15,888,351 members
Please Sign up or sign in to vote.
0.00/5 (No votes)
See more:
Hi all,

I need to create a document store with search capabilities. Sounds simple...
That means that I have documents which I need to store in database. I thought about CouchDB, and about few other document-oriented databases, but I'm still not sure what would be the best solution.

On the other side, I thought about integrating Solr in some kind of web application which I'm going to use for uploading, indexing, search, update, delete documents.
And, of course, the main problem is that most of these documents are written using Cyrillic characters.

Maybe I'm trying to combine things that do not match together.
Could someone give me an advice what would be the best way to implement solution like this.

Best,
Srecko
Posted
Comments
Sergey Alexandrovich Kryukov 26-Jan-12 12:40pm    
Advice? Even though you did not tag platform(s), language, etc.?

What do you mean "Cyrillic characters". Don't tell me you are not going to use Unicode. (However, this is technically possible if you store charset information with the document.)
--SA
skyforever 26-Jan-12 12:51pm    
It is not important, Windows or Linux. It should be Java programming language, and I'm going to use Unicode, but when I tried to index documents using Lucene, there are certain differences when it comes to Cyrillic characterset.
In the meantime, I have decided that I’m going to use Solr because it has API which I could use from my web application. In that part, I need to know how to configure Solr for indexing and searching Cyrillic characterset.
On the other hand, I plan to use NoSQL document-oriented database. So, I need advice which one would be the best choice.
Thank you!

This content, along with any associated source code and files, is licensed under The Code Project Open License (CPOL)



CodeProject, 20 Bay Street, 11th Floor Toronto, Ontario, Canada M5J 2N8 +1 (416) 849-8900