Hi, Thank you very much for the suggestion. Indeed lucene transform seems to be something that could meet the requirements for the project.
Thank you all for your help and fast answers. Rafaela On 4 July 2013 08:52, VIGNESH S <vigneshkln...@gmail.com> wrote: > Hi Rafaela, > > Look at Lucene Transform.It might help to encrypt lucene documents. > > https://code.google.com/p/lucenetransform/ > > > On Wed, Jun 26, 2013 at 2:36 PM, Rafaela Voiculescu < > rafaela.voicule...@gmail.com> wrote: > > > Hello, > > > > Thank you all for your help and the suggestions. They are very useful. I > > wanted to clarify more aspects of the project, since I overlooked them in > > my previous mails. > > > > To explain the use case exactly, the application should work like this: > > > > - The application works with patient data and that's why we want to > keep > > things confidential. We are downloading patient data that can go to > the > > mobile device (it should even work on desktop in a similar way really) > > - We have to keep the data in the device due to internet connection > > limitation. The device will get, if lucky, internet connection once or > > twice per week, hence us needing to keep the patient data locally > > - The thing I forgot to mention is that the structure of the patient > > data is kept in json format > > - Currently, there is no plan for using database because the structure > > of the patient stored locally might need to change (so we want to > store > > the > > json as document in Lucene). > > - And we need to achieve the part with not having someone who, for > > instance steals the device, able to access the data unless they have > the > > encryption key and mechanism and not having someone who's not supposed > > to > > access the data do that. > > > > This is why we're trying to find a way to encrypt somehow the json > > documents and still use Lucene or try not to have the index stored as > > plaintext, if it would be possible. > > > > Thank you again for all your help and in case this mail has given more > > useful details and there are other suggestions or comments, I would be > very > > happy to read them. > > > > Have a nice day, > > Rafaela > > > > > > On 25 June 2013 20:59, SUJIT PAL <sujit....@comcast.net> wrote: > > > > > Hi Rafaela, > > > > > > I built something along these lines as a proof of concept. All data in > > the > > > index was unstored and only fields which were searchable (tokenized and > > > indexed) were kept in the index. The full record was encrypted and > stored > > > in a MongoDB database. A custom Solr component did the search against > the > > > index, gathered up unique ids of the results, then pulled out the > > encrypted > > > data from MongoDB, decrypted it on the fly and rendered the results. > > > > > > You can find the (Scala) code here: > > > https://github.com/sujitpal/solr4-extras > > > (under the src/main/scala/com/mycompany/solr4extras/secure folder). > > > > > > More information (more or less the same as what I wrote but probably a > > bit > > > more readable with inlined code): > > > > > > > > > http://sujitpal.blogspot.com/2012/12/searching-encrypted-document-collection.html > > > > > > There are some obvious data sync concerns with this sort of setup, but > as > > > Adrian points out, you can't index encrypted data. > > > > > > HTH > > > Sujit > > > > > > On Jun 25, 2013, at 4:17 AM, Adrien Grand wrote: > > > > > > > On Tue, Jun 25, 2013 at 1:03 PM, Rafaela Voiculescu > > > > <rafaela.voicule...@gmail.com> wrote: > > > >> Hello, > > > > > > > > Hi, > > > > > > > >> I am sorry I was not a bit more explicit. I am trying to find an > > > acceptable > > > >> way to encrypt the data to prevent any access of it in any way > unless > > > the > > > >> person who is trying to access it knows how to decrypt it. As I > > > mentioned, > > > >> I looked a bit through the patch, but I am not sure of its status. > > > > > > > > You can encrypt stored fields, but there is no way to do it correctly > > > > with fields that have positions indexed: attackers could infer the > > > > actual terms based on the order of terms (the encrypted version must > > > > sort the same way as the original terms), frequencies and positions. > > > > > > > > -- > > > > Adrien > > > > > > > > --------------------------------------------------------------------- > > > > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > > > > For additional commands, e-mail: java-user-h...@lucene.apache.org > > > > > > > > > > > > > --------------------------------------------------------------------- > > > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > > > For additional commands, e-mail: java-user-h...@lucene.apache.org > > > > > > > > > > > > -- > Thanks and Regards > Vignesh Srinivasan > 9739135640 >