Hi, I have a huge LDIF file in order of GBs spanning some million user records. I am running the example "Grep" job on that file. The search results have not really been upto expectations because of it being a basic per line , brute force.
I was thinking of building some indexes inside HDFS for that file , so that the search results could improve. What could I possibly try to achieve this? Secura
