Re: Corpus interpretation

2007-10-24 Thread Steven Rowe
Hi Liaqat, Liaqat Ali wrote: > I want to index the Urdu language corpus (200 documents in CES XML DTD > format). Is net necessary to break the XML file into 200 different files > or it can be indexed in the original form using Lucene. Kindly guide in > this regard. A Lucene document is composed o

Corpus interpretation

2007-10-24 Thread Liaqat Ali
I want to index the Urdu language corpus (200 documents in CES XML DTD format). Is net necessary to break the XML file into 200 different files or it can be indexed in the original form using Lucene. Kindly guide in this regard. ---