Re: Problem using Lucene on Ubuntu

2008-02-18 Thread Grant Ingersoll
Good point Jan! On Feb 18, 2008, at 9:00 AM, Jan Peter Stotz wrote: Grant Ingersoll wrote: Note: ENCODING is whatever encoding the file is in, as in "UTF-8", if that is what your files are in. I think there is a misunderstanding, the WordExtractor extracts text from MS Word (.doc) files.

Re: Problem using Lucene on Ubuntu

2008-02-18 Thread Jan Peter Stotz
Grant Ingersoll wrote: Note: ENCODING is whatever encoding the file is in, as in "UTF-8", if that is what your files are in. I think there is a misunderstanding, the WordExtractor extracts text from MS Word (.doc) files. Those files are binary and therefore does not have an encoding. I wou

Re: Problem using Lucene on Ubuntu

2008-02-18 Thread Grant Ingersoll
tor takes an inputstream as an arguement. Should i determine the encoding of the inputstream and how? -- View this message in context: http://www.nabble.com/Problem-using-Lucene-on-Ubuntu-tp15543843p15545082.html Sent from the Lucene - Java

Re: Problem using Lucene on Ubuntu

2008-02-18 Thread kratoras
ctor; The wordextractor takes an inputstream as an arguement. Should i determine the encoding of the inputstream and how? -- View this message in context: http://www.nabble.com/Problem-using-Lucene-on-Ubuntu-tp15543843p15545082.html Sent from the Lucene - Java Users mailing list archive at

Re: Problem using Lucene on Ubuntu

2008-02-18 Thread Grant Ingersoll
ng getBytes before i write it to the index?? -- View this message in context: http://www.nabble.com/Problem-using-Lucene-on-Ubuntu-tp15543843p15544612.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. ---

Re: Problem using Lucene on Ubuntu

2008-02-18 Thread kratoras
s message in context: http://www.nabble.com/Problem-using-Lucene-on-Ubuntu-tp15543843p15544612.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additio

Re: Problem using Lucene on Ubuntu

2008-02-18 Thread Grant Ingersoll
ext: http://www.nabble.com/Problem-using-Lucene-on-Ubuntu-tp15543843p15543843.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-ma

Problem using Lucene on Ubuntu

2008-02-18 Thread kratoras
greek word but it continues to search for english words. Could it be a problem of fonts in my ubuntu or what?Greek fonts are installed though. Thanks for any answers in advance -- View this message in context: http://www.nabble.com/Problem-using-Lucene-on-Ubuntu-tp15543843p15543843.html Sent