http://opennlp.sf.net/
- Fredrik
2006/10/20, z shalev <[EMAIL PROTECTED]>:
hello all,
i would like to retrieve during query time, the part of speech of each word
in a query,
does anyone know of an implementation of a java part of speech api?
thanks in a
ndex from scratch, so I don't think it's a corrupt index.
And finally, the start of the 'tii' file is as follows:
$ hexdump /tmp/test.tii | head
000 feff f3cc 8000
010 1000 0014 3107 3030
020 3130 3831
#x27;s a corrupt index.
And finally, the start of the 'tii' file is as follows:
$ hexdump /tmp/test.tii | head
000 feff f3cc 8000
010 1000 0014 3107 3030
020 3130 3831 0104 0
ad dump indicates that it is lucene which is the culprit, trying
to allocate the 500+Mb slab. What is interesting is that the number of
bytes it tries to allocate is exactly the same every time (532676624 bytes)
Can anyone please help me out here? Have anyone experienced simila