On Tue, 10 Oct 2000, Craig Small wrote: > I've been chasing up the reason why udmsearch does not index > non-english too well and after having a chat with the developer it all > comes down to charsets. > > English basically has a word charset of [A-Za-z0-9] easy stuff and > all 7 bit. But other languages have other charsets. > > Charsets I have already are: > Cryllic: cpl25l, koi8r, cp866, iso88595, maccyr > Western: iso-8859-1 > Central Europe: iso-8859-2, cpl250 > Arabic: cpl256
I prefer to use UTF-8, but not sure you can get that one to work with udmsearch... (it is Perl i assume?) Egon -- Hi! I'm a .signature virus! Copy me into your ~/.signature to help me spread If you dont know what ~/.signature means , don't get your panties in a knot, you already have a PoLDeRoDel virus.!