On Mon, Feb 28, 2011 at 2:16 AM, Peter Ruwoldt <ruwol...@gmail.com> wrote: > Hi > I am wanting to develop a Pitjantjatjara spell checker for > LibreOffice. Pitjantjatjara is one of a number of indigenous > languages from Central Australia. I'm hoping that what we learn from > doing this will enable us to develop spell check dictionaries for > other Australian indigenous languages. > I have a list of words in a unicode text file where each word is on a > new line. There are about 2300 lines/words. > I have no idea what to do next and I would appreciate clues for the next > steps. > Thanks > Peter >
Dear Peter, I've helped create spell checking dictionaries for a number of minority and endangered languages and would be happy to help you with this. As others have already indicated, you will probably want to combine your word list with an affix file that encodes any common prefixes and suffixes in the language. I can help write that but I'll need lots of input from you to do so. I also have scripts that can package your word list up so that it is usable as an extension in either Firefox or OpenOffice/LibreOffice. 2300 words is a great start - in Irish we say "Tús maith leath na hoibre" - a good start is half the work. Kevin -- Unsubscribe instructions: E-mail to l10n+h...@libreoffice.org List archive: http://listarchives.libreoffice.org/www/l10n/ *** All posts to this list are publicly archived for eternity ***