On 1 March 2011 00:23, Christian Lohmaier <lohmaier+ooofut...@googlemail.com> wrote: > Hi Peter, > > On Mon, Feb 28, 2011 at 9:16 AM, Peter Ruwoldt <ruwol...@gmail.com> wrote: >> >> I am wanting to develop a Pitjantjatjara spell checker for >> LibreOffice. > > Well, I know nothing about that language, and I'm not sure whether you > really intended to write what you did. > I don't think it is necessary to develop a new spell-checker, but > instead it is enough to create a corresponding dictionary for the > existing spellchecker, namely hunspell. > >> [...] I have a list of words in a unicode text file where each word is on a >> new line. There are about 2300 lines/words. > > This is a rather short list, very likely not enough for automatic > affix creation. I guess that this list doesn't included flexed forms > of the words anyway (i.e. past and future forms, genitives, etc.) > >> I have no idea what to do next and I would appreciate clues for the next >> steps. > > You need to have a good understanding on how words are formed in the language. > > For example if plural of a word is (almost) always formed by appending > an "s" to a word, then you should create an affix rule for that, etc. > > But without knowing the languge specifics, it is hard to give a > concrete path. But then again 2300 is really short. With that list, > you can just save that list as a dictionary without any > affix/transformation rules. > > But to develop a dictionary, not only a list with correct words, but > also a list with (possibly automatically generated) list of misspelled > words is needed, to do quality checks on your modifications. > > ciao > Christian > > -- > Unsubscribe instructions: E-mail to l10n+h...@libreoffice.org > List archive: http://listarchives.libreoffice.org/www/l10n/ > *** All posts to this list are publicly archived for eternity *** > > Thanks Christian I've checked out the Hunspell project and I do not want to create a new checker, just the dictionary. I figured that a plain word list with no aff will allow a start to be made. I have got a linguist working with me and I figured that from here we could evolve the project over time. Peter
-- Free and Open education for all Peter Ruwoldt Grant High School Hosking Avenue MOUNT GAMBIER SA 5290 P. 08 87263107 F. 08 87250173 -- Unsubscribe instructions: E-mail to l10n+h...@libreoffice.org List archive: http://listarchives.libreoffice.org/www/l10n/ *** All posts to this list are publicly archived for eternity ***