Hi Peter,

On Mon, Feb 28, 2011 at 9:16 AM, Peter Ruwoldt <ruwol...@gmail.com> wrote:
>
> I am wanting to develop a Pitjantjatjara spell checker for
> LibreOffice.

Well, I know nothing about that language, and I'm not sure whether you
really intended to write what you did.
I don't think it is necessary to develop a new spell-checker, but
instead it is enough to create a corresponding dictionary for the
existing spellchecker, namely hunspell.

> [...] I have a list of words in a unicode text file where each word is on a
> new line.  There are about 2300 lines/words.

This is a rather short list, very likely not enough for automatic
affix creation. I guess that this list doesn't included flexed forms
of the words anyway (i.e. past and future forms, genitives, etc.)

> I have no idea what to do next and I would appreciate clues for the next 
> steps.

You need to have a good understanding on how words are formed in the language.

For example if plural of a word is (almost) always formed by appending
an "s" to a word, then you should create an affix rule for that, etc.

But without knowing the languge specifics, it is hard to give a
concrete path. But then again 2300 is really short. With that list,
you can just save that list as a dictionary without any
affix/transformation rules.

But to develop a dictionary, not only a list with correct words, but
also a list with (possibly automatically generated) list of misspelled
words is needed, to do quality checks on your modifications.

ciao
Christian

-- 
Unsubscribe instructions: E-mail to l10n+h...@libreoffice.org
List archive: http://listarchives.libreoffice.org/www/l10n/
*** All posts to this list are publicly archived for eternity ***

Reply via email to