Marco A.G.Pinto wrote:
I came up with the idea of creating a GUI to edit the thesaurus of AOO.
JZA told me the files were in TXT format and gave me a URL with several
information but I gave a quick look and didn't find anything about the
data dictionary of the thesaurus.

My FOSDEM presentation may have a little bit more, at least in terms of examples and links (make sure you check the existing web interfaces available, like the one mentioned by RGB: read through the whole presentation).
https://fosdem.org/2013/schedule/event/apache_openoffice_dictionaries/

If you need more, ask here and I will be able to provide some guidance.

The tool will be called "Proofing Tool GUI" and will be coded in
PureBasic. Is this a good name? PureBasic allows to compile in
Windows/Linux/Mac/Amiga.

We may disagree about the language choice (we have already a dozen languages in use in the OpenOffice sources!) but for the moment let's discuss the overall functionality. Then you may code the tool as part of OpenOffice (much harder; but it isn't totally unreasonable to have built-in dictionary editing capabilities!) or as an independent tool, or whatever.

I unziped the Portuguese .OXT and grabbed the files:
- th_pt_PT.idx
- th_pt_PT.dat

As far as your tool is concerned, you may disregard the .idx file. It is automatically generated from the .dat file. (To do so, you use th_gen_idx.pl from Kevin B. Hendricks' MyThes; if you need further information just ask).

in the *.dat* one I had:
UTF-8
1|3
-|anuviado
-|aperitivo
-|sigla
ababelado|1
-|atrapalhado|baralhado|atarantado|desnorteado
ababelar|1
-|baralhar|atrapalhar
abaçanado|1
... etc...

Slide 20 in my presentation has a more understandable example. Anyway, your file means:
- The word (!) "1" has "3" possible meanings
- The word "ababelado" has "1" possible meaning:
  - This meaning has no description ("-")
- Synonyms, in this meaning, are: atrapalhado, baralhado, atarantado, desnorteado
- And so on.

Is there a URL which explains every detail of those files?

You should already have quite a few pointers by now... If you collect information, feel free to create a wiki page on http://wiki.openoffice.org/ to consolidate knowledge there!

Regards,
  Andrea.

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@openoffice.apache.org
For additional commands, e-mail: dev-h...@openoffice.apache.org

Reply via email to