Marco A.G.Pinto wrote:
I came up with the idea of creating a GUI to edit the thesaurus of AOO.
JZA told me the files were in TXT format and gave me a URL with several
information but I gave a quick look and didn't find anything about the
data dictionary of the thesaurus.
My FOSDEM presentation may have a little bit more, at least in terms of
examples and links (make sure you check the existing web interfaces
available, like the one mentioned by RGB: read through the whole
presentation).
https://fosdem.org/2013/schedule/event/apache_openoffice_dictionaries/
If you need more, ask here and I will be able to provide some guidance.
The tool will be called "Proofing Tool GUI" and will be coded in
PureBasic. Is this a good name? PureBasic allows to compile in
Windows/Linux/Mac/Amiga.
We may disagree about the language choice (we have already a dozen
languages in use in the OpenOffice sources!) but for the moment let's
discuss the overall functionality. Then you may code the tool as part of
OpenOffice (much harder; but it isn't totally unreasonable to have
built-in dictionary editing capabilities!) or as an independent tool, or
whatever.
I unziped the Portuguese .OXT and grabbed the files:
- th_pt_PT.idx
- th_pt_PT.dat
As far as your tool is concerned, you may disregard the .idx file. It is
automatically generated from the .dat file. (To do so, you use
th_gen_idx.pl from Kevin B. Hendricks' MyThes; if you need further
information just ask).
in the *.dat* one I had:
UTF-8
1|3
-|anuviado
-|aperitivo
-|sigla
ababelado|1
-|atrapalhado|baralhado|atarantado|desnorteado
ababelar|1
-|baralhar|atrapalhar
abaçanado|1
... etc...
Slide 20 in my presentation has a more understandable example. Anyway,
your file means:
- The word (!) "1" has "3" possible meanings
- The word "ababelado" has "1" possible meaning:
- This meaning has no description ("-")
- Synonyms, in this meaning, are: atrapalhado, baralhado, atarantado,
desnorteado
- And so on.
Is there a URL which explains every detail of those files?
You should already have quite a few pointers by now... If you collect
information, feel free to create a wiki page on
http://wiki.openoffice.org/ to consolidate knowledge there!
Regards,
Andrea.
---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@openoffice.apache.org
For additional commands, e-mail: dev-h...@openoffice.apache.org