2009/10/17 Lars Aronsson <[email protected]> > Kelly Jones wrote: > > > How can I extract just a word list w/ definitions from wiktionary? > > A very simple Perl script for extracting information from the > Wikimedia XML dumps is found on > http://meta.wikimedia.org/wiki/User:LA2/Extraktor > > If you know Perl, you can modify this script to filter out the > articles and sections you want, and output them separately. > > There are some Perl tools I've made on the Toolserver subversion repository but since I'm the only one using them so far they be tricky for others to use: https://fisheye.toolserver.org/browse/enwikt/wiktdump/ wiktsplitnames.pl <https://fisheye.toolserver.org/browse/enwikt/wiktdump/wiktsplitnames.pl>will split an English Wiktionary dumpfile into word lists or mini dump files for each language at the most simplistic level
I've also created a feature request on bugzilla: "Regularly publish updated word lists and definition lists" https://bugzilla.wikimedia.org/show_bug.cgi?id=21164 Andrew Dunbar (hippietrail) > -- > Lars Aronsson ([email protected]) > Aronsson Datateknik - http://aronsson.se > > _______________________________________________ > Wiktionary-l mailing list > [email protected] > https://lists.wikimedia.org/mailman/listinfo/wiktionary-l > -- http://wiktionarydev.leuksman.com http://linguaphile.sf.net _______________________________________________ Wiktionary-l mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wiktionary-l
