The following module was proposed for inclusion in the Module List: modid: Lingua::RU::OpenCorpora::DumpFile DSLIP: cdpOp description: Iterator for Opencorpora's XML dump userid: PERLINO (Michael Ivanchenko) chapterid: 11 (String_Lang_Text_Proc) communities: https://github.com/perlino/opencorpora
similar: Lingua::RU::OpenCorpora::Tokenizer XML::TreePuller rationale: Guys at Opencorpora decided we need an iterator that will go through the long MediaWiki xml dump, and I tried to make the use of this iterator comfortable. enteredby: PERLINO (Michael Ivanchenko) enteredon: Fri Jun 13 07:27:20 2014 UTC The resulting entry would be: Lingua::RU::OpenCorpora:: ::DumpFile cdpOp Iterator for Opencorpora's XML dump PERLINO Thanks for registering, -- The PAUSE PS: The following links are only valid for module list maintainers: Registration form with editing capabilities: https://pause.perl.org/pause/authenquery?ACTION=add_mod&USERID=ff010000_4e0d9ec666895222&SUBMIT_pause99_add_mod_preview=1 Immediate (one click) registration: https://pause.perl.org/pause/authenquery?ACTION=add_mod&USERID=ff010000_4e0d9ec666895222&SUBMIT_pause99_add_mod_insertit=1 Peek at the current permissions: https://pause.perl.org/pause/authenquery?pause99_peek_perms_by=me&pause99_peek_perms_query=Lingua%3A%3ARU%3A%3AOpenCorpora%3A%3ADumpFile