The following module was proposed for inclusion in the Module List: modid: Lingua::ZH::MMSEG DSLIP: adpOl description: Mandarin Chinese text segmentation userid: DRYMAN (陳仁乾) chapterid: 9 (Language_Interfaces) communities: github
similar: Lingua::ZH::TaBE, Lingua::ZH::WordSegment rationale: I found that Lingua::ZH::Segment is registered. So change my namespace to Lingua::ZH::MMSEG A problem in computational analysis of Chinese text is that there are no word boundaries in conventionally printed text. Since the word is such a fundamental linguistic unit, it is necessary to identify words in Chinese text so that higher-level analyses can be performed. This module provide phrase segmentation using Maximum Matching Algorithm. It was found that the system successfully identified 98.41% of words in a sample consisting of 1013 words. enteredby: DRYMAN (陳仁乾) enteredon: Tue Dec 27 12:40:14 2011 GMT The resulting entry would be: Lingua::ZH:: ::MMSEG adpOl Mandarin Chinese text segmentation DRYMAN Thanks for registering, -- The PAUSE PS: The following links are only valid for module list maintainers: Registration form with editing capabilities: https://pause.perl.org/pause/authenquery?ACTION=add_mod&USERID=08900000_7e269787df8188bc&SUBMIT_pause99_add_mod_preview=1 Immediate (one click) registration: https://pause.perl.org/pause/authenquery?ACTION=add_mod&USERID=08900000_7e269787df8188bc&SUBMIT_pause99_add_mod_insertit=1 Peek at the current permissions: https://pause.perl.org/pause/authenquery?pause99_peek_perms_by=me&pause99_peek_perms_query=Lingua%3A%3AZH%3A%3AMMSEG