The following module was proposed for inclusion in the Module List: modid: Lingua::EN::Sentence::Offsets DSLIP: adpfp description: Report sentence boundaries character offsets userid: ANDREFS (André Fernandes dos Santos) chapterid: 11 (String_Lang_Text_Proc) communities: http://github.com/andrefs/Lingua-EN-Sentence-Offsets/issues
similar: Lingua::EN::Sentence rationale: Sentence splitter for English language with a twist: instead of returning some kind of array with the sentences, returns a list of pairs of start-end offsets for each sentence. This allows to know where each sentence starts and ends without the need of actually splitting the text. enteredby: ANDREFS (André Fernandes dos Santos) enteredon: Sat May 12 15:29:55 2012 GMT The resulting entry would be: Lingua::EN::Sentence:: ::Offsets adpfp Report sentence boundaries character offsets ANDREFS Thanks for registering, -- The PAUSE PS: The following links are only valid for module list maintainers: Registration form with editing capabilities: https://pause.perl.org/pause/authenquery?ACTION=add_mod&USERID=e8a00000_823b8a36e9ca173f&SUBMIT_pause99_add_mod_preview=1 Immediate (one click) registration: https://pause.perl.org/pause/authenquery?ACTION=add_mod&USERID=e8a00000_823b8a36e9ca173f&SUBMIT_pause99_add_mod_insertit=1 Peek at the current permissions: https://pause.perl.org/pause/authenquery?pause99_peek_perms_by=me&pause99_peek_perms_query=Lingua%3A%3AEN%3A%3ASentence%3A%3AOffsets