The following module was proposed for inclusion in the Module List:

  modid:       HTML::Strip
  DSLIP:       RdcOp
  description: Efficiently removes HTML markup from text
  userid:      KILINRAX (Alex Bowley)
  chapterid:   15 (World_Wide_Web_HTML_HTTP_CGI)
  communities:

  similar:

  rationale:

    Whilst the module quite happily strips SGML/XML-like markup from
    text as well as HTML; I believe it should exist in the HTML
    namespace simple because I can envisage no circumstances under which
    someone would want to blanketly remove SGML/XML markup - whereas
    stripping extraneous HTML markup is occasionally very desirable. A
    common application is preparing HTML snippets for indexing by a
    search engine. As this module is written bare-minimum C, it tends to
    be about 7 times faster than using regular expressions to do the
    same thing.

  enteredby:   KILINRAX (Alex Bowley)
  enteredon:   Wed Aug 13 14:56:59 2003 GMT

The resulting entry would be:

HTML::
::Strip           RdcOp Efficiently removes HTML markup from text    KILINRAX


Thanks for registering,
-- 
The PAUSE

PS: The following links are only valid for module list maintainers:

Registration form with editing capabilities:
  
https://pause.perl.org/pause/authenquery?ACTION=add_mod&USERID=be300000_9f6897b4844009e5&SUBMIT_pause99_add_mod_preview=1
Immediate (one click) registration:
  
https://pause.perl.org/pause/authenquery?ACTION=add_mod&USERID=be300000_9f6897b4844009e5&SUBMIT_pause99_add_mod_insertit=1

Reply via email to