Cool; I tend to do a lot of that from Ruby, using Hpricot. On Thu, May 7, 2009 at 2:28 PM, Martin Strand <do.not.eat.yellow.s...@gmail.com> wrote: > If you need to parse html input, from a rich text editor, a remote website, > uploaded documents, etc, I would recommend nekohtml: > http://nekohtml.sourceforge.net/ > > It cleans up broken html and you can easily add a filter to only allow > certain tags: > http://nekohtml.sourceforge.net/filters.html > > Martin > > On Thu, 07 May 2009 22:58:56 +0200, Howard Lewis Ship <hls...@gmail.com> > wrote: > >> I'd tend to do this on the other end, if possible; parse user input >> (or RSS feed, or whatever) into XML and transform out the content you >> don't like, then store that in DB or render it raw. >> >> On Thu, May 7, 2009 at 1:26 PM, Christian Edward Gruber >> <christianedwardgru...@gmail.com> wrote: >>> Hi, >>> >>> I was considering how to write a minimal markup component - that is a >>> component which can handle encoding raw text into safe text much the same >>> way MarkupWriterImpl passes to a Text node which encodes html entities, >>> etc., but in this case, letting a few of them pass. >>> >>> One simple way is just to create an <t:output/> like component which >>> pre-encodes before passing on the text to the MarkupWriter, but I think >>> that's sort of a cheap hack. Can I contribute an alternate MarkupWriter >>> implementation with a marker annotation that I can obtain for beginRender()? >>> >>> The use case I'm trying to solve (without doing a lot of extra >>> code/storage, is to allow extremely minimal markup through - specifically >>> <p>,<strong>,<em>, <ul>, <dl>, <ol>, <li>, <table><tr><th><td>. It would >>> disallow all other markup, and strip out any style, class, or id attributes. >>> It's to allow a bit of (safe) data entry that can include some rendering >>> hints. >>> >>> Probably my first go will be to create a component that pre-encodes and >>> does a MarkupWriter.writewRaw() with the results, but it feels like a bad >>> hack. >>> >>> cheers, >>> Christian. >>> >>> Christian Edward Gruber >>> e-mail: christianedwardgru...@gmail.com >>> weblog: http://www.geekinasuit.com/ >>> > > --------------------------------------------------------------------- > To unsubscribe, e-mail: users-unsubscr...@tapestry.apache.org > For additional commands, e-mail: users-h...@tapestry.apache.org > >
-- Howard M. Lewis Ship Creator of Apache Tapestry Director of Open Source Technology at Formos --------------------------------------------------------------------- To unsubscribe, e-mail: users-unsubscr...@tapestry.apache.org For additional commands, e-mail: users-h...@tapestry.apache.org