If you need to parse html input, from a rich text editor, a remote website, 
uploaded documents, etc, I would recommend nekohtml:
http://nekohtml.sourceforge.net/

It cleans up broken html and you can easily add a filter to only allow certain 
tags:
http://nekohtml.sourceforge.net/filters.html

Martin

On Thu, 07 May 2009 22:58:56 +0200, Howard Lewis Ship <hls...@gmail.com> wrote:

> I'd tend to do this on the other end, if possible; parse user input
> (or RSS feed, or whatever) into XML and transform out the content you
> don't like, then store that in DB or render it raw.
>
> On Thu, May 7, 2009 at 1:26 PM, Christian Edward Gruber
> <christianedwardgru...@gmail.com> wrote:
>> Hi,
>>
>>    I was considering how to write a minimal markup component - that is a
>> component which can handle encoding raw text into safe text much the same
>> way MarkupWriterImpl passes to a Text node which encodes html entities,
>> etc., but in this case, letting a few of them pass.
>>
>>    One simple way is just to create an <t:output/> like component which
>> pre-encodes before passing on the text to the MarkupWriter, but I think
>> that's sort of a cheap hack.  Can I contribute an alternate MarkupWriter
>> implementation with a marker annotation that I can obtain for beginRender()?
>>
>>    The use case I'm trying to solve (without doing a lot of extra
>> code/storage, is to allow extremely minimal markup through - specifically
>> <p>,<strong>,<em>, <ul>, <dl>, <ol>, <li>, <table><tr><th><td>.  It would
>> disallow all other markup, and strip out any style, class, or id attributes.
>>  It's to allow a bit of (safe) data entry that can include some rendering
>> hints.
>>
>>    Probably my first go will be to create a component that pre-encodes and
>> does a MarkupWriter.writewRaw() with the results, but it feels like a bad
>> hack.
>>
>> cheers,
>> Christian.
>>
>> Christian Edward Gruber
>> e-mail: christianedwardgru...@gmail.com
>> weblog: http://www.geekinasuit.com/
>>

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@tapestry.apache.org
For additional commands, e-mail: users-h...@tapestry.apache.org

Reply via email to