Cool; I tend to do a lot of that from Ruby, using Hpricot.

On Thu, May 7, 2009 at 2:28 PM, Martin Strand
<do.not.eat.yellow.s...@gmail.com> wrote:
> If you need to parse html input, from a rich text editor, a remote website, 
> uploaded documents, etc, I would recommend nekohtml:
> http://nekohtml.sourceforge.net/
>
> It cleans up broken html and you can easily add a filter to only allow 
> certain tags:
> http://nekohtml.sourceforge.net/filters.html
>
> Martin
>
> On Thu, 07 May 2009 22:58:56 +0200, Howard Lewis Ship <hls...@gmail.com> 
> wrote:
>
>> I'd tend to do this on the other end, if possible; parse user input
>> (or RSS feed, or whatever) into XML and transform out the content you
>> don't like, then store that in DB or render it raw.
>>
>> On Thu, May 7, 2009 at 1:26 PM, Christian Edward Gruber
>> <christianedwardgru...@gmail.com> wrote:
>>> Hi,
>>>
>>>    I was considering how to write a minimal markup component - that is a
>>> component which can handle encoding raw text into safe text much the same
>>> way MarkupWriterImpl passes to a Text node which encodes html entities,
>>> etc., but in this case, letting a few of them pass.
>>>
>>>    One simple way is just to create an <t:output/> like component which
>>> pre-encodes before passing on the text to the MarkupWriter, but I think
>>> that's sort of a cheap hack.  Can I contribute an alternate MarkupWriter
>>> implementation with a marker annotation that I can obtain for beginRender()?
>>>
>>>    The use case I'm trying to solve (without doing a lot of extra
>>> code/storage, is to allow extremely minimal markup through - specifically
>>> <p>,<strong>,<em>, <ul>, <dl>, <ol>, <li>, <table><tr><th><td>.  It would
>>> disallow all other markup, and strip out any style, class, or id attributes.
>>>  It's to allow a bit of (safe) data entry that can include some rendering
>>> hints.
>>>
>>>    Probably my first go will be to create a component that pre-encodes and
>>> does a MarkupWriter.writewRaw() with the results, but it feels like a bad
>>> hack.
>>>
>>> cheers,
>>> Christian.
>>>
>>> Christian Edward Gruber
>>> e-mail: christianedwardgru...@gmail.com
>>> weblog: http://www.geekinasuit.com/
>>>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: users-unsubscr...@tapestry.apache.org
> For additional commands, e-mail: users-h...@tapestry.apache.org
>
>



-- 
Howard M. Lewis Ship

Creator of Apache Tapestry
Director of Open Source Technology at Formos

---------------------------------------------------------------------
To unsubscribe, e-mail: users-unsubscr...@tapestry.apache.org
For additional commands, e-mail: users-h...@tapestry.apache.org

Reply via email to