[Announce] CyberNeko HTML Parser 1.9.6 Release

2007-12-14 Thread Andy Clark
spamming a bunch of the Apache lists but it's been a lng time since the last release and wanted to reach as many people as possible. -- Andy Clark - [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For addit

Future of NekoHTML

2007-04-19 Thread Andy Clark
project. If you feel that it's not the right place for the code, that's fine. In that case, I'll start a project at SourceForge so that NekoHTML has a permanent home for future development. Thoughts? [1] http://people.apache.org/~andyc/nek

Re: Parsing HTML

2005-10-06 Thread Andy Clark
o a file but it also supports a DOM result. There are a number of other HTML parsers available but I have less experience with them and, from what I've seen, most have custom programming interfaces. So evaluate a few of the available options and choose the one that works best for you and

Re: Query reg. NekoDTD and DOMParser

2005-09-07 Thread Andy Clark
t on the Xerces Native Interface (XNI). If you have any specific questions about the tool, you can write to me directly. -- Andy Clark * [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

Re: going crazy with this: org.xml.sax.SAXParseException: Content is not allowed in prolog

2005-07-31 Thread Andy Clark
on) is valid in that encoding. All except for the UTF-8 byte order mark which ends up looking like "content [that] is not allowed in [the] prolog". Even constructing an input stream reader with the encoding set to "UTF-8" doesn't help because that will use the Java UTF-8 read

Re: going crazy with this: org.xml.sax.SAXParseException: Content is not allowed in prolog

2005-07-27 Thread Andy Clark
attach the first few lines of the file to a followup message? (Attach, not paste.) -- Andy Clark * [EMAIL PROTECTED] - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]

[Announce] CyberNeko Tools for XNI 2005.06.18 Available

2005-06-18 Thread Andy Clark
n return start/ end line and column information but many users have requested character offset information based on the beginning of the file. So I'll be working on that feature and fixing more bugs. As always, the code is available at the following URL: http://www.apache.org/~andyc/neko/doc/in