On Tue, Jul 10, 2012 at 12:06 -0700, Alan Malloy wrote:
>    Just create a Reader over the file, and do something like (take-while
>    identity (repeatedly #(read-one-wellformed-xml-tag the-reader))). It needs
>    some fleshing out for boundary conditions, but I hope you get the general
>    idea.

I am not quite sure that I follow. Are you suggesting that the reader reads
the file in chunks (i.e. all lines between and including <DOC> or </DOC>
tags) and parse those individually?jj

Wouldn't it be easier to somehow wrap the inputstream in, for example, <xml>
and </xml> ?

I have the feeling that I misunderstood you or that you think that my problem
is that I have to deal with broken tags (e.g.  <foo) (which is not the case).

Thanks a lot for your help!
-- 
Wolodja <babi...@gmail.com>

4096R/CAF14EFC
081C B7CD FF04 2BA9 94EA  36B2 8B7F 7D30 CAF1 4EFC

Attachment: signature.asc
Description: Digital signature

Reply via email to