On Tue, Jul 10, 2012 at 12:06 -0700, Alan Malloy wrote: > Just create a Reader over the file, and do something like (take-while > identity (repeatedly #(read-one-wellformed-xml-tag the-reader))). It needs > some fleshing out for boundary conditions, but I hope you get the general > idea.
I am not quite sure that I follow. Are you suggesting that the reader reads the file in chunks (i.e. all lines between and including <DOC> or </DOC> tags) and parse those individually?jj Wouldn't it be easier to somehow wrap the inputstream in, for example, <xml> and </xml> ? I have the feeling that I misunderstood you or that you think that my problem is that I have to deal with broken tags (e.g. <foo) (which is not the case). Thanks a lot for your help! -- Wolodja <babi...@gmail.com> 4096R/CAF14EFC 081C B7CD FF04 2BA9 94EA 36B2 8B7F 7D30 CAF1 4EFC
signature.asc
Description: Digital signature