Any ideas appreciated.
I need to parse some really large (> 25MB) sgml files. The files are
just database dumps essentially, with each record looking something
like this:
<reportid>
<primarystuff>STUFF</primarystuff>
<secondarystuff>
stuff
</secondarystuff>
</primarystuff>
</reportid>
The records are not overly complicated, but I've never tried XML or
SGML parsing before and am at a loss on how to approach it. The
files are way too big just to slurp in and play around with. Can I
set he record separator to '</reportid>'? Or is that a stupid way to
approach this? Suggestion/modules would be great.
Thanks,
J
--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
<http://learn.perl.org/> <http://learn.perl.org/first-response>