Any ideas appreciated.

I need to parse some really large (> 25MB) sgml files. The files are
just database dumps essentially, with each record looking something
like this:
<reportid>
    <primarystuff>STUFF</primarystuff>
        <secondarystuff>
            stuff
        </secondarystuff>
    </primarystuff>
</reportid>

The records are not overly complicated, but I've never tried XML or
SGML parsing before and am at a loss on how to approach it. The
files are way too big just to slurp in and play around with. Can I
set he record separator to '</reportid>'? Or is that a stupid way to
approach this?  Suggestion/modules would be great.

Thanks,
J



-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
<http://learn.perl.org/> <http://learn.perl.org/first-response>


Reply via email to