Gary Hawkins wrote: > > I'm reading a 25 meg file that contains the content of web pages and their > urls. The entire content of the file is placed in an array and then > worked-over from there. (I know there are faster ways but ran into a loop snag > with $/ redefine). > > The problem is that when I run the script the array only contains 590 elements, > when it should have 902. On page 590 are some vertical black bars if I look at > it in Notepad. Tried the usual trick of opening and saving it in Word, but > Word doesn't even display those, not even with a placeholder, and a save > doesn't solve it. > > So my theory is that those characters are causing only part of the file to be > read into the array.
Have you used binmode() on the file handle? perldoc -f binmode > Is there some way to find out what their ascii or ansi value is? Get an editor like UltraEdit which can display the hex values of all the characters. > If identifiable, how can they be replaced with carriage returns in a script? For characters 0 through 12 tr/\0-\014/\n/; # or s/[\0-\014]/\n/g; # slower John -- use Perl; program fulfillment -- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]