Gary Hawkins wrote:
> 
> I'm reading a 25 meg file that contains the content of web pages and their
> urls.  The entire content of the file is placed in an array and then
> worked-over from there.  (I know there are faster ways but ran into a loop snag
> with $/ redefine).
> 
> The problem is that when I run the script the array only contains 590 elements,
> when it should have 902.  On page 590 are some vertical black bars if I look at
> it in Notepad.  Tried the usual trick of opening and saving it in Word, but
> Word doesn't even display those, not even with a placeholder, and a save
> doesn't solve it.
> 
> So my theory is that those characters are causing only part of the file to be
> read into the array.

Have you used binmode() on the file handle?

perldoc -f binmode


> Is there some way to find out what their ascii or ansi value is?

Get an editor like UltraEdit which can display the hex values of all the
characters.


> If identifiable, how can they be replaced with carriage returns in a script?

For characters 0 through 12

tr/\0-\014/\n/;
# or
s/[\0-\014]/\n/g;  # slower




John
-- 
use Perl;
program
fulfillment

-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to