>> -----Original Message-----
>> From: Michael Higgins [mailto:[EMAIL PROTECTED] 
>> 
>> Hello, List-ers --
>> 
>> I've come across a problem, unsure where to ask, so 
>> subscribed here. I upload a file through a browser. It's a 
>> '.txt' file and it comes as text/html. 
>> 
>> However, I've found some hyphen and single-quote like 
>> characters that are in this text file are from a higher 
>> codepoint... or something. What _seems_ to happen is the 
>> browser is stripping them and my script isn't getting all the 
>> info to dump into my database.

[8<]

>> 
> -----Original Message-----
> From: Scott Statland [mailto:[EMAIL PROTECTED] 
> 
> The characters that you are describing, may need to be 
> escaped or have their codes entered. 
> It sounds like that they may have special meanings in either 
> the scripting language or in the html output.

Hmm.

I guess my question wasn't clear. The issue is a file upload that is tagged
as text/html but has wide characters in it. The file doesn't make it out of
the browser right AFAICT. (If this is obviously incorrect, please post the
correction!)

A little more pain and research let me to find this:

open F, '<', $ARGV[0] or die $!;
for (<F>){
s/([^\x00-\x7f])/sprintf('&#%d;', ord($1))/ge;
print
} 

... helpful code snippet, which applied to my files before they are uploaded
gives me a new text file with lines like: "Regarding the box &#150; the
driver wouldn&#146;t".

The cool part is that it is uploaded fully and when viewed in a browser the
characters are displayed correctly. Duh.

Now, if I could only get the browser to fix it up like this when sending...
rather than what it was doing. Since it's going to a *nix box, I don't care
about the text/binary thing, right? I guess I could test from a 'nix Firefox
and see if the behaviour is different.

Anyone have a thought on what is happening that the browser upload fails to
accommodate text with wide chars? I don't know how it determines ... maybe
if the first char was wide, it'd go up as a different mimetype?

Cheers,

 
Michael Higgins



-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/


Reply via email to