On Oct 14, 2008, at 3:23 AM, Dotan Cohen wrote:
2008/10/14 <[EMAIL PROTECTED]>:
Dotan> Can Python go through a directory of files and replace each
Dotan> instance of "newline-space" with nothing?
Sure. Something like (*completely* untested, so caveat emptor):
import glob
import os
for f in glob.glob('*.vcf'):
# corrupt data
uncooked = open(f, 'rb').read()
# fix it
cooked = uncooked.replace('\n ', '')
# backup original file for safety
os.rename(f, '%s.orig' % f)
# and save it
open(f, 'wb').write(cooked)
Thanks, that's easier than I thought! I am sure with some googling I
will discover how to loop through all the files in a directory. One
question, though, is that code unicode-safe in the event that there
are unicode characters in there?
Sure! all the bytes in multibyte UTF-8 characters are above 128 in
value.
Thus, they won't be replaced.
--
Dotan Cohen
http://what-is-what.com
http://gibberish.co.il
א-ב-ג-ד-ה-ו-ז-ח-ט-י-ך-כ-ל-ם-מ-ן-נ-ס-ע-ף-פ-ץ-צ-
ק-ר-ש-ת
ä-ö-ü-ß-Ä-Ö-Ü
--
http://mail.python.org/mailman/listinfo/python-list
----------------------------------
Skinheads are so tired of immigration, that they are going to move to
a country that don't accept immigrants!
Tommy Nordgren
[EMAIL PROTECTED]
--
http://mail.python.org/mailman/listinfo/python-list