On Oct 14, 2008, at 3:23 AM, Dotan Cohen wrote:

2008/10/14  <[EMAIL PROTECTED]>:

  Dotan> Can Python go through a directory of files and replace each
  Dotan> instance of "newline-space" with nothing?

Sure.  Something like (*completely* untested, so caveat emptor):

  import glob
  import os

  for f in glob.glob('*.vcf'):
      # corrupt data
      uncooked = open(f, 'rb').read()
      # fix it
      cooked = uncooked.replace('\n ', '')
      # backup original file for safety
      os.rename(f, '%s.orig' % f)
      # and save it
      open(f, 'wb').write(cooked)


Thanks, that's easier than I thought! I am sure with some googling I
will discover how to loop through all the files in a directory. One
question, though, is that code unicode-safe in the event that there
are unicode characters in there?


Sure! all the bytes in multibyte UTF-8 characters are above 128 in value.
Thus, they won't be replaced.
--
Dotan Cohen

http://what-is-what.com
http://gibberish.co.il
א-ב-ג-ד-ה-ו-ז-ח-ט-י-ך-כ-ל-ם-מ-ן-נ-ס-ע-ף-פ-ץ-צ- ק-ר-ש-ת

ä-ö-ü-ß-Ä-Ö-Ü
--
http://mail.python.org/mailman/listinfo/python-list

----------------------------------
Skinheads are so tired of immigration, that they are going to move to a country that don't accept immigrants!
Tommy Nordgren
[EMAIL PROTECTED]



--
http://mail.python.org/mailman/listinfo/python-list

Reply via email to