On Tue, Feb 15, 2011 at 12:21:18PM -0500, Robert M. Flight wrote:
> Say I have a tab-delimited table I want to read into R. What should I
> expect to happen if some of the entries contain the character " ' "? I
> thought it would read the file fine, but that is not what happens.
> Instead, all the values in between two " ' "s get read into one field,
> and things are just seriously messed up. Is this a bug, and besides
> removing the offending characters, is there a fix?
> 
> Example Input file:
> 
> testFile.txt:
> 3499  9031    424823  COP'B2  118094989       XP_422637.2
> 3499  7955    114454  copb2   50080158        NP_001001940.1
> 3499  7227    45757   betaCop 24584107        NP_524836.2
> ...
> 
> testDat <- read.table('testFile.txt',sep='\t')
> testDat

I believe you want to use:

testDat <- read.table('testFile.txt',sep='\t',quote="")

Ref.:

   quote: the set of quoting characters. To disable quoting altogether,
          use 'quote = ""'.  See 'scan' for the behaviour on quotes
          embedded in quotes.  Quoting is only considered for columns
          read as character, which is all of them unless 'colClasses'
          is specified.

>...

Peace,
david
-- 
David H. Wolfskill                              da...@catwhisker.org
Depriving a girl or boy of an opportunity for education is evil.

See http://www.catwhisker.org/~david/publickey.gpg for my public key.

Attachment: pgpZ0dUFTInK9.pgp
Description: PGP signature

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to