Read Gabor's advice more closely: Especially unlucky "#" (hint, hint) 13.

On Sep 21, 2009, at 11:08 PM, Peng Yu wrote:

Here are the outputs.

strsplit(scanned_file[5205],'\t')[[1]]
[1] "6836237"
[2] "8.146431"
[3] "8.197432"
[4] "8.156005"
[5] "7.98905"
[6] "8.327593"
[7] "7.673796"
[8] "8.119687"
[9] "8.077252"
[10] "Asap1 "
[11] "NM_010026 "
[12] "RefSeq "
[13] "Mus musculus ArfGAP with SH# domain, ankyrin repeat and PH
domain1 (Asap1), mRNA. "
[14] "FALSE"
[15] "GO:0032312 "
[16] "regulation of ARF GTPase activity "
[17] "GO:0005737  // GO:0016020 "
[18] "cytoplasm  // membrane "
[19] "GO:0005096  // GO:0005515  // GO:0008060  // GO:0008270  //
GO:0046872 "
[20] "GTPase activator activity  // protein binding  // ARF GTPase
activator activity  // zinc ion binding  // metal ion binding "
[21] "---"
[22] "---"
scanned_file[5205]
[1] "6836237 \t8.146431 \t8.197432 \t8.156005\t7.98905\t8.327593\t7.673796\t8.119687\t8.077252\tAsap1
\tNM_010026 \tRefSeq \tMus musculus ArfGAP with SH# domain, ankyrin
repeat and PH domain1 (Asap1), mRNA. \tFALSE\tGO:0032312 \tregulation
of ARF GTPase activity \tGO:0005737  // GO:0016020 \tcytoplasm  //
membrane \tGO:0005096  // GO:0005515  // GO:0008060  // GO:0008270  //
GO:0046872 \tGTPase activator activity  // protein binding  // ARF
GTPase activator activity  // zinc ion binding  // metal ion binding
\t---\t---"


On Mon, Sep 21, 2009 at 9:34 PM, Gabor Grothendieck
<ggrothendi...@gmail.com> wrote:
Its highly unusual to use xls as the extension for a text file.
Use something more suggestive.

print out the line in question.  For example, note that scan
and read.table have different defaults for the comment character,
namely, none and #.

On Mon, Sep 21, 2009 at 10:23 PM, Peng Yu <pengyu...@gmail.com> wrote:
On Mon, Sep 21, 2009 at 9:12 PM, Peng Yu <pengyu...@gmail.com> wrote:
Hi,

I have the following commands. It says line 5205 does not have 22
elements. But I use my 'vim' checked that line in the file. It has 22
fields. Can somebody let me know how to further debug this case?

Regards,
Peng

annotation = read.table("../EC_results/ Juan_15wks_gene_core.xls", header=T, sep='\t',quote='')
Error in scan(file, what, nmax, sep, dec, quote, skip, nlines, na.strings, :
 line 5204 did not have 22 elements
annotation = count.fields("../EC_results/ Juan_15wks_gene_core.xls", sep='\t',quote='')
which(annotation!=22)
[1] 5205



I also run the following command to test, which confirms that line
5205 has 22 elements. Is it a bug in read.table?

scanned_file = scan("../EC_results/Juan_15wks_gene_core.xls", what=character(),sep='\n',quote='')
Read 23333 items
length(strsplit(scanned_file[5205],'\t')[[1]])
[1] 22

Regards,
Peng

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.



______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

David Winsemius, MD
Heritage Laboratories
West Hartford, CT

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to