dear Boris,

Thanks for your answer!

Yes it seems to be a fixed-width format. I didn't remember this type of datasets since I am not used to analyze and process them.

Thanks anyway, it seems to fix the problem (I just need to reflect a bit more on the width of each feature)!


Cheers

Jean-Philippe

On 05/10/2017 16:19, Boris Steipe wrote:
Is this a fixed width format?
If so, read.fwf() in base, or read_fwf() in the readr package will solve the 
problem. You may need to trim trailing spaces though.


B.




On Oct 5, 2017, at 10:12 AM, jean-philippe <jeanphilippe.fonta...@gssi.infn.it> 
wrote:

dear R-users,


I am facing a quite regular and basic problem when it comes to dealing with 
datasets, but I cannot find any satisfying answer so far.
I have a messy dataset of galaxies like that :

And XVIII          000214.5+450520  0.69 17   9 0.00  -8.7 26.8 6.44  6.78 < 
6.65  -44  0.5 MESSIER031               0.6  1.54
PAndAS-03          000356.4+405319  0.10 17     0.00  -3.6 27.8 4.38            
        2.8 MESSIER031               2.8  1.75
PAndAS-04          000442.9+472142  0.05 22     0.00  -6.6 23.1 5.59            
  -108  2.5 MESSIER031               2.5  1.75
PAndAS-05          000524.1+435535  0.06 31     0.00  -4.5 25.6 4.75            
   103  2.8 MESSIER031               2.8  1.75
ESO409-015         000531.8-280553  3.00 78  23 0.00 -14.6 24.1 8.10  8.25   
8.10  769 -2.0 NGC0024                 -1.5 -2.05
AGC748778          000634.4+153039  0.61 70   3 0.00 -10.4 24.9 6.39  5.70   
6.64  486 -1.9 NGC0253                 -1.5 -2.72
And XX             000730.7+350756  0.20 33   5 0.00  -5.8 27.1 5.26  5.70      
  -182  2.4 MESSIER031               2.4  1.75

What I would like to do is to read this dataset, but I would like that the 
space between And and XVIII is not interpreted as 2 different columns but as 
the name of the galaxy in one column.
How is it possible to do so?

For instance I did this data1<-read.table("lvg_table2.txt",skip=70,fill=T) 
where I used fill=T because the rows don't have the same number of features since R splits 
the name of the galaxies into 2 columns because of the space.


Best Regards, thanks in advance


Jean-Philippe Fontaine

--
Jean-Philippe Fontaine
PhD Student in Astroparticle Physics,
Gran Sasso Science Institute (GSSI),
Viale Francesco Crispi 7,
67100 L'Aquila, Italy
Mobile: +393487128593, +33615653774

______________________________________________
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

--
Jean-Philippe Fontaine
PhD Student in Astroparticle Physics,
Gran Sasso Science Institute (GSSI),
Viale Francesco Crispi 7,
67100 L'Aquila, Italy
Mobile: +393487128593, +33615653774

______________________________________________
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to