On Oct 13, 2010, at 11:52 PM, Santosh Srinivas wrote:
I have a question about the output given below after running few
lines of
code. Surely a 101 query!
MF_Data <- read.csv("MF_Data_F.txt", header = F, sep="|")
temp <- head(MF_Data) #Get the sample Data
temp1 <- subset(temp, select= c(V1,V4,V6)) #where V1, V4, V6 are the
col
names .. to Get the relevant data
names(temp1) <- c('Ticker', 'Price','Date') #Adjusted column names
Now as expected, I get:
temp1
Ticker Price Date
1 106270 10.3287 01-Apr-2008
2 106269 10.3287 01-Apr-2008
3 102767 12.6832 01-Apr-2008
4 102766 10.5396 01-Apr-2008
5 102855 9.7833 01-Apr-2008
6 102856 12.1485 01-Apr-2008
BUT, for the below:
temp1$Price
[1] 10.3287 10.3287 12.6832 10.5396 9.7833 12.1485
439500 Levels: -101.2358 -102.622 -2171.1276 -6796.4926 -969.5193 ...
Repurchase Price
What is this line? "439500 Levels: -101.2358 -102.622 -2171.1276
-6796.4926
-969.5193 ... Repurchase Price"??
It tells you that the Price column got constructed as a factor. One of
the items in the input data couldn't be coerced to numeric hence
looked like a character variable and the default stringsAsFactors
setting of TRUE resulted in classifying that column as factor rather
than as numeric (or character. Your Date column is surely a factor
variable.
You may want to look at colClasses in the read.table help page.
The read.zoo function in the zoo package may have better behavior for
this sort of data input task.
Many thanks for the help.
Santosh
--
David.
______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.