Hi All,
This is my first post in the community.
I am currently working on finding some inferences from my sample data and
the code I have used is:
inference(y = nc$weight, x = nc$habit, est = "mean", type = "ht", null = 0,
method = "theoretical"). While researching more on the code as I have just
Hi All,
Thanks for extending help on this one. I am able to understand how what it
refers to.
I am using R studio as I think it comes as an inbuilt capability.
--
View this message in context:
http://r.789695.n4.nabble.com/Inference-Syntax-tp4706637p4706677.html
Sent from the R help mailing li
Hi All,
I have data based on truck load for various states.
The data points range from Oct'14 To Mar'15. Now I need to know what was the
difference in load in Nov as compared to Oct in both real numbers as well as
in %. Similarly for all the month in comparison to the previous month.
I am able to
Thanks John for the tip. I will use it and see what is the output. Also I
will share my analysis on R & then you can advice accordingly.
--
View this message in context:
http://r.789695.n4.nabble.com/MOnth-over-Month-Variance-in-tp4706873p4706923.html
Sent from the R help mailing list archive
HI All,
I Am creating a residual plot for my linear model.
the code I created is : plot(eval$bty_avg,residuals,ylab="residuals",
xlab="Score", main = "Residual Analysis")Here data set is eval. eval$bty_avg
is my response variable and residual is the var I have created using resid
function to stor
Hello experts,
I have recently (1month) started using R. Earlier I was using SAS to work on
analytic assignments.
In SAS there is an option - forward selection, backward selection, step wise
selection where in it removes the least impacting predictor variable from
the set of variables based on a
Hi Varun,
Courses offered from Coursera & EDX are very informative and carry details
in depth.
However I agree with your point that these courses are very fast paced &
sometimes very technical in nature. (I found the same when I went for Linear
regression course)
I have also recently started lear
HI All,
I am trying to load an CSV file into the R project. the code for the same
is:
mydata<- read.csv("Jan-May Data.csv", header=TRUE)
however with this I am getting the below error message:
/*Error in file(file, "rt") : cannot open the connection
In addition: Warning message:
In file(file, "rt
Hi Pat,
Thanks for the suggestion. It worked for me.
Actually I had not saved the file in the WD accidentally and with the help
of get files syntax I got to know what was the issue.
Thanks a ton.
Shivi
--
View this message in context:
http://r.789695.n4.nabble.com/Issues-with-loading-csv-fil
Hello All,
I need help on creating a histogram for one of my data. The data is as below
(sample):
MFST_WT Hours PROCESS Month Weekday Day of the Month
6,828 13 INBOUND Mar Fri13
2,504 16 INBOUND Mar Fri27
20
HI Jim,
Thanks for the help however R throws an error when i create a var
tot_mon_wt-
tot_mon_wt<-by(mwlc$MFST_WT,mwlc$Month,sum). It gives me an error =
Error in Summary.factor(c(1L, 1L), na.rm = FALSE) :
‘sum’ not meaningful for factors
Not sure what this error refers to. Thank you, Shivi
HI Team,
A quick question.
When I used the print option in R to see the output of my syntax I do not
see the headers or column names. Is there a way to see the headers in the
print.
Also as most of the datasets we work today have huge number of observations
but when I print it only shows a portion
HI David,
So if I understand from your post below, when we import a file in R- we need
to make sure that the variable names do not have any space nor they should
be in special characters or not in comma format.
Please correct me I am wrong.
Now I have changed the file to a new file as RData.csv f
Hello All,
This is an easy fix but I am not able to find the root cause of the error. I
am trying to upload a csv file but it is throwing an error.
Have done a lot of research on google and some tutorial but cant find a
solution hence please advice:-
Syntax is :- aaa<-read.csv(file ="VehicleData.
This ate my head like for 2 hours. God thanks for the help.
--
View this message in context:
http://r.789695.n4.nabble.com/Error-in-CSV-file-tp4707879p4707882.html
Sent from the R help mailing list archive at Nabble.com.
__
R-help@r-project.org mail
Hello Experts,
I have couple of questions on the analysis I am creating.
1) How does R adopt to changes. The case I have here is that the excel I
have started initially had to be modified because the data I had was on
hourly basis ranging from 0 to 23 hours. After Changes 0 was modified to 24
in h
Hello Experts,
I have couple of questions on the analysis I am creating.
1) How does R adopt to changes. The case I have here is that the excel I
have started initially had to be modified because the data I had was on
hourly basis ranging from 0 to 23 hours. After Changes 0 was modified to 24
in h
Thanks Sarah. This is magical.
Thanks for explaining in such a length.
--
View this message in context:
http://r.789695.n4.nabble.com/Help-on-R-Functionality-Histogram-tp4707886p4707891.html
Sent from the R help mailing list archive at Nabble.com.
Thanks you Sarah. This was very impressive and really helped me out.
--
View this message in context:
http://r.789695.n4.nabble.com/Help-on-R-Functionality-Histogram-tp4707886p4707949.html
Sent from the R help mailing list archive at Nabble.com.
__
Hi All,
I have a data set with 11000 rows & 19 columns.
I have 2 columns on which I need to summarize the data:- Date & Weight.
Snapshot is :
Date
13/03/2015
31/03/2015
15/03/2015
17/03/2015
17/03/2015
11/3/2015
11/3/2015
19/03/2015
CHG_WT
0
0
0
770
3,730
70
10
500
N
Hi Petr,
Thanks for the explanation below.
I tried the code you supplied however it seems as my date is a factor hence
it is not working.
The error I got from the code was :
Error: unexpected symbol in:
"final<-aggregate(test$CHG_WT,list(format(test$CR_DT,"%d"),sum)
final"
str(test$CR_DT)- gives
Hi All,
Kindly see the below code I have used:
maxorder<-ddply(test, ~ ORIGIN,summarize,Weight=sum(CHG_WT))
Here I have written the code to summarize values based on origin and total
weight however I am getting below error:
Error: ‘sum’ not meaningful for factors
Please advice. I need CHG_WT tota
Hi Petr
I researched a lot over the net and R manual as well based on which I
revamped my code and came to the code as:
test$CR_DT <- as.Date(test$CR_DT, '%d-%b-%y')
iii<- aggregate(test$CHG_WT,list(format(test$CR_DT,"%m")),FUN=sum)
However it still gives me the error as below:
Error in Summary
HI All,
I am able to get the desired result. Thanks for extending help.
while reading the csv file I made some changes as :
Test<-read.csv("Testdata.csv", head=TRUE, stringsAsFactors = FALSE,
strip.white = TRUE)
with this character var were not changed to factors.
Then aggregation was simple:
Thank you John for spending time on this query and helping out.
It really helped me and finally i am able to achieve the desired results.
Thanks a ton to all others as well to spending time and furbishing solution.
Regards, Shivi
--
View this message in context:
http://r.789695.n4.nabble.com/
HI All,
I need help on 2 issues as highlighted below"
A)I have 2 variables:- Sch_Time & Origin Name.
Now there are multiple instances where Scheduled time i.e. Sch_Time is
missing from each location hence i need to count how many instances do i
have split on location.
the code i have is :
table(
Hi Petr,
Probably i did not explain my scenario clearly.
table(test$ORIGIN_NAME,is.na(test$SCH_TIME)) is the syntax with which i am
trying to find per destination wise how many instances are there where
system failed to enter the scheduled delivery time & there are multiple
cases of these. I am e
Hi Petr, Please see the output from dget as follows.
ORIGIN ORIGIN_NAME DESTINATION DESTINATION_NM RPS_NO
VENDOR_NAME CR_DT SCHD_MRKT VHL_NO vhl_cap
1 DLI11DELHI-11 NDA50NOIDA-50 1350760
HI Don,
This is the exact result i need. However in my case i am not getting any
value under TRUE whereas FALSE captures total observations in each variable.
Please find the syntax and output from the code:
table(test$ORIGIN_NAME,is.na(test$SCH_TIME))
Output
FALSE
HI Petr,There is no reason for holding back the data from dput format. The
reason for not supplying is that i tried multiple times but it the output
what comes is not really user friendly is what i think.Not sure if i am
missing a trick somewhere as i tried both the dput and dget options. Though
as
Good Morning All,
I have working on a data set where I am finding mean and median for weight
variable on a daily basis.
The code:
aggr<-aggregate(retail$weight,list(retail$ship.date),mean)
This is giving me an accurate result however with 4 decimal places for the
mean weight. In order to restric
HI All,
I have situation where i am aggregating weight on monthly and quarterly
level.
I need to summarize weight on variable ship date i.e. shipping date . As
this date is in a character format so used the conversion as:
Shipdate<-as.Date("retail$ship.date", format="%m-%d-%Y"). But when i see th
Hi Petr,
The solution you shared worked though it does not show any decimal values.
The output is
Group.1 x
11/1/2015309450
2 1/10/2015 332780
Instead of mean i used sum & i think that should be fine.
aggr<-aggregate(retail$weight,list(retail$ship.date),function(x)
round
Hi All,
I am working on a data where the total row count is 25+ and have approx.
20 variables. One of the var on which i need to summarize the data is
Consignor i.e. seller name.
Now the issue here is after deleting all the duplicate names i still have
55000 unique customer name and i am not
Hi All, I am working on a dataset baseball where i am grouping based on one
var income in descending order.
Now i need to find the top 25% of the observations from the income group for
which i used top_n (0.25) but it is not finding the desired.
Can you please suggest.
Baseball%>%
group_by(in
Hello All,
As i am a newbie in R so most of you would have seen this question zillion
times. I searched for the answer on this forum as well on other various
forums however could not find the answer i am looking for.
I am dplyr package and used a very basic code:
select(june,city,state,mod)
HI Boris,
The reason i want to see or show 3 million rows in console is that i need to
present it to a business user.
So here my end objective is to present the final output to the business
user. So lets say when i write a code:
select(june,waybill:type,contains("sfxcode")) so here there could b
Hi Loris,
I have already tried options(max.print=99) but does not show the desired
result.
As posted above it want to share the outcome with the business owner where
there could be multiple entries.
--
View this message in context:
http://r.789695.n4.nabble.com/Output-In-R-tp4711227p47
Hi Jim,
Please see the sample code:
ak<-read.csv("June.csv", header = TRUE)
ak%>%select(sfxcode,mod,chargedweight)%>%filter(mod=='AIR')
what i am trying to find is selecting the required var and then selecting
only AIR as a mode of transportation from mod.
I am getting the output but the total ro
Thanks Jeff, this is helpful.
The reason i am curious to know this is because I have worked for a long
duration in SAS where in it gives us the flexibility to create a data set of
our analysis and then we can easily detail out the same to the end user.
In R seems like View or Sweave or Shiny are
Hi All,
I am using dplyr package and need to find total bills booked grouped on a
date level however my date is integer.
In the code below i was trying to change date format from integer. However
it is throwing an error:
"no applicable method for 'group_by_' applied to an object of class
"c('int
Hi John,
Sorry but if this sounds really as a newbie question.
I looked at the data as you suggested using glimpse(name of the dataset)
and then View(dput(head(ak,20))) to capture it as a table. Is there an
option where i can save this view table & share along.
I used dput but that would not w
42 matches
Mail list logo