[R] Mailinglist
Dear Mr/Mrs, This is my first time working in R studio. I have a database of 36 participants but it has 150600 entries. Column - Column - Column- Column Participant Activityprobe - Activity Level - High/low/none Participant Screenprobe - screenon/off - Participant SMSprobe etc Participant CallLogProbe etc. I need a code that helps me count the activity level of all the participants High activity level. No activity level and Low activity level. And to help me find out for every participant what the percentages are of all their high/no/low activity. For screenprobe I need to count how many times the participant turned their screen on and how many times they turned it off and the percentage of screen on/off. For callLog I need to count how many times each participant got called and the percentage. For SMS I need to count the number of SMS for each participant and their percentage. I also need to categorize the probes. So that my database shows all the activity levels first, organized by none/high/low and then all the screenprobes, organized by on and off etc... I hope that my description is clear and that you can maybe help me. Best, Rachel [[alternative HTML version deleted]] __ R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Mailinglist
Hi Jim, Thank you for your email and information It is a CVS file which I imported in Rstudio. I will look into what you told me and see if I am able to figure it out. Best, Rachel On Sun, Jan 6, 2019 at 4:12 AM Jim Lemon wrote: > Hi Rachel, > It looks to me as though the first thing you want to do is to get your > data, which you attach as images, into a data frame. If these are flat > files like CSV or TAB, you should be able to read them in with some > variant of the read.table function. If Excel, look at the various > Excel import packages. Then you can operate on the data frame by doing > things like tabulating Participant ID against the code for SMS or call > (which I assume are those 3000+ numbers). You can take the differences > in what look like POSIX time values between successive TRUE and FALSE > screen values to get the duration of screen activity and it looks like > participant activity is recorded at regular intervals. As Jeff > suggested, this is really just boring work figuring out how to extract > the events: > > call_indices<-which(Probetype == xxCallLogProbe & ValueSpecified > == _id & Valuedetailed ==3271) > > using suitable logical statements and then tabulating them by > ParticipantID. If you know how to do that in SPSS, it won't be too > hard to translate the logical statements into R syntax as above. I may > have misunderstood the variable names, but I think the logic is clear. > > Jim > > On Sun, Jan 6, 2019 at 4:07 PM Rachel Thompson > wrote: > > > > Hi Jim, > > > > Thank you for the clarification. Since I only work in SPSS and I am from > Amsterdam I have had problems with specifying what I am trying to do in > this specific program and also in clear English language. > > > > I think I want to indeed aggregate these events for each subject over > the observation. But in this case several observations. > > 1. I want to have a summary of how many times a specific subject got > called (CallLogProbe) > > 2. I want to have a summary of how many times a specific subject got a > text message (SMS probe) > > 3. I want to have a summary of how many times a specific subject > > - Turned their screen on - True (ScreenProbe) > > - Or did not turn their screen on - False (ScreenProbe) > > 4. I want to have a summary of the activity level of a specific subject > > - Activity level - none (ActivityProbe) > > - Activity level- low (ActivityProbe) > > - Activity level - High (ActivityProbe) > > > > I want to do this for all the 36 subjects(Participants). > > > > In the end, I have to define percentages, so I am able to say...Subject > 36 has low social interactions ( because they only got called and texted > 500 times in total, while the average of all the participants is 1 or > something). I have to come up with the percentages myself and define cutoff > points of what is considered low-medium-high, based on what the results of > all the subjects are. > > > > I hope that I am as clear as possible . > > > > > > I feel as if I am on my way of understanding it, but since I do not > clearly know, I am trying out a lot of different codes etc. and I do not > know if I am doing the right thing. I indeed made a new data frame etc, but > I still feel a bit lost. Do I need to make one per subject or per Probe > etc.. > > > > > > Thanks for your help. I hope that you can help me resolve this issue. > > > > > > Best, > > > > > > Rachel > > > > > > > > > > > > > > On Sat, Jan 5, 2019 at 9:03 PM Jim Lemon wrote: > >> > >> Hi Rachel, > >> I'll take a guess and assume that you are monitoring the mobile phones > >> of 36 people, adding an observation every time some specified change > >> of state is sensed on each device. I'll also assume that you are only > >> recording four types of measurement. It seems that you want to > >> aggregate these events for each subject over the interval or > >> observation (or over each day or something). I think you are going to > >> create a new data frame of these summaries from the one you have of > >> individual observations. Creating each summary doesn't look too hard, > >> but you will have to define more precisely what you want those > >> summaries to be. For instance, "I want the mean activity level for > >> each subject during the overall time that their mobile phone is > >> switched on", One you have clearly defined your goals, it probably > >> won't be too hard to get to them. > >> > >> Jim >
Re: [R] Mailinglist
Hi Jeff, Thanks for your email. I am an intern from Amsterdam and I have to do an analysis in R. I spoke to my professor in Amsterdam and my supervisor's here in Boston. But they are to busy to help. I informed them from the start that I am not familiar with R(Rstudio) and they told me that I would receive guidance. So since they can not help me, I decided to share my problem online. (It is a CVS file imported into R) Please understand that I am new to this. I will unsubscribe to the mailing list if my question does not belong here. Thanks, Rachel On Sun, Jan 6, 2019 at 11:01 AM Jeff Newmiller wrote: > I would not want to leave the impression that I think the task at hand is > merely tedious... my point is that there are numerous steps involved and > each step depends on information that has not been communicated to the > list, and there is a learning curve even in knowing what to include in an > email question. What I do think is that knowing enough basic R syntax to > express small bits of the problem in R will be a vast improvement over > attempting to use only English descriptions, and Rachel has to bridge that > initial gap. > > For example, some images of data were apparently sent to Jim only, yet he > still does not know in what format the data file is stored, so that > technique was not very effective. One way for the question to become more > focused is for Rachel to study up on her own how to import data and provide > us with a "dput" (see the StackOverflow discussion I referenced before) of > a small sample of data. Another is for Rachel to use basic R syntax to > create an anonymous data set from scratch (also outlined in the SO > discussion). These approaches allow us to keep the focus of our mailing > list discussion on manipulating the data into summaries. Another approach > is to re-focus the question on importing data by supplying a download link > to the data so we can make suggestions as to what R commands will handle > this data in its raw form. In any case, we cannot leapfrog over the data to > the analysis as the question stands. > > Given the above, I have to wonder why Rachel hasn't simply used the tool > she is familiar with... SPSS... to do this? If it is because this is an > academic assignment to learn R then she should be talking to her > institutional support (instructor/teaching assistant/tutoring staff) anyway > since there is a no-homework policy on this list (and that avenue would > have the benefit of being conducted orally and most likely in her native > language). > > > On January 6, 2019 1:12:46 AM PST, Jim Lemon wrote: > >Hi Rachel, > >It looks to me as though the first thing you want to do is to get your > >data, which you attach as images, into a data frame. If these are flat > >files like CSV or TAB, you should be able to read them in with some > >variant of the read.table function. If Excel, look at the various > >Excel import packages. Then you can operate on the data frame by doing > >things like tabulating Participant ID against the code for SMS or call > >(which I assume are those 3000+ numbers). You can take the differences > >in what look like POSIX time values between successive TRUE and FALSE > >screen values to get the duration of screen activity and it looks like > >participant activity is recorded at regular intervals. As Jeff > >suggested, this is really just boring work figuring out how to extract > >the events: > > > >call_indices<-which(Probetype == xxCallLogProbe & ValueSpecified > >== _id & Valuedetailed ==3271) > > > >using suitable logical statements and then tabulating them by > >ParticipantID. If you know how to do that in SPSS, it won't be too > >hard to translate the logical statements into R syntax as above. I may > >have misunderstood the variable names, but I think the logic is clear. > > > >Jim > > > >On Sun, Jan 6, 2019 at 4:07 PM Rachel Thompson > > wrote: > >> > >> Hi Jim, > >> > >> Thank you for the clarification. Since I only work in SPSS and I am > >from Amsterdam I have had problems with specifying what I am trying to > >do in this specific program and also in clear English language. > >> > >> I think I want to indeed aggregate these events for each subject over > >the observation. But in this case several observations. > >> 1. I want to have a summary of how many times a specific subject got > >called (CallLogProbe) > >> 2. I want to have a summary of how many times a specific subject got > >a text message (SMS probe) > >> 3. I want to have a summary of how many times a specific subject > >> - Turned their screen
Re: [R] Mailinglist
Hi Michael Thanks, I'll check it out. Best, Rachel On Sun, Jan 6, 2019 at 11:45 AM Michael Dewey wrote: > Dear Rachel > > Not sure if this is going to help but if it is a csv file then > read.csv() is your friend. Read the help first in case you need to > specify what is being used for the decimal point and the separator as if > it is from the Netherlands they may not be the default settings. > > michael > > On 06/01/2019 16:37, Rachel Thompson wrote: > > Hi Jeff, > > > > Thanks for your email. > > I am an intern from Amsterdam and I have to do an analysis in R. I spoke > to > > my professor in Amsterdam and my supervisor's here in Boston. But they > are > > to busy to help. I informed them from the start that I am not familiar > with > > R(Rstudio) and they told me that I would receive guidance. So since they > > can not help me, I decided to share my problem online. > > (It is a CVS file imported into R) > > > > Please understand that I am new to this. I will unsubscribe to the > mailing > > list if my question does not belong here. > > > > Thanks, > > > > Rachel > > > > On Sun, Jan 6, 2019 at 11:01 AM Jeff Newmiller > > > wrote: > > > >> I would not want to leave the impression that I think the task at hand > is > >> merely tedious... my point is that there are numerous steps involved and > >> each step depends on information that has not been communicated to the > >> list, and there is a learning curve even in knowing what to include in > an > >> email question. What I do think is that knowing enough basic R syntax to > >> express small bits of the problem in R will be a vast improvement over > >> attempting to use only English descriptions, and Rachel has to bridge > that > >> initial gap. > >> > >> For example, some images of data were apparently sent to Jim only, yet > he > >> still does not know in what format the data file is stored, so that > >> technique was not very effective. One way for the question to become > more > >> focused is for Rachel to study up on her own how to import data and > provide > >> us with a "dput" (see the StackOverflow discussion I referenced before) > of > >> a small sample of data. Another is for Rachel to use basic R syntax to > >> create an anonymous data set from scratch (also outlined in the SO > >> discussion). These approaches allow us to keep the focus of our mailing > >> list discussion on manipulating the data into summaries. Another > approach > >> is to re-focus the question on importing data by supplying a download > link > >> to the data so we can make suggestions as to what R commands will handle > >> this data in its raw form. In any case, we cannot leapfrog over the > data to > >> the analysis as the question stands. > >> > >> Given the above, I have to wonder why Rachel hasn't simply used the tool > >> she is familiar with... SPSS... to do this? If it is because this is an > >> academic assignment to learn R then she should be talking to her > >> institutional support (instructor/teaching assistant/tutoring staff) > anyway > >> since there is a no-homework policy on this list (and that avenue would > >> have the benefit of being conducted orally and most likely in her native > >> language). > >> > >> > >> On January 6, 2019 1:12:46 AM PST, Jim Lemon > wrote: > >>> Hi Rachel, > >>> It looks to me as though the first thing you want to do is to get your > >>> data, which you attach as images, into a data frame. If these are flat > >>> files like CSV or TAB, you should be able to read them in with some > >>> variant of the read.table function. If Excel, look at the various > >>> Excel import packages. Then you can operate on the data frame by doing > >>> things like tabulating Participant ID against the code for SMS or call > >>> (which I assume are those 3000+ numbers). You can take the differences > >>> in what look like POSIX time values between successive TRUE and FALSE > >>> screen values to get the duration of screen activity and it looks like > >>> participant activity is recorded at regular intervals. As Jeff > >>> suggested, this is really just boring work figuring out how to extract > >>> the events: > >>> > >>> call_indices<-which(Probetype == xxCallLogProbe & ValueSpecified > >>> == _id & Valuedetailed ==3271
Re: [R] Mailinglist
Hi Rui, Thank you, I willl look into it. Best, Rachel On Sun, Jan 6, 2019 at 12:27 PM Rui Barradas wrote: > Hello, > > In many continental European countries, such as mine, the function to > use is > > read.csv2 > > It defaults to > > sep = ";", dec = "," > > Note that these functions are in fact calls to read.table with special > default arguments. Another default that changes is header = TRUE. > You might also want to set stringsAsFactors = FALSE since the default > value TRUE is a common source for errors. > > Hope this helps, > > Rui Barradas > > Às 16:45 de 06/01/2019, Michael Dewey escreveu: > > Dear Rachel > > > > Not sure if this is going to help but if it is a csv file then > > read.csv() is your friend. Read the help first in case you need to > > specify what is being used for the decimal point and the separator as if > > it is from the Netherlands they may not be the default settings. > > > > michael > > > > On 06/01/2019 16:37, Rachel Thompson wrote: > >> Hi Jeff, > >> > >> Thanks for your email. > >> I am an intern from Amsterdam and I have to do an analysis in R. I > >> spoke to > >> my professor in Amsterdam and my supervisor's here in Boston. But they > >> are > >> to busy to help. I informed them from the start that I am not familiar > >> with > >> R(Rstudio) and they told me that I would receive guidance. So since they > >> can not help me, I decided to share my problem online. > >> (It is a CVS file imported into R) > >> > >> Please understand that I am new to this. I will unsubscribe to the > >> mailing > >> list if my question does not belong here. > >> > >> Thanks, > >> > >> Rachel > >> > >> On Sun, Jan 6, 2019 at 11:01 AM Jeff Newmiller < > jdnew...@dcn.davis.ca.us> > >> wrote: > >> > >>> I would not want to leave the impression that I think the task at > >>> hand is > >>> merely tedious... my point is that there are numerous steps involved > and > >>> each step depends on information that has not been communicated to the > >>> list, and there is a learning curve even in knowing what to include > >>> in an > >>> email question. What I do think is that knowing enough basic R syntax > to > >>> express small bits of the problem in R will be a vast improvement over > >>> attempting to use only English descriptions, and Rachel has to bridge > >>> that > >>> initial gap. > >>> > >>> For example, some images of data were apparently sent to Jim only, > >>> yet he > >>> still does not know in what format the data file is stored, so that > >>> technique was not very effective. One way for the question to become > >>> more > >>> focused is for Rachel to study up on her own how to import data and > >>> provide > >>> us with a "dput" (see the StackOverflow discussion I referenced > >>> before) of > >>> a small sample of data. Another is for Rachel to use basic R syntax to > >>> create an anonymous data set from scratch (also outlined in the SO > >>> discussion). These approaches allow us to keep the focus of our mailing > >>> list discussion on manipulating the data into summaries. Another > >>> approach > >>> is to re-focus the question on importing data by supplying a download > >>> link > >>> to the data so we can make suggestions as to what R commands will > handle > >>> this data in its raw form. In any case, we cannot leapfrog over the > >>> data to > >>> the analysis as the question stands. > >>> > >>> Given the above, I have to wonder why Rachel hasn't simply used the > tool > >>> she is familiar with... SPSS... to do this? If it is because this is an > >>> academic assignment to learn R then she should be talking to her > >>> institutional support (instructor/teaching assistant/tutoring staff) > >>> anyway > >>> since there is a no-homework policy on this list (and that avenue would > >>> have the benefit of being conducted orally and most likely in her > native > >>> language). > >>> > >>> > >>> On January 6, 2019 1:12:46 AM PST, Jim Lemon > >>> wrote: > >>>> Hi Rachel, > >>>> It looks to me as though the first thi
Re: [R] Mailinglist
Hi Rich, I really feel lost at this point. I need a code that helps me count the phone activity level(high/low/none), the screen activity (on/off) and the amount calls and SMS of each subject. 1. I want to have a summary of how many times a specific subject got called (CallLogProbe) 2. I want to have a summary of how many times a specific subject got a text message (SMS probe) 3. I want to have a summary of how many times a specific subject - Turned their screen on - True (ScreenProbe) - Or did not turn their screen on - False (ScreenProbe) 4. I want to have a summary of the activity level of a specific subject - Activity level - none (ActivityProbe) - Activity level- low (ActivityProbe) - Activity level - High (ActivityProbe) I want to do this for all the 36 subjects(Participants). In the end, I have to define the percentages and cutoff points of what is considered low-medium-high, based on what the results of all the subjects are. So I am able to see if a specific subject has low social interaction etc. I have tried a lot, with the help of youtube etc. But I feel as if I am trying a lot of things but without clearly knowing if it is the right step. I have a csv file, but I need to look into what Jeff said about the guides. So I am able to share it. Best. On Sun, Jan 6, 2019 at 11:51 AM Rich Shepard wrote: > On Sun, 6 Jan 2019, Rachel Thompson wrote: > > > I am an intern from Amsterdam and I have to do an analysis in R. I spoke > > to my professor in Amsterdam and my supervisor's here in Boston. But they > > are to busy to help. I informed them from the start that I am not > familiar > > with R(Rstudio) and they told me that I would receive guidance. So since > > they can not help me, I decided to share my problem online. (It is a CVS > > file imported into R) > > Rachel, > >I find it interesting that you're put in such a difficult position. I've > not followed this thread from the start so my comments might be redundant > or > inappropriate. > >If you can, describe the problem. That is, what are you being asked to > find and what are the available data? This information helps us to guide > you > to learning the mechanics for accomplishing your task with R. > > Regards, > > Rich > > __ > R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide > http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > [[alternative HTML version deleted]] __ R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.
Re: [R] Mailinglist
Hi Thank you for your help and suggestions! I have tried a few things and ask help from lots of people online! My problem is that I am not able to share the database! I tried to recreate one but I wasn't successful. So I found a way to analyze each subject individually, but I do not know how to perform the same steps for all of the subjects at once. But I just wanted to share what I did, since you tried to help me! This is what I did. I stored all the column names in a vector named "Names" names=c("participants","id","participantid","key","probetype","time","timespecific","value","valuespecified","valuedetailed","period","periodspecified") colnames(gmoji_passivedata)=names I used this code to find the number of participants in the dataset length(unique(gmoji_passivedata$participants)) The number of participants is 44 I used this code to find the unique ID for every participant library(plyr) > count(gmoji_passivedata,”participants") >From the dataset, I selected one participant ""U_..." I used subset data participant1=subset(gmoji_passivedata,participants=="U_0139cf62_e615_41f7_a4cc_878c0490c510") With the table code table(p1$probetype) I found the counts of all the different values of the probe type column edu.mit.media.funf.probe.builtin.ActivityProbe 16167 edu.mit.media.funf.probe.builtin.BluetoothProbe 405 edu.mit.media.funf.probe.builtin.CallLogProbe 1427 edu.mit.media.funf.probe.builtin.ScreenProbe 1791 edu.mit.media.funf.probe.builtin.WifiProbe 5386 The count for the call log probe for the selected participant is 1427 There was only one participant with sms probe for the rest of the participant the count of sms probe is 0 For the screen probe and activity probe I found the total count (1791 and 16167) For screen probe, I used a subset code and set the value detailed column to false and true screenon_false=subset(p1,valuedetailed=="False") (this participant 875) screenon_true=subset(p1,valuedetailed=="True") (this participant 916) and for activity probe to none, low and high to find the required values activity_none=subset(p1,valuedetailed=="none") (this participant 12900) activity_low=subset(p1,valuedetailed=="low") (this participant 1050) activity_high=subset(p1,valuedetailed=="high") (this participant 2217) I did this for each participant Best, Rachel On Sun, Jan 6, 2019 at 2:48 PM Richard M. Heiberger wrote: > Questions like this > 1. I want to have a summary of how many times a specific subject got called > (CallLogProbe) > > suggest that you should look at the table function. See > ?table > and run the examples. > They show how to get one-way frequency tables and two-way contingency > tables. > > If you have followup questions for the list, you can use the examples in > ?table as your starting point. > That way you don't need to worry about sharing your own data. > > > On Sun, Jan 6, 2019 at 1:59 PM Rachel Thompson < > rachel.thomp...@student.uva.nl> wrote: > >> Hi Rich, >> >> I really feel lost at this point. >> I need a code that helps me count the phone activity level(high/low/none), >> the screen activity (on/off) and the amount calls and SMS of each subject. >> >> 1. I want to have a summary of how many times a specific subject got >> called >> (CallLogProbe) >> 2. I want to have a summary of how many times a specific subject got a >> text >> message (SMS probe) >> 3. I want to have a summary of how many times a specific subject >> - Turned their screen on - True (ScreenProbe) >> - Or did not turn their screen on - False (ScreenProbe) >> 4. I want to have a summary of the activity level of a specific subject >> - Activity level - none (ActivityProbe) >> - Activity level- low (ActivityProbe) >> - Activity level - High (ActivityProbe) >> >> I want to do this for all the 36 subjects(Participants). >> In the end, I have to define the percentages and cutoff points of what is >> considered low-medium-high, based on what the results of all the subjects >> are. So I am able to see if a specific subject has low social interaction >> etc. >> >> I have tried a lot, with the help of youtube etc. But I fe
Re: [R] Mailinglist
Hi Thank you for your help and suggestions! I have tried a few things and ask help from lots of people online! My problem is that I am not able to share the database! I tried to recreate one but I wasn't successful. So I found a way to analyze each subject individually, but I do not know how to perform the same steps for all of the subjects at once. But I just wanted to share what I did, since you tried to help me! This is what I did. I stored all the column names in a vector named "Names" names=c("participants","id","participantid","key","probetype","time","timespecific","value","valuespecified","valuedetailed","period","periodspecified") colnames(gmoji_passivedata)=names I used this code to find the number of participants in the dataset length(unique(gmoji_passivedata$participants)) The number of participants is 44 I used this code to find the unique ID for every participant library(plyr) > count(gmoji_passivedata,”participants") >From the dataset, I selected one participant ""U_..." I used subset data participant1=subset(gmoji_passivedata,participants=="U_0139cf62_e615_41f7_a4cc_878c0490c510") With the table code table(p1$probetype) I found the counts of all the different values of the probe type column edu.mit.media.funf.probe.builtin.ActivityProbe 16167 edu.mit.media.funf.probe.builtin.BluetoothProbe 405 edu.mit.media.funf.probe.builtin.CallLogProbe 1427 edu.mit.media.funf.probe.builtin.ScreenProbe 1791 edu.mit.media.funf.probe.builtin.WifiProbe 5386 The count for the call log probe for the selected participant is 1427 There was only one participant with sms probe for the rest of the participant the count of sms probe is 0 For the screen probe and activity probe I found the total count (1791 and 16167) For screen probe, I used a subset code and set the value detailed column to false and true screenon_false=subset(p1,valuedetailed=="False") (this participant 875) screenon_true=subset(p1,valuedetailed=="True") (this participant 916) and for activity probe to none, low and high to find the required values activity_none=subset(p1,valuedetailed=="none") (this participant 12900) activity_low=subset(p1,valuedetailed=="low") (this participant 1050) activity_high=subset(p1,valuedetailed=="high") (this participant 2217) I did this for each participant Best, Rachel On Mon, Jan 7, 2019 at 1:28 AM K. Elo wrote: > Hi! > > Not having a data chunk prevents me from testing abit, but maybe you > should take a look on: > > ?table > ?xtabs > > to start with. > > But as already suggested by other users, a small data set would be of > great help :) > > HTH, > Kimmo > > su, 2019-01-06 kello 13:49 -0500, Rachel Thompson kirjoitti: > > Hi Rich, > > > > I really feel lost at this point. > > I need a code that helps me count the phone activity > > level(high/low/none), > > the screen activity (on/off) and the amount calls and SMS of each > > subject. > > > > 1. I want to have a summary of how many times a specific subject got > > called > > (CallLogProbe) > > 2. I want to have a summary of how many times a specific subject got > > a text > > message (SMS probe) > > 3. I want to have a summary of how many times a specific subject > > - Turned their screen on - True (ScreenProbe) > > - Or did not turn their screen on - False (ScreenProbe) > > 4. I want to have a summary of the activity level of a specific > > subject > > - Activity level - none (ActivityProbe) > > - Activity level- low (ActivityProbe) > > - Activity level - High (ActivityProbe) > > > > I want to do this for all the 36 subjects(Participants). > > In the end, I have to define the percentages and cutoff points of > > what is > > considered low-medium-high, based on what the results of all the > > subjects > > are. So I am able to see if a specific subject has low social > > interaction > > etc. > > > > I have tried a lot, with the help of youtube etc. But I feel as if I > > am > > trying a lot of things but without clearly knowing if it is the right > > step. > > I have a csv file, but I need to look into what Jeff said abou
Re: [R] Mailinglist
Hi Thank you for your help and suggestions! I have tried a few things and ask help from lots of people online! My problem is that I am not able to share the database! I tried to recreate one but I wasn't successful. So I found a way to analyze each subject individually, but I do not know how to perform the same steps for all of the subjects at once. But I just wanted to share what I did, since you tried to help me! This is what I did. I stored all the column names in a vector named "Names" names=c("participants","id","participantid","key","probetype","time","timespecific","value","valuespecified","valuedetailed","period","periodspecified") colnames(gmoji_passivedata)=names I used this code to find the number of participants in the dataset length(unique(gmoji_passivedata$participants)) The number of participants is 44 I used this code to find the unique ID for every participant library(plyr) > count(gmoji_passivedata,”participants") >From the dataset, I selected one participant ""U_..." I used subset data participant1=subset(gmoji_passivedata,participants=="U_0139cf62_e615_41f7_a4cc_878c0490c510") With the table code table(p1$probetype) I found the counts of all the different values of the probe type column edu.mit.media.funf.probe.builtin.ActivityProbe 16167 edu.mit.media.funf.probe.builtin.BluetoothProbe 405 edu.mit.media.funf.probe.builtin.CallLogProbe 1427 edu.mit.media.funf.probe.builtin.ScreenProbe 1791 edu.mit.media.funf.probe.builtin.WifiProbe 5386 The count for the call log probe for the selected participant is 1427 There was only one participant with sms probe for the rest of the participant the count of sms probe is 0 For the screen probe and activity probe I found the total count (1791 and 16167) For screen probe, I used a subset code and set the value detailed column to false and true screenon_false=subset(p1,valuedetailed=="False") (this participant 875) screenon_true=subset(p1,valuedetailed=="True") (this participant 916) and for activity probe to none, low and high to find the required values activity_none=subset(p1,valuedetailed=="none") (this participant 12900) activity_low=subset(p1,valuedetailed=="low") (this participant 1050) activity_high=subset(p1,valuedetailed=="high") (this participant 2217) I did this for each participant Best, Rachel On Mon, Jan 7, 2019 at 3:56 AM PIKAL Petr wrote: > Hi Rachel. > > You already have got several suggestions, but results depend on structure > of your data. The best way from your side would be just copy a part of your > data directly to email and preferable way is to use "dput". > > Assuming your data already transfered to R are called "mydata". > > You can just copy otput of > > dput(mydata[1:30,]) > > to your next mail. > > Cheers > Petr > > > > -Original Message- > > From: R-help On Behalf Of Rachel Thompson > > Sent: Sunday, January 6, 2019 7:49 PM > > To: Rich Shepard > > Cc: r-help mailing list > > Subject: Re: [R] Mailinglist > > > > Hi Rich, > > > > I really feel lost at this point. > > I need a code that helps me count the phone activity > level(high/low/none), > > the screen activity (on/off) and the amount calls and SMS of each > subject. > > > > 1. I want to have a summary of how many times a specific subject got > called > > (CallLogProbe) > > 2. I want to have a summary of how many times a specific subject got a > text > > message (SMS probe) > > 3. I want to have a summary of how many times a specific subject > > - Turned their screen on - True (ScreenProbe) > > - Or did not turn their screen on - False (ScreenProbe) > > 4. I want to have a summary of the activity level of a specific subject > > - Activity level - none (ActivityProbe) > > - Activity level- low (ActivityProbe) > > - Activity level - High (ActivityProbe) > > > > I want to do this for all the 36 subjects(Participants). > > In the end, I have to define the percentages and cutoff points of what is > > considered low-medium-high, based on what the results of all the subjects > > are. So I am able to see if a specific subject has low social
Re: [R] Mailinglist
Hi Thank you for your help and suggestions! I have tried a few things and ask help from lots of people online! My problem is that I am not able to share the database! I tried to recreate one but I wasn't successful. So I found a way to analyze each subject individually, but I do not know how to perform the same steps for all of the subjects at once. But I just wanted to share what I did, since you tried to help me! This is what I did. I stored all the column names in a vector named "Names" names=c("participants","id","participantid","key","probetype","time","timespecific","value","valuespecified","valuedetailed","period","periodspecified") colnames(gmoji_passivedata)=names I used this code to find the number of participants in the dataset length(unique(gmoji_passivedata$participants)) The number of participants is 44 I used this code to find the unique ID for every participant library(plyr) > count(gmoji_passivedata,”participants") >From the dataset, I selected one participant ""U_..." I used subset data participant1=subset(gmoji_passivedata,participants=="U_0139cf62_e615_41f7_a4cc_878c0490c510") With the table code table(p1$probetype) I found the counts of all the different values of the probe type column edu.mit.media.funf.probe.builtin.ActivityProbe 16167 edu.mit.media.funf.probe.builtin.BluetoothProbe 405 edu.mit.media.funf.probe.builtin.CallLogProbe 1427 edu.mit.media.funf.probe.builtin.ScreenProbe 1791 edu.mit.media.funf.probe.builtin.WifiProbe 5386 The count for the call log probe for the selected participant is 1427 There was only one participant with sms probe for the rest of the participant the count of sms probe is 0 For the screen probe and activity probe I found the total count (1791 and 16167) For screen probe, I used a subset code and set the value detailed column to false and true screenon_false=subset(p1,valuedetailed=="False") (this participant 875) screenon_true=subset(p1,valuedetailed=="True") (this participant 916) and for activity probe to none, low and high to find the required values activity_none=subset(p1,valuedetailed=="none") (this participant 12900) activity_low=subset(p1,valuedetailed=="low") (this participant 1050) activity_high=subset(p1,valuedetailed=="high") (this participant 2217) I did this for each participant Best, Rachel On Mon, Jan 7, 2019 at 4:07 AM Hasan Diwan wrote: > dput(sample(mydata, n=25)) is probably going to be more representative. -- > H > > On Mon, 7 Jan 2019 at 00:56, PIKAL Petr wrote: > > > Hi Rachel. > > > > You already have got several suggestions, but results depend on structure > > of your data. The best way from your side would be just copy a part of > your > > data directly to email and preferable way is to use "dput". > > > > Assuming your data already transfered to R are called "mydata". > > > > You can just copy otput of > > > > dput(mydata[1:30,]) > > > > to your next mail. > > > > Cheers > > Petr > > > > > > > -Original Message- > > > From: R-help On Behalf Of Rachel > Thompson > > > Sent: Sunday, January 6, 2019 7:49 PM > > > To: Rich Shepard > > > Cc: r-help mailing list > > > Subject: Re: [R] Mailinglist > > > > > > Hi Rich, > > > > > > I really feel lost at this point. > > > I need a code that helps me count the phone activity > > level(high/low/none), > > > the screen activity (on/off) and the amount calls and SMS of each > > subject. > > > > > > 1. I want to have a summary of how many times a specific subject got > > called > > > (CallLogProbe) > > > 2. I want to have a summary of how many times a specific subject got a > > text > > > message (SMS probe) > > > 3. I want to have a summary of how many times a specific subject > > > - Turned their screen on - True (ScreenProbe) > > > - Or did not turn their screen on - False (ScreenProbe) > > > 4. I want to have a summary of the activity level of a specific > subject > > > - Activity level - none (ActivityProbe) > > > - Activity level- l