Hi! How about trying this:
data[ data$col1!=data$col2 & !is.na(data$col3), ] col1 col2 col3 2 a 1 ST001 3 b 2 ST002 HTH, Kimmo 28.05.2014 15:35, jeff6868 wrote: > Hi everybody, > > I have a little problem in my R-code which seems be easy to solve, but I > wasn't able to find the solution by myself for the moment. > > Here's an example of the form of my data: > > data <- > data.frame(col1=c("a","a","b","b"),col2=c(1,1,2,2),col3=c(NA,"ST001","ST002",NA)) > > I would like to remove duplicated data based on the first two columns > (col1,col2), but in both cases here, I would like to remove the duplicated > row which is equal to NA in col3. > > Here's the data.frame I would like to obtain: > > data2 <- data.frame(col1=c("a","b"),col2=c(1,2),col3=c("ST001","ST002")) > > I've been trying to mix duplicated() with is.na() but it doesn't work yet. > > Can someone tell me the best and easiest way to do this? > > Thanks a lot! > > > > > > > > -- > View this message in context: > http://r.789695.n4.nabble.com/remove-duplicated-row-according-to-NA-condition-tp4691362.html > Sent from the R help mailing list archive at Nabble.com. > > ______________________________________________ > R-help@r-project.org mailing list > https://stat.ethz.ch/mailman/listinfo/r-help > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html > and provide commented, minimal, self-contained, reproducible code. > ______________________________________________ R-help@r-project.org mailing list https://stat.ethz.ch/mailman/listinfo/r-help PLEASE do read the posting guide http://www.R-project.org/posting-guide.html and provide commented, minimal, self-contained, reproducible code.