On Aug 13, 2012, at 4:07 AM, Sri krishna Devarayalu Balanagu wrote:

Thank you for the quick response.
But I want those duplicated with Ids in a separate vector like Duplicated.ids in the below example? Duplication should be checked for Publication and Reference combination, not on a single variable.

First case:
If you wanted just the ones that came _after_ the initial intances then this would apply:

Duplicated.ids<- df.key[duplicated(key), c("Id")]

The vector that comes back from duplicated will be the same length as the number of rows of df.key or of df for that matter. You could also have been able to skip the creation of key and just done this:

Duplicated.ids<- df[ duplicated( df[ , c("Publication", "Reference")] ), c("Id") ]

------------------
Second case:
If you wanted both the later instances _and_ the first instances, you could use this method offered by Bill Dunlap in these pages within the last week if memory serves.

Duplicated.ids<- df.key[ duplicated(key) | duplicated(key, fromLast=TRUE), c("Id")]

The second condition with an OR connector will also bring in the first instances.

?duplicated for further detail and worked examples
--
David.


Regards
Rayalu
-----Original Message-----
From: Jim Lemon [mailto:j...@bitwrit.com.au]
Sent: Monday, August 13, 2012 3:37 PM
To: Sri krishna Devarayalu Balanagu
Cc: r-help@r-project.org
Subject: Re: [R] How can I get the Ids with Duplicated key and corresponding Ids with original key?

On 08/13/2012 07:17 PM, Sri krishna Devarayalu Balanagu wrote:

In this following example Id 4 is duplicated with Id 1.
Like this I want both Ids (Duplicated and Duplicated with). Can anyone help?

df<- data.frame(
    "Publication" = c(1, 2, 3, 1, 4, 5, 2, 3),
    "Reference"   = c("a", "b", "c", "a", "d", "e", "b", "c"),
    "Id"= c(1, 2, 3, 4, 5, 6, 7, 8)
                 )

key<- paste(df$Publication, df$Reference, sep="_")
df.key<- cbind(key, df)
Duplicated.ids<- df.key[duplicated(df.key$key), c("Id")]

Hi Sri krishna Devarayalu Balanagu,
Does this do it?

cat("Id Publication(s)\n")
for(pub in unique(df$Publication))
 cat(pub,"-",df$Id[which(df$Publication==pub)],"\n")

Jim
________________________________
Notice: The information contained in this electronic mail message is intended only for the use of the designated recipient. This message is privileged and confidential. and the property of GVK BIO or its affiliates and subsidiaries. If the reader of this message is not the intended recipient or an agent responsible for delivering it to the intended recipient, you are hereby notified that you have received this message in error and that any review, dissemination, distribution, or copying of this message is strictly prohibited. If you have received this communication in error, please notify us immediately by telephone +91-40-66929999<tel:%2B91-40-66929999> and destroy any and all copies of this message in your possession (whether hard copies or electronically stored copies).

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

David Winsemius, MD
Alameda, CA, USA

______________________________________________
R-help@r-project.org mailing list
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to