Thanks, Peter, Eivind and Lui

Sorry, I could not explain it properly in the first go. Trying to simplify it 
here with an example - Say I have two dataframes as below that are not 
equally-sized data frames:

Table_A:
Email             Name                   Phone
a...@gmail.com<mailto:a...@gmail.com>   John Chan         0909
b...@yahoo.com<mailto:b...@yahoo.com>   Tim Ma                    89089
......

Table_B:
Email              Name                 Sex        Phone
a...@gmail.com<mailto:a...@gmail.com>    John Chan        M                 0909
k...@hotmail.com<mailto:k...@hotmail.com>           Rosy  M               F     
              7779
.....

Now, I have used -
merge (Table_A, Table_B, by="Email", all = FALSE))

- to find only the rows that match from these data frames.

Further, I am also interested (using "Email" as the common key) which rows from 
Table_A did not match with Table_B.
I am not sure how to do here.

 Thanks.


On 1 May 2018, at 9:35 pm, Chintanu 
<chint...@gmail.com<mailto:chint...@gmail.com>> wrote:


---------- Forwarded message ----------
From: peter dalgaard <pda...@gmail.com<mailto:pda...@gmail.com>>
Date: Tue, May 1, 2018 ar-h...@r-project.org<mailto:r-help@r-project.org>t 9:05 
PM
Subject: Re: [R] Merging dataframes
To: Rui Barradas <ruipbarra...@sapo.pt<mailto:ruipbarra...@sapo.pt>>
Cc: Chintanu <chint...@gmail.com<mailto:chint...@gmail.com>>, R help 
<r-help@r-project.org<mailto:r-help@r-project.org>>


I'd expect more like

setdiff(A$key, B$key)

and vice versa. Or, if you want the actual rows

A[!(A$key %in% B$key),]

or for the row numbers

which(!(A$key %in% B$key))


-pd




> On 1 May 2018, at 12:48 , Rui Barradas 
> <ruipbarra...@sapo.pt<mailto:ruipbarra...@sapo.pt>> wrote:
>
> Hello,
>
> Is it something like this that you want?
>
> x <- data.frame(a = c(1:3, 5, 5:10), b = c(1:7, 7, 9:10))
> y <- data.frame(a = 1:10, b = 1:10)
>
> which(x != y, arr.ind = TRUE)
>
>
> Hope this helps,
>
> Rui Barradas
>
> On 5/1/2018 11:35 AM, Chintanu wrote:
>> Hi,
>> May I please ask how I do the following in R. Sorry - this may be trivial,
>> but I am struggling here for this.
>> For two dataframes (A and B), I wish to identify (based on a primary
>> key-column present in both A & B) -
>> 1. Which records (rows) of A did not match with B, and
>> 2. Which records of B did not match with A ?
>> I came across a setdt function while browsing, but when I tried it, it says
>> - Could not find function "setdt".
>> Overall, if there is any way of doing it (preferably in some simplified
>> way), please advise.
>> Many thanks in advance.
>> regards,
>> Tito
>>      [[alternative HTML version deleted]]
>> ______________________________________________
>> R-help@r-project.org<mailto:R-help@r-project.org> mailing list -- To 
>> UNSUBSCRIBE and more, see
>> https://stat.ethz.ch/mailman/listinfo/r-help<https://urldefense.proofpoint.com/v2/url?u=https-3A__stat.ethz.ch_mailman_listinfo_r-2Dhelp&d=DwMFaQ&c=8acizdFhFtEZX1sSgfHPrQ&r=BxjHz6j-Vy7scfJA1zHGhv3tYe2rG8R184kyAJS45dM&m=jMuWrLr4CoKyPhQHA8AN6zWVm7gVs8LF6UCNstNRRAQ&s=lfIRPP8CRcCepiCqApPDf7wZsVTrG9O2Lt8rByESWFI&e=>
>> PLEASE do read the posting guide 
>> http://www.R-project.org/posting-guide.html<https://urldefense.proofpoint.com/v2/url?u=http-3A__www.R-2Dproject.org_posting-2Dguide.html&d=DwMFaQ&c=8acizdFhFtEZX1sSgfHPrQ&r=BxjHz6j-Vy7scfJA1zHGhv3tYe2rG8R184kyAJS45dM&m=jMuWrLr4CoKyPhQHA8AN6zWVm7gVs8LF6UCNstNRRAQ&s=rW2b2LomxW9-0O0Tb34jnePsC_tX-3CpadlJWt9ikQc&e=>
>> and provide commented, minimal, self-contained, reproducible code.
>>
>
> ______________________________________________
> R-help@r-project.org<mailto:R-help@r-project.org> mailing list -- To 
> UNSUBSCRIBE and more, see
> https://stat.ethz.ch/mailman/listinfo/r-help<https://urldefense.proofpoint.com/v2/url?u=https-3A__stat.ethz.ch_mailman_listinfo_r-2Dhelp&d=DwMFaQ&c=8acizdFhFtEZX1sSgfHPrQ&r=BxjHz6j-Vy7scfJA1zHGhv3tYe2rG8R184kyAJS45dM&m=jMuWrLr4CoKyPhQHA8AN6zWVm7gVs8LF6UCNstNRRAQ&s=lfIRPP8CRcCepiCqApPDf7wZsVTrG9O2Lt8rByESWFI&e=>
> PLEASE do read the posting guide 
> http://www.R-project.org/posting-guide.html<https://urldefense.proofpoint.com/v2/url?u=http-3A__www.R-2Dproject.org_posting-2Dguide.html&d=DwMFaQ&c=8acizdFhFtEZX1sSgfHPrQ&r=BxjHz6j-Vy7scfJA1zHGhv3tYe2rG8R184kyAJS45dM&m=jMuWrLr4CoKyPhQHA8AN6zWVm7gVs8LF6UCNstNRRAQ&s=rW2b2LomxW9-0O0Tb34jnePsC_tX-3CpadlJWt9ikQc&e=>
> and provide commented, minimal, self-contained, reproducible code.

--
Peter Dalgaard, Professor,
Center for Statistics, Copenhagen Business School
Solbjerg Plads 3, 2000 Frederiksberg, Denmark
Phone: (+45)38153501
Office: A 4.23
Email: pd....@cbs.dk<mailto:pd....@cbs.dk>  Priv: 
pda...@gmail.com<mailto:pda...@gmail.com>










IMPORTANT NOTICE: The information in this email (and any attachments) is 
confidential. If you are not the intended recipient, you must not use or 
disseminate the information. If you have received this email in error, please 
immediately notify me by "Reply" command and permanently delete the original 
and any copies or printouts thereof. Although this email and any attachments 
are believed to be free of any virus or other defect that might affect any 
computer system into which it is received and opened, it is the responsibility 
of the recipient to ensure that it is virus free and no responsibility is 
accepted by AIA Group Limited or its subsidiaries or affiliates either jointly 
or severally, for any loss or damage arising in any way from its use.

        [[alternative HTML version deleted]]

______________________________________________
R-help@r-project.org mailing list -- To UNSUBSCRIBE and more, see
https://stat.ethz.ch/mailman/listinfo/r-help
PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
and provide commented, minimal, self-contained, reproducible code.

Reply via email to