ing?
Thanks,
From: Sean Owen [mailto:sro...@gmail.com]
Sent: Monday, November 9, 2020 9:58 AM
To: Du, Yi
Cc: user@spark.apache.org
Subject: Re: Ask about Pyspark ML interaction
CAUTION: External email.
I think you have this flipped around - you want to one-hot encode, then compute
interactions.
I think you have this flipped around - you want to one-hot encode, then
compute interactions. As it is you are treating the product of {0,1,2,3,4}
x {0,1,2,3,4} as if it's a categorical index. That doesn't have nearly 25
possible values and probably is not what you intend.
On Mon, Nov 9, 2020 at 7
Hi,
How are you doing?
Please first introduce myself to you. I am Yi Du, working in a mortgage
insurance company called ‘Arch Capital Group’ based in Washington DC office in
US. I find your profile under the repo Spark of Github and would like to ask
you one particular coding issue under Spark