It does not currently handle surrogate splits. You will need to preprocess your data to remove or fill in missing values. I'd recommend using the DataFrame API for that since it comes with a number of na methods. Joseph
On Thu, Mar 17, 2016 at 9:51 PM, Abir Chakraborty <abi...@247-inc.com> wrote: > Hello, > > > > Can MLLIB Decision Tree (DT) handle missing values by having surrogate > split (as it is currently being done in “rpart” library in R)? > > > > Thanks, > > Abir > ------------------------------ > > *Principal Data Scientist, Data Science Group, Innovation Labs* > > *[24]**7 **Inc. - *The Intuitive Consumer Experience Company™ *|* *We > make life simple for consumers to connect with companies to get things done* > > Mobile: +91-9880755850 *|* e-mail: abi...@247-inc.com > Prestige Tech Platina, Kadubeesanahalli, Marathahalli Outer Ring Road > *|* Bangalore 560087 *|* India *|* www.247-inc.com > > >