Got it. Thanks!
-- Ruslan Dautkhanov On Thu, Jun 18, 2015 at 1:02 PM, Xiangrui Meng <men...@gmail.com> wrote: > ChiSqSelector calls an RDD of labeled points, where the label is the > target. See > https://github.com/apache/spark/blob/master/mllib/src/main/scala/org/apache/spark/mllib/feature/ChiSqSelector.scala#L120 > > On Wed, Jun 17, 2015 at 10:22 PM, Ruslan Dautkhanov > <dautkha...@gmail.com> wrote: > > Thank you Xiangrui. > > > > Oracle's attribute importance mining function have a target variable. > > "Attribute importance is a supervised function that ranks attributes > > according to their significance in predicting a target." > > MLlib's ChiSqSelector does not have a target variable. > > > > > > > > > > -- > > Ruslan Dautkhanov > > > > On Wed, Jun 17, 2015 at 5:50 PM, Xiangrui Meng <men...@gmail.com> wrote: > >> > >> We don't have it in MLlib. The closest would be the ChiSqSelector, > >> which works for categorical data. -Xiangrui > >> > >> On Thu, Jun 11, 2015 at 4:33 PM, Ruslan Dautkhanov < > dautkha...@gmail.com> > >> wrote: > >> > What would be closest equivalent in MLLib to Oracle Data Miner's > >> > Attribute > >> > Importance mining function? > >> > > >> > > >> > > http://docs.oracle.com/cd/B28359_01/datamine.111/b28129/feature_extr.htm#i1005920 > >> > > >> > Attribute importance is a supervised function that ranks attributes > >> > according to their significance in predicting a target. > >> > > >> > > >> > Best regards, > >> > Ruslan Dautkhanov > > > > >