Hi everyone. org.apache.spark.ml.feature.VectorAssembler currently cannot
handle null values. This presents a problem for us as we wish to run a
decision tree classifier on sometimes sparse data. Is there a particular
reason VectorAssembler is implemented in this way, and can anyone recommend
the best path for enabling VectorAssembler to build vectors for data that
will contain empty values?

Thanks!

-Andres

Reply via email to