Fan Hong created FLINK-30734:
--------------------------------

             Summary: KBinsDiscretizer handles Double.NaN incorrectly
                 Key: FLINK-30734
                 URL: https://issues.apache.org/jira/browse/FLINK-30734
             Project: Flink
          Issue Type: Bug
          Components: Library / Machine Learning
            Reporter: Fan Hong


When the training data contains Double.NaN values and the strategy is set to 
"quantile", the generated model data has Double.NaN as the right edge of the 
largest bin.

My expected behavior is to ignore Double.NaN values when training, and to 
support skip/error/keep strategy when transforming with generated 
KBinsDiscretizerModel.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to