As far as I know the null support was removed from the Table API because its support was consistently supported with all operations. See https://issues.apache.org/jira/browse/FLINK-2236
On Fri, Oct 23, 2015 at 7:18 PM, Shiti Saxena <ssaxena....@gmail.com> wrote: > For a similar problem where we wanted to preserve and track null entries, > we load the CSV as a DataSet[Array[Object]] and then transform it into > DataSet[Row] using a custom RowSerializer( > https://gist.github.com/Shiti/d0572c089cc08654019c) which handles null. > > The Table API(which supports null) can then be used on the resulting > DataSet[Row]. > > > On Fri, Oct 23, 2015 at 7:38 PM, Maximilian Michels <m...@apache.org> > wrote: > >> Hi Philip, >> >> How about making the empty field of type String? Then you can read the >> CSV into a DataSet and treat the empty string as a null value. Not very >> nice but a workaround. As of now, Flink deliberately doesn't support null >> values. >> >> Regards, >> Max >> >> >> On Thu, Oct 22, 2015 at 4:30 PM, Philip Lee <philjj...@gmail.com> wrote: >> >>> Hi, >>> >>> I am trying to load the dataset with the part of null value by using >>> readCsvFile(). >>> >>> // e.g _date|_click|_sales|_item|_web_page|_user >>> >>> case class WebClick(_click_date: Long, _click_time: Long, _sales: Int, >>> _item: Int,_page: Int, _user: Int) >>> >>> private def getWebClickDataSet(env: ExecutionEnvironment): >>> DataSet[WebClick] = { >>> >>> env.readCsvFile[WebClick]( >>> webClickPath, >>> fieldDelimiter = "|", >>> includedFields = Array(0, 1, 2, 3, 4, 5), >>> // lenient = true >>> ) >>> } >>> >>> >>> Well, I know there is an option to ignore malformed value, but I have to >>> read the dataset even though it has null value. >>> >>> as it follows, dataset (third column is null) looks like >>> 37794|24669||16705|23|54810 >>> but I have to read null value as well because I have to use filter or >>> where function ( _sales == null ) >>> >>> Is there any detail suggestion to do it? >>> >>> Thanks, >>> Philip >>> >>> >>> >>> >>> >>> >>> >>> -- >>> >>> ========================================================== >>> >>> *Hae Joon Lee* >>> >>> >>> Now, in Germany, >>> >>> M.S. Candidate, Interested in Distributed System, Iterative Processing >>> >>> Dept. of Computer Science, Informatik in German, TUB >>> >>> Technical University of Berlin >>> >>> >>> In Korea, >>> >>> M.S. Candidate, Computer Architecture Laboratory >>> >>> Dept. of Computer Science, KAIST >>> >>> >>> Rm# 4414 CS Dept. KAIST >>> >>> 373-1 Guseong-dong, Yuseong-gu, Daejon, South Korea (305-701) >>> >>> >>> Mobile) 49) 015-251-448-278 in Germany, no cellular in Korea >>> >>> ========================================================== >>> >> >> >