Re: Data Quality Library in Flink

2020-06-09 Thread aj
Thanks, Andrey, I will check it out. On Mon, Jun 8, 2020 at 8:10 PM Andrey Zagrebin wrote: > Hi Anuj, > > I am not familiar with data quality measurement methods and deequ > in depth. > What you describe looks like monitoring some data metrics. > Maybe, there a

Re: Data Quality Library in Flink

2020-06-08 Thread Andrey Zagrebin
Hi Anuj, I am not familiar with data quality measurement methods and deequ in depth. What you describe looks like monitoring some data metrics. Maybe, there are other community users aware of better solution. Meanwhile, I would recommend to implement the checks a

Data Quality Library in Flink

2020-06-05 Thread aj
Hello All, I want to do some data quality analysis on stream data example. 1. Fill rate in a particular column 2. How many events are going to error queue due to favor schema validation failed? 3. Different statistics measure of a column. 3. Alert if a particular threshold is breached (like if f