Hi,
I have a couple of questions about the onlineKmeans algorithm. I am running
OnlikeKmeans on a small dataset (36k records) with k=2 and testing varying
decay rates. The features are consumed in flink from kafka topic.
Sample feature: [0., 5.0, 1.0, 0.0, 0.0, 0.0, 0.0]
Is the implementation
Hi, I have another question. Is the implementation of kmeans in flink-ml
same as Spark's StreamingKmeans?
Should the accuracy/results from the same dataset be comparable between the
two?
On Sun, Jun 5, 2022 at 8:14 PM Natia Chachkhiani <
natia.chachkhia...@gmail.com> wrote:
>
dataset and got inconsistent results too?
>>
>> Best regards,
>> Jing
>>
>> On Fri, Jun 3, 2022 at 4:29 AM Natia Chachkhiani <
>> natia.chachkhia...@gmail.com> wrote:
>>
>>> Hi,
>>>
>>> I am running OnlineKmeans from flink-ml re
Hi,
I am running OnlineKmeans from flink-ml repo on a small dataset. I've
noticed that I don't get consistent results, assignments to clusters,
across different runs. I have set both parallelism and globalBatchSize to 1.
I am doing simple fit and transform on each data point ingested. Is the
order