Hi there, It’s my first voyage into stream processing - I’ve tried a few things but I think I’m struggling to think in the streams way. I wondered if I could be cheeky and ask if someone could give me some clues as to the correct design for my first task to get me started?
I have application events coming in like: <timestamp>,type:start,<user_id> <timestamp>,type:action,<user_id> <timestamp>,type:action,<user_id> <timestamp>,type:action,<user_id> <timestamp>,type:end,<user_id> each one represents a single user session. I need to output: <timestamp of start event>,<duration between start and end event>,<user_id>,<count_of_action_events> I’m working with event time (specified by the application) and I can’t trust the application to close sessions/notify gracefully (I’m happy for those to be thrown out, but cool ideas for alternatives are very welcome!). Any advice would be much appreciated. Chris Madge