Nice paper to read and cool usage of Kafka. Thanks for sharing Afonso :)
Guozhang On Mon, Mar 12, 2018 at 1:13 PM, Afonso Mukai <afonso.mu...@esss.se> wrote: > Hi Hannes, > > We will use Kafka here at the European Spallation Source (the facility is > currently under construction) to stream data from neutron detectors and > other experimental station equipment to consumers (the EPICS forwarding > software mentioned by Eric covers the sources other than detectors). From > Kafka the data follow two different paths: > > - For online reduction and visualisation, data will be consumed directly > from Kafka topics by the applications performing these tasks. > - A separate consumer (https://github.com/ess-dmsc/kafka-to-nexus) > subscribes to these data and writes them to HDF5 files, for subsequent > offline reduction and analysis. > > An overview of the system architecture is given in > http://accelconf.web.cern.ch/AccelConf/icalepcs2017/papers/tupha177.pdf > > Best regards, > Afonso > > > On 12/03/2018, 19:49, "Berryman, Eric" <berry...@frib.msu.edu> wrote: > > > The European Spallation Source [1] seems to be using it for this case > [2]. > > > I am also using this code [2], but only for visualization in another > "data center". > > > > [1] https://europeanspallationsource.se/ > > > [2] https://github.com/ess-dmsc/forward-epics-to-kafka > > > Thank you! > > Eric > > > > > ________________________________ > From: Hannes Petri <hannespe...@gmail.com> > Sent: Monday, March 12, 2018 1:37:29 PM > To: users@kafka.apache.org > Subject: Using Kafka to build a streaming platform for a research > facility? > > Hi, > > I work at a research facility where numerous hi-res detectors produce > thousands of GB of data every day. We want to build a highly flexible and > performant streaming platform for storing, transmitting and routing the > data. For example, detector output needs to end up: > > 1. In permanent storage systems > 2. In realtime or semi-realtime visualization software > 3. In post-processing and analysis software > 4. In metrics software > > ...and possibly more. Now I'm exploring Kafka as an option to back > such a platform. > > Would Kafka be a good fit? The reason I'm asking is because among the > use cases, I've mostly seen Kafka being used with more lightweight data, > geared towards business events and high frequency streams of text and > scalars. In other words, *more* but *smaller* messages. > > In my situation, we'd be looking at low frequency but huge files > (typically these detectors produce one large file at a time). In order not > to flood the storage, raw data topics would need to have a very short > retention time (hours to days). > > Does anybody have experience in using Kafka in a similar scenario? > What are your thoughts about the situation I describe? Would we benefit > from using Kafka? > > I highly appreciate any input on this. Many thanks in advance. > > Best regards > Hannes > > Sent from my iPhone > > > -- -- Guozhang