Nick Allen created METRON-1533:
----------------------------------

             Summary: Create KAFKA_FIND Stellar Function
                 Key: METRON-1533
                 URL: https://issues.apache.org/jira/browse/METRON-1533
             Project: Metron
          Issue Type: Improvement
            Reporter: Nick Allen
            Assignee: Nick Allen


When creating enrichments, I often find that I want to validate that the 
enrichment I just created was successful on the live, incoming stream of 
telemetry. My workflow looks something like this.

1. Create and test the enrichment that I want to create.
{code:java}
[Stellar]>>> ip_src_addr := "72.34.49.86"
72.34.49.86
[Stellar]>>> geo := GEO_GET(ip_src_addr)
{country=US, dmaCode=803, city=Los Angeles, postalCode=90014, latitude=34.0438, 
location_point=34.0438,-118.2512, locID=5368361, longitude=-118.2512}
{code}
2. That looks good to me. Now let's add that to my Bro telemetry.
{code:java}
[Stellar]>>> conf := SHELL_EDIT(conf)
{
  "enrichment" : {
    "fieldMap": {
      "stellar": {
        "config": [
           "geo := GEO_GET(ip_src_addr)"
        ]
      }
    }
  },
  "threatIntel": {
  }
}
[Stellar]>>> CONFIG_PUT("ENRICHMENTS", e, "bro")
{code}
 
3.  It looks like that worked, but did that really work?

At this point, I would run KAFKA_GET as many times as it takes to retrieve a 
Bro message. You would just have to get lucky and hope that the enrichment 
worked and secondly that you would pull down a Bro message (as opposed to a 
different sensor).

 

4. I would rather have a function that lets me only pull back the messages that 
I care about. In this case I could either retrieve only Bro messages.
{code:java}
KAFKA_FIND('indexing', m -> MAP_GET('source.type', m) == 'bro')
{code}
Or I could look for messages that contain geolocation data.
{code:java}
KAFKA_FIND('topic', m -> MAP_EXISTS('geo', m))
{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to