Mark Payne created NIFI-13796:
---------------------------------

             Summary: Improve efficiency of FlowFile content slab allocation
                 Key: NIFI-13796
                 URL: https://issues.apache.org/jira/browse/NIFI-13796
             Project: Apache NiFi
          Issue Type: Improvement
            Reporter: Mark Payne
            Assignee: Mark Payne
             Fix For: 2.0.0-M5


When we write to the content repository, we write until the we hit the 
configurable max appendable claim size. Then we move on to another resource 
claim in the content repository. However, this only happens inter-session, as 
the session holds open an OutputStream to a Content Claim for efficiency 
purposes.

This, however, has the downside of being less efficient in terms of splitting 
FlowFile across content claims. As a result, we can write significantly far 
beyond the 50 KB max when writing many FlowFiles in a session (think 
UnpackContent, Split*, etc.)

We can improve this by checking the size of the content claim + offset when we 
finish writing to a content claim and only adding back to our active claim 
queue if that size is less than the max appendable size. This should improve 
NiFi's ability to cleanup content more aggressively.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to