[ 
https://issues.apache.org/jira/browse/NIFI-6710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pierre Villard resolved NIFI-6710.
----------------------------------
    Resolution: Feedback Received

Apache NiFi 1.x is no longer maintained and no new release is planned on the 
1.x release line. Marking as resolved as part of a cleanup operation. Please 
open a new one with an updated description if this is still relevant for NiFi 
2.x.

> Large queues get stuck with loss of data
> ----------------------------------------
>
>                 Key: NIFI-6710
>                 URL: https://issues.apache.org/jira/browse/NIFI-6710
>             Project: Apache NiFi
>          Issue Type: Bug
>    Affects Versions: 1.9.2
>         Environment: Windows 10, Windows Server, Single nifi node
>            Reporter: Evert Bevernage
>            Priority: Major
>         Attachments: 5 mins after processor stops taking in files.PNG, after 
> emptyqueue attempt 1.PNG, after emptyqueue attempt 2.png, 
> after-restart-nifi.PNG, failing-queue.xml, nifi-app.log, putfile-destination 
> with missing files only 20k of 50kplus.PNG, when processor stops taking in 
> files.PNG
>
>
> Hi,
> We've observed the following behavior repeatedly:
> When a queue is filled with a large number of smaller flowfiles (more than 
> 10000) the processor taking files from the queue will stop taking in new 
> files at a certain point (not a fixed number) and ignore the remaining files 
> that are in the queue. There is no way to recover the stuck files in nifi.
> Right now we are avoiding a buildup of large queue's but they do happen from 
> time to time 
> (especially if a information source recovers from connection issues)
> As i can now consistently reproduce the issue, please advise if additional 
> testing or information is required.
> To reproduce (on a windows system): 
>  # Use the attached template (failing-queue.xml) 
>  # Start the Putfile processor
>  # Start the GenerateFile processor to rapidly fill the queue to the maximum
>  # Stop the GenerateFile processor  when the queue is applying back-pressure
>  # PutFile will start writing files to disk and stop after a variable number 
> of files.
>  # Observe that only a fraction of the generated files is written to disk
>  # Observe that emptying the queue has not the expected effect
>  # Observe that a reboot will clear the queue.
> Observations:
>  * Waiting for recovery doesn't work. The queue remains filled with files
>  * Emptying the queue generally doesn't work. Most of the time I receive the 
> message that the queue is empty or sometimes only a fraction of the files is 
> removed
>  * Rebooting nifi will consistently clear the queue. The flowfiles will 
> however be lost. (Also when not emptying the queue).
>  * I have had this issue with files that were processed by processors not 
> directly reading or writing from disk (e.g. i don't think it is putfile  or 
> list-file/fetchfile or getfile related)
>  * I'm not 100% sure, but I think that in all observations the problem occurs 
> when there are 10000+ files in the queue
> I've attched the templatefile and screenshots of the operation in nifi and 
> the app.log ,
> For the interpretation of app.log
>  * GenerateFlowFile id: 016d1000-1a9e-12b8-4bb4-cb34ad6229fe
>  * Queue id: 016d1000-202e-12b9-9fa6-4b38cadf8463
>  * PutFile id: 016d1000-408d-12b8-15d9-fea8289e0017
> What i've found in the log
>  * I've started the experiment with generate flowfile 25/9/2019 at 9:24:34 
> (line 1073)
>  * stopped the generate flowfile on line 1079
>  * dropped 5000 flowfiles on line 1115&1116
>  * tried to drop the remaining files on line 1121 & 1123
>  * relaunched nifi on line 1138
>  * line 2152 and 2153 unknown files in the content_repository
>  
>  
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to