[
https://issues.apache.org/jira/browse/NIFI-4971?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Koji Kawamura updated NIFI-4971:
--------------------------------
Description:
For the simplest example, with GetFlowFIle (GFF) -> PutFlowFile (PFF), where
GFF gets files and PFF saves those files into a different directory, then
following provenance events will be generated:
# GFF RECEIVE file1
# PFF SEND file2
>From above provenance events, following entities and lineages should be
>created in Atlas, labels in brackets are Atlas type names:
{code}
file1 (fs_path) -> GFF, PFF (nifi_flow_path) -> file2 (fs_path)
{code}
Entities shown in above graph are created. However, the 'nifi_flow_path' entity
do not have inputs/outputs referencing 'fs_path', so lineage can not be seen in
Atlas UI.
This issue was discovered by [~nayakmahesh616]
was:
For the simplest example, with GenerateFlowFile (GFF) -> PutFlowFile (PFF),
where GFF generates FlowFiles with unique ids, and PFF creates local files, GFF
ran 2 times, then following provenance events will be generated:
# GFF CREATE FF1
# PFF SEND file1
# PFF DROP FF1
# GFF CREATE FF2
# PFF SEND file2
# PFF DROP FF2
>From above provenance events, following entities and lineages should be
>created in Atlas, labels in brackets are Atlas type names:
{code}
GenerateFlowFile (nifi_data)
-> GFF, PFF (nifi_flow_path) -> file1 (fs_path)
-> GFF, PFF (nifi_flow_path) -> file2 (fs_path)
{code}
Entities shown in above graph are created. However, those 'nifi_flow_path'
entities do not have inputs/outputs referencing 'nifi_data' or 'fs_path', and
lineage can not be seen in Atlas UI.
This issue was discovered by [~nayakmahesh616]
> ReportLineageToAtlas 'complete path' strategy can miss one-time lineages
> ------------------------------------------------------------------------
>
> Key: NIFI-4971
> URL: https://issues.apache.org/jira/browse/NIFI-4971
> Project: Apache NiFi
> Issue Type: Bug
> Components: Extensions
> Affects Versions: 1.5.0
> Reporter: Koji Kawamura
> Assignee: Koji Kawamura
> Priority: Major
>
> For the simplest example, with GetFlowFIle (GFF) -> PutFlowFile (PFF), where
> GFF gets files and PFF saves those files into a different directory, then
> following provenance events will be generated:
> # GFF RECEIVE file1
> # PFF SEND file2
> From above provenance events, following entities and lineages should be
> created in Atlas, labels in brackets are Atlas type names:
> {code}
> file1 (fs_path) -> GFF, PFF (nifi_flow_path) -> file2 (fs_path)
> {code}
> Entities shown in above graph are created. However, the 'nifi_flow_path'
> entity do not have inputs/outputs referencing 'fs_path', so lineage can not
> be seen in Atlas UI.
> This issue was discovered by [~nayakmahesh616]
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)