xinyuiscool commented on a change in pull request #950: SAMZA-2126: Bug fixes 
for batch-mode generated stream specs
URL: https://github.com/apache/samza/pull/950#discussion_r264769050
 
 

 ##########
 File path: samza-core/src/main/java/org/apache/samza/execution/StreamEdge.java
 ##########
 @@ -75,7 +75,9 @@ StreamSpec getStreamSpec() {
     StreamSpec spec = (partitions == PARTITIONS_UNKNOWN) ?
         streamSpec : streamSpec.copyWithPartitionCount(partitions);
 
-    if (isIntermediate) {
+    // Append unique id to the batch intermediate streams
+    // Check the physical stream name is already generated first
+    if (isIntermediate && spec.getId().equals(spec.getPhysicalName())) {
 
 Review comment:
   Good question. Usually the id and physical are the same for intermediate 
streams since they are generated. If the user overrides it for some reason, 
then we won't append the unique id, for either stream or batch cases. 
   
   This check is not very obvious given I couldn't find a way to tell that the 
physical name has been generated already due to the double planning problem in 
Samza. If we only invoke the planner once during submission, then we don't need 
this logic anymore :(.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

Reply via email to