[ https://issues.apache.org/jira/browse/KAFKA-3795?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15953759#comment-15953759 ]
Roger Hoover commented on KAFKA-3795: ------------------------------------- Happened again: http://confluent-systest.s3-website-us-west-2.amazonaws.com/confluent-kafka-system-test-results/?prefix=2017-04-03--001.1491220440--apache--trunk--bdf4cba/ {noformat} test_id: kafkatest.tests.core.upgrade_test.TestUpgrade.test_upgrade.from_kafka_version=0.9.0.1.to_message_format_version=None.security_protocol=SASL_SSL.compression_types=.none status: FAIL run time: 4 minutes 4.673 seconds 199680 acked message did not make it to the Consumer. They are: 538129, 538132, 538135, 538138, 538140, 538141, 538143, 538144, 538146, 538147, 538149, 538150, 538152, 538153, 538155, 538156, 538158, 538159, 538161, 538162...plus 199660 more. Total Acked: 331954, Total Consumed: 138002. The first 1000 missing messages were validated to ensure they are in Kafka's data files. 1000 were missing. This suggests data loss. Here are some of the messages not found in the data files: [538624, 538625, 538626, 538627, 538628, 538629, 538630, 538631, 538632, 538633] Traceback (most recent call last): File "/var/lib/jenkins/workspace/system-test-kafka/kafka/venv/local/lib/python2.7/site-packages/ducktape-0.6.0-py2.7.egg/ducktape/tests/runner_client.py", line 123, in run data = self.run_test() File "/var/lib/jenkins/workspace/system-test-kafka/kafka/venv/local/lib/python2.7/site-packages/ducktape-0.6.0-py2.7.egg/ducktape/tests/runner_client.py", line 176, in run_test return self.test_context.function(self.test) File "/var/lib/jenkins/workspace/system-test-kafka/kafka/venv/local/lib/python2.7/site-packages/ducktape-0.6.0-py2.7.egg/ducktape/mark/_mark.py", line 321, in wrapper return functools.partial(f, *args, **kwargs)(*w_args, **w_kwargs) File "/var/lib/jenkins/workspace/system-test-kafka/kafka/tests/kafkatest/tests/core/upgrade_test.py", line 125, in test_upgrade self.run_produce_consume_validate(core_test_action=lambda: self.perform_upgrade(from_kafka_version, File "/var/lib/jenkins/workspace/system-test-kafka/kafka/tests/kafkatest/tests/produce_consume_validate.py", line 118, in run_produce_consume_validate self.validate() File "/var/lib/jenkins/workspace/system-test-kafka/kafka/tests/kafkatest/tests/produce_consume_validate.py", line 188, in validate assert success, msg AssertionError: 199680 acked message did not make it to the Consumer. They are: 538129, 538132, 538135, 538138, 538140, 538141, 538143, 538144, 538146, 538147, 538149, 538150, 538152, 538153, 538155, 538156, 538158, 538159, 538161, 538162...plus 199660 more. Total Acked: 331954, Total Consumed: 138002. The first 1000 missing messages were validated to ensure they are in Kafka's data files. 1000 were missing. This suggests data loss. Here are some of the messages not found in the data files: [538624, 538625, 538626, 538627, 538628, 538629, 538630, 538631, 538632, 538633] {noformat} > Transient system test failure upgrade_test.TestUpgrade > ------------------------------------------------------ > > Key: KAFKA-3795 > URL: https://issues.apache.org/jira/browse/KAFKA-3795 > Project: Kafka > Issue Type: Bug > Components: system tests > Reporter: Jason Gustafson > Labels: reliability > > From a recent build running on the 0.10.0 branch: > {code} > test_id: > 2016-06-06--001.kafkatest.tests.core.upgrade_test.TestUpgrade.test_upgrade.from_kafka_version=0.9.0.1.to_message_format_version=0.9.0.1.compression_types=.snappy.new_consumer=True > status: FAIL > run time: 3 minutes 29.166 seconds > 3522 acked message did not make it to the Consumer. They are: 476524, > 476525, 476527, 476528, 476530, 476531, 476533, 476534, 476536, 476537, > 476539, 476540, 476542, 476543, 476545, 476546, 476548, 476549, 476551, > 476552, ...plus 3482 more. Total Acked: 110437, Total Consumed: 127470. The > first 1000 missing messages were validated to ensure they are in Kafka's data > files. 1000 were missing. This suggests data loss. Here are some of the > messages not found in the data files: [477184, 477185, 477187, 477188, > 477190, 477191, 477193, 477194, 477196, 477197] > Traceback (most recent call last): > File > "/var/lib/jenkins/workspace/system-test-kafka-0.10.0/kafka/venv/local/lib/python2.7/site-packages/ducktape-0.5.1-py2.7.egg/ducktape/tests/runner.py", > line 106, in run_all_tests > data = self.run_single_test() > File > "/var/lib/jenkins/workspace/system-test-kafka-0.10.0/kafka/venv/local/lib/python2.7/site-packages/ducktape-0.5.1-py2.7.egg/ducktape/tests/runner.py", > line 162, in run_single_test > return self.current_test_context.function(self.current_test) > File > "/var/lib/jenkins/workspace/system-test-kafka-0.10.0/kafka/venv/local/lib/python2.7/site-packages/ducktape-0.5.1-py2.7.egg/ducktape/mark/_mark.py", > line 331, in wrapper > return functools.partial(f, *args, **kwargs)(*w_args, **w_kwargs) > File > "/var/lib/jenkins/workspace/system-test-kafka-0.10.0/kafka/tests/kafkatest/tests/core/upgrade_test.py", > line 113, in test_upgrade > self.run_produce_consume_validate(core_test_action=lambda: > self.perform_upgrade(from_kafka_version, > File > "/var/lib/jenkins/workspace/system-test-kafka-0.10.0/kafka/tests/kafkatest/tests/produce_consume_validate.py", > line 79, in run_produce_consume_validate > raise e > AssertionError: 3522 acked message did not make it to the Consumer. They are: > 476524, 476525, 476527, 476528, 476530, 476531, 476533, 476534, 476536, > 476537, 476539, 476540, 476542, 476543, 476545, 476546, 476548, 476549, > 476551, 476552, ...plus 3482 more. Total Acked: 110437, Total Consumed: > 127470. The first 1000 missing messages were validated to ensure they are in > Kafka's data files. 1000 were missing. This suggests data loss. Here are some > of the messages not found in the data files: [477184, 477185, 477187, 477188, > 477190, 477191, 477193, 477194, 477196, 477197] > {code} > Here's a link to the test data: > http://testing.confluent.io/confluent-kafka-0-10-0-system-test-results/?prefix=2016-06-06--001.1465234069--apache--0.10.0--6500b53/ -- This message was sent by Atlassian JIRA (v6.3.15#6346)