==================== Test output for //tensorflow/python/ops/ragged:ragged_cross_op_test: Running tests under Python 3.11.3: /usr/local/bin/python3 [ RUN ] RaggedCrossOpTest.testRaggedCrossBatchSizeZero INFO:tensorflow:Running testRaggedCrossBatchSizeZero in GRAPH mode. I0423 21:42:08.517747 281473654158208 test_util.py:1492] Running testRaggedCrossBatchSizeZero in GRAPH mode. WARNING:tensorflow:From /usr/lib/python3.11/contextlib.py:105: TensorFlowTestCase.test_session (from tensorflow.python.framework.test_util) is deprecated and will be removed in a future version. Instructions for updating: Use `self.session()` or `self.cached_session()` instead. W0423 21:42:08.518204 281473654158208 deprecation.py:364] From /usr/lib/python3.11/contextlib.py:105: TensorFlowTestCase.test_session (from tensorflow.python.framework.test_util) is deprecated and will be removed in a future version. Instructions for updating: Use `self.session()` or `self.cached_session()` instead. 2023-04-23 21:42:08.579294: I tensorflow/compiler/mlir/mlir_graph_optimization_pass.cc:375] MLIR V1 optimization pass is not enabled INFO:tensorflow:time(__main__.RaggedCrossOpTest.testRaggedCrossBatchSizeZero): 0.55s I0423 21:42:09.065886 281473654158208 test_util.py:2462] time(__main__.RaggedCrossOpTest.testRaggedCrossBatchSizeZero): 0.55s INFO:tensorflow:Running testRaggedCrossBatchSizeZero in EAGER mode. I0423 21:42:09.069971 281473654158208 test_util.py:1501] Running testRaggedCrossBatchSizeZero in EAGER mode. INFO:tensorflow:time(__main__.RaggedCrossOpTest.testRaggedCrossBatchSizeZero): 0.1s I0423 21:42:09.174100 281473654158208 test_util.py:2462] time(__main__.RaggedCrossOpTest.testRaggedCrossBatchSizeZero): 0.1s [ OK ] RaggedCrossOpTest.testRaggedCrossBatchSizeZero [ RUN ] RaggedCrossOpTest.testRaggedCrossFiveInputs INFO:tensorflow:Running testRaggedCrossFiveInputs in GRAPH mode. I0423 21:42:09.177110 281473654158208 test_util.py:1492] Running testRaggedCrossFiveInputs in GRAPH mode. INFO:tensorflow:time(__main__.RaggedCrossOpTest.testRaggedCrossFiveInputs): 0.03s I0423 21:42:09.211381 281473654158208 test_util.py:2462] time(__main__.RaggedCrossOpTest.testRaggedCrossFiveInputs): 0.03s INFO:tensorflow:Running testRaggedCrossFiveInputs in EAGER mode. I0423 21:42:09.212517 281473654158208 test_util.py:1501] Running testRaggedCrossFiveInputs in EAGER mode. INFO:tensorflow:time(__main__.RaggedCrossOpTest.testRaggedCrossFiveInputs): 0.02s I0423 21:42:09.229216 281473654158208 test_util.py:2462] time(__main__.RaggedCrossOpTest.testRaggedCrossFiveInputs): 0.02s [ OK ] RaggedCrossOpTest.testRaggedCrossFiveInputs [ RUN ] RaggedCrossOpTest.testRaggedCrossHashed100BucketsCustomKey INFO:tensorflow:Running testRaggedCrossHashed100BucketsCustomKey in GRAPH mode. I0423 21:42:09.230297 281473654158208 test_util.py:1492] Running testRaggedCrossHashed100BucketsCustomKey in GRAPH mode. INFO:tensorflow:time(__main__.RaggedCrossOpTest.testRaggedCrossHashed100BucketsCustomKey): 0.32s I0423 21:42:09.549310 281473654158208 test_util.py:2462] time(__main__.RaggedCrossOpTest.testRaggedCrossHashed100BucketsCustomKey): 0.32s INFO:tensorflow:Running testRaggedCrossHashed100BucketsCustomKey in EAGER mode. I0423 21:42:09.552784 281473654158208 test_util.py:1501] Running testRaggedCrossHashed100BucketsCustomKey in EAGER mode. INFO:tensorflow:time(__main__.RaggedCrossOpTest.testRaggedCrossHashed100BucketsCustomKey): 0.1s I0423 21:42:09.654085 281473654158208 test_util.py:2462] time(__main__.RaggedCrossOpTest.testRaggedCrossHashed100BucketsCustomKey): 0.1s [ OK ] RaggedCrossOpTest.testRaggedCrossHashed100BucketsCustomKey [ RUN ] RaggedCrossOpTest.testRaggedCrossHashed100BucketsDefaultKey INFO:tensorflow:Running testRaggedCrossHashed100BucketsDefaultKey in GRAPH mode. I0423 21:42:09.656953 281473654158208 test_util.py:1492] Running testRaggedCrossHashed100BucketsDefaultKey in GRAPH mode. INFO:tensorflow:time(__main__.RaggedCrossOpTest.testRaggedCrossHashed100BucketsDefaultKey): 0.49s I0423 21:42:10.147485 281473654158208 test_util.py:2462] time(__main__.RaggedCrossOpTest.testRaggedCrossHashed100BucketsDefaultKey): 0.49s INFO:tensorflow:Running testRaggedCrossHashed100BucketsDefaultKey in EAGER mode. I0423 21:42:10.151045 281473654158208 test_util.py:1501] Running testRaggedCrossHashed100BucketsDefaultKey in EAGER mode. INFO:tensorflow:time(__main__.RaggedCrossOpTest.testRaggedCrossHashed100BucketsDefaultKey): 0.13s I0423 21:42:10.282491 281473654158208 test_util.py:2462] time(__main__.RaggedCrossOpTest.testRaggedCrossHashed100BucketsDefaultKey): 0.13s [ OK ] RaggedCrossOpTest.testRaggedCrossHashed100BucketsDefaultKey [ RUN ] RaggedCrossOpTest.testRaggedCrossHashedZeroBucketsCustomKey INFO:tensorflow:Running testRaggedCrossHashedZeroBucketsCustomKey in GRAPH mode. I0423 21:42:10.285415 281473654158208 test_util.py:1492] Running testRaggedCrossHashedZeroBucketsCustomKey in GRAPH mode. INFO:tensorflow:time(__main__.RaggedCrossOpTest.testRaggedCrossHashedZeroBucketsCustomKey): 0.71s I0423 21:42:11.000110 281473654158208 test_util.py:2462] time(__main__.RaggedCrossOpTest.testRaggedCrossHashedZeroBucketsCustomKey): 0.71s INFO:tensorflow:Running testRaggedCrossHashedZeroBucketsCustomKey in EAGER mode. I0423 21:42:11.003663 281473654158208 test_util.py:1501] Running testRaggedCrossHashedZeroBucketsCustomKey in EAGER mode. INFO:tensorflow:time(__main__.RaggedCrossOpTest.testRaggedCrossHashedZeroBucketsCustomKey): 0.09s I0423 21:42:11.096224 281473654158208 test_util.py:2462] time(__main__.RaggedCrossOpTest.testRaggedCrossHashedZeroBucketsCustomKey): 0.09s [ OK ] RaggedCrossOpTest.testRaggedCrossHashedZeroBucketsCustomKey [ RUN ] RaggedCrossOpTest.testRaggedCrossHashedZeroBucketsDefaultKey INFO:tensorflow:Running testRaggedCrossHashedZeroBucketsDefaultKey in GRAPH mode. I0423 21:42:11.099225 281473654158208 test_util.py:1492] Running testRaggedCrossHashedZeroBucketsDefaultKey in GRAPH mode. INFO:tensorflow:time(__main__.RaggedCrossOpTest.testRaggedCrossHashedZeroBucketsDefaultKey): 0.35s I0423 21:42:11.445467 281473654158208 test_util.py:2462] time(__main__.RaggedCrossOpTest.testRaggedCrossHashedZeroBucketsDefaultKey): 0.35s INFO:tensorflow:Running testRaggedCrossHashedZeroBucketsDefaultKey in EAGER mode. I0423 21:42:11.449254 281473654158208 test_util.py:1501] Running testRaggedCrossHashedZeroBucketsDefaultKey in EAGER mode. INFO:tensorflow:time(__main__.RaggedCrossOpTest.testRaggedCrossHashedZeroBucketsDefaultKey): 0.11s I0423 21:42:11.558201 281473654158208 test_util.py:2462] time(__main__.RaggedCrossOpTest.testRaggedCrossHashedZeroBucketsDefaultKey): 0.11s [ OK ] RaggedCrossOpTest.testRaggedCrossHashedZeroBucketsDefaultKey [ RUN ] RaggedCrossOpTest.testRaggedCrossHashedZeroKey INFO:tensorflow:Running testRaggedCrossHashedZeroKey in GRAPH mode. I0423 21:42:11.561391 281473654158208 test_util.py:1492] Running testRaggedCrossHashedZeroKey in GRAPH mode. INFO:tensorflow:time(__main__.RaggedCrossOpTest.testRaggedCrossHashedZeroKey): 0.02s I0423 21:42:11.583793 281473654158208 test_util.py:2462] time(__main__.RaggedCrossOpTest.testRaggedCrossHashedZeroKey): 0.02s INFO:tensorflow:Running testRaggedCrossHashedZeroKey in EAGER mode. I0423 21:42:11.584641 281473654158208 test_util.py:1501] Running testRaggedCrossHashedZeroKey in EAGER mode. INFO:tensorflow:time(__main__.RaggedCrossOpTest.testRaggedCrossHashedZeroKey): 0.01s I0423 21:42:11.595339 281473654158208 test_util.py:2462] time(__main__.RaggedCrossOpTest.testRaggedCrossHashedZeroKey): 0.01s [ OK ] RaggedCrossOpTest.testRaggedCrossHashedZeroKey [ RUN ] RaggedCrossOpTest.testRaggedCrossInvalidValue INFO:tensorflow:Running testRaggedCrossInvalidValue in GRAPH mode. I0423 21:42:11.596214 281473654158208 test_util.py:1492] Running testRaggedCrossInvalidValue in GRAPH mode. Fatal Python error: Segmentation fault Thread 0x0000ffffb12b7380 (most recent call first): File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/ops/ragged/ragged_cross_op_test.runfiles/org_tensorflow/tensorflow/python/client/session.py", line 1455 in _call_tf_sessionrun File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/ops/ragged/ragged_cross_op_test.runfiles/org_tensorflow/tensorflow/python/client/session.py", line 1362 in _run_fn File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/ops/ragged/ragged_cross_op_test.runfiles/org_tensorflow/tensorflow/python/client/session.py", line 1379 in _do_call File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/ops/ragged/ragged_cross_op_test.runfiles/org_tensorflow/tensorflow/python/client/session.py", line 1372 in _do_run File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/ops/ragged/ragged_cross_op_test.runfiles/org_tensorflow/tensorflow/python/client/session.py", line 1192 in _run File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/ops/ragged/ragged_cross_op_test.runfiles/org_tensorflow/tensorflow/python/client/session.py", line 969 in run File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/ops/ragged/ragged_cross_op_test.runfiles/org_tensorflow/tensorflow/python/framework/test_util.py", line 2059 in run File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/ops/ragged/ragged_cross_op_test.runfiles/org_tensorflow/tensorflow/python/framework/test_util.py", line 2691 in evaluate File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/ops/ragged/ragged_cross_op_test.runfiles/org_tensorflow/tensorflow/python/ops/ragged/ragged_cross_op_test.py", line 478 in testRaggedCrossInvalidValue File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/ops/ragged/ragged_cross_op_test.runfiles/org_tensorflow/tensorflow/python/framework/test_util.py", line 1496 in decorated File "/usr/lib/python3.11/unittest/case.py", line 579 in _callTestMethod File "/usr/lib/python3.11/unittest/case.py", line 623 in run File "/usr/lib/python3.11/unittest/case.py", line 678 in __call__ File "/usr/lib/python3.11/unittest/suite.py", line 122 in run File "/usr/lib/python3.11/unittest/suite.py", line 84 in __call__ File "/usr/lib/python3.11/unittest/suite.py", line 122 in run File "/usr/lib/python3.11/unittest/suite.py", line 84 in __call__ File "/usr/lib/python3.11/unittest/runner.py", line 217 in run File "/usr/lib/python3.11/unittest/main.py", line 274 in runTests File "/usr/lib/python3.11/unittest/main.py", line 102 in __init__ File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/ops/ragged/ragged_cross_op_test.runfiles/absl_py/absl/testing/absltest.py", line 2537 in _run_and_get_tests_result File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/ops/ragged/ragged_cross_op_test.runfiles/absl_py/absl/testing/absltest.py", line 2568 in run_tests File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/ops/ragged/ragged_cross_op_test.runfiles/absl_py/absl/testing/absltest.py", line 2156 in _run_in_app File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/ops/ragged/ragged_cross_op_test.runfiles/absl_py/absl/testing/absltest.py", line 2049 in main File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/ops/ragged/ragged_cross_op_test.runfiles/org_tensorflow/tensorflow/python/platform/googletest.py", line 51 in g_main File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/ops/ragged/ragged_cross_op_test.runfiles/absl_py/absl/app.py", line 258 in _run_main File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/ops/ragged/ragged_cross_op_test.runfiles/absl_py/absl/app.py", line 312 in run File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/ops/ragged/ragged_cross_op_test.runfiles/org_tensorflow/tensorflow/python/platform/googletest.py", line 60 in main_wrapper File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/ops/ragged/ragged_cross_op_test.runfiles/org_tensorflow/tensorflow/python/platform/benchmark.py", line 489 in benchmarks_main File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/ops/ragged/ragged_cross_op_test.runfiles/org_tensorflow/tensorflow/python/platform/googletest.py", line 62 in main File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/ops/ragged/ragged_cross_op_test.runfiles/org_tensorflow/tensorflow/python/ops/ragged/ragged_cross_op_test.py", line 497 in Extension modules: numpy.core._multiarray_umath, numpy.core._multiarray_tests, numpy.linalg._umath_linalg, numpy.fft._pocketfft_internal, numpy.random._common, numpy.random.bit_generator, numpy.random._bounded_integers, numpy.random._mt19937, numpy.random.mtrand, numpy.random._philox, numpy.random._pcg64, numpy.random._sfc64, numpy.random._generator, google._upb._message, tensorflow.python.framework.fast_tensor_util (total: 15) ================================================================================ ==================== Test output for //tensorflow/python/distribute/failure_handling:gce_failure_handler_test (shard 7 of 8): Running tests under Python 3.11.3: /usr/local/bin/python3 [ RUN ] GceFailureHandlingTest.test_basic_run_test_inputarg_manager_strategyoption_MWMSmultiworker INFO:tensorflow:Using local port 18383 I0423 21:41:47.202462 281473453224832 test_util.py:3794] Using local port 18383 INFO:tensorflow:Using local port 16009 I0423 21:41:47.203077 281473453224832 test_util.py:3794] Using local port 16009 INFO:tensorflow:Using local port 20537 I0423 21:41:47.203456 281473453224832 test_util.py:3794] Using local port 20537 INFO:tensorflow:Using local port 19329 I0423 21:41:47.203819 281473453224832 test_util.py:3794] Using local port 19329 INFO:tensorflow:Cluster starting. I0423 21:41:50.409955 281473453224832 gce_failure_handler_test.py:317] Cluster starting. [worker-0]: I0423 21:41:50.505328 281473829860224 multi_process_runner.py:840] Subprocess with PID 2258449 (worker, 0) is now being started. [worker-0]: I0423 21:41:50.506008 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:18383", "localhost:16009", "localhost:20537", "localhost:19329"]}, "task": {"type": "worker", "index": 0}, "rpc_layer": "grpc"}' [worker-1]: I0423 21:41:50.550377 281473829860224 multi_process_runner.py:840] Subprocess with PID 2258452 (worker, 1) is now being started. [worker-2]: I0423 21:41:50.568581 281473829860224 multi_process_runner.py:840] Subprocess with PID 2258459 (worker, 2) is now being started. [worker-1]: I0423 21:41:50.551054 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:18383", "localhost:16009", "localhost:20537", "localhost:19329"]}, "task": {"type": "worker", "index": 1}, "rpc_layer": "grpc"}' [worker-3]: I0423 21:41:50.575812 281473829860224 multi_process_runner.py:840] Subprocess with PID 2258470 (worker, 3) is now being started. [worker-2]: I0423 21:41:50.569259 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:18383", "localhost:16009", "localhost:20537", "localhost:19329"]}, "task": {"type": "worker", "index": 2}, "rpc_layer": "grpc"}' [worker-3]: I0423 21:41:50.576561 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:18383", "localhost:16009", "localhost:20537", "localhost:19329"]}, "task": {"type": "worker", "index": 3}, "rpc_layer": "grpc"}' [worker-1]: 2023-04-23 21:41:50.665093: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:16009 [worker-0]: 2023-04-23 21:41:50.669422: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:18383 [worker-2]: 2023-04-23 21:41:50.687350: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:20537 [worker-0]: 2023-04-23 21:41:50.712677: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:0 has connected to coordination service. Incarnation: 3606541172777685901 [worker-0]: 2023-04-23 21:41:50.712973: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-0]: 2023-04-23 21:41:50.722627: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:2 has connected to coordination service. Incarnation: 15452963622433181017 [worker-2]: 2023-04-23 21:41:50.722880: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-0]: 2023-04-23 21:41:50.723773: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:1 has connected to coordination service. Incarnation: 4705457417492514962 [worker-1]: 2023-04-23 21:41:50.723937: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-3]: 2023-04-23 21:41:50.866303: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:19329 [worker-0]: 2023-04-23 21:41:50.906330: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:3 has connected to coordination service. Incarnation: 17808445111674360431 [worker-3]: 2023-04-23 21:41:50.906972: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-2]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-2]: I0423 21:41:50.924908 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-3]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1'] [worker-3]: I0423 21:41:50.927525 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1'] [worker-0]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-1]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-0]: I0423 21:41:50.946079 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-1]: I0423 21:41:50.936941 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-1]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:1/device:CPU:0',) [worker-1]: I0423 21:41:50.995895 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:1/device:CPU:0',) [worker-1]: INFO:tensorflow:Check health not enabled. [worker-1]: I0423 21:41:50.996488 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-1]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:18383', 'localhost:16009', 'localhost:20537', 'localhost:19329']}, task_type = 'worker', task_id = 1, num_workers = 4, local_devices = ('/job:worker/task:1/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-1]: I0423 21:41:50.996708 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:18383', 'localhost:16009', 'localhost:20537', 'localhost:19329']}, task_type = 'worker', task_id = 1, num_workers = 4, local_devices = ('/job:worker/task:1/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-2]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:2/device:CPU:0',) [worker-2]: I0423 21:41:51.032025 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:2/device:CPU:0',) [worker-2]: INFO:tensorflow:Check health not enabled. [worker-2]: I0423 21:41:51.033242 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-2]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:18383', 'localhost:16009', 'localhost:20537', 'localhost:19329']}, task_type = 'worker', task_id = 2, num_workers = 4, local_devices = ('/job:worker/task:2/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-2]: I0423 21:41:51.033902 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:18383', 'localhost:16009', 'localhost:20537', 'localhost:19329']}, task_type = 'worker', task_id = 2, num_workers = 4, local_devices = ('/job:worker/task:2/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-3]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:3/device:CPU:0',) [worker-3]: I0423 21:41:51.086248 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:3/device:CPU:0',) [worker-3]: INFO:tensorflow:Check health not enabled. [worker-3]: I0423 21:41:51.086851 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-3]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:18383', 'localhost:16009', 'localhost:20537', 'localhost:19329']}, task_type = 'worker', task_id = 3, num_workers = 4, local_devices = ('/job:worker/task:3/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-3]: I0423 21:41:51.087076 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:18383', 'localhost:16009', 'localhost:20537', 'localhost:19329']}, task_type = 'worker', task_id = 3, num_workers = 4, local_devices = ('/job:worker/task:3/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:0/device:CPU:0',) [worker-0]: I0423 21:41:51.090733 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:0/device:CPU:0',) [worker-0]: INFO:tensorflow:Check health not enabled. [worker-0]: I0423 21:41:51.091292 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-0]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:18383', 'localhost:16009', 'localhost:20537', 'localhost:19329']}, task_type = 'worker', task_id = 0, num_workers = 4, local_devices = ('/job:worker/task:0/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: I0423 21:41:51.091516 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:18383', 'localhost:16009', 'localhost:20537', 'localhost:19329']}, task_type = 'worker', task_id = 0, num_workers = 4, local_devices = ('/job:worker/task:0/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-2]: INFO:tensorflow:Start watcher for peer's signal. [worker-2]: I0423 21:41:51.200896 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-2]: INFO:tensorflow:Start polling for termination signal. [worker-2]: I0423 21:41:51.201824 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-2]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-2]: I0423 21:41:51.202271 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-2]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-2]: Instructions for updating: [worker-2]: Track steps using a tf.Variable saved in checkpoint instead. [worker-2]: W0423 21:41:51.202542 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-2]: Instructions for updating: [worker-2]: Track steps using a tf.Variable saved in checkpoint instead. [worker-2]: INFO:tensorflow:Start training at 0 [worker-2]: I0423 21:41:51.202701 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-0]: INFO:tensorflow:Start watcher for peer's signal. [worker-0]: I0423 21:41:51.216752 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-0]: INFO:tensorflow:Start polling for termination signal. [worker-0]: I0423 21:41:51.217573 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-0]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-0]: I0423 21:41:51.218009 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-0]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-0]: Instructions for updating: [worker-0]: Track steps using a tf.Variable saved in checkpoint instead. [worker-0]: W0423 21:41:51.218300 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-0]: Instructions for updating: [worker-0]: Track steps using a tf.Variable saved in checkpoint instead. [worker-0]: INFO:tensorflow:Start training at 0 [worker-0]: I0423 21:41:51.218458 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-1]: INFO:tensorflow:Start watcher for peer's signal. [worker-1]: I0423 21:41:51.246678 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-3]: INFO:tensorflow:Start watcher for peer's signal. [worker-3]: I0423 21:41:51.247128 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-3]: INFO:tensorflow:Start polling for termination signal. [worker-3]: I0423 21:41:51.259576 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-3]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-3]: I0423 21:41:51.276526 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-3]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-3]: Instructions for updating: [worker-3]: Track steps using a tf.Variable saved in checkpoint instead. [worker-3]: W0423 21:41:51.276978 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-3]: Instructions for updating: [worker-3]: Track steps using a tf.Variable saved in checkpoint instead. [worker-3]: INFO:tensorflow:Start training at 0 [worker-3]: I0423 21:41:51.277143 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-1]: INFO:tensorflow:Start polling for termination signal. [worker-1]: I0423 21:41:51.296584 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-1]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-1]: I0423 21:41:51.336229 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-1]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-1]: Instructions for updating: [worker-1]: Track steps using a tf.Variable saved in checkpoint instead. [worker-1]: W0423 21:41:51.336695 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-1]: Instructions for updating: [worker-1]: Track steps using a tf.Variable saved in checkpoint instead. [worker-1]: INFO:tensorflow:Start training at 0 [worker-1]: I0423 21:41:51.336864 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:51.365371 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:51.494427 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:51.499981 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:51.528941 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:51.616909 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:51.630912 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:51.630517 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:51.659452 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:51.769271 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:51.801933 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:51.787174 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:51.812986 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:51.930386 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:51.954666 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:51.968477 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:51.971022 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:52.082738 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:52.085180 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:52.120962 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:52.140901 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830ab60> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb8312a20> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: W0423 21:41:52.255419 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb8312a20> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb8306ca0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830e840> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: W0423 21:41:52.256446 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb8306ca0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: W0423 21:41:52.256752 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830e840> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: W0423 21:41:52.247529 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830ab60> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:52.261067 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:52.281108 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:52.263879 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:52.300752 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Termination notice available. [worker-2]: I0423 21:41:52.327220 281449642848736 gce_failure_handler_test.py:142] Termination notice available. [worker-2]: INFO:tensorflow:Member 2 has received termination notice. [worker-2]: I0423 21:41:52.336813 281449642848736 failure_handling.py:710] Member 2 has received termination notice. [worker-3]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830b4c0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: W0423 21:41:52.356436 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830b4c0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: INFO:tensorflow:epoch 0 finished [worker-3]: I0423 21:41:52.356848 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-2]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830f420> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: W0423 21:41:52.357972 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830f420> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb83134c0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb83077e0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: INFO:tensorflow:epoch 0 finished [worker-0]: W0423 21:41:52.359565 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb83077e0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: INFO:tensorflow:epoch 0 finished [worker-0]: I0423 21:41:52.359947 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-2]: I0423 21:41:52.358357 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-1]: W0423 21:41:52.365301 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb83134c0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: INFO:tensorflow:Termination caught in main thread on preempted worker [worker-1]: INFO:tensorflow:epoch 0 finished [worker-1]: I0423 21:41:52.365797 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-1]: INFO:tensorflow:PreemptionCheckpointHandler: RECEIVED_SIGNAL_RUN_TO_CHECKPOINT_1 set, preemption awareness acknowledged [worker-2]: I0423 21:41:52.358600 281473829860224 failure_handling.py:1159] Termination caught in main thread on preempted worker [worker-2]: INFO:tensorflow:RUN_TO_CHECKPOINT set to 7 [worker-2]: I0423 21:41:52.366512 281473829860224 failure_handling.py:1168] RUN_TO_CHECKPOINT set to 7 [worker-1]: I0423 21:41:52.369234 281447579382240 failure_handling.py:1242] PreemptionCheckpointHandler: RECEIVED_SIGNAL_RUN_TO_CHECKPOINT_1 set, preemption awareness acknowledged [worker-2]: INFO:tensorflow:PreemptionCheckpointHandler: RECEIVED_SIGNAL_RUN_TO_CHECKPOINT_2 set, preemption awareness acknowledged [worker-2]: I0423 21:41:52.369306 281449651302880 failure_handling.py:1242] PreemptionCheckpointHandler: RECEIVED_SIGNAL_RUN_TO_CHECKPOINT_2 set, preemption awareness acknowledged [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:52.374366 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:52.375317 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:52.366478 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:PreemptionCheckpointHandler: RECEIVED_SIGNAL_RUN_TO_CHECKPOINT_0 set, preemption awareness acknowledged [worker-0]: I0423 21:41:52.384230 281447898018272 failure_handling.py:1242] PreemptionCheckpointHandler: RECEIVED_SIGNAL_RUN_TO_CHECKPOINT_0 set, preemption awareness acknowledged [worker-2]: INFO:tensorflow:Sigterm acknowledgement from replica 0 received [worker-2]: I0423 21:41:52.386278 281473829860224 failure_handling.py:1177] Sigterm acknowledgement from replica 0 received [worker-3]: INFO:tensorflow:PreemptionCheckpointHandler: RECEIVED_SIGNAL_RUN_TO_CHECKPOINT_3 set, preemption awareness acknowledged [worker-2]: INFO:tensorflow:Sigterm acknowledgement from replica 1 received [worker-2]: I0423 21:41:52.387658 281473829860224 failure_handling.py:1177] Sigterm acknowledgement from replica 1 received [worker-2]: INFO:tensorflow:Sigterm acknowledgement from replica 2 received [worker-2]: I0423 21:41:52.388234 281473829860224 failure_handling.py:1177] Sigterm acknowledgement from replica 2 received [worker-3]: I0423 21:41:52.386729 281455171138016 failure_handling.py:1242] PreemptionCheckpointHandler: RECEIVED_SIGNAL_RUN_TO_CHECKPOINT_3 set, preemption awareness acknowledged [worker-2]: INFO:tensorflow:Sigterm acknowledgement from replica 3 received [worker-2]: I0423 21:41:52.391030 281473829860224 failure_handling.py:1177] Sigterm acknowledgement from replica 3 received [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:52.408600 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:PreemptionCheckpointHandler: Starting saving a checkpoint. [worker-0]: INFO:tensorflow:PreemptionCheckpointHandler: Starting saving a checkpoint. [worker-3]: I0423 21:41:52.484887 281473829860224 failure_handling.py:1063] PreemptionCheckpointHandler: Starting saving a checkpoint. [worker-1]: INFO:tensorflow:PreemptionCheckpointHandler: Starting saving a checkpoint. [worker-2]: INFO:tensorflow:PreemptionCheckpointHandler: Starting saving a checkpoint. [worker-1]: I0423 21:41:52.486667 281473829860224 failure_handling.py:1063] PreemptionCheckpointHandler: Starting saving a checkpoint. [worker-2]: I0423 21:41:52.486675 281473829860224 failure_handling.py:1063] PreemptionCheckpointHandler: Starting saving a checkpoint. [worker-0]: I0423 21:41:52.485162 281473829860224 failure_handling.py:1063] PreemptionCheckpointHandler: Starting saving a checkpoint. [worker-1]: INFO:tensorflow:Checkpoint finished at path /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/_tmp/284377733e18a9a7ee6d6d7363a8b7056tteablh/tmphha0m0cq/fh_ckpt/workertemp_1/ [worker-3]: INFO:tensorflow:Checkpoint finished at path /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/_tmp/284377733e18a9a7ee6d6d7363a8b7056tteablh/tmphha0m0cq/fh_ckpt/workertemp_3/ [worker-3]: I0423 21:41:52.553120 281473829860224 failure_handling.py:1078] Checkpoint finished at path /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/_tmp/284377733e18a9a7ee6d6d7363a8b7056tteablh/tmphha0m0cq/fh_ckpt/workertemp_3/ [worker-0]: INFO:tensorflow:Checkpoint finished at path /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/_tmp/284377733e18a9a7ee6d6d7363a8b7056tteablh/tmphha0m0cq/fh_ckpt/ [worker-0]: I0423 21:41:52.557822 281473829860224 failure_handling.py:1078] Checkpoint finished at path /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/_tmp/284377733e18a9a7ee6d6d7363a8b7056tteablh/tmphha0m0cq/fh_ckpt/ [worker-2]: INFO:tensorflow:Checkpoint finished at path /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/_tmp/284377733e18a9a7ee6d6d7363a8b7056tteablh/tmphha0m0cq/fh_ckpt/workertemp_2/ [worker-2]: I0423 21:41:52.563436 281473829860224 failure_handling.py:1078] Checkpoint finished at path /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/_tmp/284377733e18a9a7ee6d6d7363a8b7056tteablh/tmphha0m0cq/fh_ckpt/workertemp_2/ [worker-2]: INFO:tensorflow:Shut down watcher for one's own termination signal [worker-2]: I0423 21:41:52.563788 281473829860224 failure_handling.py:737] Shut down watcher for one's own termination signal [worker-1]: I0423 21:41:52.553092 281473829860224 failure_handling.py:1078] Checkpoint finished at path /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/_tmp/284377733e18a9a7ee6d6d7363a8b7056tteablh/tmphha0m0cq/fh_ckpt/workertemp_1/ [worker-2]: INFO:tensorflow:Shut down watcher for peer's termination signal. [worker-2]: I0423 21:41:52.588692 281473829860224 failure_handling.py:771] Shut down watcher for peer's termination signal. [worker-2]: INFO:tensorflow:PreemptionCheckpointHandler: checkpoint saved. Exiting. [worker-2]: I0423 21:41:52.588972 281473829860224 failure_handling.py:1128] PreemptionCheckpointHandler: checkpoint saved. Exiting. [worker-0]: INFO:tensorflow:Shut down watcher for one's own termination signal [worker-0]: I0423 21:41:53.223206 281473829860224 failure_handling.py:737] Shut down watcher for one's own termination signal [worker-0]: INFO:tensorflow:Shut down watcher for peer's termination signal. [worker-0]: I0423 21:41:53.254078 281473829860224 failure_handling.py:771] Shut down watcher for peer's termination signal. [worker-0]: INFO:tensorflow:PreemptionCheckpointHandler: checkpoint saved. Exiting. [worker-0]: I0423 21:41:53.254365 281473829860224 failure_handling.py:1128] PreemptionCheckpointHandler: checkpoint saved. Exiting. [worker-3]: INFO:tensorflow:Shut down watcher for one's own termination signal [worker-3]: I0423 21:41:53.312774 281473829860224 failure_handling.py:737] Shut down watcher for one's own termination signal [worker-3]: INFO:tensorflow:Shut down watcher for peer's termination signal. [worker-3]: I0423 21:41:53.325432 281473829860224 failure_handling.py:771] Shut down watcher for peer's termination signal. [worker-3]: INFO:tensorflow:PreemptionCheckpointHandler: checkpoint saved. Exiting. [worker-1]: INFO:tensorflow:Shut down watcher for one's own termination signal [worker-3]: I0423 21:41:53.325721 281473829860224 failure_handling.py:1128] PreemptionCheckpointHandler: checkpoint saved. Exiting. [worker-1]: I0423 21:41:53.326227 281473829860224 failure_handling.py:737] Shut down watcher for one's own termination signal [worker-1]: INFO:tensorflow:Shut down watcher for peer's termination signal. [worker-1]: I0423 21:41:53.327784 281473829860224 failure_handling.py:771] Shut down watcher for peer's termination signal. [worker-1]: INFO:tensorflow:PreemptionCheckpointHandler: checkpoint saved. Exiting. [worker-1]: I0423 21:41:53.327951 281473829860224 failure_handling.py:1128] PreemptionCheckpointHandler: checkpoint saved. Exiting. INFO:tensorflow:restarting workers I0423 21:41:54.517236 281473453224832 gce_failure_handler_test.py:323] restarting workers INFO:tensorflow:workers restarted I0423 21:41:54.635860 281473453224832 gce_failure_handler_test.py:327] workers restarted [worker-1]: I0423 21:41:54.863326 281473829860224 multi_process_runner.py:840] Subprocess with PID 2266308 (worker, 1) is now being started. [worker-0]: I0423 21:41:54.866039 281473829860224 multi_process_runner.py:840] Subprocess with PID 2266246 (worker, 0) is now being started. [worker-0]: I0423 21:41:54.866743 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:18383", "localhost:16009", "localhost:20537", "localhost:19329"]}, "task": {"type": "worker", "index": 0}, "rpc_layer": "grpc"}' [worker-1]: I0423 21:41:54.863995 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:18383", "localhost:16009", "localhost:20537", "localhost:19329"]}, "task": {"type": "worker", "index": 1}, "rpc_layer": "grpc"}' [worker-2]: I0423 21:41:54.890973 281473829860224 multi_process_runner.py:840] Subprocess with PID 2266314 (worker, 2) is now being started. [worker-3]: I0423 21:41:54.908181 281473829860224 multi_process_runner.py:840] Subprocess with PID 2266317 (worker, 3) is now being started. [worker-3]: I0423 21:41:54.908869 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:18383", "localhost:16009", "localhost:20537", "localhost:19329"]}, "task": {"type": "worker", "index": 3}, "rpc_layer": "grpc"}' [worker-2]: I0423 21:41:54.891657 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:18383", "localhost:16009", "localhost:20537", "localhost:19329"]}, "task": {"type": "worker", "index": 2}, "rpc_layer": "grpc"}' [worker-0]: 2023-04-23 21:41:55.066748: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:18383 [worker-0]: 2023-04-23 21:41:55.097655: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:0 has connected to coordination service. Incarnation: 10809336007733949302 [worker-0]: 2023-04-23 21:41:55.097937: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-2]: 2023-04-23 21:41:55.183991: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:20537 [worker-0]: 2023-04-23 21:41:55.196824: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:2 has connected to coordination service. Incarnation: 1579447817085998295 [worker-2]: 2023-04-23 21:41:55.198514: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-1]: 2023-04-23 21:41:55.206927: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:16009 [worker-0]: 2023-04-23 21:41:55.216822: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:1 has connected to coordination service. Incarnation: 15273115696155317798 [worker-1]: 2023-04-23 21:41:55.217426: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-3]: 2023-04-23 21:41:55.506685: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:19329 [worker-3]: 2023-04-23 21:41:55.536170: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-0]: 2023-04-23 21:41:55.533567: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:3 has connected to coordination service. Incarnation: 10361797061461032394 [worker-0]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-0]: I0423 21:41:55.549522 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-2]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-1]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-1]: I0423 21:41:55.566913 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-2]: I0423 21:41:55.567446 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-3]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1'] [worker-3]: I0423 21:41:55.612850 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1'] [worker-0]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:0/device:CPU:0',) [worker-0]: I0423 21:41:55.670876 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:0/device:CPU:0',) [worker-0]: INFO:tensorflow:Check health not enabled. [worker-0]: I0423 21:41:55.672095 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-0]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:18383', 'localhost:16009', 'localhost:20537', 'localhost:19329']}, task_type = 'worker', task_id = 0, num_workers = 4, local_devices = ('/job:worker/task:0/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: I0423 21:41:55.672322 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:18383', 'localhost:16009', 'localhost:20537', 'localhost:19329']}, task_type = 'worker', task_id = 0, num_workers = 4, local_devices = ('/job:worker/task:0/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-1]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:1/device:CPU:0',) [worker-1]: I0423 21:41:55.688392 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:1/device:CPU:0',) [worker-1]: INFO:tensorflow:Check health not enabled. [worker-1]: I0423 21:41:55.689596 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-1]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:18383', 'localhost:16009', 'localhost:20537', 'localhost:19329']}, task_type = 'worker', task_id = 1, num_workers = 4, local_devices = ('/job:worker/task:1/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-1]: I0423 21:41:55.689825 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:18383', 'localhost:16009', 'localhost:20537', 'localhost:19329']}, task_type = 'worker', task_id = 1, num_workers = 4, local_devices = ('/job:worker/task:1/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-2]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:2/device:CPU:0',) [worker-2]: I0423 21:41:55.695177 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:2/device:CPU:0',) [worker-2]: INFO:tensorflow:Check health not enabled. [worker-2]: I0423 21:41:55.695741 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-2]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:18383', 'localhost:16009', 'localhost:20537', 'localhost:19329']}, task_type = 'worker', task_id = 2, num_workers = 4, local_devices = ('/job:worker/task:2/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-2]: I0423 21:41:55.695962 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:18383', 'localhost:16009', 'localhost:20537', 'localhost:19329']}, task_type = 'worker', task_id = 2, num_workers = 4, local_devices = ('/job:worker/task:2/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-3]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:3/device:CPU:0',) [worker-3]: I0423 21:41:55.741871 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:3/device:CPU:0',) [worker-3]: INFO:tensorflow:Check health not enabled. [worker-3]: I0423 21:41:55.743147 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-3]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:18383', 'localhost:16009', 'localhost:20537', 'localhost:19329']}, task_type = 'worker', task_id = 3, num_workers = 4, local_devices = ('/job:worker/task:3/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-3]: I0423 21:41:55.743818 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:18383', 'localhost:16009', 'localhost:20537', 'localhost:19329']}, task_type = 'worker', task_id = 3, num_workers = 4, local_devices = ('/job:worker/task:3/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-2]: INFO:tensorflow:Start watcher for peer's signal. [worker-2]: I0423 21:41:55.922708 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-1]: INFO:tensorflow:Start watcher for peer's signal. [worker-1]: I0423 21:41:55.932936 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-2]: INFO:tensorflow:Start polling for termination signal. [worker-0]: INFO:tensorflow:Start watcher for peer's signal. [worker-0]: I0423 21:41:55.937546 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-3]: INFO:tensorflow:Start watcher for peer's signal. [worker-3]: I0423 21:41:55.939058 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-1]: INFO:tensorflow:Start polling for termination signal. [worker-1]: I0423 21:41:55.945320 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-2]: I0423 21:41:55.936796 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-3]: INFO:tensorflow:Start polling for termination signal. [worker-0]: INFO:tensorflow:Start polling for termination signal. [worker-3]: I0423 21:41:55.956870 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-0]: I0423 21:41:55.956851 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-2]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-0]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-0]: I0423 21:41:55.976376 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-0]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-0]: Instructions for updating: [worker-0]: Track steps using a tf.Variable saved in checkpoint instead. [worker-0]: W0423 21:41:55.976727 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-0]: Instructions for updating: [worker-0]: Track steps using a tf.Variable saved in checkpoint instead. [worker-0]: INFO:tensorflow:Start training at 7 [worker-0]: I0423 21:41:55.976892 281473829860224 gce_failure_handler_test.py:194] Start training at 7 [worker-2]: I0423 21:41:55.966223 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-1]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-1]: I0423 21:41:55.986494 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-1]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-1]: Instructions for updating: [worker-1]: Track steps using a tf.Variable saved in checkpoint instead. [worker-1]: W0423 21:41:55.986978 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-1]: Instructions for updating: [worker-1]: Track steps using a tf.Variable saved in checkpoint instead. [worker-1]: INFO:tensorflow:Start training at 7 [worker-1]: I0423 21:41:55.987153 281473829860224 gce_failure_handler_test.py:194] Start training at 7 [worker-3]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-3]: I0423 21:41:55.996282 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-3]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-3]: Instructions for updating: [worker-3]: Track steps using a tf.Variable saved in checkpoint instead. [worker-3]: W0423 21:41:55.996775 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-3]: Instructions for updating: [worker-3]: Track steps using a tf.Variable saved in checkpoint instead. [worker-3]: INFO:tensorflow:Start training at 7 [worker-3]: I0423 21:41:55.996947 281473829860224 gce_failure_handler_test.py:194] Start training at 7 [worker-2]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-2]: Instructions for updating: [worker-2]: Track steps using a tf.Variable saved in checkpoint instead. [worker-2]: W0423 21:41:55.966687 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-2]: Instructions for updating: [worker-2]: Track steps using a tf.Variable saved in checkpoint instead. [worker-2]: INFO:tensorflow:Start training at 7 [worker-2]: I0423 21:41:55.966856 281473829860224 gce_failure_handler_test.py:194] Start training at 7 [worker-0]: INFO:tensorflow:['workertemp_2', 'workertemp_1', 'workertemp_3', 'ckpt-1.data-00000-of-00001', 'ckpt-1.index', 'checkpoint'] [worker-0]: I0423 21:41:56.037718 281473829860224 gce_failure_handler_test.py:203] ['workertemp_2', 'workertemp_1', 'workertemp_3', 'ckpt-1.data-00000-of-00001', 'ckpt-1.index', 'checkpoint'] [worker-2]: INFO:tensorflow:['workertemp_2', 'workertemp_1', 'workertemp_3', 'ckpt-1.data-00000-of-00001', 'ckpt-1.index', 'checkpoint'] [worker-2]: I0423 21:41:56.057931 281473829860224 gce_failure_handler_test.py:203] ['workertemp_2', 'workertemp_1', 'workertemp_3', 'ckpt-1.data-00000-of-00001', 'ckpt-1.index', 'checkpoint'] [worker-1]: INFO:tensorflow:['workertemp_2', 'workertemp_1', 'workertemp_3', 'ckpt-1.data-00000-of-00001', 'ckpt-1.index', 'checkpoint'] [worker-1]: I0423 21:41:56.107701 281473829860224 gce_failure_handler_test.py:203] ['workertemp_2', 'workertemp_1', 'workertemp_3', 'ckpt-1.data-00000-of-00001', 'ckpt-1.index', 'checkpoint'] [worker-3]: INFO:tensorflow:['workertemp_2', 'workertemp_1', 'workertemp_3', 'ckpt-1.data-00000-of-00001', 'ckpt-1.index', 'checkpoint'] [worker-3]: I0423 21:41:56.127549 281473829860224 gce_failure_handler_test.py:203] ['workertemp_2', 'workertemp_1', 'workertemp_3', 'ckpt-1.data-00000-of-00001', 'ckpt-1.index', 'checkpoint'] [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:56.159291 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:56.252472 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:56.264240 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:56.238014 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:56.375468 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:56.399625 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:56.390150 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:56.410295 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:56.547216 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:56.556866 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:56.560748 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:56.586631 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:56.644948 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:56.655960 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:56.664626 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:56.690015 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:56.745855 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:56.745955 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:56.765482 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:56.770876 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb831a8e0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: W0423 21:41:56.898396 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb831a8e0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: INFO:tensorflow:epoch 1 finished [worker-3]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb8319300> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: W0423 21:41:56.906717 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb8319300> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: INFO:tensorflow:epoch 1 finished [worker-3]: I0423 21:41:56.907113 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-2]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb831a7a0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: W0423 21:41:56.903980 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb831a7a0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: INFO:tensorflow:epoch 1 finished [worker-2]: I0423 21:41:56.904275 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-0]: I0423 21:41:56.898695 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-1]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb831aa20> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:56.916046 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: W0423 21:41:56.906334 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb831aa20> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: INFO:tensorflow:epoch 1 finished [worker-1]: I0423 21:41:56.906631 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-2]: I0423 21:41:56.912567 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:56.924989 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:56.929938 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb831b4c0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb831b6a0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: W0423 21:41:57.000109 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb831b4c0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: W0423 21:41:57.005185 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb831b6a0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:57.007353 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb8319e40> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: W0423 21:41:57.017224 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb8319e40> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:57.013670 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb831aca0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: W0423 21:41:57.026372 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb831aca0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:57.036672 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:57.054714 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:57.134650 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:57.139705 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:57.144048 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:57.179045 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:57.232002 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:57.255739 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:57.259943 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:57.259840 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:57.351595 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:57.354073 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:57.351567 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:57.372053 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:57.450938 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:57.474242 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:57.463718 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:57.477556 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:epoch 2 finished [worker-3]: INFO:tensorflow:epoch 2 finished [worker-0]: INFO:tensorflow:epoch 2 finished [worker-1]: I0423 21:41:57.608946 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-0]: I0423 21:41:57.616804 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-2]: INFO:tensorflow:epoch 2 finished [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:57.626697 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-3]: I0423 21:41:57.616609 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:57.630797 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:57.618698 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:57.631769 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:57.637173 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:57.714521 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:57.743937 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:57.746724 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:57.757820 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:57.827198 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:57.834659 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:57.832878 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:57.841882 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:58.071772 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:58.079606 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:58.105802 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:58.225421 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:58.272449 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:58.273751 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:58.283071 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:58.310616 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:58.383687 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:58.398309 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:58.412833 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:58.419578 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:epoch 3 finished [worker-0]: I0423 21:41:58.523079 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-3]: INFO:tensorflow:epoch 3 finished [worker-3]: I0423 21:41:58.522670 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-1]: INFO:tensorflow:epoch 3 finished [worker-1]: I0423 21:41:58.523451 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-2]: INFO:tensorflow:epoch 3 finished [worker-2]: I0423 21:41:58.528293 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:58.531105 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:58.535586 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:58.547630 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:58.569603 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:58.716039 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:58.747035 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:58.739839 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:58.772216 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:58.860793 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:58.864661 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:58.849877 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:58.870085 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:58.965357 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:58.988495 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:58.989465 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:59.019576 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:59.137166 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:59.129984 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:59.129997 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:59.150996 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:41:59.240061 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:41:59.242225 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:41:59.249783 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:41:59.257225 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 4 finished [worker-3]: I0423 21:41:59.345557 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-0]: INFO:tensorflow:epoch 4 finished [worker-0]: I0423 21:41:59.346083 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-0]: INFO:tensorflow:Training finished. [worker-0]: I0423 21:41:59.346363 281473829860224 gce_failure_handler_test.py:244] Training finished. [worker-1]: INFO:tensorflow:epoch 4 finished [worker-1]: I0423 21:41:59.347653 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-1]: INFO:tensorflow:Training finished. [worker-1]: I0423 21:41:59.347901 281473829860224 gce_failure_handler_test.py:244] Training finished. [worker-2]: INFO:tensorflow:epoch 4 finished [worker-2]: I0423 21:41:59.352185 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-3]: INFO:tensorflow:Training finished. [worker-3]: I0423 21:41:59.345850 281473829860224 gce_failure_handler_test.py:244] Training finished. [worker-2]: INFO:tensorflow:Training finished. [worker-2]: I0423 21:41:59.352457 281473829860224 gce_failure_handler_test.py:244] Training finished. [worker-3]: INFO:tensorflow:Shut down watcher for peer's termination signal. [worker-3]: I0423 21:41:59.353937 281473829860224 failure_handling.py:771] Shut down watcher for peer's termination signal. [worker-1]: INFO:tensorflow:Shut down watcher for peer's termination signal. [worker-1]: I0423 21:41:59.386258 281473829860224 failure_handling.py:771] Shut down watcher for peer's termination signal. [worker-3]: INFO:tensorflow:Shut down watcher for one's own termination signal [worker-3]: I0423 21:41:59.998330 281473829860224 failure_handling.py:737] Shut down watcher for one's own termination signal [worker-1]: INFO:tensorflow:Shut down watcher for one's own termination signal [worker-1]: I0423 21:42:00.056576 281473829860224 failure_handling.py:737] Shut down watcher for one's own termination signal [worker-1]: 2023-04-23 21:42:00.233216: E tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:737] Coordination agent is in ERROR: UNAVAILABLE: failed to connect to all addresses [worker-1]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-1]: :{"created":"@1682286120.233040616","description":"Failed to pick subchannel","file":"external/com_github_grpc_grpc/src/core/ext/filters/client_channel/client_channel.cc","file_line":3940,"referenced_errors":[{"created":"@1682286120.227705468","description":"failed to connect to all addresses","file":"external/com_github_grpc_grpc/src/core/ext/filters/client_channel/lb_policy/pick_first/pick_first.cc","file_line":392,"grpc_status":14}]} [worker-1]: 2023-04-23 21:42:00.233305: E tensorflow/core/common_runtime/base_collective_executor.cc:249] BaseCollectiveExecutor::StartAbort UNAVAILABLE: failed to connect to all addresses [worker-1]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-1]: :{"created":"@1682286120.233040616","description":"Failed to pick subchannel","file":"external/com_github_grpc_grpc/src/core/ext/filters/client_channel/client_channel.cc","file_line":3940,"referenced_errors":[{"created":"@1682286120.227705468","description":"failed to connect to all addresses","file":"external/com_github_grpc_grpc/src/core/ext/filters/client_channel/lb_policy/pick_first/pick_first.cc","file_line":392,"grpc_status":14}]} [worker-3]: 2023-04-23 21:42:00.562653: E tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:737] Coordination agent is in ERROR: UNAVAILABLE: failed to connect to all addresses [worker-3]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-3]: :{"created":"@1682286120.562510299","description":"Failed to pick subchannel","file":"external/com_github_grpc_grpc/src/core/ext/filters/client_channel/client_channel.cc","file_line":3940,"referenced_errors":[{"created":"@1682286120.557720843","description":"failed to connect to all addresses","file":"external/com_github_grpc_grpc/src/core/ext/filters/client_channel/lb_policy/pick_first/pick_first.cc","file_line":392,"grpc_status":14}]} [worker-3]: 2023-04-23 21:42:00.562722: E tensorflow/core/common_runtime/base_collective_executor.cc:249] BaseCollectiveExecutor::StartAbort UNAVAILABLE: failed to connect to all addresses [worker-3]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-3]: :{"created":"@1682286120.562510299","description":"Failed to pick subchannel","file":"external/com_github_grpc_grpc/src/core/ext/filters/client_channel/client_channel.cc","file_line":3940,"referenced_errors":[{"created":"@1682286120.557720843","description":"failed to connect to all addresses","file":"external/com_github_grpc_grpc/src/core/ext/filters/client_channel/lb_policy/pick_first/pick_first.cc","file_line":392,"grpc_status":14}]} I0423 21:42:02.617754 281473453224832 multi_process_runner.py:646] worker-0 exit code: 0 I0423 21:42:02.618062 281473453224832 multi_process_runner.py:646] worker-1 exit code: 0 I0423 21:42:02.618186 281473453224832 multi_process_runner.py:646] worker-2 exit code: 0 I0423 21:42:02.618295 281473453224832 multi_process_runner.py:646] worker-3 exit code: 0 I0423 21:42:02.620106 281473453224832 multi_process_runner.py:662] Joining log reading threads. I0423 21:42:02.620332 281473453224832 multi_process_runner.py:665] Joined log reading threads. INFO:tensorflow:time(__main__.GceFailureHandlingTest.test_basic_run_test_inputarg_manager_strategyoption_MWMSmultiworker): 15.61s I0423 21:42:02.711047 281473453224832 test_util.py:2462] time(__main__.GceFailureHandlingTest.test_basic_run_test_inputarg_manager_strategyoption_MWMSmultiworker): 15.61s [ OK ] GceFailureHandlingTest.test_basic_run_test_inputarg_manager_strategyoption_MWMSmultiworker [ RUN ] GceFailureHandlingTest.test_grace_period_continue_training_test_apiwrappingtrain_False_inputarg_manager_strategyoption_MWMSmultiworker INFO:tensorflow:Using MirroredStrategy with devices ('/device:CPU:0',) I0423 21:42:02.774879 281473453224832 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/device:CPU:0',) INFO:tensorflow:Single-worker MultiWorkerMirroredStrategy with local_devices = ('/device:CPU:0',), communication = CommunicationImplementation.AUTO I0423 21:42:02.775305 281473453224832 collective_all_reduce_strategy.py:446] Single-worker MultiWorkerMirroredStrategy with local_devices = ('/device:CPU:0',), communication = CommunicationImplementation.AUTO INFO:tensorflow:Start polling for termination signal. I0423 21:42:02.800457 281473453224832 failure_handling.py:683] Start polling for termination signal. INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. I0423 21:42:02.802485 281473453224832 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. Instructions for updating: Track steps using a tf.Variable saved in checkpoint instead. W0423 21:42:02.802863 281473453224832 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. Instructions for updating: Track steps using a tf.Variable saved in checkpoint instead. INFO:tensorflow:Start training at 0 I0423 21:42:02.803045 281473453224832 gce_failure_handler_test.py:194] Start training at 0 WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xfffefa704fe0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. W0423 21:42:03.123129 281473453224832 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xfffefa704fe0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xfffefa706160> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. W0423 21:42:03.147369 281473453224832 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xfffefa706160> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. INFO:tensorflow:epoch 0 finished I0423 21:42:03.147777 281473453224832 gce_failure_handler_test.py:192] epoch 0 finished INFO:tensorflow:epoch 1 finished I0423 21:42:03.298123 281473453224832 gce_failure_handler_test.py:192] epoch 1 finished INFO:tensorflow:epoch 2 finished I0423 21:42:03.470829 281473453224832 gce_failure_handler_test.py:192] epoch 2 finished INFO:tensorflow:epoch 3 finished I0423 21:42:03.774605 281473453224832 gce_failure_handler_test.py:192] epoch 3 finished INFO:tensorflow:epoch 4 finished I0423 21:42:04.026662 281473453224832 gce_failure_handler_test.py:192] epoch 4 finished INFO:tensorflow:Training finished. I0423 21:42:04.026944 281473453224832 gce_failure_handler_test.py:244] Training finished. INFO:tensorflow:time(__main__.GceFailureHandlingTest.test_grace_period_continue_training_test_apiwrappingtrain_False_inputarg_manager_strategyoption_MWMSmultiworker): 1.32s I0423 21:42:04.031368 281473453224832 test_util.py:2462] time(__main__.GceFailureHandlingTest.test_grace_period_continue_training_test_apiwrappingtrain_False_inputarg_manager_strategyoption_MWMSmultiworker): 1.32s [ OK ] GceFailureHandlingTest.test_grace_period_continue_training_test_apiwrappingtrain_False_inputarg_manager_strategyoption_MWMSmultiworker [ RUN ] GceFailureHandlingTest.test_grace_period_continue_training_test_apiwrappingtrain_True_inputarg_manager_strategyoption_MWMSmultiworker INFO:tensorflow:Using MirroredStrategy with devices ('/device:CPU:0',) I0423 21:42:04.045993 281473453224832 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/device:CPU:0',) INFO:tensorflow:Single-worker MultiWorkerMirroredStrategy with local_devices = ('/device:CPU:0',), communication = CommunicationImplementation.AUTO I0423 21:42:04.046424 281473453224832 collective_all_reduce_strategy.py:446] Single-worker MultiWorkerMirroredStrategy with local_devices = ('/device:CPU:0',), communication = CommunicationImplementation.AUTO INFO:tensorflow:Start polling for termination signal. I0423 21:42:04.062061 281473453224832 failure_handling.py:683] Start polling for termination signal. INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. I0423 21:42:04.062654 281473453224832 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. INFO:tensorflow:Start training at 0 I0423 21:42:04.062871 281473453224832 gce_failure_handler_test.py:194] Start training at 0 INFO:tensorflow:epoch 0 finished I0423 21:42:04.232458 281473453224832 gce_failure_handler_test.py:192] epoch 0 finished INFO:tensorflow:epoch 1 finished I0423 21:42:04.398182 281473453224832 gce_failure_handler_test.py:192] epoch 1 finished INFO:tensorflow:epoch 2 finished I0423 21:42:04.542752 281473453224832 gce_failure_handler_test.py:192] epoch 2 finished INFO:tensorflow:epoch 3 finished I0423 21:42:04.679828 281473453224832 gce_failure_handler_test.py:192] epoch 3 finished INFO:tensorflow:epoch 4 finished I0423 21:42:04.835602 281473453224832 gce_failure_handler_test.py:192] epoch 4 finished INFO:tensorflow:Training finished. I0423 21:42:04.835953 281473453224832 gce_failure_handler_test.py:244] Training finished. INFO:tensorflow:time(__main__.GceFailureHandlingTest.test_grace_period_continue_training_test_apiwrappingtrain_True_inputarg_manager_strategyoption_MWMSmultiworker): 0.81s I0423 21:42:04.840879 281473453224832 test_util.py:2462] time(__main__.GceFailureHandlingTest.test_grace_period_continue_training_test_apiwrappingtrain_True_inputarg_manager_strategyoption_MWMSmultiworker): 0.81s [ OK ] GceFailureHandlingTest.test_grace_period_continue_training_test_apiwrappingtrain_True_inputarg_manager_strategyoption_MWMSmultiworker [ RUN ] GceFailureHandlingTest.test_multiple_workers_preempted_consecutively_test_apiwrappingtrain_False_graceperiod_0_inputarg_manager_strategyoption_MWMSmultiworker INFO:tensorflow:Using local port 16636 I0423 21:42:04.846392 281473453224832 test_util.py:3794] Using local port 16636 INFO:tensorflow:Using local port 21658 I0423 21:42:04.847250 281473453224832 test_util.py:3794] Using local port 21658 INFO:tensorflow:Using local port 20797 I0423 21:42:04.847620 281473453224832 test_util.py:3794] Using local port 20797 INFO:tensorflow:Using local port 23677 I0423 21:42:04.847975 281473453224832 test_util.py:3794] Using local port 23677 INFO:tensorflow:Cluster starting. I0423 21:42:04.872336 281473453224832 gce_failure_handler_test.py:405] Cluster starting. [worker-0]: I0423 21:42:04.915449 281473829860224 multi_process_runner.py:840] Subprocess with PID 2280954 (worker, 0) is now being started. [worker-0]: I0423 21:42:04.916145 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:16636", "localhost:21658", "localhost:20797", "localhost:23677"]}, "task": {"type": "worker", "index": 0}, "rpc_layer": "grpc"}' [worker-1]: I0423 21:42:04.917848 281473829860224 multi_process_runner.py:840] Subprocess with PID 2280958 (worker, 1) is now being started. [worker-1]: I0423 21:42:04.918509 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:16636", "localhost:21658", "localhost:20797", "localhost:23677"]}, "task": {"type": "worker", "index": 1}, "rpc_layer": "grpc"}' [worker-1]: 2023-04-23 21:42:04.997962: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:21658 [worker-2]: I0423 21:42:05.008888 281473829860224 multi_process_runner.py:840] Subprocess with PID 2280980 (worker, 2) is now being started. [worker-2]: I0423 21:42:05.009500 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:16636", "localhost:21658", "localhost:20797", "localhost:23677"]}, "task": {"type": "worker", "index": 2}, "rpc_layer": "grpc"}' [worker-3]: I0423 21:42:05.035077 281473829860224 multi_process_runner.py:840] Subprocess with PID 2281068 (worker, 3) is now being started. [worker-3]: I0423 21:42:05.035737 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:16636", "localhost:21658", "localhost:20797", "localhost:23677"]}, "task": {"type": "worker", "index": 3}, "rpc_layer": "grpc"}' [worker-2]: 2023-04-23 21:42:05.051679: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:20797 [worker-0]: 2023-04-23 21:42:05.057057: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:16636 [worker-0]: 2023-04-23 21:42:05.059559: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:1 has connected to coordination service. Incarnation: 11331323576060035491 [worker-1]: 2023-04-23 21:42:05.061659: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-0]: 2023-04-23 21:42:05.071174: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:2 has connected to coordination service. Incarnation: 4105219231101325414 [worker-2]: 2023-04-23 21:42:05.071612: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-0]: 2023-04-23 21:42:05.074434: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:0 has connected to coordination service. Incarnation: 7241607243790412832 [worker-0]: 2023-04-23 21:42:05.074676: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-3]: 2023-04-23 21:42:05.094682: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:23677 [worker-0]: 2023-04-23 21:42:05.116290: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:3 has connected to coordination service. Incarnation: 2133882424824221462 [worker-3]: 2023-04-23 21:42:05.118731: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-1]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-1]: I0423 21:42:05.130121 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-2]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-2]: I0423 21:42:05.137401 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-0]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-0]: I0423 21:42:05.143507 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-3]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1'] [worker-3]: I0423 21:42:05.147556 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1'] [worker-1]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:1/device:CPU:0',) [worker-1]: I0423 21:42:05.186510 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:1/device:CPU:0',) [worker-1]: INFO:tensorflow:Check health not enabled. [worker-1]: I0423 21:42:05.187009 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-1]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16636', 'localhost:21658', 'localhost:20797', 'localhost:23677']}, task_type = 'worker', task_id = 1, num_workers = 4, local_devices = ('/job:worker/task:1/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-1]: I0423 21:42:05.187218 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16636', 'localhost:21658', 'localhost:20797', 'localhost:23677']}, task_type = 'worker', task_id = 1, num_workers = 4, local_devices = ('/job:worker/task:1/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-3]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:3/device:CPU:0',) [worker-2]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:2/device:CPU:0',) [worker-3]: I0423 21:42:05.207762 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:3/device:CPU:0',) [worker-2]: I0423 21:42:05.207960 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:2/device:CPU:0',) [worker-3]: INFO:tensorflow:Check health not enabled. [worker-2]: INFO:tensorflow:Check health not enabled. [worker-3]: I0423 21:42:05.208222 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-2]: I0423 21:42:05.208531 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-3]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16636', 'localhost:21658', 'localhost:20797', 'localhost:23677']}, task_type = 'worker', task_id = 3, num_workers = 4, local_devices = ('/job:worker/task:3/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-2]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16636', 'localhost:21658', 'localhost:20797', 'localhost:23677']}, task_type = 'worker', task_id = 2, num_workers = 4, local_devices = ('/job:worker/task:2/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-3]: I0423 21:42:05.208432 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16636', 'localhost:21658', 'localhost:20797', 'localhost:23677']}, task_type = 'worker', task_id = 3, num_workers = 4, local_devices = ('/job:worker/task:3/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-2]: I0423 21:42:05.208752 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16636', 'localhost:21658', 'localhost:20797', 'localhost:23677']}, task_type = 'worker', task_id = 2, num_workers = 4, local_devices = ('/job:worker/task:2/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:0/device:CPU:0',) [worker-0]: I0423 21:42:05.211599 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:0/device:CPU:0',) [worker-0]: INFO:tensorflow:Check health not enabled. [worker-0]: I0423 21:42:05.212266 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-0]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16636', 'localhost:21658', 'localhost:20797', 'localhost:23677']}, task_type = 'worker', task_id = 0, num_workers = 4, local_devices = ('/job:worker/task:0/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: I0423 21:42:05.212491 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16636', 'localhost:21658', 'localhost:20797', 'localhost:23677']}, task_type = 'worker', task_id = 0, num_workers = 4, local_devices = ('/job:worker/task:0/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: INFO:tensorflow:Start watcher for peer's signal. [worker-0]: I0423 21:42:05.257405 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-0]: INFO:tensorflow:Start polling for termination signal. [worker-0]: I0423 21:42:05.258479 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-0]: Exception in thread WorkerTerminationSignalWatcher-0: [worker-0]: Traceback (most recent call last): [worker-0]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-0]: self.run() [worker-0]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-0]: self._target(*self._args, **self._kwargs) [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-0]: if self._termination_watcher_fn(): [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-0]: elif frequent_send and not maintenance_event.is_set(): [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-0]: AttributeError: 'str' object has no attribute 'is_set' [worker-0]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-0]: I0423 21:42:05.262306 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-0]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-0]: Instructions for updating: [worker-0]: Track steps using a tf.Variable saved in checkpoint instead. [worker-0]: W0423 21:42:05.262659 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-0]: Instructions for updating: [worker-0]: Track steps using a tf.Variable saved in checkpoint instead. [worker-0]: INFO:tensorflow:Start training at 0 [worker-0]: I0423 21:42:05.262825 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-2]: INFO:tensorflow:Start watcher for peer's signal. [worker-2]: I0423 21:42:05.277548 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-1]: INFO:tensorflow:Start watcher for peer's signal. [worker-1]: I0423 21:42:05.279336 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-1]: INFO:tensorflow:Start polling for termination signal. [worker-1]: I0423 21:42:05.280677 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-2]: INFO:tensorflow:Start polling for termination signal. [worker-2]: I0423 21:42:05.286342 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-2]: Exception in thread WorkerTerminationSignalWatcher-2: [worker-2]: Traceback (most recent call last): [worker-2]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-2]: self.run() [worker-2]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-2]: self._target(*self._args, **self._kwargs) [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-2]: if self._termination_watcher_fn(): [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-2]: elif frequent_send and not maintenance_event.is_set(): [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-2]: AttributeError: 'str' object has no attribute 'is_set' [worker-2]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-1]: Exception in thread WorkerTerminationSignalWatcher-1: [worker-2]: I0423 21:42:05.288932 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-1]: Traceback (most recent call last): [worker-2]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-2]: Instructions for updating: [worker-2]: Track steps using a tf.Variable saved in checkpoint instead. [worker-1]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-2]: W0423 21:42:05.289314 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-1]: self.run() [worker-1]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-2]: Instructions for updating: [worker-1]: self._target(*self._args, **self._kwargs) [worker-2]: Track steps using a tf.Variable saved in checkpoint instead. [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-2]: INFO:tensorflow:Start training at 0 [worker-1]: if self._termination_watcher_fn(): [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-2]: I0423 21:42:05.289479 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-1]: elif frequent_send and not maintenance_event.is_set(): [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: AttributeError: 'str' object has no attribute 'is_set' [worker-1]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-1]: I0423 21:42:05.296197 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-1]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-1]: Instructions for updating: [worker-1]: Track steps using a tf.Variable saved in checkpoint instead. [worker-1]: W0423 21:42:05.296545 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-1]: Instructions for updating: [worker-1]: Track steps using a tf.Variable saved in checkpoint instead. [worker-1]: INFO:tensorflow:Start training at 0 [worker-1]: I0423 21:42:05.296706 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-3]: INFO:tensorflow:Start watcher for peer's signal. [worker-3]: I0423 21:42:05.298372 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-3]: INFO:tensorflow:Start polling for termination signal. [worker-3]: I0423 21:42:05.317069 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-3]: Exception in thread WorkerTerminationSignalWatcher-3: [worker-3]: Traceback (most recent call last): [worker-3]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-3]: self.run() [worker-3]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-3]: self._target(*self._args, **self._kwargs) [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-3]: if self._termination_watcher_fn(): [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-3]: elif frequent_send and not maintenance_event.is_set(): [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: AttributeError: 'str' object has no attribute 'is_set' [worker-3]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-3]: I0423 21:42:05.366276 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-3]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-3]: Instructions for updating: [worker-3]: Track steps using a tf.Variable saved in checkpoint instead. [worker-3]: W0423 21:42:05.366631 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-3]: Instructions for updating: [worker-3]: Track steps using a tf.Variable saved in checkpoint instead. [worker-3]: INFO:tensorflow:Start training at 0 [worker-3]: I0423 21:42:05.366793 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:05.378041 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:05.446771 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:05.451866 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:05.652570 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:05.776481 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:05.807558 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:05.830066 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:05.829709 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:05.907305 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:05.907525 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:05.914428 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:05.927601 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:06.002876 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:06.003890 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:06.020133 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:06.020488 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:06.090041 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:06.093204 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:06.096801 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:06.099929 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830a980> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830eac0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: W0423 21:42:06.157116 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830a980> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: W0423 21:42:06.157324 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830eac0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb8309c60> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: W0423 21:42:06.158063 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb8309c60> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:06.164910 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830dc60> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: W0423 21:42:06.162637 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830dc60> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:06.165860 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:06.166007 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:06.172512 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830ad40> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830ee80> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: W0423 21:42:06.217466 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830ad40> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: W0423 21:42:06.217582 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830ee80> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: INFO:tensorflow:epoch 0 finished [worker-0]: INFO:tensorflow:epoch 0 finished [worker-3]: I0423 21:42:06.217864 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-0]: I0423 21:42:06.217974 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-2]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830ad40> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: W0423 21:42:06.225443 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830ad40> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: I0423 21:42:06.225550 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:epoch 0 finished [worker-3]: I0423 21:42:06.225551 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:06.225832 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-1]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830f2e0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: W0423 21:42:06.227636 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830f2e0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: INFO:tensorflow:epoch 0 finished [worker-1]: I0423 21:42:06.227960 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:06.236868 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:06.234348 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:06.296754 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:06.298677 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:06.297255 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:06.299338 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:06.354090 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:06.355027 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:06.353611 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:06.370506 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:06.428250 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:06.429227 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:06.429610 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:06.439158 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:06.489920 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:06.504616 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:06.504436 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:06.516743 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:06.613241 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:06.612737 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:06.609098 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:06.629607 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 1 finished [worker-0]: INFO:tensorflow:epoch 1 finished [worker-0]: I0423 21:42:06.722648 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-3]: I0423 21:42:06.717542 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-1]: INFO:tensorflow:epoch 1 finished [worker-1]: I0423 21:42:06.736656 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-2]: INFO:tensorflow:epoch 1 finished [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:06.739934 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-1]: I0423 21:42:06.743847 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:06.746768 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:06.739630 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:06.764166 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:06.813943 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:06.831666 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:06.838762 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:06.853007 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:06.904957 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:06.905745 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:06.905755 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:06.913420 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:06.972335 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:06.973906 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:06.975431 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:06.972283 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:07.031006 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:07.031051 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:07.031015 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:07.031021 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:07.082286 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:07.082775 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:07.084426 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:07.082023 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 2 finished [worker-0]: INFO:tensorflow:epoch 2 finished [worker-1]: INFO:tensorflow:epoch 2 finished [worker-3]: I0423 21:42:07.143320 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-1]: I0423 21:42:07.143846 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-0]: I0423 21:42:07.143565 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-2]: INFO:tensorflow:epoch 2 finished [worker-2]: I0423 21:42:07.147366 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:07.151284 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:07.151353 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:07.151861 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:07.154816 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:07.203239 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:07.204218 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:07.204194 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:07.207193 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:07.257837 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:07.259598 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:07.262526 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:07.270298 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:07.339681 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:07.340249 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:07.344702 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:07.345018 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:07.424867 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:07.430478 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:07.489784 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:07.479671 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:07.564266 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:07.566241 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:07.566996 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:07.568595 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 3 finished [worker-0]: INFO:tensorflow:epoch 3 finished [worker-0]: I0423 21:42:07.609932 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-3]: I0423 21:42:07.609679 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-1]: INFO:tensorflow:epoch 3 finished [worker-2]: INFO:tensorflow:epoch 3 finished [worker-1]: I0423 21:42:07.614061 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-2]: I0423 21:42:07.614156 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:07.616528 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:07.618220 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:07.622215 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:07.624691 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:07.696103 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:07.695276 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:07.727982 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:07.730537 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:07.820887 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:07.830658 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:07.851344 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:07.861401 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:07.917085 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:07.917068 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:07.949651 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:07.940628 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:08.014903 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:08.027374 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:08.023535 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:08.034797 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:08.086670 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:08.087336 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:08.087980 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:08.094376 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:epoch 4 finished [worker-0]: I0423 21:42:08.134929 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-0]: INFO:tensorflow:Training finished. [worker-0]: I0423 21:42:08.135190 281473829860224 gce_failure_handler_test.py:244] Training finished. [worker-3]: INFO:tensorflow:epoch 4 finished [worker-3]: I0423 21:42:08.135613 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-3]: INFO:tensorflow:Training finished. [worker-3]: I0423 21:42:08.135930 281473829860224 gce_failure_handler_test.py:244] Training finished. [worker-1]: INFO:tensorflow:epoch 4 finished [worker-1]: I0423 21:42:08.137545 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-1]: INFO:tensorflow:Training finished. [worker-1]: I0423 21:42:08.137838 281473829860224 gce_failure_handler_test.py:244] Training finished. [worker-2]: INFO:tensorflow:epoch 4 finished [worker-2]: I0423 21:42:08.154430 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-2]: INFO:tensorflow:Training finished. [worker-2]: I0423 21:42:08.154743 281473829860224 gce_failure_handler_test.py:244] Training finished. INFO:tensorflow:restarting workers I0423 21:42:09.977180 281473453224832 gce_failure_handler_test.py:411] restarting workers INFO:tensorflow:workers restarted I0423 21:42:10.106225 281473453224832 gce_failure_handler_test.py:415] workers restarted [worker-0]: I0423 21:42:10.119084 281473829860224 multi_process_runner.py:840] Subprocess with PID 2289518 (worker, 0) is now being started. [worker-0]: I0423 21:42:10.119617 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:16636", "localhost:21658", "localhost:20797", "localhost:23677"]}, "task": {"type": "worker", "index": 0}, "rpc_layer": "grpc"}' [worker-0]: 2023-04-23 21:42:10.180775: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:16636 [worker-0]: 2023-04-23 21:42:10.246310: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:0 has connected to coordination service. Incarnation: 14373122875219373055 [worker-0]: 2023-04-23 21:42:10.247195: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-2]: I0423 21:42:10.297386 281473829860224 multi_process_runner.py:840] Subprocess with PID 2289526 (worker, 2) is now being started. [worker-1]: I0423 21:42:10.327663 281473829860224 multi_process_runner.py:840] Subprocess with PID 2289523 (worker, 1) is now being started. [worker-3]: I0423 21:42:10.355470 281473829860224 multi_process_runner.py:840] Subprocess with PID 2289531 (worker, 3) is now being started. [worker-2]: I0423 21:42:10.298005 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:16636", "localhost:21658", "localhost:20797", "localhost:23677"]}, "task": {"type": "worker", "index": 2}, "rpc_layer": "grpc"}' [worker-1]: I0423 21:42:10.328285 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:16636", "localhost:21658", "localhost:20797", "localhost:23677"]}, "task": {"type": "worker", "index": 1}, "rpc_layer": "grpc"}' [worker-3]: I0423 21:42:10.356093 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:16636", "localhost:21658", "localhost:20797", "localhost:23677"]}, "task": {"type": "worker", "index": 3}, "rpc_layer": "grpc"}' [worker-2]: 2023-04-23 21:42:10.582814: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:20797 [worker-1]: 2023-04-23 21:42:10.585948: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:21658 [worker-0]: 2023-04-23 21:42:10.593648: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:1 has connected to coordination service. Incarnation: 5882631765840851560 [worker-1]: 2023-04-23 21:42:10.593852: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-0]: 2023-04-23 21:42:10.600781: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:2 has connected to coordination service. Incarnation: 10049170844653268742 [worker-2]: 2023-04-23 21:42:10.601020: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-3]: 2023-04-23 21:42:10.617924: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:23677 [worker-0]: 2023-04-23 21:42:10.626959: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:3 has connected to coordination service. Incarnation: 1589573924155272977 [worker-3]: 2023-04-23 21:42:10.627612: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-1]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-2]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-0]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-1]: I0423 21:42:10.639173 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-2]: I0423 21:42:10.639188 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-0]: I0423 21:42:10.639499 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-3]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1'] [worker-3]: I0423 21:42:10.651327 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1'] [worker-0]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:0/device:CPU:0',) [worker-1]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:1/device:CPU:0',) [worker-0]: I0423 21:42:10.706048 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:0/device:CPU:0',) [worker-1]: I0423 21:42:10.700485 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:1/device:CPU:0',) [worker-1]: INFO:tensorflow:Check health not enabled. [worker-1]: I0423 21:42:10.700983 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-0]: INFO:tensorflow:Check health not enabled. [worker-1]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16636', 'localhost:21658', 'localhost:20797', 'localhost:23677']}, task_type = 'worker', task_id = 1, num_workers = 4, local_devices = ('/job:worker/task:1/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: I0423 21:42:10.707149 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-1]: I0423 21:42:10.701196 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16636', 'localhost:21658', 'localhost:20797', 'localhost:23677']}, task_type = 'worker', task_id = 1, num_workers = 4, local_devices = ('/job:worker/task:1/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16636', 'localhost:21658', 'localhost:20797', 'localhost:23677']}, task_type = 'worker', task_id = 0, num_workers = 4, local_devices = ('/job:worker/task:0/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: I0423 21:42:10.707367 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16636', 'localhost:21658', 'localhost:20797', 'localhost:23677']}, task_type = 'worker', task_id = 0, num_workers = 4, local_devices = ('/job:worker/task:0/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-3]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:3/device:CPU:0',) [worker-2]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:2/device:CPU:0',) [worker-2]: I0423 21:42:10.708044 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:2/device:CPU:0',) [worker-2]: INFO:tensorflow:Check health not enabled. [worker-2]: I0423 21:42:10.708632 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-2]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16636', 'localhost:21658', 'localhost:20797', 'localhost:23677']}, task_type = 'worker', task_id = 2, num_workers = 4, local_devices = ('/job:worker/task:2/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-2]: I0423 21:42:10.708838 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16636', 'localhost:21658', 'localhost:20797', 'localhost:23677']}, task_type = 'worker', task_id = 2, num_workers = 4, local_devices = ('/job:worker/task:2/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-3]: I0423 21:42:10.711424 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:3/device:CPU:0',) [worker-3]: INFO:tensorflow:Check health not enabled. [worker-3]: I0423 21:42:10.711906 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-3]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16636', 'localhost:21658', 'localhost:20797', 'localhost:23677']}, task_type = 'worker', task_id = 3, num_workers = 4, local_devices = ('/job:worker/task:3/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-3]: I0423 21:42:10.712109 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16636', 'localhost:21658', 'localhost:20797', 'localhost:23677']}, task_type = 'worker', task_id = 3, num_workers = 4, local_devices = ('/job:worker/task:3/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-3]: INFO:tensorflow:Start watcher for peer's signal. [worker-3]: I0423 21:42:10.878881 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-3]: INFO:tensorflow:Start polling for termination signal. [worker-3]: I0423 21:42:10.879935 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-3]: Exception in thread WorkerTerminationSignalWatcher-3: [worker-3]: Traceback (most recent call last): [worker-3]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-3]: self.run() [worker-3]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-3]: self._target(*self._args, **self._kwargs) [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-3]: if self._termination_watcher_fn(): [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-3]: elif frequent_send and not maintenance_event.is_set(): [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: AttributeError: 'str' object has no attribute 'is_set' [worker-3]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-3]: I0423 21:42:10.885682 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-3]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-3]: Instructions for updating: [worker-3]: Track steps using a tf.Variable saved in checkpoint instead. [worker-3]: W0423 21:42:10.886049 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-3]: Instructions for updating: [worker-3]: Track steps using a tf.Variable saved in checkpoint instead. [worker-3]: INFO:tensorflow:Start training at 0 [worker-3]: I0423 21:42:10.886239 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-2]: INFO:tensorflow:Start watcher for peer's signal. [worker-2]: I0423 21:42:10.890374 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-2]: INFO:tensorflow:Start polling for termination signal. [worker-2]: I0423 21:42:10.891065 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-2]: Exception in thread WorkerTerminationSignalWatcher-2: [worker-2]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-2]: Traceback (most recent call last): [worker-2]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-2]: I0423 21:42:10.891499 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-2]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-2]: Instructions for updating: [worker-2]: Track steps using a tf.Variable saved in checkpoint instead. [worker-2]: W0423 21:42:10.891877 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-2]: Instructions for updating: [worker-2]: Track steps using a tf.Variable saved in checkpoint instead. [worker-2]: INFO:tensorflow:Start training at 0 [worker-2]: I0423 21:42:10.892093 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-2]: self.run() [worker-2]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-2]: self._target(*self._args, **self._kwargs) [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-2]: if self._termination_watcher_fn(): [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-2]: elif frequent_send and not maintenance_event.is_set(): [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-2]: AttributeError: 'str' object has no attribute 'is_set' [worker-1]: INFO:tensorflow:Start watcher for peer's signal. [worker-1]: I0423 21:42:10.896380 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-0]: INFO:tensorflow:Start watcher for peer's signal. [worker-0]: I0423 21:42:10.899590 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-1]: INFO:tensorflow:Start polling for termination signal. [worker-0]: INFO:tensorflow:Start polling for termination signal. [worker-1]: I0423 21:42:10.897157 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-0]: I0423 21:42:10.900267 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-0]: Exception in thread WorkerTerminationSignalWatcher-0: [worker-0]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-1]: Exception in thread WorkerTerminationSignalWatcher-1: [worker-0]: Traceback (most recent call last): [worker-1]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-0]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-0]: I0423 21:42:10.900738 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-0]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-1]: Traceback (most recent call last): [worker-0]: Instructions for updating: [worker-1]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-0]: Track steps using a tf.Variable saved in checkpoint instead. [worker-1]: I0423 21:42:10.897634 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-0]: W0423 21:42:10.901155 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-0]: Instructions for updating: [worker-1]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-0]: Track steps using a tf.Variable saved in checkpoint instead. [worker-1]: Instructions for updating: [worker-0]: INFO:tensorflow:Start training at 0 [worker-1]: Track steps using a tf.Variable saved in checkpoint instead. [worker-0]: I0423 21:42:10.901369 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-1]: W0423 21:42:10.898022 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-0]: self.run() [worker-1]: Instructions for updating: [worker-0]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-1]: Track steps using a tf.Variable saved in checkpoint instead. [worker-0]: self._target(*self._args, **self._kwargs) [worker-1]: INFO:tensorflow:Start training at 0 [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-1]: I0423 21:42:10.898244 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-0]: if self._termination_watcher_fn(): [worker-1]: self.run() [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-1]: self._target(*self._args, **self._kwargs) [worker-0]: elif frequent_send and not maintenance_event.is_set(): [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: if self._termination_watcher_fn(): [worker-0]: AttributeError: 'str' object has no attribute 'is_set' [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-1]: elif frequent_send and not maintenance_event.is_set(): [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: AttributeError: 'str' object has no attribute 'is_set' [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:11.045022 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:11.053594 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:11.061791 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:11.062293 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:11.127339 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:11.127454 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:11.133305 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:11.127832 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:11.187771 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:11.205186 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:11.223319 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:11.310156 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:11.371610 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:11.374507 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:11.374507 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:11.375651 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:11.422438 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:11.426921 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:11.444995 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:11.431733 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830dda0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: W0423 21:42:11.487048 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830dda0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:11.494216 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb8309da0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: W0423 21:42:11.507413 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb8309da0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830a980> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: W0423 21:42:11.514970 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830a980> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:11.515724 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb8311da0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: W0423 21:42:11.516454 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb8311da0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:11.540292 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:11.540125 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830c860> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: W0423 21:42:11.602223 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830c860> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: INFO:tensorflow:epoch 0 finished [worker-3]: I0423 21:42:11.602568 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:11.609912 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830b420> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: W0423 21:42:11.627069 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830b420> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: INFO:tensorflow:epoch 0 finished [worker-1]: I0423 21:42:11.627445 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-2]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830b560> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: W0423 21:42:11.629222 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830b560> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: INFO:tensorflow:epoch 0 finished [worker-2]: I0423 21:42:11.629551 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:11.634943 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb83132e0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: W0423 21:42:11.636363 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb83132e0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:epoch 0 finished [worker-0]: I0423 21:42:11.636720 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-2]: I0423 21:42:11.636644 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:11.644207 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:11.693719 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:11.694802 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:11.693975 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:11.700037 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:11.750847 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:11.750886 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:11.751038 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:11.751007 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:11.799786 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:11.800105 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:11.819789 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:11.820117 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:11.891586 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:11.892019 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:11.896427 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:11.896603 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:11.953154 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:11.955531 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:11.956063 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:11.962155 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 1 finished [worker-1]: INFO:tensorflow:epoch 1 finished [worker-2]: INFO:tensorflow:epoch 1 finished [worker-0]: INFO:tensorflow:epoch 1 finished [worker-2]: I0423 21:42:12.002458 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-1]: I0423 21:42:12.002422 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-3]: I0423 21:42:12.002116 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-0]: I0423 21:42:12.002321 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:12.009377 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:12.009877 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:12.010097 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:12.009918 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:12.077037 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:12.077142 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:12.079274 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:12.079902 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:12.128668 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:12.128804 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:12.129569 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:12.129944 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:12.180323 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:12.180319 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:12.180636 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:12.180438 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:12.234195 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:12.234727 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:12.234807 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:12.234893 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:12.288384 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:12.289225 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:12.292239 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:12.303650 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 2 finished [worker-0]: INFO:tensorflow:epoch 2 finished [worker-1]: INFO:tensorflow:epoch 2 finished [worker-2]: INFO:tensorflow:epoch 2 finished [worker-3]: I0423 21:42:12.344641 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-0]: I0423 21:42:12.344838 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-1]: I0423 21:42:12.345016 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-2]: I0423 21:42:12.345071 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:12.351260 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:12.351607 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:12.351557 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:12.351753 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:12.403131 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:12.404036 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:12.403688 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:12.409884 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:12.457860 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:12.457987 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:12.457944 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:12.458182 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:12.512478 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:12.512120 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:12.513505 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:12.512869 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:12.568698 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:12.568756 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:12.568834 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:12.584433 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:12.631321 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:12.631438 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:12.631465 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:12.632090 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 3 finished [worker-0]: INFO:tensorflow:epoch 3 finished [worker-1]: INFO:tensorflow:epoch 3 finished [worker-2]: INFO:tensorflow:epoch 3 finished [worker-3]: I0423 21:42:12.675249 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-0]: I0423 21:42:12.675410 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-2]: I0423 21:42:12.675605 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-1]: I0423 21:42:12.675575 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:12.683335 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:12.683355 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:12.683055 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:12.683435 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:12.731586 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:12.731953 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:12.732032 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:12.735694 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:12.787726 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:12.788951 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:12.787978 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:12.801373 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:12.856791 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:12.858802 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:12.859228 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:12.860742 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:12.911526 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:12.911928 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:12.912264 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:12.912361 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:13.023862 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:13.014884 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:13.019956 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:13.029842 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:epoch 4 finished [worker-3]: INFO:tensorflow:epoch 4 finished [worker-3]: I0423 21:42:13.072941 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-0]: I0423 21:42:13.073242 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-3]: INFO:tensorflow:Training finished. [worker-1]: INFO:tensorflow:epoch 4 finished [worker-0]: INFO:tensorflow:Training finished. [worker-1]: I0423 21:42:13.074389 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-3]: I0423 21:42:13.073202 281473829860224 gce_failure_handler_test.py:244] Training finished. [worker-0]: I0423 21:42:13.073508 281473829860224 gce_failure_handler_test.py:244] Training finished. [worker-2]: INFO:tensorflow:epoch 4 finished [worker-1]: INFO:tensorflow:Training finished. [worker-2]: I0423 21:42:13.075087 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-1]: I0423 21:42:13.074645 281473829860224 gce_failure_handler_test.py:244] Training finished. [worker-2]: INFO:tensorflow:Training finished. [worker-2]: I0423 21:42:13.075359 281473829860224 gce_failure_handler_test.py:244] Training finished. INFO:tensorflow:Termination notice available. I0423 21:42:13.836859 281462729077216 gce_failure_handler_test.py:142] Termination notice available. --- Logging error --- Traceback (most recent call last): File "/usr/lib/python3.11/logging/__init__.py", line 1110, in emit msg = self.format(record) ^^^^^^^^^^^^^^^^^^^ File "/usr/lib/python3.11/logging/__init__.py", line 953, in format return fmt.format(record) ^^^^^^^^^^^^^^^^^^ File "/usr/lib/python3.11/logging/__init__.py", line 687, in format record.message = record.getMessage() ^^^^^^^^^^^^^^^^^^^ File "/usr/lib/python3.11/logging/__init__.py", line 377, in getMessage msg = msg % self.args ~~~~^~~~~~~~~~~ TypeError: not all arguments converted during string formatting Call stack: File "/usr/lib/python3.11/threading.py", line 995, in _bootstrap self._bootstrap_inner() File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner self.run() File "/usr/lib/python3.11/threading.py", line 975, in run self._target(*self._args, **self._kwargs) File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 696, in _poll_termination_signal self._maybe_set_received_own_sigterm() File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 701, in _maybe_set_received_own_sigterm logging.info('Received termination notice.', File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/platform/tf_logging.py", line 204, in info get_logger().info(msg, *args, **kwargs) Message: 'Received termination notice.' Arguments: ('single_worker',) --- Logging error --- Traceback (most recent call last): File "/usr/lib/python3.11/logging/__init__.py", line 1110, in emit msg = self.format(record) ^^^^^^^^^^^^^^^^^^^ File "/usr/lib/python3.11/logging/__init__.py", line 953, in format return fmt.format(record) ^^^^^^^^^^^^^^^^^^ File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/absl_py/absl/logging/__init__.py", line 1025, in format return prefix + super(PythonFormatter, self).format(record) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/usr/lib/python3.11/logging/__init__.py", line 687, in format record.message = record.getMessage() ^^^^^^^^^^^^^^^^^^^ File "/usr/lib/python3.11/logging/__init__.py", line 377, in getMessage msg = msg % self.args ~~~~^~~~~~~~~~~ TypeError: not all arguments converted during string formatting Call stack: File "/usr/lib/python3.11/threading.py", line 995, in _bootstrap self._bootstrap_inner() File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner self.run() File "/usr/lib/python3.11/threading.py", line 975, in run self._target(*self._args, **self._kwargs) File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 696, in _poll_termination_signal self._maybe_set_received_own_sigterm() File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 701, in _maybe_set_received_own_sigterm logging.info('Received termination notice.', File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/platform/tf_logging.py", line 204, in info get_logger().info(msg, *args, **kwargs) File "/usr/lib/python3.11/logging/__init__.py", line 1489, in info self._log(INFO, msg, args, **kwargs) File "/usr/lib/python3.11/logging/__init__.py", line 1634, in _log self.handle(record) File "/usr/lib/python3.11/logging/__init__.py", line 1644, in handle self.callHandlers(record) File "/usr/lib/python3.11/logging/__init__.py", line 1706, in callHandlers hdlr.handle(record) File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/absl_py/absl/logging/__init__.py", line 988, in handle return self._current_handler.handle(record) File "/usr/lib/python3.11/logging/__init__.py", line 978, in handle self.emit(record) File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/absl_py/absl/logging/__init__.py", line 925, in emit super(PythonHandler, self).emit(record) Message: 'Received termination notice.' Arguments: ('single_worker',) Exception ignored in: Traceback (most recent call last): File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 775, in __del__ self._stop_poll_termination_signal_thread() File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 734, in _stop_poll_termination_signal_thread self._poll_termination_signal_thread.join() File "/usr/lib/python3.11/threading.py", line 1109, in join raise RuntimeError("cannot join current thread") RuntimeError: cannot join current thread I0423 21:42:14.017671 281473453224832 multi_process_runner.py:646] worker-0 exit code: 0 I0423 21:42:14.017951 281473453224832 multi_process_runner.py:646] worker-1 exit code: 0 I0423 21:42:14.018070 281473453224832 multi_process_runner.py:646] worker-2 exit code: 0 I0423 21:42:14.018173 281473453224832 multi_process_runner.py:646] worker-3 exit code: 0 I0423 21:42:14.020777 281473453224832 multi_process_runner.py:662] Joining log reading threads. I0423 21:42:14.020989 281473453224832 multi_process_runner.py:665] Joined log reading threads. INFO:tensorflow:time(__main__.GceFailureHandlingTest.test_multiple_workers_preempted_consecutively_test_apiwrappingtrain_False_graceperiod_0_inputarg_manager_strategyoption_MWMSmultiworker): 9.32s I0423 21:42:14.164374 281473453224832 test_util.py:2462] time(__main__.GceFailureHandlingTest.test_multiple_workers_preempted_consecutively_test_apiwrappingtrain_False_graceperiod_0_inputarg_manager_strategyoption_MWMSmultiworker): 9.32s [ OK ] GceFailureHandlingTest.test_multiple_workers_preempted_consecutively_test_apiwrappingtrain_False_graceperiod_0_inputarg_manager_strategyoption_MWMSmultiworker [ RUN ] GceFailureHandlingTest.test_multiple_workers_preempted_consecutively_test_apiwrappingtrain_False_graceperiod_7_inputarg_manager_strategyoption_MWMSmultiworker INFO:tensorflow:Using local port 20398 I0423 21:42:14.171209 281473453224832 test_util.py:3794] Using local port 20398 INFO:tensorflow:Using local port 22688 I0423 21:42:14.171675 281473453224832 test_util.py:3794] Using local port 22688 INFO:tensorflow:Using local port 22942 I0423 21:42:14.172021 281473453224832 test_util.py:3794] Using local port 22942 INFO:tensorflow:Using local port 16311 I0423 21:42:14.172354 281473453224832 test_util.py:3794] Using local port 16311 INFO:tensorflow:Cluster starting. I0423 21:42:14.219702 281473453224832 gce_failure_handler_test.py:405] Cluster starting. [worker-1]: I0423 21:42:14.273583 281473829860224 multi_process_runner.py:840] Subprocess with PID 2298468 (worker, 1) is now being started. [worker-1]: I0423 21:42:14.274083 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:20398", "localhost:22688", "localhost:22942", "localhost:16311"]}, "task": {"type": "worker", "index": 1}, "rpc_layer": "grpc"}' [worker-2]: I0423 21:42:14.281525 281473829860224 multi_process_runner.py:840] Subprocess with PID 2298470 (worker, 2) is now being started. [worker-0]: I0423 21:42:14.282351 281473829860224 multi_process_runner.py:840] Subprocess with PID 2298465 (worker, 0) is now being started. [worker-2]: I0423 21:42:14.282019 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:20398", "localhost:22688", "localhost:22942", "localhost:16311"]}, "task": {"type": "worker", "index": 2}, "rpc_layer": "grpc"}' [worker-0]: I0423 21:42:14.282798 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:20398", "localhost:22688", "localhost:22942", "localhost:16311"]}, "task": {"type": "worker", "index": 0}, "rpc_layer": "grpc"}' [worker-3]: I0423 21:42:14.289071 281473829860224 multi_process_runner.py:840] Subprocess with PID 2298473 (worker, 3) is now being started. [worker-3]: I0423 21:42:14.289566 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:20398", "localhost:22688", "localhost:22942", "localhost:16311"]}, "task": {"type": "worker", "index": 3}, "rpc_layer": "grpc"}' [worker-1]: 2023-04-23 21:42:14.315977: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:22688 [worker-2]: 2023-04-23 21:42:14.315969: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:22942 [worker-0]: 2023-04-23 21:42:14.316447: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:20398 [worker-3]: 2023-04-23 21:42:14.324356: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:16311 [worker-0]: 2023-04-23 21:42:14.327979: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:2 has connected to coordination service. Incarnation: 13052616424752085840 [worker-2]: 2023-04-23 21:42:14.328610: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-0]: 2023-04-23 21:42:14.329831: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:0 has connected to coordination service. Incarnation: 6761235770174467133 [worker-0]: 2023-04-23 21:42:14.329981: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-0]: 2023-04-23 21:42:14.330982: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:1 has connected to coordination service. Incarnation: 13777813839443883001 [worker-1]: 2023-04-23 21:42:14.331125: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-0]: 2023-04-23 21:42:14.334670: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:3 has connected to coordination service. Incarnation: 5433979606884966184 [worker-3]: 2023-04-23 21:42:14.334908: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-0]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-2]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-0]: I0423 21:42:14.337042 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-2]: I0423 21:42:14.337152 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-1]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-1]: I0423 21:42:14.345703 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-3]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1'] [worker-3]: I0423 21:42:14.345586 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1'] [worker-0]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:0/device:CPU:0',) [worker-2]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:2/device:CPU:0',) [worker-0]: I0423 21:42:14.393652 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:0/device:CPU:0',) [worker-2]: I0423 21:42:14.393652 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:2/device:CPU:0',) [worker-0]: INFO:tensorflow:Check health not enabled. [worker-2]: INFO:tensorflow:Check health not enabled. [worker-0]: I0423 21:42:14.394140 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-2]: I0423 21:42:14.394141 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-0]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:20398', 'localhost:22688', 'localhost:22942', 'localhost:16311']}, task_type = 'worker', task_id = 0, num_workers = 4, local_devices = ('/job:worker/task:0/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-2]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:20398', 'localhost:22688', 'localhost:22942', 'localhost:16311']}, task_type = 'worker', task_id = 2, num_workers = 4, local_devices = ('/job:worker/task:2/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: I0423 21:42:14.394348 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:20398', 'localhost:22688', 'localhost:22942', 'localhost:16311']}, task_type = 'worker', task_id = 0, num_workers = 4, local_devices = ('/job:worker/task:0/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-2]: I0423 21:42:14.394347 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:20398', 'localhost:22688', 'localhost:22942', 'localhost:16311']}, task_type = 'worker', task_id = 2, num_workers = 4, local_devices = ('/job:worker/task:2/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-1]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:1/device:CPU:0',) [worker-1]: I0423 21:42:14.402729 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:1/device:CPU:0',) [worker-1]: INFO:tensorflow:Check health not enabled. [worker-1]: I0423 21:42:14.403164 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-1]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:20398', 'localhost:22688', 'localhost:22942', 'localhost:16311']}, task_type = 'worker', task_id = 1, num_workers = 4, local_devices = ('/job:worker/task:1/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-1]: I0423 21:42:14.403377 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:20398', 'localhost:22688', 'localhost:22942', 'localhost:16311']}, task_type = 'worker', task_id = 1, num_workers = 4, local_devices = ('/job:worker/task:1/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-3]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:3/device:CPU:0',) [worker-3]: I0423 21:42:14.404040 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:3/device:CPU:0',) [worker-3]: INFO:tensorflow:Check health not enabled. [worker-3]: I0423 21:42:14.404595 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-3]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:20398', 'localhost:22688', 'localhost:22942', 'localhost:16311']}, task_type = 'worker', task_id = 3, num_workers = 4, local_devices = ('/job:worker/task:3/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-3]: I0423 21:42:14.404814 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:20398', 'localhost:22688', 'localhost:22942', 'localhost:16311']}, task_type = 'worker', task_id = 3, num_workers = 4, local_devices = ('/job:worker/task:3/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: INFO:tensorflow:Start watcher for peer's signal. [worker-0]: I0423 21:42:14.445962 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-1]: INFO:tensorflow:Start watcher for peer's signal. [worker-0]: INFO:tensorflow:Start polling for termination signal. [worker-1]: I0423 21:42:14.447044 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-0]: I0423 21:42:14.447201 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-1]: INFO:tensorflow:Start polling for termination signal. [worker-0]: Exception in thread WorkerTerminationSignalWatcher-0: [worker-1]: I0423 21:42:14.447968 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-0]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-1]: Exception in thread WorkerTerminationSignalWatcher-1: [worker-0]: Traceback (most recent call last): [worker-1]: Traceback (most recent call last): [worker-1]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-2]: INFO:tensorflow:Start watcher for peer's signal. [worker-1]: self.run() [worker-2]: I0423 21:42:14.449536 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-1]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-2]: INFO:tensorflow:Start polling for termination signal. [worker-1]: self._target(*self._args, **self._kwargs) [worker-3]: INFO:tensorflow:Start watcher for peer's signal. [worker-2]: I0423 21:42:14.450530 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-0]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-3]: I0423 21:42:14.452210 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-2]: Exception in thread WorkerTerminationSignalWatcher-2: [worker-0]: I0423 21:42:14.448274 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-3]: INFO:tensorflow:Start polling for termination signal. [worker-1]: if self._termination_watcher_fn(): [worker-2]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-0]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-3]: I0423 21:42:14.452939 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-0]: Instructions for updating: [worker-2]: Traceback (most recent call last): [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-0]: Track steps using a tf.Variable saved in checkpoint instead. [worker-3]: Exception in thread WorkerTerminationSignalWatcher-3: [worker-2]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-3]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-1]: elif frequent_send and not maintenance_event.is_set(): [worker-0]: W0423 21:42:14.448930 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-2]: I0423 21:42:14.451051 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-0]: Instructions for updating: [worker-2]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-3]: Traceback (most recent call last): [worker-0]: Track steps using a tf.Variable saved in checkpoint instead. [worker-2]: Instructions for updating: [worker-3]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-1]: AttributeError: 'str' object has no attribute 'is_set' [worker-3]: I0423 21:42:14.453401 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-0]: INFO:tensorflow:Start training at 0 [worker-1]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-2]: Track steps using a tf.Variable saved in checkpoint instead. [worker-3]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-0]: I0423 21:42:14.449329 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-2]: W0423 21:42:14.451462 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-3]: Instructions for updating: [worker-1]: I0423 21:42:14.450185 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-0]: self.run() [worker-1]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-3]: Track steps using a tf.Variable saved in checkpoint instead. [worker-2]: Instructions for updating: [worker-0]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-3]: W0423 21:42:14.453830 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-2]: Track steps using a tf.Variable saved in checkpoint instead. [worker-0]: self._target(*self._args, **self._kwargs) [worker-3]: Instructions for updating: [worker-2]: INFO:tensorflow:Start training at 0 [worker-1]: Instructions for updating: [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-3]: Track steps using a tf.Variable saved in checkpoint instead. [worker-2]: I0423 21:42:14.451632 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-1]: Track steps using a tf.Variable saved in checkpoint instead. [worker-2]: self.run() [worker-0]: if self._termination_watcher_fn(): [worker-3]: INFO:tensorflow:Start training at 0 [worker-2]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-1]: W0423 21:42:14.450580 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-2]: self._target(*self._args, **self._kwargs) [worker-1]: Instructions for updating: [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-3]: I0423 21:42:14.454011 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-1]: Track steps using a tf.Variable saved in checkpoint instead. [worker-0]: elif frequent_send and not maintenance_event.is_set(): [worker-3]: self.run() [worker-1]: INFO:tensorflow:Start training at 0 [worker-2]: if self._termination_watcher_fn(): [worker-3]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: self._target(*self._args, **self._kwargs) [worker-0]: AttributeError: 'str' object has no attribute 'is_set' [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-2]: elif frequent_send and not maintenance_event.is_set(): [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-1]: I0423 21:42:14.450747 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: if self._termination_watcher_fn(): [worker-2]: AttributeError: 'str' object has no attribute 'is_set' [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-3]: elif frequent_send and not maintenance_event.is_set(): [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: AttributeError: 'str' object has no attribute 'is_set' [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:14.681414 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:14.685637 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:14.677657 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:14.695403 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:14.771609 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:14.771674 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:14.771694 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:14.773678 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:14.839555 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:14.854195 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:14.854537 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:14.856173 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:14.964517 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:14.959634 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:14.979809 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:14.989505 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:15.063558 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:15.064771 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:15.070411 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:15.060226 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830e020> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830e160> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830a160> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830e020> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: W0423 21:42:15.117199 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830e020> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: W0423 21:42:15.117534 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830e160> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: W0423 21:42:15.117541 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830e020> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: W0423 21:42:15.117582 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830a160> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:15.125172 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:15.125217 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:15.126038 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:15.124403 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830f420> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830f420> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830f420> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: W0423 21:42:15.165735 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830f420> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: W0423 21:42:15.165880 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830f420> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830b420> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: W0423 21:42:15.166131 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830f420> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: INFO:tensorflow:epoch 0 finished [worker-0]: INFO:tensorflow:epoch 0 finished [worker-2]: W0423 21:42:15.166386 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830b420> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: INFO:tensorflow:epoch 0 finished [worker-0]: I0423 21:42:15.166195 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-3]: I0423 21:42:15.166056 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-2]: INFO:tensorflow:epoch 0 finished [worker-1]: I0423 21:42:15.166430 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-2]: I0423 21:42:15.166677 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:15.173137 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:15.173129 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:15.173712 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:15.174023 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:15.243221 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:15.244473 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:15.244723 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:15.250297 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:15.301537 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:15.301531 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:15.301545 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:15.302522 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:15.355551 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:15.355433 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:15.356003 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:15.355810 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:15.420161 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:15.421578 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:15.421663 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:15.421607 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:15.475168 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:15.475292 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:15.477417 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:15.477569 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 1 finished [worker-3]: I0423 21:42:15.526772 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-0]: INFO:tensorflow:epoch 1 finished [worker-1]: INFO:tensorflow:epoch 1 finished [worker-0]: I0423 21:42:15.529470 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-1]: I0423 21:42:15.529671 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-2]: INFO:tensorflow:epoch 1 finished [worker-2]: I0423 21:42:15.531983 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:15.535364 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:15.535961 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:15.536200 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:15.540530 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:15.596422 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:15.605049 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:15.605134 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:15.608724 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:15.659358 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:15.659903 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:15.660167 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:15.660168 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:15.709447 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:15.721032 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:15.721929 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:15.722585 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:15.772285 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:15.772289 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:15.773553 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:15.774260 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:15.833688 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:15.835944 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:15.837067 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:15.837502 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 2 finished [worker-1]: INFO:tensorflow:epoch 2 finished [worker-0]: INFO:tensorflow:epoch 2 finished [worker-3]: I0423 21:42:15.880282 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-2]: INFO:tensorflow:epoch 2 finished [worker-1]: I0423 21:42:15.880587 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-0]: I0423 21:42:15.880550 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-2]: I0423 21:42:15.880741 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:15.887143 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:15.887660 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:15.888525 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:15.888820 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:15.953242 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:15.953610 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:15.954275 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:15.957183 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:16.003387 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:16.003413 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:16.004295 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:16.004366 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:16.055610 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:16.056454 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:16.057386 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:16.061735 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:16.107841 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:16.108535 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:16.108400 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:16.108965 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:16.158926 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:16.158960 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:16.159865 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:16.160397 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 3 finished [worker-0]: INFO:tensorflow:epoch 3 finished [worker-1]: INFO:tensorflow:epoch 3 finished [worker-3]: I0423 21:42:16.209383 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-2]: INFO:tensorflow:epoch 3 finished [worker-0]: I0423 21:42:16.209648 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-1]: I0423 21:42:16.209767 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-2]: I0423 21:42:16.209860 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:16.216159 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:16.216804 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:16.218359 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:16.228453 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:16.277951 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:16.281095 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:16.280180 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:16.281635 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:16.341494 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:16.341801 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:16.341733 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:16.341315 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:16.409263 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:16.411981 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:16.412365 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:16.412871 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:16.461728 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:16.461814 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:16.463232 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:16.467717 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:16.513868 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:16.514753 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:16.517223 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:16.539549 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 4 finished [worker-3]: I0423 21:42:16.653257 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-3]: INFO:tensorflow:Training finished. [worker-3]: I0423 21:42:16.653492 281473829860224 gce_failure_handler_test.py:244] Training finished. [worker-1]: INFO:tensorflow:epoch 4 finished [worker-0]: INFO:tensorflow:epoch 4 finished [worker-1]: I0423 21:42:16.656834 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-2]: INFO:tensorflow:epoch 4 finished [worker-0]: I0423 21:42:16.657524 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-0]: INFO:tensorflow:Training finished. [worker-0]: I0423 21:42:16.657750 281473829860224 gce_failure_handler_test.py:244] Training finished. [worker-2]: I0423 21:42:16.657741 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-1]: INFO:tensorflow:Training finished. [worker-1]: I0423 21:42:16.657058 281473829860224 gce_failure_handler_test.py:244] Training finished. [worker-2]: INFO:tensorflow:Training finished. [worker-2]: I0423 21:42:16.657952 281473829860224 gce_failure_handler_test.py:244] Training finished. INFO:tensorflow:restarting workers I0423 21:42:18.260449 281473453224832 gce_failure_handler_test.py:411] restarting workers INFO:tensorflow:workers restarted I0423 21:42:18.293382 281473453224832 gce_failure_handler_test.py:415] workers restarted [worker-0]: I0423 21:42:18.298250 281473829860224 multi_process_runner.py:840] Subprocess with PID 2307268 (worker, 0) is now being started. [worker-0]: I0423 21:42:18.298825 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:20398", "localhost:22688", "localhost:22942", "localhost:16311"]}, "task": {"type": "worker", "index": 0}, "rpc_layer": "grpc"}' [worker-1]: I0423 21:42:18.305569 281473829860224 multi_process_runner.py:840] Subprocess with PID 2307276 (worker, 1) is now being started. [worker-1]: I0423 21:42:18.306154 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:20398", "localhost:22688", "localhost:22942", "localhost:16311"]}, "task": {"type": "worker", "index": 1}, "rpc_layer": "grpc"}' [worker-2]: I0423 21:42:18.315971 281473829860224 multi_process_runner.py:840] Subprocess with PID 2307286 (worker, 2) is now being started. [worker-2]: I0423 21:42:18.316520 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:20398", "localhost:22688", "localhost:22942", "localhost:16311"]}, "task": {"type": "worker", "index": 2}, "rpc_layer": "grpc"}' [worker-3]: I0423 21:42:18.331595 281473829860224 multi_process_runner.py:840] Subprocess with PID 2307358 (worker, 3) is now being started. [worker-3]: I0423 21:42:18.332223 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:20398", "localhost:22688", "localhost:22942", "localhost:16311"]}, "task": {"type": "worker", "index": 3}, "rpc_layer": "grpc"}' [worker-0]: 2023-04-23 21:42:18.334171: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:20398 [worker-0]: 2023-04-23 21:42:18.338631: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:0 has connected to coordination service. Incarnation: 10812750873541054630 [worker-1]: 2023-04-23 21:42:18.340982: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:22688 [worker-0]: 2023-04-23 21:42:18.347037: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-0]: 2023-04-23 21:42:18.351668: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:1 has connected to coordination service. Incarnation: 17923444337519011285 [worker-1]: 2023-04-23 21:42:18.351902: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-3]: 2023-04-23 21:42:18.377211: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:16311 [worker-2]: 2023-04-23 21:42:18.378158: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:22942 [worker-0]: 2023-04-23 21:42:18.386945: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:3 has connected to coordination service. Incarnation: 1021590881332887019 [worker-3]: 2023-04-23 21:42:18.387141: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-0]: 2023-04-23 21:42:18.403238: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:2 has connected to coordination service. Incarnation: 7031367907648216338 [worker-2]: 2023-04-23 21:42:18.403416: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-1]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-2]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-3]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1'] [worker-0]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-1]: I0423 21:42:18.405132 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-2]: I0423 21:42:18.405177 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-3]: I0423 21:42:18.405229 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1'] [worker-0]: I0423 21:42:18.405275 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-1]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:1/device:CPU:0',) [worker-2]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:2/device:CPU:0',) [worker-3]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:3/device:CPU:0',) [worker-1]: I0423 21:42:18.462946 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:1/device:CPU:0',) [worker-2]: I0423 21:42:18.463017 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:2/device:CPU:0',) [worker-3]: I0423 21:42:18.463069 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:3/device:CPU:0',) [worker-1]: INFO:tensorflow:Check health not enabled. [worker-2]: INFO:tensorflow:Check health not enabled. [worker-3]: INFO:tensorflow:Check health not enabled. [worker-1]: I0423 21:42:18.463389 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-2]: I0423 21:42:18.463464 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-3]: I0423 21:42:18.463587 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-1]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:20398', 'localhost:22688', 'localhost:22942', 'localhost:16311']}, task_type = 'worker', task_id = 1, num_workers = 4, local_devices = ('/job:worker/task:1/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-2]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:20398', 'localhost:22688', 'localhost:22942', 'localhost:16311']}, task_type = 'worker', task_id = 2, num_workers = 4, local_devices = ('/job:worker/task:2/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:0/device:CPU:0',) [worker-3]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:20398', 'localhost:22688', 'localhost:22942', 'localhost:16311']}, task_type = 'worker', task_id = 3, num_workers = 4, local_devices = ('/job:worker/task:3/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-2]: I0423 21:42:18.463671 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:20398', 'localhost:22688', 'localhost:22942', 'localhost:16311']}, task_type = 'worker', task_id = 2, num_workers = 4, local_devices = ('/job:worker/task:2/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-1]: I0423 21:42:18.463598 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:20398', 'localhost:22688', 'localhost:22942', 'localhost:16311']}, task_type = 'worker', task_id = 1, num_workers = 4, local_devices = ('/job:worker/task:1/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: I0423 21:42:18.464646 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:0/device:CPU:0',) [worker-3]: I0423 21:42:18.463797 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:20398', 'localhost:22688', 'localhost:22942', 'localhost:16311']}, task_type = 'worker', task_id = 3, num_workers = 4, local_devices = ('/job:worker/task:3/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: INFO:tensorflow:Check health not enabled. [worker-0]: I0423 21:42:18.465160 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-0]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:20398', 'localhost:22688', 'localhost:22942', 'localhost:16311']}, task_type = 'worker', task_id = 0, num_workers = 4, local_devices = ('/job:worker/task:0/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: I0423 21:42:18.465363 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:20398', 'localhost:22688', 'localhost:22942', 'localhost:16311']}, task_type = 'worker', task_id = 0, num_workers = 4, local_devices = ('/job:worker/task:0/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: INFO:tensorflow:Start watcher for peer's signal. [worker-0]: I0423 21:42:18.493606 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-1]: INFO:tensorflow:Start watcher for peer's signal. [worker-2]: INFO:tensorflow:Start watcher for peer's signal. [worker-3]: INFO:tensorflow:Start watcher for peer's signal. [worker-1]: I0423 21:42:18.494256 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-0]: INFO:tensorflow:Start polling for termination signal. [worker-2]: I0423 21:42:18.494301 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-3]: I0423 21:42:18.494369 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-2]: INFO:tensorflow:Start polling for termination signal. [worker-3]: INFO:tensorflow:Start polling for termination signal. [worker-3]: I0423 21:42:18.495079 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-2]: I0423 21:42:18.495022 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-3]: Exception in thread WorkerTerminationSignalWatcher-3: [worker-2]: Exception in thread WorkerTerminationSignalWatcher-2: [worker-3]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-2]: Traceback (most recent call last): [worker-1]: INFO:tensorflow:Start polling for termination signal. [worker-3]: Traceback (most recent call last): [worker-2]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-0]: I0423 21:42:18.494380 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-1]: I0423 21:42:18.494934 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-3]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-2]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-0]: Exception in thread WorkerTerminationSignalWatcher-0: [worker-1]: Exception in thread WorkerTerminationSignalWatcher-1: [worker-3]: I0423 21:42:18.495515 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-2]: I0423 21:42:18.495563 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-0]: Traceback (most recent call last): [worker-1]: Traceback (most recent call last): [worker-3]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-0]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-1]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-2]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-3]: Instructions for updating: [worker-0]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-1]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-2]: Instructions for updating: [worker-3]: Track steps using a tf.Variable saved in checkpoint instead. [worker-0]: I0423 21:42:18.494955 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-1]: I0423 21:42:18.495512 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-2]: Track steps using a tf.Variable saved in checkpoint instead. [worker-3]: W0423 21:42:18.495949 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-0]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-1]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-2]: W0423 21:42:18.495861 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-3]: Instructions for updating: [worker-0]: Instructions for updating: [worker-1]: Instructions for updating: [worker-2]: Instructions for updating: [worker-3]: Track steps using a tf.Variable saved in checkpoint instead. [worker-0]: Track steps using a tf.Variable saved in checkpoint instead. [worker-2]: Track steps using a tf.Variable saved in checkpoint instead. [worker-1]: Track steps using a tf.Variable saved in checkpoint instead. [worker-3]: self.run() [worker-0]: W0423 21:42:18.495379 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-2]: INFO:tensorflow:Start training at 0 [worker-1]: W0423 21:42:18.495918 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-3]: INFO:tensorflow:Start training at 0 [worker-2]: I0423 21:42:18.496033 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-1]: Instructions for updating: [worker-3]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-2]: self.run() [worker-0]: Instructions for updating: [worker-1]: Track steps using a tf.Variable saved in checkpoint instead. [worker-2]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-3]: I0423 21:42:18.496598 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-0]: Track steps using a tf.Variable saved in checkpoint instead. [worker-1]: INFO:tensorflow:Start training at 0 [worker-2]: self._target(*self._args, **self._kwargs) [worker-3]: self._target(*self._args, **self._kwargs) [worker-0]: INFO:tensorflow:Start training at 0 [worker-1]: I0423 21:42:18.496087 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-0]: I0423 21:42:18.495559 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-1]: self.run() [worker-2]: if self._termination_watcher_fn(): [worker-3]: if self._termination_watcher_fn(): [worker-0]: self.run() [worker-1]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-0]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: self._target(*self._args, **self._kwargs) [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-0]: self._target(*self._args, **self._kwargs) [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-3]: elif frequent_send and not maintenance_event.is_set(): [worker-2]: elif frequent_send and not maintenance_event.is_set(): [worker-1]: if self._termination_watcher_fn(): [worker-0]: if self._termination_watcher_fn(): [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-0]: elif frequent_send and not maintenance_event.is_set(): [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-3]: AttributeError: 'str' object has no attribute 'is_set' [worker-2]: AttributeError: 'str' object has no attribute 'is_set' [worker-1]: elif frequent_send and not maintenance_event.is_set(): [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-0]: AttributeError: 'str' object has no attribute 'is_set' [worker-1]: AttributeError: 'str' object has no attribute 'is_set' [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:18.598567 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:18.599686 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:18.622418 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:18.627553 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:18.716345 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:18.716987 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:18.717377 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:18.717581 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:18.782263 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:18.785246 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:18.793404 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:18.794921 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:18.851584 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:18.851681 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:18.852878 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:18.863564 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:18.923493 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:18.924631 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:18.925861 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:18.942304 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830e840> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: W0423 21:42:19.026651 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830e840> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830e840> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: W0423 21:42:19.024169 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830e840> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830e840> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: W0423 21:42:19.034398 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830e840> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830e160> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: W0423 21:42:19.046531 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830e160> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:19.042514 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:19.054177 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:19.050189 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:19.050417 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830f060> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830ee80> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: W0423 21:42:19.186586 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830f060> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: W0423 21:42:19.186972 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830ee80> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: INFO:tensorflow:epoch 0 finished [worker-0]: INFO:tensorflow:epoch 0 finished [worker-3]: I0423 21:42:19.186929 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-0]: I0423 21:42:19.187351 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:19.194740 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830f060> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: W0423 21:42:19.198187 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830f060> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: INFO:tensorflow:epoch 0 finished [worker-2]: I0423 21:42:19.198545 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:19.206024 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830f060> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: W0423 21:42:19.206498 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830f060> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: INFO:tensorflow:epoch 0 finished [worker-1]: I0423 21:42:19.206887 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:19.214651 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:19.218411 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:19.290121 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:19.309976 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:19.310166 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:19.339025 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:19.424922 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:19.449869 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:19.446696 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:19.657260 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:19.704984 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:19.705700 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:19.705892 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:19.705890 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:19.774113 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:19.774591 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:19.774973 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:19.782944 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:19.851119 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:19.851790 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:19.851893 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:19.852360 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:epoch 1 finished [worker-1]: I0423 21:42:19.894683 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-2]: INFO:tensorflow:epoch 1 finished [worker-2]: I0423 21:42:19.895159 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-0]: INFO:tensorflow:epoch 1 finished [worker-0]: I0423 21:42:19.894289 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:19.902410 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:19.905224 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 1 finished [worker-3]: I0423 21:42:19.894112 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:19.902062 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:19.911464 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:19.984995 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:19.978885 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:19.978839 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:19.978954 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:20.034757 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:20.035176 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:20.034668 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:20.035473 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:20.084667 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:20.103446 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:20.104424 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:20.099213 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:20.164403 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:20.173852 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:20.184264 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:20.190370 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:20.244126 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:20.244260 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:20.244700 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:20.245109 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 2 finished [worker-0]: INFO:tensorflow:epoch 2 finished [worker-1]: INFO:tensorflow:epoch 2 finished [worker-2]: INFO:tensorflow:epoch 2 finished [worker-3]: I0423 21:42:20.284614 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-0]: I0423 21:42:20.284786 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-1]: I0423 21:42:20.284953 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-2]: I0423 21:42:20.284970 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:20.291966 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:20.292145 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:20.292197 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:20.292414 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:20.352324 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:20.353512 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:20.353526 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:20.353464 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:20.403614 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:20.403663 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:20.409998 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:20.408854 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:20.519836 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:20.519777 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:20.530730 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:20.539939 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:20.657219 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:20.660474 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:20.660084 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:20.669813 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:20.746572 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:20.747332 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:20.748271 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:20.763739 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 3 finished [worker-3]: I0423 21:42:20.810787 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-0]: INFO:tensorflow:epoch 3 finished [worker-0]: I0423 21:42:20.811527 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:20.819172 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:20.818757 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:epoch 3 finished [worker-2]: I0423 21:42:20.827677 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-1]: INFO:tensorflow:epoch 3 finished [worker-1]: I0423 21:42:20.828654 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:20.834848 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:20.836869 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:20.897348 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:20.900957 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:20.902236 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:20.907306 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:20.965028 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:20.965028 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:20.965651 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:20.965689 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:21.035242 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:21.035986 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:21.040633 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:21.034685 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:21.087214 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:21.087809 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:21.089078 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:21.088670 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:21.157446 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:21.160945 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:21.157696 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:21.158300 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 4 finished [worker-0]: INFO:tensorflow:epoch 4 finished [worker-0]: I0423 21:42:21.289047 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-3]: I0423 21:42:21.287215 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-0]: INFO:tensorflow:Training finished. [worker-3]: INFO:tensorflow:Training finished. [worker-0]: I0423 21:42:21.289285 281473829860224 gce_failure_handler_test.py:244] Training finished. [worker-3]: I0423 21:42:21.287525 281473829860224 gce_failure_handler_test.py:244] Training finished. [worker-1]: INFO:tensorflow:epoch 4 finished [worker-1]: I0423 21:42:21.296596 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-1]: INFO:tensorflow:Training finished. [worker-1]: I0423 21:42:21.296824 281473829860224 gce_failure_handler_test.py:244] Training finished. [worker-2]: INFO:tensorflow:epoch 4 finished [worker-2]: I0423 21:42:21.300766 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-2]: INFO:tensorflow:Training finished. [worker-2]: I0423 21:42:21.300997 281473829860224 gce_failure_handler_test.py:244] Training finished. I0423 21:42:22.272095 281473453224832 multi_process_runner.py:646] worker-0 exit code: 0 I0423 21:42:22.272355 281473453224832 multi_process_runner.py:646] worker-1 exit code: 0 I0423 21:42:22.272471 281473453224832 multi_process_runner.py:646] worker-2 exit code: 0 I0423 21:42:22.272574 281473453224832 multi_process_runner.py:646] worker-3 exit code: 0 I0423 21:42:22.275339 281473453224832 multi_process_runner.py:662] Joining log reading threads. I0423 21:42:22.275547 281473453224832 multi_process_runner.py:665] Joined log reading threads. INFO:tensorflow:time(__main__.GceFailureHandlingTest.test_multiple_workers_preempted_consecutively_test_apiwrappingtrain_False_graceperiod_7_inputarg_manager_strategyoption_MWMSmultiworker): 8.27s I0423 21:42:22.433294 281473453224832 test_util.py:2462] time(__main__.GceFailureHandlingTest.test_multiple_workers_preempted_consecutively_test_apiwrappingtrain_False_graceperiod_7_inputarg_manager_strategyoption_MWMSmultiworker): 8.27s [ OK ] GceFailureHandlingTest.test_multiple_workers_preempted_consecutively_test_apiwrappingtrain_False_graceperiod_7_inputarg_manager_strategyoption_MWMSmultiworker [ RUN ] GceFailureHandlingTest.test_multiple_workers_preempted_consecutively_test_apiwrappingtrain_True_graceperiod_0_inputarg_manager_strategyoption_MWMSmultiworker INFO:tensorflow:Using local port 15507 I0423 21:42:22.435082 281473453224832 test_util.py:3794] Using local port 15507 INFO:tensorflow:Using local port 16395 I0423 21:42:22.435507 281473453224832 test_util.py:3794] Using local port 16395 INFO:tensorflow:Using local port 16639 I0423 21:42:22.435852 281473453224832 test_util.py:3794] Using local port 16639 INFO:tensorflow:Using local port 18544 I0423 21:42:22.436264 281473453224832 test_util.py:3794] Using local port 18544 INFO:tensorflow:Cluster starting. I0423 21:42:22.556252 281473453224832 gce_failure_handler_test.py:405] Cluster starting. [worker-0]: I0423 21:42:22.620365 281473829860224 multi_process_runner.py:840] Subprocess with PID 2316106 (worker, 0) is now being started. [worker-2]: I0423 21:42:22.626967 281473829860224 multi_process_runner.py:840] Subprocess with PID 2316118 (worker, 2) is now being started. [worker-0]: I0423 21:42:22.620945 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:15507", "localhost:16395", "localhost:16639", "localhost:18544"]}, "task": {"type": "worker", "index": 0}, "rpc_layer": "grpc"}' [worker-2]: I0423 21:42:22.627526 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:15507", "localhost:16395", "localhost:16639", "localhost:18544"]}, "task": {"type": "worker", "index": 2}, "rpc_layer": "grpc"}' [worker-1]: I0423 21:42:22.630552 281473829860224 multi_process_runner.py:840] Subprocess with PID 2316112 (worker, 1) is now being started. [worker-1]: I0423 21:42:22.631104 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:15507", "localhost:16395", "localhost:16639", "localhost:18544"]}, "task": {"type": "worker", "index": 1}, "rpc_layer": "grpc"}' [worker-3]: I0423 21:42:22.632991 281473829860224 multi_process_runner.py:840] Subprocess with PID 2316125 (worker, 3) is now being started. [worker-3]: I0423 21:42:22.633576 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:15507", "localhost:16395", "localhost:16639", "localhost:18544"]}, "task": {"type": "worker", "index": 3}, "rpc_layer": "grpc"}' [worker-2]: 2023-04-23 21:42:22.664885: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:16639 [worker-0]: 2023-04-23 21:42:22.659289: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:15507 [worker-1]: 2023-04-23 21:42:22.682251: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:16395 [worker-3]: 2023-04-23 21:42:22.686693: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:18544 [worker-0]: 2023-04-23 21:42:22.702553: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:0 has connected to coordination service. Incarnation: 13923407403693256237 [worker-2]: 2023-04-23 21:42:22.702945: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-3]: 2023-04-23 21:42:22.702935: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-0]: 2023-04-23 21:42:22.702631: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:2 has connected to coordination service. Incarnation: 8802669822409980452 [worker-0]: 2023-04-23 21:42:22.702677: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:3 has connected to coordination service. Incarnation: 2927583516118684597 [worker-0]: 2023-04-23 21:42:22.702870: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-0]: 2023-04-23 21:42:22.714574: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:1 has connected to coordination service. Incarnation: 1145236043898294279 [worker-1]: 2023-04-23 21:42:22.714916: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-3]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1'] [worker-0]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-1]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-2]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-3]: I0423 21:42:22.716784 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1'] [worker-1]: I0423 21:42:22.716936 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-0]: I0423 21:42:22.716851 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-2]: I0423 21:42:22.716982 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-1]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:1/device:CPU:0',) [worker-0]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:0/device:CPU:0',) [worker-2]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:2/device:CPU:0',) [worker-1]: I0423 21:42:22.773169 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:1/device:CPU:0',) [worker-0]: I0423 21:42:22.773154 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:0/device:CPU:0',) [worker-1]: INFO:tensorflow:Check health not enabled. [worker-2]: I0423 21:42:22.773185 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:2/device:CPU:0',) [worker-1]: I0423 21:42:22.773727 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-0]: INFO:tensorflow:Check health not enabled. [worker-2]: INFO:tensorflow:Check health not enabled. [worker-1]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:15507', 'localhost:16395', 'localhost:16639', 'localhost:18544']}, task_type = 'worker', task_id = 1, num_workers = 4, local_devices = ('/job:worker/task:1/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: I0423 21:42:22.773727 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-2]: I0423 21:42:22.773759 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-3]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:3/device:CPU:0',) [worker-1]: I0423 21:42:22.773938 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:15507', 'localhost:16395', 'localhost:16639', 'localhost:18544']}, task_type = 'worker', task_id = 1, num_workers = 4, local_devices = ('/job:worker/task:1/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:15507', 'localhost:16395', 'localhost:16639', 'localhost:18544']}, task_type = 'worker', task_id = 0, num_workers = 4, local_devices = ('/job:worker/task:0/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-2]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:15507', 'localhost:16395', 'localhost:16639', 'localhost:18544']}, task_type = 'worker', task_id = 2, num_workers = 4, local_devices = ('/job:worker/task:2/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-3]: I0423 21:42:22.774369 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:3/device:CPU:0',) [worker-2]: I0423 21:42:22.773978 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:15507', 'localhost:16395', 'localhost:16639', 'localhost:18544']}, task_type = 'worker', task_id = 2, num_workers = 4, local_devices = ('/job:worker/task:2/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: I0423 21:42:22.773938 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:15507', 'localhost:16395', 'localhost:16639', 'localhost:18544']}, task_type = 'worker', task_id = 0, num_workers = 4, local_devices = ('/job:worker/task:0/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-3]: INFO:tensorflow:Check health not enabled. [worker-3]: I0423 21:42:22.774830 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-3]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:15507', 'localhost:16395', 'localhost:16639', 'localhost:18544']}, task_type = 'worker', task_id = 3, num_workers = 4, local_devices = ('/job:worker/task:3/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-3]: I0423 21:42:22.775051 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:15507', 'localhost:16395', 'localhost:16639', 'localhost:18544']}, task_type = 'worker', task_id = 3, num_workers = 4, local_devices = ('/job:worker/task:3/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-2]: INFO:tensorflow:Start watcher for peer's signal. [worker-2]: I0423 21:42:22.813717 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-2]: INFO:tensorflow:Start polling for termination signal. [worker-2]: I0423 21:42:22.814453 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-2]: Exception in thread WorkerTerminationSignalWatcher-2: [worker-2]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-2]: Traceback (most recent call last): [worker-2]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-2]: I0423 21:42:22.814891 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-2]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-2]: Instructions for updating: [worker-2]: Track steps using a tf.Variable saved in checkpoint instead. [worker-2]: W0423 21:42:22.815318 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-2]: Instructions for updating: [worker-2]: Track steps using a tf.Variable saved in checkpoint instead. [worker-3]: INFO:tensorflow:Start watcher for peer's signal. [worker-1]: INFO:tensorflow:Start watcher for peer's signal. [worker-2]: INFO:tensorflow:Start training at 0 [worker-3]: I0423 21:42:22.816953 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-1]: I0423 21:42:22.816956 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-2]: I0423 21:42:22.815623 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-0]: INFO:tensorflow:Start watcher for peer's signal. [worker-2]: self.run() [worker-1]: INFO:tensorflow:Start polling for termination signal. [worker-2]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-2]: self._target(*self._args, **self._kwargs) [worker-1]: I0423 21:42:22.818351 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-0]: I0423 21:42:22.817905 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-0]: INFO:tensorflow:Start polling for termination signal. [worker-1]: Exception in thread WorkerTerminationSignalWatcher-1: [worker-3]: INFO:tensorflow:Start polling for termination signal. [worker-0]: I0423 21:42:22.818727 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-2]: if self._termination_watcher_fn(): [worker-3]: I0423 21:42:22.819212 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-0]: Exception in thread WorkerTerminationSignalWatcher-0: [worker-1]: Traceback (most recent call last): [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: Exception in thread WorkerTerminationSignalWatcher-3: [worker-1]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-0]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-3]: Traceback (most recent call last): [worker-1]: self.run() [worker-0]: Traceback (most recent call last): [worker-2]: elif frequent_send and not maintenance_event.is_set(): [worker-1]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-0]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-1]: self._target(*self._args, **self._kwargs) [worker-3]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-0]: I0423 21:42:22.819204 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-3]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-2]: AttributeError: 'str' object has no attribute 'is_set' [worker-0]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-3]: I0423 21:42:22.819969 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-0]: Instructions for updating: [worker-1]: if self._termination_watcher_fn(): [worker-3]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-0]: Track steps using a tf.Variable saved in checkpoint instead. [worker-3]: Instructions for updating: [worker-0]: W0423 21:42:22.819641 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-0]: Instructions for updating: [worker-3]: Track steps using a tf.Variable saved in checkpoint instead. [worker-1]: elif frequent_send and not maintenance_event.is_set(): [worker-0]: Track steps using a tf.Variable saved in checkpoint instead. [worker-3]: W0423 21:42:22.820379 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-0]: INFO:tensorflow:Start training at 0 [worker-3]: Instructions for updating: [worker-1]: AttributeError: 'str' object has no attribute 'is_set' [worker-0]: I0423 21:42:22.819881 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-3]: Track steps using a tf.Variable saved in checkpoint instead. [worker-1]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-0]: self.run() [worker-3]: INFO:tensorflow:Start training at 0 [worker-1]: I0423 21:42:22.820500 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-3]: I0423 21:42:22.820565 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-1]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-0]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-3]: self.run() [worker-1]: Instructions for updating: [worker-3]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-0]: self._target(*self._args, **self._kwargs) [worker-3]: self._target(*self._args, **self._kwargs) [worker-1]: Track steps using a tf.Variable saved in checkpoint instead. [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-1]: W0423 21:42:22.820819 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-0]: if self._termination_watcher_fn(): [worker-1]: Instructions for updating: [worker-3]: if self._termination_watcher_fn(): [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: Track steps using a tf.Variable saved in checkpoint instead. [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-1]: INFO:tensorflow:Start training at 0 [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-0]: elif frequent_send and not maintenance_event.is_set(): [worker-3]: elif frequent_send and not maintenance_event.is_set(): [worker-1]: I0423 21:42:22.820983 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-0]: AttributeError: 'str' object has no attribute 'is_set' [worker-3]: AttributeError: 'str' object has no attribute 'is_set' [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:23.008035 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:23.041275 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:23.038101 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:23.044907 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:23.162687 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:23.162852 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:23.162875 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:23.162897 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:23.215838 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:23.234192 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:23.249927 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:23.263863 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:23.344611 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:23.356796 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:23.349879 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:23.380450 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:23.430837 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:23.431489 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:23.433834 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:23.434332 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830a020> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830a020> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb83060c0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb8306020> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: W0423 21:42:23.473984 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830a020> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: W0423 21:42:23.474058 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830a020> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: W0423 21:42:23.474330 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb83060c0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: W0423 21:42:23.474225 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb8306020> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:23.481862 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:23.482229 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:23.482402 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:23.482414 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb8307420> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: W0423 21:42:23.541451 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb8307420> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: INFO:tensorflow:epoch 0 finished [worker-1]: I0423 21:42:23.541840 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-2]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb8306980> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: W0423 21:42:23.546599 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb8306980> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: INFO:tensorflow:epoch 0 finished [worker-2]: I0423 21:42:23.546911 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-0]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830b420> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: W0423 21:42:23.546352 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830b420> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: INFO:tensorflow:epoch 0 finished [worker-0]: I0423 21:42:23.546707 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:23.554132 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830b420> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: W0423 21:42:23.559111 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830b420> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: INFO:tensorflow:epoch 0 finished [worker-3]: I0423 21:42:23.559480 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:23.597440 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:23.599785 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:23.590347 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:23.723350 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:23.723830 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:23.727022 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:23.718862 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:23.777082 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:23.777582 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:23.781681 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:23.778933 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:23.944484 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:23.944855 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:23.950408 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:23.954481 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:24.005471 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:24.005504 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:24.005793 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:24.009559 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:24.097989 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:24.130630 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:24.130917 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:24.140191 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 1 finished [worker-3]: I0423 21:42:24.231133 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:24.238716 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:epoch 1 finished [worker-1]: INFO:tensorflow:epoch 1 finished [worker-0]: I0423 21:42:24.247003 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-1]: I0423 21:42:24.247134 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-2]: INFO:tensorflow:epoch 1 finished [worker-2]: I0423 21:42:24.247247 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:24.255253 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:24.257451 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:24.254281 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:24.328797 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:24.328798 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:24.331768 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:24.328584 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:24.385288 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:24.385621 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:24.385761 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:24.390423 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:24.440845 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:24.441277 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:24.441811 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:24.441836 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:24.524360 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:24.535304 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:24.550366 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:24.557698 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:24.639453 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:24.643611 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:24.650594 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:24.660082 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 2 finished [worker-0]: INFO:tensorflow:epoch 2 finished [worker-0]: I0423 21:42:24.713647 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-1]: INFO:tensorflow:epoch 2 finished [worker-3]: I0423 21:42:24.708667 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:24.721306 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:epoch 2 finished [worker-2]: I0423 21:42:24.720289 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:24.730348 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:24.714536 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:24.732711 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:24.750076 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:24.799647 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:24.803124 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:24.803744 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:24.824842 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:24.884676 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:24.894016 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:24.886207 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:24.890237 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:24.943605 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:24.943642 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:24.943654 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:24.943918 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:24.990981 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:25.014060 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:25.024140 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:25.037907 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:25.108323 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:25.108468 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:25.130138 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:25.137505 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 3 finished [worker-0]: INFO:tensorflow:epoch 3 finished [worker-0]: I0423 21:42:25.211379 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-3]: I0423 21:42:25.206488 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-2]: INFO:tensorflow:epoch 3 finished [worker-2]: I0423 21:42:25.217167 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-1]: INFO:tensorflow:epoch 3 finished [worker-1]: I0423 21:42:25.221366 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:25.223858 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:25.221587 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:25.228196 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:25.239881 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:25.298798 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:25.300170 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:25.300988 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:25.319413 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:25.389450 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:25.393453 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:25.396064 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:25.404012 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:25.453970 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:25.464909 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:25.476422 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:25.480767 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:25.527084 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:25.531304 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:25.548569 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:25.569728 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:25.655921 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:25.658576 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:25.661470 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:25.661768 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 4 finished [worker-1]: INFO:tensorflow:epoch 4 finished [worker-2]: INFO:tensorflow:epoch 4 finished [worker-2]: I0423 21:42:25.760054 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-1]: I0423 21:42:25.758901 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-1]: INFO:tensorflow:Training finished. [worker-2]: INFO:tensorflow:Training finished. [worker-3]: I0423 21:42:25.757044 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-1]: I0423 21:42:25.759140 281473829860224 gce_failure_handler_test.py:244] Training finished. [worker-2]: I0423 21:42:25.760281 281473829860224 gce_failure_handler_test.py:244] Training finished. [worker-3]: INFO:tensorflow:Training finished. [worker-3]: I0423 21:42:25.757298 281473829860224 gce_failure_handler_test.py:244] Training finished. [worker-0]: INFO:tensorflow:epoch 4 finished [worker-0]: I0423 21:42:25.766403 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-0]: INFO:tensorflow:Training finished. [worker-0]: I0423 21:42:25.766700 281473829860224 gce_failure_handler_test.py:244] Training finished. INFO:tensorflow:restarting workers I0423 21:42:27.590029 281473453224832 gce_failure_handler_test.py:411] restarting workers [worker-1]: I0423 21:42:27.637196 281473829860224 multi_process_runner.py:840] Subprocess with PID 2326873 (worker, 1) is now being started. [worker-1]: I0423 21:42:27.637757 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:15507", "localhost:16395", "localhost:16639", "localhost:18544"]}, "task": {"type": "worker", "index": 1}, "rpc_layer": "grpc"}' [worker-0]: I0423 21:42:27.660409 281473829860224 multi_process_runner.py:840] Subprocess with PID 2326870 (worker, 0) is now being started. [worker-0]: I0423 21:42:27.660968 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:15507", "localhost:16395", "localhost:16639", "localhost:18544"]}, "task": {"type": "worker", "index": 0}, "rpc_layer": "grpc"}' [worker-2]: I0423 21:42:27.669182 281473829860224 multi_process_runner.py:840] Subprocess with PID 2326885 (worker, 2) is now being started. [worker-2]: I0423 21:42:27.669726 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:15507", "localhost:16395", "localhost:16639", "localhost:18544"]}, "task": {"type": "worker", "index": 2}, "rpc_layer": "grpc"}' INFO:tensorflow:workers restarted I0423 21:42:27.671770 281473453224832 gce_failure_handler_test.py:415] workers restarted [worker-0]: 2023-04-23 21:42:27.695836: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:15507 [worker-1]: 2023-04-23 21:42:27.704614: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:16395 [worker-2]: 2023-04-23 21:42:27.707247: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:16639 [worker-0]: 2023-04-23 21:42:27.716647: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:0 has connected to coordination service. Incarnation: 613243580410731057 [worker-2]: 2023-04-23 21:42:27.716986: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-0]: 2023-04-23 21:42:27.716711: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:2 has connected to coordination service. Incarnation: 8857142960403658228 [worker-0]: 2023-04-23 21:42:27.716967: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-0]: 2023-04-23 21:42:27.741326: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:1 has connected to coordination service. Incarnation: 3737550762197295334 [worker-1]: 2023-04-23 21:42:27.742097: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-3]: I0423 21:42:27.820841 281473829860224 multi_process_runner.py:840] Subprocess with PID 2327027 (worker, 3) is now being started. [worker-3]: I0423 21:42:27.821362 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:15507", "localhost:16395", "localhost:16639", "localhost:18544"]}, "task": {"type": "worker", "index": 3}, "rpc_layer": "grpc"}' [worker-0]: 2023-04-23 21:42:27.973511: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:3 has connected to coordination service. Incarnation: 9893050224369924687 [worker-0]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-3]: 2023-04-23 21:42:27.966149: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:18544 [worker-0]: I0423 21:42:27.975659 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-1]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-1]: I0423 21:42:27.976783 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-3]: 2023-04-23 21:42:27.973709: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-3]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1'] [worker-3]: I0423 21:42:27.975704 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1'] [worker-2]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-2]: I0423 21:42:27.983805 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-0]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:0/device:CPU:0',) [worker-3]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:3/device:CPU:0',) [worker-1]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:1/device:CPU:0',) [worker-0]: I0423 21:42:28.030237 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:0/device:CPU:0',) [worker-1]: I0423 21:42:28.030214 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:1/device:CPU:0',) [worker-1]: INFO:tensorflow:Check health not enabled. [worker-1]: I0423 21:42:28.030691 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-1]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:15507', 'localhost:16395', 'localhost:16639', 'localhost:18544']}, task_type = 'worker', task_id = 1, num_workers = 4, local_devices = ('/job:worker/task:1/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: INFO:tensorflow:Check health not enabled. [worker-1]: I0423 21:42:28.030889 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:15507', 'localhost:16395', 'localhost:16639', 'localhost:18544']}, task_type = 'worker', task_id = 1, num_workers = 4, local_devices = ('/job:worker/task:1/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: I0423 21:42:28.030662 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-0]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:15507', 'localhost:16395', 'localhost:16639', 'localhost:18544']}, task_type = 'worker', task_id = 0, num_workers = 4, local_devices = ('/job:worker/task:0/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: I0423 21:42:28.030869 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:15507', 'localhost:16395', 'localhost:16639', 'localhost:18544']}, task_type = 'worker', task_id = 0, num_workers = 4, local_devices = ('/job:worker/task:0/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-3]: I0423 21:42:28.030260 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:3/device:CPU:0',) [worker-3]: INFO:tensorflow:Check health not enabled. [worker-3]: I0423 21:42:28.030663 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-3]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:15507', 'localhost:16395', 'localhost:16639', 'localhost:18544']}, task_type = 'worker', task_id = 3, num_workers = 4, local_devices = ('/job:worker/task:3/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-3]: I0423 21:42:28.030868 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:15507', 'localhost:16395', 'localhost:16639', 'localhost:18544']}, task_type = 'worker', task_id = 3, num_workers = 4, local_devices = ('/job:worker/task:3/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-2]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:2/device:CPU:0',) [worker-2]: I0423 21:42:28.039871 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:2/device:CPU:0',) [worker-2]: INFO:tensorflow:Check health not enabled. [worker-2]: I0423 21:42:28.040339 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-2]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:15507', 'localhost:16395', 'localhost:16639', 'localhost:18544']}, task_type = 'worker', task_id = 2, num_workers = 4, local_devices = ('/job:worker/task:2/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-2]: I0423 21:42:28.040544 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:15507', 'localhost:16395', 'localhost:16639', 'localhost:18544']}, task_type = 'worker', task_id = 2, num_workers = 4, local_devices = ('/job:worker/task:2/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-2]: INFO:tensorflow:Start watcher for peer's signal. [worker-2]: I0423 21:42:28.081337 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-2]: INFO:tensorflow:Start polling for termination signal. [worker-2]: I0423 21:42:28.082246 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-2]: Exception in thread WorkerTerminationSignalWatcher-2: [worker-2]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-2]: Traceback (most recent call last): [worker-2]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-2]: I0423 21:42:28.083128 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-2]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-2]: Instructions for updating: [worker-2]: Track steps using a tf.Variable saved in checkpoint instead. [worker-2]: W0423 21:42:28.083574 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-2]: Instructions for updating: [worker-2]: Track steps using a tf.Variable saved in checkpoint instead. [worker-2]: INFO:tensorflow:Start training at 0 [worker-2]: I0423 21:42:28.083751 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-2]: self.run() [worker-2]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-2]: self._target(*self._args, **self._kwargs) [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-2]: if self._termination_watcher_fn(): [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-2]: elif frequent_send and not maintenance_event.is_set(): [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-2]: AttributeError: 'str' object has no attribute 'is_set' [worker-0]: INFO:tensorflow:Start watcher for peer's signal. [worker-0]: I0423 21:42:28.098258 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-0]: INFO:tensorflow:Start polling for termination signal. [worker-0]: I0423 21:42:28.099100 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-0]: Exception in thread WorkerTerminationSignalWatcher-0: [worker-0]: Traceback (most recent call last): [worker-0]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-0]: self.run() [worker-0]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-0]: self._target(*self._args, **self._kwargs) [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-0]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-0]: I0423 21:42:28.099534 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-0]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-0]: Instructions for updating: [worker-0]: Track steps using a tf.Variable saved in checkpoint instead. [worker-0]: W0423 21:42:28.101093 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-0]: Instructions for updating: [worker-0]: Track steps using a tf.Variable saved in checkpoint instead. [worker-0]: INFO:tensorflow:Start training at 0 [worker-0]: I0423 21:42:28.101266 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-0]: if self._termination_watcher_fn(): [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-0]: elif frequent_send and not maintenance_event.is_set(): [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-0]: AttributeError: 'str' object has no attribute 'is_set' [worker-1]: INFO:tensorflow:Start watcher for peer's signal. [worker-1]: I0423 21:42:28.116684 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-3]: INFO:tensorflow:Start watcher for peer's signal. [worker-3]: I0423 21:42:28.132909 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-1]: INFO:tensorflow:Start polling for termination signal. [worker-1]: I0423 21:42:28.136272 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-3]: INFO:tensorflow:Start polling for termination signal. [worker-3]: I0423 21:42:28.137290 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-1]: Exception in thread WorkerTerminationSignalWatcher-1: [worker-3]: Exception in thread WorkerTerminationSignalWatcher-3: [worker-1]: Traceback (most recent call last): [worker-3]: Traceback (most recent call last): [worker-3]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-3]: self.run() [worker-3]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-3]: self._target(*self._args, **self._kwargs) [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-3]: if self._termination_watcher_fn(): [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-3]: elif frequent_send and not maintenance_event.is_set(): [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: AttributeError: 'str' object has no attribute 'is_set' [worker-3]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-3]: I0423 21:42:28.158302 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-3]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: Instructions for updating: [worker-2]: I0423 21:42:28.166905 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: Track steps using a tf.Variable saved in checkpoint instead. [worker-1]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-3]: W0423 21:42:28.158739 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-1]: self.run() [worker-3]: Instructions for updating: [worker-1]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-3]: Track steps using a tf.Variable saved in checkpoint instead. [worker-1]: self._target(*self._args, **self._kwargs) [worker-3]: INFO:tensorflow:Start training at 0 [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-3]: I0423 21:42:28.158920 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-1]: if self._termination_watcher_fn(): [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-1]: elif frequent_send and not maintenance_event.is_set(): [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: AttributeError: 'str' object has no attribute 'is_set' [worker-1]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-1]: I0423 21:42:28.158027 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-1]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-1]: Instructions for updating: [worker-1]: Track steps using a tf.Variable saved in checkpoint instead. [worker-1]: W0423 21:42:28.158344 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-1]: Instructions for updating: [worker-1]: Track steps using a tf.Variable saved in checkpoint instead. [worker-1]: INFO:tensorflow:Start training at 0 [worker-1]: I0423 21:42:28.158500 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:28.192049 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:28.267709 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:28.283466 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:28.354662 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:28.360448 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:28.347124 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:28.354927 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:28.412221 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:28.412294 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:28.424414 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:28.423568 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:28.474385 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:28.474376 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:28.482302 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:28.480152 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:28.561362 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:28.560652 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:28.570449 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:28.600312 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830e660> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: W0423 21:42:28.647860 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830e660> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830e980> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: W0423 21:42:28.649353 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830e980> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:28.654906 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830e660> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: W0423 21:42:28.656457 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830e660> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:28.658978 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:28.663522 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830a7a0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: W0423 21:42:28.666534 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830a7a0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:28.700113 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830f1a0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830b1a0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830f1a0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830eac0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: W0423 21:42:28.747415 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830f1a0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: W0423 21:42:28.747630 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830f1a0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: W0423 21:42:28.747679 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830eac0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: W0423 21:42:28.747428 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830b1a0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: INFO:tensorflow:epoch 0 finished [worker-2]: INFO:tensorflow:epoch 0 finished [worker-1]: INFO:tensorflow:epoch 0 finished [worker-0]: INFO:tensorflow:epoch 0 finished [worker-2]: I0423 21:42:28.747963 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-3]: I0423 21:42:28.747846 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-1]: I0423 21:42:28.747915 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-0]: I0423 21:42:28.747729 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:28.754643 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:28.754889 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:28.754994 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:28.756056 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:28.803028 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:28.803248 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:28.805403 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:28.806238 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:28.856322 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:28.856506 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:28.857007 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:28.872388 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:28.918000 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:28.918000 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:28.919402 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:28.919495 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:28.976859 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:28.978429 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:28.990231 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:28.993947 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:29.041937 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:29.042446 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:29.042665 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:29.042763 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:epoch 1 finished [worker-0]: I0423 21:42:29.097602 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:29.104538 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 1 finished [worker-3]: I0423 21:42:29.106547 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-1]: INFO:tensorflow:epoch 1 finished [worker-1]: I0423 21:42:29.112579 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-2]: INFO:tensorflow:epoch 1 finished [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:29.116838 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-3]: I0423 21:42:29.115082 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:29.124611 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:29.125210 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:29.197536 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:29.204243 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:29.206330 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:29.224036 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:29.290803 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:29.303720 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:29.304771 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:29.303232 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:29.359011 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:29.402927 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:29.405137 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:29.429614 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:29.477997 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:29.477906 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:29.478065 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:29.478435 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:29.535844 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:29.537467 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:29.537922 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:29.538489 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 2 finished [worker-0]: INFO:tensorflow:epoch 2 finished [worker-1]: INFO:tensorflow:epoch 2 finished [worker-2]: INFO:tensorflow:epoch 2 finished [worker-3]: I0423 21:42:29.583353 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-0]: I0423 21:42:29.583559 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-2]: I0423 21:42:29.583880 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-1]: I0423 21:42:29.583667 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:29.592338 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:29.597085 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:29.596881 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:29.621128 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:29.673202 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:29.674817 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:29.693048 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:29.705267 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:29.806380 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:29.806368 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:29.822560 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:29.840690 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:29.977470 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:29.977036 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:30.000377 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:30.010391 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:30.113277 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:30.103962 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:30.120721 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:30.140478 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:30.194264 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:30.198383 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:30.199601 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:30.230387 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 3 finished [worker-3]: I0423 21:42:30.303863 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-1]: INFO:tensorflow:epoch 3 finished [worker-1]: I0423 21:42:30.320333 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-0]: INFO:tensorflow:epoch 3 finished [worker-0]: I0423 21:42:30.316649 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:30.326268 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:epoch 3 finished [worker-2]: I0423 21:42:30.336439 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:30.338388 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:30.340714 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:30.378816 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:30.430259 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:30.431597 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:30.431650 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:30.434881 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:30.561813 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:30.561895 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:30.566290 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:30.574391 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:30.656627 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:30.657024 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:30.657114 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:30.656985 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:30.708811 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:30.708997 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:30.709666 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:30.708824 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:30.759711 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:30.759765 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:30.759841 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:30.760019 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 4 finished [worker-0]: INFO:tensorflow:epoch 4 finished [worker-3]: I0423 21:42:30.803279 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-0]: I0423 21:42:30.803505 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-1]: INFO:tensorflow:epoch 4 finished [worker-2]: INFO:tensorflow:epoch 4 finished [worker-3]: INFO:tensorflow:Training finished. [worker-0]: INFO:tensorflow:Training finished. [worker-1]: I0423 21:42:30.803898 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-2]: I0423 21:42:30.803990 281473829860224 gce_failure_handler_test.py:192] epoch 4 finished [worker-0]: I0423 21:42:30.803822 281473829860224 gce_failure_handler_test.py:244] Training finished. [worker-1]: INFO:tensorflow:Training finished. [worker-3]: I0423 21:42:30.803617 281473829860224 gce_failure_handler_test.py:244] Training finished. [worker-2]: INFO:tensorflow:Training finished. [worker-2]: I0423 21:42:30.804313 281473829860224 gce_failure_handler_test.py:244] Training finished. [worker-1]: I0423 21:42:30.804157 281473829860224 gce_failure_handler_test.py:244] Training finished. I0423 21:42:31.627614 281473453224832 multi_process_runner.py:646] worker-0 exit code: 0 I0423 21:42:31.627878 281473453224832 multi_process_runner.py:646] worker-1 exit code: 0 I0423 21:42:31.627992 281473453224832 multi_process_runner.py:646] worker-2 exit code: 0 I0423 21:42:31.628095 281473453224832 multi_process_runner.py:646] worker-3 exit code: 0 I0423 21:42:31.630592 281473453224832 multi_process_runner.py:662] Joining log reading threads. I0423 21:42:31.630781 281473453224832 multi_process_runner.py:665] Joined log reading threads. INFO:tensorflow:time(__main__.GceFailureHandlingTest.test_multiple_workers_preempted_consecutively_test_apiwrappingtrain_True_graceperiod_0_inputarg_manager_strategyoption_MWMSmultiworker): 9.34s I0423 21:42:31.772393 281473453224832 test_util.py:2462] time(__main__.GceFailureHandlingTest.test_multiple_workers_preempted_consecutively_test_apiwrappingtrain_True_graceperiod_0_inputarg_manager_strategyoption_MWMSmultiworker): 9.34s [ OK ] GceFailureHandlingTest.test_multiple_workers_preempted_consecutively_test_apiwrappingtrain_True_graceperiod_0_inputarg_manager_strategyoption_MWMSmultiworker [ RUN ] GceFailureHandlingTest.test_multiple_workers_preempted_consecutively_test_apiwrappingtrain_True_graceperiod_7_inputarg_manager_strategyoption_MWMSmultiworker INFO:tensorflow:Using local port 16671 I0423 21:42:31.831659 281473453224832 test_util.py:3794] Using local port 16671 INFO:tensorflow:Using local port 23400 I0423 21:42:31.832165 281473453224832 test_util.py:3794] Using local port 23400 INFO:tensorflow:Using local port 23929 I0423 21:42:31.832507 281473453224832 test_util.py:3794] Using local port 23929 INFO:tensorflow:Using local port 19346 I0423 21:42:31.832833 281473453224832 test_util.py:3794] Using local port 19346 INFO:tensorflow:Cluster starting. I0423 21:42:31.908308 281473453224832 gce_failure_handler_test.py:405] Cluster starting. [worker-0]: I0423 21:42:31.954873 281473829860224 multi_process_runner.py:840] Subprocess with PID 2335833 (worker, 0) is now being started. [worker-0]: I0423 21:42:31.955438 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:16671", "localhost:23400", "localhost:23929", "localhost:19346"]}, "task": {"type": "worker", "index": 0}, "rpc_layer": "grpc"}' [worker-1]: I0423 21:42:31.962634 281473829860224 multi_process_runner.py:840] Subprocess with PID 2335845 (worker, 1) is now being started. [worker-2]: I0423 21:42:31.962738 281473829860224 multi_process_runner.py:840] Subprocess with PID 2335855 (worker, 2) is now being started. [worker-1]: I0423 21:42:31.963206 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:16671", "localhost:23400", "localhost:23929", "localhost:19346"]}, "task": {"type": "worker", "index": 1}, "rpc_layer": "grpc"}' [worker-2]: I0423 21:42:31.963243 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:16671", "localhost:23400", "localhost:23929", "localhost:19346"]}, "task": {"type": "worker", "index": 2}, "rpc_layer": "grpc"}' [worker-3]: I0423 21:42:31.970278 281473829860224 multi_process_runner.py:840] Subprocess with PID 2335861 (worker, 3) is now being started. [worker-3]: I0423 21:42:31.970812 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:16671", "localhost:23400", "localhost:23929", "localhost:19346"]}, "task": {"type": "worker", "index": 3}, "rpc_layer": "grpc"}' [worker-0]: 2023-04-23 21:42:31.993442: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:16671 [worker-2]: 2023-04-23 21:42:32.006034: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:23929 [worker-3]: 2023-04-23 21:42:32.009229: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:19346 [worker-0]: 2023-04-23 21:42:32.019879: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:2 has connected to coordination service. Incarnation: 13818649246497369067 [worker-2]: 2023-04-23 21:42:32.020141: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-0]: 2023-04-23 21:42:32.019978: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:0 has connected to coordination service. Incarnation: 17330015133661554317 [worker-0]: 2023-04-23 21:42:32.020150: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-0]: 2023-04-23 21:42:32.021753: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:3 has connected to coordination service. Incarnation: 16504726903948456385 [worker-3]: 2023-04-23 21:42:32.021913: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-1]: 2023-04-23 21:42:32.047046: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:23400 [worker-0]: 2023-04-23 21:42:32.063668: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:1 has connected to coordination service. Incarnation: 11939258956407075104 [worker-1]: 2023-04-23 21:42:32.064060: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-0]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-2]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-3]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1'] [worker-0]: I0423 21:42:32.066221 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-2]: I0423 21:42:32.066205 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-3]: I0423 21:42:32.066322 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1'] [worker-1]: INFO:tensorflow:Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-1]: I0423 21:42:32.066299 281473829860224 collective_all_reduce_strategy.py:531] Enabled multi-worker collective ops with available devices: ['/job:worker/replica:0/task:1/device:CPU:0', '/job:worker/replica:0/task:1/device:CPU:1', '/job:worker/replica:0/task:0/device:CPU:0', '/job:worker/replica:0/task:0/device:CPU:1', '/job:worker/replica:0/task:2/device:CPU:0', '/job:worker/replica:0/task:2/device:CPU:1', '/job:worker/replica:0/task:3/device:CPU:0', '/job:worker/replica:0/task:3/device:CPU:1'] [worker-1]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:1/device:CPU:0',) [worker-2]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:2/device:CPU:0',) [worker-0]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:0/device:CPU:0',) [worker-1]: I0423 21:42:32.125237 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:1/device:CPU:0',) [worker-0]: I0423 21:42:32.125240 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:0/device:CPU:0',) [worker-2]: I0423 21:42:32.125257 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:2/device:CPU:0',) [worker-1]: INFO:tensorflow:Check health not enabled. [worker-1]: I0423 21:42:32.125740 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-1]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16671', 'localhost:23400', 'localhost:23929', 'localhost:19346']}, task_type = 'worker', task_id = 1, num_workers = 4, local_devices = ('/job:worker/task:1/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-1]: I0423 21:42:32.125951 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16671', 'localhost:23400', 'localhost:23929', 'localhost:19346']}, task_type = 'worker', task_id = 1, num_workers = 4, local_devices = ('/job:worker/task:1/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-3]: INFO:tensorflow:Using MirroredStrategy with devices ('/job:worker/task:3/device:CPU:0',) [worker-3]: I0423 21:42:32.127384 281473829860224 mirrored_strategy.py:419] Using MirroredStrategy with devices ('/job:worker/task:3/device:CPU:0',) [worker-3]: INFO:tensorflow:Check health not enabled. [worker-3]: I0423 21:42:32.127912 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-3]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16671', 'localhost:23400', 'localhost:23929', 'localhost:19346']}, task_type = 'worker', task_id = 3, num_workers = 4, local_devices = ('/job:worker/task:3/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-3]: I0423 21:42:32.128133 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16671', 'localhost:23400', 'localhost:23929', 'localhost:19346']}, task_type = 'worker', task_id = 3, num_workers = 4, local_devices = ('/job:worker/task:3/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-2]: INFO:tensorflow:Check health not enabled. [worker-2]: I0423 21:42:32.125800 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-2]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16671', 'localhost:23400', 'localhost:23929', 'localhost:19346']}, task_type = 'worker', task_id = 2, num_workers = 4, local_devices = ('/job:worker/task:2/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-2]: I0423 21:42:32.126007 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16671', 'localhost:23400', 'localhost:23929', 'localhost:19346']}, task_type = 'worker', task_id = 2, num_workers = 4, local_devices = ('/job:worker/task:2/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: INFO:tensorflow:Check health not enabled. [worker-0]: I0423 21:42:32.125785 281473829860224 collective_all_reduce_strategy.py:574] Check health not enabled. [worker-0]: INFO:tensorflow:MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16671', 'localhost:23400', 'localhost:23929', 'localhost:19346']}, task_type = 'worker', task_id = 0, num_workers = 4, local_devices = ('/job:worker/task:0/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: I0423 21:42:32.125992 281473829860224 collective_all_reduce_strategy.py:576] MultiWorkerMirroredStrategy with cluster_spec = {'worker': ['localhost:16671', 'localhost:23400', 'localhost:23929', 'localhost:19346']}, task_type = 'worker', task_id = 0, num_workers = 4, local_devices = ('/job:worker/task:0/device:CPU:0',), communication = CommunicationImplementation.AUTO [worker-0]: INFO:tensorflow:Start watcher for peer's signal. [worker-0]: I0423 21:42:32.181214 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-3]: INFO:tensorflow:Start watcher for peer's signal. [worker-1]: INFO:tensorflow:Start watcher for peer's signal. [worker-2]: INFO:tensorflow:Start watcher for peer's signal. [worker-0]: INFO:tensorflow:Start polling for termination signal. [worker-1]: I0423 21:42:32.182131 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-2]: I0423 21:42:32.182181 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-3]: I0423 21:42:32.182107 281473829860224 failure_handling.py:634] Start watcher for peer's signal. [worker-0]: I0423 21:42:32.182502 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-1]: INFO:tensorflow:Start polling for termination signal. [worker-2]: INFO:tensorflow:Start polling for termination signal. [worker-3]: INFO:tensorflow:Start polling for termination signal. [worker-0]: Exception in thread WorkerTerminationSignalWatcher-0: [worker-1]: I0423 21:42:32.182950 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-2]: I0423 21:42:32.182921 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-1]: Exception in thread WorkerTerminationSignalWatcher-1: [worker-3]: I0423 21:42:32.182796 281473829860224 failure_handling.py:683] Start polling for termination signal. [worker-0]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-1]: Traceback (most recent call last): [worker-2]: Exception in thread WorkerTerminationSignalWatcher-2: [worker-3]: Exception in thread WorkerTerminationSignalWatcher-3: [worker-0]: I0423 21:42:32.183012 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-1]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-2]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-3]: Traceback (most recent call last): [worker-2]: Traceback (most recent call last): [worker-0]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-3]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-2]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-0]: Instructions for updating: [worker-3]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-2]: I0423 21:42:32.183477 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-0]: Track steps using a tf.Variable saved in checkpoint instead. [worker-3]: I0423 21:42:32.183242 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-2]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-0]: W0423 21:42:32.183397 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-3]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-2]: Instructions for updating: [worker-0]: Instructions for updating: [worker-3]: Instructions for updating: [worker-2]: Track steps using a tf.Variable saved in checkpoint instead. [worker-0]: Track steps using a tf.Variable saved in checkpoint instead. [worker-3]: Track steps using a tf.Variable saved in checkpoint instead. [worker-2]: W0423 21:42:32.184016 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-0]: INFO:tensorflow:Start training at 0 [worker-3]: W0423 21:42:32.183801 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-2]: Instructions for updating: [worker-0]: I0423 21:42:32.183570 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-3]: Instructions for updating: [worker-2]: Track steps using a tf.Variable saved in checkpoint instead. [worker-0]: Traceback (most recent call last): [worker-3]: Track steps using a tf.Variable saved in checkpoint instead. [worker-2]: INFO:tensorflow:Start training at 0 [worker-0]: File "/usr/lib/python3.11/threading.py", line 1038, in _bootstrap_inner [worker-3]: INFO:tensorflow:Start training at 0 [worker-2]: I0423 21:42:32.184203 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-0]: self.run() [worker-2]: self.run() [worker-3]: I0423 21:42:32.183975 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-0]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-3]: self.run() [worker-2]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-0]: self._target(*self._args, **self._kwargs) [worker-2]: self._target(*self._args, **self._kwargs) [worker-3]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-3]: self._target(*self._args, **self._kwargs) [worker-0]: if self._termination_watcher_fn(): [worker-2]: if self._termination_watcher_fn(): [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: if self._termination_watcher_fn(): [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-0]: elif frequent_send and not maintenance_event.is_set(): [worker-2]: elif frequent_send and not maintenance_event.is_set(): [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: elif frequent_send and not maintenance_event.is_set(): [worker-0]: AttributeError: 'str' object has no attribute 'is_set' [worker-2]: AttributeError: 'str' object has no attribute 'is_set' [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: AttributeError: 'str' object has no attribute 'is_set' [worker-1]: INFO:tensorflow:PreemptionCheckpointHandler initialized or restored. [worker-1]: I0423 21:42:32.183642 281473829860224 failure_handling.py:538] PreemptionCheckpointHandler initialized or restored. [worker-1]: WARNING:tensorflow:From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-1]: Instructions for updating: [worker-1]: Track steps using a tf.Variable saved in checkpoint instead. [worker-1]: W0423 21:42:32.183956 281473829860224 deprecation.py:364] From /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py:195: PreemptionCheckpointHandler.total_run_calls (from tensorflow.python.distribute.failure_handling.failure_handling) is deprecated and will be removed in a future version. [worker-1]: Instructions for updating: [worker-1]: Track steps using a tf.Variable saved in checkpoint instead. [worker-1]: INFO:tensorflow:Start training at 0 [worker-1]: I0423 21:42:32.184126 281473829860224 gce_failure_handler_test.py:194] Start training at 0 [worker-1]: self.run() [worker-1]: File "/usr/lib/python3.11/threading.py", line 975, in run [worker-1]: self._target(*self._args, **self._kwargs) [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 692, in _poll_termination_signal [worker-1]: if self._termination_watcher_fn(): [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 145, in mock_termination_watcher_function_gce [worker-1]: elif frequent_send and not maintenance_event.is_set(): [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: AttributeError: 'str' object has no attribute 'is_set' [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:32.479665 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:32.625939 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:32.653080 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:32.658297 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:32.719845 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:32.719941 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:32.720174 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:32.720674 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:32.774692 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:32.775246 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:32.775245 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:32.779639 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:32.858401 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:32.874851 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:32.875115 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:32.880851 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:32.934506 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:32.935140 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:32.945237 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:32.945656 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830a980> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830e840> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830e980> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: W0423 21:42:32.998371 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830a980> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: W0423 21:42:32.998566 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830e980> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: W0423 21:42:32.998501 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830e840> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: WARNING:tensorflow:5 out of the last 5 calls to .wrapped_fn at 0xffffb830e8e0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: W0423 21:42:32.998080 281473829860224 polymorphic_function.py:158] 5 out of the last 5 calls to .wrapped_fn at 0xffffb830e8e0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:33.005967 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:33.006148 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:33.006163 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:33.006582 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830f2e0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830f420> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: W0423 21:42:33.058821 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830f2e0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-2]: W0423 21:42:33.058916 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830f420> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-1]: INFO:tensorflow:epoch 0 finished [worker-2]: INFO:tensorflow:epoch 0 finished [worker-1]: I0423 21:42:33.059190 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-2]: I0423 21:42:33.059236 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-3]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830f2e0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: WARNING:tensorflow:6 out of the last 6 calls to .wrapped_fn at 0xffffb830b2e0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: W0423 21:42:33.062434 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830f2e0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-0]: W0423 21:42:33.058568 281473829860224 polymorphic_function.py:158] 6 out of the last 6 calls to .wrapped_fn at 0xffffb830b2e0> triggered tf.function retracing. Tracing is expensive and the excessive number of tracings could be due to (1) creating @tf.function repeatedly in a loop, (2) passing tensors with different shapes, (3) passing Python objects instead of tensors. For (1), please define your @tf.function outside of the loop. For (2), @tf.function has reduce_retracing=True option that can avoid unnecessary retracing. For (3), please refer to https://www.tensorflow.org/guide/function#controlling_retracing and https://www.tensorflow.org/api_docs/python/tf/function for more details. [worker-3]: INFO:tensorflow:epoch 0 finished [worker-0]: INFO:tensorflow:epoch 0 finished [worker-3]: I0423 21:42:33.062772 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-0]: I0423 21:42:33.058928 281473829860224 gce_failure_handler_test.py:192] epoch 0 finished [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:33.067451 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:33.067032 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:33.067533 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:33.070927 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:33.121034 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:33.121170 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:33.121790 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:33.122010 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:33.173773 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:33.183691 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:33.184281 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:33.184782 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:33.231733 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:33.233073 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:33.233097 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:33.236026 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:33.293561 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:33.293920 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:33.293953 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:33.296386 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:33.347650 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:33.347741 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:33.347807 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:33.347859 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 1 finished [worker-0]: INFO:tensorflow:epoch 1 finished [worker-3]: I0423 21:42:33.390179 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-1]: INFO:tensorflow:epoch 1 finished [worker-2]: INFO:tensorflow:epoch 1 finished [worker-0]: I0423 21:42:33.390334 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-1]: I0423 21:42:33.390561 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-2]: I0423 21:42:33.390552 281473829860224 gce_failure_handler_test.py:192] epoch 1 finished [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:33.398214 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:33.397597 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:33.397603 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:33.397571 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:33.477528 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:33.496750 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:33.521649 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:33.532306 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:33.583705 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:33.619931 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:33.629726 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:33.650043 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:33.720374 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:33.723594 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:33.724219 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:33.724255 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:33.773601 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:33.774282 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:33.774822 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:33.784393 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:33.835072 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:33.835433 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:33.835479 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:33.837929 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 2 finished [worker-3]: I0423 21:42:33.906215 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:33.913159 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:epoch 2 finished [worker-1]: INFO:tensorflow:epoch 2 finished [worker-0]: I0423 21:42:33.917433 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-1]: I0423 21:42:33.917135 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:33.924711 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:epoch 2 finished [worker-2]: I0423 21:42:33.936899 281473829860224 gce_failure_handler_test.py:192] epoch 2 finished [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:33.944127 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:33.944090 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:34.000378 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:34.003201 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:34.003592 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:34.005671 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:34.054600 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:34.054629 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:34.055171 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:34.054608 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:34.106643 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:34.106785 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:34.106864 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:34.107386 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:34.158698 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:34.158728 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:34.158727 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:34.159569 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:34.244162 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:34.250205 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:34.253743 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:34.249775 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:epoch 3 finished [worker-0]: INFO:tensorflow:epoch 3 finished [worker-1]: INFO:tensorflow:epoch 3 finished [worker-3]: I0423 21:42:34.331147 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-0]: I0423 21:42:34.331295 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-2]: INFO:tensorflow:epoch 3 finished [worker-1]: I0423 21:42:34.331466 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-2]: I0423 21:42:34.331874 281473829860224 gce_failure_handler_test.py:192] epoch 3 finished [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:34.339122 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:34.339761 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:34.340180 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:34.341609 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:34.419472 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:34.430140 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:34.430982 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:34.456003 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-2]: I0423 21:42:34.510538 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-3]: I0423 21:42:34.511099 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-1]: I0423 21:42:34.511334 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: INFO:tensorflow:Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: I0423 21:42:34.511126 281473829860224 cross_device_ops.py:1151] Collective all_reduce tensors: 1 all_reduces, num_devices = 1, group_size = 4, implementation = CommunicationImplementation.AUTO, num_packs = 1 [worker-0]: 2023-04-23 21:42:34.550790: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-0]: 2023-04-23 21:42:34.550902: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:3 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x03'] [worker-0]: 2023-04-23 21:42:34.551775: E tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:737] Coordination agent is in ERROR: ABORTED: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x03'] [worker-0]: 2023-04-23 21:42:34.551831: E tensorflow/core/common_runtime/base_collective_executor.cc:249] BaseCollectiveExecutor::StartAbort ABORTED: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x03'] [worker-1]: 2023-04-23 21:42:34.552598: E tensorflow/core/common_runtime/base_collective_executor.cc:249] BaseCollectiveExecutor::StartAbort ABORTED: Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-2]: 2023-04-23 21:42:34.552675: E tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:737] Coordination agent is in ERROR: ABORTED: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x03'] [worker-1]: The error could be from a previous operation. Restart your program to reset. [worker-2]: 2023-04-23 21:42:34.552720: E tensorflow/core/common_runtime/base_collective_executor.cc:249] BaseCollectiveExecutor::StartAbort ABORTED: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x03'] [worker-1]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-1]: :{"created":"@1682286154.552458505","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted.\nThe error could be from a previous operation. Restart your program to reset.","grpc_status":10} [type.googleapis.com/tensorflow.DerivedStatus=''] [worker-1]: 2023-04-23 21:42:34.552723: I tensorflow/core/common_runtime/executor.cc:1210] [/job:worker/replica:0/task:1/device:CPU:0] (DEBUG INFO) Executor start aborting (this does not indicate an error and you can ignore this message): ABORTED: Collective ops is aborted by: Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-1]: The error could be from a previous operation. Restart your program to reset. [worker-1]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-1]: :{"created":"@1682286154.552458505","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted.\nThe error could be from a previous operation. Restart your program to reset.","grpc_status":10} [worker-1]: The error could be from a previous operation. Restart your program to reset. [worker-1]: [[{{node CollectiveReduceV2}}]] [type.googleapis.com/tensorflow.DerivedStatus=''] [worker-1]: 2023-04-23 21:42:34.553293: E tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:737] Coordination agent is in ERROR: ABORTED: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x03'] [worker-2]: 2023-04-23 21:42:34.554872: I tensorflow/core/common_runtime/executor.cc:1210] [/job:worker/replica:0/task:2/device:CPU:0] (DEBUG INFO) Executor start aborting (this does not indicate an error and you can ignore this message): ABORTED: Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-2]: The error could be from a previous operation. Restart your program to reset. [worker-2]: [[{{node CollectiveReduceV2}}]] [type.googleapis.com/tensorflow.DerivedStatus=''] [worker-0]: 2023-04-23 21:42:34.555587: I tensorflow/core/common_runtime/executor.cc:1210] [/job:worker/replica:0/task:0/device:CPU:0] (DEBUG INFO) Executor start aborting (this does not indicate an error and you can ignore this message): ABORTED: Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-0]: The error could be from a previous operation. Restart your program to reset. [worker-3]: 2023-04-23 21:42:34.556252: E tensorflow/core/common_runtime/base_collective_executor.cc:249] BaseCollectiveExecutor::StartAbort ABORTED: Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-3]: The error could be from a previous operation. Restart your program to reset. [worker-3]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-3]: :{"created":"@1682286154.552650920","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted.\nThe error could be from a previous operation. Restart your program to reset.","grpc_status":10} [type.googleapis.com/tensorflow.DerivedStatus=''] [worker-3]: 2023-04-23 21:42:34.556388: I tensorflow/core/common_runtime/executor.cc:1210] [/job:worker/replica:0/task:3/device:CPU:0] (DEBUG INFO) Executor start aborting (this does not indicate an error and you can ignore this message): ABORTED: Collective ops is aborted by: Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-3]: The error could be from a previous operation. Restart your program to reset. [worker-3]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-3]: :{"created":"@1682286154.552650920","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted.\nThe error could be from a previous operation. Restart your program to reset.","grpc_status":10} [worker-3]: The error could be from a previous operation. Restart your program to reset. [worker-3]: [[{{node CollectiveReduceV2}}]] [type.googleapis.com/tensorflow.DerivedStatus=''] [worker-0]: [[{{node CollectiveReduceV2}}]] [type.googleapis.com/tensorflow.DerivedStatus=''] [worker-0]: 2023-04-23 21:42:34.557138: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:1 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-0]: 2023-04-23 21:42:34.557221: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:1 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:1 unexpectedly tried to connect with a different incarnation. It has likely restarted. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x01'] [worker-0]: 2023-04-23 21:42:34.564290: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:1 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.564372: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:1 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:1 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x01'] [worker-0]: 2023-04-23 21:42:34.567238: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:3 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.567310: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:3 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:3 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x03'] [worker-0]: 2023-04-23 21:42:34.567544: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:1 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.567597: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:1 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:1 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x01'] [worker-0]: 2023-04-23 21:42:34.572874: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:1 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.572943: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:1 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:1 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x01'] [worker-0]: 2023-04-23 21:42:34.573012: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:3 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.573093: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:3 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:3 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x03'] [worker-0]: 2023-04-23 21:42:34.576108: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:3 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.576197: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:3 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:3 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x03'] [worker-0]: 2023-04-23 21:42:34.585323: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:2 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-0]: 2023-04-23 21:42:34.585422: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:2 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:2 unexpectedly tried to connect with a different incarnation. It has likely restarted. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x02'] [worker-0]: 2023-04-23 21:42:34.585494: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:1 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.585592: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:1 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:1 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x01'] [worker-0]: 2023-04-23 21:42:34.598242: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:3 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.598326: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:3 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:3 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x03'] [worker-0]: 2023-04-23 21:42:34.606296: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:2 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.606399: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:2 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:2 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x02'] [worker-0]: 2023-04-23 21:42:34.609534: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:2 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.609630: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:2 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:2 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x02'] [worker-0]: 2023-04-23 21:42:34.613654: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:2 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.613736: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:2 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:2 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x02'] [worker-0]: 2023-04-23 21:42:34.622816: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:1 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.622912: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:1 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:1 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x01'] [worker-0]: 2023-04-23 21:42:34.625909: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:2 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.625983: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:2 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:2 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x02'] [worker-0]: 2023-04-23 21:42:34.646311: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:3 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.646404: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:3 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:3 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x03'] [worker-0]: 2023-04-23 21:42:34.657228: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:2 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.657340: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:2 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:2 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x02'] [worker-0]: 2023-04-23 21:42:34.657411: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:1 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.657515: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:1 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:1 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x01'] [worker-0]: 2023-04-23 21:42:34.663232: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:1 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.663326: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:1 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:1 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x01'] [worker-0]: 2023-04-23 21:42:34.681102: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:3 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.681193: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:3 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:3 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x03'] [worker-0]: 2023-04-23 21:42:34.692115: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:3 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.692208: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:3 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:3 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x03'] [worker-0]: 2023-04-23 21:42:34.692635: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:2 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.692731: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:2 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:2 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x02'] [worker-0]: 2023-04-23 21:42:34.698275: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:2 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.698360: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:2 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:2 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x02'] [worker-0]: 2023-04-23 21:42:34.804126: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:1 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.804239: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:1 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:1 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x01'] [worker-0]: 2023-04-23 21:42:34.809612: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:1 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.809708: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:1 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:1 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x01'] [worker-0]: 2023-04-23 21:42:34.829233: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:3 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.829339: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:3 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:3 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x03'] [worker-0]: 2023-04-23 21:42:34.835604: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:2 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.835690: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:2 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:2 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x02'] [worker-0]: 2023-04-23 21:42:34.842350: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:2 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.842438: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:2 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:2 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x02'] [worker-0]: 2023-04-23 21:42:34.842481: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:3 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.842521: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:3 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:3 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x03'] [worker-0]: 2023-04-23 21:42:34.879820: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:1 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.879918: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:1 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:1 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x01'] [worker-0]: 2023-04-23 21:42:34.912490: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:2 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.912579: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:2 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:2 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x02'] [worker-0]: 2023-04-23 21:42:34.912650: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:3 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [worker-0]: 2023-04-23 21:42:34.912747: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:3 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:3 unexpectedly tried to connect while it is already in error. ResetTask() should be called before a subsequent connect attempt. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x03'] [worker-3]: INFO:tensorflow:Propagating error to cluster: AbortedError(): Graph execution error: [worker-3]: [worker-1]: INFO:tensorflow:Propagating error to cluster: AbortedError(): Graph execution error: [worker-0]: INFO:tensorflow:Propagating error to cluster: AbortedError(): Graph execution error: [worker-3]: Detected at node 'CollectiveReduceV2' defined at (most recent call last): [worker-0]: [worker-1]: [worker-2]: INFO:tensorflow:Propagating error to cluster: AbortedError(): Graph execution error: [worker-1]: Detected at node 'CollectiveReduceV2' defined at (most recent call last): [worker-2]: [worker-0]: Detected at node 'CollectiveReduceV2' defined at (most recent call last): [worker-2]: Detected at node 'CollectiveReduceV2' defined at (most recent call last): [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 549, in [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 549, in [worker-1]: test_util.main() [worker-3]: test_util.main() [worker-1]: File "", line 1, in [worker-3]: File "", line 1, in [worker-3]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 274, in main [worker-1]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 274, in main [worker-3]: code = _serve_one(child_r, fds, [worker-1]: code = _serve_one(child_r, fds, [worker-3]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 313, in _serve_one [worker-1]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 313, in _serve_one [worker-3]: code = spawn._main(child_r, parent_sentinel) [worker-1]: code = spawn._main(child_r, parent_sentinel) [worker-3]: File "/usr/lib/python3.11/multiprocessing/spawn.py", line 133, in _main [worker-1]: File "/usr/lib/python3.11/multiprocessing/spawn.py", line 133, in _main [worker-3]: return self._bootstrap(parent_sentinel) [worker-1]: return self._bootstrap(parent_sentinel) [worker-3]: File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap [worker-1]: File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap [worker-1]: self.run() [worker-3]: self.run() [worker-1]: File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run [worker-3]: File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run [worker-1]: self._target(*self._args, **self._kwargs) [worker-3]: self._target(*self._args, **self._kwargs) [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 237, in worker_fn [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 237, in worker_fn [worker-1]: preemption_handler.run(distributed_train_step, epoch, step) [worker-3]: preemption_handler.run(distributed_train_step, epoch, step) [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 189, in distributed_train_step [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 189, in distributed_train_step [worker-1]: strategy.run(train_step) [worker-3]: strategy.run(train_step) [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/mirrored_run.py", line 79, in wrapped_fn [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/mirrored_run.py", line 79, in wrapped_fn [worker-3]: return call_for_each_replica(strategy, fn.python_function, args, kwargs) [worker-3]: Node: 'CollectiveReduceV2' [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 549, in [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 549, in [worker-3]: Collective ops is aborted by: Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-0]: test_util.main() [worker-2]: test_util.main() [worker-3]: The error could be from a previous operation. Restart your program to reset. [worker-1]: return call_for_each_replica(strategy, fn.python_function, args, kwargs) [worker-1]: Node: 'CollectiveReduceV2' [worker-1]: Collective ops is aborted by: Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-2]: File "", line 1, in [worker-1]: The error could be from a previous operation. Restart your program to reset. [worker-2]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 274, in main [worker-0]: File "", line 1, in [worker-2]: code = _serve_one(child_r, fds, [worker-1]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-2]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 313, in _serve_one [worker-1]: :{"created":"@1682286154.552458505","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted.\nThe error could be from a previous operation. Restart your program to reset.","grpc_status":10} [worker-2]: code = spawn._main(child_r, parent_sentinel) [worker-1]: The error could be from a previous operation. Restart your program to reset. [worker-2]: File "/usr/lib/python3.11/multiprocessing/spawn.py", line 133, in _main [worker-1]: [[{{node CollectiveReduceV2}}]] [Op:__inference_train_step_663] [worker-2]: return self._bootstrap(parent_sentinel) [worker-3]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-1]: I0423 21:42:35.180688 281473829860224 failure_handling.py:918] Propagating error to cluster: AbortedError(): Graph execution error: [worker-2]: File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap [worker-0]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 274, in main [worker-2]: self.run() [worker-0]: code = _serve_one(child_r, fds, [worker-1]: [worker-3]: :{"created":"@1682286154.552650920","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted.\nThe error could be from a previous operation. Restart your program to reset.","grpc_status":10} [worker-2]: File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run [worker-0]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 313, in _serve_one [worker-1]: Detected at node 'CollectiveReduceV2' defined at (most recent call last): [worker-3]: The error could be from a previous operation. Restart your program to reset. [worker-2]: self._target(*self._args, **self._kwargs) [worker-0]: code = spawn._main(child_r, parent_sentinel) [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 549, in [worker-3]: [[{{node CollectiveReduceV2}}]] [Op:__inference_train_step_663] [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 237, in worker_fn [worker-0]: File "/usr/lib/python3.11/multiprocessing/spawn.py", line 133, in _main [worker-1]: test_util.main() [worker-3]: I0423 21:42:35.181048 281473829860224 failure_handling.py:918] Propagating error to cluster: AbortedError(): Graph execution error: [worker-2]: preemption_handler.run(distributed_train_step, epoch, step) [worker-0]: return self._bootstrap(parent_sentinel) [worker-1]: File "", line 1, in [worker-3]: [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 189, in distributed_train_step [worker-0]: File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap [worker-0]: self.run() [worker-1]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 274, in main [worker-3]: Detected at node 'CollectiveReduceV2' defined at (most recent call last): [worker-2]: strategy.run(train_step) [worker-0]: File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run [worker-0]: self._target(*self._args, **self._kwargs) [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 237, in worker_fn [worker-0]: preemption_handler.run(distributed_train_step, epoch, step) [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 189, in distributed_train_step [worker-0]: strategy.run(train_step) [worker-1]: code = _serve_one(child_r, fds, [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/mirrored_run.py", line 79, in wrapped_fn [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 549, in [worker-3]: test_util.main() [worker-3]: File "", line 1, in [worker-3]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 274, in main [worker-3]: code = _serve_one(child_r, fds, [worker-3]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 313, in _serve_one [worker-3]: code = spawn._main(child_r, parent_sentinel) [worker-3]: File "/usr/lib/python3.11/multiprocessing/spawn.py", line 133, in _main [worker-0]: return call_for_each_replica(strategy, fn.python_function, args, kwargs) [worker-3]: return self._bootstrap(parent_sentinel) [worker-0]: Node: 'CollectiveReduceV2' [worker-3]: File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap [worker-0]: Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-3]: self.run() [worker-0]: The error could be from a previous operation. Restart your program to reset. [worker-3]: File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run [worker-0]: [[{{node CollectiveReduceV2}}]] [Op:__inference_train_step_665] [worker-3]: self._target(*self._args, **self._kwargs) [worker-0]: I0423 21:42:35.181595 281473829860224 failure_handling.py:918] Propagating error to cluster: AbortedError(): Graph execution error: [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 237, in worker_fn [worker-0]: [worker-3]: preemption_handler.run(distributed_train_step, epoch, step) [worker-0]: Detected at node 'CollectiveReduceV2' defined at (most recent call last): [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 189, in distributed_train_step [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 549, in [worker-3]: strategy.run(train_step) [worker-0]: test_util.main() [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/mirrored_run.py", line 79, in wrapped_fn [worker-0]: File "", line 1, in [worker-3]: return call_for_each_replica(strategy, fn.python_function, args, kwargs) [worker-0]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 274, in main [worker-3]: Node: 'CollectiveReduceV2' [worker-0]: code = _serve_one(child_r, fds, [worker-3]: Collective ops is aborted by: Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-0]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 313, in _serve_one [worker-3]: The error could be from a previous operation. Restart your program to reset. [worker-0]: code = spawn._main(child_r, parent_sentinel) [worker-3]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-0]: File "/usr/lib/python3.11/multiprocessing/spawn.py", line 133, in _main [worker-3]: :{"created":"@1682286154.552650920","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted.\nThe error could be from a previous operation. Restart your program to reset.","grpc_status":10} [worker-0]: return self._bootstrap(parent_sentinel) [worker-3]: The error could be from a previous operation. Restart your program to reset. [worker-1]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 313, in _serve_one [worker-0]: File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap [worker-3]: [[{{node CollectiveReduceV2}}]] [Op:__inference_train_step_663] [worker-0]: self.run() [worker-3]: 2023-04-23 21:42:35.181405: E tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:737] Coordination agent is in ERROR: ABORTED: Graph execution error: [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/mirrored_run.py", line 79, in wrapped_fn [worker-0]: File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run [worker-0]: self._target(*self._args, **self._kwargs) [worker-1]: code = spawn._main(child_r, parent_sentinel) [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 237, in worker_fn [worker-3]: [worker-2]: return call_for_each_replica(strategy, fn.python_function, args, kwargs) [worker-2]: Node: 'CollectiveReduceV2' [worker-2]: Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-2]: The error could be from a previous operation. Restart your program to reset. [worker-2]: [[{{node CollectiveReduceV2}}]] [Op:__inference_train_step_663] [worker-2]: I0423 21:42:35.186985 281473829860224 failure_handling.py:918] Propagating error to cluster: AbortedError(): Graph execution error: [worker-2]: [worker-2]: Detected at node 'CollectiveReduceV2' defined at (most recent call last): [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 549, in [worker-2]: test_util.main() [worker-2]: File "", line 1, in [worker-2]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 274, in main [worker-2]: code = _serve_one(child_r, fds, [worker-2]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 313, in _serve_one [worker-2]: code = spawn._main(child_r, parent_sentinel) [worker-2]: File "/usr/lib/python3.11/multiprocessing/spawn.py", line 133, in _main [worker-2]: return self._bootstrap(parent_sentinel) [worker-2]: File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap [worker-2]: self.run() [worker-2]: File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run [worker-2]: self._target(*self._args, **self._kwargs) [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 237, in worker_fn [worker-2]: preemption_handler.run(distributed_train_step, epoch, step) [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 189, in distributed_train_step [worker-2]: strategy.run(train_step) [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/mirrored_run.py", line 79, in wrapped_fn [worker-2]: return call_for_each_replica(strategy, fn.python_function, args, kwargs) [worker-2]: Node: 'CollectiveReduceV2' [worker-2]: Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-2]: The error could be from a previous operation. Restart your program to reset. [worker-2]: [[{{node CollectiveReduceV2}}]] [Op:__inference_train_step_663] [worker-2]: INFO:tensorflow:Ignoring error during error propagation: FailedPreconditionError():Coordination service agent is already in error state. [worker-2]: I0423 21:42:35.187345 281473829860224 failure_handling.py:922] Ignoring error during error propagation: FailedPreconditionError():Coordination service agent is already in error state. [worker-0]: preemption_handler.run(distributed_train_step, epoch, step) [worker-1]: File "/usr/lib/python3.11/multiprocessing/spawn.py", line 133, in _main [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 189, in distributed_train_step [worker-3]: Detected at node 'CollectiveReduceV2' defined at (most recent call last): [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 549, in [worker-1]: return self._bootstrap(parent_sentinel) [worker-1]: File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap [worker-3]: test_util.main() [worker-0]: strategy.run(train_step) [worker-1]: self.run() [worker-3]: File "", line 1, in [worker-1]: File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run [worker-3]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 274, in main [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/mirrored_run.py", line 79, in wrapped_fn [worker-1]: self._target(*self._args, **self._kwargs) [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 237, in worker_fn [worker-3]: code = _serve_one(child_r, fds, [worker-3]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 313, in _serve_one [worker-3]: code = spawn._main(child_r, parent_sentinel) [worker-3]: File "/usr/lib/python3.11/multiprocessing/spawn.py", line 133, in _main [worker-1]: preemption_handler.run(distributed_train_step, epoch, step) [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 189, in distributed_train_step [worker-3]: return self._bootstrap(parent_sentinel) [worker-0]: return call_for_each_replica(strategy, fn.python_function, args, kwargs) [worker-1]: strategy.run(train_step) [worker-3]: File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap [worker-3]: self.run() [worker-0]: Node: 'CollectiveReduceV2' [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/mirrored_run.py", line 79, in wrapped_fn [worker-3]: File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run [worker-1]: return call_for_each_replica(strategy, fn.python_function, args, kwargs) [worker-3]: self._target(*self._args, **self._kwargs) [worker-1]: Node: 'CollectiveReduceV2' [worker-0]: Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 237, in worker_fn [worker-0]: The error could be from a previous operation. Restart your program to reset. [worker-1]: Collective ops is aborted by: Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-3]: preemption_handler.run(distributed_train_step, epoch, step) [worker-0]: [[{{node CollectiveReduceV2}}]] [Op:__inference_train_step_665] [worker-1]: The error could be from a previous operation. Restart your program to reset. [worker-0]: INFO:tensorflow:Ignoring error during error propagation: FailedPreconditionError():Coordination service agent is already in error state. [worker-1]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 189, in distributed_train_step [worker-0]: I0423 21:42:35.181956 281473829860224 failure_handling.py:922] Ignoring error during error propagation: FailedPreconditionError():Coordination service agent is already in error state. [worker-1]: :{"created":"@1682286154.552458505","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted.\nThe error could be from a previous operation. Restart your program to reset.","grpc_status":10} [worker-3]: strategy.run(train_step) [worker-1]: The error could be from a previous operation. Restart your program to reset. [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/mirrored_run.py", line 79, in wrapped_fn [worker-1]: [[{{node CollectiveReduceV2}}]] [Op:__inference_train_step_663] [worker-3]: return call_for_each_replica(strategy, fn.python_function, args, kwargs) [worker-1]: INFO:tensorflow:Ignoring error during error propagation: FailedPreconditionError():Coordination service agent is already in error state. [worker-3]: Node: 'CollectiveReduceV2' [worker-1]: I0423 21:42:35.181105 281473829860224 failure_handling.py:922] Ignoring error during error propagation: FailedPreconditionError():Coordination service agent is already in error state. [worker-3]: Collective ops is aborted by: Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-3]: The error could be from a previous operation. Restart your program to reset. [worker-3]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-3]: :{"created":"@1682286154.552650920","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted.\nThe error could be from a previous operation. Restart your program to reset.","grpc_status":10} [worker-3]: The error could be from a previous operation. Restart your program to reset. [worker-3]: [[{{node CollectiveReduceV2}}]] [Op:__inference_train_step_663] [type.googleapis.com/tensorflow.CoordinationServiceError='\x18\x01\"\n\n\x06worker\x10\x03'] [worker-3]: 2023-04-23 21:42:35.181452: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:433] Reporting error to coordination service: ABORTED: Graph execution error: [worker-3]: [worker-3]: Detected at node 'CollectiveReduceV2' defined at (most recent call last): [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 549, in [worker-3]: test_util.main() [worker-3]: File "", line 1, in [worker-3]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 274, in main [worker-3]: code = _serve_one(child_r, fds, [worker-3]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 313, in _serve_one [worker-3]: code = spawn._main(child_r, parent_sentinel) [worker-3]: File "/usr/lib/python3.11/multiprocessing/spawn.py", line 133, in _main [worker-3]: return self._bootstrap(parent_sentinel) [worker-3]: File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap [worker-3]: self.run() [worker-3]: File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run [worker-3]: self._target(*self._args, **self._kwargs) [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 237, in worker_fn [worker-3]: preemption_handler.run(distributed_train_step, epoch, step) [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 189, in distributed_train_step [worker-3]: strategy.run(train_step) [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/mirrored_run.py", line 79, in wrapped_fn [worker-3]: return call_for_each_replica(strategy, fn.python_function, args, kwargs) [worker-3]: Node: 'CollectiveReduceV2' [worker-3]: Collective ops is aborted by: Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-3]: The error could be from a previous operation. Restart your program to reset. [worker-3]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-3]: :{"created":"@1682286154.552650920","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted.\nThe error could be from a previous operation. Restart your program to reset.","grpc_status":10} [worker-3]: The error could be from a previous operation. Restart your program to reset. [worker-3]: [[{{node CollectiveReduceV2}}]] [Op:__inference_train_step_663] [worker-3]: 2023-04-23 21:42:35.182563: E tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:445] Encountered another error when reporting error to coordination service: FAILED_PRECONDITION: The task is not connected or already has an error. [worker-3]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-3]: :{"created":"@1682286155.182470935","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"The task is not connected or already has an error.","grpc_status":9} [type.googleapis.com/tensorflow.CoordinationServiceError=''] [worker-3]: Process _Process-37: [worker-1]: Process _Process-35: [worker-0]: Process _Process-34: [worker-2]: Process _Process-36: [worker-3]: Traceback (most recent call last): [worker-0]: Traceback (most recent call last): [worker-1]: Traceback (most recent call last): [worker-2]: Traceback (most recent call last): [worker-0]: File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap [worker-3]: File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap [worker-0]: self.run() [worker-1]: File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 755, in _run_with_setenv [worker-0]: return self._actual_run() [worker-3]: self.run() [worker-0]: ^^^^^^^^^^^^^^^^^^ [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_lib.py", line 54, in _run_with_absl [worker-0]: app.run(lambda _: self._run_impl()) [worker-1]: self.run() [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/absl_py/absl/app.py", line 312, in run [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 755, in _run_with_setenv [worker-0]: _run_main(main, args) [worker-3]: return self._actual_run() [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/absl_py/absl/app.py", line 258, in _run_main [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 755, in _run_with_setenv [worker-0]: sys.exit(main(argv)) [worker-3]: ^^^^^^^^^^^^^^^^^^ [worker-1]: return self._actual_run() [worker-0]: ^^^^^^^^^^ [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_lib.py", line 54, in [worker-0]: app.run(lambda _: self._run_impl()) [worker-0]: ^^^^^^^^^^^^^^^^ [worker-0]: File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run [worker-0]: self._target(*self._args, **self._kwargs) [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 866, in __call__ [worker-1]: ^^^^^^^^^^^^^^^^^^ [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_lib.py", line 54, in _run_with_absl [worker-0]: six.reraise(*info.exc_info) [worker-3]: app.run(lambda _: self._run_impl()) [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_lib.py", line 54, in _run_with_absl [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/six_archive/six.py", line 719, in reraise [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/absl_py/absl/app.py", line 312, in run [worker-1]: app.run(lambda _: self._run_impl()) [worker-0]: raise value [worker-3]: _run_main(main, args) [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/absl_py/absl/app.py", line 312, in run [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 1060, in _run_contained [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/absl_py/absl/app.py", line 258, in _run_main [worker-1]: _run_main(main, args) [worker-0]: return_value = fn(*args, **kwargs) [worker-3]: sys.exit(main(argv)) [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/absl_py/absl/app.py", line 258, in _run_main [worker-0]: ^^^^^^^^^^^^^^^^^^^ [worker-3]: ^^^^^^^^^^ [worker-1]: sys.exit(main(argv)) [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 237, in worker_fn [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_lib.py", line 54, in [worker-1]: ^^^^^^^^^^ [worker-0]: preemption_handler.run(distributed_train_step, epoch, step) [worker-3]: app.run(lambda _: self._run_impl()) [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_lib.py", line 54, in [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 893, in run [worker-3]: ^^^^^^^^^^^^^^^^ [worker-1]: app.run(lambda _: self._run_impl()) [worker-0]: return self._run_for_multi_worker_mirrored( [worker-3]: File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run [worker-1]: ^^^^^^^^^^^^^^^^ [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: self._target(*self._args, **self._kwargs) [worker-1]: File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 909, in _run_for_multi_worker_mirrored [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 866, in __call__ [worker-1]: self._target(*self._args, **self._kwargs) [worker-0]: result = distributed_train_function(*args, **kwargs) [worker-3]: six.reraise(*info.exc_info) [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 866, in __call__ [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/six_archive/six.py", line 719, in reraise [worker-1]: six.reraise(*info.exc_info) [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 189, in distributed_train_step [worker-3]: raise value [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/six_archive/six.py", line 719, in reraise [worker-0]: strategy.run(train_step) [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 1060, in _run_contained [worker-1]: raise value [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/distribute_lib.py", line 1671, in run [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 1060, in _run_contained [worker-3]: return_value = fn(*args, **kwargs) [worker-1]: return_value = fn(*args, **kwargs) [worker-0]: return self._extended.call_for_each_replica(fn, args=args, kwargs=kwargs) [worker-1]: ^^^^^^^^^^^^^^^^^^^ [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 237, in worker_fn [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/distribute_lib.py", line 3248, in call_for_each_replica [worker-1]: preemption_handler.run(distributed_train_step, epoch, step) [worker-0]: return self._call_for_each_replica(fn, args, kwargs) [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 893, in run [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: return self._run_for_multi_worker_mirrored( [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/mirrored_strategy.py", line 696, in _call_for_each_replica [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-0]: return mirrored_run.call_for_each_replica( [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 909, in _run_for_multi_worker_mirrored [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: result = distributed_train_function(*args, **kwargs) [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/mirrored_run.py", line 84, in call_for_each_replica [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: ^^^^^^^^^^^^^^^^^^^ [worker-0]: return wrapped(*args, **kwargs) [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 189, in distributed_train_step [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 237, in worker_fn [worker-1]: strategy.run(train_step) [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/util/traceback_utils.py", line 141, in error_handler [worker-3]: preemption_handler.run(distributed_train_step, epoch, step) [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/distribute_lib.py", line 1671, in run [worker-0]: return fn(*args, **kwargs) [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 893, in run [worker-1]: return self._extended.call_for_each_replica(fn, args=args, kwargs=kwargs) [worker-0]: ^^^^^^^^^^^^^^^^^^^ [worker-3]: return self._run_for_multi_worker_mirrored( [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/polymorphic_function/polymorphic_function.py", line 840, in __call__ [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/distribute_lib.py", line 3248, in call_for_each_replica [worker-0]: result = self._call(*args, **kwds) [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 909, in _run_for_multi_worker_mirrored [worker-1]: return self._call_for_each_replica(fn, args, kwargs) [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: result = distributed_train_function(*args, **kwargs) [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/polymorphic_function/polymorphic_function.py", line 912, in _call [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/mirrored_strategy.py", line 696, in _call_for_each_replica [worker-0]: return self._concrete_variable_creation_fn._call_flat( # pylint: disable=protected-access [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 189, in distributed_train_step [worker-1]: return mirrored_run.call_for_each_replica( [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: strategy.run(train_step) [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/polymorphic_function/monomorphic_function.py", line 1342, in _call_flat [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/distribute_lib.py", line 1671, in run [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/mirrored_run.py", line 84, in call_for_each_replica [worker-0]: return self._build_call_outputs(self._inference_function(*args)) [worker-3]: return self._extended.call_for_each_replica(fn, args=args, kwargs=kwargs) [worker-1]: return wrapped(*args, **kwargs) [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/polymorphic_function/atomic_function.py", line 200, in __call__ [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/distribute_lib.py", line 3248, in call_for_each_replica [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/util/traceback_utils.py", line 141, in error_handler [worker-3]: return self._call_for_each_replica(fn, args, kwargs) [worker-0]: outputs = self._bound_context.call_function( [worker-1]: return fn(*args, **kwargs) [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: ^^^^^^^^^^^^^^^^^^^ [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/context.py", line 1457, in call_function [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/mirrored_strategy.py", line 696, in _call_for_each_replica [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/polymorphic_function/polymorphic_function.py", line 840, in __call__ [worker-0]: outputs = execute.execute( [worker-3]: return mirrored_run.call_for_each_replica( [worker-1]: result = self._call(*args, **kwds) [worker-0]: ^^^^^^^^^^^^^^^^ [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/execute.py", line 53, in quick_execute [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/mirrored_run.py", line 84, in call_for_each_replica [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/polymorphic_function/polymorphic_function.py", line 912, in _call [worker-0]: tensors = pywrap_tfe.TFE_Py_Execute(ctx._handle, device_name, op_name, [worker-3]: return wrapped(*args, **kwargs) [worker-1]: return self._concrete_variable_creation_fn._call_flat( # pylint: disable=protected-access [worker-2]: File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-2]: self.run() [worker-0]: tensorflow.python.framework.errors_impl.AbortedError: Graph execution error: [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/polymorphic_function/monomorphic_function.py", line 1342, in _call_flat [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 755, in _run_with_setenv [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/util/traceback_utils.py", line 141, in error_handler [worker-0]: [worker-1]: return self._build_call_outputs(self._inference_function(*args)) [worker-2]: return self._actual_run() [worker-3]: return fn(*args, **kwargs) [worker-0]: Detected at node 'CollectiveReduceV2' defined at (most recent call last): [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-2]: ^^^^^^^^^^^^^^^^^^ [worker-3]: ^^^^^^^^^^^^^^^^^^^ [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 549, in [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/polymorphic_function/atomic_function.py", line 200, in __call__ [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/polymorphic_function/polymorphic_function.py", line 840, in __call__ [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_lib.py", line 54, in _run_with_absl [worker-0]: test_util.main() [worker-3]: result = self._call(*args, **kwds) [worker-1]: outputs = self._bound_context.call_function( [worker-2]: app.run(lambda _: self._run_impl()) [worker-0]: File "", line 1, in [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/absl_py/absl/app.py", line 312, in run [worker-0]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 274, in main [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/polymorphic_function/polymorphic_function.py", line 912, in _call [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/context.py", line 1457, in call_function [worker-2]: _run_main(main, args) [worker-0]: code = _serve_one(child_r, fds, [worker-3]: return self._concrete_variable_creation_fn._call_flat( # pylint: disable=protected-access [worker-1]: outputs = execute.execute( [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/absl_py/absl/app.py", line 258, in _run_main [worker-0]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 313, in _serve_one [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: ^^^^^^^^^^^^^^^^ [worker-2]: sys.exit(main(argv)) [worker-0]: code = spawn._main(child_r, parent_sentinel) [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/polymorphic_function/monomorphic_function.py", line 1342, in _call_flat [worker-2]: ^^^^^^^^^^ [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/execute.py", line 53, in quick_execute [worker-0]: File "/usr/lib/python3.11/multiprocessing/spawn.py", line 133, in _main [worker-3]: return self._build_call_outputs(self._inference_function(*args)) [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_lib.py", line 54, in [worker-1]: tensors = pywrap_tfe.TFE_Py_Execute(ctx._handle, device_name, op_name, [worker-0]: return self._bootstrap(parent_sentinel) [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-2]: app.run(lambda _: self._run_impl()) [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/polymorphic_function/atomic_function.py", line 200, in __call__ [worker-0]: File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap [worker-1]: tensorflow.python.framework.errors_impl.AbortedError: Graph execution error: [worker-3]: outputs = self._bound_context.call_function( [worker-2]: ^^^^^^^^^^^^^^^^ [worker-0]: self.run() [worker-1]: [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-2]: File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run [worker-1]: Detected at node 'CollectiveReduceV2' defined at (most recent call last): [worker-0]: File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/context.py", line 1457, in call_function [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 549, in [worker-3]: outputs = execute.execute( [worker-2]: self._target(*self._args, **self._kwargs) [worker-1]: test_util.main() [worker-3]: ^^^^^^^^^^^^^^^^ [worker-0]: self._target(*self._args, **self._kwargs) [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 866, in __call__ [worker-1]: File "", line 1, in [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 237, in worker_fn [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/execute.py", line 53, in quick_execute [worker-1]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 274, in main [worker-0]: preemption_handler.run(distributed_train_step, epoch, step) [worker-2]: six.reraise(*info.exc_info) [worker-1]: code = _serve_one(child_r, fds, [worker-3]: tensors = pywrap_tfe.TFE_Py_Execute(ctx._handle, device_name, op_name, [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 189, in distributed_train_step [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/six_archive/six.py", line 719, in reraise [worker-1]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 313, in _serve_one [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: code = spawn._main(child_r, parent_sentinel) [worker-2]: raise value [worker-3]: tensorflow.python.framework.errors_impl.AbortedError: Graph execution error: [worker-0]: strategy.run(train_step) [worker-1]: File "/usr/lib/python3.11/multiprocessing/spawn.py", line 133, in _main [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 1060, in _run_contained [worker-1]: return self._bootstrap(parent_sentinel) [worker-2]: return_value = fn(*args, **kwargs) [worker-1]: File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap [worker-2]: ^^^^^^^^^^^^^^^^^^^ [worker-1]: self.run() [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 237, in worker_fn [worker-1]: File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run [worker-2]: preemption_handler.run(distributed_train_step, epoch, step) [worker-1]: self._target(*self._args, **self._kwargs) [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 893, in run [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 237, in worker_fn [worker-2]: return self._run_for_multi_worker_mirrored( [worker-1]: preemption_handler.run(distributed_train_step, epoch, step) [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 189, in distributed_train_step [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handling.py", line 909, in _run_for_multi_worker_mirrored [worker-1]: strategy.run(train_step) [worker-2]: result = distributed_train_function(*args, **kwargs) [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/mirrored_run.py", line 79, in wrapped_fn [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: return call_for_each_replica(strategy, fn.python_function, args, kwargs) [worker-3]: [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 189, in distributed_train_step [worker-1]: Node: 'CollectiveReduceV2' [worker-2]: strategy.run(train_step) [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/distribute_lib.py", line 1671, in run [worker-1]: Collective ops is aborted by: Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-3]: Detected at node 'CollectiveReduceV2' defined at (most recent call last): [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/mirrored_run.py", line 79, in wrapped_fn [worker-2]: return self._extended.call_for_each_replica(fn, args=args, kwargs=kwargs) [worker-1]: The error could be from a previous operation. Restart your program to reset. [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 549, in [worker-0]: return call_for_each_replica(strategy, fn.python_function, args, kwargs) [worker-1]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-3]: test_util.main() [worker-0]: Node: 'CollectiveReduceV2' [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: :{"created":"@1682286154.552458505","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted.\nThe error could be from a previous operation. Restart your program to reset.","grpc_status":10} [worker-0]: Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-3]: File "", line 1, in [worker-0]: The error could be from a previous operation. Restart your program to reset. [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/distribute_lib.py", line 3248, in call_for_each_replica [worker-3]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 274, in main [worker-1]: The error could be from a previous operation. Restart your program to reset. [worker-0]: [[{{node CollectiveReduceV2}}]] [Op:__inference_train_step_665] [worker-2]: return self._call_for_each_replica(fn, args, kwargs) [worker-3]: code = _serve_one(child_r, fds, [worker-1]: [[{{node CollectiveReduceV2}}]] [Op:__inference_train_step_663] [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 313, in _serve_one [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/mirrored_strategy.py", line 696, in _call_for_each_replica [worker-3]: code = spawn._main(child_r, parent_sentinel) [worker-2]: return mirrored_run.call_for_each_replica( [worker-3]: File "/usr/lib/python3.11/multiprocessing/spawn.py", line 133, in _main [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: return self._bootstrap(parent_sentinel) [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/mirrored_run.py", line 84, in call_for_each_replica [worker-3]: File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap [worker-2]: return wrapped(*args, **kwargs) [worker-3]: self.run() [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run [worker-3]: self._target(*self._args, **self._kwargs) [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/util/traceback_utils.py", line 141, in error_handler [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 237, in worker_fn [worker-2]: return fn(*args, **kwargs) [worker-2]: ^^^^^^^^^^^^^^^^^^^ [worker-3]: preemption_handler.run(distributed_train_step, epoch, step) [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/polymorphic_function/polymorphic_function.py", line 840, in __call__ [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 189, in distributed_train_step [worker-2]: result = self._call(*args, **kwds) [worker-3]: strategy.run(train_step) [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/mirrored_run.py", line 79, in wrapped_fn [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/polymorphic_function/polymorphic_function.py", line 912, in _call [worker-3]: return call_for_each_replica(strategy, fn.python_function, args, kwargs) [worker-2]: return self._concrete_variable_creation_fn._call_flat( # pylint: disable=protected-access [worker-3]: Node: 'CollectiveReduceV2' [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: Collective ops is aborted by: Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/polymorphic_function/monomorphic_function.py", line 1342, in _call_flat [worker-3]: The error could be from a previous operation. Restart your program to reset. [worker-2]: return self._build_call_outputs(self._inference_function(*args)) [worker-3]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: :{"created":"@1682286154.552650920","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted.\nThe error could be from a previous operation. Restart your program to reset.","grpc_status":10} [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/polymorphic_function/atomic_function.py", line 200, in __call__ [worker-3]: The error could be from a previous operation. Restart your program to reset. [worker-2]: outputs = self._bound_context.call_function( [worker-3]: [[{{node CollectiveReduceV2}}]] [Op:__inference_train_step_663] [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/context.py", line 1457, in call_function [worker-2]: outputs = execute.execute( [worker-2]: ^^^^^^^^^^^^^^^^ [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/execute.py", line 53, in quick_execute [worker-2]: tensors = pywrap_tfe.TFE_Py_Execute(ctx._handle, device_name, op_name, [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-2]: tensorflow.python.framework.errors_impl.AbortedError: Graph execution error: [worker-2]: [worker-2]: Detected at node 'CollectiveReduceV2' defined at (most recent call last): [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 549, in [worker-2]: test_util.main() [worker-2]: File "", line 1, in [worker-2]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 274, in main [worker-2]: code = _serve_one(child_r, fds, [worker-2]: File "/usr/lib/python3.11/multiprocessing/forkserver.py", line 313, in _serve_one [worker-2]: code = spawn._main(child_r, parent_sentinel) [worker-2]: File "/usr/lib/python3.11/multiprocessing/spawn.py", line 133, in _main [worker-2]: return self._bootstrap(parent_sentinel) [worker-2]: File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap [worker-2]: self.run() [worker-2]: File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run [worker-2]: self._target(*self._args, **self._kwargs) [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 237, in worker_fn [worker-2]: preemption_handler.run(distributed_train_step, epoch, step) [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 189, in distributed_train_step [worker-2]: strategy.run(train_step) [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/mirrored_run.py", line 79, in wrapped_fn [worker-2]: return call_for_each_replica(strategy, fn.python_function, args, kwargs) [worker-2]: Node: 'CollectiveReduceV2' [worker-2]: Collective ops is aborted by: Error reported from /job:worker/task:3: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-2]: The error could be from a previous operation. Restart your program to reset. [worker-2]: [[{{node CollectiveReduceV2}}]] [Op:__inference_train_step_663] INFO:tensorflow:restarting workers I0423 21:42:36.986426 281473453224832 gce_failure_handler_test.py:411] restarting workers [worker-0]: I0423 21:42:37.023430 281473829860224 multi_process_runner.py:840] Subprocess with PID 2347464 (worker, 0) is now being started. [worker-0]: I0423 21:42:37.024008 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:16671", "localhost:23400", "localhost:23929", "localhost:19346"]}, "task": {"type": "worker", "index": 0}, "rpc_layer": "grpc"}' INFO:tensorflow:workers restarted I0423 21:42:37.031741 281473453224832 gce_failure_handler_test.py:415] workers restarted [worker-2]: I0423 21:42:37.054663 281473829860224 multi_process_runner.py:840] Subprocess with PID 2347482 (worker, 2) is now being started. [worker-2]: I0423 21:42:37.055231 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:16671", "localhost:23400", "localhost:23929", "localhost:19346"]}, "task": {"type": "worker", "index": 2}, "rpc_layer": "grpc"}' [worker-1]: I0423 21:42:37.058510 281473829860224 multi_process_runner.py:840] Subprocess with PID 2347470 (worker, 1) is now being started. [worker-0]: 2023-04-23 21:42:37.061149: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:16671 [worker-0]: 2023-04-23 21:42:37.074860: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:0 has connected to coordination service. Incarnation: 14930057885338718955 [worker-0]: 2023-04-23 21:42:37.075107: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-1]: I0423 21:42:37.059091 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:16671", "localhost:23400", "localhost:23929", "localhost:19346"]}, "task": {"type": "worker", "index": 1}, "rpc_layer": "grpc"}' [worker-3]: I0423 21:42:37.079806 281473829860224 multi_process_runner.py:840] Subprocess with PID 2347500 (worker, 3) is now being started. [worker-3]: I0423 21:42:37.080324 281473829860224 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:16671", "localhost:23400", "localhost:23929", "localhost:19346"]}, "task": {"type": "worker", "index": 3}, "rpc_layer": "grpc"}' [worker-1]: 2023-04-23 21:42:37.095690: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:23400 [worker-0]: 2023-04-23 21:42:37.105807: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:1 has connected to coordination service. Incarnation: 1937637791896734459 [worker-1]: 2023-04-23 21:42:37.107188: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-2]: 2023-04-23 21:42:37.227374: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:23929 [worker-0]: 2023-04-23 21:42:37.233444: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:2 has connected to coordination service. Incarnation: 524945756456362205 [worker-2]: 2023-04-23 21:42:37.233689: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-3]: 2023-04-23 21:42:37.279008: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:19346 [worker-0]: 2023-04-23 21:42:37.288562: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:1 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-0]: 2023-04-23 21:42:37.289070: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:1 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:1 unexpectedly tried to connect with a different incarnation. It has likely restarted. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x01'] [worker-1]: 2023-04-23 21:42:37.289183: E tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:737] Coordination agent is in ERROR: INTERNAL: Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1 [worker-2]: 2023-04-23 21:42:37.289243: E tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:737] Coordination agent is in ERROR: INTERNAL: Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1 [worker-1]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-0]: 2023-04-23 21:42:37.289822: E tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:737] Coordination agent is in ERROR: ABORTED: Error reported from /job:worker/task:1: /job:worker/replica:0/task:1 unexpectedly tried to connect with a different incarnation. It has likely restarted. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x01'] [worker-2]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-1]: :{"created":"@1682286157.288995974","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1","grpc_status":13} [type.googleapis.com/tensorflow.CoordinationServiceError=''] [worker-1]: 2023-04-23 21:42:37.289257: E tensorflow/core/common_runtime/base_collective_executor.cc:249] BaseCollectiveExecutor::StartAbort INTERNAL: Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1 [worker-2]: :{"created":"@1682286157.289068837","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1","grpc_status":13} [type.googleapis.com/tensorflow.CoordinationServiceError=''] [worker-0]: 2023-04-23 21:42:37.289878: E tensorflow/core/common_runtime/base_collective_executor.cc:249] BaseCollectiveExecutor::StartAbort ABORTED: Error reported from /job:worker/task:1: /job:worker/replica:0/task:1 unexpectedly tried to connect with a different incarnation. It has likely restarted. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x01'] [worker-2]: 2023-04-23 21:42:37.289341: E tensorflow/core/common_runtime/base_collective_executor.cc:249] BaseCollectiveExecutor::StartAbort INTERNAL: Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1 [worker-2]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-1]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-2]: :{"created":"@1682286157.289068837","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1","grpc_status":13} [type.googleapis.com/tensorflow.CoordinationServiceError=''] [worker-1]: :{"created":"@1682286157.288995974","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1","grpc_status":13} [type.googleapis.com/tensorflow.CoordinationServiceError=''] [worker-1]: 2023-04-23 21:42:37.289299: E tensorflow/core/common_runtime/eager/context_distributed_manager.cc:761] Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1 [worker-2]: 2023-04-23 21:42:37.289391: E tensorflow/core/common_runtime/eager/context_distributed_manager.cc:761] Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1 [worker-1]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-2]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-0]: 2023-04-23 21:42:37.293916: I tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:535] /job:worker/replica:0/task:3 has connected to coordination service. Incarnation: 8039046905443877274 [worker-1]: :{"created":"@1682286157.288995974","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1","grpc_status":13} [worker-3]: 2023-04-23 21:42:37.294142: I tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:298] Coordination agent has successfully connected. [worker-2]: :{"created":"@1682286157.289068837","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1","grpc_status":13} [worker-0]: 2023-04-23 21:42:37.296163: E tensorflow/core/common_runtime/eager/context_distributed_manager.cc:761] Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1 [worker-3]: 2023-04-23 21:42:37.294890: E tensorflow/tsl/distributed_runtime/coordination/coordination_service_agent.cc:737] Coordination agent is in ERROR: INTERNAL: Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1 [worker-3]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-0]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-3]: :{"created":"@1682286157.294770442","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1","grpc_status":13} [type.googleapis.com/tensorflow.CoordinationServiceError=''] [worker-3]: 2023-04-23 21:42:37.294939: E tensorflow/core/common_runtime/base_collective_executor.cc:249] BaseCollectiveExecutor::StartAbort INTERNAL: Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1 [worker-0]: :{"created":"@1682286157.288890103","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1","grpc_status":13} [worker-3]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-3]: :{"created":"@1682286157.294770442","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1","grpc_status":13} [type.googleapis.com/tensorflow.CoordinationServiceError=''] [worker-3]: 2023-04-23 21:42:37.294972: E tensorflow/core/common_runtime/eager/context_distributed_manager.cc:761] Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1 [worker-3]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-3]: :{"created":"@1682286157.294770442","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1","grpc_status":13} [worker-1]: Process _Process-39: [worker-1]: Traceback (most recent call last): [worker-1]: File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap [worker-1]: self.run() [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 755, in _run_with_setenv [worker-1]: return self._actual_run() [worker-1]: ^^^^^^^^^^^^^^^^^^ [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_lib.py", line 54, in _run_with_absl [worker-0]: 2023-04-23 21:42:37.324231: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:2 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-1]: app.run(lambda _: self._run_impl()) [worker-0]: 2023-04-23 21:42:37.324326: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:2 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:2 unexpectedly tried to connect with a different incarnation. It has likely restarted. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x02'] [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/absl_py/absl/app.py", line 312, in run [worker-1]: _run_main(main, args) [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/absl_py/absl/app.py", line 258, in _run_main [worker-0]: 2023-04-23 21:42:37.326841: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:566] /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [worker-0]: 2023-04-23 21:42:37.326907: E tensorflow/tsl/distributed_runtime/coordination/coordination_service.cc:971] /job:worker/replica:0/task:3 has been set to ERROR in coordination service: ABORTED: /job:worker/replica:0/task:3 unexpectedly tried to connect with a different incarnation. It has likely restarted. [type.googleapis.com/tensorflow.CoordinationServiceError='\"\n\n\x06worker\x10\x03'] [worker-1]: sys.exit(main(argv)) [worker-1]: ^^^^^^^^^^ [worker-3]: Process _Process-41: [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_lib.py", line 54, in [worker-1]: app.run(lambda _: self._run_impl()) [worker-1]: ^^^^^^^^^^^^^^^^ [worker-1]: File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run [worker-0]: Process _Process-38: [worker-1]: self._target(*self._args, **self._kwargs) [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 866, in __call__ [worker-1]: six.reraise(*info.exc_info) [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/six_archive/six.py", line 719, in reraise [worker-1]: raise value [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 1060, in _run_contained [worker-1]: return_value = fn(*args, **kwargs) [worker-3]: Traceback (most recent call last): [worker-1]: ^^^^^^^^^^^^^^^^^^^ [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 134, in worker_fn [worker-1]: strategy = collective_all_reduce_strategy.CollectiveAllReduceStrategy() [worker-1]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 187, in __init__ [worker-1]: CollectiveAllReduceExtended( [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 339, in __init__ [worker-3]: File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap [worker-1]: self._initialize_strategy(self._cluster_resolver, devices=devices) [worker-3]: self.run() [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 358, in _initialize_strategy [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 755, in _run_with_setenv [worker-3]: return self._actual_run() [worker-3]: ^^^^^^^^^^^^^^^^^^ [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_lib.py", line 54, in _run_with_absl [worker-3]: app.run(lambda _: self._run_impl()) [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/absl_py/absl/app.py", line 312, in run [worker-3]: _run_main(main, args) [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/absl_py/absl/app.py", line 258, in _run_main [worker-3]: sys.exit(main(argv)) [worker-3]: ^^^^^^^^^^ [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_lib.py", line 54, in [worker-3]: app.run(lambda _: self._run_impl()) [worker-3]: ^^^^^^^^^^^^^^^^ [worker-3]: File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run [worker-3]: self._target(*self._args, **self._kwargs) [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 866, in __call__ [worker-3]: six.reraise(*info.exc_info) [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/six_archive/six.py", line 719, in reraise [worker-3]: raise value [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 1060, in _run_contained [worker-3]: return_value = fn(*args, **kwargs) [worker-3]: ^^^^^^^^^^^^^^^^^^^ [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 134, in worker_fn [worker-3]: strategy = collective_all_reduce_strategy.CollectiveAllReduceStrategy() [worker-3]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 187, in __init__ [worker-3]: CollectiveAllReduceExtended( [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 339, in __init__ [worker-3]: self._initialize_strategy(self._cluster_resolver, devices=devices) [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 358, in _initialize_strategy [worker-1]: self._initialize_multi_worker(cluster_resolver) [worker-3]: self._initialize_multi_worker(cluster_resolver) [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 530, in _initialize_multi_worker [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 530, in _initialize_multi_worker [worker-1]: context.context().ensure_initialized() [worker-3]: context.context().ensure_initialized() [worker-1]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/context.py", line 608, in ensure_initialized [worker-3]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/context.py", line 608, in ensure_initialized [worker-1]: pywrap_tfe.TFE_EnableCollectiveOps(context_handle, server_def_str) [worker-3]: pywrap_tfe.TFE_EnableCollectiveOps(context_handle, server_def_str) [worker-1]: tensorflow.python.framework.errors_impl.InternalError: Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1 [worker-3]: tensorflow.python.framework.errors_impl.InternalError: Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1 [worker-0]: Traceback (most recent call last): [worker-1]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-3]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-0]: File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap [worker-1]: :{"created":"@1682286157.288995974","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1","grpc_status":13} [worker-3]: :{"created":"@1682286157.294770442","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1","grpc_status":13} [worker-0]: self.run() [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 755, in _run_with_setenv [worker-0]: return self._actual_run() [worker-0]: ^^^^^^^^^^^^^^^^^^ [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_lib.py", line 54, in _run_with_absl [worker-0]: app.run(lambda _: self._run_impl()) [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/absl_py/absl/app.py", line 312, in run [worker-0]: _run_main(main, args) [worker-2]: Process _Process-40: [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/absl_py/absl/app.py", line 258, in _run_main [worker-0]: sys.exit(main(argv)) [worker-0]: ^^^^^^^^^^ [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_lib.py", line 54, in [worker-0]: app.run(lambda _: self._run_impl()) [worker-0]: ^^^^^^^^^^^^^^^^ [worker-0]: File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run [worker-0]: self._target(*self._args, **self._kwargs) [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 866, in __call__ [worker-0]: six.reraise(*info.exc_info) [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/six_archive/six.py", line 719, in reraise [worker-0]: raise value [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 1060, in _run_contained [worker-0]: return_value = fn(*args, **kwargs) [worker-0]: ^^^^^^^^^^^^^^^^^^^ [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 134, in worker_fn [worker-0]: strategy = collective_all_reduce_strategy.CollectiveAllReduceStrategy() [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 187, in __init__ [worker-0]: CollectiveAllReduceExtended( [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 339, in __init__ [worker-0]: self._initialize_strategy(self._cluster_resolver, devices=devices) [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 358, in _initialize_strategy [worker-0]: self._initialize_multi_worker(cluster_resolver) [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 530, in _initialize_multi_worker [worker-0]: context.context().ensure_initialized() [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/context.py", line 608, in ensure_initialized [worker-0]: pywrap_tfe.TFE_EnableCollectiveOps(context_handle, server_def_str) [worker-0]: tensorflow.python.framework.errors_impl.InternalError: Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1 [worker-0]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-0]: :{"created":"@1682286157.288890103","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1","grpc_status":13} [worker-2]: Traceback (most recent call last): [worker-2]: File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap [worker-2]: self.run() [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 755, in _run_with_setenv [worker-2]: return self._actual_run() [worker-2]: ^^^^^^^^^^^^^^^^^^ [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_lib.py", line 54, in _run_with_absl [worker-2]: app.run(lambda _: self._run_impl()) [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/absl_py/absl/app.py", line 312, in run [worker-2]: _run_main(main, args) [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/absl_py/absl/app.py", line 258, in _run_main [worker-2]: sys.exit(main(argv)) [worker-2]: ^^^^^^^^^^ [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_lib.py", line 54, in [worker-2]: app.run(lambda _: self._run_impl()) [worker-2]: ^^^^^^^^^^^^^^^^ [worker-2]: File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run [worker-2]: self._target(*self._args, **self._kwargs) [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 866, in __call__ [worker-2]: six.reraise(*info.exc_info) [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/six_archive/six.py", line 719, in reraise [worker-2]: raise value [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 1060, in _run_contained [worker-2]: return_value = fn(*args, **kwargs) [worker-2]: ^^^^^^^^^^^^^^^^^^^ [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 134, in worker_fn [worker-2]: strategy = collective_all_reduce_strategy.CollectiveAllReduceStrategy() [worker-2]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 187, in __init__ [worker-2]: CollectiveAllReduceExtended( [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 339, in __init__ [worker-2]: self._initialize_strategy(self._cluster_resolver, devices=devices) [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 358, in _initialize_strategy [worker-2]: self._initialize_multi_worker(cluster_resolver) [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 530, in _initialize_multi_worker [worker-2]: context.context().ensure_initialized() [worker-2]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/context.py", line 608, in ensure_initialized [worker-2]: pywrap_tfe.TFE_EnableCollectiveOps(context_handle, server_def_str) [worker-2]: tensorflow.python.framework.errors_impl.InternalError: Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1 [worker-2]: Additional GRPC error information from remote target /job:worker/replica:0/task:0: [worker-2]: :{"created":"@1682286157.289068837","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1","grpc_status":13} I0423 21:42:37.997186 281473453224832 multi_process_runner.py:646] worker-0 exit code: 1 I0423 21:42:37.997471 281473453224832 multi_process_runner.py:646] worker-1 exit code: 1 I0423 21:42:37.997591 281473453224832 multi_process_runner.py:646] worker-2 exit code: 1 I0423 21:42:37.997703 281473453224832 multi_process_runner.py:646] worker-3 exit code: 1 [ FAILED ] GceFailureHandlingTest.test_multiple_workers_preempted_consecutively_test_apiwrappingtrain_True_graceperiod_7_inputarg_manager_strategyoption_MWMSmultiworker INFO:tensorflow:time(__main__.GceFailureHandlingTest.test_multiple_workers_preempted_consecutively_test_apiwrappingtrain_True_graceperiod_7_inputarg_manager_strategyoption_MWMSmultiworker): 6.53s I0423 21:42:38.300332 281473453224832 test_util.py:2462] time(__main__.GceFailureHandlingTest.test_multiple_workers_preempted_consecutively_test_apiwrappingtrain_True_graceperiod_7_inputarg_manager_strategyoption_MWMSmultiworker): 6.53s ====================================================================== ERROR: test_multiple_workers_preempted_consecutively_test_apiwrappingtrain_True_graceperiod_7_inputarg_manager_strategyoption_MWMSmultiworker (__main__.GceFailureHandlingTest) GceFailureHandlingTest.test_multiple_workers_preempted_consecutively_test_apiwrappingtrain_True_graceperiod_7_inputarg_manager_strategyoption_MWMSmultiworker test_multiple_workers_preempted_consecutively_test_apiwrappingtrain_True_graceperiod_7_inputarg_manager_strategyoption_MWMSmultiworker(api_wrapping_train=True, grace_period=7, input_arg='manager', strategy_option='MWMS_multi_worker') ---------------------------------------------------------------------- Traceback (most recent call last): File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/absl_py/absl/testing/parameterized.py", line 314, in bound_param_test return test_method(self, **testcase_params) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/framework/test_combinations.py", line 360, in decorated execute_test_method() File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/framework/test_combinations.py", line 343, in execute_test_method test_method(**kwargs_to_pass) File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/combinations.py", line 559, in decorator test_method(self, **kwargs) File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 417, in test_multiple_workers_preempted_consecutively mpr.join(timeout=250) File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 649, in join self._reraise_if_subprocess_error(process_statuses) File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 565, in _reraise_if_subprocess_error six.reraise(*process_status.exc_info) File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/six_archive/six.py", line 719, in reraise raise value File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 1060, in _run_contained return_value = fn(*args, **kwargs) ^^^^^^^^^^^^^^^^^ File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.py", line 134, in worker_fn strategy = collective_all_reduce_strategy.CollectiveAllReduceStrategy() ^^^^^^^^^^^^^^^^^ File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 187, in __init__ CollectiveAllReduceExtended( ^^^^^^^^^^^^^^^^^ File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 339, in __init__ self._initialize_strategy(self._cluster_resolver, devices=devices) ^^^^^^^^^^^^^^^^^ File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 358, in _initialize_strategy self._initialize_multi_worker(cluster_resolver) ^^^^^^^^^^^^^^^^^ File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 530, in _initialize_multi_worker context.context().ensure_initialized() ^^^^^^^^^^^^^^^^^ File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/gce_failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/context.py", line 608, in ensure_initialized pywrap_tfe.TFE_EnableCollectiveOps(context_handle, server_def_str) ^^^^^^^^^^^^^^^^^ tensorflow.python.framework.errors_impl.InternalError: Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1 Additional GRPC error information from remote target /job:worker/replica:0/task:0: :{"created":"@1682286157.288995974","description":"Error received from peer ipv6:[::1]:16671","file":"external/com_github_grpc_grpc/src/core/lib/surface/call.cc","file_line":1056,"grpc_message":"Barrier failed from a task error. Barrier Id: WaitForAllTasks::13831608150383750917, Task: /job:worker/replica:0/task:1","grpc_status":13} ---------------------------------------------------------------------- Ran 7 tests in 51.195s FAILED (errors=1) ================================================================================ ==================== Test output for //tensorflow/python/distribute/failure_handling:failure_handler_test (shard 8 of 8): Running tests under Python 3.11.3: /usr/local/bin/python3 [ RUN ] PreemptionCheckpointTest.test_grace_period_continue_training_test_inputarg_manager_strategyoption_MWMSmultiworker INFO:tensorflow:Using local port 16671 I0423 21:42:30.837587 281473691841408 test_util.py:3794] Using local port 16671 INFO:tensorflow:Using local port 23039 I0423 21:42:30.838237 281473691841408 test_util.py:3794] Using local port 23039 INFO:tensorflow:Using local port 22347 I0423 21:42:30.838624 281473691841408 test_util.py:3794] Using local port 22347 INFO:tensorflow:Using local port 18250 I0423 21:42:30.839002 281473691841408 test_util.py:3794] Using local port 18250 INFO:tensorflow:Cluster starting. I0423 21:42:34.365690 281473691841408 failure_handler_test.py:432] Cluster starting. [worker-0]: I0423 21:42:34.415323 281473277457280 multi_process_runner.py:840] Subprocess with PID 2340580 (worker, 0) is now being started. [worker-0]: I0423 21:42:34.415848 281473277457280 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:16671", "localhost:23039", "localhost:22347", "localhost:18250"]}, "task": {"type": "worker", "index": 0}, "rpc_layer": "grpc"}' [worker-0]: E0423 21:42:34.454076202 2340580 server_chttp2.cc:40] {"created":"@1682286154.453917321","description":"No address added out of total 1 resolved","file":"external/com_github_grpc_grpc/src/core/ext/transport/chttp2/server/chttp2_server.cc","file_line":395,"referenced_errors":[{"created":"@1682286154.453911890","description":"Failed to add any wildcard listeners","file":"external/com_github_grpc_grpc/src/core/lib/iomgr/tcp_server_posix.cc","file_line":341,"referenced_errors":[{"created":"@1682286154.453882977","description":"Unable to configure socket","fd":9,"file":"external/com_github_grpc_grpc/src/core/lib/iomgr/tcp_server_utils_posix_common.cc","file_line":215,"referenced_errors":[{"created":"@1682286154.453870701","description":"Address already in use","errno":98,"file":"external/com_github_grpc_grpc/src/core/lib/iomgr/tcp_server_utils_posix_common.cc","file_line":189,"os_error":"Address already in use","syscall":"bind"}]},{"created":"@1682286154.453910760","description":"Unable to configure socket","fd":9,"file":"external/com_github_grpc_grpc/src/core/lib/iomgr/tcp_server_utils_posix_common.cc","file_line":215,"referenced_errors":[{"created":"@1682286154.453903079","description":"Address already in use","errno":98,"file":"external/com_github_grpc_grpc/src/core/lib/iomgr/tcp_server_utils_posix_common.cc","file_line":189,"os_error":"Address already in use","syscall":"bind"}]}]}]} [worker-0]: 2023-04-23 21:42:34.454224: E tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:600] UNKNOWN: Could not start gRPC server [worker-0]: 2023-04-23 21:42:34.454724: E tensorflow/core/common_runtime/eager/context_distributed_manager.cc:699] Could not start gRPC server [worker-2]: I0423 21:42:34.461701 281473277457280 multi_process_runner.py:840] Subprocess with PID 2340605 (worker, 2) is now being started. [worker-2]: I0423 21:42:34.462456 281473277457280 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:16671", "localhost:23039", "localhost:22347", "localhost:18250"]}, "task": {"type": "worker", "index": 2}, "rpc_layer": "grpc"}' [worker-0]: Process _Process-2: [worker-1]: I0423 21:42:34.477763 281473277457280 multi_process_runner.py:840] Subprocess with PID 2340595 (worker, 1) is now being started. [worker-3]: I0423 21:42:34.478070 281473277457280 multi_process_runner.py:840] Subprocess with PID 2340611 (worker, 3) is now being started. [worker-0]: Traceback (most recent call last): [worker-0]: File "/usr/lib/python3.11/multiprocessing/process.py", line 314, in _bootstrap [worker-0]: self.run() [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 755, in _run_with_setenv [worker-0]: return self._actual_run() [worker-0]: ^^^^^^^^^^^^^^^^^^ [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_lib.py", line 54, in _run_with_absl [worker-0]: app.run(lambda _: self._run_impl()) [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/absl_py/absl/app.py", line 312, in run [worker-0]: _run_main(main, args) [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/absl_py/absl/app.py", line 258, in _run_main [worker-0]: sys.exit(main(argv)) [worker-0]: ^^^^^^^^^^ [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_lib.py", line 54, in [worker-0]: app.run(lambda _: self._run_impl()) [worker-0]: ^^^^^^^^^^^^^^^^ [worker-0]: File "/usr/lib/python3.11/multiprocessing/process.py", line 108, in run [worker-0]: self._target(*self._args, **self._kwargs) [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 866, in __call__ [worker-0]: six.reraise(*info.exc_info) [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/six_archive/six.py", line 719, in reraise [worker-0]: raise value [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 1060, in _run_contained [worker-0]: return_value = fn(*args, **kwargs) [worker-0]: ^^^^^^^^^^^^^^^^^^^ [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handler_test.py", line 146, in worker_fn [worker-0]: strategy = collective_all_reduce_strategy.CollectiveAllReduceStrategy() [worker-0]: ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 187, in __init__ [worker-0]: CollectiveAllReduceExtended( [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 339, in __init__ [worker-0]: self._initialize_strategy(self._cluster_resolver, devices=devices) [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 358, in _initialize_strategy [worker-0]: self._initialize_multi_worker(cluster_resolver) [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/collective_all_reduce_strategy.py", line 530, in _initialize_multi_worker [worker-0]: context.context().ensure_initialized() [worker-0]: File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/context.py", line 608, in ensure_initialized [worker-0]: pywrap_tfe.TFE_EnableCollectiveOps(context_handle, server_def_str) [worker-0]: tensorflow.python.framework.errors_impl.UnknownError: Could not start gRPC server [worker-1]: I0423 21:42:34.478515 281473277457280 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:16671", "localhost:23039", "localhost:22347", "localhost:18250"]}, "task": {"type": "worker", "index": 1}, "rpc_layer": "grpc"}' [worker-3]: I0423 21:42:34.478758 281473277457280 multi_process_runner.py:842] TF_CONFIG: '{"cluster": {"worker": ["localhost:16671", "localhost:23039", "localhost:22347", "localhost:18250"]}, "task": {"type": "worker", "index": 3}, "rpc_layer": "grpc"}' [worker-3]: 2023-04-23 21:42:34.537939: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:18250 [worker-1]: 2023-04-23 21:42:34.538858: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:23039 [worker-2]: 2023-04-23 21:42:34.567791: I tensorflow/core/distributed_runtime/rpc/grpc_server_lib.cc:449] Started server with target: grpc://localhost:22347 -- Test timed out at 2023-04-23 21:57:25 UTC -- Thread 0x0000ffff027cf1e0 (most recent call first): File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 258 in _continuously_readline_from_sub File "/usr/lib/python3.11/threading.py", line 975 in run File "/usr/lib/python3.11/threading.py", line 1038 in _bootstrap_inner File "/usr/lib/python3.11/threading.py", line 995 in _bootstrap Thread 0x0000ffff02fdf1e0 (most recent call first): File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 258 in _continuously_readline_from_sub File "/usr/lib/python3.11/threading.py", line 975 in run File "/usr/lib/python3.11/threading.py", line 1038 in _bootstrap_inner File "/usr/lib/python3.11/threading.py", line 995 in _bootstrap Thread 0x0000ffff037ef1e0 (most recent call first): File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 258 in _continuously_readline_from_sub File "/usr/lib/python3.11/threading.py", line 975 in run File "/usr/lib/python3.11/threading.py", line 1038 in _bootstrap_inner File "/usr/lib/python3.11/threading.py", line 995 in _bootstrap Thread 0x0000ffff03fff1e0 (most recent call first): File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 527 in _process_watchdog File "/usr/lib/python3.11/threading.py", line 975 in run File "/usr/lib/python3.11/threading.py", line 1038 in _bootstrap_inner File "/usr/lib/python3.11/threading.py", line 995 in _bootstrap Current thread 0x0000ffffb36a7380 (most recent call first): File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handler_test.py", line 435 in test_grace_period_continue_training File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/combinations.py", line 559 in decorator File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/framework/test_combinations.py", line 343 in execute_test_method File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/framework/test_combinations.py", line 360 in decorated File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/absl_py/absl/testing/parameterized.py", line 314 in bound_param_test File "/usr/lib/python3.11/unittest/case.py", line 579 in _callTestMethod File "/usr/lib/python3.11/unittest/case.py", line 623 in run File "/usr/lib/python3.11/unittest/case.py", line 678 in __call__ File "/usr/lib/python3.11/unittest/suite.py", line 122 in run File "/usr/lib/python3.11/unittest/suite.py", line 84 in __call__ File "/usr/lib/python3.11/unittest/suite.py", line 122 in run File "/usr/lib/python3.11/unittest/suite.py", line 84 in __call__ File "/usr/lib/python3.11/unittest/runner.py", line 217 in run File "/usr/lib/python3.11/unittest/main.py", line 274 in runTests File "/usr/lib/python3.11/unittest/main.py", line 102 in __init__ File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/absl_py/absl/testing/absltest.py", line 2537 in _run_and_get_tests_result File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/absl_py/absl/testing/absltest.py", line 2568 in run_tests File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/absl_py/absl/testing/absltest.py", line 2156 in _run_in_app File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/absl_py/absl/testing/absltest.py", line 2049 in main File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/platform/googletest.py", line 51 in g_main File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/absl_py/absl/app.py", line 258 in _run_main File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/absl_py/absl/app.py", line 312 in run File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/platform/googletest.py", line 60 in main_wrapper File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/platform/benchmark.py", line 489 in benchmarks_main File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/platform/googletest.py", line 62 in main File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/platform/test.py", line 56 in main File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/eager/test.py", line 25 in main File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_lib.py", line 167 in test_main File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/multi_process_runner.py", line 1455 in test_main File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/test_util.py", line 138 in main File "/home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/bin/tensorflow/python/distribute/failure_handling/failure_handler_test.runfiles/org_tensorflow/tensorflow/python/distribute/failure_handling/failure_handler_test.py", line 558 in ================================================================================ //tensorflow/c:c_api_experimental_test PASSED in 29.1s //tensorflow/c:c_api_function_test PASSED in 28.7s //tensorflow/c:c_api_test_cpu PASSED in 33.3s //tensorflow/c:c_test PASSED in 28.3s //tensorflow/c:env_test_cpu PASSED in 23.6s //tensorflow/c:kernels_test_cpu PASSED in 33.8s //tensorflow/c:ops_test PASSED in 22.9s //tensorflow/c:while_loop_test PASSED in 27.8s //tensorflow/c/eager:c_api_cluster_test_cpu PASSED in 32.0s //tensorflow/c/eager:c_api_remote_function_test_cpu PASSED in 31.7s //tensorflow/c/eager:c_api_remote_test_cpu PASSED in 32.3s //tensorflow/c/eager:c_api_test_cpu PASSED in 34.6s //tensorflow/c/eager:custom_device_test PASSED in 29.2s //tensorflow/c/eager/parallel_device:parallel_device_lib_test PASSED in 29.8s //tensorflow/c/eager/parallel_device:parallel_device_remote_test PASSED in 28.4s //tensorflow/c/eager/parallel_device:parallel_device_test PASSED in 31.9s //tensorflow/c/experimental/filesystem/plugins/gcs:expiring_lru_cache_test PASSED in 0.1s //tensorflow/c/experimental/filesystem/plugins/gcs:ram_file_block_cache_test PASSED in 2.4s //tensorflow/c/experimental/grappler:grappler_test PASSED in 23.8s //tensorflow/c/experimental/ops/gen/common:case_format_test PASSED in 0.8s //tensorflow/c/experimental/ops/gen/cpp:cpp_generator_test PASSED in 1.1s //tensorflow/c/experimental/ops/gen/cpp/renderers:renderer_test PASSED in 0.6s //tensorflow/c/experimental/saved_model/core:constant_loading_test PASSED in 14.1s //tensorflow/c/experimental/saved_model/core:object_graph_traversal_test PASSED in 10.3s //tensorflow/c/experimental/saved_model/core:saved_variable_loading_test PASSED in 18.3s //tensorflow/c/experimental/saved_model/core:signature_flattening_test PASSED in 10.5s //tensorflow/c/experimental/saved_model/core:tf_concrete_function_loading_test PASSED in 11.7s //tensorflow/c/experimental/saved_model/core/ops:restore_ops_test PASSED in 13.3s //tensorflow/c/experimental/saved_model/core/ops:variable_ops_test PASSED in 15.6s //tensorflow/c/experimental/saved_model/internal:saved_model_api_test PASSED in 38.0s //tensorflow/c/experimental/stream_executor:stream_executor_test PASSED in 0.7s //tensorflow/c/kernels:bitcast_op_test PASSED in 0.7s //tensorflow/c/kernels:summary_op_benchmark_test PASSED in 0.7s //tensorflow/c/kernels:summary_op_test PASSED in 0.6s //tensorflow/c/kernels:tensor_shape_utils_test PASSED in 0.1s //tensorflow/cc:cc_op_gen_test PASSED in 0.1s //tensorflow/cc:client_client_session_test PASSED in 1.9s //tensorflow/cc:coordinator_test PASSED in 4.1s //tensorflow/cc:framework_cc_ops_test PASSED in 2.1s //tensorflow/cc:framework_gradient_checker_test PASSED in 2.3s //tensorflow/cc:framework_gradients_test PASSED in 5.0s //tensorflow/cc:framework_scope_test PASSED in 0.7s //tensorflow/cc:framework_while_gradients_test PASSED in 2.2s //tensorflow/cc:gradients_array_grad_test PASSED in 6.5s //tensorflow/cc:gradients_data_flow_grad_test PASSED in 2.2s //tensorflow/cc:gradients_functional_grad_test PASSED in 2.2s //tensorflow/cc:gradients_image_grad_test PASSED in 6.3s //tensorflow/cc:gradients_linalg_grad_test PASSED in 2.7s //tensorflow/cc:gradients_manip_grad_test PASSED in 1.7s //tensorflow/cc:gradients_math_grad_test PASSED in 5.5s //tensorflow/cc:gradients_nn_grad_test PASSED in 2.7s //tensorflow/cc:gradients_resource_variable_grad_test PASSED in 1.8s //tensorflow/cc:ops_const_op_test PASSED in 0.7s //tensorflow/cc:ops_while_loop_test PASSED in 3.0s //tensorflow/cc:queue_runner_test PASSED in 12.7s //tensorflow/cc/experimental/base/tests:tensor_test PASSED in 0.1s //tensorflow/cc/experimental/base/tests:tensorhandle_test PASSED in 29.2s //tensorflow/cc/experimental/libexport:load_test PASSED in 0.3s //tensorflow/cc/experimental/libexport:save_test PASSED in 0.1s //tensorflow/cc/experimental/libtf:libtf_module_test PASSED in 29.8s //tensorflow/cc/experimental/libtf:libtf_object_test PASSED in 0.7s //tensorflow/cc/experimental/libtf:libtf_perf_test PASSED in 0.1s //tensorflow/cc/experimental/libtf:libtf_runtime_test PASSED in 26.8s //tensorflow/cc/experimental/libtf:libtf_transform_test PASSED in 31.2s //tensorflow/cc/experimental/libtf:libtf_value_test PASSED in 0.1s //tensorflow/cc/experimental/libtf:libtf_visit_test PASSED in 0.2s //tensorflow/cc/experimental/libtf/impl:iostream_test PASSED in 0.2s //tensorflow/cc/experimental/libtf/impl:none_test PASSED in 0.3s //tensorflow/cc/experimental/libtf/impl:scalars_test PASSED in 0.3s //tensorflow/cc/experimental/libtf/impl:string_test PASSED in 0.1s //tensorflow/cc/experimental/libtf/impl:tensor_spec_test PASSED in 0.1s //tensorflow/cc/saved_model:bundle_v2_test PASSED in 0.1s //tensorflow/cc/saved_model:fingerprinting_test PASSED in 1.5s //tensorflow/cc/saved_model:metrics_test PASSED in 0.2s //tensorflow/cc/saved_model:reader_test PASSED in 0.2s //tensorflow/cc/saved_model:saved_model_bundle_lite_test PASSED in 8.1s //tensorflow/cc/saved_model:saved_model_bundle_test PASSED in 6.2s //tensorflow/cc/saved_model:util_test PASSED in 0.1s //tensorflow/cc/saved_model/experimental/tests:saved_model_api_test PASSED in 28.0s //tensorflow/cc/tools:freeze_saved_model_test PASSED in 1.4s //tensorflow/compiler/aot:codegen_test PASSED in 23.5s //tensorflow/compiler/jit:compilability_check_util_test PASSED in 19.9s //tensorflow/compiler/jit:deadness_analysis_test PASSED in 9.7s //tensorflow/compiler/jit:device_compilation_cache_test PASSED in 7.0s //tensorflow/compiler/jit:device_compilation_cluster_signature_test PASSED in 6.6s //tensorflow/compiler/jit:device_compilation_profiler_test PASSED in 21.9s //tensorflow/compiler/jit:device_compiler_client_test PASSED in 5.8s //tensorflow/compiler/jit:device_compiler_disable_test PASSED in 19.5s //tensorflow/compiler/jit:device_executable_persistor_test PASSED in 21.6s //tensorflow/compiler/jit:device_util_test PASSED in 6.2s //tensorflow/compiler/jit:encapsulate_util_test PASSED in 0.5s //tensorflow/compiler/jit:node_matchers_test PASSED in 0.5s //tensorflow/compiler/jit:resource_operation_safety_analysis_test PASSED in 10.5s //tensorflow/compiler/jit:shape_inference_test PASSED in 0.5s //tensorflow/compiler/jit:xla_activity_listener_test PASSED in 23.0s //tensorflow/compiler/jit:xla_cluster_util_test PASSED in 11.0s //tensorflow/compiler/jit:xla_compile_util_test PASSED in 5.4s //tensorflow/compiler/jit:xla_kernel_creator_test PASSED in 11.0s //tensorflow/compiler/jit:xla_launch_util_test PASSED in 25.5s //tensorflow/compiler/jit/tests:auto_clustering_test PASSED in 24.8s //tensorflow/compiler/mlir:mlir_graph_optimization_pass_test PASSED in 13.3s //tensorflow/compiler/mlir:register_common_dialects_test PASSED in 16.3s //tensorflow/compiler/mlir/lite:lstm_utils_test PASSED in 0.6s //tensorflow/compiler/mlir/lite:perception_ops_utils_test PASSED in 0.6s //tensorflow/compiler/mlir/lite:size_utils_test PASSED in 0.2s //tensorflow/compiler/mlir/lite:tftext_utils_test PASSED in 0.5s //tensorflow/compiler/mlir/lite/experimental/remat:rematerializer_test PASSED in 1.0s //tensorflow/compiler/mlir/lite/experimental/tac:execution_metadata_exporter_test PASSED in 4.4s //tensorflow/compiler/mlir/lite/experimental/tac/tests:compute-cost.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/experimental/tac/tests:device-transform-gpu.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/lite/experimental/tac/tests:device-transform-nnapi.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/experimental/tac/tests:fold-constants-to-subgraph.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/experimental/tac/tests:get-alternative-subgraph.mlir.test PASSED in 1.7s //tensorflow/compiler/mlir/lite/experimental/tac/tests:get-op-cost.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/experimental/tac/tests:pick-subgraphs.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/lite/experimental/tac/tests:raise-target-subgraphs.mlir.test PASSED in 1.6s //tensorflow/compiler/mlir/lite/experimental/tac/tests:target-annotation.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/experimental/tac/tests/e2e:device-transform-nnapi.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/experimental/tac/tests/e2e:simple-graph.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/metrics:error_collector_inst_test PASSED in 0.4s //tensorflow/compiler/mlir/lite/quantization:numerical_utils_test PASSED in 0.2s //tensorflow/compiler/mlir/lite/quantization/lite:quantize_model_test PASSED in 12.4s //tensorflow/compiler/mlir/lite/quantization/lite:quantize_weights_test PASSED in 11.5s //tensorflow/compiler/mlir/lite/quantization/tensorflow/tests:fallback_to_flex_ops_default.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/quantization/tensorflow/tests:fallback_to_flex_ops_legacy.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/quantization/tensorflow/tests:tf_to_quant.mlir.test PASSED in 1.1s //tensorflow/compiler/mlir/lite/quantization/tensorflow/tests:tf_to_quant_4bit.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/quantization/tests:import_quant_stats.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/sparsity:sparsify_model_test PASSED in 0.7s //tensorflow/compiler/mlir/lite/stablehlo/tests:fold_broadcast.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/stablehlo/tests:fuse_mhlo_convolution.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-inplaceupdate.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-skip-quantization-ops.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-stablehlo-tf-fb-tf.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-stablehlo-tfl-add.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-stablehlo-tfl-broadcast_in_dim.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-stablehlo-tfl-clamp.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-stablehlo-tfl-compare.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-stablehlo-tfl-concat.mlir.test PASSED in 2.1s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-stablehlo-tfl-constant.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-stablehlo-tfl-conv.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-stablehlo-tfl-dot.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-stablehlo-tfl-gather.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-stablehlo-tfl-max.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-stablehlo-tfl-mul.mlir.test PASSED in 1.4s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-stablehlo-tfl-pad.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-stablehlo-tfl-reshape.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-stablehlo-tfl-rsqrt.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-stablehlo-tfl-scatter.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-stablehlo-tfl-sub.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-stablehlo-tfl.mlir.test PASSED in 0.4s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-tfl-stablehlo-add.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-tfl-stablehlo-broadcast.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-tfl-stablehlo-clamp.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-tfl-stablehlo-concat.mlir.test PASSED in 1.3s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-tfl-stablehlo-constant.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-tfl-stablehlo-conv.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-tfl-stablehlo-max.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-tfl-stablehlo-mul.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-tfl-stablehlo-pad.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-tfl-stablehlo-reshape.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-tfl-stablehlo-rsqrt.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-tfl-stablehlo-sub.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/stablehlo/tests:legalize-tfl-stablehlo.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/stablehlo/tests:odml-to-stablehlo-allow-tf.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/stablehlo/tests:odml-to-stablehlo-smuggle-resize.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/stablehlo/tests:optimize.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/stablehlo/tests:tf-tfl-translate-serialize-stablehlo-clamp.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/stablehlo/tests:tf-tfl-translate-serialize-stablehlo-concat.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/stablehlo/tests:tf-tfl-translate-serialize-stablehlo-conv.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/stablehlo/tests:tf-tfl-translate-serialize-stablehlo-division.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/lite/stablehlo/tests:tf-tfl-translate-serialize-stablehlo-logistic.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/stablehlo/tests:tf-tfl-translate-serialize-stablehlo-multiply.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/stablehlo/tests:tf-tfl-translate-serialize-stablehlo-reduce-window.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/stablehlo/tests:tf-tfl-translate-serialize-stablehlo-resize-bilinear.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/stablehlo/tests:tf-tfl-translate-serialize-stablehlo-subtract.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/stablehlo/tests:tf-tfl-translate-serialize-stablehlo.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/stablehlo/tests:tf-tfl-translate-tf-quantize.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/stablehlo/tests:unfuse_mhlo_batch_norm.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests:analyze-variables.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/lite/tests:canonicalize.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/lite/tests:const-fold.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests:decompose-hybrid-quantization.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/tests:default_quant_params.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/lite/tests:dilated-conv.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests:fuse-tftext.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/tests:get-arithmetic-count.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests:guarantee_func_has_one_use.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests:inlining.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/tests:insert_call_once_op.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests:legalize-tf-assert.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests:legalize-tf-hashtables.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/tests:legalize-tf-no-runtime-verification.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/tests:legalize-tf-variables.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests:legalize-tf-while.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/lite/tests:legalize-tf.mlir.test PASSED in 1.3s //tensorflow/compiler/mlir/lite/tests:legalize_jax_random.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/tests:lift_tflite_flex_ops.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests:lower-static-tensor-list-default-to-single-batch.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests:lower-static-tensor-list-enable-dynamic-update-slice.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests:lower-static-tensor-list.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/tests:modify_io_nodes.mlir.test PASSED in 1.3s //tensorflow/compiler/mlir/lite/tests:ops.mlir.test PASSED in 1.3s //tensorflow/compiler/mlir/lite/tests:optimize-after-quantization.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/tests:optimize.mlir.test PASSED in 1.9s //tensorflow/compiler/mlir/lite/tests:optimize_functional_ops.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/tests:optimize_no_verify.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/tests:optimize_op_order.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/lite/tests:partitioned-topological-sort.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/tests:pin-ops-with-side-effects.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests:post-quantize-dynamic-range.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/lite/tests:post-quantize.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/tests:prepare-composite-functions-tf.mlir.test PASSED in 1.7s //tensorflow/compiler/mlir/lite/tests:prepare-quantize-dynamic-range.mlir.test PASSED in 1.4s //tensorflow/compiler/mlir/lite/tests:prepare-quantize-post-training-16bits.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/tests:prepare-quantize-post-training.mlir.test PASSED in 1.4s //tensorflow/compiler/mlir/lite/tests:prepare-quantize-signed.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/lite/tests:prepare-quantize.mlir.test PASSED in 1.2s //tensorflow/compiler/mlir/lite/tests:prepare-tf-fake-quant-4bit.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/lite/tests:prepare-tf-fake-quant.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/lite/tests:prepare-tf-with-allowing-bf16-and-f16-type-legalization.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/tests:prepare-tf.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/lite/tests:quantize-dynamic-range.mlir.test PASSED in 3.3s //tensorflow/compiler/mlir/lite/tests:quantize-numeric-verify.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/tests:quantize-variables.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/lite/tests:quantize.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/lite/tests:raise-custom-ops.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/tests:reduce_while_operands.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/tests:shape-inference.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests:split-merged-operands.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/tests:tfl_while_op_licm.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests:tfl_while_outline.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/lite/tests:trim-functions-tf.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests:unfold-large-splat-constant.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/debuginfo:v1_1.0_224_frozen.wrong_attr.line.part.pbtxt.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/debuginfo:v1_1.0_224_frozen.wrong_attr.stack.part.pbtxt.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/tests/end2end:add.pbtxt.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/tests/end2end:back2back_fake_quant.pbtxt.test PASSED in 0.9s //tensorflow/compiler/mlir/lite/tests/end2end:control_flow_v1.pbtxt.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/end2end:conv_2d.pbtxt.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/end2end:conv_2d_nchw.pbtxt.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/end2end:custom_opdef.pbtxt.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/end2end:disallow_stateful_partitioned_call.pbtxt.test PASSED in 0.9s //tensorflow/compiler/mlir/lite/tests/end2end:fake_quant_per_channel.pbtxt.test PASSED in 1.3s //tensorflow/compiler/mlir/lite/tests/end2end:fake_quant_per_channel_4bit.pbtxt.test PASSED in 1.3s //tensorflow/compiler/mlir/lite/tests/end2end:fake_quant_without_identity.pbtxt.test PASSED in 1.2s //tensorflow/compiler/mlir/lite/tests/end2end:fake_quant_without_identity_4bit.pbtxt.test PASSED in 1.2s //tensorflow/compiler/mlir/lite/tests/end2end:graph-input-node.pbtxt.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/tests/end2end:graph_with_placeholder_with_default.pbtxt.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/tests/end2end:if_op.pbtxt.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/tests/end2end:quant_stats.pbtxt.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/tests/end2end:unroll_batch_matmul.pbtxt.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/tests/end2end:unroll_batch_matmul_disabled.pbtxt.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:basic_lstm.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:bucketize.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:constants.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:control_edges.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:custom_op.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:dynamic_shape.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:external_constant.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:if_op.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:import_json.json.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:importer_test_min_max.cc.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:importer_test_min_max.cc.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:input_arrays.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:input_output_names_attr.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:legacy_reshape.json.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:lstm.json.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:lstm.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:many_attribute_op.mlir.test PASSED in 1.6s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:math.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:matmul.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:multi_output_op.json.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:optional.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:optional_input.json.test PASSED in 1.7s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:output_arrays.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:pruning.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:pruning_function_input_as_output.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:quant_stats.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:quantization.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:reshape.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:signature.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:signature_with_multiple_entry_points.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:simple.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:tf_variant_type.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:unranked_function_output.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:unranked_tensor.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/lite/tests/flatbuffer2mlir:while_op.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/mlir2exec:tfl_while_op.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:basic_lstm.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:bucketize.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:custom_op_with_tflite_op.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:depthwise_conv2d.mlir.test PASSED in 0.4s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:depthwise_conv2d_v2.mlir.test PASSED in 1.1s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:disable_builtin.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:disable_custom.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:disable_flex.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:disable_flex_enable_builtin.mlir.test PASSED in 0.4s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:dynamic_shape_constant.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:fake_quant.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:flex_exclusively.mlir.test PASSED in 0.4s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:flex_op_with_complex128.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:flex_op_with_f64.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:flex_op_with_tflite_op.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:fully_connected.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:fully_connected_v2.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:hashtable_resource.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:if_op.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:logical.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:low_bit_packing.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:lstm.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:lstm_asym_attr.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:lstm_quantized.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:math.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:metadata.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:mul_v2.mlir.test PASSED in 1.3s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:mul_v3.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:nn.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:numeric_verify.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:optional.mlir.test PASSED in 1.2s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:quantization.mlir.test PASSED in 0.4s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:reshape.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:signature_def.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:signature_def_output_override.mlir.test PASSED in 1.2s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:signature_def_with_multiple_entry_points.mlir.test PASSED in 0.4s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:signature_def_with_no_inputs.mlir.test PASSED in 0.4s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:simple.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:simple_with_connected_control_nodes.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:simple_with_unconnected_control_nodes.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:svdf.mlir.test PASSED in 1.8s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:svdf_v2.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:tf_entry_function.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:tfl_while_op.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:transpose_conv_optional.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:type_attr.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:unidirectional_sequence_lstm.mlir.test PASSED in 0.4s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:unidirectional_sequence_rnn.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:unranked_tensor.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:unsorted_segment_prod.mlir.test PASSED in 0.4s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:variant_type_on_func.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:variant_type_on_op.mlir.test PASSED in 0.4s //tensorflow/compiler/mlir/lite/tests/mlir2flatbuffer:while_op.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/quantization/tensorflow/calibrator:calibrator_singleton_test PASSED in 0.1s //tensorflow/compiler/mlir/quantization/tensorflow/calibrator:custom_aggregator_op_test PASSED in 16.7s //tensorflow/compiler/mlir/quantization/tensorflow/cc:const_op_size_test PASSED in 0.4s //tensorflow/compiler/mlir/quantization/tensorflow/cc:convert_asset_args_test PASSED in 6.1s //tensorflow/compiler/mlir/quantization/tensorflow/cc:save_variables_test PASSED in 0.5s //tensorflow/compiler/mlir/quantization/tensorflow/cc:status_macro_test PASSED in 0.6s //tensorflow/compiler/mlir/quantization/tensorflow/debugging:mlir_dump_test PASSED in 0.3s //tensorflow/compiler/mlir/quantization/tensorflow/python:concurrency_test PASSED in 41.4s //tensorflow/compiler/mlir/quantization/tensorflow/python:pywrap_quantize_model_test PASSED in 18.8s //tensorflow/compiler/mlir/quantization/tensorflow/python:representative_dataset_test PASSED in 10.3s //tensorflow/compiler/mlir/quantization/tensorflow/tests:cast_bf16_ops_to_f32.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/quantization/tensorflow/tests:convert_custom_aggregation_op_to_quant_stats.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/quantization/tensorflow/tests:convert_fake_quant_to_qdq.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/quantization/tensorflow/tests:convert_tf_quant_ops_to_mhlo.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/quantization/tensorflow/tests:convert_tpu_model_to_cpu.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/quantization/tensorflow/tests:duplicate_shape_determining_constants.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/quantization/tensorflow/tests:fake_quant_e2e_flow.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/quantization/tensorflow/tests:fake_quant_e2e_xla.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/quantization/tensorflow/tests:insert_custom_aggregation_ops.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/quantization/tensorflow/tests:insert_main_function.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/quantization/tensorflow/tests:insert_quantized_functions.mlir.test PASSED in 1.3s //tensorflow/compiler/mlir/quantization/tensorflow/tests:insert_quantized_functions_drq.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/quantization/tensorflow/tests:insert_quantized_functions_weight_only.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/quantization/tensorflow/tests:insert_restore_op.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/quantization/tensorflow/tests:insert_save_op.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/quantization/tensorflow/tests:issue_ids_of_custom_aggregation_ops.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/quantization/tensorflow/tests:lift_quantizable_spots_as_functions.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/quantization/tensorflow/tests:lift_quantizable_spots_as_functions_drq.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/quantization/tensorflow/tests:lift_quantizable_spots_as_functions_drq_min_elements.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/quantization/tensorflow/tests:lift_quantizable_spots_as_functions_xla.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/quantization/tensorflow/tests:mark_functions_noinline.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/quantization/tensorflow/tests:merge_initializer_function_ops_to_main.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/quantization/tensorflow/tests:merge_save_function_ops_to_main.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/quantization/tensorflow/tests:optimize.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/quantization/tensorflow/tests:prepare_lifting.mlir.test PASSED in 1.1s //tensorflow/compiler/mlir/quantization/tensorflow/tests:prepare_quantize.mlir.test PASSED in 1.3s //tensorflow/compiler/mlir/quantization/tensorflow/tests:prepare_quantize_drq.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/quantization/tensorflow/tests:prepare_quantize_drq_per_channel.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/quantization/tensorflow/tests:prepare_quantize_ptq.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/quantization/tensorflow/tests:prepare_quantize_ptq_per_channel.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/quantization/tensorflow/tests:preprocess_op.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/quantization/tensorflow/tests:quantize.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/quantization/tensorflow/tests:quantize_composite_functions.mlir.test PASSED in 1.7s //tensorflow/compiler/mlir/quantization/tensorflow/tests:quantize_composite_functions_drq.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/quantization/tensorflow/tests:quantize_composite_functions_weight_only.mlir.test PASSED in 1.2s //tensorflow/compiler/mlir/quantization/tensorflow/tests:quantize_composite_functions_xla.mlir.test PASSED in 1.9s //tensorflow/compiler/mlir/quantization/tensorflow/tests:quantize_drq.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/quantization/tensorflow/tests:quantize_xla.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/quantization/tensorflow/tests:remove_var_init_by_const.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/quantization/tensorflow/tests:replace_cast_hacks_with_tf_xla_ops.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/quantization/tensorflow/tests:replace_cast_hacks_with_tf_xla_ops_large_constants.mlir.test PASSED in 16.3s //tensorflow/compiler/mlir/quantization/tensorflow/tests:unfreeze_constants.mlir.test PASSED in 1.1s //tensorflow/compiler/mlir/quantization/tensorflow/utils:tf_to_xla_attribute_utils_test PASSED in 32.1s //tensorflow/compiler/mlir/tensorflow:bridge_logger_test PASSED in 5.4s //tensorflow/compiler/mlir/tensorflow:cluster_util_test PASSED in 0.3s //tensorflow/compiler/mlir/tensorflow:convert_tensor_test PASSED in 0.9s //tensorflow/compiler/mlir/tensorflow:convert_type_test PASSED in 0.3s //tensorflow/compiler/mlir/tensorflow:device_util_test PASSED in 0.3s //tensorflow/compiler/mlir/tensorflow:dump_graph_test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow:dump_mlir_util_test PASSED in 13.8s //tensorflow/compiler/mlir/tensorflow:error_util_test PASSED in 0.2s //tensorflow/compiler/mlir/tensorflow:tf_saved_model_test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow:tpu_rewrite_device_util_test PASSED in 0.3s //tensorflow/compiler/mlir/tensorflow/tests:add_functions_for_exported_names.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:annotate-parameter-replication.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:batchmatmul_to_einsum.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests:breakup-islands.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/tensorflow/tests:cannonicalize_ops_outside_compilation.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:canonicalize.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests:canonicalize_compile_and_replicate_attributes.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:check_control_dependencies.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:cluster_formation.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:cluster_ops_by_policy.mlir.test PASSED in 1.1s //tensorflow/compiler/mlir/tensorflow/tests:cluster_outlining.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:cluster_tf_ops_pass.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:constant-fold.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:constant_op_device_assignment.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:convert-tf-control-flow-to-scf.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:convert_control_to_data_outputs.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests:convert_launch_func_to_tf_call.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:convert_session_initializer_to_function.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests:convert_to_legacy_compile_and_replicate_attributes.mlir.test PASSED in 1.2s //tensorflow/compiler/mlir/tensorflow/tests:decompose_reduce_dataset.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:decompose_resource_ops.mlir.test PASSED in 1.1s //tensorflow/compiler/mlir/tensorflow/tests:device_assignment.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:device_assignment_by_func_attr.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:device_attribute_to_launch.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:device_canonicalize.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:device_copy.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:drop_while_shape_invariant.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/tensorflow/tests:einsum.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:embedding_pipelining.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:empty-main.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:end-to-end-tpu-reshard-variables.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:executor_canonicalize.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:executor_island_coarsening.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:executor_island_materialize_const.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:extract_head_tail_outside_compilation.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:extract_outside_compilation.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:extract_tpu_copy_with_dynamic_shape_op.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:fold-broadcast.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:freeze_variables.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:func-attr-invalid.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:func-attr.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:functional-control-flow-to-cfg.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests:functional-control-flow-to-regions.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:functionalize-if-fail.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests:functionalize-if.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:fused_kernel_matcher.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:gpu_fusion.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:graph_pruning.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:graph_pruning_preserve_ops.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:group_by_dialect.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:guarantee-all-funcs-one-use.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:hoist_loop_invariant.mlir.test PASSED in 1.6s //tensorflow/compiler/mlir/tensorflow/tests:hoist_replicate_invariant_resource_writes.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests:host_launch_to_outside_compiled.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:init_text_file_to_import.mlir.test PASSED in 1.6s //tensorflow/compiler/mlir/tensorflow/tests:init_text_file_to_import_invalid.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:init_text_file_to_import_saved_model.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:inlining.mlir.test PASSED in 1.9s //tensorflow/compiler/mlir/tensorflow/tests:isolate-placer.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:launch_outlining.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:launch_to_device_attribute.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:launch_to_device_attribute_legacy.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:layout_optimization_layout_assignment_gpu_cc_60.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:layout_optimization_layout_assignment_gpu_cc_70.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:layout_optimization_layout_assignment_to_nchw.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:layout_optimization_layout_assignment_to_nhwc.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:layout_optimization_move_transposes_begin.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:layout_optimization_move_transposes_end.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:layout_optimization_to_nchw.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/tensorflow/tests:layout_optimization_to_nhwc.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/tensorflow/tests:legalize_hlo.mlir.test PASSED in 1.3s //tensorflow/compiler/mlir/tensorflow/tests:legalize_tfg.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:legalize_tfg_arg_control_dep.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:legalize_tfg_with_control_flow.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:localize_var_handles.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:lower_globals_to_ml_program.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:lower_globals_to_ml_program_invalid.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:lower_quantized.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:lower_tf.mlir.test PASSED in 1.6s //tensorflow/compiler/mlir/tensorflow/tests:lower_variable_ops_to_ml_program.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:mark_input_output_aliases.mlir.test PASSED in 1.1s //tensorflow/compiler/mlir/tensorflow/tests:mark_ops_for_outside_compilation.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/tensorflow/tests:materialize_passthrough_op.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests:merge_control_flow.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests:mlprogram.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests:name_anonymous_iterators.mlir.test PASSED in 1.1s //tensorflow/compiler/mlir/tensorflow/tests:optimize-arg-operand-constraint.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/tensorflow/tests:optimize.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:order_by_dialect.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:outside_compiled_to_host_launch.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:parallel_execute_to_islands.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:parallel_execute_to_islands_legacy.mlir.test PASSED in 1.5s //tensorflow/compiler/mlir/tensorflow/tests:prepare_tpu_computation_for_tf_export.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:promote_resources_to_args.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests:promote_resources_to_args_functions.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:promote_var_handles_to_args.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:readonly_references_to_resources.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:region-control-flow-to-functional.mlir.test PASSED in 1.2s //tensorflow/compiler/mlir/tensorflow/tests:remove_unused_arguments.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:remove_unused_while_results.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:replica_id_to_device_ordinal.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:replicate_invariant_op_hoisting.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:replicate_tensor_list_init_ops.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/tensorflow/tests:replicate_to_island.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:replicate_to_island_legacy.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:resource-alias-analysis-test.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests:resource-device-inference.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:resource_analyzer.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:resource_inlining.mlir.test PASSED in 1.2s //tensorflow/compiler/mlir/tensorflow/tests:resource_op_lifting.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests:rewrite_tpu_embedding_ops.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/tensorflow/tests:roundtrip-tf-executor.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests:shape_inference.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:side-effect-analysis-test.mlir.test PASSED in 1.2s //tensorflow/compiler/mlir/tensorflow/tests:sink_constant.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:split_into_island_per_op.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:stack_ops_decomposition.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests:strip_noinline.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/tensorflow/tests:strip_saved_module_metadata.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:strip_tf_attributes.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:tensor_array_ops_decomposition.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/tensorflow/tests:tensor_list_ops_decomposition.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:tf-executor-to-functional.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:tf-functional-to-executor.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:tf-ops.mlir.test PASSED in 2.7s //tensorflow/compiler/mlir/tensorflow/tests:tf-reduce-identity.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:tf_data_fuse_map_and_batch.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:tf_data_fuse_pmap_and_batch.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests:tf_device_index_selector.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:tf_device_ops.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/tensorflow/tests:tf_device_ops_invalid.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests:tf_executor_ops.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/tensorflow/tests:tf_executor_ops_invalid.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:tf_executor_ops_location_roundtrip.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:tf_executor_ops_printer.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/tensorflow/tests:tf_executor_ops_side_effect.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:tf_optimize.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:tf_saved_model_deduplicate_bound_input_bindings.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:tf_saved_model_freeze_assets.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:tf_saved_model_freeze_global_tensors.mlir.test PASSED in 1.1s //tensorflow/compiler/mlir/tensorflow/tests:tf_saved_model_freeze_global_tensors_mutable_tensors.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:tf_saved_model_initialize_variables_in_session_init.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:tf_saved_model_initialize_variables_in_session_init_fail.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:tf_saved_model_lift_variables.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:tf_saved_model_lift_variables_invalid_session.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:tf_saved_model_mark_initialized_variables.mlir.test PASSED in 1.1s //tensorflow/compiler/mlir/tensorflow/tests:tf_saved_model_ops.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests:tf_saved_model_ops_invalid.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/tensorflow/tests:tf_saved_model_optimize_global_tensors.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:tf_saved_model_optimize_global_tensors_interprocedural.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:tf_saved_model_remove_vars_in_session_initializer.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests:tf_side_effect.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:tf_trait_folds.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/tensorflow/tests:tpu-annotate-dynamic-shape-inputs.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:tpu-cluster-cleanup-attributes.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:tpu-dynamic-layout-pass.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:tpu-merge-variables-with-execute.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:tpu-multiple-while-body-func.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/tensorflow/tests:tpu-resource-read-for-write.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:tpu-variable-runtime-reformatting.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:tpu_cluster_formation.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:tpu_colocate_composite_resource_ops.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:tpu_device_propagation.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:tpu_host_computation_expansion.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:tpu_identity_pruning.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:tpu_parallel_execute_sink_resource_write.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:tpu_partitioned_op_conversion.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:tpu_reorder_replicate_and_partitioned_inputs.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/tensorflow/tests:tpu_resource_partitioning.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:tpu_rewrite.mlir.test PASSED in 1.1s //tensorflow/compiler/mlir/tensorflow/tests:tpu_sharding_identification.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:tpu_space_to_depth_pass.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests:tpu_tail_with_tobool_op.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests:tpu_update_embedding_enqueue_op_inputs.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests:tpu_validate_inputs.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:transpose-op.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:unroll-batch-matmul.mlir.test PASSED in 1.3s //tensorflow/compiler/mlir/tensorflow/tests:update_control_dependencies.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/tensorflow/tests:warn_when_using_deprecated_dumps.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests:while_licm.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:xla_cluster_formation.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:xla_inline_device_ops.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests:xla_rewrite.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/compile_mlir_util:add.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/tensorflow/tests/compile_mlir_util:argument-sharding-invalid.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests/compile_mlir_util:argument-sharding.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/compile_mlir_util:constant-folding-hook.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests/compile_mlir_util:constant-folding.mlir.test PASSED in 1.4s //tensorflow/compiler/mlir/tensorflow/tests/compile_mlir_util:graph-resource.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests/compile_mlir_util:graph-resource.pbtxt.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests/compile_mlir_util:graph.pbtxt.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests/compile_mlir_util:mlir-module-serialized-str-attr.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/compile_mlir_util:replicate-tensor-list-init-ops.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/compile_mlir_util:result-sharding.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/compile_mlir_util:serialized-mlir-module-str-attr-invalid.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/compile_mlir_util:serialized-mlir-module-str-attr.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/compile_mlir_util:shape-inference-after-legalization.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/tensorflow/tests/compile_mlir_util:shape-inference.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/tensorflow/tests/compile_mlir_util:stablehlo_add.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/executor_tpuv1_island_coarsening:executor_tpuv1_island_coarsening.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests/executor_tpuv1_island_coarsening:while_op.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests/executor_tpuv1_island_inlining:executor_tpuv1_inline_tpu_island.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/executor_tpuv1_island_inlining:while_op.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/executor_tpuv1_outline_island:case_op.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests/executor_tpuv1_outline_island:executor_tpuv1_outline_tpu_island.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests/executor_tpuv1_outline_island:while_op.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:add.pbtxt.test PASSED in 1.2s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:arg-as-fetch.pbtxt.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:arg-control-dep.pbtxt.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:arg-data-type-with-subtype.pbtxt.test PASSED in 0.9s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:arg-data-type.pbtxt.test PASSED in 0.9s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:arg-multi-data-type-with-subtype.pbtxt.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:arg-retval-attrs.pbtxt.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:case_op.pbtxt.test PASSED in 0.9s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:const-values.pbtxt.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:device-arg-retval-attr.pbtxt.test PASSED in 1.2s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:empty-input-shapes.pbtxt.test PASSED in 1.1s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:empty-value-attr.pbtxt.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:feed-as-fetch.pbtxt.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:feed-control-dep.pbtxt.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:force_shared_name_for_resource_ops.pbtxt.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:function-func-attr.pbtxt.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:functional-if-ops.pbtxt.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:functional-while-ops.pbtxt.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-as-function-control-ret.pbtxt.test PASSED in 1.3s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-as-function-retval-of-arg.pbtxt.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-as-function.pbtxt.test PASSED in 1.4s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-custom-operation.pbtxt.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-default-attr.pbtxt.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-device-retval.pbtxt.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-empty-tensor-content.pbtxt.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-func-attr.pbtxt.test PASSED in 1.1s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-function-call.pbtxt.test PASSED in 0.9s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-function-control-ret-diff-island.pbtxt.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-function-control-ret-same-island.pbtxt.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-function-defs.pbtxt.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-function-input-shapes.pbtxt.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-function-name-bug.pbtxt.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-function-resource-args.pbtxt.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-gradient-def.pbtxt.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-input-func-arg-name-collision.pbtxt.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-library.pbtxt.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-malformed.pbtxt.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-scalar-input.pbtxt.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-uint8-return.pbtxt.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-undefined-output.pbtxt.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-version-info.pbtxt.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:graph-while-loop.pbtxt.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:invalid-output-index.pbtxt.test PASSED in 0.9s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:legacy-fed-input-without-inputs.pbtxt.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:merge_node_with_function.pbtxt.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:mlir_passthrough_op.pbtxt.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:multi-output-feeds.pbtxt.test PASSED in 1.0s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:multiple-use-next-iteration.pbtxt.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:node-locations.pbtxt.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:output-shapes-attr.pbtxt.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:output-shapes.pbtxt.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:parse_example.pbtxt.test PASSED in 1.1s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:parse_example_v2.pbtxt.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:partial-device-name.pbtxt.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:prune_unused_nodes.pbtxt.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:quint8-const.pbtxt.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:shape-attrs.pbtxt.test PASSED in 1.1s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:stateful-attribute.pbtxt.test PASSED in 1.8s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:string-attr.pbtxt.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:switch_n.pbtxt.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:target.pbtxt.test PASSED in 0.9s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:tensor-list.pbtxt.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:tf-data-pipeline.pbtxt.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir:unregistered_kernel.pbtxt.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/graphdef2mlir/batch_use_same_function:saved_model.pbtxt.test PASSED in 1.6s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:aliasing_arg_attr.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:case.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:convert_tensor.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:derived_shape_attr.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:derived_size_attr.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:device-arg-retval-attr.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:export_main_to_flib.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:fetch_feed_names.mlir.test PASSED in 1.9s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:func_attr.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:func_list_attr.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:function-control-ret.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:function-order.mlir.test PASSED in 1.5s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:function-resource-args-handle-info.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:function-resource-args.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:functional-if-ops.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:functional-while-ops.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:graph-as-function.mlir.test PASSED in 2.3s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:infer_derived_attribute.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:invalid_input.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:legalized_name.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:missing-main.mlir.test PASSED in 1.4s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:noop.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:optional_symbol_ref.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:output-shapes-attr.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:parse_example.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:parse_example_v2.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:preserve-entry-func-names.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:ref-type-attr.mlir.test PASSED in 2.0s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:ref-while-loop.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:shape_list_attr.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:simple.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:simple_tf_dialect_op.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:stringescape.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:switchn.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:tf-gradient-attr.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:tf-legacy-call.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:tf_add.mlir.test PASSED in 3.3s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:tf_identity_n.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:tf_tpu_embedding_ops.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:type_attr.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:type_list_attr.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:unique_name.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:unique_output_name.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/mlir2graphdef:while-loop.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tensorflow/tests/tf_to_hlo_pipeline:sccp-post-shape-inference.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tensorflow/tests/tpu_bridge_v1:end_to_end.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tf2xla/api/v0:compile_mlir_util_test PASSED in 5.4s //tensorflow/compiler/mlir/tf2xla/api/v0:compile_tf_graph_test PASSED in 0.4s //tensorflow/compiler/mlir/tf2xla/api/v1:legalize_tf_test PASSED in 19.7s //tensorflow/compiler/mlir/tf2xla/tests:adjust-layout.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tf2xla/tests:convert-mhlo-quant-to-int.mlir.test PASSED in 1.2s //tensorflow/compiler/mlir/tf2xla/tests:hlo_xla_runtime_pipeline.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tf2xla/tests:hlo_xla_sparsification.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tf2xla/tests:legalize-tf-BatchMatMulV2.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tf2xla/tests:legalize-tf-binary-elementwise.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tf2xla/tests:legalize-tf-collective.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tf2xla/tests:legalize-tf-communication.mlir.test PASSED in 1.0s //tensorflow/compiler/mlir/tf2xla/tests:legalize-tf-include-tf2xla-fallback.mlir.test PASSED in 1.3s //tensorflow/compiler/mlir/tf2xla/tests:legalize-tf-no-tf2xla-fallback.mlir.test PASSED in 5.3s //tensorflow/compiler/mlir/tf2xla/tests:legalize-tf-prefer-tf2xla.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tf2xla/tests:legalize-tf-types.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tf2xla/tests:legalize-tf-with-tf2xla-hlo-importer-and-inline.mlir.test PASSED in 1.1s //tensorflow/compiler/mlir/tf2xla/tests:legalize-tf-with-tf2xla-hlo-importer.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tf2xla/tests:legalize-tf-with-tf2xla.mlir.test PASSED in 1.2s //tensorflow/compiler/mlir/tf2xla/tests:legalize-tf.mlir.test PASSED in 11.4s //tensorflow/compiler/mlir/tf2xla/tests:tfxla_device_specific_transformations_cpu.mlir.test PASSED in 2.4s //tensorflow/compiler/mlir/tf2xla/tests:tfxla_device_specific_transformations_gpu.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tf2xla/tests:verify-tfxla-legalization-no-chlo.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tf2xla/tests:verify-tfxla-legalization.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tf2xla/transforms:tf2xla_rewriter_test PASSED in 14.0s //tensorflow/compiler/mlir/tf2xla/transforms:verify_tfxla_legalization_test PASSED in 12.6s //tensorflow/compiler/mlir/tf2xla/transforms:xla_legalize_targets_test PASSED in 0.7s //tensorflow/compiler/mlir/tf2xla/transforms:xla_legalize_tf_test PASSED in 2.7s //tensorflow/compiler/mlir/tfr:graph_decompose_test PASSED in 9.2s //tensorflow/compiler/mlir/tfr:node_expansion_test PASSED in 12.2s //tensorflow/compiler/mlir/tfr:op_reg_gen_test PASSED in 14.4s //tensorflow/compiler/mlir/tfr:tfr_decompose_ctx_test PASSED in 7.9s //tensorflow/compiler/mlir/tfr:tfr_gen_test PASSED in 18.6s //tensorflow/compiler/mlir/tfr/examples/customization:test_ops_test PASSED in 20.5s //tensorflow/compiler/mlir/tfr/examples/mnist:mnist_ops_test PASSED in 24.2s //tensorflow/compiler/mlir/tfr/examples/pad:pad_ops_test PASSED in 27.4s //tensorflow/compiler/mlir/tools/kernel_gen/tests:buffer_deallocation.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tools/kernel_gen/tests:buffer_reuse.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tools/kernel_gen/tests:bufferize.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tools/kernel_gen/tests:copy_cleanup.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tools/kernel_gen/tests:embed_tf_framework.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tools/kernel_gen/tests:invalid.mlir.test PASSED in 0.4s //tensorflow/compiler/mlir/tools/kernel_gen/tests:isinf.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tools/kernel_gen/tests:ops.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/tools/kernel_gen/tests:parallel_loops_to_sequential.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tools/kernel_gen/tests:rewrite_tf_framework_assert.mlir.test PASSED in 0.8s //tensorflow/compiler/mlir/tools/kernel_gen/tests:tanh.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tools/kernel_gen/tests:tf-legalize-to-lmhlo.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tools/kernel_gen/tests:tf_abi_knowledge.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/tools/kernel_gen/tests:tf_framework_legalize_to_llvm.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tools/kernel_gen/tests:tf_kernel_gpu_launch_to_llvm.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tools/kernel_gen/tests:tf_to_jit_invocations.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tosa/tests:convert-tfl-uint8.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tosa/tests:convert_metadata.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tosa/tests:fuse-bias-tf.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tosa/tests:lower-complex-types.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tosa/tests:lower_global_tensors.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tosa/tests:multi_add.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tosa/tests:retain_call_once_funcs.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tosa/tests:strip-quant-types.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/tosa/tests:strip_metadata.mlir.test PASSED in 0.7s //tensorflow/compiler/mlir/tosa/tests:tf-tfl-to-tosa-pipeline.mlir.test PASSED in 0.6s //tensorflow/compiler/mlir/tosa/tests:tf-to-tosa-pipeline.mlir.test PASSED in 1.4s //tensorflow/compiler/mlir/tosa/tests:tfl-to-tosa-dequantize_softmax.mlir.test PASSED in 0.5s //tensorflow/compiler/mlir/tosa/tests:tfl-to-tosa-pipeline-filtered.mlir.test PASSED in 0.9s //tensorflow/compiler/mlir/tosa/tests:tfl-to-tosa-pipeline.mlir.test PASSED in 5.4s //tensorflow/compiler/mlir/tosa/tests:verify_fully_converted.mlir.test PASSED in 0.8s //tensorflow/compiler/tests:adadelta_test_cpu PASSED in 14.8s //tensorflow/compiler/tests:adagrad_da_test_cpu PASSED in 16.2s //tensorflow/compiler/tests:adagrad_test_cpu PASSED in 10.3s //tensorflow/compiler/tests:adam_test_cpu PASSED in 15.9s //tensorflow/compiler/tests:add_n_test_cpu PASSED in 8.9s //tensorflow/compiler/tests:argminmax_test_cpu PASSED in 16.1s //tensorflow/compiler/tests:argminmax_test_cpu_mlir_bridge_test PASSED in 17.7s //tensorflow/compiler/tests:bucketize_op_test_cpu PASSED in 9.3s //tensorflow/compiler/tests:bucketize_op_test_cpu_mlir_bridge_test PASSED in 7.7s //tensorflow/compiler/tests:case_test_cpu PASSED in 8.5s //tensorflow/compiler/tests:cast_ops_test_cpu PASSED in 11.2s //tensorflow/compiler/tests:cast_ops_test_cpu_mlir_bridge_test PASSED in 9.2s //tensorflow/compiler/tests:categorical_op_test_cpu PASSED in 11.4s //tensorflow/compiler/tests:categorical_op_test_cpu_mlir_bridge_test PASSED in 13.8s //tensorflow/compiler/tests:cholesky_op_test_cpu PASSED in 16.9s //tensorflow/compiler/tests:cholesky_op_test_cpu_mlir_bridge_test PASSED in 20.9s //tensorflow/compiler/tests:clustering_test_cpu PASSED in 9.0s //tensorflow/compiler/tests:clustering_test_cpu_mlir_bridge_test PASSED in 10.3s //tensorflow/compiler/tests:concat_ops_test_cpu PASSED in 9.6s //tensorflow/compiler/tests:concat_ops_test_cpu_mlir_bridge_test PASSED in 12.0s //tensorflow/compiler/tests:cond_test_cpu PASSED in 8.8s //tensorflow/compiler/tests:const_arg_test_cpu PASSED in 7.6s //tensorflow/compiler/tests:const_test_cpu PASSED in 10.7s //tensorflow/compiler/tests:data_format_ops_test_cpu PASSED in 12.1s //tensorflow/compiler/tests:data_format_ops_test_cpu_mlir_bridge_test PASSED in 13.4s //tensorflow/compiler/tests:dense_layer_test_cpu PASSED in 11.0s //tensorflow/compiler/tests:dynamic_slice_ops_test_cpu PASSED in 9.5s //tensorflow/compiler/tests:dynamic_slice_ops_test_cpu_mlir_bridge_test PASSED in 13.6s //tensorflow/compiler/tests:dynamic_stitch_test_cpu PASSED in 6.6s //tensorflow/compiler/tests:dynamic_stitch_test_cpu_mlir_bridge_test PASSED in 7.5s //tensorflow/compiler/tests:eager_test_cpu PASSED in 18.8s //tensorflow/compiler/tests:einsum_op_test_cpu PASSED in 7.8s //tensorflow/compiler/tests:einsum_op_test_cpu_mlir_bridge_test PASSED in 9.4s //tensorflow/compiler/tests:ensure_shape_op_test_cpu PASSED in 9.6s //tensorflow/compiler/tests:extract_image_patches_op_test_cpu PASSED in 7.2s //tensorflow/compiler/tests:extract_image_patches_op_test_cpu_mlir_bridge_test PASSED in 11.1s //tensorflow/compiler/tests:fake_quant_ops_test_cpu PASSED in 13.4s //tensorflow/compiler/tests:fake_quant_ops_test_cpu_mlir_bridge_test PASSED in 13.2s //tensorflow/compiler/tests:fifo_queue_test_cpu PASSED in 8.8s //tensorflow/compiler/tests:fifo_queue_test_cpu_mlir_bridge_test PASSED in 9.6s //tensorflow/compiler/tests:ftrl_ops_test_cpu PASSED in 20.3s //tensorflow/compiler/tests:ftrl_ops_test_cpu_mlir_bridge_test PASSED in 16.4s //tensorflow/compiler/tests:ftrl_test_cpu PASSED in 15.0s //tensorflow/compiler/tests:function_test_cpu PASSED in 8.8s //tensorflow/compiler/tests:function_test_cpu_mlir_bridge_test PASSED in 9.3s //tensorflow/compiler/tests:gather_nd_op_test_cpu PASSED in 8.5s //tensorflow/compiler/tests:gather_nd_op_test_cpu_mlir_bridge_test PASSED in 10.5s //tensorflow/compiler/tests:gather_test_cpu PASSED in 41.6s //tensorflow/compiler/tests:gather_test_cpu_mlir_bridge_test PASSED in 52.7s //tensorflow/compiler/tests:jit_test_cpu PASSED in 51.6s //tensorflow/compiler/tests:listdiff_op_test_cpu PASSED in 10.5s //tensorflow/compiler/tests:listdiff_op_test_cpu_mlir_bridge_test PASSED in 15.1s //tensorflow/compiler/tests:lrn_ops_test_cpu PASSED in 8.4s //tensorflow/compiler/tests:lrn_ops_test_cpu_mlir_bridge_test PASSED in 10.5s //tensorflow/compiler/tests:lstm_test_cpu PASSED in 23.1s //tensorflow/compiler/tests:manip_ops_test_cpu PASSED in 11.1s //tensorflow/compiler/tests:manip_ops_test_cpu_mlir_bridge_test PASSED in 12.5s //tensorflow/compiler/tests:matrix_band_part_test_cpu PASSED in 40.3s //tensorflow/compiler/tests:matrix_band_part_test_cpu_mlir_bridge_test PASSED in 38.3s //tensorflow/compiler/tests:matrix_inverse_op_test_cpu PASSED in 18.1s //tensorflow/compiler/tests:matrix_inverse_op_test_cpu_mlir_bridge_test PASSED in 19.8s //tensorflow/compiler/tests:matrix_solve_op_test_cpu PASSED in 9.4s //tensorflow/compiler/tests:matrix_solve_op_test_cpu_mlir_bridge_test PASSED in 9.7s //tensorflow/compiler/tests:matrix_triangular_solve_op_test_cpu PASSED in 24.0s //tensorflow/compiler/tests:matrix_triangular_solve_op_test_cpu_mlir_bridge_test PASSED in 26.0s //tensorflow/compiler/tests:momentum_test_cpu PASSED in 26.8s //tensorflow/compiler/tests:nary_ops_test_cpu PASSED in 10.7s //tensorflow/compiler/tests:nary_ops_test_cpu_mlir_bridge_test PASSED in 9.7s //tensorflow/compiler/tests:nullary_ops_test_cpu PASSED in 8.8s //tensorflow/compiler/tests:nullary_ops_test_cpu_mlir_bridge_test PASSED in 10.7s //tensorflow/compiler/tests:placeholder_test_cpu PASSED in 7.2s //tensorflow/compiler/tests:placeholder_test_cpu_mlir_bridge_test PASSED in 7.6s //tensorflow/compiler/tests:proximal_adagrad_test_cpu PASSED in 10.6s //tensorflow/compiler/tests:proximal_gradient_descent_test_cpu PASSED in 7.8s //tensorflow/compiler/tests:quantized_ops_test_cpu PASSED in 9.2s //tensorflow/compiler/tests:reduce_window_test_cpu PASSED in 9.1s //tensorflow/compiler/tests:reduce_window_test_cpu_mlir_bridge_test PASSED in 8.1s //tensorflow/compiler/tests:reshape_op_test_cpu PASSED in 8.5s //tensorflow/compiler/tests:reshape_op_test_cpu_mlir_bridge_test PASSED in 9.1s //tensorflow/compiler/tests:reverse_ops_test_cpu PASSED in 10.9s //tensorflow/compiler/tests:reverse_ops_test_cpu_mlir_bridge_test PASSED in 12.3s //tensorflow/compiler/tests:reverse_sequence_op_test_cpu PASSED in 8.0s //tensorflow/compiler/tests:reverse_sequence_op_test_cpu_mlir_bridge_test PASSED in 14.0s //tensorflow/compiler/tests:risc_ops_test_cpu_mlir_bridge_test PASSED in 8.4s //tensorflow/compiler/tests:rmsprop_test_cpu PASSED in 10.7s //tensorflow/compiler/tests:scatter_nd_op_test_cpu PASSED in 21.9s //tensorflow/compiler/tests:scatter_nd_op_test_cpu_mlir_bridge_test PASSED in 25.4s //tensorflow/compiler/tests:searchsorted_op_test_cpu PASSED in 10.4s //tensorflow/compiler/tests:searchsorted_op_test_cpu_mlir_bridge_test PASSED in 10.0s //tensorflow/compiler/tests:segment_reduction_ops_test_cpu PASSED in 23.9s //tensorflow/compiler/tests:segment_reduction_ops_test_cpu_mlir_bridge_test PASSED in 34.2s //tensorflow/compiler/tests:self_adjoint_eig_op_test_cpu PASSED in 17.8s //tensorflow/compiler/tests:self_adjoint_eig_op_test_cpu_mlir_bridge_test PASSED in 18.2s //tensorflow/compiler/tests:slice_ops_test_cpu PASSED in 16.8s //tensorflow/compiler/tests:slice_ops_test_cpu_mlir_bridge_test PASSED in 22.5s //tensorflow/compiler/tests:sparse_to_dense_op_test_cpu PASSED in 8.7s //tensorflow/compiler/tests:sparse_to_dense_op_test_cpu_mlir_bridge_test PASSED in 14.4s //tensorflow/compiler/tests:stack_ops_test_cpu PASSED in 7.2s //tensorflow/compiler/tests:tensor_list_ops_test_cpu PASSED in 10.9s //tensorflow/compiler/tests:tridiagonal_matmul_ops_test_cpu PASSED in 14.8s //tensorflow/compiler/tests:tridiagonal_matmul_ops_test_cpu_mlir_bridge_test PASSED in 14.3s //tensorflow/compiler/tests:tridiagonal_solve_ops_test_cpu PASSED in 13.2s //tensorflow/compiler/tests:tridiagonal_solve_ops_test_cpu_mlir_bridge_test PASSED in 11.8s //tensorflow/compiler/tests:unique_ops_test_cpu PASSED in 9.9s //tensorflow/compiler/tests:variable_ops_test_cpu PASSED in 25.1s //tensorflow/compiler/tests:variable_ops_test_cpu_mlir_bridge_test PASSED in 16.4s //tensorflow/compiler/tests:where_op_test_cpu PASSED in 15.7s //tensorflow/compiler/tests:while_test_cpu PASSED in 9.0s //tensorflow/compiler/tests:xla_call_module_test_cpu PASSED in 10.7s //tensorflow/compiler/tests:xla_custom_call_ops_test_cpu PASSED in 7.2s //tensorflow/compiler/tests:xla_device_gpu_test_cpu PASSED in 9.8s //tensorflow/compiler/tests:xla_device_test_cpu PASSED in 12.3s //tensorflow/compiler/tests:xla_device_test_cpu_mlir_bridge_test PASSED in 15.7s //tensorflow/compiler/tests:xla_ops_test_cpu PASSED in 33.5s //tensorflow/compiler/tests:xla_ops_test_cpu_mlir_bridge_test PASSED in 43.9s //tensorflow/compiler/tests:xla_test_test PASSED in 7.8s //tensorflow/compiler/tf2xla:const_analysis_test PASSED in 7.7s //tensorflow/compiler/tf2xla:cpu_function_runtime_test PASSED in 0.1s //tensorflow/compiler/tf2xla:functionalize_cond_test PASSED in 0.9s //tensorflow/compiler/tf2xla:functionalize_control_flow_test PASSED in 1.1s //tensorflow/compiler/tf2xla:fused_batchnorm_reserve_space_test_cpu PASSED in 22.2s //tensorflow/compiler/tf2xla:graph_compiler_test PASSED in 6.0s //tensorflow/compiler/tf2xla:literal_util_test PASSED in 0.6s //tensorflow/compiler/tf2xla:resource_operation_table_test PASSED in 9.5s //tensorflow/compiler/tf2xla:resource_util_test_cpu PASSED in 2.1s //tensorflow/compiler/tf2xla:sharding_util_test PASSED in 0.6s //tensorflow/compiler/tf2xla:tf2xla_opset_test PASSED in 10.6s //tensorflow/compiler/tf2xla:tf2xla_test PASSED in 18.0s //tensorflow/compiler/tf2xla:tf2xla_util_test PASSED in 0.8s //tensorflow/compiler/tf2xla:xla_compiler_test PASSED in 15.6s //tensorflow/compiler/tf2xla:xla_jit_compiled_cpu_function_test PASSED in 18.2s //tensorflow/compiler/tf2xla:xla_op_registry_test PASSED in 7.0s //tensorflow/compiler/tf2xla/kernels:rng_converter_utils_test PASSED in 1.2s //tensorflow/compiler/xla:array2d_test PASSED in 0.1s //tensorflow/compiler/xla:array3d_test PASSED in 0.2s //tensorflow/compiler/xla:array4d_test PASSED in 0.2s //tensorflow/compiler/xla:array_test PASSED in 0.5s //tensorflow/compiler/xla:bit_cast_test PASSED in 0.1s //tensorflow/compiler/xla:comparison_util_test PASSED in 0.1s //tensorflow/compiler/xla:debug_options_parsers_test PASSED in 0.2s //tensorflow/compiler/xla:index_util_test PASSED in 0.1s //tensorflow/compiler/xla:iterator_util_test PASSED in 0.2s //tensorflow/compiler/xla:layout_test PASSED in 0.9s //tensorflow/compiler/xla:layout_util_test PASSED in 0.2s //tensorflow/compiler/xla:literal_test PASSED in 0.2s //tensorflow/compiler/xla:parse_flags_from_env_test PASSED in 0.5s //tensorflow/compiler/xla:permutation_util_test PASSED in 0.1s //tensorflow/compiler/xla:primitive_util_test PASSED in 0.3s //tensorflow/compiler/xla:refcounting_hash_map_test PASSED in 0.1s //tensorflow/compiler/xla:reference_util_test PASSED in 0.2s //tensorflow/compiler/xla:shape_test PASSED in 0.4s //tensorflow/compiler/xla:shape_tree_test PASSED in 0.2s //tensorflow/compiler/xla:shape_util_test PASSED in 2.1s //tensorflow/compiler/xla:status_macros_test PASSED in 1.0s //tensorflow/compiler/xla:text_literal_reader_test PASSED in 0.2s //tensorflow/compiler/xla:text_literal_writer_test PASSED in 0.1s //tensorflow/compiler/xla:types_test PASSED in 0.3s //tensorflow/compiler/xla:util_test PASSED in 0.1s //tensorflow/compiler/xla:window_util_test PASSED in 0.5s //tensorflow/compiler/xla/client:padding_test PASSED in 0.1s //tensorflow/compiler/xla/client:xla_builder_test PASSED in 0.3s //tensorflow/compiler/xla/client/lib:arithmetic_test_cpu PASSED in 7.8s //tensorflow/compiler/xla/client/lib:comparators_test_cpu PASSED in 8.1s //tensorflow/compiler/xla/client/lib:constants_test_cpu PASSED in 8.1s //tensorflow/compiler/xla/client/lib:logdet_test_cpu PASSED in 8.8s //tensorflow/compiler/xla/client/lib:math_test_cpu PASSED in 14.8s //tensorflow/compiler/xla/client/lib:matrix_test_cpu PASSED in 10.5s //tensorflow/compiler/xla/client/lib:pooling_test_cpu PASSED in 9.3s //tensorflow/compiler/xla/client/lib:qr_test_cpu PASSED in 14.0s //tensorflow/compiler/xla/client/lib:slicing_test_cpu PASSED in 8.7s //tensorflow/compiler/xla/client/lib:sorting_test_cpu PASSED in 8.4s //tensorflow/compiler/xla/examples/axpy:stablehlo_compile_test PASSED in 9.6s //tensorflow/compiler/xla/experiments/sm_bandwidth_benchmark:sm_bw_test PASSED in 0.1s //tensorflow/compiler/xla/hlo/evaluator:hlo_evaluator_test PASSED in 2.5s //tensorflow/compiler/xla/hlo/experimental/auto_sharding:auto_sharding_test PASSED in 1.3s //tensorflow/compiler/xla/hlo/transforms:hlo_constant_splitter_test PASSED in 0.8s //tensorflow/compiler/xla/hlo/utils:hlo_live_range_test PASSED in 1.1s //tensorflow/compiler/xla/hlo/utils:hlo_matchers_test PASSED in 0.8s //tensorflow/compiler/xla/hlo/utils:hlo_sharding_util_test PASSED in 0.2s //tensorflow/compiler/xla/mlir/backends/cpu/transforms/tests:collective_ops.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir/backends/cpu/transforms/tests:collective_ops_to_cpu_runtime.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir/backends/cpu/transforms/tests:fft.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir/backends/cpu/transforms/tests:legalize_i1_vector_transfers.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir/backends/cpu/transforms/tests:lmhlo_custom_call.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir/backends/cpu/transforms/tests:lmhlo_infeed.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir/backends/cpu/transforms/tests:remove_copies_to_out_params.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir/backends/cpu/transforms/tests:rng_bit_generator.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir/backends/cpu/transforms/tests:xla_abi_legalization.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir/backends/cpu/transforms/tests:xla_cpu_memref_element_cast_to_llvm.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir/backends/cpu/transforms/tests:xla_cpu_outfeed.mlir.test PASSED in 0.9s //tensorflow/compiler/xla/mlir/backends/gpu/transforms/tests:add_hlo_trace.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir/backends/gpu/transforms/tests:gpu_launch.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir/backends/gpu/transforms/tests:gpu_memcpy.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir/backends/gpu/transforms/tests:gpu_memset.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir/backends/gpu/transforms/tests:lmhlo_case.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir/backends/gpu/transforms/tests:lmhlo_custom_call.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir/backends/gpu/transforms/tests:lmhlo_fft.mlir.test PASSED in 0.8s //tensorflow/compiler/xla/mlir/backends/gpu/transforms/tests:lmhlo_gpu_cholesky.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir/backends/gpu/transforms/tests:lmhlo_gpu_conv.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir/backends/gpu/transforms/tests:lmhlo_gpu_cublas_lt_matmul.mlir.test PASSED in 0.8s //tensorflow/compiler/xla/mlir/backends/gpu/transforms/tests:lmhlo_gpu_gemm.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir/backends/gpu/transforms/tests:lmhlo_infeed.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir/backends/gpu/transforms/tests:lmhlo_outfeed.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir/backends/gpu/transforms/tests:lmhlo_send_recv.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir/backends/gpu/transforms/tests:lmhlo_while.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir/backends/gpu/transforms/tests:memref_get_global_to_arg.mlir.test PASSED in 1.4s //tensorflow/compiler/xla/mlir/backends/gpu/transforms/tests:outline_cuda_graphs.mlir.test PASSED in 1.3s //tensorflow/compiler/xla/mlir/framework/tests:legalize-xla-framework.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir/framework/tests:outline-with-xla-framework.mlir.test PASSED in 1.9s //tensorflow/compiler/xla/mlir/framework/tests:xla-framework.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir/math/transforms/tests:math_optimization.mlir.test PASSED in 1.4s //tensorflow/compiler/xla/mlir/memref/transforms/tests:aligned_allocations.mlir.test PASSED in 0.8s //tensorflow/compiler/xla/mlir/runtime/ir/tests:ops.mlir.test PASSED in 1.5s //tensorflow/compiler/xla/mlir/runtime/ir/tests:ops_verify.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir/runtime/ir/tests:testlib.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir/runtime/transforms:calling_convention_test PASSED in 0.4s //tensorflow/compiler/xla/mlir/runtime/transforms:type_converter_test PASSED in 0.2s //tensorflow/compiler/xla/mlir/runtime/transforms/tests:compilation_pipeline.mlir.test PASSED in 1.5s //tensorflow/compiler/xla/mlir/runtime/transforms/tests:convert_asserts.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir/runtime/transforms/tests:convert_custom_calls.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir/runtime/transforms/tests:export_functions.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir/runtime/transforms/tests:ordinal_assignment.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir/runtime/transforms/tests:rt_to_llvm.mlir.test PASSED in 0.8s //tensorflow/compiler/xla/mlir/tools/mlir_bisect/rewrites/tests:erase-op-without-results.mlir.test PASSED in 1.0s //tensorflow/compiler/xla/mlir/tools/mlir_bisect/rewrites/tests:inline-scf-while.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir/tools/mlir_bisect/rewrites/tests:reduce-scf-forall-bounds.mlir.test PASSED in 0.9s //tensorflow/compiler/xla/mlir/tools/mlir_bisect/rewrites/tests:replace-op-with-constant.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir/tools/mlir_bisect/rewrites/tests:replace-op-with-value.mlir.test PASSED in 0.7s //tensorflow/compiler/xla/mlir/tools/mlir_bisect/rewrites/tests:replace-operand-with-constant.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir/tools/mlir_bisect/rewrites/tests:return-operands-of-terminator-operands.mlir.test PASSED in 0.9s //tensorflow/compiler/xla/mlir/tools/mlir_bisect/rewrites/tests:truncate-function.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir/tools/mlir_bisect/tests:bisect.mlir.test PASSED in 0.7s //tensorflow/compiler/xla/mlir/tools/mlir_bisect/tests:no-bug.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir/tools/mlir_bisect/tests:snapshot.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir/tools/mlir_replay/public:execution_trace_utils_test PASSED in 0.2s //tensorflow/compiler/xla/mlir/utils:error_util_test PASSED in 0.1s //tensorflow/compiler/xla/mlir/xla_cpu/tests:bufferize.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir/xla_cpu/tests:invalid.mlir.test PASSED in 0.7s //tensorflow/compiler/xla/mlir/xla_cpu/tests:ops.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/bufferization/hlo_one_shot_bufferize.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/chlo/chlo_legalize_to_hlo_broadcasts.mlir.test PASSED in 1.0s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/chlo/chlo_legalize_to_hlo_no_broadcasts.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/chlo/chlo_legalize_to_mhlo.mlir.test PASSED in 1.7s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/chlo/sparse_chlo_legalize_to_linalg.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/deallocation/buffer_reuse.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/deallocation/convert_deallocation_ops_to_llvm.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/deallocation/deallocate.mlir.test PASSED in 0.7s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/deallocation/deallocate_invalid.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/deallocation/deallocation_ops.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/deallocation/deallocation_simplification.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/deallocation/deallocation_to_scf.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/deallocation/split_alloc_tensors.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/add_debug_info.mlir.test PASSED in 0.8s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/bufferization.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/collapse-shape.mlir.test PASSED in 0.7s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/collect_stats.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/compose_extract_insert_slice.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/cpu_tiling/batch_matmul.mlir.test PASSED in 0.7s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/cpu_tiling/conv_2d_nhwc_hwcf.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/cpu_tiling/dot.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/cpu_tiling/duplicate_fusions.mlir.test PASSED in 1.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/cpu_tiling/fibonacci.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/cpu_tiling/fusion_outlining.mlir.test PASSED in 0.8s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/cpu_tiling/fusion_planning_for_cpu.mlir.test PASSED in 1.3s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/cpu_tiling/inline_fusion_clusters.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/cpu_tiling/map_bcast_map.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/cpu_tiling/map_matmul.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/cpu_tiling/map_reduce.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/cpu_tiling/map_reduce_map.mlir.test PASSED in 0.7s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/cpu_tiling/map_reshape_map.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/cpu_tiling/matmul.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/cpu_tiling/reduce_1d.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/cpu_tiling/reduce_1d_map.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/cpu_tiling/reduce_2d.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/cpu_tiling/reduce_window.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/cpu_tiling/reverse.mlir.test PASSED in 0.7s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/cpu_tiling/scatter.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/cpu_tiling/sort.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/cpu_tiling/transpose.mlir.test PASSED in 0.8s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/greedy_fusion.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/invalid.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/lower_vectors.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/nested_tiling_softmax.mlir.test PASSED in 0.8s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/ops.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/optimize_linalg_ops.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/rewrite_forall_to_for.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/simplify_dead_copy.mlir.test PASSED in 1.8s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/tile_by_one.mlir.test PASSED in 1.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/tiling_softmax.mlir.test PASSED in 1.0s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/vectorize_copy.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/gml_st/vectorize_for_cpu.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/lhlo/lhlo-legalize-select-and-scatter.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/lhlo/lhlo-legalize-to-affine.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/lhlo/lhlo-legalize-to-gpu.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/lhlo/lhlo-legalize-to-parallel-loops.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/lhlo/lhlo-legalize-to-tensor-op.mlir.test PASSED in 1.0s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/lhlo/ops.mlir.test PASSED in 0.7s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/lhlo_gpu/lhlo_gpu_ops.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/attrs.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/broadcast_propagation.mlir.test PASSED in 0.7s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/canonicalize/bitcast.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/canonicalize/canonicalize.mlir.test PASSED in 0.7s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/canonicalize/concatenate.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/canonicalize/convert.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/canonicalize/convolution.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/canonicalize/custom_call.mlir.test PASSED in 1.1s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/canonicalize/folder_limit.mlir.test PASSED in 0.7s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/canonicalize/reduce.mlir.test PASSED in 0.8s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/canonicalize/reshape.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/canonicalize/reverse.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/canonicalize/scatter.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/canonicalize/transpose.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/canonicalize/tuple.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/canonicalize/while.mlir.test PASSED in 1.0s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/constraint_fusion.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/convert_to_signless.mlir.test PASSED in 0.7s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/expand_hlo_tuples.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/expand_ops_simplifier.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/group_reduction_dimensions.mlir.test PASSED in 1.2s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/hlo-collapse-elementwise-map.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/hlo-legalize-einsum-to-dot-general.mlir.test PASSED in 1.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/hlo-legalize-gather-to-torch-index-select.mlir.test PASSED in 1.1s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/hlo-legalize-rng-to-linalg.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/hlo-legalize-shape-ops-to-standard.mlir.test PASSED in 1.0s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/hlo-legalize-sort.mlir.test PASSED in 1.8s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/hlo-legalize-to-arithmetic.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/hlo-legalize-to-lhlo-only-dynamic.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/hlo-legalize-to-lhlo-unranked.mlir.test PASSED in 1.1s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/hlo-legalize-to-lhlo.mlir.test PASSED in 0.9s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/hlo-legalize-to-linalg.mlir.test PASSED in 4.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/hlo-legalize-to-memref-unranked.mlir.test PASSED in 0.9s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/hlo-legalize-to-memref.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/hlo-legalize-to-stablehlo-experimental.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/hlo-legalize-to-stablehlo.mlir.test PASSED in 1.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/inlining.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/invalid.mlir.test PASSED in 0.8s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/legalize-control-flow.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/legalize-hlo-shape-computations.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/legalize-mhlo-to-thlo.mlir.test PASSED in 0.7s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/legalize-to-std.mlir.test PASSED in 0.7s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/lower-complex.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/lower-general-dot.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/materialize-broadcasts.mlir.test PASSED in 0.7s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/merge_assuming_ops.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/mhlo_bytecode_customizations.mlir.test PASSED in 1.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/mhlo_canonicalize_dot.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/mhlo_canonicalize_gather.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/mhlo_canonicalize_reduction.mlir.test PASSED in 1.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/mhlo_canonicalize_scatter.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/mhlo_flatten_tuple.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/mhlo_infer_shape_type_methods.mlir.test PASSED in 0.7s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/mhlo_ops_prettyprint.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/mhlo_reduce_pretty_print.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/ops.mlir.test PASSED in 1.2s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/optimize-hlo.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/prepare-for-export.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/reify-result-types.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/restrict_max_rank.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/shape_legalize_to_hlo.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/shape_reification.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/sink-constants-to-control-flow.mlir.test PASSED in 1.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/sparse_gendot_lower.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/sparse_lower.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/sparse_ops.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/sparse_rewriting.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/sparse_transpose.mlir.test PASSED in 0.3s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/stablehlo-legalize-to-hlo.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/symbolic-shape-optimization.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/unfuse_batch_norm.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/verifier_bounds.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/verifier_conv_op.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/verifier_reduce_op.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/verifier_reduce_window_op.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/verifier_scatter_op.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/verifier_select_and_scatter_op.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/verifier_while_op.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/mhlo/while_prettyprint.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/thlo/bufferize.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/thlo/canonicalize.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/thlo/invalid.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/thlo/legalize_sort.mlir.test PASSED in 1.5s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/thlo/ops.mlir.test PASSED in 1.0s //tensorflow/compiler/xla/mlir_hlo/tests:Dialect/thlo/tiling.mlir.test PASSED in 0.7s //tensorflow/compiler/xla/mlir_hlo/tests:alloc_to_arg.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:assuming-structural-propagation.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:buffer_packing.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:bufferize.mlir.test PASSED in 1.3s //tensorflow/compiler/xla/mlir_hlo/tests:bufferize_one_shot.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:collapse_parallel_loops_to_1d_pass.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:detensorize_scf_ops.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:index_type_llvm_lowering.mlir.test PASSED in 0.7s //tensorflow/compiler/xla/mlir_hlo/tests:legalize-trigonometric-to-approximation.mlir.test PASSED in 0.7s //tensorflow/compiler/xla/mlir_hlo/tests:lower_index_cast.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:propagate_static_shapes.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:rank-specialization.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:scalarization.mlir.test PASSED in 0.7s //tensorflow/compiler/xla/mlir_hlo/tests:shape-component-analysis.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/mlir_hlo/tests:shape_simplification.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tests:test_userange.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:tile_loops.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tests:unbufferize.mlir.test PASSED in 0.8s //tensorflow/compiler/xla/mlir_hlo/tests:unroll-loops.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tools/mlir_interpreter/framework/tests:interpreter_value_test PASSED in 0.1s //tensorflow/compiler/xla/mlir_hlo/tools/mlir_interpreter/framework/tests:tensor_or_memref_test PASSED in 0.1s //tensorflow/compiler/xla/mlir_hlo/tosa/tests:binary.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tosa/tests:nullary.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/mlir_hlo/tosa/tests:prepare-mhlo.mlir.test PASSED in 0.9s //tensorflow/compiler/xla/mlir_hlo/tosa/tests:ternary.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/mlir_hlo/tosa/tests:unary.mlir.test PASSED in 0.8s //tensorflow/compiler/xla/pjrt:host_callback_test PASSED in 0.3s //tensorflow/compiler/xla/pjrt:lru_cache_test PASSED in 0.2s //tensorflow/compiler/xla/pjrt:pjrt_api_test PASSED in 0.2s //tensorflow/compiler/xla/pjrt:pjrt_client_test_cpu PASSED in 7.7s //tensorflow/compiler/xla/pjrt:pjrt_compiler_test PASSED in 0.3s //tensorflow/compiler/xla/pjrt:pjrt_executable_test PASSED in 0.2s //tensorflow/compiler/xla/pjrt:pjrt_stream_executor_client_test PASSED in 8.8s //tensorflow/compiler/xla/pjrt:semaphore_test PASSED in 0.2s //tensorflow/compiler/xla/pjrt:tf_pjrt_client_test PASSED in 8.0s //tensorflow/compiler/xla/pjrt:tfrt_cpu_pjrt_client_test PASSED in 8.2s //tensorflow/compiler/xla/pjrt:tracked_device_buffer_test PASSED in 8.0s //tensorflow/compiler/xla/pjrt:tracked_tfrt_cpu_device_buffer_test PASSED in 0.1s //tensorflow/compiler/xla/pjrt:transpose_test PASSED in 47.9s //tensorflow/compiler/xla/pjrt/c:pjrt_c_api_cpu_test PASSED in 6.7s //tensorflow/compiler/xla/pjrt/c:pjrt_c_api_helpers_test PASSED in 0.3s //tensorflow/compiler/xla/pjrt/distributed:client_server_test PASSED in 43.9s //tensorflow/compiler/xla/pjrt/distributed:service_test PASSED in 6.5s //tensorflow/compiler/xla/pjrt/gpu:se_gpu_pjrt_client_test PASSED in 2.4s //tensorflow/compiler/xla/python:outfeed_receiver_test_cpu PASSED in 8.2s //tensorflow/compiler/xla/python/ifrt:array_test PASSED in 0.9s //tensorflow/compiler/xla/python/ifrt:array_test_no_impl PASSED in 0.2s //tensorflow/compiler/xla/python/ifrt:client_test_no_impl PASSED in 0.2s //tensorflow/compiler/xla/python/ifrt:executable_test_no_impl PASSED in 1.3s //tensorflow/compiler/xla/python/ifrt:future_test PASSED in 0.5s //tensorflow/compiler/xla/python/ifrt:index_domain_test PASSED in 0.3s //tensorflow/compiler/xla/python/ifrt:index_test PASSED in 0.4s //tensorflow/compiler/xla/python/ifrt:shape_test PASSED in 0.3s //tensorflow/compiler/xla/python/ifrt:sharding_test PASSED in 0.4s //tensorflow/compiler/xla/python/ifrt:tuple_test_no_impl PASSED in 0.3s //tensorflow/compiler/xla/python/ifrt/ir/tests:verify_array.mlir.test PASSED in 0.7s //tensorflow/compiler/xla/python/ifrt/ir/tests:verify_assemble.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/python/ifrt/ir/tests:verify_call.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/python/ifrt/ir/tests:verify_call_loaded_executable.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/python/ifrt/ir/tests:verify_disassemble.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/python/ifrt/ir/tests:verify_loaded_executable.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/python/ifrt/ir/tests:verify_reshard.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/python/ifrt/support:sharding_param_to_op_sharding_test PASSED in 0.9s //tensorflow/compiler/xla/python/pjrt_ifrt:pjrt_array_impl_test_tfrt_cpu PASSED in 13.9s //tensorflow/compiler/xla/python/pjrt_ifrt:pjrt_client_impl_test_tfrt_cpu PASSED in 7.1s //tensorflow/compiler/xla/python/pjrt_ifrt:pjrt_executable_impl_test_tfrt_cpu PASSED in 6.1s //tensorflow/compiler/xla/python/pjrt_ifrt:pjrt_tuple_impl_test_tfrt_cpu PASSED in 7.7s //tensorflow/compiler/xla/python_api:xla_literal_test PASSED in 1.5s //tensorflow/compiler/xla/python_api:xla_shape_test PASSED in 1.0s //tensorflow/compiler/xla/rpc:grpc_client_test PASSED in 2.4s //tensorflow/compiler/xla/runtime:arguments_test PASSED in 1.1s //tensorflow/compiler/xla/runtime:async_runtime_test PASSED in 0.7s //tensorflow/compiler/xla/runtime:custom_call_test PASSED in 1.7s //tensorflow/compiler/xla/runtime:diagnostics_test PASSED in 0.1s //tensorflow/compiler/xla/runtime:executable_test PASSED in 1.7s //tensorflow/compiler/xla/runtime:ffi_test PASSED in 1.0s //tensorflow/compiler/xla/runtime:map_by_type_test PASSED in 0.2s //tensorflow/compiler/xla/runtime:module_test PASSED in 0.2s //tensorflow/compiler/xla/runtime:results_test PASSED in 0.5s //tensorflow/compiler/xla/runtime:state_test PASSED in 0.1s //tensorflow/compiler/xla/runtime:symbolic_shape_test PASSED in 0.2s //tensorflow/compiler/xla/runtime:type_id_test PASSED in 0.1s //tensorflow/compiler/xla/service:algebraic_simplifier_overflow_test_cpu PASSED in 7.2s //tensorflow/compiler/xla/service:algebraic_simplifier_test PASSED in 2.0s //tensorflow/compiler/xla/service:all_gather_broadcast_reorder_test PASSED in 1.1s //tensorflow/compiler/xla/service:all_gather_combiner_test PASSED in 0.9s //tensorflow/compiler/xla/service:all_gather_decomposer_test PASSED in 0.8s //tensorflow/compiler/xla/service:all_reduce_combiner_test PASSED in 1.0s //tensorflow/compiler/xla/service:all_reduce_contiguous_test PASSED in 0.9s //tensorflow/compiler/xla/service:all_reduce_folder_test PASSED in 0.9s //tensorflow/compiler/xla/service:all_reduce_promotion_test PASSED in 1.0s //tensorflow/compiler/xla/service:all_reduce_reassociate_test PASSED in 0.9s //tensorflow/compiler/xla/service:all_reduce_simplifier_test PASSED in 0.9s //tensorflow/compiler/xla/service:ar_crs_combiner_test PASSED in 0.9s //tensorflow/compiler/xla/service:async_collective_creator_test PASSED in 0.8s //tensorflow/compiler/xla/service:async_op_canonicalizer_test PASSED in 0.9s //tensorflow/compiler/xla/service:batch_dot_simplification_test PASSED in 0.9s //tensorflow/compiler/xla/service:batchnorm_expander_test_cpu PASSED in 8.8s //tensorflow/compiler/xla/service:bfloat16_conversion_folding_test PASSED in 1.3s //tensorflow/compiler/xla/service:bfloat16_propagation_test PASSED in 1.9s //tensorflow/compiler/xla/service:bitcast_dtypes_expander_test PASSED in 0.9s //tensorflow/compiler/xla/service:broadcast_canonicalizer_test PASSED in 1.1s //tensorflow/compiler/xla/service:buffer_assignment_test PASSED in 8.7s //tensorflow/compiler/xla/service:call_graph_test PASSED in 1.1s //tensorflow/compiler/xla/service:call_inliner_test PASSED in 2.0s //tensorflow/compiler/xla/service:change_op_data_type_test PASSED in 0.9s //tensorflow/compiler/xla/service:collective_ops_utils_test PASSED in 0.9s //tensorflow/compiler/xla/service:collectives_schedule_linearizer_test PASSED in 1.0s //tensorflow/compiler/xla/service:compilation_environments_test PASSED in 0.3s //tensorflow/compiler/xla/service:conditional_canonicalizer_test PASSED in 0.9s //tensorflow/compiler/xla/service:conditional_code_motion_test PASSED in 1.1s //tensorflow/compiler/xla/service:conditional_simplifier_test PASSED in 2.1s //tensorflow/compiler/xla/service:conditional_to_select_test PASSED in 1.1s //tensorflow/compiler/xla/service:convert_async_collectives_to_sync_test PASSED in 1.4s //tensorflow/compiler/xla/service:convert_mover_test PASSED in 1.1s //tensorflow/compiler/xla/service:convert_operand_folding_test PASSED in 0.9s //tensorflow/compiler/xla/service:convolution_4d_expander_test PASSED in 1.1s //tensorflow/compiler/xla/service:convolution_group_converter_test PASSED in 1.0s //tensorflow/compiler/xla/service:convolution_pred_expander_test PASSED in 0.9s //tensorflow/compiler/xla/service:copy_insertion_test PASSED in 1.7s //tensorflow/compiler/xla/service:custom_call_status_test PASSED in 0.2s //tensorflow/compiler/xla/service:defuser_test PASSED in 1.3s //tensorflow/compiler/xla/service:despecializer_test PASSED in 1.0s //tensorflow/compiler/xla/service:dfs_hlo_visitor_with_default_test PASSED in 0.9s //tensorflow/compiler/xla/service:dot_decomposer_test PASSED in 1.5s //tensorflow/compiler/xla/service:dot_merger_test PASSED in 1.0s //tensorflow/compiler/xla/service:dynamic_dimension_inference_test PASSED in 1.3s //tensorflow/compiler/xla/service:dynamic_dimension_simplifier_test PASSED in 1.2s //tensorflow/compiler/xla/service:dynamic_index_splitter_test PASSED in 1.5s //tensorflow/compiler/xla/service:dynamic_padder_test_cpu PASSED in 16.1s //tensorflow/compiler/xla/service:dynamic_parameter_binding_test PASSED in 1.5s //tensorflow/compiler/xla/service:dynamic_update_slice_test_cpu PASSED in 10.7s //tensorflow/compiler/xla/service:elemental_ir_emitter_test_cpu PASSED in 12.2s //tensorflow/compiler/xla/service:flatten_call_graph_test PASSED in 1.0s //tensorflow/compiler/xla/service:float_normalization_test PASSED in 1.1s //tensorflow/compiler/xla/service:fusion_node_indexing_evaluation_test PASSED in 1.7s //tensorflow/compiler/xla/service:gather_expander_test PASSED in 1.2s //tensorflow/compiler/xla/service:gather_simplifier_test PASSED in 1.3s //tensorflow/compiler/xla/service:heap_simulator_test PASSED in 1.0s //tensorflow/compiler/xla/service:hlo_alias_analysis_test PASSED in 1.1s //tensorflow/compiler/xla/service:hlo_casting_utils_test PASSED in 8.4s //tensorflow/compiler/xla/service:hlo_computation_deduplicator_test PASSED in 1.5s //tensorflow/compiler/xla/service:hlo_computation_test PASSED in 3.1s //tensorflow/compiler/xla/service:hlo_constant_folding_test PASSED in 4.6s //tensorflow/compiler/xla/service:hlo_cost_analysis_test PASSED in 8.4s //tensorflow/compiler/xla/service:hlo_creation_utils_test PASSED in 3.2s //tensorflow/compiler/xla/service:hlo_cse_test PASSED in 8.6s //tensorflow/compiler/xla/service:hlo_dataflow_analysis_test PASSED in 1.9s //tensorflow/compiler/xla/service:hlo_dce_test PASSED in 0.8s //tensorflow/compiler/xla/service:hlo_domain_test PASSED in 1.5s //tensorflow/compiler/xla/service:hlo_element_type_converter_test PASSED in 0.9s //tensorflow/compiler/xla/service:hlo_execution_profile_test PASSED in 6.2s //tensorflow/compiler/xla/service:hlo_graph_dumper_test PASSED in 1.2s //tensorflow/compiler/xla/service:hlo_input_output_alias_config_test PASSED in 2.1s //tensorflow/compiler/xla/service:hlo_instruction_test PASSED in 3.2s //tensorflow/compiler/xla/service:hlo_liveness_analysis_test PASSED in 0.9s //tensorflow/compiler/xla/service:hlo_memory_scheduler_test PASSED in 0.8s //tensorflow/compiler/xla/service:hlo_module_dce_test PASSED in 1.7s //tensorflow/compiler/xla/service:hlo_module_metadata_test PASSED in 0.3s //tensorflow/compiler/xla/service:hlo_module_test PASSED in 0.9s //tensorflow/compiler/xla/service:hlo_opcode_test PASSED in 0.2s //tensorflow/compiler/xla/service:hlo_ordering_test PASSED in 1.4s //tensorflow/compiler/xla/service:hlo_parser_test PASSED in 0.5s //tensorflow/compiler/xla/service:hlo_pass_pipeline_test PASSED in 0.9s //tensorflow/compiler/xla/service:hlo_phi_graph_test PASSED in 0.2s //tensorflow/compiler/xla/service:hlo_proto_util_test PASSED in 0.9s //tensorflow/compiler/xla/service:hlo_reachability_test PASSED in 1.3s //tensorflow/compiler/xla/service:hlo_rematerialization_test PASSED in 1.2s //tensorflow/compiler/xla/service:hlo_rematerialization_test_utils_test PASSED in 1.0s //tensorflow/compiler/xla/service:hlo_replication_analysis_test PASSED in 2.3s //tensorflow/compiler/xla/service:hlo_schedule_test PASSED in 0.9s //tensorflow/compiler/xla/service:hlo_sharding_test PASSED in 0.8s //tensorflow/compiler/xla/service:hlo_value_semantics_analysis_test PASSED in 1.0s //tensorflow/compiler/xla/service:hlo_verifier_test PASSED in 1.9s //tensorflow/compiler/xla/service:indexed_array_analysis_test PASSED in 1.5s //tensorflow/compiler/xla/service:instruction_fusion_test PASSED in 1.2s //tensorflow/compiler/xla/service:latency_hiding_scheduler_test PASSED in 1.1s //tensorflow/compiler/xla/service:layout_assignment_test PASSED in 6.4s //tensorflow/compiler/xla/service:layout_normalization_test PASSED in 2.6s //tensorflow/compiler/xla/service:logistic_expander_test PASSED in 0.9s //tensorflow/compiler/xla/service:loop_schedule_linearizer_test PASSED in 0.9s //tensorflow/compiler/xla/service:map_inliner_test PASSED in 0.9s //tensorflow/compiler/xla/service:mapped_ptr_container_sorter_test PASSED in 0.1s //tensorflow/compiler/xla/service:memory_space_assignment_best_fit_repacker_test PASSED in 0.2s //tensorflow/compiler/xla/service:memory_space_assignment_test PASSED in 2.3s //tensorflow/compiler/xla/service:memory_space_propagation_test PASSED in 1.0s //tensorflow/compiler/xla/service:name_uniquer_test PASSED in 0.1s //tensorflow/compiler/xla/service:operand_upcaster_test PASSED in 1.7s //tensorflow/compiler/xla/service:optimize_input_output_buffer_alias_test PASSED in 1.2s //tensorflow/compiler/xla/service:pattern_matcher_gmock_test PASSED in 0.2s //tensorflow/compiler/xla/service:pattern_matcher_test PASSED in 1.0s //tensorflow/compiler/xla/service:profile_guided_latency_estimator_test PASSED in 0.8s //tensorflow/compiler/xla/service:real_imag_expander_test PASSED in 1.1s //tensorflow/compiler/xla/service:reduce_decomposer_test PASSED in 1.6s //tensorflow/compiler/xla/service:reduce_scatter_combiner_test PASSED in 0.9s //tensorflow/compiler/xla/service:reduce_scatter_decomposer_test PASSED in 1.3s //tensorflow/compiler/xla/service:reduce_scatter_reassociate_test PASSED in 0.9s //tensorflow/compiler/xla/service:reshape_decomposer_test PASSED in 1.5s //tensorflow/compiler/xla/service:reshape_mover_test PASSED in 0.9s //tensorflow/compiler/xla/service:result_caster_test PASSED in 1.6s //tensorflow/compiler/xla/service:root_instruction_sinker_test PASSED in 1.4s //tensorflow/compiler/xla/service:scatter_expander_test PASSED in 1.2s //tensorflow/compiler/xla/service:scatter_simplifier_test PASSED in 1.0s //tensorflow/compiler/xla/service:select_and_scatter_expander_test PASSED in 1.0s //tensorflow/compiler/xla/service:shape_inference_test PASSED in 0.2s //tensorflow/compiler/xla/service:shaped_buffer_test PASSED in 7.5s //tensorflow/compiler/xla/service:sharding_propagation_test PASSED in 3.5s //tensorflow/compiler/xla/service:sharding_remover_test PASSED in 1.1s //tensorflow/compiler/xla/service:simplify_fp_conversions_test PASSED in 1.8s //tensorflow/compiler/xla/service:slice_sinker_test PASSED in 2.7s //tensorflow/compiler/xla/service:sort_simplifier_test PASSED in 1.0s //tensorflow/compiler/xla/service:space_to_batch_converter_test PASSED in 0.9s //tensorflow/compiler/xla/service:stable_sort_expander_test PASSED in 0.9s //tensorflow/compiler/xla/service:stochastic_convert_decomposer_test PASSED in 0.9s //tensorflow/compiler/xla/service:stream_pool_test PASSED in 0.5s //tensorflow/compiler/xla/service:topk_rewriter_test PASSED in 3.6s //tensorflow/compiler/xla/service:transpose_folding_test PASSED in 1.8s //tensorflow/compiler/xla/service:tuple_points_to_analysis_test PASSED in 1.2s //tensorflow/compiler/xla/service:tuple_simplifier_test PASSED in 1.4s //tensorflow/compiler/xla/service:tuple_util_test PASSED in 1.4s //tensorflow/compiler/xla/service:while_loop_all_reduce_code_motion_test PASSED in 2.0s //tensorflow/compiler/xla/service:while_loop_analysis_test PASSED in 1.2s //tensorflow/compiler/xla/service:while_loop_concat_code_motion_test PASSED in 1.0s //tensorflow/compiler/xla/service:while_loop_constant_sinking_test PASSED in 1.4s //tensorflow/compiler/xla/service:while_loop_expensive_invariant_code_motion_test PASSED in 0.9s //tensorflow/compiler/xla/service:while_loop_invariant_code_motion_test PASSED in 1.4s //tensorflow/compiler/xla/service:while_loop_simplifier_test PASSED in 0.9s //tensorflow/compiler/xla/service:while_loop_trip_count_annotator_test PASSED in 0.9s //tensorflow/compiler/xla/service:while_util_test PASSED in 1.2s //tensorflow/compiler/xla/service:xla_aot_compile_stablehlo_cpu_test PASSED in 9.3s //tensorflow/compiler/xla/service:xla_debug_info_manager_test PASSED in 1.0s //tensorflow/compiler/xla/service:zero_sized_hlo_elimination_test PASSED in 2.1s //tensorflow/compiler/xla/service/cpu:conv_canonicalization_test PASSED in 2.3s //tensorflow/compiler/xla/service/cpu:cpu_eigen_tensor_alignment_test PASSED in 2.1s //tensorflow/compiler/xla/service/cpu:cpu_instruction_fusion_test PASSED in 1.7s //tensorflow/compiler/xla/service/cpu:cpu_layout_assignment_test PASSED in 2.0s //tensorflow/compiler/xla/service/cpu:ir_emission_utils_test PASSED in 1.2s //tensorflow/compiler/xla/service/cpu:parallel_task_assignment_test PASSED in 4.4s //tensorflow/compiler/xla/service/cpu:runtime_fft_test PASSED in 0.2s //tensorflow/compiler/xla/service/cpu:shape_partition_test PASSED in 1.0s //tensorflow/compiler/xla/service/cpu:xfeed_manager_test PASSED in 0.7s //tensorflow/compiler/xla/service/cpu/tests:cpu_bytesizeof_test PASSED in 0.7s //tensorflow/compiler/xla/service/cpu/tests:cpu_dyn_shape_test PASSED in 7.2s //tensorflow/compiler/xla/service/cpu/tests:cpu_eigen_dot_operation_test PASSED in 9.0s //tensorflow/compiler/xla/service/cpu/tests:cpu_external_constants_test PASSED in 26.8s //tensorflow/compiler/xla/service/cpu/tests:cpu_fusion_test PASSED in 8.0s //tensorflow/compiler/xla/service/cpu/tests:cpu_infeed_test PASSED in 5.8s //tensorflow/compiler/xla/service/cpu/tests:cpu_intrinsic_test PASSED in 9.5s //tensorflow/compiler/xla/service/cpu/tests:cpu_key_value_sort_test PASSED in 8.1s //tensorflow/compiler/xla/service/cpu/tests:cpu_literal_caching_test PASSED in 8.0s //tensorflow/compiler/xla/service/cpu/tests:cpu_noalias_test PASSED in 6.8s //tensorflow/compiler/xla/service/cpu/tests:cpu_outfeed_test PASSED in 7.9s //tensorflow/compiler/xla/service/cpu/tests:cpu_profiling_test PASSED in 9.0s //tensorflow/compiler/xla/service/cpu/tests:cpu_spmd_compile_test PASSED in 7.2s //tensorflow/compiler/xla/service/cpu/tests:cpu_topk_test PASSED in 8.3s //tensorflow/compiler/xla/service/cpu/tests:cpu_vectorization_test PASSED in 9.5s //tensorflow/compiler/xla/service/cpu/tests:cpu_while_test PASSED in 8.4s //tensorflow/compiler/xla/service/cpu/tests:tree_reduction_rewriter_test PASSED in 8.8s //tensorflow/compiler/xla/service/gpu:alias_passthrough_params_test PASSED in 1.2s //tensorflow/compiler/xla/service/gpu:all_reduce_blueconnect_test PASSED in 0.9s //tensorflow/compiler/xla/service/gpu:cublas_pad_for_gemms_test PASSED in 3.3s //tensorflow/compiler/xla/service/gpu:cudnn_pad_for_convolutions_test PASSED in 1.5s //tensorflow/compiler/xla/service/gpu:cudnn_simplify_padding_test PASSED in 2.0s //tensorflow/compiler/xla/service/gpu:cudnn_support_utils_test PASSED in 1.1s //tensorflow/compiler/xla/service/gpu:cudnn_vectorize_convolutions_test PASSED in 1.1s //tensorflow/compiler/xla/service/gpu:fusion_merger_test PASSED in 1.5s //tensorflow/compiler/xla/service/gpu:gemm_rewriter_triton_test PASSED in 1.5s //tensorflow/compiler/xla/service/gpu:gpu_conv_padding_legalization_test PASSED in 1.0s //tensorflow/compiler/xla/service/gpu:gpu_conv_rewriter_test PASSED in 1.4s //tensorflow/compiler/xla/service/gpu:gpu_fusible_test PASSED in 1.4s //tensorflow/compiler/xla/service/gpu:gpu_hlo_cost_analysis_test PASSED in 1.4s //tensorflow/compiler/xla/service/gpu:gpu_performance_model_test PASSED in 1.8s //tensorflow/compiler/xla/service/gpu:gpu_sanitize_constant_names_test PASSED in 1.4s //tensorflow/compiler/xla/service/gpu:hlo_algorithm_denylist_test PASSED in 0.2s //tensorflow/compiler/xla/service/gpu:hlo_fusion_stats_test PASSED in 1.0s //tensorflow/compiler/xla/service/gpu:instruction_fusion_test PASSED in 4.3s //tensorflow/compiler/xla/service/gpu:ir_emission_utils_test PASSED in 1.7s //tensorflow/compiler/xla/service/gpu:matmul_utils_test PASSED in 2.1s //tensorflow/compiler/xla/service/gpu:move_copy_to_users_test PASSED in 1.9s //tensorflow/compiler/xla/service/gpu:multi_output_fusion_test PASSED in 2.3s //tensorflow/compiler/xla/service/gpu:non_atomically_upgradeable_rw_lock_test PASSED in 0.2s //tensorflow/compiler/xla/service/gpu:reduction_splitter_test PASSED in 1.4s //tensorflow/compiler/xla/service/gpu:scatter_slice_simplifier_test PASSED in 1.5s //tensorflow/compiler/xla/service/gpu:target_util_test PASSED in 0.5s //tensorflow/compiler/xla/service/gpu:variadic_op_splitter_test PASSED in 2.0s //tensorflow/compiler/xla/service/gpu:while_transformer_test PASSED in 2.4s //tensorflow/compiler/xla/service/gpu/llvm_gpu_backend:utils_test PASSED in 0.5s //tensorflow/compiler/xla/service/gpu/tests:gpu_reduce_scatter_creator_test PASSED in 0.9s //tensorflow/compiler/xla/service/gpu/tests:reduction_degenerate_dim_remover_test PASSED in 2.5s //tensorflow/compiler/xla/service/gpu/tests:reduction_dimension_grouper_test PASSED in 1.3s //tensorflow/compiler/xla/service/gpu/tests:tree_reduction_rewriter_test PASSED in 2.4s //tensorflow/compiler/xla/service/graphcycles:graphcycles_test PASSED in 0.9s //tensorflow/compiler/xla/service/graphcycles:ordered_set_test PASSED in 0.2s //tensorflow/compiler/xla/service/llvm_ir:alias_analysis_test PASSED in 7.3s //tensorflow/compiler/xla/service/llvm_ir:ir_array_test PASSED in 0.5s //tensorflow/compiler/xla/service/spmd:canonicalize_all_gather_for_cse_test PASSED in 1.2s //tensorflow/compiler/xla/service/spmd:collective_permute_motion_test PASSED in 1.4s //tensorflow/compiler/xla/service/spmd:partition_assignment_test PASSED in 1.2s //tensorflow/compiler/xla/service/spmd:schedule_aware_collective_ops_cse_test PASSED in 1.9s //tensorflow/compiler/xla/service/spmd:spmd_partitioner_test PASSED in 9.2s //tensorflow/compiler/xla/service/spmd:stateful_rng_spmd_partitioner_test PASSED in 1.2s //tensorflow/compiler/xla/stream_executor:dnn_test PASSED in 0.2s //tensorflow/compiler/xla/stream_executor:stream_test PASSED in 0.6s //tensorflow/compiler/xla/stream_executor/host:host_stream_test PASSED in 0.2s //tensorflow/compiler/xla/tests:all_reduce_test_cpu PASSED in 12.4s //tensorflow/compiler/xla/tests:axpy_simple_test_cpu PASSED in 7.3s //tensorflow/compiler/xla/tests:bad_rng_shape_validation_test_cpu PASSED in 8.6s //tensorflow/compiler/xla/tests:binop_scaling_test_cpu PASSED in 7.5s //tensorflow/compiler/xla/tests:bitcast_convert_test_cpu PASSED in 8.5s //tensorflow/compiler/xla/tests:broadcast_simple_test_cpu PASSED in 9.3s //tensorflow/compiler/xla/tests:broadcast_test_cpu PASSED in 8.9s //tensorflow/compiler/xla/tests:buffer_donation_test_cpu PASSED in 9.3s //tensorflow/compiler/xla/tests:call_test_cpu PASSED in 9.0s //tensorflow/compiler/xla/tests:check_execution_arity_test_cpu PASSED in 7.5s //tensorflow/compiler/xla/tests:cholesky_test_cpu PASSED in 15.0s //tensorflow/compiler/xla/tests:client_test_cpu PASSED in 7.9s //tensorflow/compiler/xla/tests:collective_ops_test_cpu PASSED in 27.8s //tensorflow/compiler/xla/tests:compilation_cache_test_cpu PASSED in 8.6s //tensorflow/compiler/xla/tests:compute_constant_test_cpu PASSED in 8.7s //tensorflow/compiler/xla/tests:concat_test_cpu PASSED in 11.7s //tensorflow/compiler/xla/tests:constant_reduction_function_test_cpu PASSED in 9.1s //tensorflow/compiler/xla/tests:constants_test_cpu PASSED in 9.4s //tensorflow/compiler/xla/tests:convert_test_cpu PASSED in 9.4s //tensorflow/compiler/xla/tests:copy_test_cpu PASSED in 10.8s //tensorflow/compiler/xla/tests:cpu_gpu_fusion_test_cpu PASSED in 11.3s //tensorflow/compiler/xla/tests:custom_call_test_cpu PASSED in 8.6s //tensorflow/compiler/xla/tests:deallocation_test_cpu PASSED in 8.4s //tensorflow/compiler/xla/tests:deconstruct_tuple_test_cpu PASSED in 7.5s //tensorflow/compiler/xla/tests:deep_graph_test_cpu PASSED in 8.2s //tensorflow/compiler/xla/tests:execution_profile_test_cpu PASSED in 8.2s //tensorflow/compiler/xla/tests:fft_test_cpu PASSED in 7.8s //tensorflow/compiler/xla/tests:float8_test_cpu PASSED in 7.5s //tensorflow/compiler/xla/tests:floor_ceil_test_cpu PASSED in 7.2s //tensorflow/compiler/xla/tests:fmax_fmin_test_cpu PASSED in 7.6s //tensorflow/compiler/xla/tests:gather_operation_test_cpu PASSED in 22.1s //tensorflow/compiler/xla/tests:get_dimension_size_test_cpu PASSED in 7.6s //tensorflow/compiler/xla/tests:half_test_cpu PASSED in 9.4s //tensorflow/compiler/xla/tests:hlo_metadata_test PASSED in 7.0s //tensorflow/compiler/xla/tests:literal_test_util_test PASSED in 9.4s //tensorflow/compiler/xla/tests:local_client_allocation_test_cpu PASSED in 8.1s //tensorflow/compiler/xla/tests:local_client_aot_test PASSED in 0.0s //tensorflow/compiler/xla/tests:log_test_cpu PASSED in 8.8s //tensorflow/compiler/xla/tests:map_test_cpu PASSED in 9.1s //tensorflow/compiler/xla/tests:matrix_ops_simple_test_cpu PASSED in 15.4s //tensorflow/compiler/xla/tests:multidimensional_slice_test_cpu PASSED in 7.5s //tensorflow/compiler/xla/tests:multiple_devices_on_host_test PASSED in 6.6s //tensorflow/compiler/xla/tests:multithreaded_compilation_test_cpu PASSED in 8.0s //tensorflow/compiler/xla/tests:outfeed_in_nested_computation_test_cpu PASSED in 6.8s //tensorflow/compiler/xla/tests:pad_test_cpu PASSED in 8.8s //tensorflow/compiler/xla/tests:pred_test_cpu PASSED in 9.8s //tensorflow/compiler/xla/tests:query_inferred_shape_test_cpu PASSED in 6.1s //tensorflow/compiler/xla/tests:reduce_hlo_test_cpu PASSED in 10.0s //tensorflow/compiler/xla/tests:reduce_precision_test_cpu PASSED in 7.9s //tensorflow/compiler/xla/tests:replay_test_cpu PASSED in 7.2s //tensorflow/compiler/xla/tests:reshape_motion_test_cpu PASSED in 7.7s //tensorflow/compiler/xla/tests:reverse_test_cpu PASSED in 8.2s //tensorflow/compiler/xla/tests:round_trip_packed_literal_test_cpu PASSED in 6.4s //tensorflow/compiler/xla/tests:round_trip_transfer_test_cpu PASSED in 8.0s //tensorflow/compiler/xla/tests:sample_text_test_cpu PASSED in 10.2s //tensorflow/compiler/xla/tests:scatter_test_cpu PASSED in 18.3s //tensorflow/compiler/xla/tests:select_test_cpu PASSED in 8.7s //tensorflow/compiler/xla/tests:test_utils_test_cpu PASSED in 7.9s //tensorflow/compiler/xla/tests:token_hlo_test_cpu PASSED in 8.5s //tensorflow/compiler/xla/tests:transfer_manager_test_cpu PASSED in 13.7s //tensorflow/compiler/xla/tests:transpose_test_cpu PASSED in 9.3s //tensorflow/compiler/xla/tests:tuple_test_cpu PASSED in 9.2s //tensorflow/compiler/xla/tests:unary_op_test_cpu PASSED in 9.2s //tensorflow/compiler/xla/tests:value_inference_test_cpu PASSED in 6.9s //tensorflow/compiler/xla/tests:vector_ops_reduce_test_cpu PASSED in 7.2s //tensorflow/compiler/xla/tests:vector_ops_simple_test_cpu PASSED in 7.8s //tensorflow/compiler/xla/tests:while_test_cpu PASSED in 10.2s //tensorflow/compiler/xla/tests/fuzz:rand_0_cpu PASSED in 7.9s //tensorflow/compiler/xla/tests/fuzz:rand_10_cpu PASSED in 7.8s //tensorflow/compiler/xla/tests/fuzz:rand_11_cpu PASSED in 8.0s //tensorflow/compiler/xla/tests/fuzz:rand_12_cpu PASSED in 7.9s //tensorflow/compiler/xla/tests/fuzz:rand_13_cpu PASSED in 7.7s //tensorflow/compiler/xla/tests/fuzz:rand_14_cpu PASSED in 7.7s //tensorflow/compiler/xla/tests/fuzz:rand_15_cpu PASSED in 7.4s //tensorflow/compiler/xla/tests/fuzz:rand_16_cpu PASSED in 8.2s //tensorflow/compiler/xla/tests/fuzz:rand_17_cpu PASSED in 8.2s //tensorflow/compiler/xla/tests/fuzz:rand_18_cpu PASSED in 8.6s //tensorflow/compiler/xla/tests/fuzz:rand_19_cpu PASSED in 8.3s //tensorflow/compiler/xla/tests/fuzz:rand_20_cpu PASSED in 7.7s //tensorflow/compiler/xla/tests/fuzz:rand_21_cpu PASSED in 8.1s //tensorflow/compiler/xla/tests/fuzz:rand_22_cpu PASSED in 8.8s //tensorflow/compiler/xla/tests/fuzz:rand_23_cpu PASSED in 7.7s //tensorflow/compiler/xla/tests/fuzz:rand_24_cpu PASSED in 8.1s //tensorflow/compiler/xla/tests/fuzz:rand_25_cpu PASSED in 8.0s //tensorflow/compiler/xla/tests/fuzz:rand_26_cpu PASSED in 8.1s //tensorflow/compiler/xla/tests/fuzz:rand_27_cpu PASSED in 8.1s //tensorflow/compiler/xla/tests/fuzz:rand_28_cpu PASSED in 7.7s //tensorflow/compiler/xla/tests/fuzz:rand_29_cpu PASSED in 8.6s //tensorflow/compiler/xla/tests/fuzz:rand_2_cpu PASSED in 7.7s //tensorflow/compiler/xla/tests/fuzz:rand_30_cpu PASSED in 8.3s //tensorflow/compiler/xla/tests/fuzz:rand_31_cpu PASSED in 7.3s //tensorflow/compiler/xla/tests/fuzz:rand_32_cpu PASSED in 7.3s //tensorflow/compiler/xla/tests/fuzz:rand_33_cpu PASSED in 7.6s //tensorflow/compiler/xla/tests/fuzz:rand_34_cpu PASSED in 8.7s //tensorflow/compiler/xla/tests/fuzz:rand_35_cpu PASSED in 8.0s //tensorflow/compiler/xla/tests/fuzz:rand_36_cpu PASSED in 10.1s //tensorflow/compiler/xla/tests/fuzz:rand_37_cpu PASSED in 6.8s //tensorflow/compiler/xla/tests/fuzz:rand_38_cpu PASSED in 6.9s //tensorflow/compiler/xla/tests/fuzz:rand_39_cpu PASSED in 8.7s //tensorflow/compiler/xla/tests/fuzz:rand_3_cpu PASSED in 7.7s //tensorflow/compiler/xla/tests/fuzz:rand_40_cpu PASSED in 9.4s //tensorflow/compiler/xla/tests/fuzz:rand_41_cpu PASSED in 8.3s //tensorflow/compiler/xla/tests/fuzz:rand_42_cpu PASSED in 10.6s //tensorflow/compiler/xla/tests/fuzz:rand_43_cpu PASSED in 8.4s //tensorflow/compiler/xla/tests/fuzz:rand_44_cpu PASSED in 8.2s //tensorflow/compiler/xla/tests/fuzz:rand_45_cpu PASSED in 9.0s //tensorflow/compiler/xla/tests/fuzz:rand_46_cpu PASSED in 7.6s //tensorflow/compiler/xla/tests/fuzz:rand_47_cpu PASSED in 8.4s //tensorflow/compiler/xla/tests/fuzz:rand_48_cpu PASSED in 9.0s //tensorflow/compiler/xla/tests/fuzz:rand_49_cpu PASSED in 7.2s //tensorflow/compiler/xla/tests/fuzz:rand_4_cpu PASSED in 11.2s //tensorflow/compiler/xla/tests/fuzz:rand_50_cpu PASSED in 7.0s //tensorflow/compiler/xla/tests/fuzz:rand_51_cpu PASSED in 7.9s //tensorflow/compiler/xla/tests/fuzz:rand_52_cpu PASSED in 7.8s //tensorflow/compiler/xla/tests/fuzz:rand_53_cpu PASSED in 8.5s //tensorflow/compiler/xla/tests/fuzz:rand_54_cpu PASSED in 7.6s //tensorflow/compiler/xla/tests/fuzz:rand_56_cpu PASSED in 7.7s //tensorflow/compiler/xla/tests/fuzz:rand_57_cpu PASSED in 7.4s //tensorflow/compiler/xla/tests/fuzz:rand_58_cpu PASSED in 7.5s //tensorflow/compiler/xla/tests/fuzz:rand_59_cpu PASSED in 8.4s //tensorflow/compiler/xla/tests/fuzz:rand_5_cpu PASSED in 7.5s //tensorflow/compiler/xla/tests/fuzz:rand_61_cpu PASSED in 8.5s //tensorflow/compiler/xla/tests/fuzz:rand_62_cpu PASSED in 9.0s //tensorflow/compiler/xla/tests/fuzz:rand_63_cpu PASSED in 7.9s //tensorflow/compiler/xla/tests/fuzz:rand_64_cpu PASSED in 11.0s //tensorflow/compiler/xla/tests/fuzz:rand_65_cpu PASSED in 7.4s //tensorflow/compiler/xla/tests/fuzz:rand_66_cpu PASSED in 7.6s //tensorflow/compiler/xla/tests/fuzz:rand_68_cpu PASSED in 7.9s //tensorflow/compiler/xla/tests/fuzz:rand_69_cpu PASSED in 7.4s //tensorflow/compiler/xla/tests/fuzz:rand_6_cpu PASSED in 8.2s //tensorflow/compiler/xla/tests/fuzz:rand_70_cpu PASSED in 7.6s //tensorflow/compiler/xla/tests/fuzz:rand_71_cpu PASSED in 7.6s //tensorflow/compiler/xla/tests/fuzz:rand_73_cpu PASSED in 7.4s //tensorflow/compiler/xla/tests/fuzz:rand_74_cpu PASSED in 8.4s //tensorflow/compiler/xla/tests/fuzz:rand_75_cpu PASSED in 10.9s //tensorflow/compiler/xla/tests/fuzz:rand_76_cpu PASSED in 8.6s //tensorflow/compiler/xla/tests/fuzz:rand_77_cpu PASSED in 8.4s //tensorflow/compiler/xla/tests/fuzz:rand_78_cpu PASSED in 8.6s //tensorflow/compiler/xla/tests/fuzz:rand_79_cpu PASSED in 8.8s //tensorflow/compiler/xla/tests/fuzz:rand_7_cpu PASSED in 6.9s //tensorflow/compiler/xla/tests/fuzz:rand_80_cpu PASSED in 7.6s //tensorflow/compiler/xla/tests/fuzz:rand_81_cpu PASSED in 7.4s //tensorflow/compiler/xla/tests/fuzz:rand_82_cpu PASSED in 7.3s //tensorflow/compiler/xla/tests/fuzz:rand_83_cpu PASSED in 8.5s //tensorflow/compiler/xla/tests/fuzz:rand_84_cpu PASSED in 7.5s //tensorflow/compiler/xla/tests/fuzz:rand_85_cpu PASSED in 7.3s //tensorflow/compiler/xla/tests/fuzz:rand_86_cpu PASSED in 6.7s //tensorflow/compiler/xla/tests/fuzz:rand_87_cpu PASSED in 8.4s //tensorflow/compiler/xla/tests/fuzz:rand_88_cpu PASSED in 8.0s //tensorflow/compiler/xla/tests/fuzz:rand_89_cpu PASSED in 8.1s //tensorflow/compiler/xla/tests/fuzz:rand_8_cpu PASSED in 8.1s //tensorflow/compiler/xla/tests/fuzz:rand_90_cpu PASSED in 8.0s //tensorflow/compiler/xla/tests/fuzz:rand_91_cpu PASSED in 6.9s //tensorflow/compiler/xla/tests/fuzz:rand_92_cpu PASSED in 8.9s //tensorflow/compiler/xla/tests/fuzz:rand_93_cpu PASSED in 9.2s //tensorflow/compiler/xla/tests/fuzz:rand_94_cpu PASSED in 8.0s //tensorflow/compiler/xla/tests/fuzz:rand_95_cpu PASSED in 7.8s //tensorflow/compiler/xla/tests/fuzz:rand_96_cpu PASSED in 7.7s //tensorflow/compiler/xla/tests/fuzz:rand_97_cpu PASSED in 8.7s //tensorflow/compiler/xla/tests/fuzz:rand_98_cpu PASSED in 7.8s //tensorflow/compiler/xla/tests/fuzz:rand_99_cpu PASSED in 7.3s //tensorflow/compiler/xla/tests/fuzz:rand_9_cpu PASSED in 7.8s //tensorflow/compiler/xla/tools:hlo_control_flow_flattening_test PASSED in 1.0s //tensorflow/compiler/xla/tools:hlo_extractor_test PASSED in 1.1s //tensorflow/compiler/xla/tools:hlo_module_loader_test PASSED in 0.7s //tensorflow/compiler/xla/tools:interactive_graphviz_bin_test PASSED in 0.3s //tensorflow/compiler/xla/tools:run_hlo_module_bin_test PASSED in 0.3s //tensorflow/compiler/xla/tools/hlo_bisect:hlo_bisect_state_test PASSED in 1.0s //tensorflow/compiler/xla/translate/hlo_to_mhlo:hlo_utils_test PASSED in 0.6s //tensorflow/compiler/xla/translate/hlo_to_mhlo:mlir_hlo_builder_test PASSED in 0.8s //tensorflow/compiler/xla/translate/hlo_to_mhlo/tests:bool_compare.hlotxt.test PASSED in 0.6s //tensorflow/compiler/xla/translate/hlo_to_mhlo/tests:case_conditional.hlotxt.test PASSED in 0.5s //tensorflow/compiler/xla/translate/hlo_to_mhlo/tests:dynamic_param.hlo.test PASSED in 0.5s //tensorflow/compiler/xla/translate/hlo_to_mhlo/tests:entry_computation_layout.hlotxt.test PASSED in 0.5s //tensorflow/compiler/xla/translate/hlo_to_mhlo/tests:frontend_attributes.hlotxt.test PASSED in 0.5s //tensorflow/compiler/xla/translate/hlo_to_mhlo/tests:fully_connected_reference_model.hlotxt.test PASSED in 0.4s //tensorflow/compiler/xla/translate/hlo_to_mhlo/tests:fusion.hlotxt.test PASSED in 1.0s //tensorflow/compiler/xla/translate/hlo_to_mhlo/tests:if_conditional.hlotxt.test PASSED in 0.5s //tensorflow/compiler/xla/translate/hlo_to_mhlo/tests:import.hlotxt.test PASSED in 0.7s //tensorflow/compiler/xla/translate/hlo_to_mhlo/tests:import_async.hlotxt.test PASSED in 0.5s //tensorflow/compiler/xla/translate/hlo_to_mhlo/tests:layouts_and_names.hlotxt.test PASSED in 0.5s //tensorflow/compiler/xla/translate/hlo_to_mhlo/tests:location.hlotxt.test PASSED in 0.7s //tensorflow/compiler/xla/translate/hlo_to_mhlo/tests:module_attributes.hlo.test PASSED in 0.7s //tensorflow/compiler/xla/translate/hlo_to_mhlo/tests:simple.hlo.test PASSED in 0.5s //tensorflow/compiler/xla/translate/hlo_to_mhlo/tests:spmd_module_sharding.hlo.test PASSED in 0.6s //tensorflow/compiler/xla/translate/hlo_to_mhlo/tests:types.hlotxt.test PASSED in 0.5s //tensorflow/compiler/xla/translate/hlo_to_mhlo/tests:while.hlotxt.test PASSED in 0.5s //tensorflow/compiler/xla/translate/mhlo_to_hlo:type_to_shape_test PASSED in 0.6s //tensorflow/compiler/xla/translate/mhlo_to_hlo/tests:add.mlir.test PASSED in 0.8s //tensorflow/compiler/xla/translate/mhlo_to_hlo/tests:case.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/translate/mhlo_to_hlo/tests:dynamic.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/translate/mhlo_to_hlo/tests:export-with-layouts.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/translate/mhlo_to_hlo/tests:export.mlir.test PASSED in 1.6s //tensorflow/compiler/xla/translate/mhlo_to_hlo/tests:export_and_check_layouts.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/translate/mhlo_to_hlo/tests:export_large_constants.mlir.test PASSED in 1.5s //tensorflow/compiler/xla/translate/mhlo_to_hlo/tests:export_replicas.mlir.test PASSED in 0.9s //tensorflow/compiler/xla/translate/mhlo_to_hlo/tests:frontend_attributes.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/translate/mhlo_to_hlo/tests:fusion.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/translate/mhlo_to_hlo/tests:if.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/translate/mhlo_to_hlo/tests:input_output_aliasing.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/translate/mhlo_to_hlo/tests:layouts_and_names.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/translate/mhlo_to_hlo/tests:location_to_op_metadata.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/translate/mhlo_to_hlo/tests:missing_main.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/translate/mhlo_to_hlo/tests:module_attributes.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/translate/mhlo_to_hlo/tests:multiple_return_tuple.mlir.test PASSED in 0.6s //tensorflow/compiler/xla/translate/mhlo_to_hlo/tests:opaque_elements_attr.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/translate/mhlo_to_hlo/tests:rng_get_and_update_state.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/translate/mhlo_to_hlo/tests:sharding.mlir.test PASSED in 0.7s //tensorflow/compiler/xla/translate/mhlo_to_hlo/tests:simple.mlir.test PASSED in 0.4s //tensorflow/compiler/xla/translate/mhlo_to_hlo/tests:unsupported_type.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/translate/mhlo_to_hlo/tests:while.mlir.test PASSED in 0.5s //tensorflow/compiler/xla/translate/mhlo_to_lhlo_with_xla/tests:hlo_text_to_lhlo_no_opt.hlotxt.test PASSED in 10.7s //tensorflow/compiler/xla/translate/mhlo_to_lhlo_with_xla/tests:no_opt_ops.hlotxt.test PASSED in 0.5s //tensorflow/compiler/xla/translate/mhlo_to_lhlo_with_xla/tests:non_identity_layouts.hlotxt.test PASSED in 0.5s //tensorflow/compiler/xla/translate/mhlo_to_lhlo_with_xla/tests:ops.mlir.test PASSED in 3.2s //tensorflow/compiler/xla/translate/mhlo_to_lhlo_with_xla/tests:passthrough.mlir.test PASSED in 0.5s //tensorflow/core:__tensorflow_core_lib_core_legacy_lib_core_all_tests PASSED in 13.8s //tensorflow/core:__tensorflow_core_lib_gtl_legacy_lib_gtl_tests PASSED in 0.5s //tensorflow/core:__tensorflow_core_lib_monitoring_cell_reader_test PASSED in 37.3s //tensorflow/core:__tensorflow_core_lib_monitoring_collection_registry_test PASSED in 0.2s //tensorflow/core:__tensorflow_core_lib_monitoring_counter_test PASSED in 0.1s //tensorflow/core:__tensorflow_core_lib_monitoring_gauge_test PASSED in 0.1s //tensorflow/core:__tensorflow_core_lib_monitoring_metric_def_test PASSED in 0.1s //tensorflow/core:__tensorflow_core_lib_monitoring_percentile_sampler_test PASSED in 0.2s //tensorflow/core:__tensorflow_core_lib_monitoring_sampler_test PASSED in 0.1s //tensorflow/core:__tensorflow_core_lib_monitoring_test_utils_test PASSED in 0.1s //tensorflow/core:__tensorflow_core_lib_strings_legacy_low_level_library_tests PASSED in 0.1s //tensorflow/core:__tensorflow_core_lib_wav_wav_io_test PASSED in 0.2s //tensorflow/core:__tensorflow_core_util_mkl_util_test_srcs PASSED in 0.1s //tensorflow/core:__tensorflow_tsl_lib_core_legacy_lib_core_all_tests PASSED in 0.7s //tensorflow/core:lib_strings_ordered_code_test PASSED in 1.2s //tensorflow/core:lib_strings_proto_serialization_test PASSED in 0.1s //tensorflow/core/api_def:api_test PASSED in 5.6s //tensorflow/core/api_def:update_api_def_test PASSED in 1.0s //tensorflow/core/common_runtime:all_to_all_test_cpu PASSED in 0.6s //tensorflow/core/common_runtime:arg_ret_placement_test PASSED in 0.5s //tensorflow/core/common_runtime:buf_rendezvous_test PASSED in 0.8s //tensorflow/core/common_runtime:collective_executor_mgr_test PASSED in 1.7s //tensorflow/core/common_runtime:collective_param_resolver_local_test PASSED in 6.3s //tensorflow/core/common_runtime:collective_rma_local_test PASSED in 1.5s //tensorflow/core/common_runtime:composite_device_test PASSED in 1.0s //tensorflow/core/common_runtime:cost_measurement_registry_test PASSED in 2.8s //tensorflow/core/common_runtime:cost_util_test PASSED in 0.1s //tensorflow/core/common_runtime:device_mgr_test PASSED in 0.9s //tensorflow/core/common_runtime:device_propagation_test PASSED in 0.6s //tensorflow/core/common_runtime:device_resolver_local_test PASSED in 0.8s //tensorflow/core/common_runtime:device_set_test PASSED in 0.8s //tensorflow/core/common_runtime:direct_session_test_cpu PASSED in 2.4s //tensorflow/core/common_runtime:direct_session_with_debug_test PASSED in 2.7s //tensorflow/core/common_runtime:direct_session_with_tracking_alloc_test PASSED in 1.4s //tensorflow/core/common_runtime:dynamic_device_mgr_test PASSED in 0.9s //tensorflow/core/common_runtime:eval_const_tensor_test PASSED in 1.3s //tensorflow/core/common_runtime:executor_test PASSED in 1.6s //tensorflow/core/common_runtime:function_optimization_registration_test PASSED in 2.8s //tensorflow/core/common_runtime:function_optimization_registry_no_pass_test PASSED in 0.7s //tensorflow/core/common_runtime:function_optimization_registry_pass_failure_test PASSED in 0.7s //tensorflow/core/common_runtime:function_optimization_registry_test PASSED in 0.7s //tensorflow/core/common_runtime:function_threadpool_test PASSED in 1.1s //tensorflow/core/common_runtime:graph_constructor_test PASSED in 2.1s //tensorflow/core/common_runtime:graph_runner_test PASSED in 0.9s //tensorflow/core/common_runtime:hierarchical_tree_broadcaster_test_cpu PASSED in 3.6s //tensorflow/core/common_runtime:inline_function_utils_test PASSED in 0.7s //tensorflow/core/common_runtime:input_colocation_exemption_registry_test PASSED in 0.5s //tensorflow/core/common_runtime:int32_fulltype_test PASSED in 0.7s //tensorflow/core/common_runtime:isolate_placer_inspection_required_ops_pass_test PASSED in 2.1s //tensorflow/core/common_runtime:lower_case_op_test PASSED in 3.6s //tensorflow/core/common_runtime:lower_function_call_test PASSED in 1.8s //tensorflow/core/common_runtime:lower_functional_ops_test PASSED in 2.1s //tensorflow/core/common_runtime:lower_if_op_test PASSED in 2.5s //tensorflow/core/common_runtime:lower_while_op_test PASSED in 2.6s //tensorflow/core/common_runtime:mkl_cpu_allocator_test PASSED in 0.1s //tensorflow/core/common_runtime:mkl_threadpool_device_test PASSED in 0.1s //tensorflow/core/common_runtime:no_op_cost_measurement_test PASSED in 0.1s //tensorflow/core/common_runtime:null_request_cost_accessor_test PASSED in 0.2s //tensorflow/core/common_runtime:optimization_registry_test PASSED in 1.9s //tensorflow/core/common_runtime:optimize_cross_host_control_deps_test PASSED in 13.2s //tensorflow/core/common_runtime:optimize_function_graph_utils_test PASSED in 1.9s //tensorflow/core/common_runtime:partitioning_utils_test PASSED in 1.1s //tensorflow/core/common_runtime:pending_counts_test PASSED in 1.0s //tensorflow/core/common_runtime:permuter_test_cpu PASSED in 3.0s //tensorflow/core/common_runtime:placer_inspection_required_ops_utils_test PASSED in 1.1s //tensorflow/core/common_runtime:placer_test PASSED in 0.8s //tensorflow/core/common_runtime:process_function_library_runtime_test_cpu PASSED in 0.7s //tensorflow/core/common_runtime:process_util_test PASSED in 0.2s //tensorflow/core/common_runtime:quantize_training_test PASSED in 2.1s //tensorflow/core/common_runtime:rendezvous_util_test PASSED in 0.3s //tensorflow/core/common_runtime:replicate_per_replica_nodes_test PASSED in 1.1s //tensorflow/core/common_runtime:request_cost_accessor_registry_test PASSED in 2.3s //tensorflow/core/common_runtime:request_cost_test PASSED in 0.1s //tensorflow/core/common_runtime:ring_gatherer_test_cpu PASSED in 2.5s //tensorflow/core/common_runtime:ring_reducer_test_cpu PASSED in 4.6s //tensorflow/core/common_runtime:scoped_allocator_mgr_test PASSED in 4.7s //tensorflow/core/common_runtime:session_test PASSED in 1.4s //tensorflow/core/common_runtime:shape_refiner_test PASSED in 0.7s //tensorflow/core/common_runtime:single_threaded_executor_test PASSED in 0.9s //tensorflow/core/common_runtime:threadpool_device_test PASSED in 0.9s //tensorflow/core/common_runtime:type_inference_test PASSED in 2.9s //tensorflow/core/common_runtime/eager:attr_builder_test PASSED in 28.0s //tensorflow/core/common_runtime/eager:context_test PASSED in 10.8s //tensorflow/core/common_runtime/eager:custom_device_test PASSED in 14.7s //tensorflow/core/common_runtime/eager:eager_executor_test PASSED in 11.2s //tensorflow/core/common_runtime/eager:eager_op_rewrite_registry_test PASSED in 1.0s //tensorflow/core/common_runtime/eager:eager_operation_test PASSED in 11.0s //tensorflow/core/common_runtime/eager:execute_node_test PASSED in 13.6s //tensorflow/core/common_runtime/eager:execute_test PASSED in 25.8s //tensorflow/core/common_runtime/eager:kernel_and_device_test PASSED in 1.0s //tensorflow/core/common_runtime/eager:mkl_eager_op_rewrite_test PASSED in 16.3s //tensorflow/core/common_runtime/eager:placement_test PASSED in 10.4s //tensorflow/core/common_runtime/eager:placement_utils_test PASSED in 11.0s //tensorflow/core/common_runtime/eager:tensor_handle_data_test PASSED in 11.4s //tensorflow/core/common_runtime/eager:tensor_handle_test PASSED in 10.6s //tensorflow/core/common_runtime/gpu:gpu_device_on_non_gpu_machine_test PASSED in 0.1s //tensorflow/core/common_runtime/next_pluggable_device/c:plugin_c_api_test PASSED in 27.2s //tensorflow/core/config:flags_py_test PASSED in 6.2s //tensorflow/core/config:flags_test PASSED in 0.1s //tensorflow/core/data:compression_utils_test PASSED in 2.0s //tensorflow/core/data:dataset_utils_test PASSED in 0.7s //tensorflow/core/data:hash_utils_test PASSED in 0.8s //tensorflow/core/data:metric_utils_test PASSED in 5.8s //tensorflow/core/data:name_utils_test PASSED in 0.1s //tensorflow/core/data:rewrite_utils_test PASSED in 1.1s //tensorflow/core/data:serialization_utils_test PASSED in 0.5s //tensorflow/core/data:snapshot_utils_test PASSED in 0.6s //tensorflow/core/data:split_utils_test PASSED in 0.9s //tensorflow/core/data:standalone_save_restore_test PASSED in 2.8s //tensorflow/core/data:standalone_test PASSED in 1.5s //tensorflow/core/data:tfdataz_metrics_test PASSED in 3.0s //tensorflow/core/data:unbounded_thread_pool_test PASSED in 0.5s //tensorflow/core/data/service:auto_shard_rewriter_test PASSED in 1.0s //tensorflow/core/data/service:common_test PASSED in 0.1s //tensorflow/core/data/service:credentials_factory_test PASSED in 0.7s //tensorflow/core/data/service:cross_trainer_cache_test PASSED in 2.5s //tensorflow/core/data/service:data_service_test PASSED in 12.1s //tensorflow/core/data/service:data_transfer_test PASSED in 1.2s //tensorflow/core/data/service:dataset_store_test PASSED in 0.8s //tensorflow/core/data/service:dispatcher_client_test PASSED in 4.6s //tensorflow/core/data/service:dispatcher_state_test PASSED in 0.6s //tensorflow/core/data/service:grpc_dispatcher_impl_test PASSED in 2.5s //tensorflow/core/data/service:grpc_util_test PASSED in 0.9s //tensorflow/core/data/service:grpc_worker_impl_test PASSED in 2.9s //tensorflow/core/data/service:journal_test PASSED in 1.2s //tensorflow/core/data/service:logging_utils_test PASSED in 0.1s //tensorflow/core/data/service:task_runner_test PASSED in 3.7s //tensorflow/core/data/service:test_util_test PASSED in 2.6s //tensorflow/core/data/service:url_test PASSED in 0.1s //tensorflow/core/data/service:utils_test PASSED in 0.6s //tensorflow/core/data/service:validate_utils_test PASSED in 0.2s //tensorflow/core/data/service:worker_client_test PASSED in 3.8s //tensorflow/core/data/service:worker_impl_test PASSED in 2.6s //tensorflow/core/data/service/client:data_service_client_test PASSED in 3.6s //tensorflow/core/data/service/client:utils_test PASSED in 2.6s //tensorflow/core/data/service/client:validate_utils_test PASSED in 1.3s //tensorflow/core/data/service/snapshot:distributed_snapshot_test PASSED in 22.3s //tensorflow/core/data/service/snapshot:file_utils_test PASSED in 0.5s //tensorflow/core/data/service/snapshot:path_utils_test PASSED in 0.1s //tensorflow/core/data/service/snapshot:snapshot_manager_test PASSED in 3.1s //tensorflow/core/data/service/snapshot:snapshot_split_provider_test PASSED in 2.6s //tensorflow/core/data/service/snapshot:snapshot_stream_writer_checkpoint_test PASSED in 4.1s //tensorflow/core/data/service/snapshot:snapshot_stream_writer_test PASSED in 5.4s //tensorflow/core/data/service/snapshot:utils_test PASSED in 0.1s //tensorflow/core/debug:debug_graph_utils_test PASSED in 0.6s //tensorflow/core/distributed_runtime:call_options_test PASSED in 0.4s //tensorflow/core/distributed_runtime:cluster_function_library_runtime_test PASSED in 3.7s //tensorflow/core/distributed_runtime:collective_param_resolver_distributed_test PASSED in 1.1s //tensorflow/core/distributed_runtime:collective_rma_distributed_test PASSED in 0.6s //tensorflow/core/distributed_runtime:device_resolver_distributed_test PASSED in 0.8s //tensorflow/core/distributed_runtime:message_wrappers_test PASSED in 0.2s //tensorflow/core/distributed_runtime:partial_run_mgr_test PASSED in 0.4s //tensorflow/core/distributed_runtime:recent_request_ids_test PASSED in 0.1s //tensorflow/core/distributed_runtime:request_id_test PASSED in 0.8s //tensorflow/core/distributed_runtime:rpc_collective_executor_mgr_test PASSED in 0.7s //tensorflow/core/distributed_runtime:server_lib_test PASSED in 0.1s //tensorflow/core/distributed_runtime:session_mgr_test PASSED in 1.2s //tensorflow/core/distributed_runtime:tensor_coding_test PASSED in 0.1s //tensorflow/core/distributed_runtime/coordination:coordination_service_barrier_proxy_test PASSED in 2.4s //tensorflow/core/distributed_runtime/eager:eager_service_impl_test PASSED in 24.5s //tensorflow/core/distributed_runtime/eager:remote_mgr_test PASSED in 11.0s //tensorflow/core/distributed_runtime/integration_test:c_api_coordination_test_cpu PASSED in 49.9s //tensorflow/core/distributed_runtime/integration_test:c_api_multi_client_test_cpu PASSED in 36.4s //tensorflow/core/distributed_runtime/integration_test:c_api_recoverable_jobs_test_cpu PASSED in 41.5s //tensorflow/core/distributed_runtime/integration_test:c_api_session_coordination_test_cpu PASSED in 26.6s //tensorflow/core/distributed_runtime/rpc:grpc_tensor_coding_test PASSED in 2.4s //tensorflow/core/distributed_runtime/rpc:grpc_worker_cache_test PASSED in 0.8s //tensorflow/core/distributed_runtime/rpc/eager:grpc_eager_client_test PASSED in 0.8s //tensorflow/core/example:example_parser_configuration_test PASSED in 1.4s //tensorflow/core/example:feature_util_test PASSED in 1.0s //tensorflow/core/framework:allocator_test PASSED in 5.8s //tensorflow/core/framework:attr_value_util_test PASSED in 0.8s //tensorflow/core/framework:batch_util_test PASSED in 1.1s //tensorflow/core/framework:bfloat16_test PASSED in 0.8s //tensorflow/core/framework:common_shape_fns_test PASSED in 0.7s //tensorflow/core/framework:dataset_test PASSED in 0.9s //tensorflow/core/framework:device_base_test PASSED in 1.2s //tensorflow/core/framework:disable_jit_test PASSED in 1.5s //tensorflow/core/framework:framework_op_gen_lib_test PASSED in 0.1s //tensorflow/core/framework:framework_op_segment_test PASSED in 0.8s //tensorflow/core/framework:framework_resource_var_test PASSED in 0.2s //tensorflow/core/framework:framework_run_handler_test PASSED in 4.3s //tensorflow/core/framework:framework_run_handler_util_test PASSED in 2.4s //tensorflow/core/framework:full_type_inference_util_test PASSED in 0.7s //tensorflow/core/framework:full_type_util_test PASSED in 1.3s //tensorflow/core/framework:function_test PASSED in 0.8s //tensorflow/core/framework:graph_def_util_test PASSED in 0.7s //tensorflow/core/framework:graph_to_functiondef_test PASSED in 0.7s //tensorflow/core/framework:kernel_def_builder_test PASSED in 0.8s //tensorflow/core/framework:kernel_def_util_test PASSED in 0.9s //tensorflow/core/framework:memory_types_test PASSED in 1.0s //tensorflow/core/framework:model_test PASSED in 0.9s //tensorflow/core/framework:node_def_builder_test PASSED in 0.8s //tensorflow/core/framework:node_def_util_test PASSED in 1.0s //tensorflow/core/framework:node_properties_test PASSED in 1.2s //tensorflow/core/framework:op_compatibility_test PASSED in 0.9s //tensorflow/core/framework:op_def_builder_test PASSED in 0.9s //tensorflow/core/framework:op_def_util_test PASSED in 0.8s //tensorflow/core/framework:op_kernel_test PASSED in 1.1s //tensorflow/core/framework:op_registration_test PASSED in 1.1s //tensorflow/core/framework:partial_tensor_shape_test PASSED in 0.9s //tensorflow/core/framework:rendezvous_test PASSED in 3.0s //tensorflow/core/framework:resource_handle_test PASSED in 0.3s //tensorflow/core/framework:resource_mgr_test PASSED in 3.8s //tensorflow/core/framework:resource_op_kernel_test PASSED in 1.1s //tensorflow/core/framework:shape_inference_test PASSED in 0.8s //tensorflow/core/framework:shape_inference_testutil_test PASSED in 1.3s //tensorflow/core/framework:tensor_shape_test PASSED in 8.3s //tensorflow/core/framework:tensor_slice_test PASSED in 0.9s //tensorflow/core/framework:tensor_test PASSED in 35.5s //tensorflow/core/framework:tensor_testutil_test PASSED in 0.8s //tensorflow/core/framework:tensor_util_test PASSED in 0.7s //tensorflow/core/framework:tracking_allocator_test PASSED in 0.7s //tensorflow/core/framework:types_test PASSED in 0.9s //tensorflow/core/framework:variant_op_registry_test PASSED in 18.3s //tensorflow/core/framework:variant_test PASSED in 0.8s //tensorflow/core/framework/registration:registration_test PASSED in 0.5s //tensorflow/core/function/capture:by_ref_capture_test PASSED in 9.0s //tensorflow/core/function/capture:capture_container_test PASSED in 7.9s //tensorflow/core/function/integration_test:side_inputs_manual_api_test PASSED in 16.1s //tensorflow/core/function/integration_test:side_inputs_test PASSED in 15.3s //tensorflow/core/function/polymorphism:function_cache_test PASSED in 8.0s //tensorflow/core/function/polymorphism:function_type_test PASSED in 7.1s //tensorflow/core/function/polymorphism:type_dispatch_test PASSED in 7.7s //tensorflow/core/function/runtime_client:runtime_client_cc_test PASSED in 38.9s //tensorflow/core/function/trace_type:default_types_test PASSED in 9.0s //tensorflow/core/function/trace_type:serialization_test PASSED in 7.6s //tensorflow/core/function/trace_type:trace_type_test PASSED in 43.8s //tensorflow/core/graph:algorithm_test PASSED in 1.1s //tensorflow/core/graph:collective_order_test PASSED in 0.5s //tensorflow/core/graph:control_flow_test PASSED in 0.7s //tensorflow/core/graph:costmodel_test PASSED in 1.8s //tensorflow/core/graph:edgeset_test PASSED in 1.0s //tensorflow/core/graph:graph_def_builder_test PASSED in 1.3s //tensorflow/core/graph:graph_partition_test PASSED in 1.3s //tensorflow/core/graph:graph_test PASSED in 1.1s //tensorflow/core/graph:node_builder_test PASSED in 1.6s //tensorflow/core/graph:optimizer_cse_test PASSED in 0.7s //tensorflow/core/graph:subgraph_test PASSED in 0.8s //tensorflow/core/graph:tensor_id_test PASSED in 0.7s //tensorflow/core/graph:validate_test PASSED in 0.9s //tensorflow/core/graph/regularization:simple_delete_test PASSED in 0.5s //tensorflow/core/graph/regularization:util_test PASSED in 0.6s //tensorflow/core/grappler:graph_topology_view_test PASSED in 0.1s //tensorflow/core/grappler:graph_view_test PASSED in 1.3s //tensorflow/core/grappler:grappler_item_builder_test PASSED in 1.3s //tensorflow/core/grappler:grappler_item_test PASSED in 1.3s //tensorflow/core/grappler:mutable_graph_view_test PASSED in 1.7s //tensorflow/core/grappler:utils_test PASSED in 3.5s //tensorflow/core/grappler/clusters:single_machine_test PASSED in 23.4s //tensorflow/core/grappler/clusters:virtual_cluster_test PASSED in 1.2s //tensorflow/core/grappler/costs:analytical_cost_estimator_test PASSED in 1.5s //tensorflow/core/grappler/costs:cost_estimator_test PASSED in 0.2s //tensorflow/core/grappler/costs:graph_memory_test PASSED in 1.1s //tensorflow/core/grappler/costs:graph_properties_test PASSED in 3.6s //tensorflow/core/grappler/costs:robust_stats_test PASSED in 0.1s //tensorflow/core/grappler/costs:utils_test PASSED in 1.3s //tensorflow/core/grappler/costs:virtual_placer_test PASSED in 0.5s //tensorflow/core/grappler/costs:virtual_scheduler_test PASSED in 2.4s //tensorflow/core/grappler/graph_analyzer:gen_node_test PASSED in 1.4s //tensorflow/core/grappler/graph_analyzer:graph_analyzer_test PASSED in 1.5s //tensorflow/core/grappler/graph_analyzer:hash_tools_test PASSED in 1.8s //tensorflow/core/grappler/graph_analyzer:sig_node_test PASSED in 2.3s //tensorflow/core/grappler/graph_analyzer:subgraph_test PASSED in 1.6s //tensorflow/core/grappler/inputs:utils_test PASSED in 0.1s //tensorflow/core/grappler/optimizers:arithmetic_optimizer_test_cpu PASSED in 3.6s //tensorflow/core/grappler/optimizers:auto_mixed_precision_test_cpu PASSED in 2.6s //tensorflow/core/grappler/optimizers:auto_parallel_test_cpu PASSED in 1.8s //tensorflow/core/grappler/optimizers:common_subgraph_elimination_test_cpu PASSED in 3.2s //tensorflow/core/grappler/optimizers:custom_graph_optimizer_registry_test_cpu PASSED in 5.0s //tensorflow/core/grappler/optimizers:debug_stripper_test_cpu PASSED in 1.5s //tensorflow/core/grappler/optimizers:dependency_optimizer_test_cpu PASSED in 1.7s //tensorflow/core/grappler/optimizers:evaluation_utils_test PASSED in 0.7s //tensorflow/core/grappler/optimizers:function_api_info_test PASSED in 0.1s //tensorflow/core/grappler/optimizers:function_optimizer_test_cpu PASSED in 3.3s //tensorflow/core/grappler/optimizers:generic_layout_optimizer_test_cpu PASSED in 1.7s //tensorflow/core/grappler/optimizers:generic_layout_optimizer_transposer_factory_test PASSED in 0.4s //tensorflow/core/grappler/optimizers:generic_layout_optimizer_transposer_test_cpu PASSED in 1.7s //tensorflow/core/grappler/optimizers:graph_optimizer_stage_test_cpu PASSED in 2.7s //tensorflow/core/grappler/optimizers:implementation_selector_test PASSED in 2.7s //tensorflow/core/grappler/optimizers:loop_optimizer_test_cpu PASSED in 2.0s //tensorflow/core/grappler/optimizers:memory_optimizer_test_cpu PASSED in 2.3s //tensorflow/core/grappler/optimizers:meta_optimizer_test_cpu PASSED in 7.6s //tensorflow/core/grappler/optimizers:mkl_remapper_test PASSED in 1.9s //tensorflow/core/grappler/optimizers:model_pruner_test_cpu PASSED in 1.5s //tensorflow/core/grappler/optimizers:pin_to_host_optimizer_test_cpu PASSED in 2.0s //tensorflow/core/grappler/optimizers:remapper_test_cpu PASSED in 2.6s //tensorflow/core/grappler/optimizers:scoped_allocator_optimizer_test PASSED in 2.4s //tensorflow/core/grappler/optimizers:shape_optimizer_test_cpu PASSED in 1.8s //tensorflow/core/grappler/optimizers:static_schedule_test_cpu PASSED in 1.6s //tensorflow/core/grappler/optimizers:tfg_optimizer_hook_test PASSED in 0.5s //tensorflow/core/grappler/optimizers/data:auto_shard_test PASSED in 0.5s //tensorflow/core/grappler/optimizers/data:autotune_buffer_sizes_test PASSED in 0.7s //tensorflow/core/grappler/optimizers/data:batch_parallelization_test PASSED in 0.8s //tensorflow/core/grappler/optimizers/data:disable_intra_op_parallelism_test PASSED in 0.5s //tensorflow/core/grappler/optimizers/data:disable_prefetch_legacy_autotune_test PASSED in 0.8s //tensorflow/core/grappler/optimizers/data:enable_gradient_descent_test PASSED in 0.6s //tensorflow/core/grappler/optimizers/data:filter_fusion_test PASSED in 1.0s //tensorflow/core/grappler/optimizers/data:filter_parallelization_test PASSED in 0.7s //tensorflow/core/grappler/optimizers/data:function_utils_test PASSED in 1.1s //tensorflow/core/grappler/optimizers/data:fusion_utils_test PASSED in 0.5s //tensorflow/core/grappler/optimizers/data:graph_utils_test PASSED in 0.7s //tensorflow/core/grappler/optimizers/data:inject_prefetch_test PASSED in 0.6s //tensorflow/core/grappler/optimizers/data:make_deterministic_test PASSED in 0.9s //tensorflow/core/grappler/optimizers/data:make_sloppy_test PASSED in 0.8s //tensorflow/core/grappler/optimizers/data:map_and_batch_fusion_test PASSED in 0.4s //tensorflow/core/grappler/optimizers/data:map_and_filter_fusion_test PASSED in 0.8s //tensorflow/core/grappler/optimizers/data:map_fusion_test PASSED in 0.6s //tensorflow/core/grappler/optimizers/data:map_parallelization_test PASSED in 0.6s //tensorflow/core/grappler/optimizers/data:noop_elimination_test PASSED in 0.4s //tensorflow/core/grappler/optimizers/data:parallel_batch_test PASSED in 0.4s //tensorflow/core/grappler/optimizers/data:replicate_on_split_test PASSED in 0.6s //tensorflow/core/grappler/optimizers/data:shuffle_and_repeat_fusion_test PASSED in 0.4s //tensorflow/core/grappler/optimizers/data:slack_test PASSED in 0.9s //tensorflow/core/grappler/optimizers/data:split_utils_test PASSED in 1.0s //tensorflow/core/grappler/optimizers/data:use_private_thread_pool_test PASSED in 1.0s //tensorflow/core/grappler/optimizers/inference:batch_op_rewriter_test PASSED in 0.1s //tensorflow/core/grappler/utils:canonicalizer_test PASSED in 1.8s //tensorflow/core/grappler/utils:colocation_test PASSED in 0.5s //tensorflow/core/grappler/utils:frame_test PASSED in 0.6s //tensorflow/core/grappler/utils:functions_test PASSED in 2.2s //tensorflow/core/grappler/utils:graph_view_internal_test PASSED in 0.5s //tensorflow/core/grappler/utils:graph_view_test PASSED in 2.3s //tensorflow/core/grappler/utils:grappler_test_test PASSED in 7.6s //tensorflow/core/grappler/utils:pattern_utils_test PASSED in 0.9s //tensorflow/core/grappler/utils:scc_test PASSED in 1.0s //tensorflow/core/grappler/utils:symbolic_shapes_test PASSED in 0.1s //tensorflow/core/grappler/utils:topological_sort_test PASSED in 0.6s //tensorflow/core/grappler/utils:tpu_test PASSED in 0.1s //tensorflow/core/grappler/utils:transitive_fanin_test PASSED in 1.1s //tensorflow/core/grappler/utils:traversal_test PASSED in 1.3s //tensorflow/core/grappler/verifiers:structure_verifier_test PASSED in 1.7s //tensorflow/core/ir:interfaces_test PASSED in 0.2s //tensorflow/core/ir:ops_test PASSED in 0.3s //tensorflow/core/ir:shape_inference_utils_test PASSED in 0.3s //tensorflow/core/ir:tf_op_registry_test PASSED in 0.3s //tensorflow/core/ir:tf_op_wrapper_test PASSED in 0.2s //tensorflow/core/ir:utility_test PASSED in 0.1s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:arg_as_control_ret.pbtxt.test PASSED in 0.8s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:backedge_segment.pbtxt.test PASSED in 0.7s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:empty.pbtxt.test PASSED in 1.0s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:error_during_backedge.pbtxt.test PASSED in 0.7s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:import_case_with_attr_inference.pbtxt.test PASSED in 0.5s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:import_if_with_attr_inference.pbtxt.test PASSED in 1.8s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:import_iterator_get_next_attr_inference.pbtxt.test PASSED in 0.8s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:import_underscore_output_shapes.pbtxt.test PASSED in 0.6s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:import_while_with_attr_inference.pbtxt.test PASSED in 0.5s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:infeed_dequeue.pbtxt.test PASSED in 0.5s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:infer_arg_handle_type.pbtxt.test PASSED in 0.5s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:infer_with_output_shapes.pbtxt.test PASSED in 0.6s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_arg_name.pbtxt.test PASSED in 0.4s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_backedge_input_size.pbtxt.test PASSED in 0.7s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_duplicated_node_name.pbtxt.test PASSED in 0.5s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_edge_index.pbtxt.test PASSED in 0.5s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_edge_name.pbtxt.test PASSED in 0.6s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_empty_attr_key.pbtxt.test PASSED in 0.5s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_empty_func_attr_key.pbtxt.test PASSED in 0.9s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_empty_func_attr_name.pbtxt.test PASSED in 0.5s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_empty_op_type.pbtxt.test PASSED in 0.7s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_func_with_empty_name.pbtxt.test PASSED in 0.5s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_function_import.pbtxt.test PASSED in 0.6s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_generic_func_with_empty_control_result.pbtxt.test PASSED in 0.9s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_generic_func_with_empty_input.pbtxt.test PASSED in 0.4s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_generic_func_with_empty_name.pbtxt.test PASSED in 0.5s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_generic_func_with_empty_result.pbtxt.test PASSED in 1.0s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_generic_function_attr_name.pbtxt.test PASSED in 0.5s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_generic_function_named_edge_index.pbtxt.test PASSED in 0.7s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_handle_data.pbtxt.test PASSED in 0.5s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_missing_control_input.pbtxt.test PASSED in 0.5s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_missing_control_result.pbtxt.test PASSED in 0.4s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_missing_control_result_value.pbtxt.test PASSED in 0.5s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_missing_data_result.pbtxt.test PASSED in 0.7s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_missing_data_result_value.pbtxt.test PASSED in 0.5s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_missing_input.pbtxt.test PASSED in 0.5s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_missing_two_inputs.pbtxt.test PASSED in 0.7s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_named_edge_index.pbtxt.test PASSED in 0.6s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_op_name.pbtxt.test PASSED in 0.7s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:invalid_type_list.pbtxt.test PASSED in 0.5s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:legacy_call.pbtxt.test PASSED in 0.6s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:negative_shape.pbtxt.test PASSED in 0.6s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:negative_zero_constant.pbtxt.test PASSED in 0.9s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:three_nodes_with_attrs.pbtxt.test PASSED in 0.6s //tensorflow/core/ir/importexport/tests/graphdef_to_mlir:version.pbtxt.test PASSED in 0.4s //tensorflow/core/ir/importexport/tests/mlir_to_graphdef:empty.mlir.test PASSED in 0.6s //tensorflow/core/ir/importexport/tests/mlir_to_graphdef:fulltype.mlir.test PASSED in 0.5s //tensorflow/core/ir/importexport/tests/mlir_to_graphdef:func_with_no_args_or_results.mlir.test PASSED in 0.5s //tensorflow/core/ir/importexport/tests/mlir_to_graphdef:negative_zero_constant.mlir.test PASSED in 0.4s //tensorflow/core/ir/importexport/tests/mlir_to_graphdef:nested_legacy_call.mlir.test PASSED in 0.5s //tensorflow/core/ir/importexport/tests/mlir_to_graphdef:three_nodes_with_attrs.mlir.test PASSED in 0.4s //tensorflow/core/ir/importexport/tests/mlir_to_graphdef:version.mlir.test PASSED in 0.5s //tensorflow/core/ir/importexport/tests/saved_model:saved_model_roundtrip_test PASSED in 0.5s //tensorflow/core/ir/tests:attributes.mlir.test PASSED in 0.5s //tensorflow/core/ir/tests:canonicalize.mlir.test PASSED in 0.5s //tensorflow/core/ir/tests:compatible_types.mlir.test PASSED in 0.4s //tensorflow/core/ir/tests:concrete-ops.mlir.test PASSED in 0.6s //tensorflow/core/ir/tests:generic_concrete_ops.mlir.test PASSED in 0.5s //tensorflow/core/ir/tests:invalid-concrete-ops.mlir.test PASSED in 0.5s //tensorflow/core/ir/tests:invalid-preserved-attrs.mlir.test PASSED in 0.4s //tensorflow/core/ir/tests:invalid.mlir.test PASSED in 0.6s //tensorflow/core/ir/tests:invalid_types.mlir.test PASSED in 0.5s //tensorflow/core/ir/tests:ops.mlir.test PASSED in 0.5s //tensorflow/core/ir/tests:region-invalid-ops.mlir.test PASSED in 0.5s //tensorflow/core/ir/tests:region-ops-graph.mlir.test PASSED in 0.8s //tensorflow/core/ir/tests:region-ops.mlir.test PASSED in 0.4s //tensorflow/core/ir/tests:types.mlir.test PASSED in 0.4s //tensorflow/core/ir/types:dialect_test PASSED in 0.2s //tensorflow/core/kernels:as_string_op_test PASSED in 0.9s //tensorflow/core/kernels:basic_ops_benchmark_test PASSED in 0.7s //tensorflow/core/kernels:batch_kernels_env_test PASSED in 0.9s //tensorflow/core/kernels:batch_kernels_test PASSED in 1.0s //tensorflow/core/kernels:bias_op_test PASSED in 0.5s //tensorflow/core/kernels:bincount_op_test_cpu PASSED in 0.6s //tensorflow/core/kernels:broadcast_to_op_test_cpu PASSED in 0.6s //tensorflow/core/kernels:cast_op_test_cpu PASSED in 1.1s //tensorflow/core/kernels:checkpoint_callback_manager_test PASSED in 0.7s //tensorflow/core/kernels:clustering_ops_test PASSED in 0.6s //tensorflow/core/kernels:composite_tensor_variant_test PASSED in 0.6s //tensorflow/core/kernels:concat_op_test PASSED in 0.6s //tensorflow/core/kernels:constant_op_test_cpu PASSED in 0.7s //tensorflow/core/kernels:control_flow_ops_test PASSED in 6.3s //tensorflow/core/kernels:conv_grad_filter_ops_benchmark_test_cpu PASSED in 0.6s //tensorflow/core/kernels:conv_grad_input_ops_benchmark_test_cpu PASSED in 0.7s //tensorflow/core/kernels:conv_ops_benchmark_test_cpu PASSED in 0.7s //tensorflow/core/kernels:conv_ops_test_cpu PASSED in 6.6s //tensorflow/core/kernels:count_ops_test PASSED in 0.6s //tensorflow/core/kernels:cross_op_test PASSED in 0.7s //tensorflow/core/kernels:cwise_ops_test_cpu PASSED in 0.7s //tensorflow/core/kernels:debug_ops_test PASSED in 1.0s //tensorflow/core/kernels:decode_wav_op_test PASSED in 1.6s //tensorflow/core/kernels:deep_conv2d_test PASSED in 0.6s //tensorflow/core/kernels:dequantize_op_test PASSED in 1.1s //tensorflow/core/kernels:diag_op_test_cpu PASSED in 0.4s //tensorflow/core/kernels:dynamic_partition_op_test_cpu PASSED in 0.6s //tensorflow/core/kernels:dynamic_stitch_op_test_cpu PASSED in 0.7s //tensorflow/core/kernels:eigen_activations_test PASSED in 0.2s //tensorflow/core/kernels:eigen_attention_test PASSED in 0.4s //tensorflow/core/kernels:eigen_backward_cuboid_convolutions_test PASSED in 0.5s //tensorflow/core/kernels:eigen_backward_spatial_convolutions_test PASSED in 0.1s //tensorflow/core/kernels:eigen_benchmark_cpu_test PASSED in 0.3s //tensorflow/core/kernels:eigen_mkldnn_contraction_kernel_test PASSED in 0.1s //tensorflow/core/kernels:eigen_pooling_test PASSED in 0.4s //tensorflow/core/kernels:encode_wav_op_test PASSED in 1.4s //tensorflow/core/kernels:fingerprint_op_test PASSED in 0.7s //tensorflow/core/kernels:fused_batch_norm_ex_op_test_cpu PASSED in 0.7s //tensorflow/core/kernels:fused_batch_norm_op_test_cpu PASSED in 1.0s //tensorflow/core/kernels:gather_nd_op_test_cpu PASSED in 0.7s //tensorflow/core/kernels:gather_op_test_cpu PASSED in 1.0s //tensorflow/core/kernels:guarantee_const_op_test PASSED in 0.6s //tensorflow/core/kernels:identity_n_op_test PASSED in 0.8s //tensorflow/core/kernels:identity_op_test PASSED in 0.7s //tensorflow/core/kernels:immutable_constant_op_test PASSED in 0.8s //tensorflow/core/kernels:in_topk_op_test PASSED in 0.5s //tensorflow/core/kernels:isotonic_regression_op_test PASSED in 0.6s //tensorflow/core/kernels:logging_ops_test PASSED in 1.8s //tensorflow/core/kernels:lookup_ops_test PASSED in 0.7s //tensorflow/core/kernels:loss_test PASSED in 0.2s //tensorflow/core/kernels:lrn_op_test_cpu PASSED in 1.8s //tensorflow/core/kernels:matmul_op_test_cpu PASSED in 3.5s //tensorflow/core/kernels:merge_v2_checkpoints_op_test PASSED in 0.6s //tensorflow/core/kernels:mfcc_dct_test PASSED in 0.1s //tensorflow/core/kernels:mfcc_mel_filterbank_test PASSED in 0.3s //tensorflow/core/kernels:mfcc_op_test_cpu PASSED in 1.7s //tensorflow/core/kernels:mfcc_test PASSED in 0.2s //tensorflow/core/kernels:multinomial_op_test_cpu PASSED in 0.5s //tensorflow/core/kernels:nn_ops_test_cpu PASSED in 1.0s //tensorflow/core/kernels:one_hot_op_test PASSED in 0.6s //tensorflow/core/kernels:ops_testutil_test PASSED in 0.6s //tensorflow/core/kernels:ops_util_test PASSED in 0.1s //tensorflow/core/kernels:parameterized_truncated_normal_op_test_cpu PASSED in 1.3s //tensorflow/core/kernels:parse_tensor_test PASSED in 0.6s //tensorflow/core/kernels:quantization_utils_test PASSED in 0.7s //tensorflow/core/kernels:quantize_and_dequantize_op_test_cpu PASSED in 0.8s //tensorflow/core/kernels:quantize_down_and_shrink_range_op_test PASSED in 0.8s //tensorflow/core/kernels:quantize_op_test PASSED in 0.8s //tensorflow/core/kernels:quantized_activation_ops_test PASSED in 0.6s //tensorflow/core/kernels:quantized_add_op_test PASSED in 1.0s //tensorflow/core/kernels:quantized_batch_norm_op_test PASSED in 0.7s //tensorflow/core/kernels:quantized_bias_add_op_test PASSED in 1.3s //tensorflow/core/kernels:quantized_concat_op_test PASSED in 0.8s //tensorflow/core/kernels:quantized_conv_ops_test PASSED in 0.9s //tensorflow/core/kernels:quantized_instance_norm_test PASSED in 0.9s //tensorflow/core/kernels:quantized_matmul_op_test PASSED in 1.2s //tensorflow/core/kernels:quantized_mul_op_test PASSED in 0.9s //tensorflow/core/kernels:quantized_pooling_ops_test PASSED in 1.0s //tensorflow/core/kernels:quantized_reshape_op_test PASSED in 1.0s //tensorflow/core/kernels:quantized_resize_bilinear_op_test PASSED in 2.8s //tensorflow/core/kernels:ragged_fill_empty_rows_op_test PASSED in 0.7s //tensorflow/core/kernels:ragged_gather_op_test PASSED in 0.7s //tensorflow/core/kernels:ragged_range_op_test PASSED in 1.2s //tensorflow/core/kernels:ragged_tensor_from_variant_op_test PASSED in 0.7s //tensorflow/core/kernels:ragged_tensor_to_sparse_kernel_test PASSED in 0.7s //tensorflow/core/kernels:ragged_tensor_to_tensor_op_test PASSED in 0.6s //tensorflow/core/kernels:ragged_tensor_to_variant_op_test PASSED in 0.5s //tensorflow/core/kernels:random_binomial_op_test_cpu PASSED in 0.7s //tensorflow/core/kernels:random_index_shuffle_test PASSED in 1.0s //tensorflow/core/kernels:random_op_test_cpu PASSED in 0.8s //tensorflow/core/kernels:random_poisson_op_test_cpu PASSED in 0.5s //tensorflow/core/kernels:range_sampler_test PASSED in 0.5s //tensorflow/core/kernels:reduction_ops_test_cpu PASSED in 0.8s //tensorflow/core/kernels:regex_replace_op_test PASSED in 0.7s //tensorflow/core/kernels:requantization_range_op_test PASSED in 0.9s //tensorflow/core/kernels:requantize_op_test PASSED in 1.0s //tensorflow/core/kernels:resource_ops_test PASSED in 0.6s //tensorflow/core/kernels:restore_op_test PASSED in 0.8s //tensorflow/core/kernels:restore_v2_op_test PASSED in 0.6s //tensorflow/core/kernels:reverse_op_test PASSED in 0.6s //tensorflow/core/kernels:roll_op_test PASSED in 0.7s //tensorflow/core/kernels:save_op_test PASSED in 0.8s //tensorflow/core/kernels:save_v2_op_test PASSED in 0.6s //tensorflow/core/kernels:scan_ops_test_cpu PASSED in 0.5s //tensorflow/core/kernels:scatter_nd_op_test_cpu PASSED in 0.5s //tensorflow/core/kernels:scatter_op_test PASSED in 0.6s //tensorflow/core/kernels:scoped_allocator_ops_test_cpu PASSED in 9.0s //tensorflow/core/kernels:sdca_ops_test PASSED in 1.4s //tensorflow/core/kernels:segment_reduction_ops_test PASSED in 0.4s //tensorflow/core/kernels:sendrecv_ops_test PASSED in 1.3s //tensorflow/core/kernels:sequence_ops_test PASSED in 0.8s //tensorflow/core/kernels:shape_ops_test PASSED in 1.2s //tensorflow/core/kernels:slice_op_test PASSED in 0.6s //tensorflow/core/kernels:spacetobatch_benchmark_test_cpu PASSED in 0.5s //tensorflow/core/kernels:sparse_add_op_test PASSED in 0.7s //tensorflow/core/kernels:sparse_dense_binary_op_shared_test PASSED in 0.6s //tensorflow/core/kernels:sparse_fill_empty_rows_op_test_cpu PASSED in 0.6s //tensorflow/core/kernels:sparse_matmul_op_test_cpu PASSED in 0.5s //tensorflow/core/kernels:sparse_reduce_sum_op_test PASSED in 0.6s //tensorflow/core/kernels:sparse_tensor_dense_matmul_op_test_cpu PASSED in 0.5s //tensorflow/core/kernels:sparse_to_dense_op_test_cpu PASSED in 1.7s //tensorflow/core/kernels:sparse_utils_test PASSED in 0.3s //tensorflow/core/kernels:sparse_xent_op_test_cpu PASSED in 0.4s //tensorflow/core/kernels:spectrogram_op_test_cpu PASSED in 6.1s //tensorflow/core/kernels:spectrogram_test PASSED in 0.1s //tensorflow/core/kernels:split_op_test_cpu PASSED in 0.7s //tensorflow/core/kernels:split_v_op_test_cpu PASSED in 1.2s //tensorflow/core/kernels:strided_slice_op_test PASSED in 0.5s //tensorflow/core/kernels:string_format_op_test PASSED in 0.9s //tensorflow/core/kernels:string_ngrams_op_test PASSED in 0.8s //tensorflow/core/kernels:string_split_op_test PASSED in 0.6s //tensorflow/core/kernels:substr_op_test PASSED in 0.5s //tensorflow/core/kernels:summary_audio_op_test PASSED in 0.8s //tensorflow/core/kernels:summary_image_op_test PASSED in 0.7s //tensorflow/core/kernels:summary_op_test PASSED in 0.6s //tensorflow/core/kernels:summary_tensor_op_test PASSED in 0.7s //tensorflow/core/kernels:tensor_cord_test PASSED in 0.2s //tensorflow/core/kernels:tensor_flag_utils_test PASSED in 0.1s //tensorflow/core/kernels:tensor_map_test PASSED in 0.1s //tensorflow/core/kernels:training_ops_test PASSED in 0.6s //tensorflow/core/kernels:transpose_util_test PASSED in 0.5s //tensorflow/core/kernels:unary_ops_composition_test_cpu PASSED in 2.0s //tensorflow/core/kernels:unique_op_test PASSED in 0.6s //tensorflow/core/kernels:variable_ops_test PASSED in 1.2s //tensorflow/core/kernels:while_op_test PASSED in 0.9s //tensorflow/core/kernels:xent_op_test_cpu PASSED in 0.5s //tensorflow/core/kernels/batching_util:basic_batch_scheduler_test PASSED in 0.3s //tensorflow/core/kernels/batching_util:batch_input_task_test PASSED in 0.8s //tensorflow/core/kernels/batching_util:batch_resource_base_test PASSED in 0.1s //tensorflow/core/kernels/batching_util:batch_scheduler_test PASSED in 0.1s //tensorflow/core/kernels/batching_util:bounded_executor_test PASSED in 20.2s //tensorflow/core/kernels/batching_util:input_split_metadata_test PASSED in 0.1s //tensorflow/core/kernels/batching_util:periodic_function_test PASSED in 1.5s //tensorflow/core/kernels/batching_util:serial_device_batch_scheduler_test PASSED in 2.0s //tensorflow/core/kernels/batching_util:shared_batch_scheduler_test PASSED in 2.9s //tensorflow/core/kernels/batching_util:threadsafe_status_test PASSED in 0.1s //tensorflow/core/kernels/data:batch_dataset_op_test PASSED in 2.7s //tensorflow/core/kernels/data:cache_dataset_ops_test PASSED in 0.7s //tensorflow/core/kernels/data:concatenate_dataset_op_test PASSED in 1.0s //tensorflow/core/kernels/data:filter_dataset_op_test PASSED in 1.3s //tensorflow/core/kernels/data:finalize_dataset_op_test PASSED in 1.0s //tensorflow/core/kernels/data:fixed_length_record_dataset_op_test PASSED in 0.7s //tensorflow/core/kernels/data:flat_map_dataset_op_test PASSED in 1.2s //tensorflow/core/kernels/data:get_options_op_test PASSED in 0.6s //tensorflow/core/kernels/data:interleave_dataset_op_test PASSED in 1.5s //tensorflow/core/kernels/data:iterator_ops_test PASSED in 1.0s //tensorflow/core/kernels/data:map_dataset_op_test PASSED in 0.9s //tensorflow/core/kernels/data:map_defun_op_test PASSED in 0.8s //tensorflow/core/kernels/data:optimize_dataset_op_test PASSED in 0.7s //tensorflow/core/kernels/data:options_dataset_op_test PASSED in 0.6s //tensorflow/core/kernels/data:padded_batch_dataset_op_test PASSED in 0.9s //tensorflow/core/kernels/data:parallel_batch_dataset_op_test PASSED in 0.9s //tensorflow/core/kernels/data:parallel_filter_dataset_op_test PASSED in 1.2s //tensorflow/core/kernels/data:parallel_interleave_dataset_op_test PASSED in 1.9s //tensorflow/core/kernels/data:parallel_map_dataset_op_test PASSED in 1.1s //tensorflow/core/kernels/data:prefetch_autotuner_test PASSED in 0.7s //tensorflow/core/kernels/data:prefetch_dataset_op_test PASSED in 0.8s //tensorflow/core/kernels/data:range_dataset_op_test PASSED in 0.7s //tensorflow/core/kernels/data:reduce_dataset_op_test PASSED in 0.7s //tensorflow/core/kernels/data:repeat_dataset_op_test PASSED in 1.1s //tensorflow/core/kernels/data:rewrite_dataset_op_test PASSED in 0.6s //tensorflow/core/kernels/data:shard_dataset_op_test PASSED in 1.1s //tensorflow/core/kernels/data:shuffle_dataset_op_test PASSED in 0.9s //tensorflow/core/kernels/data:skip_dataset_op_test PASSED in 0.9s //tensorflow/core/kernels/data:sparse_tensor_slice_dataset_op_test PASSED in 1.1s //tensorflow/core/kernels/data:take_dataset_op_test PASSED in 1.1s //tensorflow/core/kernels/data:tensor_dataset_op_test PASSED in 0.8s //tensorflow/core/kernels/data:tensor_slice_dataset_op_test PASSED in 0.7s //tensorflow/core/kernels/data:text_line_dataset_op_test PASSED in 1.7s //tensorflow/core/kernels/data:tf_record_dataset_op_test PASSED in 1.4s //tensorflow/core/kernels/data:window_dataset_op_test PASSED in 0.8s //tensorflow/core/kernels/data:zip_dataset_op_test PASSED in 0.9s //tensorflow/core/kernels/data/experimental:assert_next_dataset_op_test PASSED in 0.8s //tensorflow/core/kernels/data/experimental:assert_prev_dataset_op_test PASSED in 1.2s //tensorflow/core/kernels/data/experimental:auto_shard_dataset_op_test PASSED in 0.8s //tensorflow/core/kernels/data/experimental:directed_interleave_dataset_op_test PASSED in 1.1s //tensorflow/core/kernels/data/experimental:list_dataset_op_test PASSED in 1.0s //tensorflow/core/kernels/data/experimental:map_and_batch_dataset_op_test PASSED in 1.4s //tensorflow/core/kernels/data/experimental:parallel_interleave_dataset_op_test PASSED in 1.2s //tensorflow/core/kernels/data/experimental:random_dataset_op_test PASSED in 0.6s //tensorflow/core/kernels/data/experimental:sampling_dataset_op_test PASSED in 0.6s //tensorflow/core/kernels/data/experimental:save_dataset_op_test PASSED in 1.2s //tensorflow/core/kernels/data/experimental:unique_dataset_op_test PASSED in 0.6s //tensorflow/core/kernels/image:adjust_contrast_op_benchmark_test_cpu PASSED in 0.6s //tensorflow/core/kernels/image:adjust_contrast_op_test PASSED in 0.6s //tensorflow/core/kernels/image:colorspace_op_test PASSED in 1.1s //tensorflow/core/kernels/image:crop_and_resize_op_benchmark_test_cpu PASSED in 0.5s //tensorflow/core/kernels/image:crop_and_resize_op_test PASSED in 1.4s //tensorflow/core/kernels/image:encode_jpeg_op_test PASSED in 0.7s //tensorflow/core/kernels/image:mirror_pad_op_benchmark_test_cpu PASSED in 0.6s //tensorflow/core/kernels/image:mirror_pad_op_test PASSED in 2.1s //tensorflow/core/kernels/image:non_max_suppression_op_benchmark_test PASSED in 0.5s //tensorflow/core/kernels/image:non_max_suppression_op_test PASSED in 1.0s //tensorflow/core/kernels/image:resize_area_op_test PASSED in 1.7s //tensorflow/core/kernels/image:resize_benchmark_test_cpu PASSED in 1.1s //tensorflow/core/kernels/image:resize_bicubic_op_test PASSED in 3.7s //tensorflow/core/kernels/image:resize_ops_test_cpu PASSED in 2.5s //tensorflow/core/kernels/image:sampling_kernels_test PASSED in 0.6s //tensorflow/core/kernels/image:scale_and_translate_op_test PASSED in 1.6s //tensorflow/core/kernels/linalg:banded_triangular_solve_op_test_cpu PASSED in 0.6s //tensorflow/core/kernels/linalg:matrix_triangular_solve_op_test_cpu PASSED in 0.7s //tensorflow/core/kernels/mkl:mkl_conv_ops_test PASSED in 0.1s //tensorflow/core/kernels/mkl:mkl_dequantize_op_test PASSED in 0.3s //tensorflow/core/kernels/mkl:mkl_fused_batch_norm_op_test PASSED in 0.1s //tensorflow/core/kernels/mkl:mkl_fused_ops_test PASSED in 0.2s //tensorflow/core/kernels/mkl:mkl_matmul_op_benchmark PASSED in 0.4s //tensorflow/core/kernels/mkl:mkl_qmatmul_op_test PASSED in 0.1s //tensorflow/core/kernels/mkl:mkl_quantize_op_test PASSED in 0.1s //tensorflow/core/kernels/mkl:mkl_quantized_concat_op_test PASSED in 0.1s //tensorflow/core/kernels/mkl:mkl_quantized_conv_ops_perchannel_test PASSED in 0.4s //tensorflow/core/kernels/mkl:mkl_quantized_conv_ops_test PASSED in 0.1s //tensorflow/core/kernels/mkl:mkl_quantized_pooling_ops_test PASSED in 0.6s //tensorflow/core/kernels/mkl:mkl_relu_op_test PASSED in 0.1s //tensorflow/core/kernels/mkl:mkl_requantize_ops_test PASSED in 0.7s //tensorflow/core/kernels/mkl:mkl_swish_op_test PASSED in 0.2s //tensorflow/core/kernels/mkl:onednn_nn_ops_benchmark PASSED in 0.1s //tensorflow/core/kernels/sparse:kernels_test PASSED in 0.8s //tensorflow/core/kernels/uniform_quant_ops:math_utils_test PASSED in 0.2s //tensorflow/core/kernels/uniform_quant_ops:tensor_utils_test PASSED in 0.8s //tensorflow/core/kernels/uniform_quant_ops:uniform_dequantize_op_test PASSED in 0.5s //tensorflow/core/kernels/uniform_quant_ops:uniform_quantize_op_test PASSED in 0.5s //tensorflow/core/kernels/uniform_quant_ops:uniform_quantized_add_op_test PASSED in 0.5s //tensorflow/core/kernels/uniform_quant_ops:uniform_quantized_clip_by_value_op_test PASSED in 0.5s //tensorflow/core/kernels/uniform_quant_ops:uniform_quantized_convolution_ops_test PASSED in 0.6s //tensorflow/core/kernels/uniform_quant_ops:uniform_quantized_dot_ops_test PASSED in 0.6s //tensorflow/core/kernels/uniform_quant_ops:uniform_requantize_op_test PASSED in 0.8s //tensorflow/core/lib/db:sqlite_test PASSED in 0.2s //tensorflow/core/lib/gif:lib_gif_io_test PASSED in 1.2s //tensorflow/core/lib/jpeg:lib_jpeg_jpeg_mem_unittest PASSED in 0.6s //tensorflow/core/ops:cudnn_rnn_ops_test_cc PASSED in 1.0s //tensorflow/core/ops:ops_array_grad_test PASSED in 1.1s //tensorflow/core/ops:ops_math_grad_test PASSED in 3.8s //tensorflow/core/ops:ops_tests PASSED in 0.7s //tensorflow/core/ops/compat:backwards_compatibility_test PASSED in 0.7s //tensorflow/core/platform:__tensorflow_tsl_platform_profile_utils_cpu_utils_test PASSED in 0.1s //tensorflow/core/platform:enable_tf2_utils_test PASSED in 0.6s //tensorflow/core/platform:env_test PASSED in 2.7s //tensorflow/core/platform:fake_python_env_test PASSED in 0.1s //tensorflow/core/platform:file_system_test PASSED in 0.9s //tensorflow/core/platform:platform_strings_test PASSED in 0.1s //tensorflow/core/platform:ram_file_system_test PASSED in 29.6s //tensorflow/core/platform:resource_loader_test PASSED in 0.8s //tensorflow/core/platform:vmodule_benchmark_test PASSED in 0.1s //tensorflow/core/platform:vmodule_test PASSED in 0.2s //tensorflow/core/profiler/backends/cpu:host_tracer_test PASSED in 0.4s //tensorflow/core/profiler/convert:hlo_proto_to_graph_view_test PASSED in 0.3s //tensorflow/core/profiler/convert:hlo_proto_to_memory_visualization_utils_test PASSED in 0.4s //tensorflow/core/profiler/convert:op_stats_to_pod_stats_test PASSED in 0.7s //tensorflow/core/profiler/convert:op_stats_to_pod_viewer_test PASSED in 0.1s //tensorflow/core/profiler/convert:op_stats_to_tf_stats_test PASSED in 0.4s //tensorflow/core/profiler/convert:xplane_to_kernel_stats_db_test PASSED in 0.1s //tensorflow/core/profiler/convert:xplane_to_memory_profile_test PASSED in 0.4s //tensorflow/core/profiler/convert:xplane_to_op_metrics_db_test PASSED in 0.1s //tensorflow/core/profiler/convert:xplane_to_op_stats_test PASSED in 0.5s //tensorflow/core/profiler/convert:xplane_to_step_events_test PASSED in 0.1s //tensorflow/core/profiler/convert:xplane_to_tf_functions_test PASSED in 0.1s //tensorflow/core/profiler/convert:xplane_to_tool_names_test PASSED in 0.1s //tensorflow/core/profiler/convert/trace_viewer:trace_viewer_visibility_test PASSED in 0.1s //tensorflow/core/profiler/internal:tfprof_show_test PASSED in 0.8s //tensorflow/core/profiler/internal:tfprof_stats_test PASSED in 1.5s //tensorflow/core/profiler/internal:tfprof_tensor_test PASSED in 0.7s //tensorflow/core/profiler/internal:tfprof_timeline_test PASSED in 0.8s //tensorflow/core/profiler/internal/advisor:tfprof_advisor_test PASSED in 0.6s //tensorflow/core/profiler/lib:profiler_disabled_test PASSED in 0.2s //tensorflow/core/profiler/utils:derived_timeline_test PASSED in 0.5s //tensorflow/core/profiler/utils:kernel_stats_utils_test PASSED in 0.1s //tensorflow/core/profiler/utils:op_metrics_db_utils_test PASSED in 0.2s //tensorflow/core/profiler/utils:step_intersection_test PASSED in 0.1s //tensorflow/core/summary:schema_test PASSED in 0.2s //tensorflow/core/summary:summary_db_writer_test PASSED in 0.4s //tensorflow/core/summary:summary_file_writer_test PASSED in 0.4s //tensorflow/core/tfrt/common:pjrt_state_test PASSED in 5.7s //tensorflow/core/tfrt/common:pjrt_util_test PASSED in 5.3s //tensorflow/core/tfrt/fallback:cost_recorder_test PASSED in 0.3s //tensorflow/core/tfrt/fallback:fallback_state_test PASSED in 0.5s //tensorflow/core/transforms:eval_utils_test PASSED in 1.2s //tensorflow/core/transforms:graph_transform_wrapper_test PASSED in 0.3s //tensorflow/core/util:bcast_test PASSED in 1.5s //tensorflow/core/util:command_line_flags_test PASSED in 0.8s //tensorflow/core/util:debug_data_dumper_test PASSED in 1.4s //tensorflow/core/util:debug_events_writer_test PASSED in 9.2s //tensorflow/core/util:dump_graph_test PASSED in 1.6s //tensorflow/core/util:equal_graph_def_test PASSED in 0.7s //tensorflow/core/util:events_writer_test PASSED in 3.0s //tensorflow/core/util:example_proto_fast_parsing_test PASSED in 2.7s //tensorflow/core/util:example_proto_helper_test PASSED in 0.8s //tensorflow/core/util:exec_on_stall_test PASSED in 2.1s //tensorflow/core/util:fake_clock_env_test PASSED in 1.5s //tensorflow/core/util:incremental_barrier_test PASSED in 0.3s //tensorflow/core/util:matmul_bcast_test PASSED in 0.9s //tensorflow/core/util:memmapped_file_system_test PASSED in 1.1s //tensorflow/core/util:overflow_test PASSED in 0.2s //tensorflow/core/util:presized_cuckoo_map_test PASSED in 2.2s //tensorflow/core/util:ragged_to_dense_util_test PASSED in 0.5s //tensorflow/core/util:reffed_status_callback_test PASSED in 0.9s //tensorflow/core/util:reporter_test PASSED in 1.0s //tensorflow/core/util:saved_tensor_slice_util_test PASSED in 1.1s //tensorflow/core/util:semver_test PASSED in 0.8s //tensorflow/core/util:stat_summarizer_test PASSED in 0.9s //tensorflow/core/util:strided_slice_op_test PASSED in 1.2s //tensorflow/core/util:tensor_format_test PASSED in 1.3s //tensorflow/core/util:tensor_slice_reader_test PASSED in 1.3s //tensorflow/core/util:tensor_slice_set_test PASSED in 0.7s //tensorflow/core/util:tensor_slice_util_test PASSED in 0.8s //tensorflow/core/util:tensor_slice_writer_test PASSED in 1.6s //tensorflow/core/util:work_sharder_test PASSED in 0.9s //tensorflow/core/util/ctc:ctc_beam_search_test PASSED in 0.2s //tensorflow/core/util/proto:descriptor_pool_registry_test PASSED in 0.6s //tensorflow/core/util/proto:proto_utils_test PASSED in 0.7s //tensorflow/core/util/quantization:uniform_quant_ops_params_test PASSED in 0.6s //tensorflow/core/util/sparse:sparse_tensor_test PASSED in 0.1s //tensorflow/core/util/tensor_bundle:tensor_bundle_test PASSED in 30.3s //tensorflow/dtensor/mlir:dtensor_location_test PASSED in 0.4s //tensorflow/dtensor/mlir:group_assignment_test PASSED in 0.5s //tensorflow/dtensor/mlir/tests:annotate_global_shape.mlir.test PASSED in 0.6s //tensorflow/dtensor/mlir/tests:cluster_function_conversion.mlir.test PASSED in 0.4s //tensorflow/dtensor/mlir/tests:constant_folding.mlir.test PASSED in 0.7s //tensorflow/dtensor/mlir/tests:designate_resource_handle_mesh.mlir.test PASSED in 0.5s //tensorflow/dtensor/mlir/tests:device_mesh_cluster_coarsening.mlir.test PASSED in 0.6s //tensorflow/dtensor/mlir/tests:dtensor_all_gather.mlir.test PASSED in 0.6s //tensorflow/dtensor/mlir/tests:dtensor_all_scatter.mlir.test PASSED in 0.4s //tensorflow/dtensor/mlir/tests:dtensor_allreduce_combine_optimization.mlir.test PASSED in 0.4s //tensorflow/dtensor/mlir/tests:dtensor_allreduce_lowering.mlir.test PASSED in 0.4s //tensorflow/dtensor/mlir/tests:dtensor_allreduce_scatter_optimization.mlir.test PASSED in 0.6s //tensorflow/dtensor/mlir/tests:dtensor_allreduce_sum_optimization.mlir.test PASSED in 0.5s //tensorflow/dtensor/mlir/tests:dtensor_alltoall_lowering.mlir.test PASSED in 0.7s //tensorflow/dtensor/mlir/tests:dtensor_layout_must_execute.mlir.test PASSED in 0.6s //tensorflow/dtensor/mlir/tests:dtensor_layout_to_xla_sharding_op.mlir.test PASSED in 0.9s //tensorflow/dtensor/mlir/tests:dtensor_mixed_precision_reduce.mlir.test PASSED in 0.6s //tensorflow/dtensor/mlir/tests:dtensor_reduce_scatter_lowering.mlir.test PASSED in 1.5s //tensorflow/dtensor/mlir/tests:dtensor_remove_dtensorlayout.mlir.test PASSED in 0.6s //tensorflow/dtensor/mlir/tests:dtensor_replace_auxiliary_layout_op.mlir.test PASSED in 0.6s //tensorflow/dtensor/mlir/tests:dtensor_replace_relayout_with_identity.mlir.test PASSED in 0.4s //tensorflow/dtensor/mlir/tests:dtensor_set_hlo_sharding.mlir.test PASSED in 1.7s //tensorflow/dtensor/mlir/tests:dtensor_set_hlo_sharding_default.mlir.test PASSED in 0.9s //tensorflow/dtensor/mlir/tests:dtensor_xla_spmd_integration.mlir.test PASSED in 0.5s //tensorflow/dtensor/mlir/tests:elide_identity_before_copy_to_mesh.mlir.test PASSED in 1.0s //tensorflow/dtensor/mlir/tests:function_renaming.mlir.test PASSED in 0.5s //tensorflow/dtensor/mlir/tests:handle_cross_cluster_dependencies.mlir.test PASSED in 0.5s //tensorflow/dtensor/mlir/tests:handle_sparsetensors.mlir.test PASSED in 0.7s //tensorflow/dtensor/mlir/tests:layout_propagation_v2.mlir.test PASSED in 1.3s //tensorflow/dtensor/mlir/tests:lower_send_recv.mlir.test PASSED in 0.4s //tensorflow/dtensor/mlir/tests:merge_clusters.mlir.test PASSED in 1.1s //tensorflow/dtensor/mlir/tests:mesh_propagation.mlir.test PASSED in 0.7s //tensorflow/dtensor/mlir/tests:multi_device_expansion.mlir.test PASSED in 0.7s //tensorflow/dtensor/mlir/tests:op_to_device_cluster.mlir.test PASSED in 0.5s //tensorflow/dtensor/mlir/tests:propagate_default_layout.mlir.test PASSED in 0.7s //tensorflow/dtensor/mlir/tests:propagate_device_id_to_function.mlir.test PASSED in 0.5s //tensorflow/dtensor/mlir/tests:restore_and_assign.mlir.test PASSED in 0.5s //tensorflow/dtensor/mlir/tests:restore_shape_inference.mlir.test PASSED in 0.6s //tensorflow/dtensor/mlir/tests:set_default_sharding.mlir.test PASSED in 0.5s //tensorflow/dtensor/mlir/tests:sparse_expansion.mlir.test PASSED in 0.5s //tensorflow/dtensor/mlir/tests:spmd_batchparallel.mlir.test PASSED in 0.7s //tensorflow/dtensor/mlir/tests:spmd_concat.mlir.test PASSED in 0.6s //tensorflow/dtensor/mlir/tests:spmd_conv.mlir.test PASSED in 0.7s //tensorflow/dtensor/mlir/tests:spmd_einsum.mlir.test PASSED in 0.6s //tensorflow/dtensor/mlir/tests:spmd_expansion.mlir.test PASSED in 0.7s //tensorflow/dtensor/mlir/tests:spmd_io_ops.mlir.test PASSED in 0.5s //tensorflow/dtensor/mlir/tests:spmd_iterator.mlir.test PASSED in 0.5s //tensorflow/dtensor/mlir/tests:spmd_matmul.mlir.test PASSED in 0.7s //tensorflow/dtensor/mlir/tests:spmd_random.mlir.test PASSED in 0.5s //tensorflow/dtensor/mlir/tests:spmd_save_restore.mlir.test PASSED in 0.6s //tensorflow/dtensor/mlir/tests:spmd_segment_sum.mlir.test PASSED in 0.5s //tensorflow/dtensor/mlir/tests:spmd_slice.mlir.test PASSED in 0.8s //tensorflow/dtensor/mlir/tests:spmd_softmax_loss.mlir.test PASSED in 0.6s //tensorflow/dtensor/mlir/tests:spmd_squeeze.mlir.test PASSED in 0.5s //tensorflow/dtensor/mlir/tests:spmd_var_handle.mlir.test PASSED in 0.6s //tensorflow/dtensor/mlir/tests:tf_dtensor_ops.mlir.test PASSED in 0.6s //tensorflow/dtensor/mlir/tests:tpu_add_resource_device_attribute.mlir.test PASSED in 0.4s //tensorflow/dtensor/mlir/tests:tpu_integration.mlir.test PASSED in 1.8s //tensorflow/dtensor/mlir/tests:undo_merge_const_across_mesh.mlir.test PASSED in 0.8s //tensorflow/dtensor/mlir/tests:update_tpu_metadata.mlir.test PASSED in 0.4s //tensorflow/dtensor/python/tests:collective_combine_all_reduce_test_cpu PASSED in 15.4s //tensorflow/dtensor/python/tests:collective_test_cpu PASSED in 14.3s //tensorflow/dtensor/python/tests:config_test_cpu PASSED in 8.5s //tensorflow/dtensor/python/tests:device_test_cpu PASSED in 46.3s //tensorflow/dtensor/python/tests:layout_test_cpu PASSED in 7.4s //tensorflow/dtensor/python/tests:multi_client_test_cpu PASSED in 18.5s //tensorflow/dtensor/python/tests:numpy_util_test_cpu PASSED in 10.4s //tensorflow/dtensor/tests:executable_manager_test PASSED in 33.6s //tensorflow/dtensor/tests:layout_to_xla_sharding_test PASSED in 0.2s //tensorflow/dtensor/tests:tensor_layout_test PASSED in 0.3s //tensorflow/examples/adding_an_op:fact_test PASSED in 19.4s //tensorflow/examples/adding_an_op:zero_out_1_test PASSED in 19.7s //tensorflow/examples/adding_an_op:zero_out_2_test PASSED in 19.0s //tensorflow/examples/adding_an_op:zero_out_3_test PASSED in 16.6s //tensorflow/examples/custom_ops_doc/multiplex_1:multiplex_1_test PASSED in 20.3s //tensorflow/examples/custom_ops_doc/multiplex_2:multiplex_2_test_cpu PASSED in 15.8s //tensorflow/examples/custom_ops_doc/multiplex_3:multiplex_3_test PASSED in 32.7s //tensorflow/examples/custom_ops_doc/multiplex_4:multiplex_4_test PASSED in 17.6s //tensorflow/examples/custom_ops_doc/simple_hash_table:simple_hash_table_test PASSED in 17.8s //tensorflow/examples/custom_ops_doc/sleep:sleep_test PASSED in 16.8s //tensorflow/examples/speech_commands:accuracy_utils_test PASSED in 1.2s //tensorflow/examples/speech_commands:models_test PASSED in 20.1s //tensorflow/examples/speech_commands:recognize_commands_test PASSED in 1.9s //tensorflow/examples/wav_to_spectrogram:wav_to_spectrogram_test PASSED in 2.0s //tensorflow/js:ts_op_gen_test PASSED in 0.5s //tensorflow/python:array_grad_test_cpu PASSED in 11.5s //tensorflow/python:autograph_ops_test PASSED in 10.7s //tensorflow/python:batch_norm_benchmark_cpu PASSED in 9.0s //tensorflow/python:bincount_ops_test_cpu PASSED in 11.5s //tensorflow/python:bitwise_ops_test_cpu PASSED in 8.7s //tensorflow/python:clip_ops_test PASSED in 7.9s //tensorflow/python:clustering_ops_test PASSED in 21.1s //tensorflow/python:collective_ops_benchmark_cpu PASSED in 9.4s //tensorflow/python:collective_ops_gpu_test_2gpu PASSED in 11.9s //tensorflow/python:collective_ops_gpu_test_cpu PASSED in 10.2s //tensorflow/python:collective_ops_test PASSED in 15.8s //tensorflow/python:collective_ops_xla_test PASSED in 8.6s //tensorflow/python:compiled_collective_ops_gpu_test_2gpu PASSED in 11.6s //tensorflow/python:compiled_collective_ops_gpu_test_cpu PASSED in 10.6s //tensorflow/python:concat_benchmark_cpu PASSED in 8.2s //tensorflow/python:control_flow_ops_benchmark_cpu PASSED in 9.8s //tensorflow/python:control_flow_v2_enable_test PASSED in 6.9s //tensorflow/python:control_flow_v2_toggles_test PASSED in 10.4s //tensorflow/python:dequantize_op_test PASSED in 7.8s //tensorflow/python:embedding_ops_test_cpu PASSED in 8.7s //tensorflow/python:factory_ops_test_cpu PASSED in 9.2s //tensorflow/python:functional_ops_test PASSED in 7.6s //tensorflow/python:gradient_checker_v2_test_cpu PASSED in 25.0s //tensorflow/python:gradients_test_cpu PASSED in 18.2s //tensorflow/python:init_ops_test_cpu PASSED in 10.2s //tensorflow/python:init_ops_v2_test_cpu PASSED in 12.8s //tensorflow/python:math_grad_test_cpu PASSED in 16.8s //tensorflow/python:math_ops_linspace_test_cpu PASSED in 7.4s //tensorflow/python:math_ops_test_cpu PASSED in 25.3s //tensorflow/python:matmul_benchmark_cpu PASSED in 12.0s //tensorflow/python:nn_grad_test_cpu PASSED in 11.1s //tensorflow/python:nn_loss_scaling_utilities_test PASSED in 13.0s //tensorflow/python:nn_test_cpu PASSED in 54.4s //tensorflow/python:nn_xent_test_cpu PASSED in 8.3s //tensorflow/python:op_selector_test PASSED in 7.1s //tensorflow/python:ops/array_ops_test PASSED in 9.7s //tensorflow/python:quantized_conv_ops_test PASSED in 8.8s //tensorflow/python:quantized_ops_test PASSED in 9.5s //tensorflow/python:raw_ops_test_cpu PASSED in 10.0s //tensorflow/python:rnn_grad_test_cpu PASSED in 8.5s //tensorflow/python:script_ops_test PASSED in 7.2s //tensorflow/python:sort_ops_test PASSED in 8.4s //tensorflow/python:sparse_ops_test PASSED in 15.9s //tensorflow/python:split_benchmark_cpu PASSED in 7.6s //tensorflow/python:tensor_array_ops_test PASSED in 6.6s //tensorflow/python:transpose_benchmark_cpu PASSED in 9.4s //tensorflow/python:variable_spec_test PASSED in 10.0s //tensorflow/python/autograph/converters:asserts_test PASSED in 7.9s //tensorflow/python/autograph/converters:break_statements_test PASSED in 7.9s //tensorflow/python/autograph/converters:call_trees_test PASSED in 8.0s //tensorflow/python/autograph/converters:conditional_expressions_test PASSED in 8.5s //tensorflow/python/autograph/converters:continue_statements_test PASSED in 11.9s //tensorflow/python/autograph/converters:control_flow_test PASSED in 16.1s //tensorflow/python/autograph/converters:directives_test PASSED in 9.5s //tensorflow/python/autograph/converters:functions_test PASSED in 10.1s //tensorflow/python/autograph/converters:list_comprehensions_test PASSED in 9.9s //tensorflow/python/autograph/converters:lists_test PASSED in 10.2s //tensorflow/python/autograph/converters:logical_expressions_test PASSED in 9.5s //tensorflow/python/autograph/converters:return_statements_test PASSED in 12.3s //tensorflow/python/autograph/converters:slices_test PASSED in 8.5s //tensorflow/python/autograph/converters:variables_test PASSED in 7.9s //tensorflow/python/autograph/core:converter_test PASSED in 6.5s //tensorflow/python/autograph/core:function_wrappers_test PASSED in 7.4s //tensorflow/python/autograph/impl:api_test PASSED in 16.4s //tensorflow/python/autograph/impl:conversion_test PASSED in 8.7s //tensorflow/python/autograph/lang:special_functions_test PASSED in 8.2s //tensorflow/python/autograph/operators:conditional_expressions_test PASSED in 11.1s //tensorflow/python/autograph/operators:control_flow_test PASSED in 35.1s //tensorflow/python/autograph/operators:data_structures_test PASSED in 10.1s //tensorflow/python/autograph/operators:exceptions_test PASSED in 8.3s //tensorflow/python/autograph/operators:logical_test PASSED in 7.8s //tensorflow/python/autograph/operators:py_builtins_test PASSED in 16.1s //tensorflow/python/autograph/operators:slices_test PASSED in 12.6s //tensorflow/python/autograph/operators:variables_test PASSED in 8.9s //tensorflow/python/autograph/pyct:anno_test PASSED in 8.7s //tensorflow/python/autograph/pyct:ast_util_test PASSED in 10.1s //tensorflow/python/autograph/pyct:cache_test PASSED in 11.1s //tensorflow/python/autograph/pyct:cfg_test PASSED in 11.4s //tensorflow/python/autograph/pyct:error_utils_test PASSED in 8.6s //tensorflow/python/autograph/pyct:inspect_utils_test PASSED in 10.0s //tensorflow/python/autograph/pyct:loader_test PASSED in 8.3s //tensorflow/python/autograph/pyct:naming_test PASSED in 7.9s //tensorflow/python/autograph/pyct:origin_info_test PASSED in 8.3s //tensorflow/python/autograph/pyct:parser_test PASSED in 10.6s //tensorflow/python/autograph/pyct:pretty_printer_test PASSED in 9.1s //tensorflow/python/autograph/pyct:qual_names_test PASSED in 9.4s //tensorflow/python/autograph/pyct:templates_test PASSED in 11.0s //tensorflow/python/autograph/pyct:transformer_test PASSED in 7.3s //tensorflow/python/autograph/pyct:transpiler_test PASSED in 13.0s //tensorflow/python/autograph/pyct/static_analysis:activity_test PASSED in 30.7s //tensorflow/python/autograph/pyct/static_analysis:liveness_test PASSED in 7.0s //tensorflow/python/autograph/pyct/static_analysis:reaching_definitions_test PASSED in 9.4s //tensorflow/python/autograph/pyct/static_analysis:reaching_fndefs_test PASSED in 10.5s //tensorflow/python/autograph/pyct/static_analysis:type_inference_test PASSED in 8.1s //tensorflow/python/autograph/tests:assertion_test PASSED in 25.1s //tensorflow/python/autograph/tests:basic_ifexp_test PASSED in 19.6s //tensorflow/python/autograph/tests:call_to_builtin_function_test PASSED in 17.9s //tensorflow/python/autograph/tests:call_to_lambda_function_test PASSED in 17.1s //tensorflow/python/autograph/tests:call_to_named_tuple_test PASSED in 17.9s //tensorflow/python/autograph/tests:call_to_numpy_function_test PASSED in 16.6s //tensorflow/python/autograph/tests:call_to_print_function_test PASSED in 16.6s //tensorflow/python/autograph/tests:call_to_tf_api_test PASSED in 19.7s //tensorflow/python/autograph/tests:call_to_user_function_test PASSED in 26.9s //tensorflow/python/autograph/tests:composite_names_in_control_flow_test PASSED in 30.1s //tensorflow/python/autograph/tests:cond_basic_test PASSED in 26.3s //tensorflow/python/autograph/tests:datasets_test PASSED in 17.7s //tensorflow/python/autograph/tests:early_return_test PASSED in 46.0s //tensorflow/python/autograph/tests:ext_slice_test PASSED in 15.6s //tensorflow/python/autograph/tests:generator_test PASSED in 24.5s //tensorflow/python/autograph/tests:logical_expression_test PASSED in 18.6s //tensorflow/python/autograph/tests:loop_basic_test PASSED in 79.1s //tensorflow/python/autograph/tests:loop_control_flow_illegal_cases_test PASSED in 24.3s //tensorflow/python/autograph/tests:loop_created_variables_test PASSED in 27.2s //tensorflow/python/autograph/tests:loop_scoping_test PASSED in 23.8s //tensorflow/python/autograph/tests:loop_with_function_call_test PASSED in 31.6s //tensorflow/python/autograph/tests:loop_with_variable_type_illegal_cases_test PASSED in 19.8s //tensorflow/python/autograph/tests:loop_with_variable_type_test PASSED in 55.5s //tensorflow/python/autograph/tests:nested_control_flow_test PASSED in 43.6s //tensorflow/python/autograph/tests:type_annotations_test PASSED in 23.6s //tensorflow/python/autograph/utils:context_managers_test PASSED in 6.8s //tensorflow/python/autograph/utils:misc_test PASSED in 8.8s //tensorflow/python/autograph/utils:tensor_list_test PASSED in 9.7s //tensorflow/python/autograph/utils:tensors_test PASSED in 9.6s //tensorflow/python/checkpoint:benchmarks_test PASSED in 9.4s //tensorflow/python/checkpoint:checkpoint_management_test_cpu PASSED in 17.7s //tensorflow/python/checkpoint:checkpoint_metrics_test PASSED in 15.5s //tensorflow/python/checkpoint:checkpoint_test PASSED in 27.3s //tensorflow/python/checkpoint:checkpoint_view_test PASSED in 10.0s //tensorflow/python/checkpoint:checkpoint_with_v1_optimizers_test PASSED in 11.3s //tensorflow/python/checkpoint:functional_saver_test_cpu PASSED in 12.6s //tensorflow/python/checkpoint:restore_test PASSED in 8.9s //tensorflow/python/checkpoint:save_util_v1_test PASSED in 11.2s //tensorflow/python/checkpoint:saveable_compat_test PASSED in 9.6s //tensorflow/python/checkpoint:tensor_callable_test PASSED in 8.1s //tensorflow/python/checkpoint:trackable_view_test PASSED in 10.1s //tensorflow/python/client:device_lib_test_cpu PASSED in 8.3s //tensorflow/python/client:events_writer_test PASSED in 8.5s //tensorflow/python/client:session_benchmark_cpu PASSED in 10.0s //tensorflow/python/client:session_list_devices_test PASSED in 9.2s //tensorflow/python/client:session_partial_run_test PASSED in 13.7s //tensorflow/python/client:timeline_test_cpu PASSED in 8.8s //tensorflow/python/client:virtual_gpu_test_cpu PASSED in 9.9s //tensorflow/python/compat:compat_test PASSED in 8.1s //tensorflow/python/compat:disable_v2_behavior_test PASSED in 24.6s //tensorflow/python/compiler/mlir:mlir_test PASSED in 8.6s //tensorflow/python/compiler/tensorrt:trt_convert_test_cpu PASSED in 18.1s //tensorflow/python/compiler/tensorrt/test:batch_matmul_test_cpu PASSED in 10.4s //tensorflow/python/compiler/tensorrt/test:biasadd_matmul_test_cpu PASSED in 6.8s //tensorflow/python/compiler/tensorrt/test:binary_tensor_weight_broadcast_test_cpu PASSED in 9.8s //tensorflow/python/compiler/tensorrt/test:bool_test_cpu PASSED in 11.7s //tensorflow/python/compiler/tensorrt/test:cast_test_cpu PASSED in 10.4s //tensorflow/python/compiler/tensorrt/test:concatenation_test_cpu PASSED in 9.4s //tensorflow/python/compiler/tensorrt/test:const_broadcast_test_cpu PASSED in 10.8s //tensorflow/python/compiler/tensorrt/test:data_dependent_shape_test_cpu PASSED in 6.5s //tensorflow/python/compiler/tensorrt/test:dynamic_input_shapes_test_cpu PASSED in 9.2s //tensorflow/python/compiler/tensorrt/test:identity_output_test_cpu PASSED in 11.4s //tensorflow/python/compiler/tensorrt/test:int32_test_cpu PASSED in 8.3s //tensorflow/python/compiler/tensorrt/test:lru_cache_test_cpu PASSED in 10.0s //tensorflow/python/compiler/tensorrt/test:memory_alignment_test_cpu PASSED in 9.3s //tensorflow/python/compiler/tensorrt/test:multi_connection_neighbor_engine_test_cpu PASSED in 8.3s //tensorflow/python/compiler/tensorrt/test:neighboring_engine_test_cpu PASSED in 29.2s //tensorflow/python/compiler/tensorrt/test:quantization_test_cpu PASSED in 6.7s //tensorflow/python/compiler/tensorrt/test:rank_two_test_cpu PASSED in 9.9s //tensorflow/python/compiler/tensorrt/test:reshape_transpose_test_cpu PASSED in 10.1s //tensorflow/python/compiler/tensorrt/test:topk_test_cpu PASSED in 9.6s //tensorflow/python/compiler/tensorrt/test:trt_engine_op_shape_test_cpu PASSED in 9.1s //tensorflow/python/compiler/tensorrt/test:trt_mode_test_cpu PASSED in 12.7s //tensorflow/python/compiler/tensorrt/test:unary_test_cpu PASSED in 9.6s //tensorflow/python/compiler/tensorrt/test:vgg_block_nchw_test_cpu PASSED in 9.0s //tensorflow/python/compiler/tensorrt/test:vgg_block_test_cpu PASSED in 13.2s //tensorflow/python/compiler/xla:jit_compile_test_cpu PASSED in 39.6s //tensorflow/python/compiler/xla:jit_test_cpu PASSED in 14.2s //tensorflow/python/compiler/xla:xla_test_cpu PASSED in 19.1s //tensorflow/python/compiler/xla/experimental:xla_sharding_test PASSED in 17.2s //tensorflow/python/data/benchmarks:batch_benchmark PASSED in 7.8s //tensorflow/python/data/benchmarks:filter_benchmark PASSED in 8.7s //tensorflow/python/data/benchmarks:from_tensor_slices_benchmark PASSED in 8.7s //tensorflow/python/data/benchmarks:interleave_benchmark PASSED in 11.2s //tensorflow/python/data/benchmarks:list_files_benchmark PASSED in 10.2s //tensorflow/python/data/benchmarks:map_benchmark PASSED in 9.7s //tensorflow/python/data/benchmarks:meta_benchmark PASSED in 8.1s //tensorflow/python/data/benchmarks:prefetch_benchmark PASSED in 10.8s //tensorflow/python/data/benchmarks:range_benchmark PASSED in 9.2s //tensorflow/python/data/experimental/benchmarks:autotune_benchmark PASSED in 9.6s //tensorflow/python/data/experimental/benchmarks:csv_dataset_benchmark PASSED in 9.2s //tensorflow/python/data/experimental/benchmarks:map_and_batch_benchmark PASSED in 11.1s //tensorflow/python/data/experimental/benchmarks:map_defun_benchmark PASSED in 11.0s //tensorflow/python/data/experimental/benchmarks:matching_files_benchmark PASSED in 7.3s //tensorflow/python/data/experimental/benchmarks:optimize_benchmark PASSED in 7.9s //tensorflow/python/data/experimental/benchmarks:parameter_value_benchmark PASSED in 10.2s //tensorflow/python/data/experimental/benchmarks:rejection_resample_benchmark PASSED in 10.0s //tensorflow/python/data/experimental/benchmarks:snapshot_dataset_benchmark PASSED in 8.2s //tensorflow/python/data/experimental/benchmarks:unbatch_benchmark PASSED in 8.5s //tensorflow/python/data/experimental/kernel_tests:assert_cardinality_test PASSED in 25.0s //tensorflow/python/data/experimental/kernel_tests:assert_next_test PASSED in 15.8s //tensorflow/python/data/experimental/kernel_tests:assert_prev_test PASSED in 13.3s //tensorflow/python/data/experimental/kernel_tests:checkpoint_input_pipeline_hook_test PASSED in 21.0s //tensorflow/python/data/experimental/kernel_tests:compression_ops_test PASSED in 12.5s //tensorflow/python/data/experimental/kernel_tests:copy_to_device_test_cpu PASSED in 16.0s //tensorflow/python/data/experimental/kernel_tests:dense_to_sparse_batch_test PASSED in 19.7s //tensorflow/python/data/experimental/kernel_tests:from_list_test PASSED in 25.8s //tensorflow/python/data/experimental/kernel_tests:io_test PASSED in 48.5s //tensorflow/python/data/experimental/kernel_tests:lookup_ops_test PASSED in 9.5s //tensorflow/python/data/experimental/kernel_tests:make_csv_dataset_test PASSED in 22.0s //tensorflow/python/data/experimental/kernel_tests:make_saveable_from_iterator_test PASSED in 9.6s //tensorflow/python/data/experimental/kernel_tests:make_tf_record_dataset_test PASSED in 63.0s //tensorflow/python/data/experimental/kernel_tests:map_defun_op_test PASSED in 8.7s //tensorflow/python/data/experimental/kernel_tests:matching_files_dataset_test PASSED in 20.6s //tensorflow/python/data/experimental/kernel_tests:model_dataset_test PASSED in 9.8s //tensorflow/python/data/experimental/kernel_tests:non_serializable_test PASSED in 10.6s //tensorflow/python/data/experimental/kernel_tests:prefetch_to_device_test_cpu PASSED in 44.4s //tensorflow/python/data/experimental/kernel_tests:prefetch_with_slack_test PASSED in 12.6s //tensorflow/python/data/experimental/kernel_tests:shuffle_and_repeat_test PASSED in 24.1s //tensorflow/python/data/experimental/kernel_tests:sleep_test PASSED in 9.9s //tensorflow/python/data/experimental/kernel_tests:tf_record_writer_test PASSED in 13.5s //tensorflow/python/data/experimental/kernel_tests:variant_test PASSED in 8.2s //tensorflow/python/data/experimental/kernel_tests:wrap_unwrap_test_cpu PASSED in 8.3s //tensorflow/python/data/experimental/kernel_tests/optimization:filter_fusion_test PASSED in 30.4s //tensorflow/python/data/experimental/kernel_tests/optimization:filter_parallelization_test PASSED in 78.3s //tensorflow/python/data/experimental/kernel_tests/optimization:grappler_test_cpu PASSED in 10.0s //tensorflow/python/data/experimental/kernel_tests/optimization:make_deterministic_test PASSED in 28.8s //tensorflow/python/data/experimental/kernel_tests/optimization:map_and_batch_fusion_test PASSED in 7.2s //tensorflow/python/data/experimental/kernel_tests/optimization:map_and_filter_fusion_test PASSED in 20.9s //tensorflow/python/data/experimental/kernel_tests/optimization:map_fusion_test PASSED in 25.3s //tensorflow/python/data/experimental/kernel_tests/optimization:map_parallelization_test PASSED in 10.5s //tensorflow/python/data/experimental/kernel_tests/optimization:noop_elimination_test PASSED in 10.5s //tensorflow/python/data/experimental/kernel_tests/service:multi_device_test PASSED in 28.0s //tensorflow/python/data/experimental/service:server_lib_test PASSED in 10.4s //tensorflow/python/data/kernel_tests:as_numpy_iterator_test PASSED in 10.3s //tensorflow/python/data/kernel_tests:bucket_by_sequence_length_test PASSED in 18.0s //tensorflow/python/data/kernel_tests:cache_test PASSED in 45.6s //tensorflow/python/data/kernel_tests:cardinality_test PASSED in 16.9s //tensorflow/python/data/kernel_tests:checkpoint_test PASSED in 18.2s //tensorflow/python/data/kernel_tests:concatenate_test PASSED in 22.3s //tensorflow/python/data/kernel_tests:counter_test PASSED in 28.9s //tensorflow/python/data/kernel_tests:dataset_spec_test PASSED in 8.9s //tensorflow/python/data/kernel_tests:dataset_test PASSED in 61.7s //tensorflow/python/data/kernel_tests:enumerate_test PASSED in 25.1s //tensorflow/python/data/kernel_tests:from_sparse_tensor_slices_test PASSED in 9.1s //tensorflow/python/data/kernel_tests:from_tensor_slices_test PASSED in 26.7s //tensorflow/python/data/kernel_tests:from_tensors_test PASSED in 21.5s //tensorflow/python/data/kernel_tests:get_single_element_test PASSED in 11.4s //tensorflow/python/data/kernel_tests:ignore_errors_test PASSED in 19.3s //tensorflow/python/data/kernel_tests:io_test PASSED in 49.3s //tensorflow/python/data/kernel_tests:iterator_test_cpu PASSED in 20.5s //tensorflow/python/data/kernel_tests:len_test PASSED in 7.7s //tensorflow/python/data/kernel_tests:list_files_test PASSED in 12.8s //tensorflow/python/data/kernel_tests:optional_test_cpu PASSED in 11.4s //tensorflow/python/data/kernel_tests:options_test PASSED in 11.3s //tensorflow/python/data/kernel_tests:placement_test_cpu PASSED in 10.5s //tensorflow/python/data/kernel_tests:prefetch_test PASSED in 41.6s //tensorflow/python/data/kernel_tests:random_test PASSED in 25.3s //tensorflow/python/data/kernel_tests:range_test PASSED in 37.1s //tensorflow/python/data/kernel_tests:rebatch_test PASSED in 7.3s //tensorflow/python/data/kernel_tests:reduce_test_cpu PASSED in 24.8s //tensorflow/python/data/kernel_tests:scan_test_cpu PASSED in 39.2s //tensorflow/python/data/kernel_tests:sparse_batch_test PASSED in 20.9s //tensorflow/python/data/kernel_tests:unbatch_test PASSED in 29.3s //tensorflow/python/data/util:convert_test PASSED in 13.1s //tensorflow/python/data/util:nest_test PASSED in 9.3s //tensorflow/python/data/util:options_test PASSED in 9.8s //tensorflow/python/data/util:random_seed_test PASSED in 9.5s //tensorflow/python/data/util:sparse_test PASSED in 9.7s //tensorflow/python/data/util:structure_test PASSED in 10.6s //tensorflow/python/data/util:traverse_test PASSED in 7.9s //tensorflow/python/debug/cli:analyzer_cli_test_cpu PASSED in 15.9s //tensorflow/python/debug/cli:cli_config_test PASSED in 7.8s //tensorflow/python/debug/cli:cli_shared_test PASSED in 7.3s //tensorflow/python/debug/cli:command_parser_test PASSED in 7.2s //tensorflow/python/debug/cli:curses_ui_test PASSED in 8.4s //tensorflow/python/debug/cli:debugger_cli_common_test PASSED in 8.2s //tensorflow/python/debug/cli:evaluator_test PASSED in 10.7s //tensorflow/python/debug/cli:profile_analyzer_cli_test PASSED in 7.7s //tensorflow/python/debug/cli:readline_ui_test PASSED in 8.8s //tensorflow/python/debug/cli:tensor_format_test PASSED in 10.4s //tensorflow/python/debug/lib:check_numerics_callback_test_cpu PASSED in 15.7s //tensorflow/python/debug/lib:common_test PASSED in 7.0s //tensorflow/python/debug/lib:debug_data_test PASSED in 8.6s //tensorflow/python/debug/lib:debug_events_monitors_test PASSED in 9.2s //tensorflow/python/debug/lib:debug_events_writer_test PASSED in 11.4s //tensorflow/python/debug/lib:debug_gradients_test_cpu PASSED in 11.2s //tensorflow/python/debug/lib:debug_graph_reconstruction_test_cpu PASSED in 8.6s //tensorflow/python/debug/lib:debug_graphs_test PASSED in 6.1s //tensorflow/python/debug/lib:debug_grappler_test_cpu PASSED in 10.1s //tensorflow/python/debug/lib:debug_utils_test PASSED in 6.2s //tensorflow/python/debug/lib:debug_v2_ops_test_cpu PASSED in 16.6s //tensorflow/python/debug/lib:profiling_test PASSED in 6.8s //tensorflow/python/debug/lib:session_debug_file_test_cpu PASSED in 13.0s //tensorflow/python/debug/lib:session_debug_multi_gpu_test_cpu PASSED in 10.4s //tensorflow/python/debug/lib:source_utils_test PASSED in 13.3s //tensorflow/python/debug/wrappers:disk_usage_test PASSED in 9.7s //tensorflow/python/debug/wrappers:dumping_wrapper_test PASSED in 10.1s //tensorflow/python/debug/wrappers:framework_test PASSED in 7.0s //tensorflow/python/debug/wrappers:local_cli_wrapper_test PASSED in 9.1s //tensorflow/python/distribute:checkpoint_utils_test_2gpu PASSED in 11.4s //tensorflow/python/distribute:checkpoint_utils_test_cpu PASSED in 14.8s //tensorflow/python/distribute:checkpointing_test_2gpu PASSED in 13.3s //tensorflow/python/distribute:checkpointing_test_cpu PASSED in 10.0s //tensorflow/python/distribute:collective_all_reduce_strategy_test_2gpu PASSED in 57.8s //tensorflow/python/distribute:collective_all_reduce_strategy_test_cpu PASSED in 56.4s //tensorflow/python/distribute:collective_all_reduce_strategy_test_xla_2gpu PASSED in 25.7s //tensorflow/python/distribute:collective_util_test PASSED in 9.6s //tensorflow/python/distribute:combinations_test_2gpu PASSED in 20.6s //tensorflow/python/distribute:combinations_test_cpu PASSED in 20.4s //tensorflow/python/distribute:cross_device_utils_test_cpu PASSED in 8.9s //tensorflow/python/distribute:custom_training_loop_gradient_test_2gpu PASSED in 11.1s //tensorflow/python/distribute:custom_training_loop_gradient_test_cpu PASSED in 14.6s //tensorflow/python/distribute:device_util_test_cpu PASSED in 10.1s //tensorflow/python/distribute:distribute_coordinator_test PASSED in 16.6s //tensorflow/python/distribute:distribute_lib_test PASSED in 14.4s //tensorflow/python/distribute:distribute_utils_test_2gpu PASSED in 11.4s //tensorflow/python/distribute:distribute_utils_test_cpu PASSED in 10.6s //tensorflow/python/distribute:input_ops_test_cpu PASSED in 17.1s //tensorflow/python/distribute:metrics_v1_test_2gpu PASSED in 29.3s //tensorflow/python/distribute:metrics_v1_test_cpu PASSED in 26.4s //tensorflow/python/distribute:mirrored_values_test_2gpu PASSED in 12.7s //tensorflow/python/distribute:mirrored_values_test_cpu PASSED in 8.7s //tensorflow/python/distribute:mirrored_variable_test_2gpu PASSED in 21.7s //tensorflow/python/distribute:mirrored_variable_test_cpu PASSED in 20.6s //tensorflow/python/distribute:multi_process_runner_no_init_test PASSED in 11.6s //tensorflow/python/distribute:multi_worker_continuous_run_test_cpu PASSED in 20.0s //tensorflow/python/distribute:multi_worker_util_test PASSED in 11.1s //tensorflow/python/distribute:numpy_dataset_test PASSED in 7.0s //tensorflow/python/distribute:one_device_strategy_test_cpu PASSED in 18.2s //tensorflow/python/distribute:packed_distributed_variable_test PASSED in 8.6s //tensorflow/python/distribute:parameter_server_strategy_test_2gpu PASSED in 24.6s //tensorflow/python/distribute:parameter_server_strategy_test_cpu PASSED in 34.8s //tensorflow/python/distribute:parameter_server_strategy_v2_test_2gpu PASSED in 22.8s //tensorflow/python/distribute:parameter_server_strategy_v2_test_cpu PASSED in 23.8s //tensorflow/python/distribute:per_replica_test_2gpu PASSED in 9.9s //tensorflow/python/distribute:per_replica_test_cpu PASSED in 14.5s //tensorflow/python/distribute:ps_values_test_2gpu PASSED in 8.2s //tensorflow/python/distribute:ps_values_test_cpu PASSED in 10.3s //tensorflow/python/distribute:remote_mirrored_strategy_eager_test_cpu PASSED in 9.2s //tensorflow/python/distribute:sharded_variable_test PASSED in 41.4s //tensorflow/python/distribute:shared_variable_creator_test PASSED in 6.7s //tensorflow/python/distribute:strategy_combinations_test_cpu PASSED in 44.7s //tensorflow/python/distribute:template_mirrored_strategy_test_cpu PASSED in 7.9s //tensorflow/python/distribute:test_util_test_2gpu PASSED in 17.0s //tensorflow/python/distribute:test_util_test_cpu PASSED in 15.7s //tensorflow/python/distribute:tf_function_test_2gpu PASSED in 13.0s //tensorflow/python/distribute:tf_function_test_cpu PASSED in 12.3s //tensorflow/python/distribute:values_v2_test_cpu PASSED in 14.5s //tensorflow/python/distribute:warm_starting_util_test_2gpu PASSED in 10.6s //tensorflow/python/distribute:warm_starting_util_test_cpu PASSED in 14.1s //tensorflow/python/distribute/cluster_resolver:base_cluster_resolver_py_test PASSED in 10.0s //tensorflow/python/distribute/cluster_resolver:gce_cluster_resolver_py_test PASSED in 7.8s //tensorflow/python/distribute/cluster_resolver:kubernetes_cluster_resolver_py_test PASSED in 10.5s //tensorflow/python/distribute/cluster_resolver:sagemaker_cluster_resolver_py_test PASSED in 9.8s //tensorflow/python/distribute/cluster_resolver:slurm_cluster_resolver_py_test PASSED in 8.7s //tensorflow/python/distribute/cluster_resolver:tfconfig_cluster_resolver_py_test PASSED in 8.7s //tensorflow/python/distribute/cluster_resolver/tpu:tpu_cluster_resolver_py_test PASSED in 9.4s //tensorflow/python/distribute/coordinator:metric_utils_test PASSED in 10.1s //tensorflow/python/distribute/coordinator:watchdog_test PASSED in 61.2s //tensorflow/python/distribute/experimental:dtensor_util_test_cpu PASSED in 12.1s //tensorflow/python/distribute/experimental:mirrored_strategy_test_cpu PASSED in 50.6s //tensorflow/python/distribute/integration_test:saved_model_test_cpu PASSED in 40.1s //tensorflow/python/distribute/parallel_device:parallel_device_test_cpu PASSED in 17.9s //tensorflow/python/distribute/v1:all_reduce_test PASSED in 43.3s //tensorflow/python/distribute/v1:cross_device_ops_test_2gpu PASSED in 48.7s //tensorflow/python/distribute/v1:cross_device_ops_test_cpu PASSED in 56.1s //tensorflow/python/dlpack:dlpack_test_cpu PASSED in 10.0s //tensorflow/python/eager:backprop_test_cpu PASSED in 114.3s //tensorflow/python/eager:benchmarks_test_cpu PASSED in 12.5s //tensorflow/python/eager:cancellation_test_cpu PASSED in 8.8s //tensorflow/python/eager:context_test_cpu PASSED in 12.0s //tensorflow/python/eager:core_test_cpu PASSED in 17.3s //tensorflow/python/eager:gradient_input_output_exclusions_test PASSED in 32.5s //tensorflow/python/eager:graph_only_ops_test_cpu PASSED in 8.2s //tensorflow/python/eager:lift_to_graph_test PASSED in 9.5s //tensorflow/python/eager:monitoring_test_cpu PASSED in 9.5s //tensorflow/python/eager:ops_test_cpu PASSED in 14.0s //tensorflow/python/eager:profiler_client_test PASSED in 6.7s //tensorflow/python/eager:profiler_test_cpu PASSED in 7.1s //tensorflow/python/eager:pywrap_tfe_test PASSED in 20.0s //tensorflow/python/eager:record_test PASSED in 9.6s //tensorflow/python/eager:remote_benchmarks_test_cpu PASSED in 12.6s //tensorflow/python/eager:run_eager_op_as_function_test_cpu PASSED in 10.8s //tensorflow/python/eager:run_eager_op_as_function_xla_test_cpu PASSED in 8.2s //tensorflow/python/eager:small_constants_optimizer_test_cpu PASSED in 152.4s //tensorflow/python/eager:tensor_test_cpu PASSED in 13.3s //tensorflow/python/eager:wrap_function_device_test_cpu PASSED in 10.0s //tensorflow/python/eager:wrap_function_test PASSED in 11.8s //tensorflow/python/eager/benchmarks:kpi_benchmark_test_cpu PASSED in 19.8s //tensorflow/python/eager/memory_tests:remote_memory_test_cpu PASSED in 9.0s //tensorflow/python/eager/polymorphic_function:argument_naming_test_cpu PASSED in 9.6s //tensorflow/python/eager/polymorphic_function:collection_test_cpu PASSED in 9.8s //tensorflow/python/eager/polymorphic_function:compiler_ir_test_cpu PASSED in 8.1s //tensorflow/python/eager/polymorphic_function:compiler_ir_test_cpu_mlir_bridge_test PASSED in 9.7s //tensorflow/python/eager/polymorphic_function:function_spec_test PASSED in 8.9s //tensorflow/python/eager/polymorphic_function:polymorphic_function_xla_jit_test_cpu PASSED in 25.3s //tensorflow/python/eager/polymorphic_function:polymorphic_function_xla_jit_test_cpu_mlir_bridge_test PASSED in 24.8s //tensorflow/python/eager/polymorphic_function:polymorphic_function_xla_test_cpu PASSED in 10.3s //tensorflow/python/eager/polymorphic_function:quarantine_test PASSED in 25.9s //tensorflow/python/feature_column:sequence_feature_column_integration_test PASSED in 13.8s //tensorflow/python/feature_column:serialization_test PASSED in 14.3s //tensorflow/python/framework:auto_control_deps_test PASSED in 27.2s //tensorflow/python/framework:c_api_util_test PASSED in 10.3s //tensorflow/python/framework:common_shapes_test PASSED in 8.3s //tensorflow/python/framework:composite_tensor_test PASSED in 8.6s //tensorflow/python/framework:config_test_2gpu PASSED in 15.1s //tensorflow/python/framework:config_test_cpu PASSED in 17.6s //tensorflow/python/framework:constant_op_test PASSED in 11.2s //tensorflow/python/framework:device_spec_test PASSED in 8.8s //tensorflow/python/framework:device_test PASSED in 8.4s //tensorflow/python/framework:dtypes_test PASSED in 19.6s //tensorflow/python/framework:error_interpolation_test PASSED in 10.7s //tensorflow/python/framework:errors_test PASSED in 9.1s //tensorflow/python/framework:extension_type_field_test PASSED in 8.7s //tensorflow/python/framework:extension_type_test PASSED in 17.3s //tensorflow/python/framework:file_system_test PASSED in 11.3s //tensorflow/python/framework:function_def_to_graph_test PASSED in 8.2s //tensorflow/python/framework:graph_building_benchmark_cpu PASSED in 9.1s //tensorflow/python/framework:graph_util_test PASSED in 7.5s //tensorflow/python/framework:immutable_dict_test PASSED in 8.2s //tensorflow/python/framework:importer_test PASSED in 9.7s //tensorflow/python/framework:indexed_slices_test PASSED in 9.3s //tensorflow/python/framework:kernels_test PASSED in 10.8s //tensorflow/python/framework:meta_graph_test PASSED in 13.0s //tensorflow/python/framework:node_file_writer_test_cpu PASSED in 11.1s //tensorflow/python/framework:offset_counter_helper_test PASSED in 0.1s //tensorflow/python/framework:op_allowlist_namespace_test PASSED in 2.5s //tensorflow/python/framework:op_callbacks_test_cpu PASSED in 12.7s //tensorflow/python/framework:op_def_library_test PASSED in 10.8s //tensorflow/python/framework:op_def_util_test PASSED in 8.5s //tensorflow/python/framework:ops_enable_eager_test PASSED in 1.9s //tensorflow/python/framework:ops_test PASSED in 22.2s //tensorflow/python/framework:proto_test PASSED in 9.5s //tensorflow/python/framework:py_context_manager_test PASSED in 7.8s //tensorflow/python/framework:python_api_dispatcher_test PASSED in 8.6s //tensorflow/python/framework:python_api_info_test PASSED in 10.5s //tensorflow/python/framework:python_api_parameter_converter_test PASSED in 9.1s //tensorflow/python/framework:python_op_gen_annotation_test PASSED in 3.6s //tensorflow/python/framework:python_op_gen_annotator_test PASSED in 0.1s //tensorflow/python/framework:python_tensor_converter_test PASSED in 8.3s //tensorflow/python/framework:random_seed_test PASSED in 8.4s //tensorflow/python/framework:registry_test PASSED in 10.9s //tensorflow/python/framework:smart_cond_test PASSED in 11.1s //tensorflow/python/framework:sparse_tensor_test PASSED in 7.4s //tensorflow/python/framework:subscribe_test PASSED in 9.7s //tensorflow/python/framework:tensor_shape_test PASSED in 10.1s //tensorflow/python/framework:tensor_test PASSED in 42.7s //tensorflow/python/framework:tensor_util_test PASSED in 8.9s //tensorflow/python/framework:test_combinations_test PASSED in 7.1s //tensorflow/python/framework:test_util_test_cpu PASSED in 17.3s //tensorflow/python/framework:tf2_test PASSED in 10.4s //tensorflow/python/framework:traceable_stack_test PASSED in 8.1s //tensorflow/python/framework:type_spec_test PASSED in 10.4s //tensorflow/python/framework:versions_test PASSED in 13.4s //tensorflow/python/framework/experimental:graph_building_test_cpu PASSED in 16.5s //tensorflow/python/framework/experimental:unified_api_test_cpu PASSED in 13.3s //tensorflow/python/grappler:arithmetic_optimizer_test_cpu PASSED in 10.3s //tensorflow/python/grappler:auto_mixed_precision_test_cpu PASSED in 11.7s //tensorflow/python/grappler:constant_folding_test_cpu PASSED in 9.3s //tensorflow/python/grappler:cost_analyzer_test PASSED in 9.8s //tensorflow/python/grappler:datasets_test PASSED in 9.2s //tensorflow/python/grappler:item_test PASSED in 6.5s //tensorflow/python/grappler:memory_optimizer_test PASSED in 15.0s //tensorflow/python/grappler:model_analyzer_test PASSED in 6.4s //tensorflow/python/grappler:remapper_test_cpu PASSED in 8.6s //tensorflow/python/grappler:tf_optimizer_test PASSED in 6.6s //tensorflow/python/kernel_tests:benchmark_test_cpu PASSED in 12.8s //tensorflow/python/kernel_tests:check_ops_test_cpu PASSED in 15.0s //tensorflow/python/kernel_tests:collective_ops_multi_worker_test PASSED in 26.6s //tensorflow/python/kernel_tests:composite_tensor_ops_test PASSED in 8.2s //tensorflow/python/kernel_tests:critical_section_test_cpu PASSED in 17.5s //tensorflow/python/kernel_tests:garbage_collection_test PASSED in 10.9s //tensorflow/python/kernel_tests:gradient_correctness_test_cpu PASSED in 12.6s //tensorflow/python/kernel_tests:histogram_ops_test_cpu PASSED in 10.2s //tensorflow/python/kernel_tests:logging_ops_test_cpu PASSED in 10.3s //tensorflow/python/kernel_tests:numerics_test_cpu PASSED in 8.4s //tensorflow/python/kernel_tests:template_test PASSED in 9.3s //tensorflow/python/kernel_tests:trace_op_test_cpu PASSED in 8.1s //tensorflow/python/kernel_tests/array_ops:batch_gather_op_test_cpu PASSED in 12.5s //tensorflow/python/kernel_tests/array_ops:batch_scatter_ops_test PASSED in 9.9s //tensorflow/python/kernel_tests/array_ops:batchtospace_op_test_cpu PASSED in 15.8s //tensorflow/python/kernel_tests/array_ops:bcast_ops_test PASSED in 9.7s //tensorflow/python/kernel_tests/array_ops:bitcast_op_test_cpu PASSED in 8.3s //tensorflow/python/kernel_tests/array_ops:broadcast_to_ops_test_cpu PASSED in 29.9s //tensorflow/python/kernel_tests/array_ops:cast_op_test_cpu PASSED in 9.4s //tensorflow/python/kernel_tests/array_ops:constant_op_eager_test_cpu PASSED in 8.9s //tensorflow/python/kernel_tests/array_ops:constant_op_test_cpu PASSED in 10.4s //tensorflow/python/kernel_tests/array_ops:denormal_test_cpu PASSED in 7.7s //tensorflow/python/kernel_tests/array_ops:depthtospace_op_test_cpu PASSED in 12.3s //tensorflow/python/kernel_tests/array_ops:edit_distance_op_test PASSED in 12.0s //tensorflow/python/kernel_tests/array_ops:fingerprint_op_test PASSED in 8.3s //tensorflow/python/kernel_tests/array_ops:gather_nd_op_test_cpu PASSED in 9.6s //tensorflow/python/kernel_tests/array_ops:identity_n_op_py_test PASSED in 8.5s //tensorflow/python/kernel_tests/array_ops:identity_op_py_test PASSED in 7.7s //tensorflow/python/kernel_tests/array_ops:large_concat_op_test_cpu PASSED in 9.2s //tensorflow/python/kernel_tests/array_ops:manip_ops_test_cpu PASSED in 9.7s //tensorflow/python/kernel_tests/array_ops:one_hot_op_test_cpu PASSED in 9.8s //tensorflow/python/kernel_tests/array_ops:pad_op_test_cpu PASSED in 19.2s //tensorflow/python/kernel_tests/array_ops:reshape_op_test_cpu PASSED in 11.2s //tensorflow/python/kernel_tests/array_ops:reverse_sequence_op_test_cpu PASSED in 10.8s //tensorflow/python/kernel_tests/array_ops:scalar_test_cpu PASSED in 9.4s //tensorflow/python/kernel_tests/array_ops:shape_ops_test_cpu PASSED in 16.3s //tensorflow/python/kernel_tests/array_ops:slice_op_test_cpu PASSED in 8.8s //tensorflow/python/kernel_tests/array_ops:spacetobatch_op_test_cpu PASSED in 15.4s //tensorflow/python/kernel_tests/array_ops:spacetodepth_op_test_cpu PASSED in 10.6s //tensorflow/python/kernel_tests/array_ops:stack_op_test_cpu PASSED in 13.5s //tensorflow/python/kernel_tests/array_ops:unique_op_test_cpu PASSED in 9.1s //tensorflow/python/kernel_tests/array_ops:unstack_op_test_cpu PASSED in 10.9s //tensorflow/python/kernel_tests/array_ops:where_op_test_cpu PASSED in 17.0s //tensorflow/python/kernel_tests/control_flow:cond_v2_test_cpu PASSED in 65.0s //tensorflow/python/kernel_tests/control_flow:control_flow_util_test PASSED in 9.7s //tensorflow/python/kernel_tests/control_flow:control_flow_util_v2_test PASSED in 8.4s //tensorflow/python/kernel_tests/control_flow:py_func_test_cpu PASSED in 15.5s //tensorflow/python/kernel_tests/control_flow:scan_ops_test_cpu PASSED in 68.7s //tensorflow/python/kernel_tests/control_flow:while_v2_test_cpu PASSED in 56.9s //tensorflow/python/kernel_tests/custom_ops:ackermann_test PASSED in 10.8s //tensorflow/python/kernel_tests/custom_ops:duplicate_op_test PASSED in 11.5s //tensorflow/python/kernel_tests/custom_ops:invalid_op_test PASSED in 7.7s //tensorflow/python/kernel_tests/data_structures:conditional_accumulator_test PASSED in 10.5s //tensorflow/python/kernel_tests/data_structures:dynamic_partition_op_test_2gpu PASSED in 14.0s //tensorflow/python/kernel_tests/data_structures:dynamic_partition_op_test_cpu PASSED in 14.5s //tensorflow/python/kernel_tests/data_structures:dynamic_stitch_op_test_cpu PASSED in 10.4s //tensorflow/python/kernel_tests/data_structures:fifo_queue_test PASSED in 11.5s //tensorflow/python/kernel_tests/data_structures:list_ops_test_cpu PASSED in 22.7s //tensorflow/python/kernel_tests/data_structures:listdiff_op_test PASSED in 12.3s //tensorflow/python/kernel_tests/data_structures:lookup_ops_test PASSED in 28.8s //tensorflow/python/kernel_tests/data_structures:map_ops_test PASSED in 15.6s //tensorflow/python/kernel_tests/data_structures:padding_fifo_queue_test_cpu PASSED in 10.2s //tensorflow/python/kernel_tests/data_structures:priority_queue_test PASSED in 9.2s //tensorflow/python/kernel_tests/data_structures:stack_ops_test_cpu PASSED in 8.3s //tensorflow/python/kernel_tests/data_structures:stage_op_test_cpu PASSED in 11.7s //tensorflow/python/kernel_tests/distributions:bernoulli_test_cpu PASSED in 15.6s //tensorflow/python/kernel_tests/distributions:bijector_test_cpu PASSED in 10.1s //tensorflow/python/kernel_tests/distributions:categorical_test_cpu PASSED in 11.1s //tensorflow/python/kernel_tests/distributions:dirichlet_multinomial_test_cpu PASSED in 14.5s //tensorflow/python/kernel_tests/distributions:dirichlet_test_cpu PASSED in 13.4s //tensorflow/python/kernel_tests/distributions:exponential_test_cpu PASSED in 10.7s //tensorflow/python/kernel_tests/distributions:gamma_test_cpu PASSED in 44.0s //tensorflow/python/kernel_tests/distributions:identity_bijector_test_cpu PASSED in 16.2s //tensorflow/python/kernel_tests/distributions:kullback_leibler_test_cpu PASSED in 8.1s //tensorflow/python/kernel_tests/distributions:laplace_test_cpu PASSED in 42.5s //tensorflow/python/kernel_tests/distributions:multinomial_test_cpu PASSED in 11.4s //tensorflow/python/kernel_tests/distributions:normal_test_cpu PASSED in 22.2s //tensorflow/python/kernel_tests/distributions:special_math_test_cpu PASSED in 18.9s //tensorflow/python/kernel_tests/distributions:uniform_test_cpu PASSED in 10.1s //tensorflow/python/kernel_tests/image_ops:attention_ops_test PASSED in 30.0s //tensorflow/python/kernel_tests/image_ops:decode_bmp_op_test PASSED in 12.7s //tensorflow/python/kernel_tests/image_ops:decode_compressed_op_test PASSED in 9.2s //tensorflow/python/kernel_tests/image_ops:decode_image_op_test PASSED in 10.0s //tensorflow/python/kernel_tests/image_ops:decode_jpeg_op_test PASSED in 7.3s //tensorflow/python/kernel_tests/image_ops:decode_png_op_test PASSED in 9.3s //tensorflow/python/kernel_tests/image_ops:decode_raw_op_test PASSED in 10.0s //tensorflow/python/kernel_tests/image_ops:draw_bounding_box_op_test_cpu PASSED in 10.9s //tensorflow/python/kernel_tests/image_ops:extract_image_patches_op_test_cpu PASSED in 7.4s //tensorflow/python/kernel_tests/image_ops:extract_volume_patches_op_test_cpu PASSED in 9.9s //tensorflow/python/kernel_tests/io_ops:checkpoint_ops_test PASSED in 13.4s //tensorflow/python/kernel_tests/io_ops:decode_csv_op_test PASSED in 9.9s //tensorflow/python/kernel_tests/io_ops:io_ops_test PASSED in 9.4s //tensorflow/python/kernel_tests/io_ops:parse_single_example_op_test PASSED in 11.9s //tensorflow/python/kernel_tests/io_ops:parsing_ops_test PASSED in 24.0s //tensorflow/python/kernel_tests/io_ops:reader_ops_test PASSED in 12.2s //tensorflow/python/kernel_tests/io_ops:record_input_test PASSED in 22.1s //tensorflow/python/kernel_tests/io_ops:save_restore_ops_test PASSED in 11.0s //tensorflow/python/kernel_tests/linalg:determinant_op_test_cpu PASSED in 11.2s //tensorflow/python/kernel_tests/linalg:linear_operator_addition_test_cpu PASSED in 11.0s //tensorflow/python/kernel_tests/linalg:linear_operator_algebra_test_cpu PASSED in 6.6s //tensorflow/python/kernel_tests/linalg:linear_operator_test_cpu PASSED in 11.0s //tensorflow/python/kernel_tests/linalg:lu_op_test_cpu PASSED in 11.0s //tensorflow/python/kernel_tests/linalg:matrix_inverse_op_test_cpu PASSED in 9.6s //tensorflow/python/kernel_tests/linalg:matrix_logarithm_op_test PASSED in 58.8s //tensorflow/python/kernel_tests/linalg:matrix_solve_ls_op_test_cpu PASSED in 45.0s //tensorflow/python/kernel_tests/linalg:matrix_solve_op_test_cpu PASSED in 41.6s //tensorflow/python/kernel_tests/linalg:matrix_square_root_op_test_cpu PASSED in 8.4s //tensorflow/python/kernel_tests/linalg:slicing_test_cpu PASSED in 13.9s //tensorflow/python/kernel_tests/linalg/sparse:conjugate_gradient_test_cpu PASSED in 13.9s //tensorflow/python/kernel_tests/linalg/sparse:csr_sparse_matrix_test_cpu PASSED in 8.7s //tensorflow/python/kernel_tests/math_ops:aggregate_ops_test_cpu PASSED in 15.0s //tensorflow/python/kernel_tests/math_ops:argmax_op_test_cpu PASSED in 15.7s //tensorflow/python/kernel_tests/math_ops:banded_triangular_solve_op_test_cpu PASSED in 12.1s //tensorflow/python/kernel_tests/math_ops:basic_gpu_test_cpu PASSED in 10.3s //tensorflow/python/kernel_tests/math_ops:bincount_op_test_cpu PASSED in 10.0s //tensorflow/python/kernel_tests/math_ops:bucketize_op_test_cpu PASSED in 11.5s //tensorflow/python/kernel_tests/math_ops:clip_ops_test PASSED in 9.7s //tensorflow/python/kernel_tests/math_ops:confusion_matrix_test PASSED in 14.1s //tensorflow/python/kernel_tests/math_ops:cross_grad_test_cpu PASSED in 8.2s //tensorflow/python/kernel_tests/math_ops:cumulative_logsumexp_test_cpu PASSED in 10.1s //tensorflow/python/kernel_tests/math_ops:in_topk_op_test_cpu PASSED in 10.0s //tensorflow/python/kernel_tests/math_ops:reduce_benchmark_test_cpu PASSED in 8.3s //tensorflow/python/kernel_tests/math_ops:segment_reduction_ops_d9m_test_cpu PASSED in 9.7s //tensorflow/python/kernel_tests/math_ops:sets_test PASSED in 23.9s //tensorflow/python/kernel_tests/math_ops:topk_op_test_cpu PASSED in 9.0s //tensorflow/python/kernel_tests/math_ops:zero_division_test_cpu PASSED in 10.4s //tensorflow/python/kernel_tests/nn_ops:betainc_op_test_cpu PASSED in 10.5s //tensorflow/python/kernel_tests/nn_ops:bias_op_test_cpu PASSED in 163.7s //tensorflow/python/kernel_tests/nn_ops:conv1d_test_cpu PASSED in 11.1s //tensorflow/python/kernel_tests/nn_ops:conv1d_transpose_test_cpu PASSED in 10.5s //tensorflow/python/kernel_tests/nn_ops:conv2d_transpose_test_cpu PASSED in 9.5s //tensorflow/python/kernel_tests/nn_ops:conv3d_backprop_filter_v2_grad_test_cpu PASSED in 13.4s //tensorflow/python/kernel_tests/nn_ops:conv3d_transpose_test_cpu PASSED in 10.3s //tensorflow/python/kernel_tests/nn_ops:ctc_decoder_ops_test PASSED in 10.0s //tensorflow/python/kernel_tests/nn_ops:ctc_loss_op_test_cpu PASSED in 95.7s //tensorflow/python/kernel_tests/nn_ops:cudnn_d9m_test_cpu PASSED in 8.1s //tensorflow/python/kernel_tests/nn_ops:cudnn_deterministic_ops_test_cpu PASSED in 7.1s //tensorflow/python/kernel_tests/nn_ops:losses_test PASSED in 31.0s //tensorflow/python/kernel_tests/nn_ops:lrn_op_test_cpu PASSED in 9.6s //tensorflow/python/kernel_tests/nn_ops:morphological_ops_test_cpu PASSED in 14.1s //tensorflow/python/kernel_tests/nn_ops:nth_element_op_test_cpu PASSED in 8.8s //tensorflow/python/kernel_tests/nn_ops:pool_test_cpu PASSED in 27.1s //tensorflow/python/kernel_tests/nn_ops:pooling_ops_3d_test_cpu PASSED in 19.5s //tensorflow/python/kernel_tests/nn_ops:relu_op_test_cpu PASSED in 9.8s //tensorflow/python/kernel_tests/nn_ops:softmax_op_test_cpu PASSED in 10.0s //tensorflow/python/kernel_tests/nn_ops:softplus_op_test_cpu PASSED in 8.0s //tensorflow/python/kernel_tests/nn_ops:softsign_op_test_cpu PASSED in 11.7s //tensorflow/python/kernel_tests/nn_ops:xent_op_d9m_test_cpu PASSED in 149.0s //tensorflow/python/kernel_tests/nn_ops:xent_op_test_cpu PASSED in 8.2s //tensorflow/python/kernel_tests/proto:descriptor_source_test PASSED in 6.8s //tensorflow/python/kernel_tests/proto:encode_proto_op_test PASSED in 17.0s //tensorflow/python/kernel_tests/quantization_ops:quantization_ops_test PASSED in 8.3s //tensorflow/python/kernel_tests/random:candidate_sampler_ops_test PASSED in 11.5s //tensorflow/python/kernel_tests/random:multinomial_op_test_cpu PASSED in 9.8s //tensorflow/python/kernel_tests/random:parameterized_truncated_normal_op_test_cpu PASSED in 16.0s //tensorflow/python/kernel_tests/random:random_crop_test_cpu PASSED in 9.7s //tensorflow/python/kernel_tests/random:random_grad_test_cpu PASSED in 12.9s //tensorflow/python/kernel_tests/random:random_ops_test_cpu PASSED in 14.1s //tensorflow/python/kernel_tests/random:random_poisson_test_cpu PASSED in 13.0s //tensorflow/python/kernel_tests/random:random_shuffle_queue_test PASSED in 13.1s //tensorflow/python/kernel_tests/random:stateful_random_ops_test_cpu PASSED in 19.4s //tensorflow/python/kernel_tests/signal:mel_ops_test_cpu PASSED in 13.8s //tensorflow/python/kernel_tests/signal:mfcc_ops_test_cpu PASSED in 9.2s //tensorflow/python/kernel_tests/signal:reconstruction_ops_test_cpu PASSED in 13.0s //tensorflow/python/kernel_tests/signal:shape_ops_test_cpu PASSED in 17.8s //tensorflow/python/kernel_tests/sparse_ops:sparse_add_op_test PASSED in 9.2s //tensorflow/python/kernel_tests/sparse_ops:sparse_concat_op_test PASSED in 7.2s //tensorflow/python/kernel_tests/sparse_ops:sparse_conditional_accumulator_test PASSED in 9.7s //tensorflow/python/kernel_tests/sparse_ops:sparse_cross_op_test PASSED in 14.7s //tensorflow/python/kernel_tests/sparse_ops:sparse_matmul_op_test_cpu PASSED in 32.6s //tensorflow/python/kernel_tests/sparse_ops:sparse_reorder_op_test PASSED in 10.2s //tensorflow/python/kernel_tests/sparse_ops:sparse_reshape_op_test PASSED in 10.4s //tensorflow/python/kernel_tests/sparse_ops:sparse_serialization_ops_test PASSED in 10.8s //tensorflow/python/kernel_tests/sparse_ops:sparse_slice_op_test PASSED in 9.0s //tensorflow/python/kernel_tests/sparse_ops:sparse_split_op_test_cpu PASSED in 8.5s //tensorflow/python/kernel_tests/sparse_ops:sparse_tensor_dense_matmul_grad_test_cpu PASSED in 17.9s //tensorflow/python/kernel_tests/sparse_ops:sparse_tensor_dense_matmul_op_d9m_test_cpu PASSED in 38.8s //tensorflow/python/kernel_tests/sparse_ops:sparse_tensor_dense_matmul_op_test_cpu PASSED in 25.9s //tensorflow/python/kernel_tests/sparse_ops:sparse_tensors_map_ops_test PASSED in 12.1s //tensorflow/python/kernel_tests/sparse_ops:sparse_to_dense_op_py_test_cpu PASSED in 9.7s //tensorflow/python/kernel_tests/sparse_ops:sparse_xent_op_d9m_test_cpu PASSED in 83.4s //tensorflow/python/kernel_tests/sparse_ops:sparse_xent_op_test_cpu PASSED in 15.6s //tensorflow/python/kernel_tests/sparse_ops:sparsemask_op_test PASSED in 21.1s //tensorflow/python/kernel_tests/strings_ops:as_string_op_test PASSED in 10.7s //tensorflow/python/kernel_tests/strings_ops:base64_ops_test PASSED in 15.4s //tensorflow/python/kernel_tests/strings_ops:reduce_join_op_test_cpu PASSED in 8.9s //tensorflow/python/kernel_tests/strings_ops:regex_full_match_op_test PASSED in 9.8s //tensorflow/python/kernel_tests/strings_ops:regex_replace_op_test PASSED in 10.1s //tensorflow/python/kernel_tests/strings_ops:string_bytes_split_op_test PASSED in 10.5s //tensorflow/python/kernel_tests/strings_ops:string_format_op_test PASSED in 11.1s //tensorflow/python/kernel_tests/strings_ops:string_join_op_test PASSED in 9.3s //tensorflow/python/kernel_tests/strings_ops:string_length_op_test PASSED in 7.4s //tensorflow/python/kernel_tests/strings_ops:string_lower_op_test PASSED in 9.9s //tensorflow/python/kernel_tests/strings_ops:string_split_op_test PASSED in 13.8s //tensorflow/python/kernel_tests/strings_ops:string_strip_op_test PASSED in 7.7s //tensorflow/python/kernel_tests/strings_ops:string_to_hash_bucket_op_test_cpu PASSED in 9.5s //tensorflow/python/kernel_tests/strings_ops:string_to_number_op_test_cpu PASSED in 9.3s //tensorflow/python/kernel_tests/strings_ops:string_upper_op_test PASSED in 9.1s //tensorflow/python/kernel_tests/strings_ops:substr_op_test PASSED in 9.8s //tensorflow/python/kernel_tests/strings_ops:unicode_decode_op_test PASSED in 17.2s //tensorflow/python/kernel_tests/strings_ops:unicode_encode_op_test PASSED in 7.7s //tensorflow/python/kernel_tests/strings_ops:unicode_script_op_test PASSED in 38.2s //tensorflow/python/kernel_tests/strings_ops:unicode_transcode_op_test PASSED in 10.3s //tensorflow/python/kernel_tests/strings_ops:unsorted_segment_join_op_test_cpu PASSED in 10.9s //tensorflow/python/kernel_tests/summary_ops:summary_ops_test_cpu PASSED in 19.4s //tensorflow/python/kernel_tests/summary_ops:summary_v1_audio_op_test_cpu PASSED in 9.7s //tensorflow/python/kernel_tests/summary_ops:summary_v1_image_op_test_cpu PASSED in 10.5s //tensorflow/python/kernel_tests/summary_ops:summary_v1_ops_test PASSED in 9.7s //tensorflow/python/kernel_tests/summary_ops:summary_v1_tensor_op_test PASSED in 8.0s //tensorflow/python/kernel_tests/v1_compat_tests:array_ops_test_cpu PASSED in 11.0s //tensorflow/python/kernel_tests/v1_compat_tests:dense_update_ops_test_cpu PASSED in 8.2s //tensorflow/python/kernel_tests/v1_compat_tests:identity_op_py_test PASSED in 8.0s //tensorflow/python/kernel_tests/v1_compat_tests:scatter_nd_ops_test_cpu PASSED in 8.7s //tensorflow/python/kernel_tests/v1_compat_tests:session_ops_test_cpu PASSED in 11.8s //tensorflow/python/kernel_tests/v1_compat_tests:stack_op_test_cpu PASSED in 10.8s //tensorflow/python/kernel_tests/variables:dense_update_ops_no_tsan_test_cpu PASSED in 9.4s //tensorflow/python/kernel_tests/variables:dense_update_ops_test_cpu PASSED in 8.6s //tensorflow/python/kernel_tests/variables:partitioned_variables_test PASSED in 10.8s //tensorflow/python/kernel_tests/variables:resource_variable_ops_test_cpu PASSED in 49.6s //tensorflow/python/kernel_tests/variables:variable_ops_test_cpu PASSED in 8.8s //tensorflow/python/kernel_tests/variables:variable_scope_test PASSED in 35.4s //tensorflow/python/kernel_tests/variables:variables_test PASSED in 10.4s //tensorflow/python/lib/core:custom_float_test PASSED in 9.9s //tensorflow/python/lib/io:file_io_test PASSED in 14.3s //tensorflow/python/lib/io:tf_record_test PASSED in 11.0s //tensorflow/python/module:module_test PASSED in 10.3s //tensorflow/python/ops/losses:util_test PASSED in 8.7s //tensorflow/python/ops/memory_tests:custom_gradient_memory_test_cpu PASSED in 12.9s //tensorflow/python/ops/numpy_ops:np_array_ops_test_cpu PASSED in 73.0s //tensorflow/python/ops/numpy_ops:np_arrays_test_cpu PASSED in 16.9s //tensorflow/python/ops/numpy_ops:np_dtypes_test_cpu PASSED in 8.3s //tensorflow/python/ops/numpy_ops:np_interop_test_cpu PASSED in 41.8s //tensorflow/python/ops/numpy_ops:np_logic_test_cpu PASSED in 11.8s //tensorflow/python/ops/numpy_ops:np_math_ops_test_cpu PASSED in 25.4s //tensorflow/python/ops/numpy_ops:np_random_test_cpu PASSED in 61.6s //tensorflow/python/ops/numpy_ops:np_utils_test_cpu PASSED in 16.3s //tensorflow/python/ops/numpy_ops/integration_test:np_config_test_cpu PASSED in 16.8s //tensorflow/python/ops/numpy_ops/integration_test:public_symbol_test PASSED in 17.1s //tensorflow/python/ops/parallel_for:array_test_cpu PASSED in 35.2s //tensorflow/python/ops/parallel_for:gradients_test_cpu PASSED in 13.3s //tensorflow/python/ops/parallel_for:xla_control_flow_ops_test_cpu PASSED in 50.7s //tensorflow/python/ops/ragged:convert_to_tensor_or_ragged_tensor_op_test PASSED in 8.6s //tensorflow/python/ops/ragged:ragged_batch_gather_op_test PASSED in 39.8s //tensorflow/python/ops/ragged:ragged_bitcast_op_test PASSED in 7.3s //tensorflow/python/ops/ragged:ragged_boolean_mask_op_test PASSED in 15.4s //tensorflow/python/ops/ragged:ragged_concat_op_test PASSED in 13.7s //tensorflow/python/ops/ragged:ragged_const_op_test PASSED in 7.9s //tensorflow/python/ops/ragged:ragged_constant_value_op_test PASSED in 8.5s //tensorflow/python/ops/ragged:ragged_dispatch_test PASSED in 114.9s //tensorflow/python/ops/ragged:ragged_dynamic_partition_op_test_cpu PASSED in 16.9s //tensorflow/python/ops/ragged:ragged_eager_test PASSED in 8.2s //tensorflow/python/ops/ragged:ragged_expand_dims_op_test PASSED in 9.9s //tensorflow/python/ops/ragged:ragged_factory_ops_test_cpu PASSED in 17.9s //tensorflow/python/ops/ragged:ragged_from_sparse_op_test PASSED in 11.8s //tensorflow/python/ops/ragged:ragged_from_tensor_op_test PASSED in 22.8s //tensorflow/python/ops/ragged:ragged_gather_nd_op_test PASSED in 12.1s //tensorflow/python/ops/ragged:ragged_map_flat_values_op_test PASSED in 11.1s //tensorflow/python/ops/ragged:ragged_map_fn_op_test PASSED in 15.4s //tensorflow/python/ops/ragged:ragged_math_ops_test PASSED in 12.8s //tensorflow/python/ops/ragged:ragged_matmul_op_test PASSED in 33.8s //tensorflow/python/ops/ragged:ragged_merge_dims_op_test PASSED in 26.4s //tensorflow/python/ops/ragged:ragged_one_hot_op_test PASSED in 9.2s //tensorflow/python/ops/ragged:ragged_operators_test PASSED in 21.1s //tensorflow/python/ops/ragged:ragged_placeholder_op_test PASSED in 7.3s //tensorflow/python/ops/ragged:ragged_print_op_test PASSED in 13.5s //tensorflow/python/ops/ragged:ragged_range_op_test PASSED in 9.8s //tensorflow/python/ops/ragged:ragged_rank_op_test PASSED in 14.5s //tensorflow/python/ops/ragged:ragged_reduce_op_test PASSED in 39.0s //tensorflow/python/ops/ragged:ragged_resize_image_op_test PASSED in 19.7s //tensorflow/python/ops/ragged:ragged_reverse_op_test PASSED in 9.6s //tensorflow/python/ops/ragged:ragged_row_lengths_op_test PASSED in 9.6s //tensorflow/python/ops/ragged:ragged_row_splits_to_segment_ids_op_test PASSED in 7.0s //tensorflow/python/ops/ragged:ragged_segment_ids_to_row_splits_op_test PASSED in 7.4s //tensorflow/python/ops/ragged:ragged_segment_op_test PASSED in 15.2s //tensorflow/python/ops/ragged:ragged_size_op_test PASSED in 7.8s //tensorflow/python/ops/ragged:ragged_split_op_test PASSED in 40.9s //tensorflow/python/ops/ragged:ragged_squeeze_op_test PASSED in 16.5s //tensorflow/python/ops/ragged:ragged_stack_op_test PASSED in 15.0s //tensorflow/python/ops/ragged:ragged_tensor_bounding_shape_op_test PASSED in 10.8s //tensorflow/python/ops/ragged:ragged_tensor_shape_test PASSED in 66.8s //tensorflow/python/ops/ragged:ragged_tile_op_test PASSED in 40.1s //tensorflow/python/ops/ragged:ragged_to_sparse_op_test PASSED in 10.4s //tensorflow/python/ops/ragged:ragged_to_tensor_op_test PASSED in 59.6s //tensorflow/python/ops/ragged:ragged_util_test PASSED in 23.2s //tensorflow/python/ops/ragged:ragged_where_op_test PASSED in 29.1s //tensorflow/python/ops/ragged:row_partition_test PASSED in 27.5s //tensorflow/python/ops/ragged:string_ngrams_op_test PASSED in 8.6s //tensorflow/python/ops/ragged:strings_reduce_join_op_test PASSED in 9.6s //tensorflow/python/ops/structured:structured_array_ops_test PASSED in 38.6s //tensorflow/python/ops/structured:structured_tensor_slice_test PASSED in 56.1s //tensorflow/python/ops/structured:structured_tensor_spec_test PASSED in 10.6s //tensorflow/python/ops/structured:structured_tensor_test PASSED in 42.0s //tensorflow/python/ops/v1_compat_tests:gradient_checker_test_cpu PASSED in 10.1s //tensorflow/python/platform:benchmark_test PASSED in 10.3s //tensorflow/python/platform:build_info_test PASSED in 8.1s //tensorflow/python/platform:resource_loader_test PASSED in 2.2s //tensorflow/python/profiler:pprof_profiler_test PASSED in 8.2s //tensorflow/python/profiler:profile_context_test_cpu PASSED in 27.9s //tensorflow/python/profiler:profiler_client_test_cpu PASSED in 8.4s //tensorflow/python/profiler:profiler_test_cpu PASSED in 21.1s //tensorflow/python/profiler:profiler_v2_test_cpu PASSED in 8.3s //tensorflow/python/profiler:profiler_wrapper_test PASSED in 8.9s //tensorflow/python/profiler:tfprof_logger_test PASSED in 9.0s //tensorflow/python/profiler/integration_test:profiler_api_test_cpu PASSED in 32.3s //tensorflow/python/profiler/internal:flops_registry_test PASSED in 7.0s //tensorflow/python/profiler/internal:print_model_analysis_test PASSED in 11.1s //tensorflow/python/profiler/internal:run_metadata_test_cpu PASSED in 15.4s //tensorflow/python/saved_model:fingerprinting_test PASSED in 15.2s //tensorflow/python/saved_model:keras_injection_test PASSED in 13.4s //tensorflow/python/saved_model:load_v1_in_v2_test PASSED in 15.9s //tensorflow/python/saved_model:loader_test PASSED in 11.8s //tensorflow/python/saved_model:method_name_updater_test PASSED in 11.7s //tensorflow/python/saved_model:metrics_test PASSED in 10.0s //tensorflow/python/saved_model:nested_structure_coder_test PASSED in 8.1s //tensorflow/python/saved_model:pywrap_saved_model_fingerprinting_test PASSED in 7.8s //tensorflow/python/saved_model:pywrap_saved_model_metrics_test PASSED in 6.9s //tensorflow/python/saved_model:revived_types_test PASSED in 10.6s //tensorflow/python/saved_model:save_context_test PASSED in 7.4s //tensorflow/python/saved_model:save_test PASSED in 24.2s //tensorflow/python/saved_model:saved_model_test PASSED in 18.7s //tensorflow/python/saved_model:signature_def_utils_test PASSED in 8.4s //tensorflow/python/saved_model:simple_save_test PASSED in 9.5s //tensorflow/python/saved_model:tracing_utils_test PASSED in 9.1s //tensorflow/python/saved_model:utils_test PASSED in 8.7s //tensorflow/python/saved_model/model_utils:export_output_test PASSED in 8.2s //tensorflow/python/saved_model/model_utils:export_test PASSED in 12.9s //tensorflow/python/saved_model/model_utils:mode_keys_test PASSED in 7.7s //tensorflow/python/saved_model/registration:registration_saving_test PASSED in 16.9s //tensorflow/python/saved_model/registration:registration_test PASSED in 9.2s //tensorflow/python/saved_model/registration:tf_registration_test PASSED in 17.5s //tensorflow/python/summary:plugin_asset_test PASSED in 8.7s //tensorflow/python/summary:summary_iterator_test PASSED in 7.9s //tensorflow/python/summary:summary_test PASSED in 11.1s //tensorflow/python/summary:summary_v2_test PASSED in 9.9s //tensorflow/python/summary/writer:writer_test PASSED in 22.0s //tensorflow/python/tools:aot_compiled_test PASSED in 23.2s //tensorflow/python/tools:freeze_graph_test PASSED in 17.9s //tensorflow/python/tools:optimize_for_inference_test PASSED in 19.5s //tensorflow/python/tools:print_selective_registration_header_test PASSED in 22.1s //tensorflow/python/tools:saved_model_cli_test PASSED in 36.7s //tensorflow/python/tools:saved_model_utils_test PASSED in 9.2s //tensorflow/python/tools:strip_unused_test PASSED in 11.2s //tensorflow/python/tools/api/generator:create_python_api_test PASSED in 17.0s //tensorflow/python/tools/api/generator:output_init_files_test PASSED in 60.7s //tensorflow/python/tools/api/generator:tensorflow_doc_srcs_test PASSED in 15.2s //tensorflow/python/tpu:bfloat16_test PASSED in 15.8s //tensorflow/python/tpu:feature_column_test PASSED in 19.6s //tensorflow/python/tpu:topology_test PASSED in 9.7s //tensorflow/python/tpu:tpu_embedding_for_serving_test PASSED in 12.4s //tensorflow/python/tpu:tpu_embedding_v2_utils_test PASSED in 9.0s //tensorflow/python/tpu:tpu_infeed_test PASSED in 17.0s //tensorflow/python/tpu:tpu_sharding_test PASSED in 8.0s //tensorflow/python/tpu:tpu_test_wrapper_test PASSED in 8.6s //tensorflow/python/tpu/client:client_py_test PASSED in 18.0s //tensorflow/python/trackable:autotrackable_test PASSED in 17.6s //tensorflow/python/trackable:base_delegate_test PASSED in 8.0s //tensorflow/python/trackable:base_test PASSED in 7.9s //tensorflow/python/trackable:data_structures_test PASSED in 14.3s //tensorflow/python/trackable:python_state_test PASSED in 10.2s //tensorflow/python/trackable:resource_test PASSED in 7.4s //tensorflow/python/trackable:trackable_utils_test PASSED in 9.5s //tensorflow/python/training:adadelta_test_cpu PASSED in 16.8s //tensorflow/python/training:adagrad_da_test_cpu PASSED in 10.8s //tensorflow/python/training:adagrad_test_cpu PASSED in 14.0s //tensorflow/python/training:adam_test_cpu PASSED in 14.1s //tensorflow/python/training:basic_loops_test_cpu PASSED in 9.4s //tensorflow/python/training:basic_session_run_hooks_test PASSED in 24.1s //tensorflow/python/training:checkpoint_ops_test PASSED in 8.1s //tensorflow/python/training:coordinator_test_cpu PASSED in 16.1s //tensorflow/python/training:device_setter_test_cpu PASSED in 8.5s //tensorflow/python/training:ftrl_test_cpu PASSED in 15.3s //tensorflow/python/training:gradient_descent_test_cpu PASSED in 14.2s //tensorflow/python/training:input_test PASSED in 20.9s //tensorflow/python/training:momentum_test_cpu PASSED in 13.5s //tensorflow/python/training:monitored_session_test PASSED in 27.9s //tensorflow/python/training:moving_averages_test_cpu PASSED in 14.4s //tensorflow/python/training:optimizer_test_cpu PASSED in 12.6s //tensorflow/python/training:proximal_adagrad_test_cpu PASSED in 10.5s //tensorflow/python/training:proximal_gradient_descent_test_cpu PASSED in 12.6s //tensorflow/python/training:quantize_training_test_cpu PASSED in 12.5s //tensorflow/python/training:queue_runner_test_cpu PASSED in 7.3s //tensorflow/python/training:rmsprop_test_cpu PASSED in 21.9s //tensorflow/python/training:saver_large_partitioned_variable_test PASSED in 21.8s //tensorflow/python/training:saver_test_2gpu PASSED in 34.5s //tensorflow/python/training:saver_test_cpu PASSED in 34.1s //tensorflow/python/training:server_lib_multiple_containers_test PASSED in 10.1s //tensorflow/python/training:server_lib_same_variables_clear_container_test PASSED in 11.0s //tensorflow/python/training:server_lib_same_variables_clear_test PASSED in 11.2s //tensorflow/python/training:server_lib_same_variables_no_clear_test PASSED in 8.9s //tensorflow/python/training:server_lib_sparse_job_test PASSED in 9.0s //tensorflow/python/training:server_lib_test PASSED in 18.0s //tensorflow/python/training:session_manager_test_cpu PASSED in 79.7s //tensorflow/python/training:slot_creator_test_cpu PASSED in 10.1s //tensorflow/python/training:supervisor_test PASSED in 36.6s //tensorflow/python/training:training_ops_mlir_test_cpu PASSED in 12.0s //tensorflow/python/training:training_ops_test_cpu PASSED in 12.2s //tensorflow/python/training:training_util_test PASSED in 8.5s //tensorflow/python/training:warm_starting_util_test PASSED in 23.8s //tensorflow/python/training/experimental:loss_scale_optimizer_test PASSED in 16.5s //tensorflow/python/training/experimental:loss_scale_test PASSED in 23.9s //tensorflow/python/training/experimental:mixed_precision_test_cpu PASSED in 9.3s //tensorflow/python/training/saving:saveable_object_util_test PASSED in 6.4s //tensorflow/python/util:compat_test PASSED in 9.4s //tensorflow/python/util:decorator_utils_test PASSED in 6.7s //tensorflow/python/util:deprecation_test PASSED in 8.8s //tensorflow/python/util:dispatch_test PASSED in 9.5s //tensorflow/python/util:example_parser_configuration_test PASSED in 9.9s //tensorflow/python/util:fast_module_type_test PASSED in 9.4s //tensorflow/python/util:function_parameter_canonicalizer_test PASSED in 8.3s //tensorflow/python/util:function_utils_test PASSED in 8.7s //tensorflow/python/util:keyword_args_test PASSED in 9.5s //tensorflow/python/util:lock_util_test PASSED in 9.8s //tensorflow/python/util:module_wrapper_test PASSED in 10.6s //tensorflow/python/util:nest_test PASSED in 15.0s //tensorflow/python/util:object_identity_test PASSED in 9.4s //tensorflow/python/util:pywrap_xla_ops_test PASSED in 2.0s //tensorflow/python/util:serialization_test PASSED in 8.9s //tensorflow/python/util:tf_contextlib_test PASSED in 10.2s //tensorflow/python/util:tf_decorator_test PASSED in 9.8s //tensorflow/python/util:tf_export_test PASSED in 7.5s //tensorflow/python/util:tf_inspect_test PASSED in 10.1s //tensorflow/python/util:tf_should_use_test PASSED in 8.1s //tensorflow/python/util:tf_stack_test PASSED in 9.3s //tensorflow/python/util:traceback_utils_test PASSED in 10.8s //tensorflow/python/util:type_annotations_test PASSED in 8.6s //tensorflow/python/util:variable_utils_test PASSED in 10.8s //tensorflow/python/util:vlog_test PASSED in 15.8s //tensorflow/tools/api/tests:module_test PASSED in 19.6s //tensorflow/tools/benchmark:benchmark_model_test PASSED in 3.2s //tensorflow/tools/common:public_api_test PASSED in 2.9s //tensorflow/tools/common:traverse_test PASSED in 7.3s //tensorflow/tools/compatibility:all_renames_v2_test PASSED in 7.8s //tensorflow/tools/compatibility:ast_edits_test PASSED in 8.6s //tensorflow/tools/compatibility:test_file_v1_0 PASSED in 16.4s //tensorflow/tools/compatibility:test_file_v2_0 PASSED in 24.6s //tensorflow/tools/compatibility:tf_upgrade_test PASSED in 8.3s //tensorflow/tools/compatibility:tf_upgrade_v2_safety_test PASSED in 8.0s //tensorflow/tools/docs:tf_doctest_test PASSED in 1.1s //tensorflow/tools/graph_transforms:file_utils_test PASSED in 0.6s //tensorflow/tools/graph_transforms:transform_graph_test PASSED in 1.7s //tensorflow/tools/graph_transforms:transform_utils_test PASSED in 1.6s //tensorflow/tools/graph_transforms:transforms_test PASSED in 3.9s //tensorflow/tools/proto_text:gen_proto_text_functions_lib_test PASSED in 0.1s //tensorflow/tools/tensorflow_builder/compat_checker:compat_checker_test PASSED in 0.3s //tensorflow/tsl/c:tsl_status_helper_test PASSED in 0.1s //tensorflow/tsl/c:tsl_status_test PASSED in 0.3s //tensorflow/tsl/concurrency:async_value_ref_test PASSED in 0.1s //tensorflow/tsl/concurrency:async_value_test PASSED in 0.1s //tensorflow/tsl/concurrency:concurrent_vector_test PASSED in 0.1s //tensorflow/tsl/cuda:cudnn_version_test PASSED in 0.1s //tensorflow/tsl/distributed_runtime/coordination:coordination_service_agent_test PASSED in 12.7s //tensorflow/tsl/distributed_runtime/coordination:coordination_service_error_util_test PASSED in 0.2s //tensorflow/tsl/distributed_runtime/coordination:coordination_service_recoverable_job_test PASSED in 0.7s //tensorflow/tsl/distributed_runtime/preemption:preemption_notifier_test PASSED in 6.3s //tensorflow/tsl/distributed_runtime/preemption:preemption_sync_manager_test PASSED in 5.4s //tensorflow/tsl/distributed_runtime/rpc:grpc_channel_test PASSED in 0.3s //tensorflow/tsl/distributed_runtime/rpc:grpc_util_test PASSED in 0.3s //tensorflow/tsl/framework:cancellation_test PASSED in 1.1s //tensorflow/tsl/framework:device_id_utils_test PASSED in 4.0s //tensorflow/tsl/framework/convolution:spatial_convolutions_test PASSED in 0.2s //tensorflow/tsl/lib/gtl:tsl_lib_gtl_tests PASSED in 0.1s //tensorflow/tsl/lib/hash:crc32c_test PASSED in 0.2s //tensorflow/tsl/lib/histogram:histogram_test PASSED in 0.1s //tensorflow/tsl/lib/io:buffered_inputstream_test PASSED in 0.1s //tensorflow/tsl/lib/io:cache_test PASSED in 0.1s //tensorflow/tsl/lib/io:inputbuffer_test PASSED in 1.3s //tensorflow/tsl/lib/io:inputstream_interface_test PASSED in 0.2s //tensorflow/tsl/lib/io:random_inputstream_test PASSED in 0.1s //tensorflow/tsl/lib/io:record_reader_writer_test PASSED in 0.3s //tensorflow/tsl/lib/io:recordio_test PASSED in 0.3s //tensorflow/tsl/lib/io:table_test PASSED in 3.9s //tensorflow/tsl/lib/io:zlib_buffers_test PASSED in 15.0s //tensorflow/tsl/lib/io/snappy:snappy_test PASSED in 0.3s //tensorflow/tsl/lib/math:math_util_test PASSED in 0.6s //tensorflow/tsl/lib/random:distribution_sampler_test PASSED in 0.2s //tensorflow/tsl/lib/random:philox_random_test PASSED in 0.1s //tensorflow/tsl/lib/random:random_distributions_test PASSED in 17.3s //tensorflow/tsl/lib/random:simple_philox_test PASSED in 0.1s //tensorflow/tsl/lib/random:weighted_picker_test PASSED in 11.5s //tensorflow/tsl/platform:ctstring_test PASSED in 0.1s //tensorflow/tsl/platform:denormal_test PASSED in 0.2s //tensorflow/tsl/platform:errors_test PASSED in 0.1s //tensorflow/tsl/platform:fingerprint_test PASSED in 0.1s //tensorflow/tsl/platform:float8_test PASSED in 1.2s //tensorflow/tsl/platform:hash_test PASSED in 0.1s //tensorflow/tsl/platform:integral_types_test PASSED in 0.1s //tensorflow/tsl/platform:intrusive_ptr_test PASSED in 0.1s //tensorflow/tsl/platform:logging_test PASSED in 21.2s //tensorflow/tsl/platform:mutex_test PASSED in 0.1s //tensorflow/tsl/platform:net_test PASSED in 0.1s //tensorflow/tsl/platform:numbers_test PASSED in 0.2s //tensorflow/tsl/platform:path_test PASSED in 0.6s //tensorflow/tsl/platform:port_test PASSED in 8.2s //tensorflow/tsl/platform:random_test PASSED in 2.6s //tensorflow/tsl/platform:refcount_test PASSED in 0.3s //tensorflow/tsl/platform:retrying_file_system_test PASSED in 0.1s //tensorflow/tsl/platform:retrying_utils_test PASSED in 0.1s //tensorflow/tsl/platform:scanner_test PASSED in 1.0s //tensorflow/tsl/platform:setround_test PASSED in 0.2s //tensorflow/tsl/platform:stacktrace_handler_test PASSED in 1.9s //tensorflow/tsl/platform:stacktrace_test PASSED in 0.1s //tensorflow/tsl/platform:status_matchers_test PASSED in 0.3s //tensorflow/tsl/platform:status_test PASSED in 0.1s //tensorflow/tsl/platform:statusor_test PASSED in 16.5s //tensorflow/tsl/platform:str_util_test PASSED in 0.3s //tensorflow/tsl/platform:strcat_test PASSED in 0.3s //tensorflow/tsl/platform:stringpiece_test PASSED in 1.2s //tensorflow/tsl/platform:stringprintf_test PASSED in 0.2s //tensorflow/tsl/platform:subprocess_test PASSED in 0.2s //tensorflow/tsl/platform:tstring_test PASSED in 0.1s //tensorflow/tsl/platform:unbounded_work_queue_test PASSED in 0.3s //tensorflow/tsl/platform/cloud:compute_engine_metadata_client_test PASSED in 0.1s //tensorflow/tsl/platform/cloud:compute_engine_zone_provider_test PASSED in 0.1s //tensorflow/tsl/platform/cloud:curl_http_request_test PASSED in 9.7s //tensorflow/tsl/platform/cloud:expiring_lru_cache_test PASSED in 0.1s //tensorflow/tsl/platform/cloud:gcs_dns_cache_test PASSED in 0.2s //tensorflow/tsl/platform/cloud:gcs_file_system_test PASSED in 4.0s //tensorflow/tsl/platform/cloud:gcs_throttle_test PASSED in 0.1s //tensorflow/tsl/platform/cloud:google_auth_provider_test PASSED in 0.1s //tensorflow/tsl/platform/cloud:oauth_client_test PASSED in 0.1s //tensorflow/tsl/platform/cloud:ram_file_block_cache_test PASSED in 2.4s //tensorflow/tsl/platform/cloud:time_util_test PASSED in 0.2s //tensorflow/tsl/profiler/backends/cpu:traceme_recorder_test PASSED in 0.1s //tensorflow/tsl/profiler/convert:trace_container_test PASSED in 1.0s //tensorflow/tsl/profiler/convert:trace_events_to_json_test PASSED in 0.1s //tensorflow/tsl/profiler/convert:xla_op_utils_test PASSED in 0.1s //tensorflow/tsl/profiler/convert:xplane_to_trace_events_test PASSED in 0.1s //tensorflow/tsl/profiler/lib:profiler_factory_test PASSED in 0.1s //tensorflow/tsl/profiler/lib:profiler_lock_test PASSED in 0.1s //tensorflow/tsl/profiler/lib:scoped_annotation_test PASSED in 0.1s //tensorflow/tsl/profiler/lib:traceme_encode_test PASSED in 0.4s //tensorflow/tsl/profiler/rpc/client:profiler_client_test PASSED in 3.5s //tensorflow/tsl/profiler/rpc/client:remote_profiler_session_manager_test PASSED in 3.2s //tensorflow/tsl/profiler/utils:buffer_pool_test PASSED in 0.3s //tensorflow/tsl/profiler/utils:group_events_test PASSED in 0.3s //tensorflow/tsl/profiler/utils:parse_annotation_test PASSED in 0.1s //tensorflow/tsl/profiler/utils:preprocess_xplane_test PASSED in 0.1s //tensorflow/tsl/profiler/utils:tf_op_utils_test PASSED in 0.1s //tensorflow/tsl/profiler/utils:timespan_test PASSED in 0.1s //tensorflow/tsl/profiler/utils:tpu_xplane_utils_test PASSED in 0.2s //tensorflow/tsl/profiler/utils:xplane_builder_test PASSED in 0.2s //tensorflow/tsl/profiler/utils:xplane_utils_test PASSED in 0.4s //tensorflow/tsl/util:device_name_utils_test PASSED in 0.1s //tensorflow/tsl/util:stats_calculator_test PASSED in 0.1s //tensorflow/compiler/tests:complex_div_test_cpu PASSED in 8.7s Stats over 2 runs: max = 8.7s, min = 8.2s, avg = 8.5s, dev = 0.3s //tensorflow/compiler/tests:complex_div_test_cpu_mlir_bridge_test PASSED in 9.2s Stats over 2 runs: max = 9.2s, min = 8.6s, avg = 8.9s, dev = 0.3s //tensorflow/compiler/xla/tests:conditional_test_cpu PASSED in 11.0s Stats over 2 runs: max = 11.0s, min = 10.4s, avg = 10.7s, dev = 0.3s //tensorflow/python:control_flow_ops_test_cpu PASSED in 26.2s Stats over 2 runs: max = 26.2s, min = 23.1s, avg = 24.7s, dev = 1.6s //tensorflow/python/data/experimental/kernel_tests/optimization:optimization_test PASSED in 30.0s Stats over 2 runs: max = 30.0s, min = 17.7s, avg = 23.8s, dev = 6.2s //tensorflow/python/data/experimental/kernel_tests/service:metadata_test PASSED in 14.4s Stats over 2 runs: max = 14.4s, min = 12.4s, avg = 13.4s, dev = 1.0s //tensorflow/python/data/kernel_tests:padded_batch_test PASSED in 26.7s Stats over 2 runs: max = 26.7s, min = 26.6s, avg = 26.6s, dev = 0.0s //tensorflow/python/data/kernel_tests:repeat_test PASSED in 52.8s Stats over 2 runs: max = 52.8s, min = 51.4s, avg = 52.1s, dev = 0.7s //tensorflow/python/data/kernel_tests:window_test PASSED in 37.9s Stats over 2 runs: max = 37.9s, min = 28.3s, avg = 33.1s, dev = 4.8s //tensorflow/python/distribute:strategy_common_test_2gpu PASSED in 29.8s Stats over 2 runs: max = 29.8s, min = 25.9s, avg = 27.9s, dev = 1.9s //tensorflow/python/distribute:strategy_common_test_cpu PASSED in 27.5s Stats over 2 runs: max = 27.5s, min = 23.3s, avg = 25.4s, dev = 2.1s //tensorflow/python/distribute:strategy_common_test_xla_2gpu PASSED in 15.8s Stats over 2 runs: max = 15.8s, min = 15.5s, avg = 15.7s, dev = 0.1s //tensorflow/python/kernel_tests/array_ops:scatter_nd_ops_test_cpu PASSED in 14.2s Stats over 2 runs: max = 14.2s, min = 14.0s, avg = 14.1s, dev = 0.1s //tensorflow/python/kernel_tests/array_ops:scatter_ops_test_cpu PASSED in 20.4s Stats over 2 runs: max = 20.4s, min = 19.6s, avg = 20.0s, dev = 0.4s //tensorflow/python/kernel_tests/control_flow:functional_ops_test_cpu PASSED in 15.9s Stats over 2 runs: max = 15.9s, min = 15.3s, avg = 15.6s, dev = 0.3s //tensorflow/python/kernel_tests/control_flow:map_fn_test_cpu PASSED in 10.5s Stats over 2 runs: max = 10.5s, min = 9.7s, avg = 10.1s, dev = 0.4s //tensorflow/python/kernel_tests/nn_ops:atrous_conv2d_test_cpu PASSED in 27.4s Stats over 2 runs: max = 27.4s, min = 18.6s, avg = 23.0s, dev = 4.4s //tensorflow/python/kernel_tests/nn_ops:bias_op_d9m_test_cpu PASSED in 115.2s Stats over 2 runs: max = 115.2s, min = 45.7s, avg = 80.5s, dev = 34.8s //tensorflow/python/kernel_tests/nn_ops:conv2d_backprop_filter_grad_test_cpu PASSED in 68.7s Stats over 2 runs: max = 68.7s, min = 6.3s, avg = 37.5s, dev = 31.2s //tensorflow/python/ops/ragged:ragged_cross_op_test FLAKY, failed in 1 out of 2 in 16.2s Stats over 2 runs: max = 16.2s, min = 15.8s, avg = 16.0s, dev = 0.2s /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/testlogs/tensorflow/python/ops/ragged/ragged_cross_op_test/test_attempts/attempt_1.log //tensorflow/compiler/tests:spacetobatch_op_test_cpu PASSED in 8.9s Stats over 3 runs: max = 8.9s, min = 8.6s, avg = 8.8s, dev = 0.1s //tensorflow/compiler/tests:spacetobatch_op_test_cpu_mlir_bridge_test PASSED in 10.3s Stats over 3 runs: max = 10.3s, min = 9.5s, avg = 9.9s, dev = 0.3s //tensorflow/compiler/xla/tests:triangular_solve_test_cpu PASSED in 55.4s Stats over 3 runs: max = 55.4s, min = 53.9s, avg = 54.6s, dev = 0.6s //tensorflow/core/data/service:thread_safe_buffer_test PASSED in 0.2s Stats over 3 runs: max = 0.2s, min = 0.1s, avg = 0.1s, dev = 0.0s //tensorflow/python/data/experimental/kernel_tests/service:multi_process_cluster_test PASSED in 16.8s Stats over 3 runs: max = 16.8s, min = 12.1s, avg = 15.0s, dev = 2.0s //tensorflow/python/data/kernel_tests:unique_test PASSED in 22.2s Stats over 3 runs: max = 22.2s, min = 16.3s, avg = 18.5s, dev = 2.6s //tensorflow/python/kernel_tests/array_ops:gather_op_test_cpu PASSED in 43.7s Stats over 3 runs: max = 43.7s, min = 24.1s, avg = 31.3s, dev = 8.8s //tensorflow/python/kernel_tests/array_ops:weights_broadcast_test PASSED in 14.6s Stats over 3 runs: max = 14.6s, min = 14.0s, avg = 14.3s, dev = 0.3s //tensorflow/python/kernel_tests/distributions:util_test_cpu PASSED in 13.5s Stats over 3 runs: max = 13.5s, min = 12.1s, avg = 13.0s, dev = 0.6s //tensorflow/python/kernel_tests/linalg:matrix_triangular_solve_op_test_cpu PASSED in 113.7s Stats over 3 runs: max = 113.7s, min = 11.0s, avg = 45.4s, dev = 48.3s //tensorflow/python/kernel_tests/random:multinomial_op_big_test_cpu PASSED in 13.5s Stats over 3 runs: max = 13.5s, min = 9.8s, avg = 11.1s, dev = 1.7s //tensorflow/compiler/tests:ternary_ops_test_cpu PASSED in 17.4s Stats over 4 runs: max = 17.4s, min = 14.2s, avg = 15.6s, dev = 1.3s //tensorflow/compiler/tests:ternary_ops_test_cpu_mlir_bridge_test PASSED in 14.9s Stats over 4 runs: max = 14.9s, min = 11.0s, avg = 12.5s, dev = 1.5s //tensorflow/compiler/tests:unary_ops_test_cpu PASSED in 33.2s Stats over 4 runs: max = 33.2s, min = 10.6s, avg = 23.5s, dev = 9.2s //tensorflow/compiler/tests:unary_ops_test_cpu_mlir_bridge_test PASSED in 44.6s Stats over 4 runs: max = 44.6s, min = 7.8s, avg = 27.8s, dev = 15.5s //tensorflow/compiler/xla/tests:dynamic_ops_test_cpu PASSED in 15.4s Stats over 4 runs: max = 15.4s, min = 10.8s, avg = 12.5s, dev = 1.8s //tensorflow/core/kernels:example_parsing_ops_test PASSED in 0.5s Stats over 4 runs: max = 0.5s, min = 0.5s, avg = 0.5s, dev = 0.0s //tensorflow/python:nn_batchnorm_test_cpu PASSED in 16.3s Stats over 4 runs: max = 16.3s, min = 15.1s, avg = 15.8s, dev = 0.5s //tensorflow/python:nn_fused_batchnorm_d9m_test_cpu PASSED in 18.2s Stats over 4 runs: max = 18.2s, min = 16.3s, avg = 17.3s, dev = 0.7s //tensorflow/python/data/experimental/kernel_tests:auto_shard_dataset_test PASSED in 34.9s Stats over 4 runs: max = 34.9s, min = 18.5s, avg = 28.9s, dev = 6.2s //tensorflow/python/data/experimental/kernel_tests:map_and_batch_test PASSED in 71.0s Stats over 4 runs: max = 71.0s, min = 56.7s, avg = 61.7s, dev = 5.7s //tensorflow/python/data/experimental/kernel_tests:parse_example_dataset_test PASSED in 37.2s Stats over 4 runs: max = 37.2s, min = 14.2s, avg = 25.4s, dev = 10.9s //tensorflow/python/data/experimental/kernel_tests:rebatch_dataset_test PASSED in 27.2s Stats over 4 runs: max = 27.2s, min = 11.3s, avg = 17.4s, dev = 6.2s //tensorflow/python/data/experimental/kernel_tests:sql_dataset_test PASSED in 44.9s Stats over 4 runs: max = 44.9s, min = 25.4s, avg = 36.7s, dev = 7.5s //tensorflow/python/data/experimental/kernel_tests/service:cross_trainer_cache_ft_test PASSED in 53.5s Stats over 4 runs: max = 53.5s, min = 52.4s, avg = 52.9s, dev = 0.4s //tensorflow/python/data/experimental/kernel_tests/service:distributed_save_test PASSED in 44.1s Stats over 4 runs: max = 44.1s, min = 20.8s, avg = 31.3s, dev = 10.3s //tensorflow/python/data/kernel_tests:batch_test PASSED in 34.4s Stats over 4 runs: max = 34.4s, min = 26.7s, avg = 29.3s, dev = 3.1s //tensorflow/python/data/kernel_tests:fixed_length_record_dataset_test PASSED in 14.8s Stats over 4 runs: max = 14.8s, min = 8.9s, avg = 11.8s, dev = 2.8s //tensorflow/python/data/kernel_tests:from_generator_test PASSED in 24.8s Stats over 4 runs: max = 24.8s, min = 14.8s, avg = 18.7s, dev = 3.8s //tensorflow/python/data/kernel_tests:group_by_window_test PASSED in 24.2s Stats over 4 runs: max = 24.2s, min = 10.1s, avg = 15.9s, dev = 6.0s //tensorflow/python/data/kernel_tests:ragged_batch_test PASSED in 24.1s Stats over 4 runs: max = 24.1s, min = 21.7s, avg = 22.9s, dev = 1.0s //tensorflow/python/data/kernel_tests:skip_test PASSED in 24.9s Stats over 4 runs: max = 24.9s, min = 17.6s, avg = 21.1s, dev = 3.4s //tensorflow/python/data/kernel_tests:take_test PASSED in 31.3s Stats over 4 runs: max = 31.3s, min = 30.0s, avg = 30.7s, dev = 0.5s //tensorflow/python/data/kernel_tests:take_while_test PASSED in 29.3s Stats over 4 runs: max = 29.3s, min = 25.8s, avg = 27.6s, dev = 1.6s //tensorflow/python/data/kernel_tests:text_line_dataset_test PASSED in 19.3s Stats over 4 runs: max = 19.3s, min = 14.4s, avg = 16.9s, dev = 2.3s //tensorflow/python/data/kernel_tests:zip_test PASSED in 18.6s Stats over 4 runs: max = 18.6s, min = 17.0s, avg = 17.9s, dev = 0.6s //tensorflow/python/debug/lib:dumping_callback_test_cpu PASSED in 17.8s Stats over 4 runs: max = 17.8s, min = 16.7s, avg = 17.3s, dev = 0.5s //tensorflow/python/distribute:cross_device_ops_test_2gpu PASSED in 28.7s Stats over 4 runs: max = 28.7s, min = 19.4s, avg = 24.3s, dev = 3.6s //tensorflow/python/distribute:cross_device_ops_test_cpu PASSED in 34.8s Stats over 4 runs: max = 34.8s, min = 26.7s, avg = 30.3s, dev = 3.2s //tensorflow/python/distribute:strategy_gather_test_2gpu PASSED in 26.6s Stats over 4 runs: max = 26.6s, min = 16.4s, avg = 21.6s, dev = 4.5s //tensorflow/python/distribute:strategy_gather_test_cpu PASSED in 41.6s Stats over 4 runs: max = 41.6s, min = 31.0s, avg = 36.3s, dev = 4.8s //tensorflow/python/distribute:strategy_gather_test_xla_2gpu PASSED in 20.7s Stats over 4 runs: max = 20.7s, min = 9.8s, avg = 15.0s, dev = 5.0s //tensorflow/python/framework:convert_to_constants_test PASSED in 23.1s Stats over 4 runs: max = 23.1s, min = 16.8s, avg = 19.1s, dev = 2.4s //tensorflow/python/kernel_tests:collective_ops_test_2gpu PASSED in 44.8s Stats over 4 runs: max = 44.8s, min = 41.0s, avg = 43.1s, dev = 1.4s //tensorflow/python/kernel_tests:collective_ops_test_cpu PASSED in 34.7s Stats over 4 runs: max = 34.7s, min = 32.4s, avg = 33.0s, dev = 1.0s //tensorflow/python/kernel_tests/array_ops:concat_op_test_cpu PASSED in 14.4s Stats over 4 runs: max = 14.4s, min = 12.3s, avg = 13.3s, dev = 0.8s //tensorflow/python/kernel_tests/array_ops:init_ops_test_cpu PASSED in 58.1s Stats over 4 runs: max = 58.1s, min = 21.7s, avg = 39.5s, dev = 15.4s //tensorflow/python/kernel_tests/array_ops:split_op_test_cpu PASSED in 32.1s Stats over 4 runs: max = 32.1s, min = 14.7s, avg = 21.1s, dev = 7.2s //tensorflow/python/kernel_tests/linalg:einsum_op_test_cpu PASSED in 72.5s Stats over 4 runs: max = 72.5s, min = 21.6s, avg = 42.6s, dev = 20.6s //tensorflow/python/kernel_tests/linalg:linear_operator_lower_triangular_test_cpu PASSED in 45.6s Stats over 4 runs: max = 45.6s, min = 42.0s, avg = 43.3s, dev = 1.4s //tensorflow/python/kernel_tests/nn_ops:conv_ops_test_cpu PASSED in 37.9s Stats over 4 runs: max = 37.9s, min = 30.0s, avg = 33.6s, dev = 3.2s //tensorflow/python/kernel_tests/random:random_gamma_test_cpu PASSED in 80.9s Stats over 4 runs: max = 80.9s, min = 8.2s, avg = 39.5s, dev = 32.0s //tensorflow/python/kernel_tests/signal:window_ops_test_cpu PASSED in 23.6s Stats over 4 runs: max = 23.6s, min = 22.8s, avg = 23.2s, dev = 0.3s //tensorflow/python/ops/ragged:ragged_gather_op_test PASSED in 67.7s Stats over 4 runs: max = 67.7s, min = 19.0s, avg = 43.9s, dev = 17.3s //tensorflow/python/ops/ragged:ragged_getitem_test PASSED in 43.6s Stats over 4 runs: max = 43.6s, min = 39.8s, avg = 41.6s, dev = 1.4s //tensorflow/compiler/tests:async_comp_test_cpu PASSED in 10.6s Stats over 5 runs: max = 10.6s, min = 10.4s, avg = 10.5s, dev = 0.1s //tensorflow/compiler/tests:conv3d_test_cpu PASSED in 13.4s Stats over 5 runs: max = 13.4s, min = 7.2s, avg = 9.9s, dev = 2.7s //tensorflow/compiler/tests:conv3d_test_cpu_mlir_bridge_test PASSED in 15.2s Stats over 5 runs: max = 15.2s, min = 8.4s, avg = 11.3s, dev = 3.1s //tensorflow/compiler/tests:depthwise_conv_op_test_cpu PASSED in 15.8s Stats over 5 runs: max = 15.8s, min = 7.9s, avg = 11.4s, dev = 2.7s //tensorflow/compiler/tests:depthwise_conv_op_test_cpu_mlir_bridge_test PASSED in 16.1s Stats over 5 runs: max = 16.1s, min = 10.3s, avg = 12.7s, dev = 2.2s //tensorflow/compiler/tests:fused_batchnorm_test_cpu PASSED in 12.7s Stats over 5 runs: max = 12.7s, min = 11.4s, avg = 12.3s, dev = 0.4s //tensorflow/compiler/tests:fused_batchnorm_test_cpu_mlir_bridge_test PASSED in 7.9s Stats over 5 runs: max = 7.9s, min = 6.8s, avg = 7.4s, dev = 0.4s //tensorflow/compiler/tests:image_ops_jit_compile_test_cpu PASSED in 12.9s Stats over 5 runs: max = 12.9s, min = 11.5s, avg = 12.1s, dev = 0.5s //tensorflow/compiler/tests:reduce_ops_test_cpu PASSED in 11.5s Stats over 5 runs: max = 11.5s, min = 10.3s, avg = 11.0s, dev = 0.4s //tensorflow/compiler/tests:reduce_ops_test_cpu_mlir_bridge_test PASSED in 13.6s Stats over 5 runs: max = 13.6s, min = 11.2s, avg = 12.7s, dev = 0.9s //tensorflow/compiler/tests:repeat_op_test_cpu PASSED in 7.5s Stats over 5 runs: max = 7.5s, min = 5.5s, avg = 6.4s, dev = 0.7s //tensorflow/compiler/tests:repeat_op_test_cpu_mlir_bridge_test PASSED in 16.9s Stats over 5 runs: max = 16.9s, min = 14.2s, avg = 15.4s, dev = 0.9s //tensorflow/compiler/tests:special_math_test_cpu PASSED in 105.5s Stats over 5 runs: max = 105.5s, min = 17.6s, avg = 51.0s, dev = 30.2s //tensorflow/compiler/tests:special_math_test_cpu_mlir_bridge_test PASSED in 111.9s Stats over 5 runs: max = 111.9s, min = 12.3s, avg = 51.2s, dev = 34.1s //tensorflow/compiler/xla/client/lib:self_adjoint_eig_test_cpu PASSED in 29.2s Stats over 5 runs: max = 29.2s, min = 12.6s, avg = 21.9s, dev = 7.3s //tensorflow/core/grappler/optimizers:constant_folding_test PASSED in 3.0s Stats over 5 runs: max = 3.0s, min = 1.6s, avg = 2.3s, dev = 0.6s //tensorflow/dtensor/python/tests:layout_propagation_test_cpu PASSED in 13.5s Stats over 5 runs: max = 13.5s, min = 11.8s, avg = 12.7s, dev = 0.6s //tensorflow/python/distribute:mirrored_strategy_test_2gpu PASSED in 14.6s Stats over 5 runs: max = 14.6s, min = 13.1s, avg = 13.9s, dev = 0.5s //tensorflow/python/distribute:mirrored_strategy_test_cpu PASSED in 9.9s Stats over 5 runs: max = 9.9s, min = 9.2s, avg = 9.6s, dev = 0.3s //tensorflow/python/distribute:moving_averages_test_2gpu PASSED in 17.4s Stats over 5 runs: max = 17.4s, min = 15.1s, avg = 16.2s, dev = 0.8s //tensorflow/python/distribute:moving_averages_test_cpu PASSED in 17.8s Stats over 5 runs: max = 17.8s, min = 13.3s, avg = 15.4s, dev = 1.5s //tensorflow/python/distribute:vars_test_2gpu PASSED in 21.4s Stats over 5 runs: max = 21.4s, min = 19.9s, avg = 20.9s, dev = 0.5s //tensorflow/python/distribute:vars_test_cpu PASSED in 19.4s Stats over 5 runs: max = 19.4s, min = 14.8s, avg = 17.5s, dev = 1.6s //tensorflow/python/eager:device_placement_test_cpu PASSED in 11.0s Stats over 5 runs: max = 11.0s, min = 10.2s, avg = 10.7s, dev = 0.3s //tensorflow/python/eager:forwardprop_test_cpu PASSED in 95.6s Stats over 5 runs: max = 95.6s, min = 19.7s, avg = 48.0s, dev = 25.8s //tensorflow/python/eager/polymorphic_function:gradients_test_cpu PASSED in 17.4s Stats over 5 runs: max = 17.4s, min = 13.9s, avg = 15.3s, dev = 1.6s //tensorflow/python/kernel_tests/linalg:cholesky_op_test_cpu PASSED in 53.2s Stats over 5 runs: max = 53.2s, min = 34.2s, avg = 43.4s, dev = 6.5s //tensorflow/python/kernel_tests/linalg:linear_operator_adjoint_test_cpu PASSED in 23.7s Stats over 5 runs: max = 23.7s, min = 22.1s, avg = 22.8s, dev = 0.5s //tensorflow/python/kernel_tests/linalg:linear_operator_composition_test_cpu PASSED in 38.2s Stats over 5 runs: max = 38.2s, min = 35.3s, avg = 36.3s, dev = 1.2s //tensorflow/python/kernel_tests/linalg:linear_operator_diag_test_cpu PASSED in 22.9s Stats over 5 runs: max = 22.9s, min = 20.8s, avg = 21.8s, dev = 0.9s //tensorflow/python/kernel_tests/linalg:linear_operator_full_matrix_test_cpu PASSED in 27.7s Stats over 5 runs: max = 27.7s, min = 26.9s, avg = 27.1s, dev = 0.3s //tensorflow/python/kernel_tests/linalg:linear_operator_householder_test_cpu PASSED in 81.0s Stats over 5 runs: max = 81.0s, min = 73.7s, avg = 78.3s, dev = 3.0s //tensorflow/python/kernel_tests/linalg:linear_operator_identity_test_cpu PASSED in 35.3s Stats over 5 runs: max = 35.3s, min = 31.9s, avg = 33.5s, dev = 1.2s //tensorflow/python/kernel_tests/linalg:linear_operator_inversion_test_cpu PASSED in 25.0s Stats over 5 runs: max = 25.0s, min = 23.5s, avg = 24.4s, dev = 0.6s //tensorflow/python/kernel_tests/linalg:linear_operator_permutation_test_cpu PASSED in 24.8s Stats over 5 runs: max = 24.8s, min = 22.7s, avg = 23.8s, dev = 0.7s //tensorflow/python/kernel_tests/linalg:linear_operator_toeplitz_test_cpu PASSED in 18.4s Stats over 5 runs: max = 18.4s, min = 15.1s, avg = 16.5s, dev = 1.1s //tensorflow/python/kernel_tests/linalg:linear_operator_tridiag_test_cpu PASSED in 128.0s Stats over 5 runs: max = 128.0s, min = 123.7s, avg = 125.7s, dev = 1.6s //tensorflow/python/kernel_tests/linalg:linear_operator_util_test_cpu PASSED in 7.8s Stats over 5 runs: max = 7.8s, min = 7.4s, avg = 7.7s, dev = 0.1s //tensorflow/python/kernel_tests/linalg:linear_operator_zeros_test_cpu PASSED in 30.8s Stats over 5 runs: max = 30.8s, min = 26.9s, avg = 29.3s, dev = 1.3s //tensorflow/python/kernel_tests/nn_ops:fractional_avg_pool_op_test PASSED in 16.3s Stats over 5 runs: max = 16.3s, min = 9.2s, avg = 11.1s, dev = 2.8s //tensorflow/python/kernel_tests/nn_ops:fractional_max_pool_op_test PASSED in 14.7s Stats over 5 runs: max = 14.7s, min = 6.9s, avg = 8.9s, dev = 2.9s //tensorflow/python/kernel_tests/sparse_ops:sparse_ops_test_cpu PASSED in 30.3s Stats over 5 runs: max = 30.3s, min = 8.3s, avg = 13.2s, dev = 8.6s //tensorflow/python/ops/parallel_for:math_test_cpu PASSED in 76.4s Stats over 5 runs: max = 76.4s, min = 25.2s, avg = 45.0s, dev = 18.3s //tensorflow/compiler/tests:scan_ops_test_cpu PASSED in 15.8s Stats over 6 runs: max = 15.8s, min = 12.2s, avg = 14.1s, dev = 1.2s //tensorflow/compiler/tests:scan_ops_test_cpu_mlir_bridge_test PASSED in 18.4s Stats over 6 runs: max = 18.4s, min = 12.8s, avg = 16.1s, dev = 1.8s //tensorflow/python:accumulate_n_benchmark_cpu PASSED in 8.4s Stats over 6 runs: max = 8.4s, min = 7.9s, avg = 8.2s, dev = 0.2s //tensorflow/python/data/experimental/kernel_tests:make_batched_features_dataset_test PASSED in 24.4s Stats over 6 runs: max = 24.4s, min = 7.9s, avg = 15.3s, dev = 6.9s //tensorflow/python/kernel_tests/array_ops:diag_op_test_cpu PASSED in 95.5s Stats over 6 runs: max = 95.5s, min = 8.9s, avg = 26.6s, dev = 30.9s //tensorflow/python/kernel_tests/math_ops:reduction_ops_test_cpu PASSED in 51.3s Stats over 6 runs: max = 51.3s, min = 23.8s, avg = 36.4s, dev = 8.4s //tensorflow/python/distribute/experimental/rpc:rpc_ops_test PASSED in 15.3s Stats over 7 runs: max = 15.3s, min = 11.3s, avg = 13.1s, dev = 1.5s //tensorflow/compiler/tests:matrix_diag_ops_test_cpu PASSED in 60.6s Stats over 8 runs: max = 60.6s, min = 5.3s, avg = 24.0s, dev = 18.8s //tensorflow/compiler/tests:matrix_diag_ops_test_cpu_mlir_bridge_test PASSED in 67.9s Stats over 8 runs: max = 67.9s, min = 6.2s, avg = 25.9s, dev = 21.1s //tensorflow/dtensor/python/tests:input_util_test PASSED in 22.6s Stats over 8 runs: max = 22.6s, min = 14.9s, avg = 19.3s, dev = 2.6s //tensorflow/python/data/experimental/kernel_tests:csv_dataset_test PASSED in 26.7s Stats over 8 runs: max = 26.7s, min = 9.7s, avg = 16.8s, dev = 6.0s //tensorflow/python/data/experimental/kernel_tests:parallel_interleave_test PASSED in 99.0s Stats over 8 runs: max = 99.0s, min = 84.9s, avg = 91.9s, dev = 5.0s //tensorflow/python/data/experimental/kernel_tests/service:coordinated_read_ft_test PASSED in 51.6s Stats over 8 runs: max = 51.6s, min = 20.2s, avg = 34.7s, dev = 13.2s //tensorflow/python/data/experimental/kernel_tests/service:coordinated_read_test PASSED in 51.5s Stats over 8 runs: max = 51.5s, min = 16.0s, avg = 27.9s, dev = 12.6s //tensorflow/python/data/experimental/kernel_tests/service:cross_trainer_cache_test PASSED in 21.4s Stats over 8 runs: max = 21.4s, min = 7.2s, avg = 13.4s, dev = 5.3s //tensorflow/python/data/experimental/kernel_tests/service:fault_tolerance_test PASSED in 28.4s Stats over 8 runs: max = 28.4s, min = 21.6s, avg = 23.7s, dev = 2.3s //tensorflow/python/data/kernel_tests:filter_test PASSED in 17.1s Stats over 8 runs: max = 17.1s, min = 13.8s, avg = 15.3s, dev = 0.9s //tensorflow/python/data/kernel_tests:flat_map_test PASSED in 20.2s Stats over 8 runs: max = 20.2s, min = 13.7s, avg = 16.7s, dev = 2.3s //tensorflow/python/data/kernel_tests:shard_test PASSED in 21.8s Stats over 8 runs: max = 21.8s, min = 17.3s, avg = 19.9s, dev = 1.5s //tensorflow/python/data/kernel_tests:shuffle_test PASSED in 62.8s Stats over 8 runs: max = 62.8s, min = 32.8s, avg = 37.7s, dev = 9.6s //tensorflow/python/data/kernel_tests:tf_record_dataset_test PASSED in 26.1s Stats over 8 runs: max = 26.1s, min = 18.2s, avg = 22.9s, dev = 2.0s //tensorflow/python/kernel_tests/linalg:linalg_ops_test_cpu PASSED in 50.8s Stats over 8 runs: max = 50.8s, min = 30.0s, avg = 42.7s, dev = 7.0s //tensorflow/python/kernel_tests/linalg:linear_operator_block_diag_test_cpu PASSED in 77.4s Stats over 8 runs: max = 77.4s, min = 55.7s, avg = 65.4s, dev = 7.4s //tensorflow/python/kernel_tests/linalg:linear_operator_block_lower_triangular_test_cpu PASSED in 47.2s Stats over 8 runs: max = 47.2s, min = 29.1s, avg = 36.9s, dev = 6.1s //tensorflow/python/kernel_tests/nn_ops:depthwise_conv_op_d9m_test_cpu PASSED in 59.1s Stats over 8 runs: max = 59.1s, min = 10.8s, avg = 19.0s, dev = 15.6s //tensorflow/python/kernel_tests/nn_ops:depthwise_conv_op_test_cpu PASSED in 8.3s Stats over 8 runs: max = 8.3s, min = 3.0s, avg = 5.7s, dev = 2.3s //tensorflow/python/kernel_tests/signal:fft_ops_test_cpu PASSED in 20.6s Stats over 8 runs: max = 20.6s, min = 9.1s, avg = 13.7s, dev = 4.6s //tensorflow/python/ops/ragged:dynamic_ragged_shape_test PASSED in 44.1s Stats over 8 runs: max = 44.1s, min = 28.7s, avg = 34.9s, dev = 5.1s //tensorflow/python/ops/ragged:ragged_tensor_test PASSED in 23.9s Stats over 8 runs: max = 23.9s, min = 12.7s, avg = 16.5s, dev = 3.3s //tensorflow/python/distribute/failure_handling:failure_handler_test FLAKY, failed in 1 out of 9 in 900.0s Stats over 9 runs: max = 900.0s, min = 18.5s, avg = 129.9s, dev = 272.5s /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/testlogs/tensorflow/python/distribute/failure_handling/failure_handler_test/shard_8_of_8/test_attempts/attempt_1.log //tensorflow/python/distribute/failure_handling:gce_failure_handler_test FLAKY, failed in 1 out of 9 in 61.8s Stats over 9 runs: max = 61.8s, min = 12.9s, avg = 31.8s, dev = 19.6s /home/buildslave/.cache/bazel/_bazel_buildslave/fbac33eb30dbfb6b11b15a7ff5ac830d/execroot/org_tensorflow/bazel-out/aarch64-opt/testlogs/tensorflow/python/distribute/failure_handling/gce_failure_handler_test/shard_7_of_8/test_attempts/attempt_1.log //tensorflow/compiler/tests:bincount_op_test_cpu PASSED in 9.5s Stats over 10 runs: max = 9.5s, min = 7.3s, avg = 8.3s, dev = 0.6s //tensorflow/compiler/tests:conv2d_test_cpu PASSED in 9.2s Stats over 10 runs: max = 9.2s, min = 6.8s, avg = 8.1s, dev = 0.8s //tensorflow/compiler/tests:conv2d_test_cpu_mlir_bridge_test PASSED in 14.9s Stats over 10 runs: max = 14.9s, min = 9.9s, avg = 12.5s, dev = 1.7s //tensorflow/compiler/tests:image_ops_test_cpu PASSED in 16.8s Stats over 10 runs: max = 16.8s, min = 10.2s, avg = 14.0s, dev = 2.1s //tensorflow/compiler/tests:random_ops_test_cpu PASSED in 25.0s Stats over 10 runs: max = 25.0s, min = 19.6s, avg = 22.3s, dev = 1.8s //tensorflow/compiler/tests:random_ops_test_cpu_mlir_bridge_test PASSED in 22.1s Stats over 10 runs: max = 22.1s, min = 14.6s, avg = 18.3s, dev = 2.2s //tensorflow/compiler/tests:stateless_random_ops_test_cpu PASSED in 67.2s Stats over 10 runs: max = 67.2s, min = 44.4s, avg = 55.6s, dev = 8.6s //tensorflow/compiler/tests:stateless_random_ops_test_cpu_mlir_bridge_test PASSED in 65.4s Stats over 10 runs: max = 65.4s, min = 37.6s, avg = 52.2s, dev = 10.0s //tensorflow/compiler/xla/client/lib:svd_test_cpu PASSED in 67.7s Stats over 10 runs: max = 67.7s, min = 7.5s, avg = 24.7s, dev = 22.5s //tensorflow/compiler/xla/client/lib:tridiagonal_test_cpu PASSED in 13.4s Stats over 10 runs: max = 13.4s, min = 7.4s, avg = 10.4s, dev = 2.0s //tensorflow/compiler/xla/service/cpu:cpu_runtime_test PASSED in 10.8s Stats over 10 runs: max = 10.8s, min = 0.9s, avg = 8.4s, dev = 3.7s //tensorflow/python:special_math_ops_test_cpu PASSED in 57.1s Stats over 10 runs: max = 57.1s, min = 11.0s, avg = 17.9s, dev = 13.3s //tensorflow/python/data/kernel_tests:rejection_resample_test PASSED in 15.4s Stats over 10 runs: max = 15.4s, min = 5.8s, avg = 9.8s, dev = 2.7s //tensorflow/python/distribute:input_lib_test_2gpu PASSED in 31.6s Stats over 10 runs: max = 31.6s, min = 24.6s, avg = 27.2s, dev = 2.1s //tensorflow/python/distribute:input_lib_test_cpu PASSED in 28.1s Stats over 10 runs: max = 28.1s, min = 21.1s, avg = 24.3s, dev = 2.1s //tensorflow/python/distribute:input_lib_type_spec_test_2gpu PASSED in 19.8s Stats over 10 runs: max = 19.8s, min = 11.6s, avg = 15.6s, dev = 2.9s //tensorflow/python/distribute:input_lib_type_spec_test_cpu PASSED in 19.8s Stats over 10 runs: max = 19.8s, min = 11.3s, avg = 15.5s, dev = 3.1s //tensorflow/python/framework:config_vgpu_test_2gpu PASSED in 8.9s Stats over 10 runs: max = 8.9s, min = 8.1s, avg = 8.5s, dev = 0.2s //tensorflow/python/framework:config_vgpu_test_cpu PASSED in 9.3s Stats over 10 runs: max = 9.3s, min = 8.3s, avg = 8.8s, dev = 0.3s //tensorflow/python/framework:function_test_cpu PASSED in 54.4s Stats over 10 runs: max = 54.4s, min = 5.9s, avg = 12.3s, dev = 14.2s //tensorflow/python/grappler:cluster_test_cpu PASSED in 9.1s Stats over 10 runs: max = 9.1s, min = 4.7s, avg = 7.4s, dev = 1.7s //tensorflow/python/kernel_tests/array_ops:array_ops_test_cpu PASSED in 37.2s Stats over 10 runs: max = 37.2s, min = 33.2s, avg = 34.5s, dev = 1.2s //tensorflow/python/kernel_tests/array_ops:inplace_ops_test_cpu PASSED in 9.0s Stats over 10 runs: max = 9.0s, min = 6.3s, avg = 8.1s, dev = 0.9s //tensorflow/python/kernel_tests/data_structures:tensor_array_ops_test_cpu PASSED in 15.6s Stats over 10 runs: max = 15.6s, min = 11.3s, avg = 13.2s, dev = 1.4s //tensorflow/python/kernel_tests/linalg:linear_operator_kronecker_test_cpu PASSED in 26.4s Stats over 10 runs: max = 26.4s, min = 23.3s, avg = 24.7s, dev = 1.1s //tensorflow/python/kernel_tests/linalg:linear_operator_low_rank_update_test_cpu PASSED in 66.7s Stats over 10 runs: max = 66.7s, min = 62.2s, avg = 64.4s, dev = 1.5s //tensorflow/python/kernel_tests/linalg:tridiagonal_matmul_op_test_cpu PASSED in 116.8s Stats over 10 runs: max = 116.8s, min = 3.6s, avg = 16.9s, dev = 33.4s //tensorflow/python/kernel_tests/linalg/sparse:csr_sparse_matrix_ops_test_cpu PASSED in 43.4s Stats over 10 runs: max = 43.4s, min = 14.3s, avg = 28.2s, dev = 9.2s //tensorflow/python/kernel_tests/math_ops:segment_reduction_ops_test_cpu PASSED in 28.2s Stats over 10 runs: max = 28.2s, min = 8.0s, avg = 16.5s, dev = 7.2s //tensorflow/python/kernel_tests/nn_ops:pooling_ops_test_cpu PASSED in 20.5s Stats over 10 runs: max = 20.5s, min = 3.5s, avg = 8.9s, dev = 5.7s //tensorflow/python/kernel_tests/nn_ops:rnn_test_cpu PASSED in 12.3s Stats over 10 runs: max = 12.3s, min = 10.2s, avg = 11.2s, dev = 0.6s //tensorflow/python/kernel_tests/random:random_index_shuffle_test PASSED in 8.1s Stats over 10 runs: max = 8.1s, min = 6.4s, avg = 7.4s, dev = 0.6s //tensorflow/python/kernel_tests/random:stateless_random_ops_test_cpu PASSED in 101.7s Stats over 10 runs: max = 101.7s, min = 14.4s, avg = 57.7s, dev = 41.3s //tensorflow/python/ops/ragged:ragged_tensor_supported_values_test PASSED in 18.7s Stats over 10 runs: max = 18.7s, min = 15.4s, avg = 16.9s, dev = 1.0s //tensorflow/python/saved_model:load_test_cpu PASSED in 56.2s Stats over 10 runs: max = 56.2s, min = 34.7s, avg = 40.6s, dev = 5.7s //tensorflow/compiler/tests:fft_test_cpu PASSED in 22.6s Stats over 12 runs: max = 22.6s, min = 6.6s, avg = 14.2s, dev = 5.9s //tensorflow/compiler/xla/service:triangular_solve_expander_test PASSED in 5.4s Stats over 12 runs: max = 5.4s, min = 2.5s, avg = 3.2s, dev = 0.7s //tensorflow/python/data/experimental/kernel_tests:group_by_reducer_test PASSED in 15.6s Stats over 12 runs: max = 15.6s, min = 4.3s, avg = 9.8s, dev = 3.6s //tensorflow/python/data/kernel_tests:choose_from_datasets_test PASSED in 12.4s Stats over 12 runs: max = 12.4s, min = 3.4s, avg = 7.4s, dev = 2.6s //tensorflow/python/data/kernel_tests:memory_cleanup_test_cpu PASSED in 12.9s Stats over 12 runs: max = 12.9s, min = 7.2s, avg = 10.3s, dev = 1.5s //tensorflow/python/distribute:multi_process_runner_test_2gpu PASSED in 226.3s Stats over 12 runs: max = 226.3s, min = 14.8s, avg = 53.1s, dev = 58.2s //tensorflow/python/distribute:multi_process_runner_test_cpu PASSED in 224.6s Stats over 12 runs: max = 224.6s, min = 12.4s, avg = 50.5s, dev = 58.4s //tensorflow/python/eager/polymorphic_function:polymorphic_function_test_cpu PASSED in 18.7s Stats over 15 runs: max = 18.7s, min = 11.2s, avg = 13.4s, dev = 2.0s //tensorflow/python/kernel_tests/linalg:linear_operator_circulant_test_cpu PASSED in 97.8s Stats over 15 runs: max = 97.8s, min = 89.8s, avg = 94.0s, dev = 2.4s //tensorflow/python/kernel_tests/nn_ops:rnn_cell_test_cpu PASSED in 45.1s Stats over 15 runs: max = 45.1s, min = 10.9s, avg = 15.7s, dev = 8.8s //tensorflow/python:image_ops_test_cpu PASSED in 16.2s Stats over 16 runs: max = 16.2s, min = 7.6s, avg = 11.8s, dev = 2.4s //tensorflow/python/data/experimental/kernel_tests/service:dynamic_sharding_test PASSED in 19.1s Stats over 16 runs: max = 19.1s, min = 11.0s, avg = 15.0s, dev = 2.4s //tensorflow/python/data/experimental/kernel_tests/service:worker_tags_test PASSED in 21.8s Stats over 16 runs: max = 21.8s, min = 9.4s, avg = 15.4s, dev = 4.4s //tensorflow/python/data/kernel_tests:snapshot_test PASSED in 30.9s Stats over 16 runs: max = 30.9s, min = 7.7s, avg = 19.8s, dev = 6.8s //tensorflow/python/kernel_tests/control_flow:control_flow_ops_py_test_cpu PASSED in 29.0s Stats over 16 runs: max = 29.0s, min = 9.2s, avg = 12.4s, dev = 4.5s //tensorflow/python/kernel_tests/linalg:matrix_exponential_op_test PASSED in 11.3s Stats over 16 runs: max = 11.3s, min = 6.3s, avg = 8.7s, dev = 1.7s //tensorflow/python/kernel_tests/signal:dct_ops_test_cpu PASSED in 11.5s Stats over 16 runs: max = 11.5s, min = 6.3s, avg = 8.3s, dev = 1.9s //tensorflow/python/ops/parallel_for:control_flow_ops_test_cpu PASSED in 63.3s Stats over 16 runs: max = 63.3s, min = 16.7s, avg = 25.1s, dev = 10.4s //tensorflow/python/data/experimental/kernel_tests/service:distributed_save_ft_test PASSED in 9.8s Stats over 17 runs: max = 9.8s, min = 4.1s, avg = 6.7s, dev = 2.0s //tensorflow/python/data/kernel_tests:map_test PASSED in 42.0s Stats over 19 runs: max = 42.0s, min = 12.6s, avg = 19.7s, dev = 6.6s //tensorflow/compiler/tests:pooling_ops_3d_test_cpu PASSED in 7.3s Stats over 20 runs: max = 7.3s, min = 5.5s, avg = 6.6s, dev = 0.5s //tensorflow/compiler/tests:pooling_ops_3d_test_cpu_mlir_bridge_test PASSED in 9.8s Stats over 20 runs: max = 9.8s, min = 7.4s, avg = 8.5s, dev = 0.7s //tensorflow/compiler/tests:pooling_ops_test_cpu PASSED in 11.7s Stats over 20 runs: max = 11.7s, min = 3.4s, avg = 5.5s, dev = 2.1s //tensorflow/compiler/tests:pooling_ops_test_cpu_mlir_bridge_test PASSED in 12.8s Stats over 20 runs: max = 12.8s, min = 2.8s, avg = 5.8s, dev = 2.3s //tensorflow/compiler/xla/tests:convolution_dimension_numbers_test_cpu PASSED in 10.1s Stats over 20 runs: max = 10.1s, min = 8.8s, avg = 9.5s, dev = 0.3s //tensorflow/compiler/xla/tests:dot_operation_single_threaded_runtime_test_cpu PASSED in 12.3s Stats over 20 runs: max = 12.3s, min = 9.4s, avg = 10.6s, dev = 0.7s //tensorflow/compiler/xla/tests:dot_operation_test_cpu PASSED in 25.0s Stats over 20 runs: max = 25.0s, min = 12.5s, avg = 16.9s, dev = 3.1s //tensorflow/compiler/xla/tests:prng_test_cpu PASSED in 8.1s Stats over 20 runs: max = 8.1s, min = 6.5s, avg = 7.3s, dev = 0.4s //tensorflow/compiler/xla/tests:reduce_window_test_cpu PASSED in 48.8s Stats over 20 runs: max = 48.8s, min = 7.1s, avg = 16.2s, dev = 11.6s //tensorflow/python/autograph/tests:loop_control_flow_test PASSED in 46.8s Stats over 20 runs: max = 46.8s, min = 18.6s, avg = 37.2s, dev = 7.8s //tensorflow/python/kernel_tests:metrics_test PASSED in 37.6s Stats over 20 runs: max = 37.6s, min = 9.7s, avg = 19.9s, dev = 7.8s //tensorflow/python/kernel_tests/array_ops:matrix_band_part_op_test_cpu PASSED in 7.8s Stats over 20 runs: max = 7.8s, min = 2.7s, avg = 6.1s, dev = 1.6s //tensorflow/python/kernel_tests/data_structures:barrier_ops_test PASSED in 14.1s Stats over 20 runs: max = 14.1s, min = 3.0s, avg = 6.9s, dev = 2.5s //tensorflow/python/kernel_tests/linalg:eig_op_test PASSED in 48.9s Stats over 20 runs: max = 48.9s, min = 6.9s, avg = 16.6s, dev = 13.1s //tensorflow/python/kernel_tests/linalg:linalg_grad_test_cpu PASSED in 98.1s Stats over 20 runs: max = 98.1s, min = 33.0s, avg = 55.5s, dev = 17.1s //tensorflow/python/kernel_tests/linalg:norm_op_test_cpu PASSED in 12.1s Stats over 20 runs: max = 12.1s, min = 6.1s, avg = 9.4s, dev = 1.6s //tensorflow/python/kernel_tests/linalg:normalize_op_test_cpu PASSED in 13.3s Stats over 20 runs: max = 13.3s, min = 5.1s, avg = 8.8s, dev = 2.6s //tensorflow/python/kernel_tests/linalg:qr_op_test_cpu PASSED in 119.5s Stats over 20 runs: max = 119.5s, min = 32.5s, avg = 77.0s, dev = 31.5s //tensorflow/python/kernel_tests/linalg:self_adjoint_eig_op_test_cpu PASSED in 24.1s Stats over 20 runs: max = 24.1s, min = 4.2s, avg = 11.2s, dev = 6.1s //tensorflow/python/kernel_tests/math_ops:batch_matmul_op_test_cpu PASSED in 22.0s Stats over 20 runs: max = 22.0s, min = 5.2s, avg = 12.6s, dev = 5.6s //tensorflow/python/kernel_tests/math_ops:matmul_op_test_cpu PASSED in 18.9s Stats over 20 runs: max = 18.9s, min = 12.2s, avg = 16.0s, dev = 2.0s //tensorflow/python/kernel_tests/math_ops:tensordot_op_test_cpu PASSED in 58.6s Stats over 20 runs: max = 58.6s, min = 8.7s, avg = 26.7s, dev = 17.7s //tensorflow/python/kernel_tests/nn_ops:embedding_ops_test_cpu PASSED in 28.9s Stats over 20 runs: max = 28.9s, min = 15.0s, avg = 20.4s, dev = 3.8s //tensorflow/python/data/experimental/kernel_tests/service:local_workers_test PASSED in 22.0s Stats over 24 runs: max = 22.0s, min = 8.5s, avg = 14.4s, dev = 4.0s //tensorflow/python/data/kernel_tests:interleave_test PASSED in 21.9s Stats over 24 runs: max = 21.9s, min = 8.0s, avg = 14.1s, dev = 4.2s //tensorflow/python/data/kernel_tests:sample_from_datasets_test PASSED in 19.3s Stats over 24 runs: max = 19.3s, min = 7.2s, avg = 12.4s, dev = 3.4s //tensorflow/compiler/xla/tests:array_elementwise_ops_test_cpu PASSED in 9.8s Stats over 25 runs: max = 9.8s, min = 7.2s, avg = 7.8s, dev = 0.6s //tensorflow/compiler/xla/tests:select_and_scatter_test_cpu PASSED in 47.7s Stats over 25 runs: max = 47.7s, min = 7.8s, avg = 16.0s, dev = 9.8s //tensorflow/compiler/xla/tests:convolution_variants_test_cpu PASSED in 8.6s Stats over 30 runs: max = 8.6s, min = 6.7s, avg = 7.6s, dev = 0.5s //tensorflow/compiler/xla/tests:iota_test_cpu PASSED in 15.9s Stats over 30 runs: max = 15.9s, min = 14.1s, avg = 14.8s, dev = 0.4s //tensorflow/compiler/xla/tests:params_test_cpu PASSED in 8.4s Stats over 30 runs: max = 8.4s, min = 6.1s, avg = 7.2s, dev = 0.5s //tensorflow/compiler/xla/tests:reshape_test_cpu PASSED in 13.0s Stats over 30 runs: max = 13.0s, min = 6.9s, avg = 8.3s, dev = 1.2s //tensorflow/python/kernel_tests/nn_ops:conv_ops_3d_test_cpu PASSED in 22.2s Stats over 30 runs: max = 22.2s, min = 3.1s, avg = 9.6s, dev = 5.2s //tensorflow/compiler/xla/tests:reduce_test_cpu PASSED in 12.0s Stats over 31 runs: max = 12.0s, min = 10.1s, avg = 11.2s, dev = 0.6s //tensorflow/compiler/xla/tests:scalar_computations_test_cpu PASSED in 10.3s Stats over 32 runs: max = 10.3s, min = 6.6s, avg = 7.9s, dev = 0.9s //tensorflow/python/data/experimental/kernel_tests/service:auto_shard_test PASSED in 19.8s Stats over 32 runs: max = 19.8s, min = 12.2s, avg = 15.8s, dev = 1.9s //tensorflow/python/data/experimental/kernel_tests/service:data_service_ops_test PASSED in 30.6s Stats over 32 runs: max = 30.6s, min = 7.4s, avg = 18.6s, dev = 5.3s //tensorflow/compiler/xla/tests:batch_normalization_test_cpu PASSED in 11.4s Stats over 40 runs: max = 11.4s, min = 6.9s, avg = 9.7s, dev = 1.3s //tensorflow/compiler/xla/tests:bfloat16_test_cpu PASSED in 11.3s Stats over 40 runs: max = 11.3s, min = 7.2s, avg = 9.3s, dev = 1.1s //tensorflow/compiler/xla/tests:conv_depthwise_backprop_filter_test_cpu PASSED in 13.9s Stats over 40 runs: max = 13.9s, min = 8.6s, avg = 11.2s, dev = 1.2s //tensorflow/compiler/xla/tests:slice_test_cpu PASSED in 9.8s Stats over 40 runs: max = 9.8s, min = 6.9s, avg = 8.6s, dev = 0.7s //tensorflow/compiler/mlir/quantization/tensorflow/python:quantize_model_test PASSED in 33.0s Stats over 50 runs: max = 33.0s, min = 15.2s, avg = 22.6s, dev = 5.3s //tensorflow/compiler/tests:sort_ops_test_cpu PASSED in 43.7s Stats over 50 runs: max = 43.7s, min = 2.8s, avg = 9.9s, dev = 8.5s //tensorflow/compiler/tests:sort_ops_test_cpu_mlir_bridge_test PASSED in 39.5s Stats over 50 runs: max = 39.5s, min = 2.9s, avg = 10.2s, dev = 8.0s //tensorflow/compiler/xla/tests:conv_depthwise_test_cpu PASSED in 14.1s Stats over 50 runs: max = 14.1s, min = 10.5s, avg = 11.7s, dev = 0.8s //tensorflow/compiler/xla/tests:convolution_test_1d_no_vmodule_cpu PASSED in 15.4s Stats over 50 runs: max = 15.4s, min = 12.1s, avg = 13.6s, dev = 0.7s //tensorflow/compiler/xla/tests:convolution_test_cpu PASSED in 15.4s Stats over 50 runs: max = 15.4s, min = 11.9s, avg = 13.1s, dev = 0.8s //tensorflow/python/kernel_tests/linalg/sparse:csr_sparse_matrix_dense_mat_mul_grad_test_cpu PASSED in 15.3s Stats over 50 runs: max = 15.3s, min = 4.5s, avg = 8.8s, dev = 2.8s //tensorflow/python/kernel_tests/linalg/sparse:csr_sparse_matrix_grad_test_cpu PASSED in 14.4s Stats over 50 runs: max = 14.4s, min = 3.0s, avg = 7.0s, dev = 4.0s //tensorflow/python/kernel_tests/linalg/sparse:csr_sparse_matrix_sparse_mat_mul_grad_test_cpu PASSED in 8.2s Stats over 50 runs: max = 8.2s, min = 3.3s, avg = 4.2s, dev = 1.0s //tensorflow/python/kernel_tests/math_ops:cwise_ops_binary_test_cpu PASSED in 27.3s Stats over 50 runs: max = 27.3s, min = 6.6s, avg = 13.7s, dev = 5.4s //tensorflow/python/kernel_tests/math_ops:cwise_ops_test_cpu PASSED in 13.4s Stats over 50 runs: max = 13.4s, min = 3.0s, avg = 5.1s, dev = 2.2s //tensorflow/python/kernel_tests/math_ops:cwise_ops_unary_test_cpu PASSED in 13.6s Stats over 50 runs: max = 13.6s, min = 2.7s, avg = 5.5s, dev = 2.8s Executed 3776 out of 3776 tests: 3776 tests pass. There were tests whose specified size is too big. Use the --test_verbose_timeout_warnings command line option to see which ones these are.