You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
When I do this: python test-run.py qsync -j -1, I get some qsync timeout errors. When I run just python test-run.py qsync, all is fine. That is quite strange and deserves an investigation. The failing tests are:
Test failed! Result content mismatch:
--- replication/gh-5298-qsync-recovery-snap.result Fri Feb 11 00:24:42 2022
+++ var/rejects/replication/gh-5298-qsync-recovery-snap.reject Tue Mar 29 00:25:17 2022
@@ -22,6 +22,7 @@
| ...
for i = 1, 10 do box.space.sync:replace{i} end
| ---
+ | - error: Quorum collection for a synchronous transaction is timed out
| ...
-- Local rows could affect this by increasing the signature.
@@ -46,7 +47,7 @@
-- Could hang if the limbo would incorrectly handle the snapshot end.
box.space.sync:replace{11}
| ---
- | - [11]
+ | - error: Quorum collection for a synchronous transaction is timed out
| ...
old_synchro_quorum = box.cfg.replication_synchro_quorum
@@ -79,7 +80,6 @@
| ...
box.space.sync:get({11})
| ---
- | - [11]
| ...
box.space.sync:get({12})
replication/gh-5446-qsync-eval-quorum (almost no output)
Test failed! Result content mismatch:
--- replication/gh-5446-qsync-eval-quorum.result Fri Feb 11 00:24:42 2022
+++ var/rejects/replication/gh-5446-qsync-eval-quorum.reject Tue Mar 29 00:27:19 2022
@@ -94,258 +94,3 @@
-- Only one master node -> 1/2 + 1 = 1
s:insert{1} -- should pass
- | ---
- | - [1]
- | ...
and nothing below this line. All red.
Then I gave up, next tests hang.
The text was updated successfully, but these errors were encountered:
I could track down the problem I think - _cluster space is cleared. Here is a minimal reproducer: python test-run.py -j -1 gh-5140 gh-5163 --conf memtx. The test gh-5163 runs second and sees 2 rows in _cluster. That affects automatic synchro quorum calculation. It should see only one, this test doesn't start new instances.
Apparently, it is a leftover from gh-5140 after which test-run somewhy couldn't clear _cluster. I see some relevant code in tarantool/test-run/lib/tarantool-python/test/suites/lib/tarantool_python_ci.lua, but I don't know if it is called.
The solution is either get auto-clear fixed or find all the tests having the problem and stop using automatic synchro quorum in them - set it explicitly to 1, 2, etc.
test-run restarts the default server each time before run a test. All non-default server should be stopped at end of the test (even if the test itself don't do that). Data files (xlog, snap, vylog) should be cleaned up between test runs.
IOW, if one test really can leave something in a space and other test sees it, it is the bug in test-run.
When I do this:
python test-run.py qsync -j -1
, I get some qsync timeout errors. When I run justpython test-run.py qsync
, all is fine. That is quite strange and deserves an investigation. The failing tests are:replication/gh-5163-qsync-restart-crash
replication/gh-5288-qsync-recovery
replication/gh-5298-qsync-recovery-snap
replication/gh-5446-qsync-eval-quorum (almost no output)
and nothing below this line. All red.
Then I gave up, next tests hang.
The text was updated successfully, but these errors were encountered: