Stopping maltcp_ctx is unreliable #3

gbonnefille · 2016-07-07T13:39:29Z

Reading the MALC API, it seems that the operation to interrupt the mal_ctx_start() is mal_ctx_stop(). But, this call gives an unpredictable behaviour: the unit tests provided by malc stop cleanly, while our implementation (using two separate processes, one for the provider and another for the consumer) have a behaviour depending on the number of CPU.

Reading the code, we understand that mal_ctx_stop() destroy the zloop, but do not stop it. Reading czmq zloop's test (Cf. https://github.com/zeromq/czmq/blob/master/src/zloop.c#L876) the right way to stop the zloop is to have an event handler returning '-1'.

If our understanding is correct, we think that malc should be improved in the following way:

rename mal_ctx_stop() in mal_ctx_destroy()
add a new inproc socket (ctrl_socket?) in the maltcp_ctx
register a trivial event handler on this socket to return '-1'
create a new mal_ctx_stop() sending a dedicated message ($TERM?) in the ctrl_socket

Do you agree this understanding?

The text was updated successfully, but these errors were encountered:

freyssin · 2016-07-08T06:13:35Z

I agree with your analysis in the way to stop correctly a zloop.

However, as all transports are not based on a zloop, the code that sends the dedicated message must be located in the _ctx_stop method (maltcp and malzmq) rather than in mal_ctx_stop.

gbonnefille · 2016-07-08T08:17:50Z

However, as all transports are not based on a zloop, the code that sends the dedicated message must be located in the _ctx_stop method (maltcp and malzmq) rather than in mal_ctx_stop.

Of course, I would just mean that, as a user we use mal_* fonctions (higher API), even if the change should be made on some implementations only.

freyssin · 2016-10-12T09:11:36Z

Normally the context closing has been improved and the zloop is stopped correctly.

gbonnefille · 2017-01-03T10:28:06Z

I still encounter SEGFAULT at test termination, at random.

For example, malzmq_pubsub_app concludes in:

Stopped.
destroyed.
Tests passed OK
E: 17-01-03 11:20:10 dangling 'PAIR' socket created at src/zsys.c:398
E: 17-01-03 11:20:10 dangling 'PAIR' socket created at src/zsys.c:399
E: 17-01-03 11:20:10 dangling 'PAIR' socket created at src/zsys.c:398
E: 17-01-03 11:20:10 dangling 'PAIR' socket created at src/zsys.c:399
E: 17-01-03 11:20:10 dangling 'PAIR' socket created at src/zsys.c:398
E: 17-01-03 11:20:10 dangling 'PAIR' socket created at src/zsys.c:399
E: 17-01-03 11:20:10 dangling sockets: cannot terminate ZMQ safely
Makefile:1607: recipe for target 'check-local' failed
make[2]: *** [check-local] Erreur de segmentation
make[2]: Leaving directory '/home/egsccdev/MOL/malc/test/malzmq_pubsub_app'
Makefile:1451: recipe for target 'check-am' failed

georgeslabreche · 2021-08-24T18:56:34Z

I believe this issue should be re-opened. Stopping maltcp_ctx on my end does something with the endpoints which prevents my application from receiving responses messages to my request operation after I start maltcp_ctx again. As a workaround I am completely destroying and recreating the consumer object for each request, which is a bit overkill. I would have liked to have re-used the same listening socket connection.

freyssin closed this as completed Oct 12, 2016

freyssin reopened this Aug 25, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stopping maltcp_ctx is unreliable #3

Stopping maltcp_ctx is unreliable #3

gbonnefille commented Jul 7, 2016

freyssin commented Jul 8, 2016

gbonnefille commented Jul 8, 2016

freyssin commented Oct 12, 2016

gbonnefille commented Jan 3, 2017 •

edited

Loading

georgeslabreche commented Aug 24, 2021

Stopping maltcp_ctx is unreliable #3

Stopping maltcp_ctx is unreliable #3

Comments

gbonnefille commented Jul 7, 2016

freyssin commented Jul 8, 2016

gbonnefille commented Jul 8, 2016

freyssin commented Oct 12, 2016

gbonnefille commented Jan 3, 2017 • edited Loading

georgeslabreche commented Aug 24, 2021

gbonnefille commented Jan 3, 2017 •

edited

Loading