Emitter AMQP protocol stucked after MQ broker restart #40592

bkalas · 2024-05-13T12:41:27Z

Describe the bug

we are using Emitters for sending message to remote MQ broker.
Normally without any OnOverflow annotation
@Inject @Channel(RESPONSE_QUEUE) Emitter<String> emitter; ... public void emit(Message<String> toEmit) { this.emitter.send(toEmit); ...
Few times we observed situation that if remote broker was restarted, emiting of messages stopped working.
For few x (~100) messages we didnt get any errror (i guess x= default size of buffer),
then we started to get errror
java.lang.IllegalStateException: SRMSG00034: Insufficient downstream requests to emit item
This was only solved by quarkus app restart.
Health report for channel was all time showing OK status.

We are also using a lot of consumers (@Incoming ) of amqp messages, this were susccessfully reconnected and continue to work.

Whn i try to use for example
@OnOverflow(FAIL, bufferSize = 100)
it at least discnnected after error from channel and started reporting KO in health check, but here bufferSize seems to be ignored, so it is not usable.

Expected behavior

Emitting channel must be resilient against remote broker restart and must continue to emit messages.

Actual behavior

Sometimes restarts of remote broker was handled ok, but noit always.

How to Reproduce?

must have installed amq broker (in our case Redhat AMQ7.11)
quarkus app with Emitter to some queue in remote broker
emit messages in some intervlas and restart broker.
for me always ~6 restart of broker was enough to simulate this issue, that aftger one of this restarts no new message ws emitted.

Output of `uname -a` or `ver`

No response

Output of `java -version`

17

Quarkus version or git rev

2.16.12.Final

Build tool (ie. output of `mvnw --version` or `gradlew --version`)

No response

Additional information

No response

The text was updated successfully, but these errors were encountered:

quarkus-bot · 2024-05-13T12:42:44Z

/cc @cescoffier (reactive-messaging), @ozangunalp (reactive-messaging)

ozangunalp · 2024-05-13T14:43:47Z

Looks like an issue with the send-retry mechanism on the AMQP connector. It doesn't reconnect the client and keeps retrying with the previous (unconnected) sender.

I have a fix in mind but I need to be able to reproduce this scenario (in the test environment) to test it.

MikkoKauhanen · 2024-05-17T05:44:34Z

Hi,

If it helps at all I created quick and dirty test to try to reproduce the issue which you can find from link to the repo. This test uses Quarkus 3.10.1.

We have same problem in our services that use quarkus version 3.5.3. In our emitters we have @OnOverflow(OnOverflow.Strategy.UNBOUNDED_BUFFER) as a strategy.

Here is also link to my msg about my findings related to this issue: link to comment

ozangunalp · 2024-05-17T07:17:52Z

@MikkoKauhanen Thanks for this, I'll check it later today or at the beginning of next week.

bkalas · 2024-06-04T13:10:40Z

@ozangunalp Hi, anything new?

MikkoKauhanen · 2024-06-05T05:38:02Z

Hi, Just wanted to mention that we have now found out two additional issues related to MQ restart.

In case there are messages created by the emitter to be produced to broker during the MQ disconnect. After the broker is up there are a lot of connections established by the application to broker. And these connections seem to stay alive but I don't know if these are ever used by the application for producing messages anymore.

We noticed this in our dev env where we use micro instance type of Amazon MQ (maximumConnections = 300) where we exceeded the maximumConnections and our services health checks started to fail.

We also saw that connections to broker increased a lot even when there were no messages tried to be produced. This turned to be caused by the AMQPConnector Readiness health check that tries to make a connection through AmqpCreditBasedSender.isConnected().

cescoffier · 2024-06-05T05:44:50Z

We also saw that connections to broker increased a lot even when there were no messages tried to be produced. This turned to be caused by the AMQPConnector Readiness health check that tries to make a connection through AmqpCreditBasedSender.isConnected().

That's expected no? The readiness check verify that the broker is reachable. So, we need to establish connections. Now, it should be only one connection.

MikkoKauhanen · 2024-06-05T07:03:10Z

We also saw that connections to broker increased a lot even when there were no messages tried to be produced. This turned to be caused by the AMQPConnector Readiness health check that tries to make a connection through AmqpCreditBasedSender.isConnected().

That's expected no? The readiness check verify that the broker is reachable. So, we need to establish connections. Now, it should be only one connection.

Yes I can understand that the connection is tried to be established. But it seems to be one new connection for each q/health endpoint call which is done during message broker disconnect/downtime.

MikkoKauhanen · 2024-06-05T12:49:15Z

Hey,

I updated the demo project to have three tests that tries to reproduce the issues that I am aware.

Connection amount is increasing due to the emitted msgs while broker is not available
Connection amount is increasing due to health checks done while broker is not available
Producer stops producing messages after connection issues to broker.

Link to repo

bkalas added the kind/bug Something isn't working label May 13, 2024

quarkus-bot bot added the triage/needs-triage label May 13, 2024

geoand added area/reactive-messaging and removed triage/needs-triage labels May 13, 2024

ozangunalp mentioned this issue May 16, 2024

Reactive messaging Emitter stops working correctly in dev and test modes #40118

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Emitter AMQP protocol stucked after MQ broker restart #40592

Emitter AMQP protocol stucked after MQ broker restart #40592

bkalas commented May 13, 2024

quarkus-bot bot commented May 13, 2024

ozangunalp commented May 13, 2024

MikkoKauhanen commented May 17, 2024

ozangunalp commented May 17, 2024

bkalas commented Jun 4, 2024

MikkoKauhanen commented Jun 5, 2024

cescoffier commented Jun 5, 2024

MikkoKauhanen commented Jun 5, 2024

MikkoKauhanen commented Jun 5, 2024

Emitter AMQP protocol stucked after MQ broker restart #40592

Emitter AMQP protocol stucked after MQ broker restart #40592

Comments

bkalas commented May 13, 2024

Describe the bug

Expected behavior

Actual behavior

How to Reproduce?

Output of uname -a or ver

Output of java -version

Quarkus version or git rev

Build tool (ie. output of mvnw --version or gradlew --version)

Additional information

quarkus-bot bot commented May 13, 2024

ozangunalp commented May 13, 2024

MikkoKauhanen commented May 17, 2024

ozangunalp commented May 17, 2024

bkalas commented Jun 4, 2024

MikkoKauhanen commented Jun 5, 2024

cescoffier commented Jun 5, 2024

MikkoKauhanen commented Jun 5, 2024

MikkoKauhanen commented Jun 5, 2024

Output of `uname -a` or `ver`

Output of `java -version`

Build tool (ie. output of `mvnw --version` or `gradlew --version`)