ARTEMIS-4305 Zero persistence does not work in kubernetes #4899

iiliev2 · 2024-04-22T16:58:23Z

In a cluster deployed in kubernetes, when a node is destroyed it terminates the process and shuts down the network before the process has a chance to close connections. Then a new node might be brought up, reusing the old node’s ip. If this happens before the connection ttl, from artemis’ point of view, it looks like as if the connection came back. Yet it is actually not the same, the peer has a new node id, etc. This messes things up with the cluster, the old message flow record is invalid.

This also solves another similar issue - if a node goes down and a new one comes in with a new nodeUUID and the same IP before the cluster connections in the others timeout, it would cause them to get stuck and list both the old and the new nodes in their topologies.

The changes are grouped in tightly related incremental commits to make it easier to understand what is changed:

Ping packets include nodeUUID
Acceptors and connectors carry TransportConfiguration
RemotingConnectionImpl#doBufferReceived tracks for ping nodeUUID mismatch with the target to flag it as unhealthy; ClientSessionFactoryImpl destroys unhealthy connections(in addition to not receiving any data on time)

jbertram · 2024-04-26T04:26:38Z

I think you could simplify this quite a bit. Here's what I suggest...

Don't modify any packet aside from Ping and only modify it with a new byte[].
The broker should send its node ID in every Ping.
The first time the client receives a Ping it should save the node ID.
If a client ever receives a Ping with a node ID that is different from the one it has saved then it should disconnect.

You also needs tests to verify the fix and mitigate regressions in the future.

iiliev2 · 2024-04-26T08:24:06Z

Initially I attempted what you suggest about lazy initializing the node id like that, precicely because I wanted to keep the code changes to a minimum. However, that ended up being much more complicated(rather than simplified), because of the way ClientSessionFactoryImpl creates a new connection object on re-connects. It is very hard to reason about both when reading the code and when needing to debug it at runtime. So instead of this, I had to fill the missing gaps to use the data that is already there anyway, just wasn't being propagated deep enough.

IMO from a functional standpoint, adding the TransportConfiguration to the connector(and connection) is the right thing to do here anyway. I assume due to historical reasons, those classes were working with a subset of the data, and no one had a good reason to fix this until now. For example NettyConnection#getConnectorConfig was creating a bogus transport configuration, even though when it is created there is a configuration which was not being passed to it.

Ping is already the only Packet that is being modified. Why do you want to use a raw byte[] rather than UUID? IMO that will be more confusing - it suggests that there could be other kind of data that can be contained.

…rams` map to acceptors and connectors

* include original node id in `TransportConfiguration` decoding * match ping packets' nodeUUID against the connection's transport configuration target nodeUUID; if any side is missing this data, the match succeeds * destroy a remoting connection if it ever becomes unhealthy(ping nodeUUID is different that the target)

iiliev2 · 2024-06-07T14:37:01Z

...rc/main/java/org/apache/activemq/artemis/core/protocol/core/impl/RemotingConnectionImpl.java

@@ -408,10 +416,15 @@ public void endOfBatch(Object connectionID) {
   }

   private void doBufferReceived(final Packet packet) {
+      if (isHealthy && !isCorrectPing(packet)) {
+         isHealthy = false;


Commenting this line out will effectively disable the fix. This will cause the new test ZeroPersistenceSymmetricalClusterTest to fail.

clebertsuconic · 2024-06-07T15:01:15Z

If this is an issue in Core, it will be an issue in AMQP as well. we should make sure AMQP also takes care of this?

WDYT @jbertram @gemmellr @gtully @tabish121 ?

jbertram · 2024-06-07T15:07:34Z

I believe the use-case here only involves cluster nodes and the core connections between them. Therefore, I don't think AMQP is in view.

iiliev2 force-pushed the topic/iiliev2/ARTEMIS-4305 branch from 94ac5ea to 7f8833c Compare April 23, 2024 08:15

iiliev2 added 4 commits June 7, 2024 16:50

ARTEMIS-4305 send nodeUUID with Ping packets, if available

abb3519

ARTEMIS-4305 passing TransportConfiguration rather than its raw `pa…

7918472

…rams` map to acceptors and connectors

ARTEMIS-4305 Added an integration test

5eabef6

iiliev2 force-pushed the topic/iiliev2/ARTEMIS-4305 branch from 7f8833c to 5eabef6 Compare June 7, 2024 14:33

iiliev2 commented Jun 7, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARTEMIS-4305 Zero persistence does not work in kubernetes #4899

ARTEMIS-4305 Zero persistence does not work in kubernetes #4899

iiliev2 commented Apr 22, 2024

jbertram commented Apr 26, 2024 •

edited

iiliev2 commented Apr 26, 2024 •

edited

iiliev2 Jun 7, 2024

clebertsuconic commented Jun 7, 2024

jbertram commented Jun 7, 2024

ARTEMIS-4305 Zero persistence does not work in kubernetes #4899

Are you sure you want to change the base?

ARTEMIS-4305 Zero persistence does not work in kubernetes #4899

Conversation

iiliev2 commented Apr 22, 2024

jbertram commented Apr 26, 2024 • edited

iiliev2 commented Apr 26, 2024 • edited

iiliev2 Jun 7, 2024

Choose a reason for hiding this comment

clebertsuconic commented Jun 7, 2024

jbertram commented Jun 7, 2024

jbertram commented Apr 26, 2024 •

edited

iiliev2 commented Apr 26, 2024 •

edited