Phoenix Channels & Websocket transport not being terminated client-side

llamaindustries · January 6, 2021, 7:43pm

Hello friends!

I’m currently working on a Phoenix Pub/Sub message bus to replace a union of Google Pub/Sub + an overloaded C# service. The current implementation uses a raw websocket to broadcast messages rather than any sort of Channel abstraction on top of it.

The Phoenix-based message bus manages websocket connections made from clients where we send information about what needs to happen in our game. In our tests, the message bus performs MUCH more efficiently than our current implementation does, but we’re running into weird situations described below.

The existing implementation doesn’t appear to have random disconnections (but we’re also not using any channel abstractions on top, either).

The issue we’re seeing:
We’re seeing situations where a client has “disconnected” from the topic they were subscribed to (without sending a phx_leave event). The channel PID exits are monitored and a disconnect message is broadcast to the user’s topic to terminate the connection. This appears to work when I test it locally, but the client application doesn’t appear to know the connection has been terminated.

The websocket timeout is set very high, so a timeout doesn’t appear to be the case at least on the server side.

The cluster is running on top of GKE using libcluster. The websocket connections are going over an HTTP/S load balancer to our backend services. Connections between the nodes appear to be stable.

My next attempt is going to be sending a shutdown message to the transport PID (rather than using a broadcast) to ensure the socket is actually closed.

Questions:
Has anyone experienced random channel disconnections without socket terminations?
What is the best practice for terminating a socket connection when we know the user has “unsubscribed” from the channel?
Is there a channel timeout separate from the websocket timeout that I’m missing?

Happy to answer any questions! Thank you all in advance for the help!

pejrich · January 7, 2021, 11:03am

I’m not sure I understand your question completely, but leaving a channel/topic, will not, and should not close a socket. Sockets and Channels are two different things. The socket connection is the connection from the the client to the server, and the channel connection is a subscription to a specific topic on that server. Once the socket connection is started, it can join/leave from as many channel connections as it wants without being disconnected. If you want to close a socket connection, you have to do it manually, either from the client side with socket.close(), or from the server side with MyAppWeb.Endpoint.broadcast("user_socket:#{user.id}", "disconnect", %{})), that’s assuming you’ve named the socket connection with something like this: def id(socket), do: "user_socket:#{socket.assigns.current_user.id}" in your UserSocket file.