I am wanting to build a dashboard that shows the number of users connected to each topic per node so I can visualise the distribution of users across servers.
At the moment I am using the [:phoenix, :channel_joined] telemetry event to capture metrics on how many channel joins have occurred but that won’t tell you how many people are currently attached to a channel as I don’t keep track of when people leave the channel. As for the channel that the joined event is in regards to, that is in the event metadata under the Phoenix.Socket struct:
Telemetry is great as mentioned above. However, I’m not sure that there is a telemetry event for when a channel is left, which is a problem for your app. Also, you will be getting channel level metrics rather than socket level (you could work around this though).
I have done this manually in the past with a linked GenServer (sort of how Tracker does). It keeps track of all Socket joins and Channel joins for the current node (no distribution). You can find the code here. I used a hack in order to track it for a Socket, although I would now rewrite that to manually spec defoverridable instead of redefining the code.
If you want to track Channel joins only, then it’s easier because you can hook into the join function like you would normally do with Phoenix.Tracker.
I was hoping there were telemetry events for this kind of stuff but like @sb8244 mentioned, there aren’t any leave events.
My implementation uses a linked gen server for tracking - I think I looked at the tracker code initially and build a solution similar but specific to my use case.
I don’t know what @akoutmos is suggesting for tracking leaves but I’m guessing I could hook the channel_joined and monitor / link the socket.channel_pid. I’d still need 99% of my code but I could remove the explicit call to inc/1 in favour of pulling it from the event metadata.
At least it doesn’t feel like there was an obvious solution I overlooked.
Someone can correct me here, but I imagine the issue with tracking leaves is that the disconnection event can’t be guaranteed depending on how the process is killed. That may be wrong depending on supervision setup, but my immediate thought goes to some edge cases that prevent complete guarantees.
I assume you’re correct if you are wanting to track disconnects from within the process that is “dying”. That’s why I am monitoring the channel process separately.