Advice needed on Genservers vs ETS tables for global state in Phoenix

dredison · December 11, 2021, 5:18pm

Hi, I’m looking for some advice on making data available to all my LiveComponents without passing it as a prop.

I was originally looking at doing this with a simple Genserver, but from the docs, it looks like Genservers run in a single thread so this could be a bottleneck under load if all requests need to be serialised?

I’m also looking at ETS tables but I can’t find much talk of this anywhere in relation to Phoenix, so I’m thinking it might not be a great approach?

Am I overthinking this? Is a Genserver a reasonable approach?
Is it an anti-pattern to just pull data from an ETS table where ever I need it?

In a perfect world, I’d assign data in the LiveView mount and have it magically appear in the assigns of every component’s update method, but I can’t seem to find a clean way to do this.

Thanks for any advice.

mpope · December 11, 2021, 6:01pm

Is the data that you want to be globally accessible going to change? Or is it immutable constants?

dredison · December 11, 2021, 6:08pm

It’ll change. A component could grab in an event handler, change it, update it, then add the result back to the socket assigns for re-rendering.

dimitarvp · December 11, 2021, 6:10pm

You need parallel access, you reach for ETS. There’s no anti-pattern here, people use ETS in Phoenix projects all the time.

wanton7 · December 11, 2021, 7:31pm

I’m no Elixir pro but I’m pretty sure if you just use ETS directly you could have race conditions. If you grab value, change it and put it back something could have change value in ETS between those two calls. So when designing your system make sure to take it into account.

mpope · December 11, 2021, 7:34pm

If you were to use ETS I think you’d need to regularly poll it for updates.

An alternative is to broadcast changes to all LiveViews. For example, you could start a new pg group. A LiveView could register itsself in the group in mount, and the LiveView could implement the handle_call callback. This callback could update the socket assigns. A LiveComponent can loop through all pids in the pg group and call GenServer.call on all the LivewView pids with the new data. This could be too chatty though, might not be great if the value is very frequently updated and if you have many LiveViews.

ruslandoga · December 11, 2021, 7:44pm

Is there any particular reason to use pg over phoenix pubsub? The latter is probably more familiar to most people here and scales better.

mpope · December 11, 2021, 7:48pm

I’m curious how PubSub would scale better? And fair point, I use Erlang more than Elixir.

ityonemo · December 11, 2021, 7:49pm

are you looking to share it between LiveComponents of a single connection? Or LiveComponents across multiple connections? If you are sharing LiveComponents in multiple connections, be careful, as the clustered node that any given connection can “live on” may change (suppose a backhoe digs up some fiber on the way to some datacenter and triggers a TCP reconnect) and ETS tables are local to a single node. In the former case, don’t forget to tag the ETS table with some connection identifier, etc… It could get hairy, because you’ll want to automatically evict those items when the connection dies… – use props if you can.

ruslandoga · December 11, 2021, 7:51pm

With Phoenix PubSub the registration and actual broadcasts happen locally, and only the PubSub instances are members of pg. When all processes are members of pg both registration (each process needs to register with multiple nodes) and broadcasts (each message needs to be sent to each member, likely resulting in duplicate messages sent between nodes) become costlier. I’ll prepare and post some basic benchmarks from a few t4g instances tomorrow (I’ve been in a process of benchmarking different approaches, it’s a good idea to add pg to them).

mpope · December 11, 2021, 7:54pm

Interesting, thanks for the explanation.

ruslandoga · December 12, 2021, 7:08pm

I’ll prepare and post some basic benchmarks from a few t4g instances tomorrow

It has everything needed to start up the same infra (which is a vpc, two ec2 instances with public ips, a few security groups, and an ecs service) in terraform/ folder. But before that, .envrc or similar file needs to be created to export some necessary env vars (aws keys, ssh key name). After that terraform apply should just work.