Anko

Genstage patterns

I’m fairly new to Elixir, and so far I’m finding it very exciting. So many of my concurrency problems seem to be solved in a simple, elegant way. Now i’ve learnt the basics, I’m trying to write some applications and am having trouble working out the right patterns.

First I was using Wabbit as a Genstage producer (consuming from a rabbitmq queue). I was doing some processing on the messages and then inserting into a DB. It worked great locally, around ~8000/s. When I moved it to a production environment, after excessive logging, I worked out that it was choking up at the Sink end (backpressure was working!) because of latency of writes. Locally, i would get say, 500 events to the consumer at a time, but in production i would get 1 at a time, which would make for higher latency on the write per message.

My solution was to create a stage which buffered upto x events, or for x ms (whichever came first). To be more concrete, here is the code.

# This will handle the acks back to rabbitmq
defmodule RabbitAckker do
  use GenStage
  require Logger

  def start_link() do
    GenStage.start_link(__MODULE__, :ok)
  end

  def init(:ok) do
    Process.send_after(self(), {:flush}, 500) # first message is fairly soon after startup
    {:producer_consumer, []}
  end

  # handle events coming from rabbit
  def handle_events(events, _from, state) do
    event_buffer = state
    new_event_buffer = event_buffer ++ events
    messages_before_purge = 400

    if (Enum.count(new_event_buffer) >= messages_before_purge) do
      for {event, meta} <- new_event_buffer do
        :ok = ack(meta.channel, meta.delivery_tag)
      end

      # pass all events down to the next stage
      {:noreply, new_event_buffer, []}
    else
      # don't pass any events down, just store them in the state
      {:noreply, [], new_event_buffer}
    end
  end

  # when this flush message comes, ack the events and pass them down.
  # TODO: only flush when we haven't acked anything in flush_time ms
  def handle_info({:flush}, state) do
    flush_time = 2_000 # ms
    event_buffer = state

    event_count = Enum.count(state)
    if event_count > 0 do
      Logger.debug("ackker: timer flush called for #{event_count} events")
    end
    # ack all the events we will flush
    for {event, meta} <- event_buffer do
      :ok = ack(meta.channel, meta.delivery_tag)
    end
    
    # call this again in another flush_time ms
    Process.send_after(self(), {:flush}, flush_time)
    # move event_buffer to events, and reset the state
    {:noreply, event_buffer, []}
  end

  defp ack(channel, delivery_tag) do
    try do
      Wabbit.Basic.ack(channel, delivery_tag)
    catch
      _, _ ->
        :ok
    end
  end
end

This works great! I realise that I have potential for dataloss if the process is killed before flush_time but you have that problem with any sort of buffering.

My first question is, is there a better way to do this?

Now on to my next problem!

In addition to writing to a datastore, I would like to spit out the messages to multiple websockets. Sometimes the consumers on the end of the websocket are slow. Each of them have different performance characteristics. How can I drop messages to them if they aren’t keeping up? I am trying to avoid an unbounded buffer. I am also trying to write to the datastore at the same time, but i want the datastore to get every message and apply backpressure (via demand) as it is in the above example.

Any suggestions on a good pattern for this?

Thanks for taking the time to read through my whole question!

11 comments

#genstage

4 3299 11

2017-06-09 21:56:39 UTC

Most Liked

sasajuric

Author of Elixir In Action

I had a similar situation a long time ago, and I solved it by doing an adaptive buffering with two processes. One process enqueues incoming items, while another performs the write. The writer reports to the queue that it’s available. Then, as soon as the first item arrives, the queue process sends it to the writer, and marks that the writer is busy.

Now, while the writer is busy, the queue stores new items in its internal structure. Then, when the writer is done and reports back to the queue, the queue can send all the items to the writer at once, and the writer can store them at once. In this way, the writer is profiting from the fact that writing N items at once is way faster than N writes of a single item.

I’m not familiar with GenStage, but I feel that this should be doable using two stages (consumer-producer for the queue, and consumer for the writer).

The nice benefit of this approach is that buffering is adaptable. If writing is faster than the rate of incoming messages, then there’s no buffering. Otherwise, the buffer will expand to accommodate the incoming rate.

Another nice benefit is that in the queue process you can do all sorts of trickery to handle overload, such as remove old items when the queue becomes large, reject new items if the queue is too large, or eliminate duplicate requests. I did all of those things in that project, and the result was pretty stable and resilient against all kinds of bursts, failures, latency increases and other problems.

Again, I’m not familiar with GenStage, so not sure if there’s an out-of-the-box solution for that, or you need to work a bit for it, but I believe that it should be possible with GenStage.

Post #2

peerreynders

Currently you seem to be on a fixed period “flush-cycle”. Process.send_after/4 returns a timer reference which can be used with Process.cancel_timer/1 to cancel the previous timer if you happen to release events in handle_event - in fact you should be able to factor out a common function between handle_events and handle_info for event release and timer renewal.

I don’t know about patterns but based on your description I’d start with this:

Write a simple GenServer that is dedicated to a single WebSocket. The idea being that the GenServer’s mailbox becomes your “unbounded buffer” (i.e. let the VM handle it). The GenServer itself processes one message at a time and blocks until the WebSocket is finished sending the current message.
Have those “Observer” WebSocket GenServers register with a “Observable” GenServer (reference to Observer Pattern but not in an OO way). The “Observable” GenServer simply sends all the events it receives to all the registered “Observers” (immediately and in turn).
Finally write a “Tap” GenStage. Essentially the “Tap” simply forwards all the events it receives - but not until it sends a copy to another process (in this case the “Observable” GenServer). Now it might be tempting to combine “Tap”/“Observable” but the priority is to get the events to the datastore, so by sticking to a simple “Tap” GenStage, the delay of getting the events to the datastore is minimized to the time it takes to put a copy of the events into the “Observable” mailbox. Distribution of the events happens on the “Observable’s” time and pushing messages into the WebSockets happens on the various “Observer’s” time. (The separation also creates the opportunity for the “Observable” process to be on a different CPU core than the “Tap” process.)

Post #3

Where Next?

View thread on forum (has 11 responses!)

genstage

Home Questions & Help>Questions

#genstage

4 3304 11

Last post

Questions & Help>Questions

I miss the ternary operator - does anyone have a macro that allows a ternary operator in Elixir code?

Questions & Help>Questions

Empty Result on Generic Action with graphql_unnested_unions

Questions & Help>Questions

Clarification about `assign/2,3` usage in `render/1` callbacks

Questions & Help>Questions

With the new 1.20 release does it change the way you see Gleam?

Questions & Help>Questions

Using Phoenix.LiveView.TagEngine as an EEx.Engine is deprecated!

Questions & Help>Questions

About ambiguity introduced in function default arguments

Questions & Help>Questions

OpenApiSpex schema - are there any naming conventions on handling show and index routes?

Questions & Help>Questions

How to get type warnings before test failure reports

Questions & Help>Questions

Help with Durable Server counter demo as a first step

Questions & Help>Questions

Has anyone implemented 2FA with Ash Authentication?

Questions & Help>Questions

Questions Questions ❯

Latest on Elixir Forum

Finitomata v0.41.0 released!

News>News & Updates

Oban v2.23.0 released!

News>News & Updates

Mob 0.7.12 released!

News>News & Updates

Nerves v1.14.3 released!

News>News & Updates

ElixirConf US 2026 is coming! Meet the Keynote Speakers and check out the talk lineup!

Chat & Discussions>Chit Chat

Sidereon - GPS, satellite positioning, and astrodynamics for Elixir

News>Announcing

Patch Package OTP 27.3.4.14 Released

News>Erlang News

Patch Package OTP 28.5.0.3 Released

News>Erlang News

Patch Package OTP 29.0.3 Released

News>Erlang News

LT: Your project is great! More people should know about it - Kamila Pokój | ElixirConf EU

Learning Resources>Talks

Stack Overflow Developer Survey 2026

Chat & Discussions>Discussions

Cache hit ratio on oban_jobs at 65% - large completed backlog, is this expected?

Questions & Help>Troubleshooting

Andy LeClair - Principal/Staff Full Stack Engineer

Jobs & Member Profiles>Member Profiles

What Liveview component for small data-grid?

Chat & Discussions>Discussions

How do you use mix hex.audit in your CIs?

Chat & Discussions>Discussions

Elixir Forum ❯

Sub Categories:

Forums

We're in Beta

About us Mission Statement

Genstage patterns

Anko

Genstage patterns

Most Liked

sasajuric

peerreynders

Where Next?

Popular in Questions

Deploying Elixir into ECS causing many "'global' at node :"xxxxx@10.0.X.X" requested disconnect from node :"xxxx@10.0.X.X" in order to prevent overlapping partitions"

System.get_env vs. Application.get_env

(Postgrex.Error) FATAL 28P01 (invalid_password) password authentication failed for user “postgres”

Formatting a string to PEM with headers and line breaks

Ecto delete a record WITHOUT selecting first

How to use return statement with if condition in elixir?

(EXIT) no process: the process is not alive or there's no process currently associated with the given name, possibly because its application isn't started

How do I kill a process ` #PID<0.186.0` in iex?

Best Practises for Error handling elixir?

Import a module from a file into IEX

Other popular topics

Put/update deep inside nested maps (and auto-create intermediate keys)

Idiomatic guard clause for checking not nil

Visual Studio Code - how to highlight html closing tags in html.eex?

(EXIT) no process: the process is not alive or there's no process currently associated with the given name, possibly because its application isn't started

What do you think of Gleam compared to Elixir?

How do I kill a process ` #PID<0.186.0` in iex?

Enum.map over list of key/value pairs with a map as the value

Failed to run 'elixir' command error in vs code

Concat/appending lists

How to set up the Elixir SDK in Intellij IDEA with the intellij-elixir plugin?

Questions & Help>Questions

Latest on Elixir Forum

Categories:

Sub Categories:

Forums

We're in Beta

Genstage patterns

Anko

Genstage patterns

Most Liked

sasajuric

peerreynders

Where Next?

Popular in Questions

Deploying Elixir into ECS causing many "'global' at node :"xxxxx@10.0.X.X" requested disconnect from node :"xxxx@10.0.X.X" in order to prevent overlapping partitions"

System.get_env vs. Application.get_env

(Postgrex.Error) FATAL 28P01 (invalid_password) password authentication failed for user “postgres”

Formatting a string to PEM with headers and line breaks

Ecto delete a record WITHOUT selecting first

How to use return statement with if condition in elixir?

(EXIT) no process: the process is not alive or there's no process currently associated with the given name, possibly because its application isn't started

How do I kill a process ` #PID<0.186.0` in iex?

Best Practises for Error handling elixir?

Import a module from a file into IEX

Other popular topics

Put/update deep inside nested maps (and auto-create intermediate keys)

Idiomatic guard clause for checking not nil

Visual Studio Code - how to highlight html closing tags in html.eex?

(EXIT) no process: the process is not alive or there's no process currently associated with the given name, possibly because its application isn't started

What do you think of Gleam compared to Elixir?

How do I kill a process ` #PID<0.186.0` in iex?

Enum.map over list of key/value pairs with a map as the value

Failed to run 'elixir' command error in vs code

Concat/appending lists

How to set up the Elixir SDK in Intellij IDEA with the intellij-elixir plugin?

Questions & Help>Questions

Latest on Elixir Forum

Sponsor Spotlight

Our Sponsors

Categories:

Sub Categories:

Forums

Our Sponsors

We're in Beta