onkara

What would it take to create Kafka like solution in Elixir?

Just went through this great discussion thread on Big Data with Elixir and looking at this diagram I couldn’t help but ask can Broadway be a replacement for Kafka ? But on further research it turns out Broadway is a way of creating efficient data processing pipelines. And as such it sits more downstream on the consumer side of the equation. Though one can see a consumer further producing something which then is picked up by another data processing pipelines.

There have been attempts like ErlBus, EventBus and Phoenix’s own PubSub which do what Kafka does (except fault tolerant persistence among others). ErlBus supports distributed PubSub architecture.

The above being the background/context. My question is why do the various message bus stories in BEAM land don’t even come close to Kafka and what would it take to create Kafka competitor in Elixir? Is there some limitation in BEAM that does not easily lend itself to Kafkaesque architecture?

Would love to hear your thoughts/insights?

23 comments

#big-data #beam #kafka #elixir

101 5742 23

2020-08-28 05:54:03 UTC

Most Liked

sasajuric

Author of Elixir In Action

I personally think it makes a lot of sense to implement lighter versions of such 3rd party products as BEAM libraries, with the goal of simplifying the operations and reducing the amount of moving parts. There are already many examples where we can opt for BEAM libraries instead of external tools & products, such as nginx, cron, or redis. The alternatives from the BEAM ecosystem don’t necessarily match these tools in terms of features or performance, but in many cases they can work just fine, and help simplifying the system architecture.

I’d like to see the ecosystem growing further in this area. For example, I’d love to see a relational database as a BEAM library. Something I can add as a lib dependency, start an instance (or multiple instances) somewhere in a supervision tree, and have SQL based persistence without needing to manage a separate database instance, roles, handle language ↔ db type mapping etc. When I occasionally mention this during my talks, I get some skeptical feedback along the lines of “Why would you want to reimplement databases such as PostgreSQL, MySQL, etc.?”. The point is not to reimplement or compete with established databases, but to have a lightweight alternative which would be more fitting in simpler scenarios.

Ideally, if I want to build a small to medium web-facing CRUD, I should be able to start using nothing but Elixir, and get the basic skeleton working within 15 minutes or so, with everything implemented in a single language (say Erlang and Elixir), inside a single project, running as a single OS process per each node in the cluster. As long as we’re not able to do this, I think there’s a lot of potential for improvements in our ecosystem, and implementing alternatives to established products makes a lot of sense

Post #13

tristan

Rebar3 Core Team

Phoenix’s PubSub does not do what kafka does.

I had need for a lightweight topic based log system and we created GitHub - erleans/vonnegut · GitHub

It is compatible with kafka on disk and on the wire but uses chain replication instead of Kafka’s ISR based replication. It is not ready as a kafka replacement, There is plenty to still do around membership and concensus but the internals are there. Figured I’d mention it in case anyone wanted to work on such a thing and it could be a useful base layer

Post #9

otuv

We are 3 developers having less than 1000 users. I’m tired of “this is how Netflix does it” or “Google uses” etc.

As far as I experience it there are few software talks, courses and systems that focus on doing stuff at small scale yet still professional.

I like these E languages not because I have to solve 2 million concurrent users but because I’m hacking away trying to reap the benefits of the microservices style within my application.

Post #19

Where Next?

View thread on forum (has 23 responses!)

big-data

beam

kafka

elixir

Home Chat & Discussions>Discussions

#big-data #beam #kafka #elixir

101 5742 23

Last post

What would it take to create Kafka like solution in Elixir?

onkara

What would it take to create Kafka like solution in Elixir?

Most Liked

sasajuric

tristan

otuv

Where Next?

Popular in Discussions

Gleam Has A New Web Framework

String.capitalize() should have a “leave the rest of the word alone” option

Elixir vs Python

Find maximum and minimum in two dates

Discussion: Don't add a database layer to your Phoenix application

Yet another pipe to nth argument discussion

Elixir Code Editors & IDEs - which one are you using? (Poll)

How do you organize your components with Phoenix 1.7?

Comparing Elixir with Haskell

What’s wrong with Umbrella Apps?

Other popular topics

How to fix *Bad argument in call to erlang:'++'(<<"xxx/crash.log">>, ".3") in lager_rotator_default:rotate_logfile/2 line 84*

System.get_env vs. Application.get_env

Django vs Phoenix

(Postgrex.Error) FATAL 28P01 (invalid_password) password authentication failed for user “postgres”

How can I check Phoenix version?

ElixirLS - the Elixir Language Server

Using VSCode on multiple monitors

What's the best ide/editor for elixir in 2021?

Import a module from a file into IEX

What's a great modern drag and drop javascript library you recommend?

Chat & Discussions>Discussions

Latest on Elixir Forum

Sponsor Spotlight

Our Sponsors

Categories:

Sub Categories:

Forums

Popular Tags

Our Sponsors

We're in Beta

How to fix Bad argument in call to erlang:'++'(<<"xxx/crash.log">>, ".3") in lager_rotator_default:rotate_logfile/2 line 84