gtcode

Porting Sakana AI's TRINITY Qwen-based model to Elixir/Bumblebee/Nx/Axon

Aloha gang,

I’m working on a port of Sakana AI’s TRINITY, an evolved LLM coordinator:

TRINITY Paper
OpenReview
Downloadable Assets

I started by attempting to reconstruct the work itself, but that isn’t realistic for me, given skill/resource constraints. So I’ve instead pivoted to porting their Python mechanism that uses a base Qwen model to build their coordinator: Trinity Coordinator. (Right now it’s been deconstructed to use a local path-dep in mix.exs related to an inference library I’m building to generalize abstracting LLM providers, thus not “clone friendly” yet.)

Just seeing if this is of interest to anyone. Certainly open to input/feedback/ideas/critiques on approach. Please respond here or open an issue with your candid feedback. There must be someone out there with more knowledge/experience on such matters who can provide guidance?

I’ve created the safetensors file from the original python scripts, so nx can talk numbers properly. I’ve been working on a staged verification process so that the resulting coordinator based on Qwen will behave the same as the generated .pt file from their Python system. There are some nuances related to numpy -> nx and others that might prevent perfect alignment but I’m aiming for behavioral and functional parity.

One thing I see often these days is people creating amazing work and ideas in Elixir, but often hard coupled to providers and the like. One goal for trinity_coordinator is to have a working standalone system with built in routing to LLM’s, but also pluggable/modular for integration into any other codebase/framework/system.

ps: I wasn’t sure if this is the right forum category, but there was a note that said to use the nx forum if it’s nx related.

15 comments

/nx #bumblebee

14 803 15

2026-07-07 02:19:07 UTC

Most Liked

polvalente

Nx Core Team

@gtcode I just shipped some improvements to EMLX in the past few days focusing on performance.
I took inspiration from what @ausimian did on his backend with the lowering compiler, and the final PR of the series has a benchmark in the description: feat: add fused kv_cache+sdpa by polvalente · Pull Request #124 · elixir-nx/emlx · GitHub

Our custom Qwen3 implementation now reaches 300 tok/s on 0.6B 4bit and the emlx_axon bumblebee rewrites can reach 120 tok/s on the non-quatized 0.6B! ~~Tomorrow I’ll release a new emlx/emlx_axon version with these improvements.~~ Improvements are available in emlx/emlx_axon 0.4!

Post #16

ausimian

Just for fun, I ported this to work on my 24GB M4 MBP using the Emily backend. In doing so, during the export, I ran into a limitation of the current native mlx libraries - their svd functions have no ‘thin’ mode and always materialise the full matrix. For the Qwen embedder that turned out to be ~92GB of memory.

I updated Emily to support this mode (in specific cases) directly via the Gram matrix, could do the one-time export in ~2s and was able to run the qwen router example.

If you are interested, the changes I made are here

Post #3

polvalente

Nx Core Team

@gtcode Nx 0.12.1 and EMLX 0.3.1 have been released with the changes I had to apply to get your lib working without OOMs

Post #15

Where Next?

View thread on forum (has 15 responses!)

bumblebee

Home Questions & Help>Questions

/nx #bumblebee

14 803 15

Last post

Popular in Questions

Questions & Help>Questions

Deleting item from a list

Hello, can anybody help here..? I have a list of players and I what to delete an element, but every for loop the list is reverting to ori...

7 24373 4

2020-03-18 04:04:09 UTC

New

Questions & Help>Questions

Deploying Elixir into ECS causing many "'global' at node :"xxxxx@10.0.X.X" requested disconnect from node :"xxxx@10.0.X.X" in order to prevent overlapping partitions"

We have an ECS cluster with 4 services, where each task joins a single cluster, via discovery ECS discovery service. Currently when I de...

#deployment

2 85473 3

2023-11-16 22:55:34 UTC

New

Questions & Help>Questions

How can I write a raw sql query?

Hi, I have to write a raw query for one of my project. But till now I have used ecto queries and don’t have much experience writing raw ...

/phoenix #ecto

13 19750 20

2020-04-12 00:15:10 UTC

New

Questions & Help>Questions

Mint vs Finch vs Gun vs Tesla vs HTTPoison etc

Currently suffering from paralysis by [HTTP client] analysis. This is rather unusual in Elixirland as there tends to be consensus on the ...

#http_client

252 22547 30

2024-02-11 02:32:24 UTC

New

Questions & Help>Questions

How can I check Phoenix version?

Hello, how can I check the Phoenix version ? Thanks !

/phoenix

35 28311 8

2022-07-29 11:27:07 UTC

New

Questions & Help>Questions

What do you think of Gleam compared to Elixir?

I have a relationship of love and hate with Elixir. Lots of things are just absolutely right, but there are some things that are kind of ...

#programminguages #gleam

24 17623 10

2023-04-08 20:09:27 UTC

New

Questions & Help>Questions

Ecto query using like/ilike in query

Good day to you all. I have been struggling to get a query involving like and ilike to work. Can anyone assist me on this, please? pro...

#ecto

17 16956 10

2022-09-15 19:56:29 UTC

New

Questions & Help>Questions

Updating structs: Map.put vs %Foo{oldfoo | new: value} vs put_in

Original source of discussion: This topic on the Pragmatic Programmers’ Functional Web Development with Elixir, OTP, and Phoenix forum. ...

#maps #structs #pipeline #access

115 28855 31

2020-07-04 06:01:18 UTC

New

Questions & Help>Questions

Best Practises for Error handling elixir?

How to handle excepions in elixir? Suppose i have A, B, C ,D, E modules. and each module has get() function. A.get() method will call t...

16 16271 9

2017-08-30 06:48:57 UTC

New

Questions & Help>Questions

How is it possible to get 2 million websocket connections when you have 65536 available ports?

I have a server on AWS, and was running a load test using artillery. When looking at the Phoenix dashboard I see the Ports going to 100% ...

/phoenix

20 19015 4

2023-01-24 00:21:16 UTC

New

Other popular topics

Questions & Help>Questions

How to fix Bad argument in call to erlang:'++'(<<"xxx/crash.log">>, ".3") in lager_rotator_default:rotate_logfile/2 line 84

Erlang/OTP 25 [erts-13.2.2] [source] [64-bit] [smp:8:8] [ds:8:8:10] [async-threads:1] 15:22:35.803 [error] gen_event {lager_file_backend...

#production #error #log

2 46485 2

2024-02-18 13:22:44 UTC

New

Questions & Help>Questions

System.get_env vs. Application.get_env

What is the difference between System.get_env and Application.get_env? For example, what are best practices to use one versus another.

#environment

13 17309 6

2023-08-30 16:24:51 UTC

New

Questions & Help>Questions

Put/update deep inside nested maps (and auto-create intermediate keys)

To my knowledge, put_in, Map.update etc. all have the one limitation of not automatically creating intermediate keys when needed (for exa...

#data-structures #maps #immutability

52 20442 11

2022-02-07 21:38:33 UTC

New

Chat & Discussions>Discussions

Django vs Phoenix

Anybody knows a comprehensive comparison of Django and Phoenix, thanks for the help. Where are they similar? Where do they differ the m...

/phoenix #learning-elixir

184 21830 82

2020-01-04 00:27:35 UTC

New

News>Phoenix News

Phoenix 1.4.0 released!

Phoenix 1.4.0 released Phoenix 1.4 is out! This release ships with exciting new features, most notably with HTTP2 support, improved deve...

/phoenix #phoenix-release

688 31013 112

2018-11-21 08:51:31 UTC

New

Questions & Help>Questions

What do you think of Gleam compared to Elixir?

I have a relationship of love and hate with Elixir. Lots of things are just absolutely right, but there are some things that are kind of ...

#programminguages #gleam

24 17623 10

2023-04-08 20:09:27 UTC

New

Questions & Help>Questions

Installing elixir via asdf shows zsh: command not found: iex

I tried installing elixir 1.11.2 erlang 23.3.4 via asdf in my zsh shell. Enabled the versions locally and globally. When I list them ...

#erlang #asdf

44 17038 17

2023-12-27 16:32:30 UTC

New

Questions & Help>Questions

Transform a list into an map with indexes using Enum module

Hi, I need to transform a list of numbers into a map where the keys are the indexes and the values are the original values of the list. ...

35 32927 9

2016-09-01 23:06:05 UTC

New

Chat & Discussions>Discussions

ElixirLS - the Elixir Language Server

TL;DR: I’ve just released an implementation of Microsoft’s IDE-independent Language Server Protocol for Elixir. It adds language support ...

#elixir-ls

1144 54120 245

2026-06-09 16:10:09 UTC

New

Questions & Help>Questions

Websocket connection works on localhost, but get 403 error when deployed via docker

For some reason my phoenix channels are working for me in my local dev environment, but as soon as I deploy via Docker, I get a 403 error...

/phoenix #channels

8 26986 12

2020-03-07 19:29:53 UTC

New

Latest Nx Threads

Splitting an ML model and a web app across two BEAM nodes

Blogs & Podcasts>Blog Posts

Nx ecosystem 0.12 library updates

News>News & Updates

Nx While loop training slow down when passing through frozen embedings

Questions & Help>Questions

Porting Sakana AI's TRINITY Qwen-based model to Elixir/Bumblebee/Nx/Axon

Questions & Help>Questions

Igor Ostaptchenko, Detroit MI - Senior/Staff Elixir Engineer, 13 years on the BEAM (Remote US, UTC-5)

Jobs & Member Profiles>Member Profiles

Adbc high memory usage compared with native drivers

Questions & Help>Questions

CUDA on Ubuntu 2604 (they're fixing drivers)

Chat & Discussions>Discussions

Better way to run local embeddings with Apple Metal than Ollama?

Questions & Help>Questions

Edifice - 92 neural network architectures for Nx/Axon

News>Announcing

Probabilistic Programming environment on the BEAM with NX, inspired by PyMC

News>News & Updates

Nx Forum ❯

Questions & Help>Questions

Help with elixir-ts-mode in doom-emacs config

Questions & Help>Questions

Are Vi keybindings possible inside IEx?

Questions & Help>Questions

I miss the ternary operator - does anyone have a macro that allows a ternary operator in Elixir code?

Questions & Help>Questions

Empty Result on Generic Action with graphql_unnested_unions

Questions & Help>Questions

Clarification about `assign/2,3` usage in `render/1` callbacks

Questions & Help>Questions

With the new 1.20 release does it change the way you see Gleam?

Questions & Help>Questions

Using Phoenix.LiveView.TagEngine as an EEx.Engine is deprecated!

Questions & Help>Questions

About ambiguity introduced in function default arguments

Questions & Help>Questions

OpenApiSpex schema - are there any naming conventions on handling show and index routes?

Questions & Help>Questions

How to get type warnings before test failure reports

Questions & Help>Questions

Questions Questions ❯

Latest on Elixir Forum

Comcent CE - an open-source voice/contact-center platform on Elixir/OTP, with call queues modeled as processes

News>Announcing

LT: smithy beam: Contract first API Development - Frank Eickhoff | ElixirConf EU

Learning Resources>Talks

BEAM There, Done That with Lukas Backström on Building the BEAM JIT

Blogs & Podcasts>Podcasts

Senior Software Engineer - Stord, Remote USA

Jobs & Member Profiles>Jobs

Hyper - distributed Firecracker microVM orchestrator written in Elixir

News>Announcing

Just_bash - a bash interpreter + virtual filesystem in Elixir (and how we use it to power an agent in production)

Chat & Discussions>AI / LLMs

Update from the Erlang Ecosystem Foundation - Dan Janowski | ElixirConf EU

Learning Resources>Talks

RFC 10008 - HTTP QUERY method: any plans for Plug/Cowboy support?

Chat & Discussions>Discussions

Localize bindings for Lua, LFE, Erlang and Gleam

News>Announcing

Attesto - OpenID-certified OAuth 2.1 / OpenID Connect for Elixir (Phoenix provider, client, and MCP auth)

News>Announcing

Improv - BLE Wi-Fi provisioning for Elixir/Nerves devices

News>Announcing

Andrew (Nature) Okoye - Senior Full Stack Engineer (Elixir, Phoenix, React) | Remote

Jobs & Member Profiles>Member Profiles

Annotai - turn UI annotations into structured context that AI agents can act on

News>Announcing

Cfonb - a parser for CFONB, the French banking statement format

News>Announcing

AddToCalendar - server-side "add to calendar" links + ICS generation for Phoenix LiveView

News>Announcing

Elixir Forum ❯

Sub Categories:

Forums

We're in Beta

About us Mission Statement

Porting Sakana AI's TRINITY Qwen-based model to Elixir/Bumblebee/Nx/Axon

gtcode

Porting Sakana AI's TRINITY Qwen-based model to Elixir/Bumblebee/Nx/Axon

Most Liked

polvalente

ausimian

polvalente

Where Next?

Popular in Questions

Deleting item from a list

Deploying Elixir into ECS causing many "'global' at node :"xxxxx@10.0.X.X" requested disconnect from node :"xxxx@10.0.X.X" in order to prevent overlapping partitions"

How can I write a raw sql query?

Mint vs Finch vs Gun vs Tesla vs HTTPoison etc

How can I check Phoenix version?

What do you think of Gleam compared to Elixir?

Ecto query using like/ilike in query

Updating structs: Map.put vs %Foo{oldfoo | new: value} vs put_in

Best Practises for Error handling elixir?

How is it possible to get 2 million websocket connections when you have 65536 available ports?

Other popular topics

How to fix *Bad argument in call to erlang:'++'(<<"xxx/crash.log">>, ".3") in lager_rotator_default:rotate_logfile/2 line 84*

System.get_env vs. Application.get_env

Put/update deep inside nested maps (and auto-create intermediate keys)

Django vs Phoenix

Phoenix 1.4.0 released!

What do you think of Gleam compared to Elixir?

Installing elixir via asdf shows zsh: command not found: iex

Transform a list into an map with indexes using Enum module

ElixirLS - the Elixir Language Server

Websocket connection works on localhost, but get 403 error when deployed via docker

Latest Nx Threads

Questions & Help>Questions

Latest on Elixir Forum

Sponsor Spotlight

Our Sponsors

Categories:

Sub Categories:

Forums

Popular Tags

Our Sponsors

We're in Beta

How to fix Bad argument in call to erlang:'++'(<<"xxx/crash.log">>, ".3") in lager_rotator_default:rotate_logfile/2 line 84