webofbits

Aludel - LLM prompt evaluation workbench

Aludel - LLM Evaluation Workbench

Aludel is an embeddable Phoenix LiveView dashboard for evaluating and comparing LLM prompts across multiple providers (OpenAI, Anthropic, Ollama) simultaneously. It helps developers test prompt quality, track costs, and catch regressions with automated evaluation suites.

What it does

Run the same prompt across different LLM providers side-by-side and compare:

Output quality — See responses from GPT-4, Claude, and local Ollama models together
Performance metrics — Latency, token usage, and cost per request tracked in real-time
Evolution tracking — Visualize how prompt versions perform over time with pass rates, cost, and latency trends
Regression testing — Automated evaluation suites with assertions (contains, regex, exact_match, json_field)
Prompt versioning — Immutable prompt versions with {{variable}} interpolation

Key features

Multi-provider execution — Send one prompt to OpenAI, Anthropic, and Ollama concurrently. Results stream in real-time.
Cost tracking — Automatic cost calculation based on token usage and provider pricing.
Evaluation suites — Visual test case editor with document attachments (PDF, images, CSV, JSON, TXT). Run automated assertions against LLM responses.
Dashboard — Live metrics as runs execute: cost trends, latency, and per-provider performance.
Local-first option — Works with Ollama out of the box (no API keys required). Add cloud providers optionally.
Embeddable — Add to any existing Phoenix LiveView app as a self-contained dashboard, or run standalone.

Example workflow

# 1. Create a versioned prompt template
"Explain {{topic}} in exactly 3 sentences."

# 2. Run across 3 providers simultaneously
#    - Ollama (llama3, local)
#    - OpenAI (gpt-4o)
#    - Anthropic (claude-sonnet-4)

# 3. View side-by-side comparison in real-time:
# Provider       | Latency | Tokens  | Cost     | Output
# Ollama Llama3  | 1,234ms | 45/123  | $0.0000 | ...
# OpenAI GPT-4o  | 856ms   | 52/145  | $0.0019 | ...
# Claude Sonnet  | 1,102ms | 48/138  | $0.0018 | ...

# 4. Create evaluation suite with assertions
#    - Assert output contains "three sentences"
#    - Assert output matches regex pattern
#    - Run regression tests on prompt changes

Use cases

Prompt engineering — Test variations across providers to find the best prompt/model combination
Cost optimization — Compare pricing and quality trade-offs between providers
Quality assurance — Automated regression testing when updating prompts or switching providers
Provider evaluation — Benchmark performance, cost, and quality across OpenAI, Anthropic, and local models
Offline development — Use Ollama for local development without API costs

Installation

Aludel can be embedded into any Phoenix LiveView application or run standalone.

As a dependency (embedded mode)

# mix.exs
def deps do
  [
    {:aludel, "~> 0.1"}
  ]
end

# config/config.exs
config :aludel, repo: YourApp.Repo

# lib/your_app_web/router.ex
import Aludel.Web.Router

scope "/dev" do
  pipe_through :browser
  aludel_dashboard "/aludel"
end

mix aludel.install  # Copy migrations
mix ecto.migrate
mix aludel.seed     # Optional demo data

Standalone mode

git clone https://github.com/ccarvalho-eng/aludel.git
cd aludel/standalone
mix deps.get
mix ecto.setup
mix aludel.seed  # Optional demo data
mix phx.server
# Visit http://localhost:4000

Requirements: Elixir 1.19.5+, Erlang/OTP 28.4+, PostgreSQL 17+

Optional: ImageMagick v7+ (for PDF support with Ollama vision models)

Current status

Active development. Core features complete. Available on Hex.pm with CI/CD, and security scanning.

Multi-provider execution (OpenAI, Anthropic, Ollama)
Real-time result streaming with LiveView
Cost and latency tracking
Prompt versioning and evolution tracking
Evaluation suites with document attachments
Side-by-side comparison UI

Most Liked

webofbits

Final logo (for better or for worse )

Post #13

webofbits

Update: Library was just published to hex.pm!

Post #12

webofbits

Minor updates:

Dashboard Revamp - Glass morphism styling
Improved Dark Mode - Updated to One Dark color palette for better contrast
Prompt Evolution - New evolution tab for tracking prompt performance over time (kudos to @mikehostetler for the idea i.e. feat: integrate GEPA for prompt evolution and optimization · Issue #12 · ccarvalho-eng/aludel · GitHub )
Branding Updates - Added beaker icon to navigation and favicon
UI Polish - Modernized button design, improved modals, consistent styling across pages

Post #3

Where Next?

View thread on forum (has 39 responses!)

phoenix

library

liveview

Home News>Announcing

/phoenix #library #liveview #ai

27 1322 39

Last post

Popular in Announcing

News>Announcing

Download - the easiest way to download files from the internet

Hey there! I wrote a download elixir package which does exactly what its name about - an easy way to download files. I saw solutions ab...

#library

15 6216 13

2018-01-10 08:57:56 UTC

New

News>Announcing

Announcing a new MySQL driver: MyXQL

Hi everyone, We would like to announce that Plataformatec is working on a new MySQL driver called MyXQL. Our goal is to eventually integ...

#library #mysql #myxql

84 7771 4

2019-07-26 16:18:10 UTC

New

News>Announcing

Claude - Opinionated Claude Code integration for Elixir

I’ve been working with Claude Code extensively and absolutely love it. However, I’ve come across the challenge of managing configuration ...

#library #claude #claude-code

29 3434 26

2025-11-02 20:28:32 UTC

New

News>Announcing

ChromicPDF - PDF generator

Hello! Came here to announce ChromicPDF, a pet project PDF generator I’ve been working on for the past few months. Why another PDF gener...

#protocols #library #pdf-generator #chromeremoteinterface

74 11173 63

2026-01-09 13:45:04 UTC

New

News>Announcing

Protox - A 100% conformant protobuf library

Hi everyone, I’ve been working on this protobuf library for 3 years. We use it in the company I work for, EasyMile, to communicate with ...

#protox

128 5012 15

2025-03-19 20:08:40 UTC

New

News>Announcing

LiveVue - seamless integration of Vue with Phoenix LiveView

Hi! Today, after a couple weeks of development I’ve released v0.1 of LiveVue. It’s a seamless integration of Vue and Phoenix LiveView, i...

#library #vuejs #liveview #livevue

134 4468 44

2025-07-29 14:00:10 UTC

New

News>Announcing

Pigeon - iOS and Android push notifications for Elixir

After just over two years in development, this latest version of Pigeon is what I finally consider done in regards to my original vision ...

#library #pigeon #ios #android #push-notifications

54 7484 13

2023-03-27 08:04:25 UTC

New

News>Announcing

NimbleCSV - a small and fast CSV parsing and dumping library for Elixir

Hello everyone, We have just released NimbleCSV which is a small and fast CSV parsing library for Elixir. It allows developers to define...

#library #nimblecsv

31 6751 13

2017-09-11 14:10:09 UTC

New

News>Announcing

I've forked Arc to Waffle and merged all open PRs

Hi, I would like to tell about my initiative to further maintain and develop Waffle project which is the fork of Arc library. The progre...

#library #arc #opensource #ecto_arc #waffle

166 6861 48

2020-05-14 20:13:25 UTC

New

News>Announcing

LiveSelect - Dynamic selection input component for LiveView

Hi! :waving_hand: I would like to present LiveSelect, a little library that I wrote to easily add a dynamic selection input to your LV f...

#library #liveview #liveview-form #live_select

198 10978 107

2025-12-08 20:01:23 UTC

New

Other popular topics

Questions & Help>Questions

How to set environment variables in dev.exs?

Hi All, I set a environment variables in dev.exs , like below code. when i start server, how can i set the ${enable} value? thanks. d...

/phoenix

31 22088 15

2021-03-16 00:58:41 UTC

New

Questions & Help>Questions

System.get_env vs. Application.get_env

What is the difference between System.get_env and Application.get_env? For example, what are best practices to use one versus another.

#environment

13 17351 6

2023-08-30 16:24:51 UTC

New

Questions & Help>Questions

Put/update deep inside nested maps (and auto-create intermediate keys)

To my knowledge, put_in, Map.update etc. all have the one limitation of not automatically creating intermediate keys when needed (for exa...

#data-structures #maps #immutability

52 20488 11

2022-02-07 21:38:33 UTC

New

Questions & Help>Questions

Idiomatic guard clause for checking not nil

What is the idiomatic way of matching for not nil in Elixir? E.g., First way: defp halt_if_not_signed_in(conn, signed_in_account) when...

#guard-clauses

13 43628 3

2018-11-28 20:03:07 UTC

New

Questions & Help>Questions

Using List.first instead of Enum.at(0)

I have seen a lot of code which picks the first element from a list using Enum.at(0) instead of List.first. Is there a reason why people ...

#code-style

76 33832 14

2022-10-26 22:41:44 UTC

New

Questions & Help>Questions

Learning Elixir, frst impressions ( plz don't kill me ! )

About me? ( if you have nothing better to do than reading about some random guy in the internet :stuck_out_tongue: ) Hello all, this is ...

#learning-elixir

371 26424 61

2018-09-27 19:27:50 UTC

New

Questions & Help>Questions

How to decode a JSON into a struct safely?

What’s the safe way to decode a JSON string into a struct? I want to avoid calling String.to_atom. Jason.decode can give me a map with st...

#structs #json

29 21283 26

2022-11-01 19:09:59 UTC

New

Questions & Help>Questions

How do I kill a process ` #PID<0.186.0` in iex?

When I run the Plug and I recompile I wind up having to use Ctrl C to quit iex and start again. Witht the help of rlwrap I can use the cu...

#iex #processes #how-to-question

37 21536 8

2018-08-10 12:36:11 UTC

New

Questions & Help>Questions

How to get the current URL?

I’ve read in another post that it may be possible with a router helper - but I couldn’t find an appropriate one, and tbh, I’m still just ...

/phoenix

20 18863 18

2022-06-29 17:46:21 UTC

New

Questions & Help>Questions

Why would I choose Elixir as a general purpose programming language?

In asking this question I am more interested about the expressiveness of the language itself and less concerned about the availability of...

#functional-programming #use-cases

65 35072 13

2020-01-05 04:29:20 UTC

New

Latest Phoenix Threads

Exploring LiveView 1.2

Learning Resources>Screencasts

Errors connecting to localhost after generating a certificate with mix phx.gen.cert

Questions & Help>Troubleshooting

Ex-tauri - Desktop applications using Elixir

News>Announcing

Why does <.form> discard method and CSRF token when :action is absent, and is action="" a reliable way to submit to the current URL?

Questions & Help>Questions

Comcent CE - an open-source voice/contact-center platform on Elixir/OTP, with call queues modeled as processes

News>Announcing

Andrew (Nature) Okoye - Senior Full Stack Engineer (Elixir, Phoenix, React) | Remote

Jobs & Member Profiles>Member Profiles

Annotai - turn UI annotations into structured context that AI agents can act on

News>Announcing

AddToCalendar - server-side "add to calendar" links + ICS generation for Phoenix LiveView

News>Announcing

Biomine - Javascript and Css formatter using biome

News>Announcing

Potions - deploy and manage Phoenix apps on your own VPS

News>Announcing

Phoenix Forum ❯

News>Announcing

erlquad - quadtrees for the BEAM

News>Announcing

Localize_mcp - an MCP server that teaches AI agents the Localize API

News>Announcing

Ex_fpe - Format-preserving encryption for Elixir (FF1, FF3-1)

News>Announcing

AtomicFlags - mutable runtime configuration on top of atomics

News>Announcing

Trifle.Stats - Time-series metrics with nested counters on your existing Postgres/Redis/MongoDB

News>Announcing

Ex-tauri - Desktop applications using Elixir

News>Announcing

Green_ash - a keyboard-driven LiveView console to probe your Ash resources

News>Announcing

Amarula - a WhatsApp client in pure Elixir

News>Announcing

Comcent CE - an open-source voice/contact-center platform on Elixir/OTP, with call queues modeled as processes

News>Announcing

Hyper - distributed Firecracker microVM orchestrator written in Elixir

News>Announcing

News Announcing ❯

Latest on Elixir Forum

erlquad - quadtrees for the BEAM

News>Announcing

How do you pronounce ~> symbol?

Chat & Discussions>Discussions

Hologram: The Journey to Local-First Elixir in the Browser (ElixirConf EU 2026)

Learning Resources>Talks

Building Careers, Balancing Life: Stories from the Elixir World and Beyond | ElixirConf US

Learning Resources>Talks

Localize_mcp - an MCP server that teaches AI agents the Localize API

News>Announcing

Ex_fpe - Format-preserving encryption for Elixir (FF1, FF3-1)

News>Announcing

Exploring LiveView 1.2

Learning Resources>Screencasts

AtomicFlags - mutable runtime configuration on top of atomics

News>Announcing

Trifle.Stats - Time-series metrics with nested counters on your existing Postgres/Redis/MongoDB

News>Announcing

Managing Distributed Recorder Workers with Elixir-Misael Perez Chamorro | ElixirConf US

Learning Resources>Talks

Errors connecting to localhost after generating a certificate with mix phx.gen.cert

Questions & Help>Troubleshooting

Sloppy Joe Architecture Discussion

Chat & Discussions>Discussions

Mishka Chelekom - 0.0.9 released with 35 new headless component, MCP and more

News>News & Updates

Hex.Application error trying to run mix deps.get

Questions & Help>Troubleshooting

Ex-tauri - Desktop applications using Elixir

News>Announcing

Elixir Forum ❯

Sub Categories:

Forums

We're in Beta

About us Mission Statement

Aludel - LLM prompt evaluation workbench

webofbits

Aludel - LLM prompt evaluation workbench

Aludel - LLM Evaluation Workbench

What it does

Key features

Example workflow

Use cases

Installation

As a dependency (embedded mode)

Standalone mode

Current status

Links

Most Liked

webofbits

webofbits

webofbits

Where Next?

Popular in Announcing

Download - the easiest way to download files from the internet

Announcing a new MySQL driver: MyXQL

Claude - Opinionated Claude Code integration for Elixir

ChromicPDF - PDF generator

Protox - A 100% conformant protobuf library

LiveVue - seamless integration of Vue with Phoenix LiveView

Pigeon - iOS and Android push notifications for Elixir

NimbleCSV - a small and fast CSV parsing and dumping library for Elixir

I've forked Arc to Waffle and merged all open PRs

LiveSelect - Dynamic selection input component for LiveView

Other popular topics

How to set environment variables in dev.exs?

System.get_env vs. Application.get_env

Put/update deep inside nested maps (and auto-create intermediate keys)

Idiomatic guard clause for checking not nil

Using List.first instead of Enum.at(0)

Learning Elixir, frst impressions ( plz don't kill me ! )

How to decode a JSON into a struct safely?

How do I kill a process ` #PID<0.186.0` in iex?

How to get the current URL?

Why would I choose Elixir as a general purpose programming language?

Latest Phoenix Threads

News>Announcing

Latest on Elixir Forum

Sponsor Spotlight

Our Sponsors

Categories:

Sub Categories:

Forums

Popular Tags

Our Sponsors

We're in Beta