water

When will running reasonably powerful LLMs locally be realistic/achievable

interesting thread

Please wake me up when I can run it locally.

I prefer privacy over convenience.

This thread/fork split from: AI is getting ridiculously productive

15 comments

#split-thread #ai

2 931 15

2026-03-24 23:55:14 UTC

Most Liked

Lucassifoni

One thing to keep in mind is that small local models are not useless, they are great fuzzy logic implementations or great text wranglers.

I use Qwen through ollama on my MacBook Air for various little tasks : receipts classification (my accountant is very old school), enrichment of transcripts (transcribed locally with whisper) with project context, and data structuring from quick text notes. The scripts that run those tools were also partly written locally by qwen !

If the Talaas exploration (a startup that burns LLM weights on silicon) succeds, little specialized models might be way more common. Llama8b-in-an-USB-stick running at 10000 tokens/second can be very useful. (If you did not try this demo, https://chatjimmy.ai/ demonstrates supposedly on-chip inference of llama8b, my tries hover around 15000 tokens/second).

So small local models seem to be on a track to become « edge inference » with lower consumption… still bringing another new generation of hardware and gadgets though .

Post #7

DaAnalyst

The problem I see here is a “misalignment of expectations” (within the AI retail market - i.e. us). The NVIDIA’s market is essentially data centers. Why would they jeopardize their customers’ revenues from token sales with a product they could only sell to us every once in a while? IMO, their strategy so far has been going in the opposite direction - to finance their customers’ purchases of their own products en mass (it hardly gets any bubblier than that, btw).

A rough breakdown of NVIDIA’s sales for the last fiscal year (according to Grok):

Total revenue: $215.9 billion (up 65% YoY)
Data Center: $193.7 billion → ≈ 89.7%
Gaming (GPUs for the gaming industry): $16.0 billion

Also, the LLM providers, what incentive would they have to train models we can then use for free? That’s not the name of the game.

Maybe, just maybe, this opens a new window of entrepreneurial opportunity: training and selling models for private use, but again, with the customers running them on what?

Post #8

Vidar

Thinking about all the AI models companies like Google, Meta and others have made available for free it seems to me that is part of the game. For most training some of the big models (not just LLMs) is impossible due to both lack of enough data and processing resources. But here freebies had made a lot of difference as fine tuning of such available models is achievable for many. Either way there are free LLMs available today too, and I hope they will keep improving as well.

I think that the commercial big ones will likely always be better. But all LLMs keep improving, while the difficulty of programming stay fairly constant if not easier. Thus my hope is that at some point local LLMs will make production quality code. Then I will ask myself if paying a service to do even better quality or faster is really worth it. Good enough is good enough at some point.

I don’t know. At some level this huge LLMs providers in the sky remind me of laser printing services. Great businesses until the prices and quality was such that every office just got their own laser printer instead. I think LLMs for programming might be temporary. Will we still be doing it like we are now in 5 to 10 years? I’m not so sure.

Post #10

Last Post!

AndyL

financially

In my case, local execution is about privacy, not financial optimization.

I am imagining long-running background agents with many separate checks for the correctness of feature implementations, & every kind of automated code-quality check. Keep the GPU busy for hours. Only when all checks pass does it open a PR.

Yes - IMO non-time-critical will be the sweet spot for local inference. Long running Claw/Jido/Hermes type agent networks.

Post #16

Where Next?

View thread on forum (has 15 responses!)

split-thread

Home Chat & Discussions>AI / LLMs

#split-thread #ai

17 931 15

Last post

Popular in AI / LLMs

Chat & Discussions>AI / LLMs

Tidewave has just been announced by José Valim

Hey :waving_hand: Tidewave has just been announced by José Valim :fire: Watch Tidewave transform Claude Desktop into an agent by runni...

#tooling #ai #dev-tools #mcp #vibe-coding #tidewave

140 7262 41

2025-09-10 04:42:17 UTC

New

Chat & Discussions>AI / LLMs

Sharing an Elixir Architect skill for Claude

I shared an Elixir Architect skill for Claude Code which proves that Elixir is the best LLM friendly language (@josevalim ) https://git...

/phoenix /ash #claude-code

12 912 2

2025-11-12 02:13:34 UTC

New

Chat & Discussions>AI / LLMs

What are your favorite must-haves for "vibe engineering" with Elixir?

I found Elixir after a friend recommended looking into it for developing a multi-agent orchestration and task management system. Soon aft...

/phoenix #learning-elixir

57 1681 26

2026-01-29 13:01:39 UTC

New

Chat & Discussions>AI / LLMs

Elixir Skills for Claude, Cursor, Codex

Hey there, I haven’t seen a list of useful Elixir specific skills for the popular AI tools yet so I’m starting one here with my first. ...

#learning-elixir #ai

52 4627 19

2026-03-30 18:16:27 UTC

New

Chat & Discussions>AI / LLMs

Elixir is a cheatcode when using claude code

I just wanted to say that I recently got back into Elixir after many years of barely touching it (not really by choice). Funnily enough, ...

#coding-agents

9 729 6

2026-03-15 11:52:49 UTC

New

Chat & Discussions>AI / LLMs

What do you do while waiting for it (Claude Code)?

Seriously, what do you do while Claude (or whatever you use) is doing its thing?

#ai #llm #claude-code

140 2021 58

2026-05-05 18:17:57 UTC

New

Chat & Discussions>AI / LLMs

Just_bash - a bash interpreter + virtual filesystem in Elixir (and how we use it to power an agent in production)

Hey all — Chris here, CTO at Knock. We’re an Elixir shop (Phoenix, Oban, the works) and we recently shipped an AI agent for managing cust...

#libraries

17 588 0

2026-07-09 14:02:15 UTC

New

Chat & Discussions>AI / LLMs

A web terminal built for the AI era: long-running tasks survive restarts, Fork-grade git review built in, CodeMirror everywhere

Chat & Discussions>AI / LLMs

Just_bash - a bash interpreter + virtual filesystem in Elixir (and how we use it to power an agent in production)

Chat & Discussions>AI / LLMs

ExBashkit - an elixir wrapper for bashkit, a bash sandbox for LLMs

Chat & Discussions>AI / LLMs

Matt Pocock like skills for Elixir

Chat & Discussions>AI / LLMs

What exactly is an AI loop?

Chat & Discussions>AI / LLMs

Tokenware is the new form of donation

Chat & Discussions>AI / LLMs

How much would you pay for Claude given your current experience with it?

Chat & Discussions>AI / LLMs

Successful development with local AI setup

Chat & Discussions>AI / LLMs

A task class that's going to wait for at least a year before I try giving it to Claude again

Chat & Discussions>AI / LLMs

How to measure AI code quality?

Chat & Discussions>AI / LLMs

Chat AI / LLMs ❯

Latest on Elixir Forum

Piotr Bernad - Full Stack Product Developer (Elixir, Phoenix, React)

Jobs & Member Profiles>Member Profiles

A web terminal built for the AI era: long-running tasks survive restarts, Fork-grade git review built in, CodeMirror everywhere

Chat & Discussions>AI / LLMs

Spectre - OTP-native runtime for explicit and policy-controlled AI agents

News>Announcing

How to start Oban, but delay job processing?

Questions & Help>Questions

Engineering Network Protocol Clients - Carlos Souza | ElixirConf US

Learning Resources>Talks

Junior Full Stack Developer (Elixir, Phoenix, React) - Ireland

Jobs & Member Profiles>Jobs

Ancient Stones - a Phoenix world-building dashboard for RPG and fiction settings

News>Announcing

Simple/light way to self host microapps and services?

Questions & Help>Questions

Full Stack Elixir Developer - Patient Reach 360, Dayton, OH, USA, Remote US

Jobs & Member Profiles>Jobs

Help with load testing a custom protocol over TCP

Questions & Help>Questions

Nx ecosystem 0.13 library updates

News>News & Updates

Learning Elixir: Error Handling with try/rescue/catch/after

Blogs & Podcasts>Blog Posts

Elixir-lang.org redesign

Chat & Discussions>Discussions

Plugin for Claude Code - specialist agents and an enforced Elixir/Phoenix development workflow

News>Announcing

BEAM There, Done That with Garrison Hinson-Hasty & Isaac Yonemoto on Safer Native Code

Blogs & Podcasts>Podcasts

Elixir Forum ❯

Sub Categories:

Forums

We're in Beta

About us Mission Statement

When will running reasonably powerful LLMs locally be realistic/achievable

water

When will running reasonably powerful LLMs locally be realistic/achievable

Most Liked

Lucassifoni

DaAnalyst

Vidar

Last Post!

AndyL

Where Next?

Popular in AI / LLMs

Tidewave has just been announced by José Valim

Sharing an Elixir Architect skill for Claude

What are your favorite must-haves for "vibe engineering" with Elixir?

Elixir Skills for Claude, Cursor, Codex

Elixir is a cheatcode when using claude code

What do you do while waiting for it (Claude Code)?

Just_bash - a bash interpreter + virtual filesystem in Elixir (and how we use it to power an agent in production)

Other popular topics

Elixir Blog Posts

Using List.first instead of Enum.at(0)

How to set up the Elixir SDK in Intellij IDEA with the intellij-elixir plugin?

Write while loop equivalent in elixir

How to set environment variables in dev.exs?

How is it possible to get 2 million websocket connections when you have 65536 available ports?

Chat & Discussions>AI / LLMs

Latest on Elixir Forum

Categories:

Sub Categories:

Forums

Popular Tags

We're in Beta

When will running reasonably powerful LLMs locally be realistic/achievable

water

When will running reasonably powerful LLMs locally be realistic/achievable

Most Liked

Lucassifoni

DaAnalyst

Vidar

Last Post!

AndyL

Where Next?

Popular in AI / LLMs

Tidewave has just been announced by José Valim

Sharing an Elixir Architect skill for Claude

What are your favorite must-haves for "vibe engineering" with Elixir?

Elixir Skills for Claude, Cursor, Codex

Elixir is a cheatcode when using claude code

What do you do while waiting for it (Claude Code)?

Just_bash - a bash interpreter + virtual filesystem in Elixir (and how we use it to power an agent in production)

Other popular topics

Elixir Blog Posts

Using List.first instead of Enum.at(0)

How to set up the Elixir SDK in Intellij IDEA with the intellij-elixir plugin?

Write while loop equivalent in elixir

How to set environment variables in dev.exs?

How is it possible to get 2 million websocket connections when you have 65536 available ports?

Chat & Discussions>AI / LLMs

Latest on Elixir Forum

Sponsor Spotlight

Our Sponsors

Categories:

Sub Categories:

Forums

Popular Tags

Our Sponsors

We're in Beta