DaAnalyst

Any first-hand experience with Claude model getting "nerfed" down after using it for a while?

Been using Claude for over a week now (Opus 4.6 then 4.7, max subscription). Honestly, can’t hide my joy, at some points feeling even ashamed of my past stubborn skepticism. It’s amazing how much (more) I managed to do over this last week. The productivity boost is comparable (if not bigger) than when I switched from OOP to Elixir 7+ years ago.

Just when I thought I figured it all out (how to get the most out of it - btw, I’ve managed to achieve 0 slop for what I’ve been using it for currently), yesterday I got cold showered when I called to brag to an old friend of mine who runs a small dev agency and who I knew has been using Claude for a while.

“Enjoy it while it lasts” he laughed cynically. Told me the (consistent) excellent performance I’ve been experiencing would cost way more than a max subscription, and said he felt it’s been a bait and switch - that over the 8 months he’s been using he’s experienced very serious and frequent drops in Claude’s performance to being downright dumb and messing things up (which is the exact opposite of my experience so far).

A couple of weeks ago I saw a Zerohedge retweet of an engineer from AMD ranting about Anthropic having been nerfing Claude (deliberately downgrading its performance).

Since all this can be very subjective, I need to ask if someone here had this kind of in-person experience or maybe even a more in-depth knowledge on the matter?

30 comments

#claude-code

4 2050 30

2026-04-28 22:51:34 UTC

First 10 of 30 Posts!

oleksify

It usually depends on how it is used. Plain direct usage of prompting (even through Plan mode) most of the time result if crap output. Could be subjective of course, but it writes duplicated code, do not reuse already written functions and module, ignore codebase standards (even if they are directly written to local CLAUDE.md).

Sometimes psychosis starts out of nowhere, and it’s starting outputting pure crap. Or it states he did something, but never actually wrote that code. It can also easily wipe hours of work but just resetting git state. Things happens, to what Claude usually say - “I as so sorry!“.

Only way. I was able to keep it in line is through having very strict orchestration framework with a lot of hooks. So, in short - your friend is correct.

Besides those “features“, service has terrible uptime - it is down very often. It’s not a feeling - their status page all red.

Post #1

DaAnalyst

Thanks for the feedback. But it’s weird I haven’t experienced any this except for the suboptimal code generation (which I’m accustomed to regarding LLMs in general, but I’ve learned how to deal with it and get from it what I need and how I need it).

TBH, my only worry here is this being some kind of actually deliberate policy. That’s the only thing I’d actually hate. If it’s a result of peak demand or whatever technical reason, then it’s subject for improvement and will most likely go away, but if it’s the result of a corporate policy, then it’s too bad.

Post #2

Vidar

Opus 4.5 was a bit of a tipping point for me, and I’ve been an heavy user now for months. I haven’t experienced degrading overall, and for the large majority of the time it is fine, but there are times when I get the fruitcake Claude. That will typically happen after getting a new one after compaction, but it has also happened a few times after very long session. There was a period earlier this year where that happened more often, but these days not so much.

I have gotten better at spotting the confused crazy talk early on, and I just compact that Claude away and usually get a good Claude one again afterwards.

I do git commits often as Claude will not always have a way to undo code changes that don’t work out. I also do additional more comprehensive backups for major milestones. That didn’t change because of Claude, but they have been used a few times when Claude have accidentely overwritten or deleted data source files. Frequent backups do so much better than “I did a horrible mistake. I’m so sorry”.

To be fair sometimes I’ve been the idiot and implicitly assumed Claude has a level of common sense. There is none. I once had ssh into a more powerful computer, and once the heavy processing was done, I asked Claude to clean up and remove all files no longer needed for the project. That did not work out well.

Anyway, I can’t say have experienced any systematic degradation. Rather the opposite as Opus 4.6 and 4.7 seem like improvements.

Post #3

DaAnalyst

Thanks!

What exactly do you mean by this? (compacting that Claude away)

Frequent backups of what? Your repo is already versioned (and hopefully pushed to remote).

Post #4

Vidar

/compact or just /clear

I have two projects which have been going on for months. There are many huge data source files which are git ignored due to size.

Besides, confused Claude version have at times suggested git actions that could mess up that backup so a separate one makes me sleep much better.

Post #5

oleksify

It does feel like deliberate policy. I’ve been experimenting heavily on max plan, mostly doing R&D besides real work. As soon as new model appears - it’s fast, smart, and you feel like a real change. After a week feeling is gone. Everything is slow again. Opus 4.7 started to often hang in the middle of the work (friends report similar behavior) - just stops at some point doing nothing. The worst situation when it hangs within subagents - it’s not stopping subagents, and it feels like it’s just working for really long time.

Without harness or orchestration frameworks like superpowers, plain Claude feels really silly. It’s still way better than let’s say Mistral’s devstral-2, but with Kimi 2.5 I get very similar level of quality with 0.25 of the price (if used through Factory Droid subscription for example).

To sum up, I think it’s taking at least few months to start feeling the pain and understand the AI tax. You lose knowledge of codebase, and often you just blindly trust it. Then you check some parts of the code that AI covered by tests and that work in production “correctly“ just to discover total mess, that will get you into cold sweat.

Post #6

EricGT

Yes.

On forums such as the OpenAI Community Forum, there are many threads describing what you are calling “nerfed” behavior.

From longer-term observation across multiple platforms and vendors (including OpenAI and Anthropic), this pattern is not isolated to a single model or provider. Users often report cycles where a model initially performs very well, then appears to degrade.

A few factors—both user-side and vendor-side—can explain much of this:

User-side factors

Long conversations and context compaction
As sessions grow, systems may summarize or compress earlier context. This can drop details that were implicitly guiding good responses.
Mitigation: periodically start a new session and carry forward only the essential state (a “continuation prompt”).
Prompt drift vs. model updates
When new model versions are released, guidance in model cards or documentation often changes. Prompts that previously worked well may become less effective.
Mitigation: periodically revise prompts to align with current recommendations.

Vendor-side factors

Model updates and tuning changes
Providers do update models over time (e.g., safety tuning, instruction-following behavior, latency/cost optimizations). These changes can alter output style or reliability.
System prompt and policy adjustments
Changes to system-level instructions or safety layers can have noticeable downstream effects on responses.

I could likely spend a week covering this in depth, but in practice it comes down to understanding how LLMs operate, reviewing model documentation, and gaining experience through use.

Note: I did use ChatGPT to polish the reply but the starting reply was created by me then polished with the help of ChatGPT.

https://openai.com/index/gpt-5-system-card/

Best practices for using Claude Opus 4.7 with Claude Code

Prompting best practices

Using Claude Code: session management and 1M context

Note: I do not actively use the 1M context and it eats tokens faster.

Post #7

nathanl

I doubt there’s any conspiracy here. Probably just operational difficulties.

But FWIW, there are other ways to use these models. For example, we use Opus via Opencode talking to Amazon Bedrock, so it’s running on Amazon’s infrastructure, not Anthropic’s. I haven’t noticed the kinds of issues people talk about with Claude Code. And as a bonus, in theory we could switch to another vendor’s model (although so far Opus has been great, and I like Anthropic more than I like its competitors).

Post #8

EricGT

Really, I can not edit my own post after a few hours.

An update on recent Claude Code quality reports

Post #9

DaAnalyst

Btw, this morning it got a bit lazy/superficial. Asked it about the slip-ups in execution and it admitted being lazy. Just had it add “Don’t ever EVER be lazy!” to its project memory at the very top. It also added the 3 instances of the morning laziness as reminders/arguments on its own initiative

Post #10

Last Post!

pawoc50825

Since you’re already juggling multiple plans trying to get best value, you might end up liking GLM-5.1
(it’s opensource, there are many independent providers)

Post #30

Where Next?

View thread on forum (has 30 replies!)

Switch to Best Posts mode

claude-code

Home Chat & Discussions>AI / LLMs

#claude-code

49 2046 30

Last post

REPLY VIEW THREAD WITH 30 POSTS

Trending in AI / LLMs

Chat & Discussions>AI / LLMs

Is anyone working on "AI Agents" in Elixir?

For those who are not aware, “AI agents” are, for the most part, commodity LLMs which are given access to “tools” and prompted to complet...

#ai

117 5679 26

2026-07-20 19:37:42 UTC

New

Chat & Discussions>AI / LLMs

Anyone vibe-coded/vibe-converted a Rails app to Phoenix?

Anyone vibe-converted a Rails app to Phoenix? How did it go? Which tools did you use? Any tips? Asking for a friend :sweat_smile:

/phoenix #rails

8 305 5

2026-08-01 01:43:10 UTC

New

Chat & Discussions>AI / LLMs

A web terminal built for the AI era: long-running tasks survive restarts, Fork-grade git review built in, CodeMirror everywhere

I built a tmux alternative in Elixir, focused on remote development in the AI era. It makes using remote AI agents feel just like running...

#ai

4 325 0

2026-07-21 02:39:24 UTC

New

Chat & Discussions>AI / LLMs

Anyone vibe-coded/vibe-converted a Rails app to Phoenix?

Chat & Discussions>AI / LLMs

A web terminal built for the AI era: long-running tasks survive restarts, Fork-grade git review built in, CodeMirror everywhere

Chat & Discussions>AI / LLMs

Just_bash - a bash interpreter + virtual filesystem in Elixir (and how we use it to power an agent in production)

Chat & Discussions>AI / LLMs

ExBashkit - an elixir wrapper for bashkit, a bash sandbox for LLMs

Chat & Discussions>AI / LLMs

Matt Pocock like skills for Elixir

Chat & Discussions>AI / LLMs

What exactly is an AI loop?

Chat & Discussions>AI / LLMs

Tokenware is the new form of donation

Chat & Discussions>AI / LLMs

How much would you pay for Claude given your current experience with it?

Chat & Discussions>AI / LLMs

Successful development with local AI setup

Chat & Discussions>AI / LLMs

A task class that's going to wait for at least a year before I try giving it to Claude again

Chat & Discussions>AI / LLMs

Chat AI / LLMs ❯

Latest on Elixir Forum

Help SWAR optimize URI.encode_www_form/1

Questions & Help>Thoughts On...

Elixir-google-api deprecated, reason and alternatives?

Questions & Help>Questions

Profiling Rust NIFs in Elixir

Blogs & Podcasts>Blog Posts

Leaving a BEAM cluster (basically) unattended for years - Yuri Oliveira | ElixirConf US

Learning Resources>Talks

Finitomata - 1st release candidate

News>News & Updates

Learning Elixir: Creating Modules

Blogs & Podcasts>Blog Posts

Based Integers - fast BaseN codecs for integers

News>Announcing

Localize reaches 1.0

News>News & Updates

BEAM There, Done That with Mike Williams & Björn Gustavsson on Building the JAM

Blogs & Podcasts>Podcasts

AshScylla - ScyllaDB data layer for Ash Framework

News>Announcing

Anyone vibe-coded/vibe-converted a Rails app to Phoenix?

Chat & Discussions>AI / LLMs

Building a Stateful Process in Elixir Without GenServer

Blogs & Podcasts>Blog Posts

GreenCal - agricultural sun & moon calendar, pure Elixir, zero dependencies

News>Announcing

Facturx - pure-Elixir Factur-X / ZUGFeRD (EN 16931 e-invoices)

News>Announcing

Code BEAM Europe 2026 is looking for volunteers

Chat & Discussions>Chit Chat

Elixir Forum ❯

Sub Categories:

Forums

We're in Beta

About us Mission Statement

Options

Show Best Posts Skip Thread Previews

Any first-hand experience with Claude model getting "nerfed" down after using it for a while?

DaAnalyst

Any first-hand experience with Claude model getting "nerfed" down after using it for a while?

First 10 of 30 Posts!

oleksify

DaAnalyst

Vidar

DaAnalyst

Vidar

oleksify

EricGT

User-side factors

Vendor-side factors

nathanl

EricGT

DaAnalyst

Last Post!

pawoc50825

Where Next?

Trending in AI / LLMs

Is anyone working on "AI Agents" in Elixir?

Anyone vibe-coded/vibe-converted a Rails app to Phoenix?

A web terminal built for the AI era: long-running tasks survive restarts, Fork-grade git review built in, CodeMirror everywhere

Other Trending Topics

Dexter - A fast, full-featured Elixir LSP optimized for large codebases

Beam Bots - Resilient Robotics on the BEAM

Emerge & Solve - a GUI framework for Elixir

Emily - A new MLX-based backend for Nx

Elixir-lang.org redesign

Programming Nerves (self-published)

Chat & Discussions>AI / LLMs

Latest on Elixir Forum

Categories:

Sub Categories:

Forums

Popular Tags

We're in Beta

Options

Any first-hand experience with Claude model getting "nerfed" down after using it for a while?

DaAnalyst

Any first-hand experience with Claude model getting "nerfed" down after using it for a while?

First 10 of 30 Posts!

oleksify

DaAnalyst

Vidar

DaAnalyst

Vidar

oleksify

EricGT

User-side factors

Vendor-side factors

nathanl

EricGT

DaAnalyst

Last Post!

pawoc50825

Where Next?

Trending in AI / LLMs

Is anyone working on "AI Agents" in Elixir?

Anyone vibe-coded/vibe-converted a Rails app to Phoenix?

A web terminal built for the AI era: long-running tasks survive restarts, Fork-grade git review built in, CodeMirror everywhere

Other Trending Topics

Dexter - A fast, full-featured Elixir LSP optimized for large codebases

Beam Bots - Resilient Robotics on the BEAM

Emerge & Solve - a GUI framework for Elixir

Emily - A new MLX-based backend for Nx

Elixir-lang.org redesign

Programming Nerves (self-published)

Chat & Discussions>AI / LLMs

Latest on Elixir Forum

Sponsor Spotlight

Our Sponsors

Categories:

Sub Categories:

Forums

Popular Tags

Our Sponsors

We're in Beta

Options