Yeah, honestly this is a bad move on anthropic's part. I don't think their moat ...

whimsicalism · 2026-01-11T20:24:31 1768163071

disagree. it is much better for anthropic to bundle than to become 'just another model provider' to opencode/other routers.

as a consumer, i do absolutely prefer the latter model - but i don't think that is the position I would want to be in if I were anthropic.

behnamoh · 2026-01-11T21:23:02 1768166582

Nah, Anthropic thinks they have a moat; this is classic Apple move, but they ain't Apple.

whimsicalism · 2026-01-11T23:46:52 1768175212

they do have a moat. opus is currently much better than every other model except maybe gpt-5.2

deaux · 2026-01-12T13:12:12 1768223532

At using tools, sure. If Gemini 3 Pro GA is good at tool use, the moat is gone.

oblio · 2026-01-11T20:43:45 1768164225

> I went from max 20x and chatgpt pro to Claude pro and chat gpt plus + open router providers, and I have now cancelled Claude pro and gpt plus, keeping only Gemini pro (super cheap) and using open router models + a local ai workstation I built using minimax m2.1 and glam 4.7. I use Gemini as the planner and my local models as the churners. Works great, the local models might not be as good as opus 4.5 or sonnet 4.7, but they are consistent which is something I had been missing with all commercial providers.

You went from a 5 minute signup (and 20-200 bucks per month) to probably weeks of research (or prior experience setting up workstations) and probably days of setup. Also probably a few thousand bucks in hardware.

I mean, that's great, but tech companies are a thing because convenience is a thing.

fgonzag · 2026-01-12T00:07:48 1768176468

My first switch was to open code + open router. I used it to try mixing models for different tasks and to try open weights models before committing to the hardware.

Even paying API pricing it was significantly cheaper than the nearly $500 I was paying monthly (I was spending about $100 month combined between Claude pro, chat gpt plus, and open router credits).

Only when I knew exactly the setup I wanted locally did I start looking at hardware. That part has been a PITA since I went with AMD for budget reasons and it looks like I'll be writing my own inference engine soon, but I could have gone with Nvidia and had much less issues (for double the cost, dual Blackwell's vs quad Radeon W7900s for 192GB of VRAM).

If you spend twice what I did and go Nvidia you should have nearly no issues running any models. But using open router is super easy, there are always free models (grok famously was free for a while), and there are very cheap and decent models.

All of this doesn't matter if you aren't paying for your AI usage out of pocket. I was so Anthropics and OpenAIs value proposition vs basically free Gemini + open router or local models is just not there for me.

oblio · 2026-01-12T10:45:14 1768214714

> but I could have gone with Nvidia and had much less issues (for double the cost, dual Blackwell's vs quad Radeon W7900s for 192GB of VRAM).

> If you spend twice what I did and go Nvidia you should have nearly no issues running any models.

I goodled what a Radeon W7900 costs and the result on Amazon was €2800 a piece. You say "quad" so that's €11200 (and that's just the GPUs).

You also say "spend twice what I did", which would put the total hardware costs at ~€25000 total.

Excuse me, but this is peak HN detachment from the experience of most people. You propose spending the cost of a car on hardware.

The average person will just pay Anthropic €20 or €100 per month and call it a day, for now.

fgonzag · 2026-01-12T21:38:38 1768253918

I see a ton of my peers driving around in 80k cars. I drive a 20k used one.

I'm planning a writing a ROCM inference engine anyways, or at least contributing to the rocm vllm or sglang implementations for my cards since I'm interested in the field. Funnily enough, I wouldn't consider myself bullish on AI, I just want to really learn the field so I can evaluate where it's heading.

I spent about 10k on the cards, though the upgrades were piece meal as I found them cheap. I still have to get custom water blocks for them since the original W7900s (which are cheap) are triple slot, so you can't fit 4 of them in any sort of workstation setup (I even looked at rack mount options).

Bought a used thread ripper pro wrx80 motherboard ($600), I bought the cheapest TR Pro CPU for the MB (3945wx, $150), I bought 3 128Gb DDR4-3200 sticks at 230 each before the craze, was planning on populating all 8 channels if prices went down a bit. Each stick is now 900, more than I paid for all 3 combined (730 with S&H and taxes). So the system is staying as is until prices come down a bit.

For AI assisted programming, the best value prop by far is Gemini (free) as the orchestrator + open code using either free models or grok / minimax / glm through their very cheap plans (for minimax or glm) or open router which is very cheap. You can also find some interest providers like Cerebras, who get silly fast token generation, which enables interesting cases.

falloutx · 2026-01-11T21:06:28 1768165588

On opencode you can use models which are free for unlimited use and you can pick models which only cost like $15 a month for unlimited use.

oblio · 2026-01-12T09:59:35 1768211975

OP mentioned a local LLM rig plus a rather complex setup from what I understood (I could be wrong).

Also, most of the lower end models aren't that good. At this point you can take an experienced dev and start implementing an app using any mature stack (I'm talking even about stuff like Ada, so not just C/C++, JS, etc) on any mainstream platform (big 3 desktop + big 2 mobile + web) and you can get quite far with Claude Code. By far I mean you'll at least do the 80% really quickly, at which point the "experienced dev" needs to take over. I think you can even get 95% of the way there. And that's with a stack that the dev is unfamiliar with at the start.

falloutx · 2026-01-12T11:33:33 1768217613

Claude code has aggressive limits on $18/month plan. You can get much farther with Minimax 2.1 or Qwen3 on the same amount of money. I have noticed opus is much better in some scenarios but Minimax and Q3 are not as bad or behind. Or my setup is now too used to the quirks of those models, who knows. I am using Claude Haiku as small model anyway, just not using Opus because it burns credits very quickly.

Scarbutt · 2026-01-11T20:41:33 1768164093

a local ai workstation

Peak HN comment