Hacker Newsnew | past | comments | ask | show | jobs | submit | tempoponet's commentslogin

For the huge percentage of devs using vscode, switching to Cursor was essentially adding a new color theme and a chat window. The CLI switch was far more radical.

Yeah, I was a vscode user back then. The problem was realizing that cursor was going to be enshittified... and MS would start enshittifying vscode to compete as well.

The memory upgrade is $1k on a Macbook Pro. The laptop is ~$5500.

Expect to pay $4k-10k

- Your RTX 6000 is closer to $10k now

- Sparks are creeping into the $4-5k range

- AMD Strix are ~3.5k

- Apple depends on chipset and memory. Sweet spot would be 128gb M3 Ultra, probably $6-8k but admittedly haven't been tracking closely. New M5 might come in the fall. You can get a new 128gb M5 Max laptop for ~5-6k today.

- a 4x3090 rig would take $5-6k

Every platform has tradeoffs, but it's mostly ecosystem, memory bandwidth, and power consumption. They're all slow. The best option is likely to rent hardware on Runpod. The RIO on self-hosting is very low unless you have a specific need or you're ok treating it as a hobby.


Bosgame M5 (Strix Halo) w/ 128 GB still goes for $2800 right now. SH systems have surged in price dramatically but quite unevenly.

>The best option is likely to rent hardware on Runpod.

Vast.ai is much cheaper, but the broader point here is contestable. The only dimension in which cloud GPU rentals win is cost. You lose the confidentiality, integrity, and availability benefits of local deployments.


Rentals are priced to pay themselves off in 1-1.5 years (when renting them out per hour, not selling tokens). Its never a better option to rent.

Not that I'd encourage anyone to throw large amounts of money to have access to LLMs, but you're definately going to be better off buying something that you can amortize over multiple years with a multi year warranty.


$2600 gets MBP M5 Pro 48gb. 64gb requires a Max which bumps it to $4200 at which point you may as well spend the $800 to go to 128gb.

And for what? Spend 10-15k for the slopiest of slop code, non deterministic automations, and the ability to spawn an AI gf?

This whole thing is really starting to remind me of the crypto hype phases of 2016-2018 when everyone thought their investment in GPUs was going to make them rich.


I upvoted your comment even though I disagree with you.

Yes, LLMs are sloppy, and local models usually more so (but things change fast).

But the local ones have one big advantage: they are private. So you can safely feed them the collection of your private documents and things you wouldn't trust people like sama with. The fact that some people do not care is one of the failures of our educational system.


It is possible to get real work done with LLMs. There are plenty of ethical concerns, and they're definitely over-hyped, but they are exceptionally useful tools when used well.

These models contain a wealth of knowledge that is being censored, not just deliberately, but by training data bias. Fine-Tuning and steering can produce unexpected new insights. For example a model that is trained to believe so-called "conspiracy theories", which many believe to be the ground truth.

which models do you have in mind? grok from xai?

Why this entire tool chain instead of building within something like pi code?

I've been exploring this area and a project like https://github.com/itayinbarr/little-coder (not my work) lets me mix and match with my current setup or any plugins built for pi.


Mainly because I have plenty of use cases and not all of them need or want pi. Forge isn't an orchestration framework and is not coding specific, it lives one level lower - if I understand pi correctly.

The proxy mode should integrate seamlessly, and the middleware guardrail mode could be lifted into pi.

As for little coder, I love it! I wanted forge to be more generic than just agentic coding as there's many more agentic workflows worth optimizing with small models.


Thank you for the thoughtful reply. I have some smaller 3080's I'm looking to place and this sounds like a good opportunity.

It's older, but Hermes Pro 2 (same lab as Hermes agent) is a fine-tune of Mistral 7b for tool calling and structured outputs.

This isn't for agentic loops, though. This is for turning simple requests into API calls.


Similarly, the agentic coding success stories are from orgs that had all of these things out of the gate.


Or had the sense to build the guidelines without trying to rely on writing fanfiction to guide the LLM.


It reads like the inventors of Claude can't get Claude to apply a "human in the loop" workflow.


I think they just honestly can't afford it. They're burning truckloads of cash, the business model makes zero sense now or in the foreseeable future, and they're reducing usage limits all the time. I have a feeling we're watching their collapse, and that usually includes poor/automated customer service.


You can fine-tune a model, but there are also smaller models fine-tuned for specific work like structured output and tool calling. You can build automated workflows that are largely deterministic and only slot in these models where you specifically need an LLM to do a bit of inference. If frontier models are a sledgehammer, this approach is the scalpel.

A common example would be that people are moving tasks from their OpenClaw setup off of expensive Anthropic APIs onto cheaper models for simple tasks like tagging emails, summarizing articles, etc.

Combined with memory systems, internal APIs, or just good documentation, a lot of tasks don't actually require much compute.


Rather, Imagine you have 2-3 of these working 24/7 on top of what you're doing today. What does your backlog look like a month from now?


You're right for the same reason that the original iPhone outranks the iPhone 3Gs.


These subjective evals are why community reviews are garbage.

Personally I think the 3GS is a better product. I know a few folk who returned their original iPhone because the headphone jack didn’t allow their headphones to connect, and there were obvious limitations that weren’t addressed until the 3GS

The iPhone was revolutionary no argument. But that doesn’t mean later revs were not better products for their time.


I think the 4 was really where it took off. It’s remembered for the antenna PR mess, but it was the first mix of speed and features that made me and many many colleagues say “this could be better than my BlackBerry.” And it was!


I think you're derive specificity from the data that simply isn't there. That's not a data problem, that's an interpretation problem.

The survey didn't ask about reliability, success, or functionality.


The survey didn’t ask anything. It shows two products and “which one would you prefer”

I stand by my interpretation. Who is going to prefer the OG iPhone to the refined and improved version? Nobody that I knew at the time.


No, it said more than that, the full prompt on the page is:

> Rank Your Top 50

> Help us pick the best Apple products of the last 50 years! Just choose which of the two randomly paired options you prefer.

This is explicitly invoking a context of historical importance. Some of these products are 50 years old, not available, and completely obsolete. A reader would be silly to interpret this as a survey to construct a buyers guide.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: