More

jeffhuys · 2026-05-30T12:13:18 1780143198

It is generally appreciated you include counter-arguments if you challenge the point.

abalashov · 2026-05-30T12:16:06 1780143366

I normally would, but in this case, I simply don't know where to start. The counterargument is that LLMs just don't shift the graph on productivity remotely enough to entertain this level of doomerism.

Now, it may be that people are extrapolating comparable, but non-LLM successes in AI based on our current "AI" cultural moment, but if that's the premise, I wish that were made a little clearer.

jeffhuys · 2026-05-05T17:51:21 1778003481

Check chatjimmy.ai

lelandbatey · 2026-05-05T19:50:43 1778010643

https://chatjimmy.ai being a demo of the "burn the model to an ASIC" approach being sold by Taalas[0], an approach which they use to run Llama 3.1 8B at ~17000 tokens per second.

[0] - https://taalas.com/products/

snek_case · 2026-05-06T03:19:36 1778037576

Not to downplay their accomplishment but Llama 3.1 8B is a terrible model. It's really outdated at this point. It's cool that they were able to accelerate a model with silicon, but it also feels wasteful since llama 8B is such a useless model?

puilp0502 · 2026-05-06T06:38:10 1778049490

I guess their point was to demonstrate that it's possible to bake a decently-sized model to a silicon? As with anything related to HW, I guess the lead time will be considerably larger than the software counterparts, so I guess in 1-2 years timeframe we might see something like Gemma 4 baked onto a silicon.

leoedin · 2026-05-06T08:41:43 1778056903

Yeah, I think the important part is the process to convert the model to silicon, not the actual implementation itself.

Whether it succeeds now depends a lot on the rate of improvement of model architecture. They're betting on model design and capability improvements slowing down - and then wiping the floor with everyone else with their inference economics.

WASDx · 2026-05-06T18:20:39 1778091639

I think this is the future. When models start converging at "really good" (which I think is already happening) then burning them into ASIC silicon is the natural next step.

Harnesses can keep improving with a fixed model and the throughput opens up new possibilities like doing 10x more "thinking" or exploring parallel paths and picking the best.

imtringued · 2026-05-06T08:37:55 1778056675

I agree, Gemma 3 12B is a very good model for its size and it was only obsoleted by Gemma 4.

Heck, I'm still a fan of Gemma 2 9B.

satellite2 · 2026-05-06T22:14:21 1778105661

is it still a useless model if, say, you can run it at (prompt+output)*24/s and use it to make executive function decisions?

jeffhuys · 2026-03-02T09:45:26 1772444726

Also talk to you; that’s part of learning how to. It prepares you for rejection of all sorts.

autoexec · 2026-03-02T09:49:57 1772444997

I can't recommend being intentionally rude so that you can get practice dealing with people who are pissed off at you. Learning how to tell when it's a bad time to strike up a conversation with a stranger will be a much greater benefit for anyone looking to meet women or even for someone just working on being friendlier/more social. There'll be no shortage of opportunities to learn how to deal with rejection even without being a pest.

jszymborski · 2026-03-02T14:13:47 1772460827

> I can't recommend being intentionally rude...

As I understand, GP is recommending that folks take a risk that their drumming up a conversation might be unwelcome if they are unsure. They're not advocating for folks to harass people who are in no position to chat or who have stated as much.

People can't read minds, so I think we owe grace to the folks around us if they misread a situation and respect your wishes when you let them know that it's you're unwilling or unable to chat at the moment.

While it's great to learn social cues, it's often impossible to know whether someone is in the mood to chat.

ptero · 2026-03-02T11:55:50 1772452550

I cannot agree with that. Do not be shy about trying, but do not be a pest either. There are many times when interruptions are not good for both sides. My 2c.

jeffhuys · 2025-12-18T09:16:20 1766049380

Yea, I just read that too.

jeffhuys · 2025-11-25T17:54:58 1764093298

Yeah, imo, it’s nowhere near ready for 1.0. I was a big advocate for this browser but recently changed because of exactly this. That, and it’s very slow after having it running constantly, I found myself routinely quitting and re-opening it every hour or two to get normal speed back, or my RAM for that matter.

So I’m back on Safari.

lowbloodsugar · 2025-11-25T18:29:24 1764095364

It was using 46G of ram the other day. But I prefer it over any other browser so I just kill it every now and again.

pluralmonad · 2025-11-25T20:06:30 1764101190

How many hundreds or thousands of tabs took 40+ gigs?

lowbloodsugar · 2025-11-27T16:02:42 1764259362

Like maybe 100?

jeffhuys · 2025-11-12T07:18:48 1762931928

Ew. For some reason it looks absolutely disgusting to me.

jeffhuys · 2025-11-09T09:51:58 1762681918

It’s called suicidal empathy.

jeffhuys · 2025-11-09T09:50:02 1762681802

You know, feel free to keep thinking this. In my experience Grok is the best. I don’t let myself into weird groupthink that happens just because trolls took advantage of Grok’s absence of lobotomy. Kind of a superpower.

jeffhuys · 2025-11-04T06:36:13 1762238173

Might not be waiting for long.

ehnto · 2025-11-04T10:44:16 1762253056

There's no way I'm trusting the current driving cohort with a third dimension. If we get flying cars and they aren't completely autonomous, I am moving to the sticks.

iyn · 2025-11-04T13:13:52 1762262032

Self-flying cars? I wonder if it's actually easier to have autonomous vehicles operating in 3D than in "2D".

jeffhuys · 2025-10-23T23:32:27 1761262347

The screen is the window.

il_nets · 2025-10-24T17:05:33 1761325533

Exactly what Wowfunhappy and jeffhuys are saying!