Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

It’s interesting how the Qwen team more or less proved that hybrid reasoning doesn’t work and makes things worse. The fact that this model is almost on par with the bigger model in non thinking mode (old, they released a non hybrid model recently) is crazy.


Qwen3 32B is a hybrid reasoning model and is very good. You have to generate a lot of think tokens for any agentic activity but you will probably run the model locally and it wont be a problem. If you need something quick and simple, /no_think is good enough in my experience. It might also be because its not a moe architecture


Qwen3 32B was a hybrid model that came out in April, but these new Qwen July models have all ditched the hybrid mechanism and are either thinking or non-thinking.


By Qwen3-32B you mean the first released version from late April? I don't think Qwen3-32B-2507 has been released yet.

I agree with GP that since Qwen is now releasing updated Qwen3 version without hybrid reasoning, and experience a significant performance boost in the process, it likely means that the hybrid reasoning experiment was a failure.


Isn't that because all "reasoning" approaches are very much fake? The model cannot internalise the concepts it has to reason about. For instance if you ask it why water feels wet, it is unable to grasp the concept of feeling and sensation of wetness, but will for sure "decompress" learned knowledge of people talking how it is to feel the water.


Everything about LLMs is fake. The "reasoning" trick is still demonstrably useful - the benchmarks consistently show models using that trick performing better at harder code challenges, for example.


I'd argue that what's generally considered "reasoning" isn't actually rooted in understanding either. It's just the process you apply to get to a conclusion

expressed more abstractly: is about drawing logical connections between points and extrapolating from them.

To quote the definition: "the action of thinking about something in a logical, sensible way."

I believe it's rooted in mathematics, not physics. That's probably why there is such a focus on the process instead of the result




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: