I think the best models around right now that most people can fit some quantization on their computer if it's a apple silicon Mac or gaming PC would be:
For non-coding:
Qwen3-30B-A3B-Instruct-2507 (or the thinking variant, depending on use case)
For coding:
Qwen3-Coder-30B-A3B-Instruct
---
If you have a bit more vram, GLM-4.5-Air or the full GLM-4.5
For non-coding: Qwen3-30B-A3B-Instruct-2507 (or the thinking variant, depending on use case)
For coding: Qwen3-Coder-30B-A3B-Instruct
---
If you have a bit more vram, GLM-4.5-Air or the full GLM-4.5