We've done some vibe checks on it with OpenHands and it indeed performs roughly as good as Sonnet 4.5.
OSS models are catching up
We've done some vibe checks on it with OpenHands and it indeed performs roughly as good as Sonnet 4.5.
OSS models are catching up