More

homarp · 2026-04-22T17:32:16 1776879136

Coding Agent Adaptation Lets a 9B LLM Outperform 10x Larger Models on Aider Polyglot Benchmark

and as an output, a coding agent optimized to smaller LLMs: https://github.com/itayinbarr/little-coder

homarp · 2026-04-21T15:22:09 1776784929

interesting that using AI models from China is not discussed.

e.g. Apple buys moonshot or z.ai

homarp · 2026-04-19T05:18:47 1776575927

like meteorites?

helsinkiandrew · 2026-04-19T09:21:27 1776590487

Craft?

homarp · 2026-04-18T17:57:49 1776535069

announce discussion https://news.ycombinator.com/item?id=8548429

homarp · 2026-04-16T09:02:56 1776330176

it is called llama-barn https://github.com/ggml-org/LlamaBarn

adrian_b · 2026-04-16T11:19:26 1776338366

LlamaBarn is the MacOS app, not the HTTP API server, which is "llama-server".

On non-Apple PCs, "llama-server" is what you use, and you can connect to it either with a browser or with an application compatible with the OpenAI API.

Perhaps using "llama-server" as the name of the project would have been less confusing for newbies than "llama.cpp".

I confess that when I first heard about "llama.cpp" I also thought that it is just a library and that I have to write my own program in order to implement a complete LLM inference backend.

mastermage · 2026-04-17T07:35:16 1776411316

this looks nice but is macos only.

homarp · 2026-04-16T09:00:46 1776330046

check on same port, there is an OpenAI API https://github.com/ggml-org/llama.cpp/tree/master/tools/serv...

teekert · 2026-04-16T09:33:47 1776332027

Good stuff, thanx!

homarp · 2026-04-16T07:58:09 1776326289

like someone said above: brew install llama.cpp

llama-server -hf ggml-org/gemma-4-E4B-it-GGUF --port 8000 (with MCP support and web chat interface)

and you have OpenAI API on the same 8000 port. (https://github.com/ggml-org/llama.cpp/tree/master/tools/serv... lists the endpoints)

AndroTux · 2026-04-16T13:29:51 1776346191

And why do I use ggml-org/gemma-4-E4B-it-GGUF instead of one of the 162 other models that can be found under the ggml-org namespace? And how do I even know that this is the namespace to look at?

That's what I meant by model management. I'm too tired to scroll through a bazillion models that all have very cryptic names and abbreviations just to find the one that works well on my system with my software stack.

I want a simple interface that a tool like me can scroll through easily, click on, and then have a model that works well enough. If I put in that much brain power to get my LLM working, I might as well do the work myself instead of using an LLM in the first place.

throwa356262 · 2026-04-16T15:09:12 1776352152

1. Go to HF

2. Choose the model they recommend

3. Run the one-liner the site gives you

Bonus: faster access to latest models and better memory usage

AndroTux · 2026-04-17T08:00:22 1776412822

The first model I see on the HF homepage is this one: MiniMaxAI/MiniMax-M2.7

Do you think that this 229B parameter model will work on my consumer PC?

Stop pretending like HF is in any way beginner friendly.

homarp · 2026-04-14T06:24:33 1776147873

https://www.quantamagazine.org/about/ says "launched by the Simons Foundation in 2012"

and https://www.simonsfoundation.org/about/ has "Since its founding in 1994 by Jim and Marilyn Simons"

https://en.wikipedia.org/wiki/Jim_Simons explains how Jim Simons got rich.

The book 'The Man Who Solved the Market' - https://www.gregoryzuckerman.com/the-books/the-man-who-solve... is a nice read.

HN discussion on a review of the book - https://news.ycombinator.com/item?id=29392041

homarp · 2026-04-13T16:52:55 1776099175

it is an Ethereum fork, named after Jan Zurich (a cousin of the famous Chief Niklaus Emil Wirth). Jan Zurich discovered a little moon on Uranus, and named it Blaise

see timeline on https://ethereum.org/ethereum-forks/

Rochus · 2026-04-13T17:03:49 1776099829

It’s also worth noting that Jan (who strictly uses the pronouns var / val) belongs to one of the most historically marginalized groups in modern tech: One-Pass Compiler Enthusiasts. They were repeatedly ostracized by the bloated LLVM cabal for stating that any build process taking longer than 50 milliseconds is a toxic social construct. The ETH fork was actually meant to fund a decentralized safe space where nobody is ever forced to use a borrow checker.

homarp · 2026-04-11T16:30:45 1775925045

related discussion https://news.ycombinator.com/item?id=40338443

jjuran · 2026-04-12T07:29:23 1775978963

Funny you should bring up MacRelix — the very first front end for Advanced Mac Substitute was built in it.