I know it isn’t your question exactly, and you probably know this, but the model...

neutronicus · 2025-09-26T15:26:41 1758900401

I think the question is, can I throw a couple thousand bucks of GPU time at fine-tuning a model to have knowledge of our couple million lines of C++ baked into the weights instead of needing to fuck around with "Context Engineering".

Like, how feasible is it for a mid-size corporation to use a technique like LoRA, mentioned by GP, to "teach" (say, for example) Kimi K2 about a large C++ codebase so that individual engineers don't need to learn the black art of "context engineering" and can just ask it questions.

pu_pe · 2025-09-26T15:38:47 1758901127

I'm curious about it too. I think there are two bottlenecks, one is that training a relatively large LLM can be resource-intensive (so people go for RAGs and other shortcuts), and making it finetuned to your use cases might make it dumber overall.

koakuma-chan · 2025-09-26T23:45:14 1758930314

> making it finetuned to your use cases might make it dumber overall.

LoRa doesn't overwrite weights.

pu_pe · 2025-09-27T09:34:56 1758965696

Do you need to overwrite weights to produce the effect I mentioned above?

koakuma-chan · 2025-09-27T21:24:12 1759008252

Good point

koakuma-chan · 2025-09-26T23:47:07 1758930427

I think they fine tune them for tool calling, not knowledge