I dont understand your complaint at all. If you develop a new revolutionary tech...

Bjartr · on Aug 6, 2024

I don't think it was an complaint, just a observation.

thruway516 · on Aug 6, 2024

Yes probably. But considering non-deterministic outputs is the nature of the beast with Llms and we're (mostly) engineers here, calling any part of this mundane sounds almost more like fighting words than just observation

the8thbit · on Aug 6, 2024

Extremely pedantic, but is "non-deterministic" really the right language? The same input will always produce the same output, provided you haven't intentionally configured the system to use the model non-deterministically. It seems like the right way to describe it is as a chaotic deterministic system. The same input will always produce the same output, but small shifts in the input or weights can result in dramatic and difficult to predict changes in outputs.

visarga · on Aug 6, 2024

> The same input will always produce the same output

Not guaranteed even with the same seed. If you don't perform all operations in exactly the same order, even a simple float32 sum, if batched differently, will result in different final value. This depends on the load factor and how resources are allocated.

simonw · on Aug 6, 2024

Yeah, the fact that floating point multiplication isn't associative is a real pain for producing deterministic outputs - especially when you're running massively parallel computations on GPUs (or multiple GPUs) making the order of operations even less predictable.

taneq · on Aug 7, 2024

This doesn’t mean LLMs are inherently non-deterministic, just that current common implementations are non-deterministic.

davedx · on Aug 6, 2024

Llms are indeed non deterministic

jappgar · on Aug 6, 2024

Structured outputs are hard... but they claimed to have solved this a year ago.

They were lying, of course, and meanwhile charged output tokens for malformed JSON.

jiggawatts · on Aug 7, 2024

Structured output is trivial: just select the output tokens from the given list of probability values filtered for the allowed next tokens in the schema.

Other LLM vendors figured this out many months ago.

retinaros · on Aug 7, 2024

they did not . anthropic often sends broken json