> LLM outputs are nondeterministic. LLM outputs are deterministic. There is no i...

reasonableklout · 2025-07-04T06:41:08 1751611268

Nit: in practice, even at temperature 0, production LLM implementations have some non-determinism. One reason is because many floating point computations are technically non-commutative even when the mathematical operation is, and the order can vary if they are carried out in parallel by the GPU. For example, see: https://www.twosigma.com/articles/a-workaround-for-non-deter...

jkhdigital · 2025-07-04T07:16:42 1751613402

I ran into this a bit while working on my PhD research that used LLMs for steganography. The output had to be deterministic to reverse the encoding, and it was—as long as you used the same hardware. Encoding a message on a PC and then trying to decode on a phone broke everything.

tptacek · 2025-07-04T06:02:15 1751608935

There hasn't been much change in models from 6 months ago.

I made the same claim in a widely-circulated piece a month or so back, and have come to believe it was wildly false, the dumbest thing I said in that piece.

csomar · 2025-07-04T06:12:18 1751609538

I have my own test to measure performance: https://omarabid.com/gpt3-now

So far the only model that showed significant advancement and differentiation was GPT-4.5. I advise to look at the problem and read GPT-4.5 answer. It'll show the difference to other "normal models" (including GPT-3.5) as it shows considerable levels of understanding.

Other normal models are now more chatty and have a bit more data. But they do not show increased intelligence.

Karrot_Kream · 2025-07-04T07:38:03 1751614683

I was able to have Opus 4 one-shot it. Happy to share a screenshot if that wasn't your experience.

csomar · 2025-07-04T11:22:18 1751628138

Interested to see your Opus 4 one-shot. I tried it very recently on Opus 4 and it burbled non-sense.

Karrot_Kream · 2025-07-06T08:30:00 1751790600

Sorry for the delay, I'm out for the weekend I'll hey you it tomorrow!