More

aSanchezStern · 2025-10-04T19:43:12 1759606992

I agree that "proof of thought" is a misleading name, but this whole "computers can't think" thing is making LLM skepticism seem very unscientific. There is no universally agreed upon objective definition of what it means to be able to "think" or how you would measure such a thing. The definition that these types of positions seem to rely upon is "a thing that only humans can do", which is obviously a circular one that isn't useful.

measurablefunc · 2025-10-04T20:01:05 1759608065

If you believe computers can think then you must be able to explain why a chain of dominoes is also thinking when I convert an LLM from transistor relay switches into the domino equivalent. If you don't fall for the marketing hype & study both the philosophical & mathematical literature on computation then it is obvious that computers (or any mechanical gadget for that matter) can not qualify for any reasonable definition of "thinking" unless you agree that all functionally equivalent manifestations of arithmetic must be considered "thinking", including cascading dominoes that correspond to the arithmetic operations in an LLM.

supern0va · 2025-10-04T20:57:19 1759611439

>If you believe computers can think then you must be able to explain why a chain of dominoes is also thinking when I convert an LLM from transistor relay switches into the domino equivalent.

Sure, but if you assume that physical reality can be simulated by a Turing machine, then (computational practicality aside) one could do the same thing with a human brain.

Unless you buy into some notion of magical thinking as pertains to human consciousness.

measurablefunc · 2025-10-04T21:13:40 1759612420

No magic is necessary to understand that carbon & silicon are not equivalent. The burden of proof is on those who think silicon can be a substitute for carbon & all that it entails. I don't buy into magical thinking like Turing machines being physically realizable b/c I have studied enough math & computer science to not be confused by abstractions & their physical realizations.

avmich · 2025-10-04T23:01:25 1759618885

The proof immediately follows from the ability of silicon systems in principle to model carbon ones with arbitrary precision.

measurablefunc · 2025-10-04T23:06:17 1759619177

I recently wrote a simulation of water molecules & got really confused when the keyboard started getting water condensation on it. I concluded that simulating water was equivalent to manifesting it in reality & immediately stopped the simulation b/c I didn't want to short-circuit the CPU.

bobxmax · 2025-10-04T20:22:11 1759609331

And your definition of thinking is?

measurablefunc · 2025-10-04T20:45:41 1759610741

Not arithmetic or boolean algebra. What's your definition?

Nevermark · 2025-10-05T09:07:44 1759655264

> Not arithmetic or boolean algebra.

That isn’t a definition or even a coherent attempt.

For starters, what kind of cognition or computation can’t be implemented with either logic or arithmetic?

What is or is not “cognition” is going to be a higher level property than what basic universally capable substrate is used. Given such substrates can easily simulate each other, be substituted for each other.

Even digital and analog systems can be used to implement each other to arbitrary accuracy.

Cognition is a higher level concern.

bobxmax · 2025-10-06T00:43:21 1759711401

I'm not the one making obtuse claims and desperately trying to trigger reactions.

measurablefunc · 2025-10-06T01:51:01 1759715461

Address the substance of my statements or save yourself the time & further keystrokes responding to my posts.

bobxmax · 2025-10-09T19:01:38 1760036498

There is no substance to address lol.

measurablefunc · 2025-10-09T22:55:39 1760050539

Then you should have saved yourself the time from any further responses. That would be the rational course of action.

encyclopedism · 2025-10-05T15:12:01 1759677121

The jury maybe out on how to judge what 'thought' actually is. However what it is not is perhaps easier to perceive. My digital thermometer does not think when it tells me the temperature.

My paper and pen version of the latest LLM (quite a large bit of paper and certainly a lot of ink I might add) also does not think.

I am surprised so many in the HN community have so quickly taken to assuming as fact that LLM's think or reason. Even anthropomorphising LLM's to this end.

For a group inclined to quickly calling out 'God of the gaps' they have quite quickly invented their very own 'emergence'.

Terr_ · 2025-10-04T21:38:19 1759613899

> this whole "computers can't think" thing is making LLM skepticism seem very unscientific.

It's just shorthand for "that's an extraordinary claim and nobody has provided any remotely extraordinary evidence to support it."

measurablefunc · 2025-10-05T06:05:44 1759644344

Lots of people consider company valuations evidence of a singularity right around the corner but it requires a very specific kind of mindset to buy into that as "proof" of anything other than very compelling hype by people who have turned financial scams into an art form.

aSanchezStern · 2025-09-21T19:15:29 1758482129

Sure, but when you're walking all the time, none of that time is wasted, because you're helping your body and brain function better. When you use a car, you really are wasting all your transportation time. To get the same benefits, you would have to drive places, and then go walking recreationally after, which would clearly take much more time to get the same utility.

add-sub-mul-div · 2025-09-21T19:39:43 1758483583

Their next sentence shows they traded one form of exercise for another. That's "surreal"? There are many forms of exercise.

aSanchezStern · 2025-07-29T03:25:37 1753759537

Cite?

aSanchezStern · 2025-04-23T21:53:34 1745445214

I don't think that "arguing that something is against the rules" is in the CIA sabotage manual, because it's not generally considered sabotage. Maybe if you argue things are against the rules that you know aren't, to slow things down?

Aeolun · 2025-04-23T22:06:41 1745446001

It’s not so much arguing against the rules. It’s following them to the letter when unnecessary.

It doesn’t matter that the big boss has said that purchasing a $5 knick-knack is ok. You will have that purchase go through the full procurement process, even up to and including an exhaustive search for (cheaper) alternatives.

only-one1701 · 2025-04-23T22:08:12 1745446092

I also love to unilaterally determine what is and isn’t necessary.

Aeolun · 2025-04-23T22:08:53 1745446133

Are you suggesting that’s a valuable use of time?

I make decisions about such tradeoffs every day.

only-one1701 · 2025-04-23T22:15:55 1745446555

I’m suggesting that a $5 purchase abs a critical government agency’s infosec are different considerations.

aSanchezStern · 2025-04-07T22:46:30 1744065990

The answer it seems is, it depends on what kind of code you're looking at. The post showed that `for` loops cause a lot more variable-name-biased reasoning, while `ifs` and function defs/calls are more variable-name independent.

aSanchezStern · 2025-04-07T22:45:09 1744065909

Yeah that's linked a couple of times in the post

aSanchezStern · 2025-04-07T22:01:29 1744063289

Thanks for pointing that out, it's fixed now.

aSanchezStern · 2025-04-07T19:02:24 1744052544

If you're interested in a more scientific treatment of the topic, the post links to a technical report which reports the numbers in detail. This post is instead an attempt to explain the topics to a more general audience, so digging into the weeds isn't very useful.

aSanchezStern · 2025-04-07T15:57:55 1744041475

This post actually mostly uses the subset of Python where nullability is checked. The point is not to introduce new LLM capabilities, but to understand more about how existing LLMs are reasoning about code.

aSanchezStern · 2025-04-07T15:51:17 1744041077

Many people don't think we have any good evidence that our brains aren't essentially the same thing: a stochastic statistical model that produces outputs based on inputs.

mubou · 2025-04-07T16:02:36 1744041756

Of course, you're right. Neural networks mimic exactly that after all. I'm certain we'll see an ML model developed someday that fully mimics the human brain. But my point is an LLM isn't that; it's a language model only. I know it can seem intelligent sometimes, but it's important to understand what it's actually doing and not ascribe feelings to it that don't exist in reality.

Too many people these days are forgetting this key point and putting a dangerous amount of faith in ChatGPT etc. as a result. I've seen DOCTORS using ChatGPT for diagnosis. Ignorance is scary.

goatlover · 2025-04-07T16:26:53 1744043213

Do biologists and neuroscientists not have any good evidence or is that just computer scientists and engineers speaking outside of their field of expertise? There's always been this danger of taking computer and brain comparisons too literally.

root_axis · 2025-04-07T18:19:18 1744049958

If you're willing to torture the analogy you can find a way to describe literally anything as a system of outputs based on inputs. In the case of the brain to LLM comparison, people are inclined to do it because they're eager to anthropomorophize something that produces text they can interpret as a speaker, but it's totally incorrect to suggest that our brains are "essentially the same thing" as LLMs. The comparison is specious even on a surface level. It's like saying that birds and planes are "essentially the same thing" because flight was achieved by modeling planes after birds.

SJC_Hacker · 2025-04-07T16:00:08 1744041608

Thats probably the case 99% of the time.

But that 1% is pretty important.

For example, they are dismal at math problems that aren't just slight variations of problems they've seen before.

Here's one by blackandredpenn where ChatGPT insisted the solution to problem that could be solved by high school / talented middle school students was correct, even after trying to convince it it was wrong. https://youtu.be/V0jhP7giYVY?si=sDE2a4w7WpNwp6zU&t=837

Rewind earlier to see the real answer

LordDragonfang · 2025-04-07T16:44:12 1744044252

> For example, they are dismal at math problems that aren't just slight variations of problems they've seen before.

I know plenty of teachers who would describe their students the exact same way. The difference is mostly one of magnitude (of delta in competence), not quality.

Also, I think it's important to note that by "could be solved by high school / talented middle school students" you mean "specifically designed to challenge the top ~1% of them". Because if you say "LLMs only manage to beat 99% of middle schoolers at math", the claim seems a whole lot different.

jquery · 2025-04-07T17:38:12 1744047492

ChatGPT o1 pro mode solved it on the first try, after 8 minutes and 53 seconds of “thinking”:

https://chatgpt.com/share/67f40cd2-d088-8008-acd5-fe9a9784f3...

SJC_Hacker · 2025-04-07T18:31:57 1744050717

The problem is how do you know that its correct ...

A human would probably say "I don't know how to solve the problem". But ChatGPT free version is confidentially wrong ..

nativeit · 2025-04-07T16:03:15 1744041795

Care to share any of this good evidence?