More

logicprog · 2026-03-30T11:24:37 1774869877

This is a wild misinterpretation of that ruling.

logicprog · 2026-03-29T15:42:45 1774798965

Via websearch? Or training?

logicprog · 2026-03-27T09:21:33 1774603293

The problem I'd think, for the average user, would be writing the 'then' part of any deterministic rule — that would require coding, or at least some kind of automation script (visual or otherwise) that's basically coding in a trench coat, which for most people is still a barrier to entry and annoying. I think that's why they'd use AI tbh — they can just describe what they want in natural language with AI.

logicprog · 2026-03-24T10:41:35 1774348895

If LLMs can come up with formerly truly novel solutions to things, and you have a verification loop to ensure that they are actual proper solutions, I don't understand why you think they could never come up with solutions to impressive problems, especially considering the thread we are literally on right now? That seems like a pure assertion at this point that they will always be limited to coming up with truly novel solutions to uninteresting problems.

eru · 2026-03-24T11:20:57 1774351257

"Truly novel" is fast becoming a True Scotsman.

getnormality · 2026-03-24T13:32:32 1774359152

No True Novelty, No True Understanding, etc.

The problem with these bromides is not that they're wrong, it's that they're not even wrong. They're predictive nulls.

What observable differences can we expect between an entity with True Understanding and an entity without True Understanding? It's a theological question, not a scientific one.

I'm not an AI booster by any means, but I do strongly prefer we address the question of AI agent intelligence scientifically rather than theologically.

kelseyfrog · 2026-03-24T19:17:27 1774379847

We've tested this in the small with AI art. When people believe they're viewing human-made art which is later revealed to be AI art, they feel disappointed. The actual content is incidental, the story that supports it is more important than the thing itself.

It's the same mechanism behind artisanal food, artist struggles, and luxury goods. It is the metaphysical properties we attach to objects or the frames we use to interpret strips of events. We author all of these and then promptly forget we've done so, instead believing they are simply reality.

coldtea · 2026-03-25T00:58:29 1774400309

>The actual content is incidental, the story that supports it is more important than the thing itself.

The actual content of a work of art is the expression of lived experience. Not its form.

Marha01 · 2026-03-24T18:28:42 1774376922

Well said. That's exactly what has been rubbing me the wrong way with all those "LLMs can never *really* think, ya know" people. Once we pass some level of AI capability (which we perhaps already did?), it essentially turns into an unfalsifiable statement of faith.

mikkupikku · 2026-03-24T16:16:08 1774368968

Agreed. We should be asking what the machines measurably can or can't do. If it can't be measured, then it doesn't matter from an engineering standpoint. Does it have a soul? Can't measure it, so it doesn't matter.

eru · 2026-03-24T23:42:33 1774395753

That's a bit too pessimistic. Often times you can productively find some measurable proxy for the thing you care about but can't measure. Turing's test is a famous example, of that.

Sometimes you only have a one-sided proxy. Eg I can't tell you whether Claude has a soul, but I'm fairly sure my dishwasher ain't.

hnfong · 2026-03-25T02:16:41 1774405001

> Turing's test is a famous example

Ironically, the Turing test is the OG functionalist approach. The GP's comment basically sums up with the Turing test was designed for.

eru · 2026-03-25T02:49:09 1774406949

Yes, but I interpret Turing's paper not as saying "souls don't matter", but as "here's a good proxy that we can actually measure".

(I don't know what Turing's opinion on souls is, and it doesn't matter for that paper!)

hnfong · 2026-03-25T13:13:54 1774444434

I think it's generally accepted that Turing thinks soul doesn't matter when we try to determine whether things have intelligence/ability to think.

mikkupikku · 2026-03-25T10:09:42 1774433382

When push came to shove, it turns out nobody really cared about the Turing test and immediately found excuses to discount it as soon as machines blew through that goalpost. It's fundamentally theological, but the thing is, it doesn't matter. It has no impact on what the machines can demonstrably do.

thangalin · 2026-03-25T00:24:12 1774398252

Claude has neither a soul nor a warbleflupper.

hnfong · 2026-03-25T02:13:19 1774404799

There are already people dealing with AI intelligence scientifically. That's what benchmarks do.

It's the "it's just a stochastic parrot!" camp that's doing the theological work. (and maybe also those in the Singularity camp...)

That said, I do think there's value in having people understand what "Understanding" means, which is kinda a theological (philosophical :D) question. IMHO, in every-day language there's a functional part (that can be tested with benchmarks), and there's a subjective part (i.e. what does it feel like to understand something?). Most people without the appropriate training simply mix up these two things, and together with whatever insecurities they have with AI taking over the world (which IMHO is inevitable to some extent), they just express their strong opinions about it online...

Yizahi · 2026-03-24T12:34:21 1774355661

It probably can, but won't realize that and it won't be efficient in that. LLM can shuffle tokens for an enormous number of tries and eventually come up with something super impressive, though as you yourself have mentioned, we would need to have a mandatory verification loop, to filter slop from good output and how to do it outside of some limited areas is a big question. But assuming we have these verification loops and are running LLMs for years to look for something novel. It's like running an energy grid of small country to change a few dozen of database entries per hour. Yes, we can do that, but it's kinda weird thing to do. But it is novel, no argue about that. Just inefficient.

We never had a big demand to define how humans are intelligent or conscious etc, since it is too hard and was relegated to a some frontier researchers. And with LLMs we now do have such demand but the science wasn't ready. So we are all collectively searching in the dark, trying to define if we are different from these programs if not how. I certainly can't do that. I do know that LLMs are useful, but I also suspect that AI (aka AGI nowadays) is not yet reached.

aspenmartin · 2026-03-24T13:16:36 1774358196

How can people look at

- clear generalizability

- insane growth rates (go back and look at where we were maybe 2 years ago and then consider the already signed compute infrastructure deals coming online)

And still say with a straight face that this is some kind of parlor trick or monkeys with typewriters.

we don’t need to run LLMs for years. The point is look at where we are today and consider performance gets 10x cheaper every year.

LLMs and agentic systems are clearly not monkeys with typewriters regurgitating training data. And they have and continue to grow in capabilities at extremely fast rates.

Yizahi · 2026-03-24T15:27:23 1774366043

I was talking about highest difficulty problems only, in the scope of that comment. Sure at mundane tasks they are useful and we optimizing that constantly.

But for super hard tasks, there is no situation when you just dump a few papers for context add a prompt and LLM will spit out correct answer. It's likely that a lead on such project would need to additionally train LLM on their local dataset, then parse through a lot of experimental data, then likely run multiple LLMs for for many iterations homing on the solution, verifying intermediate results, then repeating cycle again and again. And in parallel the same would do other team members. All in all, for such a huge hard task a year of cumulative machine-hours is not something outlandish.

aspenmartin · 2026-03-24T15:45:56 1774367156

This is just not true. Maybe it will be true if you increase the problem difficulty in concert with model performance? You don't need fine tuning for this and you haven't for years now. Reasoning performance for now may be SOMEWHAT brittle but again look at where we have come from in like 2 years. Then also consider the logical next steps

- better context compression (already happening) + memory solutions that extend the effective context length [memory _is_ compression]

- continual learning systems (likely already prototyped)

- these domains are _verifiable_ which I think just seems to confuse people. RL in verifiable domains takes you farther and farther. Training data is a bootstrap to get to a starting point, because RL from scratch is too inefficient.

agents can already deal with large codebases and datasets, just like any SWE, DS or researcher.

and yes! If you throw more compute at a problem you will get better solutions! But you are missing the point: for the frontier solutions, which changes with every model update, you of course need to eek out as much performance as you can, which requires a large amount of test time compute. But what you can do _without_ this is continually improving. The pattern _already in place_ is that at first you need an extreme amount of compute, then the next model iterations need far less compute to reach that same solution, etc etc. The costs + compute requirements to perform a particular task decrease exponentially.

TeMPOraL · 2026-03-24T13:45:02 1774359902

> We never had a big demand to define how humans are intelligent or conscious etc, since it is too hard and was relegated to a some frontier researchers. And with LLMs we now do have such demand but the science wasn't ready. So we are all collectively searching in the dark, trying to define if we are different from these programs if not how. I certainly can't do that. I do know that LLMs are useful, but I also suspect that AI (aka AGI nowadays) is not yet reached.

Alternative perspective: the science may not have been ready, so instead we brute-forced the problem, through training of LLMs. Consider what the overall goal function of LLM training is: it's predicting tokens that continue given input in a way that makes sense to humans - in fully general meaning of this statement.

It's a single training process that gives LLMs the ability to parse plain language - even if riddled with 1337-5p34k, typos, grammar errors, or mixing languages - and extract information from it, or act on it; it's the same single process that makes it equally good at writing code and poetry, at finding bugs in programs, inconsistencies in data, corruptions in images, possibly all at once. It's what makes LLMs good at lying and spotting lies, even if input is a tree of numbers.

(It's also why "hallucinations" and "prompt injection" are not bugs, but fundamental facets of what makes LLMs useful. They cannot and will not be "fixed", any more than you can "fix" humans to be immune to confabulation and manipulation. It's just the nature of fully general sytems.)

All of that, and more, is encoded in this simple goal function: if a human looks at the output, will they say it's okay or nonsense? We just took that and thrown a ton of compute at it.

jacquesm · 2026-03-24T14:05:35 1774361135

> (It's also why "hallucinations" and "prompt injection" are not bugs, but fundamental facets of what makes LLMs useful. They cannot and will not be "fixed", any more than you can "fix" humans to be immune to confabulation and manipulation. It's just the nature of fully general sytems.)

This is spot on and one of the reasons why I don't think putting LLMs or LLM based devices into anything that requires security is a good idea.

logicprog · 2026-03-23T20:51:21 1774299081

https://web.archive.org/web/20260323190551/https://fortune.c...

logicprog · 2026-03-23T14:57:44 1774277864

This looks genuinely awesome! I've been thinking about how to do good property based testing on UIs, and this elegantly solves that problem — I love the language they've designed here. It really feels like model checking or something.

owickstrom · 2026-03-23T15:06:53 1774278413

Thanks, glad you like it! Do you mean the temporal logic aspect of it?

logicprog · 2026-03-23T15:43:53 1774280633

Yeah, the temporal logic aspect of it is exactly what I was referring to :)

owickstrom · 2026-03-23T16:28:04 1774283284

Cool. I think that is a very neat way of expressing properties of UIs (and stateful systems more generally) that works out nicely in testing. There are some gotchas related to the finiteness of testing, but it's manageable.

logicprog · 2026-03-23T13:57:30 1774274250

https://archive.ph/ZwUyi

logicprog · 2026-03-23T13:53:19 1774273999

This is an extremely dumb video just lazily dunking on some boomer that used cloud technology poorly. If anything this is brain rot.

logicprog · 2026-03-23T13:49:18 1774273758

The statements made in this article seem so absolutely out of touch with what modern generative AI can do that it's kind of insane.

logicprog · 2026-03-22T16:31:48 1774197108

This seems like an excellent idea. I'm sure a lot of us have been idly wondering why CRDTs aren't used for VCS for some time, so it's really cool to see someone take a stab at it! We really do need an improvement over git; the question is how to overcome network effects.

vishvananda · 2026-03-22T16:48:07 1774198087

This is actually a very interesting moment to potentially overcome network effects, because more and more code is going to be written by agents. If a crdt approach is measurably better for merging by agent swarms then there is incentive to make the switch. It also much easier to get an agent to change its workflow than a human. The only tricky part is how much git usage is in the training set so some careful thought would need to be given to create a compatibility layer in the tooling to help agents along.

righthand · 2026-03-22T16:48:05 1774198085

Well over half of all people can’t tell you the difference between git and Github. The latter being owned by a corporation that needs the network effect to keep existing.

NetOpWibby · 2026-03-22T16:48:11 1774198091

Overcoming network effects cannot be the goal; otherwise, work will never get done.

The goal should be to build a full spec and then build a code forge and ecosystem around this. If it’s truly great, adoption will come. Microsoft doing a terrible job with GitHub is great for new solutions.