More

covi · 2026-03-19T18:00:34 1773943234

This feels like the chimpanzee with a power drill. An agent is honestly just brute-force search, but guided.

chaos_emergent · 2026-03-19T18:49:55 1773946195

Human-driven research is also brute-force but with a more efficient search strategy. One can think of a parameter that represents research-search-space-navigation efficiency. RL-trained agents will inevitably optimize for that parameter. I agree with your statement insomuch as the value of that efficiency parameter is lower for agents than humans today.

It's really hard to imagine that they __won't__ exceed the human value for that efficiency parameter rather soon given that 1. there are plenty of scalar value functions that can represent research efficiency, of which a subset will result in robust training, and 2. that AI labs have a massive incentive to increase their research efficiency overall, along with billions of dollars and really good human researchers working on the problem.

viccis · 2026-03-20T00:15:30 1773965730

>Human-driven research is also brute-force but with a more efficient search strategy

No it's not. Is there anything to back that up? There's a creative aspect to human research that I've yet to see with gen AI. All it does is regurgitate stuff and get some "new" ideas via the latent space of the distribution it models. But a generative model cannot by definition create anything new. Just estimate its data well enough that it can sample it well enough to fake novelty.

red75prime · 2026-03-20T10:56:12 1774004172

Your "brute-force search, but guided" feels like oxymoron. How does it differ from "guided search"?

groby_b · 2026-03-19T19:09:29 1773947369

Is there anything in the research space that doesn't fit "brute-force search, but guided"?

All of science is "gather inputs, make hypothesis, test, analyse" on repeat.

There's plenty to critique in the particular guidance approach, but the overall method is the same.

gwern · 2026-03-19T20:19:21 1773951561

Except the power drill isn't being used to make a better chimpanzee.

covi · 2026-02-11T18:47:04 1770835624

The post says Slurm supports gang scheduling, k8s doesn't (out of the box).

covi · 2025-06-05T22:18:46 1749161926

Take a look at SkyPilot. Good for running these batch workloads. You can use spot instances to save costs.

covi · 2025-06-04T18:04:07 1749060247

To massively increase the reliability to get GPUs, you can use something like SkyPilot (https://github.com/skypilot-org/skypilot) to fall back across regions, clouds, or GPU choices. E.g.,

$ sky launch --gpus H100

will fall back across GCP regions, AWS, your clusters, etc. There are options to say try either H100 or H200 or A100 or <insert>.

Essentially the way you deal with it is to increase the infra search space.

covi · 2025-04-24T18:19:56 1745518796

Related: https://skyplane.org/en/latest/ (mentioned in OP)

From what I know this idea underpins a few FAANG-level companies' data transfer systems. OP's value = a simple implementation of the idea that's OSS and applied to AI.

covi · on March 3, 2025

Congrats on the API launch (from SkyPilot)!

zaptrem · on March 3, 2025

Thanks! We used SkyPilot (an open source cloud GPU worker management tool) to help out with both our small (single node) and large (many node) training runs.

covi · on Aug 9, 2024

If you want to use your own GPUs or cloud accounts but with a great dev experience, see SkyPilot.

covi · on Oct 25, 2023

Now just need a Waymo invite code :)

covi · on Sept 27, 2023

Cloud deployment docs: https://docs.mistral.ai/cloud-deployment/skypilot/

covi · on Aug 24, 2023

https://www.forbes.com/sites/alexkonrad/2023/07/13/ai-startu...

> Its revenue run rate has spiked this year and now sits at around $30 million to $50 million, three sources said — with one noting that it had more that tripled compared to the start of the year.