More

statusfailed · 2026-03-10T23:29:45 1773185385

The Milk-V Jupiter 2 (coming out in April) is RV23 too

jacquesm · 2026-03-10T23:38:13 1773185893

Nice board but very low on max RAM.

rwmj · 2026-03-11T10:45:58 1773225958

The Milk-V Titan (https://milkv.io/titan) can take up to 64GB which is fine considering the number of cores and the cost of RAM. If you needed and could afford more RAM you'd be better off distributing the work across more than one board.

jacquesm · 2026-03-11T13:27:28 1773235648

I simply want to replace my desktop with open hardware. That board would be fine, thank you for the pointer.

rwmj · 2026-03-11T13:34:08 1773236048

Unfortunately they found a bug and had to redesign the boards. I've had one of these on pre-order since last year. Latest is I think they're intending to ship them next month (April).

The SpacemiT K3 (https://www.spacemit.com/products/keystone/k3 https://www.cnx-software.com/2026/01/23/spacemit-k3-16-core-...) is the one everyone is waiting for. We have one in house (as usual, cannot discuss benchmarks, but it's good). Unfortunately I don't think there is anyone reputable offering pre-orders yet.

jacquesm · 2026-03-11T13:37:44 1773236264

Ok! I will keep an eye out. It is one of the most interesting developments for me hardware wise in the last decade, and I definitely want to show my support by buying one or more of the boards. Respin is always really annoying this late in, the post mortem on that must make for interesting reading.

You're super lucky to have your hands on one!

statusfailed · 2026-02-16T13:52:40 1771249960

Ahh I thought I was the only one! One line per sentence makes the diffs so much nicer too, maybe we need git hooks to reject multiple sentences per line?

setopt · 2026-02-16T14:10:24 1771251024

You can just do `git diff --word-diff` and then the diffs look great even with one paragraph per line.

I used to do one sentence per line, but after getting used to how Emacs handles soft-wrapping, I now do one paragraph per line—also when I use Vim. This also makes collaboration with other authors easier, since most non-vim collaborators do that.

bee_rider · 2026-02-16T16:56:12 1771260972

Also IMO, sentences are a more useful logical grouping. I might want to swap sentence orders, so having the quick dd then p command is nice. Paragraphs get moved occasionally but it is rare enough that I don’t mind grabbing them as if they are a collection of sentences.

statusfailed · 2026-02-12T09:54:58 1770890098

I didn't find note-taking particularly useful until I started keeping everything in a single notebook with dated pages. This worked a lot better than (for example) trying to organise notes by category - it's often easier to remember when you were working on something than how you categorised it, and once you know roughly when, you can find it by binary search

statusfailed · 2025-12-09T10:29:04 1765276144

I saw a similar (I think!) paper "Grassmannian Optimization Drives Generalization in Overparameterized DNN" at OPT-ML at neurips last week[0]

This is a little outside my area, but I think the relevant part of that abstract is "Gradient-based optimization follows horizontal lifts across low-dimensional subspaces in the Grassmannian Gr(r, p), where r p is the rank of the Hessian at the optimum"

I think this question is super interesting though: why can massively overparametrised models can still generalise?

[0]: https://opt-ml.org/papers/2025/paper90.pdf

statusfailed · 2025-11-11T10:52:17 1762858337

Which repos worked well? I've had the same experience as op- unhelpful diagrams and bad information hierarchy. But I'm curious to see examples of where it's produced good output!

statusfailed · 2025-09-23T07:57:23 1758614243

Really nice! Had a quick read, here's my quick summary:

- Arrays are typed `S : D` with shape S and strides D

- Each of `S` and `D` is a nested tuple (instead of the flat tuples one typically sees in a tensor framework)

- Together `S` and `D` define the layout of a tensor

- Not every layout is "tractable", but the tractable ones form a nice category

A really good exposition, my only criticism is that it's quite front-heavy- it would be nice to see a detailed example like in 2.3.8 earlier in the document; there is a lot of detail presented early that doesn't seem necessary to understand the core ideas.

Last comment: I suspect there is a connection to strictification[0], would love to know more if the authors see this!

[0]: in the sense i mean here: https://arxiv.org/pdf/2201.11738v3

statusfailed · 2025-08-04T09:31:59 1754299919

Hellas.AI | Remote | Full Time or Contract | https://hellas.ai

Hellas.AI is building a compiler for categorical deep learning to power decentralised, serverless AI.

We're looking for an experienced Rust engineer. You must have demonstrated:

* Experience in designing and implementing BFT consensus protocols * Ability to engage with academic literature on distributed systems * Understanding of ZK proofs and proof aggregation (you don't need to design these, but you should know how to use them)

Nice-to-have: interest or experience in:

* Deep Learning / ML / AI, GPU programming * Compilers and array programming languages * Category Theory and string diagrams

If you have project(s) demonstrating your experience in one or more of these areas, get in touch!

Our process is simple: we read code you've written and research you've published, then interview you about it. We don't do tedious coding tests.

Email us at hello@hellas.ai

statusfailed · 2025-04-23T12:16:14 1745410574

I've been hoping for a nice concrete example of braided monoidal categories for ages, who knew that the best one was string diagrams that represent actual string! Great post!

@the author - I assume you're aware that morphisms in symmetric monoidal categories can be represented using cospans of hypergraphs - do you know if there's a similar combinatorial representation for braided monoidal categories?

statusfailed · on Feb 5, 2025

(author here) the goals of catgrad are a bit different to JAX - first of all, the "autodiff" algorithm is really a general approach to composing optics, of which autodiff is just a special case. Among other things, this means you can "plug your own language" into the syntax library to get a "free" autodiff algorithm. Second, catgrad itself is more like an IR right now - we're using it at hellas.ai to serve as a serverless runtime for models.

More philosophically, the motto is "write programs as morphisms directly". Rather than writing a term in some type theory which you then (maybe) give a categorical semantics, why not just work directly in a category?

Long term, the goal is to have a compiler which is a stack of categories with functors as compiler passes. The idea being that in contrast to typical compilers where you are "stuck" at a given abstraction level, this would allow you to view your code at various levels of abstractions. So for example, you could write a program, then write an x86-specific optimization for one function which you can then prove correct with respect to the more abstract program specification.

efferifick · on Feb 6, 2025

> Among other things, this means you can "plug your own language" into the syntax library to get a "free" autodiff algorithm.

Hello, as a compiler engineer I am interested in this area. Can you expand a little bit more? How would I be able to plug in my own language for example?

> So for example, you could write a program, then write an x86-specific optimization for one function which you can then prove correct with respect to the more abstract program specification.

So, what you are saying is that catgrad alllows me to write a program and then also plug in a compiler pass? I.e., the application author can also be the compiler developer?

catgary · on Feb 6, 2025

I get it, but once you’ve compiled your program down to a computational graph…well those are basically morphisms in a monoidal/multi category, no? And then you can have a stack of MLIR dialects that represent different levels of abstraction, where optimization passes are functors between the multicategory determined by that dialect, then you hand it off to LLVM for CPU execution, or maybe you want to run it with WebGPU, go down to CUDA, etc.

nh23423fefe · on Feb 5, 2025

can you expand on "write programs as morphisms directly"

I'm someone super interested in category theory and ITT, but I can't quite parse what you are trying to convey even though I think I have the prereqs to understand the answer.

statusfailed · on Jan 2, 2025

Hellas.AI | Remote | Full Time or Contract | https://hellas.ai

Hellas.AI is building a compiler for categorical deep learning to power decentralised, serverless AI. If you're interested in any of the following, get in touch:

* Category Theory and string diagrams

* Compilers and array programming languages

* Deep Learning/ML/AI and/or GPU programming (e.g. CUDA/ WGSL)

* Building and deploying distributed systems

* Peer-to-peer software and blockchain

If you have project(s) demonstrating your experience in one or more of these areas, get in touch!

Our process is simple: we read code you've written and research you've published, then interview you about it. We don't do tedious coding tests.

Some experience of Rust or a functional language (Haskell, OCaml, any Lisp) is preferred.

Email us at hello@hellas.ai