b44's comments

b44 · 2026-02-16T10:54:30 1771239270

the untrained model is literally just generating random characters, whereas your examples are at least pronouncable. you can add more layers to get progressively better results.

b44 · 2026-02-16T05:58:21 1771221501

good catch - i intentionally cap node visualizations at 16 so it doesn't get super long, but the sidebar shouldn't have that

b44 · 2026-02-15T22:10:04 1771193404

not many. diminishing returns start before 1000 and past that you should just add a second/third layer

b44 · 2026-02-15T21:31:48 1771191108

hm. the way i see things, characters are the natural/obvious building blocks and tokenization is just an improvement on that. i do mention chatgpt et al. use tokens in the last q&a dropdown, though