Hacker Newsnew | past | comments | ask | show | jobs | submit | b44's commentslogin

the untrained model is literally just generating random characters, whereas your examples are at least pronouncable. you can add more layers to get progressively better results.


good catch - i intentionally cap node visualizations at 16 so it doesn't get super long, but the sidebar shouldn't have that


not many. diminishing returns start before 1000 and past that you should just add a second/third layer


hm. the way i see things, characters are the natural/obvious building blocks and tokenization is just an improvement on that. i do mention chatgpt et al. use tokens in the last q&a dropdown, though


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: