AFAIK that model is pretty old, and it was explicitly trained for SVG generation. For other models the capability of generating SVGs of real stuff is accidental. Same as GPT-5.x and Sonnet 4.5+ being able to generate MIDI music.
Benchmarks are not interesting in deciding the "size class". Bigger size means more knowledge. Also, the Qwen 3.5 27B is a dense 27B active parameter model. StepFun 3.5 Flash has 11B active parameters.
No, because smart people realize they are playing an iterated game and that behaving in a way that people identify as Machiavellian is actually suboptimal in the long run.
So they're smart enough to be calculated and stupid enough not to be so calculated that they look untrustworthy.
> No, because smart people realize they are playing an iterated game and that behaving in a way that people identify as Machiavellian is actually suboptimal in the long run.
Even if you are right coincidentally (which I wouldn't be so sure about), that's still poor argument assuming you realize your belief in what optimal strategy is what it is - just an educated guess.
reply