OpenAI codenamed one of their models "Project Strawberry" and IIRC, Sam Altman h... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		xenotux 7 months ago \| parent \| context \| favorite \| on: GPT-5: "How many times does the letter b appear in... OpenAI codenamed one of their models "Project Strawberry" and IIRC, Sam Altman himself was taking a victory lap that it can count the number of "r"s in "strawberry". Which I think goes to show that it's hard to distinguish between LLMs getting genuinely better at a class of problems versus just being fine-tuned for a particular benchmark that's making rounds.

KeplerBoy 7 months ago | [–]

It gets strawberry right though, so I guess we are only one project blueberry from getting one step closer to AGI.

zahlman 7 months ago | [–]

See also the various wolf/goat/cabbage benchmarks, or the crossing a bridge at various speeds with limited light sources benchmarks.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact