Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Traditional Microsoft devs are used to deterministic tests: assert result == expected, whereas AI requires probabilistic evals and quality monitoring in prod. I think Microsoft simply lacks the LLM Ops culture right now to build a quality evaluation pipeline before release; they are testing everything on users


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: