> That is very different from these models that are just rewarded for mimicking ... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		idle_zealot on June 24, 2024 \| parent \| context \| favorite \| on: Detecting hallucinations in large language models ... > That is very different from these models that are just rewarded for mimicking regardless if it is right or wrong That's not a totally accurate characterization. The base models are just trained to predict plausible text, but then the models are fine-tuned on instruct or chat training data that encourages a certain "attitude" and correctness. It's far from perfect, but an attempt is certainly made to train them to be right.

Jensson on June 24, 2024 [–]

They are trained to replicate text semantically and then given a lot of correct statements to replicate, that is very different from being trained to be correct. That makes them more useful and less incorrect, but they still don't have a concept of correctness trained into them.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact