Any below 7B you'd recommend? IME Qwen2.5-3B-Instruct (or even 1.5B) have been q... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		tomp 11 months ago \| parent \| context \| favorite \| on: Gemma3 – The current strongest model that fits on ... Any below 7B you'd recommend? IME Qwen2.5-3B-Instruct (or even 1.5B) have been quite remarkable, but I haven't done that heavy testing.

archerx 11 months ago [–]

Try;

- EXAONE-3.5-2.4B-Instruct - Llama-3.2-3B-Instruct-uncensored - qwq-lcot-3b-instruct - qwen2.5-3b-instruct

These have been very interesting tiny models, they can do text processing task and can handle story telling. The Llama-3.2 is way to sensitive to random stuff so get the uncensored or abliterated versions

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact