Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Something like

    ollama run hf.co/ngxson/GLM-4.7-Flash-GGUF:Q4_K_M
It's really fast! But, for now it outputs garbage because there is no (good) template. So I'll wait for a model/template on ollama.com


It's available (with tool parsing, etc.): https://ollama.com/library/glm-4.7-flash but requires 0.14.3 which is in pre-release (and available on Ollama's GitHub repo)




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: