Something like ollama run hf.co/ngxson/GLM-4.7-Flash-GGUF:Q4_K_M It's really fas... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		ljouhet 17 days ago \| parent \| context \| favorite \| on: GLM-4.7-Flash Something like `ollama run hf.co/ngxson/GLM-4.7-Flash-GGUF:Q4_K_M` It's really fast! But, for now it outputs garbage because there is no (good) template. So I'll wait for a model/template on ollama.com

jmorgan 17 days ago [–]

It's available (with tool parsing, etc.): https://ollama.com/library/glm-4.7-flash but requires 0.14.3 which is in pre-release (and available on Ollama's GitHub repo)

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact