I agree that is performant enough for many applications, I work in the field. Bu... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		sbszllr 44 days ago \| parent \| context \| favorite \| on: Signal leaders warn agentic AI is an insecure, unr... I agree that is performant enough for many applications, I work in the field. But it isn't performant enough to run large scale LLM inference with reasonable latency. Especially not when we compare the throughput numbers for a single-tenant inference inside a TEE vs batched non-private inference.

ramoz 44 days ago [–]

We just served Deepseek R1 on this bad boy in CC+TEE (and an integrated signing layer we developed for vLLM).

https://pasteboard.co/k1hjwT7pWI6x.png

reach out if interested in collab.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact