Jupyter notebooks are actually a use case we think about a lot, you can try a li...

justinsb · on Aug 1, 2022

> we'd like the scheduler to be aware of which node has which image

The kubernetes scheduler should be aware of which node has which image, that is why the Node object has the status.images field: https://kubernetes.io/docs/reference/generated/kubernetes-ap....

It turned out to be somewhat tricky, because it increased the size of the Node object, and colocating node heartbeats onto the same object meant that a bigger object was changing relatively often. But that was addressed by moving heartbeats to a different object: https://github.com/kubernetes/enhancements/issues/589

paulgb · on Aug 1, 2022

TIL, thanks. Looks like there's a corresponding ImageLocality score used by the scheduler: https://kubernetes.io/docs/reference/scheduling/config/#sche...

It doesn't get all the way to what we want, but it could be used to build a piece of it.

cbanek · on Aug 1, 2022

Very cool, I didn't know about this either. I feel like so many of these features are coming in which is great, but also part of the drag of k8s is the kind of constant upgrade churn and having to keep your yaml fresh.

jjoonathan · on Aug 1, 2022

AWS has put work into fast-starting containers [1] using tricks like lazy loading container storage, profiling container startup, non-lazily priming critical blocks, and caching shared blocks. IIRC parts of it are open source. I don't know if enough of it is open source to be helpful, but it's cool stuff!

[1] Gigabytes in milliseconds: Bringing container support to AWS Lambda without adding latency. https://www.youtube.com/watch?v=A-7j0QlGwFk

TurningCanadian · on Aug 1, 2022

On the Google side, Artifact Registry supports image streaming

https://cloud.google.com/kubernetes-engine/docs/how-to/image...

hosh · on Aug 1, 2022

Doesn’t the latest version of k8s let you use your own custom scheduler?

paulgb · on Aug 1, 2022

You can, but that falls into this bucket:

> the more we tried to optimize it the less of core Kubernetes we were using and so the less value we were getting for the complexity tax we were paying

Since we were headed down that path, we took a step back and asked what we were really getting out of Kubernetes, and most of it was things that were orthogonal to our intended use case. The way Kubernetes is architected around control loops works great for its intended use case, but we wanted a more event-driven system.

hosh · on Aug 2, 2022

Event driven ... like a streaming data pipeline? Given your comment about Jupyter notebooks, that makes sense. It might be the Mesos project is better architected for your use-case. Then again, I think Mesos ported some of their schedulers to Kubernetes.