this is just optimizing for token windows. flat code = less context. we did the same thing with java when memory was expensive, called it "lightweight frameworks"
does it handle skewed distributions? faker's always been useless for this - like, your test data ends up with everyone having 5 orders when real data is all long tail
Fair point, yeet doesn't really decide what's safe. It just scans a hardcoded list of known cache locations and lets you pick what to delete. The assumption is that these are "caches" that can be regenerated, but you're right that some are more painful than others. For Docker specifically, we include paths like /var/lib/docker which is pretty aggressive, that's images, build cache, and volumes. Probably shouldn't be in there since `docker system prune handles` that way better. Good feedback, will tighten up what we scan for
we had to restrict ours to views only because it kept trying to run updates. still breaks sometimes when it hallucinates column names but at least it can't do anything destructive
Great question! No, Phantom Guard won't flag legit packages like openai, langchain-openai, or gpt-engineer.
The primary signal is whether the package exists on the registry. We query PyPI/npm directly:
- If a package exists → it gets a low/safe risk score
- If a package doesn't exist → that's the main red flag for slopsquatting
Pattern matching (like AI-related terms) is just one of many weighted signals, and it's far outweighed by existence. In fact, popular packages get a negative weight that actively reduces their risk score.
The attack we're detecting is when an LLM hallucinates a package name like flask-gpt-utils that sounds plausible but doesn't exist. A real attacker could then register that name and wait for developers to pip install it.
We test against the top 1000 PyPI packages and target <5% false positive rate. If you're importing openai or transformers, you're fine.
curious about the startup latency in practice. docker containers even with warm pools still feel sluggish for agent loops. e2b does firecracker and it's noticeably snappier