More

jeffchuber · 2026-02-26T16:25:24 1772123124

last weekend I vibe-coded a project called `openfs` that plugs into just-bash

https://github.com/jeffchuber/just-bash-openfs

it puts a bash interface in front of s3, filesystem (real and in-memory), postgres, and chroma.

still very much alpha - but curious what people think about it.

see an example app here: https://github.com/jeffchuber/openfs-incident-app

andrewingram · 2026-02-26T17:12:33 1772125953

I did a slightly less ambitious prototype a few weeks ago where I created added lazy loading of GCS files into the just-bash file-systems, as well as lots of other on-demand files. Was a lot of fun.

jeffchuber · 2026-02-26T17:43:39 1772127819

yeah (optional) caching is interesting to think about - incl write_through and write_back

JustFinishedBSG · 2026-02-26T19:52:59 1772135579

What problem were you trying to solve ? ( not that you need to solve one. I’m just curious )

jeffchuber · 2026-01-15T04:36:55 1768451815

try out chroma or better yet as opus to!

jeffchuber · 2025-12-01T05:17:27 1764566247

that was me swyx

rollulus · 2025-12-01T06:34:07 1764570847

Multiple people have coined the idea repeatedly, way before you. The oldest comment on HN I could find was in December 2022 by user spawarotti: https://news.ycombinator.com/item?id=33856172

threeducks · 2025-12-01T09:21:19 1764580879

Here is an even older comment chain about it from 2020: https://news.ycombinator.com/item?id=23895706

Apparently, comparing low-background steel to pre-LLM text is a rather obvious analogy.

pseidemann · 2025-12-01T09:58:11 1764583091

As well as that people often do think alike.

If you have a thought, it's likely it's not new.

rollulus · 2025-12-01T09:27:58 1764581278

Oh wow, great find! That’s really early days.

jeffchuber · 2025-12-01T16:08:25 1764605305

i didnt claim to invent it.

i claimed swyx heard it through me - which he did

swyx · 2025-12-01T17:54:34 1764611674

you did!!

jeffchuber · 2025-11-08T00:49:32 1762562972

i support this

jeffchuber · 2025-11-03T14:33:58 1762180438

Good article - the most use cases i see of pg_vector are typically “chat over their technical docs” - small corpus - doesn’t change often / can rebuild the index - no multi-tenancy avoids much of the issues with post-filtering

Chroma implements SPANN and SPFresh (to avoid the limitations of HNSW), pre-filtering, hybrid search, and has a 100% usage-based tier (many bills are around $1 per month).

Chroma is also apache 2.0 - fully open source.

jeffchuber · 2025-09-26T03:40:09 1758858009

congrats to factory on the amazing product and release!

jeffchuber · 2025-09-08T16:09:20 1757347760

chroma stores both

nkozyra · 2025-09-08T16:12:43 1757347963

As does Azure's AI search.

jeffchuber · 2025-08-20T15:07:02 1755702422

Hey there

Chroma is fully OSS - embedded, single-node and distributed (data and control plane). afaik lance distributed is not OSS.

We do have plans to release the crate (enabling embedded chroma in rust) - but haven't gotten around to it yet. Hopefully soon!

> Do you see all providers converging on similar alpha i.e cheap object storage, nvme drives,ssd cache to solve this?

It's not only a new pattern in search workloads, but it's happening in streaming, KV, OLTP, OLAP, etc. Yea - it's the future.

jeffchuber · 2025-08-20T06:04:24 1755669864

> Supabase/pgVector needs lots of resources when adding new rows to the index -> wish the resources scale up/down automatically. Instead of having to monitor and switch to the next plan.

Many ways potentially - but one way is Chroma makes all this pain go away.

We're also working on some ingestion tooling that will make it so you don't have to scale, manage or run those pipelines.

BrandiATMuhkuh · 2025-08-20T06:32:22 1755671542

I'll for sure take a deeper look. Ingestion has been by far the biggest pain and least fun. Those infra parts hold us back from the cool things -> building agents/search

jeffchuber · 2025-08-20T03:41:34 1755661294

very fair!

cloud has been in private beta for a year now.

we chose to not release it to the public until we were extremely confident in the system and its characteristics.

databases are a serious business. developers trust us with their mission critical data.