More

herpderperator · 2026-04-14T07:31:25 1776151885

Does this help with DuckDB concurrency? My main gripe with DuckDB is that you can't write to it from multiple processes at the same time. If you open the database in write mode with one process, you cannot modify it at all from another process without the first process completely releasing it. In fact, you cannot even read from it from another process in this scenario.

So if you typically use a file-backed DuckDB database in one process and want to quickly modify something in that database using the DuckDB CLI (like you might connect SequelPro or DBeaver to make changes to a DB while your main application is 'using' it), then it complains that it's locked by another process and doesn't let you connect to it at all.

This is unlike SQLite, which supports and handles this in a thread-safe manner out of the box. I know it's DuckDB's explicit design decision[0], but it would be amazing if DuckDB could behave more like SQLite when it comes to this sort of thing. DuckDB has incredible quality-of-life improvements with many extra types and functions supported, not to mention all the SQL dialect enhancements allowing you to type much more concise SQL (they call it "Friendly SQL"), which executes super efficiently too.

[0] https://duckdb.org/docs/current/connect/concurrency

szarnyasg · 2026-04-14T07:56:52 1776153412

Hi, DuckDB DevRel here. To have concurrent read-write access to a database, you can use our DuckLake lakehouse format and coordinate concurrent access through a shared Postgres catalog. We released v1.0 yesterday: https://ducklake.select/2026/04/13/ducklake-10/

I updated your reference [0] with this information.

nrjames · 2026-04-14T11:37:27 1776166647

Regarding documentation, I think the DuckLake docs would benefit from a relatively simple “When should I consider using DuckLake?” type FAQ entry. You have sections for what, how, and why, essentially, and a few simple use cases and/or case studies could help provide the aha moment to people in data jobs who are inundated with marketing from other companies. It would help folks like me understand under which circumstances I would stand to benefit most from using DuckLake.

szarnyasg · 2026-04-15T13:44:13 1776260653

DuckDB devrel here. You are right. This was in the FAQ but I also added it to the DuckLake documentation's main page at https://ducklake.select/docs/stable/

citguru · 2026-04-14T15:09:14 1776179354

Hi,

DuckLake is great for the lakehouse layer and it's what we use in production. But there's a gap and thats what I'm trying to address with OpenDuck. DuckLake do solve concurrent access at the lakehouse/catalog level and table management.

But the moment you need to fall back to DuckDB's own compute for things DuckLake doesn't support yet, you're back to a single .duckdb file with exclusive locking. One process writes, nobody else reads.

OpenDuck sits at a different layer. It intercepts DuckDB's file I/O and replaces it with a differential storage engine which is append-only layers with snapshot isolation.

citguru · 2026-04-14T13:57:25 1776175045

Yes, this is actually one of the core problems OpenDuck's architecture addresses.

The short version: OpenDuck interposes a differential storage layer between DuckDB and the underlying file. DuckDB still sees a normal file (via FUSE on Linux or an in-process FileSystem on any platform), but underneath, writes go to append-only layers and reads are resolved by overlaying those layers newest-first. Sealing a layer creates an immutable snapshot.

This gives you:

Many concurrent readers: each reader opens a snapshot, which is a frozen, consistent view of the database. They don't touch the writer's active layer at all. No locks contended.

One serialized write path: multiple clients can submit writes, but they're ordered through a single gateway/primary rather than racing on the same file. This is intentional: DuckDB's storage engine was never designed for multi-process byte-level writes, and pretending otherwise leads to corruption. Instead, OpenDuck serializes mutations at a higher level and gives you safe concurrency via snapshots.

So for your specific scenario — one process writing while you want to quickly inspect or query the DB from the CLI — you'd be able to open a read-only snapshot mount (or attach with ?snapshot=<uuid>) from a second process and query freely. The writer keeps going, new snapshots appear as checkpoints seal, and readers can pick up the latest snapshot whenever they're ready.

It's not unconstrained multi-writer OLTP (that's an explicit non-goal), but it does solve the "I literally cannot even read the database while another process has it open" problem that makes DuckDB painful in practice.

jeadie · 2026-04-14T10:20:44 1776162044

This is exactly what we found. Ingest rates were tough. We partitioned and ran over multiple duckdb instances too (and wrangled the complexity).

We ending up building a Sqlite + vortex file alternative for our use case: https://spice.ai/blog/introducing-spice-cayenne-data-acceler...

wenc · 2026-04-14T12:34:11 1776170051

Try DuckLake. They just released a prod version.

You can do read/write of a parquet folder on your local drive, but managed by DuckLake. Supports schema evolution and versioning too.

Basically SQLite for parquet.

herpderperator · 2026-04-08T18:15:44 1775672144

Nice app! I see that the page title says "client" and there's the Vite favicon still, which you might want to fix :)

I also think having a dropdown for the address search is somewhat expected these days, but is lacking here. That might be on purpose or due to a technical limitation, but just thought I'd mention.

vkatluri · 2026-04-08T18:58:58 1775674738

That's embarrassing... will fix the title and favicon.

Thanks for the feedback regarding addresses, on my TODO list to add support for dynamic search.

herpderperator · 2026-03-03T18:17:16 1772561836

This sounds like swap needing to be swapped in and then released. Check your memory usage.

herpderperator · 2025-12-22T21:39:03 1766439543

If those other applications use their own local GPS clocks, what is the significance of NIST (and the 5μs inaccuracy) in their scenario?

Denvercoder9 · 2025-12-23T00:59:29 1766451569

GPS gets its time from NIST (though during this incident they failed over to another NIST site, so it wasn't impacted).

iJohnDoe · 2025-12-23T04:36:31 1766464591

That is not correct at all. How did you arrive at that conclusion?

GPS has its own independent timescale called GPS Time. GPS Time is generated and maintained by Atomic clocks onboard the GPS satellites (cesium and rubidium).

ted_dunning · 2025-12-23T08:07:54 1766477274

It has its own timescale, but that still traces back to NIST.

In particular, the atomic clocks on board the GPS satellites are not sufficient to maintain a time standard because of relativistic variations and Doppler effects, both of which can be corrected, but only if the exact orbit is known to within exceeding tight tolerances. Those orbital elements are created by reference to NIST. Essentially, the satellite motions are computed using inverse GPS and then we use normal GPS based on those values.

throw0101d · 2026-01-05T13:48:52 1767620932

> It has its own timescale, but that still traces back to NIST.

GPS gets its time from the US Naval Observatory:

> Former USNO director Gernot M. R. Winkler initiated the "Master clock" service that the USNO still operates,[29][30] and which provides precise time to the GPS satellite constellation run by the United States Space Force. The alternate Master Clock time service continues to operate at Schriever Space Force Base in Colorado.

* https://en.wikipedia.org/wiki/United_States_Naval_Observator...

The USNO does not seem to sync with NIST:

> As a matter of policy, the U.S. Naval Observatory timescale, UTC(USNO), is kept within a close but unspecified tolerance of the international atomic timescale published by the Bureau International des Poids et Mesures (International Bureau of Weights and Measures [BIPM]) in Sevres, France. The world's timing centers, including USNO, submit their clock measurements to BIPM, which then uses them to compute a free-running (unsteered) mean timescale (Echelle Atomique Libre [EAL]). BIPM then applies frequency corrections ("steers") to EAL, based on measurements from primary frequency standards and intended to keep the International System's basic unit of time, the second, constant. The result of these corrections is another timescale, TAI (Temps Atomique International or International Atomic Time). The addition of leap seconds to TAI produces UTC. The world's timing centers have agreed to keep their real-time timescales closely synchronized ("coordinated") with UTC. Hence, all these atomic timescales are called Coordinated Universal Time (UTC), of which USNO's version is UTC(USNO).

* https://www.cnmoc.usff.navy.mil/Our-Commands/United-States-N...

The two organizations do seem to keep an eye on each other:

> The United States Naval Observatory (USNO) and the National Institute of Standards and Technology (NIST) make regular comparisons of their respective time scales. These comparisons are made using GPS common-view measurements from up to approximately 10 GPS satellites. The table below lists recent differences between the two time scales.

* https://www.nist.gov/pml/time-and-frequency-division/time-se...

sq_ · 2025-12-23T05:25:18 1766467518

I think GP might’ve been referring to the part of Jeff’s post that references GPS, which I think may be a slight misunderstanding of the NIST email (saying “people using NIST + GPS for time transfer failed over to other sites” rather than “GPS failed over to another site”).

The GPS satellite clocks are steered to the US Naval Observatory’s UTC as opposed to NIST’s, and GPS fails over to the USNO’s Alternate Master Clock [0] in Colorado.

[0] https://www.cnmoc.usff.navy.mil/Our-Commands/United-States-N...

sq_ · 2025-12-23T18:44:50 1766515490

I find this stuff really interesting, so if anyone's curious, here's a few more tidbits:

GPS system time is currently 18s ahead of UTC since it doesn't take UTC's leap seconds into account [0]

This (old) paper from USNO [1] goes into more detail about how GPS time is related to USNO's realization of UTC, as well as talking a bit about how TAI is determined (in hindsight! - by collecting data from clocks around the world and then processing it).

[0] https://www.cnmoc.usff.navy.mil/Our-Commands/United-States-N... [1] https://ntrs.nasa.gov/api/citations/19960042620/downloads/19...

throw0101c · 2025-12-23T03:01:04 1766458864

> If those other applications use their own local GPS clocks, what is the significance of NIST (and the 5μs inaccuracy) in their scenario?

Verification and traceability is one reason: it's all very well to claim you're with-in ±x seconds, but your logs may have to say how close you are to the 'legal reality' that is the official time of NIST.

NIST may also send out time via 'private fibre' for certain purposes:

* https://en.wikipedia.org/wiki/White_Rabbit_Project

'Fibre timing' is also important in case of GNSS signal disruption:

* https://www.gpsworld.com/china-finishing-high-precision-grou...

herpderperator · 2025-11-23T06:12:27 1763878347

Can someone explain exactly what's happening here? https://github.com/nadimkobeissi/16iax10h-linux-sound-saga/i...

It seems like there's a lot of personal information being asked for / thrown around... including a debit/credit card number?

Is there no better way to handle the bounty payment?

herpderperator · 2025-10-23T15:28:56 1761233336

That would cause your active connections to break because the source IP changed entirely. Are you sure the IP changes abruptly, or they keep it for as long as the session is live? Though keeping the original IP would mean that, for example, if you are sailing around the world, you'd start getting worse and worse latency as all your data continues going to the original ground station which may be on the other side of the world at that point.

An interesting problem - I wonder what they truly do here. I suppose people expect interruptions with Starlink so doing an IP swap wouldn't be all that different to losing service due to obstruction for a few minutes.

ianburrell · 2025-10-23T16:58:48 1761238728

IP addresses change all the time. It changes when connect to WiFi, it changes when enter new country, it changes when provider gives you new address. I cant tell if changes on mobile, it looks like mobile providers hand off to next tower, but there must be a limit of how far can go before routing breaks.

Everything retries cause there isn’t difference between new address or bad connection. Most of time we don’t notice cause not using device. Or because most connections are short lived.

herpderperator · 2025-10-23T17:58:18 1761242298

I'm aware that the public IP changes when a phone (on which one hardly has much control over how things run anyway), switches from cellular to a WiFI network.

Your comments are more practical (and maybe aimed at a layman's use of Starlink) but I am talking about the theory of Starlink supposedly interrupting a perfectly-working connection in order to change your IP, which interrupts everything, by design of TCP/conntrack. Whether that operation is fatal or not due to retries or whatever else is not my point at all.

Also, ISPs at home don't randomly disconnect you to give you a new IP. They may give you a new IP when you disconnect and reconnect for other reasons, but they should never dump your connection on purpose just to give you a new IP for no reason. That's not good design at all, hence the question about how Starlink handles wanting to give you a new IP.

herpderperator · 2025-09-25T22:22:17 1758838937

Serious question: If it's an improved 2.5 model, why don't they call it version 2.6? Seems annoying to have to remember if you're using the old 2.5 or the new 2.5. Kind of like when Apple released the third-gen iPad many years ago and simply called it the "new iPad" without a number.

skerit · 2025-09-25T22:28:15 1758839295

That's why people called the second version of Sonnet v3.5 simply v3.6, and Anthropic acknowledged that by naming the next version v3.7

Aeolun · 2025-09-26T06:08:11 1758866891

Only Anthropic has a slightly understandable version scheme.

alwillis · 2025-09-25T22:35:59 1758839759

It's pretty common to refer to models by the month and year they were released.

For example, the latest Gemini 2.5 Flash is known as "google/gemini-2.5-flash-preview-09-2025" [1].

[1]: https://openrouter.ai/google/gemini-2.5-flash-preview-09-202...

cpeterso · 2025-09-25T23:11:50 1758841910

If they're going to include the month and year as part of the version number, they should at least use big endian dates like gemini-2.5-flash-preview-2025-09 instead of 09-2025.

herpderperator · 2025-09-25T22:36:43 1758839803

Or, you know, just Gemini 2.6 Flash. I don't recall the 2.5 version having a date associated with it when it came out, though maybe they are using dates now. In marketing, at least, it's always known as Gemini 2.5 Flash/Pro.

kingo55 · 2025-09-25T22:47:27 1758840447

It had a date, but I also agree this is extremely confusing. Even semver 2.5.1 would be clearer IMO.

vitorgrs · 2025-09-26T01:00:27 1758848427

It always had dates... They release multiple versions and update regularly. Not sure if this is the first 2.5 Flash update, but pretty sure Pro had a few updates as well...

This is also the case with OpenAI and their models. Pretty standard I guess.

They don't change the versioning, because I guess they don't consider it to be "a new model trained from scratch".

Thorrez · 2025-09-26T13:04:38 1758891878

>For example, the latest Gemini 2.5 Flash is known as "google/gemini-2.5-flash-preview-09-2025" [1].

That "example" is the name used in the article under discussion. There's no need to link to openrouter.ai to find the name.

relatedtitle · 2025-09-25T23:11:18 1758841878

I'm pretty sure Google just does that for preview models and they drop the date from the name when it's released.

someguyiguess · 2025-09-26T03:52:38 1758858758

If only there was some of versioning nomenclature they could use. Maybe even one that is … semantic? Oh how I wish someone would introduce something like this to the software engineering field. /s

In all seriousness though, their version system is awful.

qafy · 2025-09-25T22:35:40 1758839740

2.5 is not the version number, it's the generation of the underlying model architecture. Think of it like the trim level on a Mazda 3 hatchback. Mazda already has the Mazda 3 Sport in their lineup, then later they release the Mazda 3 Turbo which is much faster. When they release this new version of the vehicle its not called the Mazda 4... that would be an entirely different vehicle based on a new platform and powertrain etc (if it existed). The new vehicle is just a new trim level / visual refresh of the existing Mazda 3.

That's why Google names it like this, but I agree its dumb. Semver would be easier.

someguyiguess · 2025-09-26T03:55:42 1758858942

I’d say it’s more like naming your Operating System off of the kernel version number.

pests · 2025-09-26T03:29:40 1758857380

Gonna steal this to help explain to non tech friends when it comes up again.

JumpCrisscross · 2025-09-25T22:51:07 1758840667

Maybe they’re signalling it’s more of a bug fix?

manquer · 2025-09-26T00:15:19 1758845719

2.5.1 then .

semantic versioning works for most scenarios.

JumpCrisscross · 2025-09-26T00:26:09 1758846369

Would that automatically roll over anyone pinging 2.5 via their API?

manquer · 2025-09-26T02:34:12 1758854052

If you want role over then you could specify ^2.5.0 or 2.5.x if you want to pin then it would be 2.5.0

This is all solved for a long time now , llm vendors seems to have unlearnt versioning principles.

This is fairly typical - marketing and business wants different things to do with version number than what version number systems are good at .

dgacmu · 2025-09-26T12:04:47 1758888287

I suspect Google doesn't want to have to maintain multiple sub-versions. It's easier to serve one 2x popular model than two models where there's flux between the load on each, since these things have a non-trivial time to load into GPU/TPU memory for serving.

manquer · 2025-09-27T19:44:09 1759002249

Even if switching quickly was a challenge[1], they are using these models in their own products not just selling them in a service, the first party applications could quite easily adapt to this by switching quickly to the available model and freeing up the in-demand one.

This is the entire premise behind the cloud, the reason it was Amazon did it first, they had the largest workloads at the time before Web 2.0 and SaaS was a thing.

Only businesses with large first party apps succeeded in the cloud provider space, companies like HP, IBM all failed and their time to failure strongly correlated to their amount of first party apps they operated. i.e. These apps anyway needed to keep a lot of idle capacity for peak demand capacity they could now monetize and co-mingle in the cloud.

LLMs as a service is not any different from S3 launched 20 years ago.

---

[1] It isn't, at the scale they are operating these models it shouldn't matter at all, it is not individual GPUs or machines that make a difference in load handling at all. Only few users are going to explicitly pining a specific patch version for the rest they can serve either one that is available immediately or cheaply.

cubefox · 2025-09-26T07:17:55 1758871075

That would be even more confusing because then it is unclear whether 2.6 Flash is better than 2.5 Pro.

hahn-kev · 2025-09-26T07:36:05 1758872165

Is a 2024 Mac boo pro better than a 2025 Mac book?

cubefox · 2025-09-26T08:17:59 1758874679

Good question

herpderperator · 2025-08-29T03:30:48 1756438248

Reminds me of Dragon Drop... https://www.youtube.com/watch?v=DCu1G2rxj5c

herpderperator · 2025-08-20T22:09:06 1755727746

Have they fixed the ability to easily transfer your existing Android data to the new Android phone? I find that every time I upgrade, despite choosing the options to transfer apps/settings, that 90% of the apps I open just greet me with the login screen and I have to set everything up completely from scratch. I remember maybe a handful of apps, I think one was Uber, that were able to transfer everything including the login session. That was truly magic. That's how it should be for all apps. I understand banks might have special security requirements and I already know for Google Wallet, your cards need to be reactivated even if they transfer over, but most apps are not banks.

gruez · 2025-08-20T23:03:49 1755731029

Blame the app developers, not google. They specifically added a backup/restore mode for device to device transfer, that bypasses backup blacklists[1]. However apps can still opt out by registering a backup agent, and returning no data.

[1] https://developer.android.com/identity/data/testingbackup

summm · 2025-08-21T01:15:33 1755738933

Google actively avoided providing a local, secure, and seamless backup or even an interface for 3rd party backup services to make users more dependent on Google cloud services. Of course many app developers decided the Google cloud is too insecure, being not end-to-end encrypted. And Google enables them by not giving the users ways to override those stupid decisions. This wouldn't have happened on PCs, where you can mostly just copy over the application's user directory.

gruez · 2025-08-21T01:25:31 1755739531

>Of course many app developers decided the Google cloud is too insecure, being not end-to-end encrypted

But so far as I can tell D2D transfers don't hit the cloud?

>For a D2D transfer, the Backup Manager Service queries your app for backup data and passes it directly to the Backup Manager Service on the new device, which loads it in to your app.

https://developer.android.com/identity/data/testingbackup

If your app is opting out of backup by implementing a custom backup agent that returns no data, it's pretty clear you're against user backups, period.

paulryanrogers · 2025-08-20T22:25:22 1755728722

Pixel to Pixel has been smooth for me since the Pixel 4. Haven't don't cross manufacturer for a while.

atomicthumbs · 2025-08-21T01:53:25 1755741205

i don't think they're ever gonna fix that

herpderperator · 2025-08-20T19:24:58 1755717898

For the sake of understanding, can you explain why putting CloudFront in front of the buckets helps?

bhattisatish · 2025-08-20T20:32:59 1755721979

Cloudfront allows you to map your S3 with both

- signed url's in case you want a session base files download

- default public files, for e.g. a static site.

You can also map a domain (sub-domain) to Cloudfront with a CNAME record and serve the files via your own domain.

Cloudfront distributions are also CDN based. This way you serve files local to the users location, thus increasing the speed of your site.

For lower to mid range traffic, cloudfront with s3 is cheaper as the network cost of cloudfront is cheaper. But for large network traffic, cloudfront cost can balloon very fast. But in those scenarios S3 costs are prohibitive too!