Yes it's a large chunk, but not everything! Marc had a comment on bluesky regarding this:
> Many SQL aggregations are monotonic operations (e.g. MAX, SUM, etc) that can be partially completed on each node and then post-merged. Some (e.g. DISTINCT) can be transformed into monotonic ops with some effort. Some aren't possible to do this way. (Ref on monotonicity: arxiv.org/pdf/1901.01930)
The benefit of this is that a lot more work is done _close_ to the data. The trend is that bandwidth is getting larger in data centers, but latency isn't improving at the same rate. Reducing the number of round trips between QP and storage greatly improves the overall query latency, even if you have to do more work on the storage.
> The benefit of this is that a lot more work is done _close_ to the data.
But isn't that fundamentally at odds with the central idea of disaggregation
> At a fundamental level, scaling compute in a database system requires disaggregation of storage and compute. If you stick storage and compute together, you end up needing to scale one to scale the other, which is either impossible or uneconomical.
So either you can get good perf by doing the work close to data, or get good scalability by separating compute and data. But I can't see how you can do both.
According to page 128 of their S-1 filing [0], their claims bot "AI Jim" processes claims without human intervention in a third of cases, including automatically denying claims. So either the second tweet in the thread is a lie, or their S-1 filing is a lie. This was pointed out by Rachel Metz [1].
I think the real story will be that their nature is a financial engineering free lunch that is going to blow up spectacularly.
They say they are different because they reinsure most of their risk. So why doesn't everybody do this, print money and let someone else take the risk? Just skimming the S-1 makes me think of Greensill. It's some sort of arbitrage that makes it hard to put your finger on where the risk is.
Nope! Actually going off better info now it looks like they WERE using rows, but it was in the old .xls format instead of .xlsx which limits to 2^16 rows