To add another data point: After working with ES for the past 10 years in produc...

heipei · 2025-08-08T18:36:44 1754678204

I agree, and I don't get where the claims that ES is hard to operate originate from. Yeah, if you allow arbitrary aggregations that exceed the heap space, or if you allow expensive queries that effectively iterate over everything you're gonna have a bad time. But apart from those, as long as you understand your data model, your searches and how data is indexed, ES is absolutely rock-solid, scales and performs like a beast. We run a 35-node cluster with ~ 240TB of disk, 4.5TB of RAM, and about 100TB of documents and are able to serve hundreds of queries. The whole thing does not require any maintenance apart from replacing nodes that failed from unrelated causes (hardware, hosting). Version upgrades are smooth as well.

The only bigger issue we had was when we initially added 10 nodes to double the initial capacity of the cluster. Performance tanked as a result, and it took us about half a day until we finally figured out that the new nodes were using dmraid (Linux RAID0) and as a result the block devices had a really high default read-ahead value (8192) compared to the existing nodes, which resulted in heavy read amplification. The ES manual specifically documents this, but since we hadn't run into this issue ourselves it took us a while to realise what was at fault.

lisbbb · 2025-08-08T23:50:58 1754697058

The thing I like about ES: When the business comes around and adds new requirements out of nowhere, the answer is always: "Yup, we can do it!" Unlike other tools such as Cassandra that force a data design from the get go and make it expensive to change later on.

itpragmatik · 2025-08-08T15:46:03 1754667963

how many clusters, how many indexes and how many documents per index? do you use self hosted es or aws managed opensearch?

dewey · 2025-08-08T15:52:39 1754668359

12 nodes, 200 million documents / node, very high number of searches and indexing operations. Self-hosted ES on GCP managed Kubernetes.

binarymax · 2025-08-08T16:06:31 1754669191

Lots of other options here if you don't like managing. You can use Elastic cloud, Bonsai.io, and others

lisbbb · 2025-08-08T23:52:18 1754697138

A lot of places can't put their data just anywhere.

chatmasta · 2025-08-09T03:59:29 1754711969

And they can pay the vendors for "bring your own cloud" or similar. If data sovereignty is important to them, then they can probably afford it. And if cost is an issue, then they wouldn't be looking at hosted solutions in the first place.

dewey · 2025-08-09T07:33:06 1754724786

They manage it in your GCP project, so you can also make use of your commitments etc.

everfrustrated · 2025-08-08T15:36:36 1754667396

How big is the team that looks after it?

dewey · 2025-08-08T15:47:58 1754668078

Nobody is actively looking after it. Good alerting + monitoring and if there's an alert like a node going down because of some Kubernetes node shuffling or a version upgrade that has to be performed one of our few infra people will do that.

It's really not something that needs much attention in my experience.