We deal with Haskell resource leaks the same way you would in C++ or Java. We ha...

pmahoney · on March 26, 2014

> We can restart the process without losing any connections

Would you mind expanding on this a bit? I'm not too familiar with Haskell, but I am familiar with various was of blocking new connections while allowing existing connections to complete, either at the load-balancer level or built-in each individual process.

What Haskell stack are you using, and how are graceful restarts accomplished?

Thanks.

implicit · on March 27, 2014

One of my coworkers wrote a really cool bit of software to do this. I want him to open source it.

Basically, you can share a single socket amongst many servers. The OS ensures that just one process accepts each connection.

You can therefore have a manager process that owns the socket and passes it on to application processes.

To update, start new processes, then politely tell the old ones to go away.

dllthomas · on March 27, 2014

One really cool thing in Linux is that you can actually pass file descriptors between processes over unix domain sockets.

enigmo · on March 27, 2014

Windows has supported this for ~14 years too.

dllthomas · on March 27, 2014

Good to know. Does it work for everything that's an fd in Linux? I know you've got to treat sockets and files differently in some cases (or at least did once)...

enigmo · on March 27, 2014

It works for most kernel handles, sockets might be a little more normal starting with Win7 but I stopped doing Windows development around then.

Here are the official docs: http://msdn.microsoft.com/en-us/library/windows/desktop/ms72...

dllthomas · on March 27, 2014

Looks like there's a separate function for sockets.

Still, cool stuff there too.

samstokes · on March 27, 2014

einhorn [1] implements this model and is pretty effective. Used in production at Stripe and other places. (It's written in Ruby, but can run application processes in any language.)

[1] https://github.com/stripe/einhorn

DonPellegrino · on March 27, 2014

Basically, catch SIGINT, then stop listening to a socket/port. Finish all current requests and exit. The "watcher" parent process will restart the process with the new executable. Repeat for all other processes listening to the socket/port.

shadytrees · on March 26, 2014

I can't answer for grandparent, but you should check out https://github.com/notogawa/graceful

joehillen · on March 26, 2014

Except in Haskell you can build ekg right into your server. http://ocharles.org.uk/blog/posts/2012-12-11-24-day-of-hacka...

benmos · on March 26, 2014

"we added a tracker for the number of suspended Haskell threads" - would you mind sharing how you did that? I couldn't see any obvious GHC APIs for it.

implicit · on March 26, 2014

It looks like you're right. I misspoke.

We track total threads, working or not. It works great as an indicator because it tends to stay below the number of CPU cores on the server.

benmos · on March 26, 2014

I take it you mean OS-level threads then?

implicit · on March 26, 2014

We track Haskell threads.

edit: Found the code. :)

We rolled our own implementation. Our WAI application action increments a counter and decrements it again whenever an HTTP request is received and completed.

It doesn't track threads created as a part of HTTP request handling, but we don't allow those actions to forkIO anyway. There hasn't been any demand for it.

benmos · on March 27, 2014

Ah, right, that makes sense - thanks.

thinkpad20 · on March 26, 2014

... so, how? Isn't that what he was asking?