More

Joker_vD · 2026-01-17T01:37:38 1768613858

Just as any "plain blob storage" eventually evolves a hierarchical filesystem (but with silly quirks!) on top of it.

deepsun · 2026-01-17T07:27:28 1768634848

AFAIK, in Google it was the other way around -- their main blob storage (BigTable) is built on top of GFS (distributed filesystem).

Scaevolus · 2026-01-17T08:26:43 1768638403

You have that backwards. GFS was replaced by Colossus ca. 2010, and largely functions as blob storage with append-only semantics for modification. BigTable is a KV store, and the row size limits (256MB) make it unsuitable for blob storage. GCS is built on top of Spanner (metadata, small files) and Colossus (bulk data storage).

But that's besides the point. When people say "RDBMS" or "filesystem" they mean the full suite of SQL queries and POSIX semantics-- neither of which you get with KV stores like BigTable or distributed storage like Colossus.

The simplest example of POSIX semantics that are rapidly discarded is the "fast folder move" operation. This is difficult to impossible to achieve when you have keys representing the full path of the file, and is generally easier to implement with hierarchical directory entries. However, many applications are absolutely fine with the semantics of "write entire file, read file, delete file", which enables huge simplifications and optimizations!

Joker_vD · 2026-01-12T17:49:05 1768240145

From what I can see in the codegen, defer is not implemented "properly": the deferred statements are only executed when the block exits normally; leaving the block via "return", "break", "continue" (including their labelled variants! those interact subtly with outer defers), or "goto" skips them entirely. Which, arguably, should not happen:

    var f = fopen("file.txt", "r");
    defer fclose(f);

    if fread(&ch, 1, 1, f) <= 0 { return -1; }
    return 0;

would not close file if it was empty. In fact, I am not sure how it works even for normal "return 0": it looks like the deferred statements are emitted after the "return", textually, so they only properly work in void-returning function and internal blocks.

ricardobeat · 2026-01-12T18:40:00 1768243200

Did you manage to compile this example?

Joker_vD · 2026-01-12T19:27:01 1768246021

Yes, actually:

    $ cat kekw.zc
    include <stdio.h>
    
    fn main() {
        var f = fopen("file.txt", "r");
        defer fclose(f);
    
        var ch: byte;
        if fread(&ch, 1, 1, f) <= 0 { return -1; }
        return 0;
    }
    $ ./zc --emit-c kekw.zc
    [zc] Compiling kekw.zc...
    $ tail -n 12 out.c
    int main()
    {
        {
        __auto_type f = fopen("file.txt", "r");
        uint8_t ch;
    if ((fread((&ch), 1, 1, f) <= 0))     {
        return (-1);
        }
        return 0;
    fclose(f);
        }
    }

Joker_vD · 2026-01-05T19:22:28 1767640948

> Someone has to build those.

Well, yes? That means those "someones" are not building something else instead.

ithkuil · 2026-01-05T21:12:34 1767647554

Yes, sometimes that's indeed an opportunity cost. But it's also a catalyst for making things possible or affordable that otherwise wouldn't be.

Many technological progress are first explored in niche and luxury segments and only later fully developed for the masses with mass utility. Space exploration, motor racing sports, military, .... they all can seem like wasteful enterprises and yet a lot of new technologies come from these areas

Joker_vD · 2026-01-04T20:30:10 1767558610

There is also a problem of nested virtualization. If the VM has its own "imaginary" page tables on top of the hypervisor's page tables, then the number of actual physical memory reads goes from 4–6 to 16–36.

Joker_vD · 2026-01-04T20:21:33 1767558093

I mean, if the stacks grew upwards, that alone would nip 90% of buffer overflow attacks in the bud. Moving the return address from the activation frame into a separate stack would help as well, but I understand that having an activation frame to be a single piece of data (a current continuation's closure, essentially) can be quite convenient.

musicale · 2026-01-04T20:34:05 1767558845

The PL/I stack growing up rather than down reduced potential impact of stack overflows in Multics (and PL/I already had better memory safety, with bounded strings, etc.) TFA's author would probably have appreciated the segmented memory architecture as well.

There is no reason why the C/C++ stack can't grow up rather than down. On paged hardware, both the stack and heap could (and probably should) grow up. "C's stack should grow up", one might say.

Joker_vD · 2026-01-04T20:51:17 1767559877

> There is no reason why the C/C++ stack can't grow up rather than down.

Historical accident. Imagine if PDP-7/PDP-11 easily allowed for the following memory layout:

    FFFF +---------------+
         |     text      |  X
         +---------------+
         |    rodata     |  R
         +---------------+
         |  data + bss   |  RW
         +---------------+
         |     heap      |
         |      ||       |  RW
         |      \/       |
         +---------------+
         |  empty space  |  unmapped
         +---------------+
         |      /\       |
         |      ||       |  RW
         |     stack     |
    0000 +---------------+

Things could have turned out very differently than they have. Oh well.

musicale · 2026-01-06T03:39:13 1767670753

Nice diagram. I might put read-only pages on both sides of 0 though to mitigate null pointer effects.

josephg · 2026-01-04T22:04:27 1767564267

Is there anything stopping us from doing this today on modern hardware? Why do we grow the stack down?

Veserv · 2026-01-04T22:18:26 1767565106

x86-64 call instruction decrements the stack pointer to push the return address. x86-64 push instructions decrement the stack pointer. The push instructions are easy to work around because most compilers already just push the entire stack frame at once and then do offset accesses, but the call instruction would be kind of annoying.

ARM does not suffer from that problem due to the usage of link registers and generic pre/post-modify. RISC-V is probably also safe, but I have not looked specifically.

musicale · 2026-01-05T05:39:21 1767591561

> [x86] call instruction would be kind of annoying

I wonder what the best way to do it (on current x86) would be. The stupid simple way might be to adjust SP before the call instruction, and that seems to me like something that would be relatively efficient (simple addition instruction, issued very early).

Joker_vD · 2026-01-05T05:53:53 1767592433

Some architectures had CALL that was just "STR [SP], IP" without anything else, and it was up to the called procedure to adjust the stack pointer further to allocate for its local variables and the return slot for further calls. The RET instruction would still normally take an immediate (just as e.g. x86/x64's RET does) and additionally adjust the stack pointer by its value (either before or after loading the return address from the tip of the stack).

sph · 2026-01-05T08:25:33 1767601533

Nothing stops you from having upward growing stacks in RISC-V, for example, as there are no dedicated stack instructions.

Instead of

  addi sp, sp, -16
  sd a0, 0(sp)
  sd a1, 8(sp)

Do:

  addi sp, sp, 16
  sd a0, -8(sp)
  sd a1, -16(sp)

ch_123 · 2026-01-04T20:54:46 1767560086

HP-UX on PA-RISC had an upward-growing stack. In practice, various exploits were developed which adapted to the changed direction of the stack.

One source from a few mins of searching: https://phrack.org/issues/58/11

LukeShu · 2026-01-04T22:50:11 1767567011

Linux on PA-RISC also has an upward-growing stack (AFAIK, it's the only architecture Linux has ever had an upward-growing stack on; it's certainly the only currently-supported one).

musicale · 2026-01-05T05:45:22 1767591922

Both this and parent comment about PA-RISC are very interesting.

As noted, stack growing up doesn't prevent all stack overflows, but it makes it less trivially easy to overwrite a return address. Bounded strings also made it less trivially easy to create string buffer overflows.

ch_123 · 2026-01-05T09:19:52 1767604792

Yeah, my assumption is that all the PA-RISC operating systems did, but I only know about HP-UX for certain.

dmitrygr · 2026-01-05T06:18:44 1767593924

In ARMv4/v5 (non-thumb-mode) stack is purely a convention that hardware does not enforce. Nobody forces you to use r13 as the stack pointer or to make the stack descending. You can prototype your approach trivially with small changes to gcc and linux kernel. As this is a standard architectural feature, qemu and the like will support emulating this. And it would run fine on real hardware too. I'd read the paper you publish based on this.

axoltl · 2026-01-04T20:49:09 1767559749

For modern systems, stack buffer overflow bugs haven't been great to exploit for a while. You need at least a stack cookie leak and on Apple Silicon the return addresses are MACed so overwriting them is a fools errand (2^-16 chance of success).

Most exploitable memory corruption bugs are heap buffer overflows.

saagarjha · 2026-01-05T03:19:19 1767583159

It’s still fairly easy to attack buffer overflows if the stack grows upward

Joker_vD · 2026-01-04T20:13:27 1767557607

Regions, like [0], for example? Multi-level page tables kinda suck.

[0] https://web.archive.org/web/20250321211345/https://www.secur...

hinkley · 2026-01-04T20:42:32 1767559352

16 bit programming kinda sucked. I caught the tail end of it but my first project was using Win32s so I just had to cherry-pick what I wanted to work on to avoid having to learn it at all. I was fortunate that a Hype Train with a particularly long track was about to leave the station and it was 32 bit. But everyone I worked with or around would wax poetic about what a pain in the ass 16 bit was.

Meanwhile though, the PC memory model really did sort of want memory to be divided into at least a couple of classes and we had to jump through a lot of hoops to deal with that era. Even if I wasn't coding in 16 bit I was still consuming 16 bit games with boot disks.

LexiMax · 2026-01-05T06:38:47 1767595127

I was recently noodling around with a retrocoding setup. I have to admit that I did grin a silly grin when I found a set of compile flags for a DOS compiler that caused sizeof(void far*) to return 6 - the first time I'd ever seen it return a non power of two in my life.

Joker_vD · 2026-01-04T15:05:10 1767539110

    const int backupIntervalHours = 24
    const int approxBackupDurationHours = 2
    const int wiggleRoomHours = 2
    const int ageAlertThresholdHours = backupIntervalHours + approxBackupDurationHours + wiggleRoomHours;

    static_assert(28 == ageAlertThresholdHours);

It's a shame more languages don't have static asserts... faking it with mismatched dimensions of array literal/duplicate keys in map literals is way too ugly and distracting from the intent.

pwdisswordfishy · 2026-01-04T16:18:40 1767543520

Mmm...

    ageAlertThresholdHours = 24 + // backup interval
                              2 + // approx backup duration
                              2;  // "wiggle room"

No static assert needed, no need to pre-compute the total the first time, and no need to use identifiers like `approxBackupDurationHours`, the cognitive override about the possibility of colliding with other stuff that's in scope, or the superfluous/verbose variable declaration preamble.

feffe · 2026-01-04T17:23:44 1767547424

I'm a believer in restricting the scope of definitions as much as possible, and like programming languages that allows creating local bindings for creating another.

For example:

    local
        val backupIntervalHours = 24
        val approxBackupDurationHours = 2
        val wiggleRoomHours = 2
    in
    val ageAlertThresholdHours = backupIntervalHours + approxBackupDurationHours + wiggleRoomHours
    end

Then it's easier to document what components a constant is composed of using code without introducing unnecessary bindings in the scope of the relevant variable. Sure constants are just data, but the first questions that pops into my head when seeing something in unfamiliar code is "What is the purpose of this?", and the smaller the scope, the faster it can be discarded.

zephen · 2026-01-04T18:37:42 1767551862

Mentally discarding a name still takes some amount of effort, even if local.

I often write things the way you have done it, for the simple reason that, when writing the code, maybe I feel that I might have more than one use for the constant, and I'm used to thinking algebraically.

Except, that I might make them global, at the top of a module. Why? Because they encode assumptions that might be useful to know at a glance.

And I probably wouldn't go back and remove the constants once they were named.

But I also have no problem with unnamed but commented constants like the ones in the comment you responded to.

Joker_vD · 2026-01-03T19:37:35 1767469055

The USA does not have jurisdiction outside the US borders. Shocking, I know.

But it doesn't, so the charges of "possession of machineguns" [0] is an utter bullshit. Talk about kangaroo courts...

[0] https://xcancel.com/AGPamBondi/status/2007428087143686611

Joker_vD · 2025-12-30T17:40:08 1767116408

Gee, I wonder why the tooling for ML in Lisp is missing even though the early ML frameworks were in Lisp. Perhaps there is something about the language that stifles truly wide collaboration?

rahen · 2025-12-30T17:51:10 1767117070

I doubt it considering there are massive Clojure codebases with large teams collaborating on them every day. The lack of Lisp tooling and the prevalence of Python are more a result of inertia, low barrier to entry and ecosystem lock-in.

Joker_vD · 2025-12-27T00:27:12 1766795232

> the same sort of reason that retailers can't be held to incorrectly published prices (in the UK at least, a displayed price is an “invitation to tender”, not a contract or other promise)

The hell? Over here, the price tags are a sort of public contract, to which the seller pre-commits. The seller forgot to change the tags? That's not the buyer's problem.

ivell · 2025-12-27T05:11:33 1766812293

Since money has not exchanged hands, you could always decide not to buy at the counter. So atleast in the countries I have been, it is not legally binding.

sznio · 2025-12-27T08:35:14 1766824514

it's still bait-and-switch

dspillett · 2025-12-27T13:06:14 1766840774

Only if deliberate. If the incorrect price is corrected as soon as the problem is noticed then that is (legally) fine. If the incorrect price is left displayed, or was put up deliberately to draw people in, then it is bait & switch.

The other solid bait & switch is advertising a product that they don't have any of to sell, in the hope that you'll come in and buy something more expensive (or lower value) instead.