Shouldn’t knowing how big it’s supposed to be make it easier to stop a zip bomb?...

jeroenhd · 2025-02-02T02:21:38 1738462898

That only works if the standard actually describes what you're supposed to do with extra data at the end, and everyone agrees.

In practice, there have been antivirus bypasses that made use of AV scanners treating the additional data differently from common extraction software (I believe it was winrar?).

One could argue that a text document with a corrupt file size byte should still be decodeable. One could also argue that the file is completely corrupt and should be treated as such. Knowing that there will be tools that will take the first approach regardless, I'd stick to just decoding the extra data and marking the file as potentially damaged rather than send the user down a data recovery rabbit hole for a file that'll decompress just fine.

o11c · 2025-02-02T05:50:32 1738475432

IIRC to decode raw zlib or deflate using only command-line tools, you have to prepend the gzip header and stream while ignoring errors at EOF.