gh-132983: Don't allow trailer data in ZstdFile#133736
Conversation
|
The current behavior matches LZMA. I think unlike >>> from lzma import LZMAFile, compress
>>> from io import BytesIO
>>> invalid = compress(b'foo') + b'bar'
>>> LZMAFile(BytesIO(invalid)).read()
b'foo'
>>> |
Sorry, something went wrong.
|
You are right this is the case for However, >>> from lzma import LZMAFile, compress, FORMAT_XZ
>>> from io import BytesIO
>>> invalid = compress(b'foo') + b'bar'
>>> LZMAFile(BytesIO(invalid), format=FORMAT_XZ).read()
Traceback (most recent call last):
File "<python-input-3>", line 1, in <module>
LZMAFile(BytesIO(invalid), format=FORMAT_XZ).read()
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^^
File "/redacted/lzma.py", line 208, in read
return self._buffer.read(size)
~~~~~~~~~~~~~~~~~^^^^^^
File "/redacted/_compression.py", line 118, in readall
while data := self.read(sys.maxsize):
~~~~~~~~~^^^^^^^^^^^^^
File "/redacted/_compression.py", line 99, in read
raise EOFError("Compressed file ended before the "
"end-of-stream marker was reached")
EOFError: Compressed file ended before the end-of-stream marker was reached |
Sorry, something went wrong.
emmatyping
left a comment
There was a problem hiding this comment.
Okay this looks good then!
Sorry, something went wrong.
|
In addition, consider
Since for |
Sorry, something went wrong.
50b5370
into
python:main
May 10, 2025
|
Thanks @Rogdham for the PR, and @AA-Turner for merging it 🌮🎉.. I'm working now to backport this PR to: 3.14. |
Sorry, something went wrong.
(cherry picked from commit 50b5370) Co-authored-by: Rogdham <3994389+Rogdham@users.noreply.github.com>
We previously made sure that an exception is raised when decompressing trailer data with
decompress:Indeed, the Zstandard specification says “Zstandard compressed data is made of one or more frames”, and it does not say that random data can be added at the end.
However, this is not the case in
ZstdFile/zstd.open:After this PR, the last call becomes: