{"id":"PYSEC-2022-222","details":"The Security Team noticed that the termination condition of the for loop in the readExternal method is a controllable variable, which, if tampered with, may lead to CPU exhaustion. As a fix, we added an upper bound and termination condition in the read and write logic. We classify it as a \"low-priority but useful improvement\". SystemDS is a distributed system and needs to serialize/deserialize data but in many code paths (e.g., on Spark broadcast/shuffle or writing to sequence files) the byte stream is anyway protected by additional CRC fingerprints. In this particular case though, the number of decoders is upper-bounded by twice the number of columns, which means an attacker would need to modify two entries in the byte stream in a consistent manner. By adding these checks robustness was strictly improved with almost zero overhead. These code changes are available in versions higher than 2.2.1.","aliases":["CVE-2022-26477","GHSA-m43h-hfrq-x8wx"],"modified":"2023-11-08T04:08:54.044327Z","published":"2022-06-27T18:15:00Z","references":[{"type":"WEB","url":"https://lists.apache.org/thread/r4x2d2r6d4zykdrrx6s2l4qbxgzws0z3"},{"type":"ADVISORY","url":"https://github.com/advisories/GHSA-m43h-hfrq-x8wx"}],"affected":[{"package":{"name":"systemds","ecosystem":"PyPI","purl":"pkg:pypi/systemds"},"ranges":[{"type":"ECOSYSTEM","events":[{"introduced":"0"},{"fixed":"2.2.2"}]}],"versions":["0.2.0","2.0.0","2.0.1","2.1.0","2.2.0","2.2.1"],"database_specific":{"source":"https://github.com/pypa/advisory-database/blob/main/vulns/systemds/PYSEC-2022-222.yaml"}}],"schema_version":"1.7.3"}