lxml known bugs
pypi12 known bugs in lxml, with affected versions, fixes and workarounds. Sourced from upstream issue trackers.
12
bugs
Known bugs
| Severity | Affected | Fixed in | Title | Status | Source |
|---|---|---|---|---|---|
| medium | any | 86368e9cf70a0ad23cccd5ee32de847149af0c6f | PYSEC-2022-230: advisory NULL Pointer Dereference allows attackers to cause a denial of service (or application crash). This only applies when lxml is used together with libxml2 2.9.10 through 2.9.14. libxml2 2.9.9 and earlier are not affected. It allows triggering crashes through forged input data, given a vulnerable code sequence in the application. The vulnerability is caused by the iterwalk function (also used by the canonicalize function). Such code shouldn't be in wide-spread use, given that parsing + iterwalk would usually be replaced with the more efficient iterparse function. However, an XML converter that serialises to C14N would also be vulnerable, for example, and there are legitimate use cases for this code sequence. If untrusted input is received (also remotely) and processed via iterwalk function, a crash can be triggered. | fixed | osv:PYSEC-2022-230 |
| medium | any | 12fa9669007180a7bb87d990c375cf91ca5b664a | PYSEC-2021-852: advisory lxml is a library for processing XML and HTML in the Python language. Prior to version 4.6.5, the HTML Cleaner in lxml.html lets certain crafted script content pass through, as well as script content in SVG files embedded using data URIs. Users that employ the HTML cleaner in a security relevant context should upgrade to lxml 4.6.5 to receive a patch. There are no known workarounds available. | fixed | osv:PYSEC-2021-852 |
| medium | any | a5f9cb52079dc57477c460dbe6ba0f775e14a999 | PYSEC-2021-19: advisory An XSS vulnerability was discovered in python-lxml's clean module versions before 4.6.3. When disabling the safe_attrs_only and forms arguments, the Cleaner class does not remove the formaction attribute allowing for JS to bypass the sanitizer. A remote attacker could exploit this flaw to run arbitrary JS code on users who interact with incorrectly sanitized HTML. This issue is patched in lxml 4.6.3. | fixed | osv:PYSEC-2021-19 |
| medium | 1.2 | 4.6.2 | PYSEC-2020-62: advisory A XSS vulnerability was discovered in python-lxml's clean module. The module's parser didn't properly imitate browsers, which caused different behaviors between the sanitizer and the user's page. A remote attacker could exploit this flaw to run arbitrary HTML/JS code. | fixed | osv:PYSEC-2020-62 |
| medium | any | 6be1d081b49c97cfd7b3fbd934a193b668629109 | PYSEC-2018-12: advisory An issue was discovered in lxml before 4.2.5. lxml/html/clean.py in the lxml.html.clean module does not remove javascript: URLs that use escaping, allowing a remote attacker to conduct XSS attacks, as demonstrated by "j a v a s c r i p t:" in Internet Explorer. This is a similar issue to CVE-2014-3146. | fixed | osv:PYSEC-2018-12 |
| medium | any | 3.3.5 | PYSEC-2014-9: advisory Incomplete blacklist vulnerability in the lxml.html.clean module in lxml before 3.3.5 allows remote attackers to conduct cross-site scripting (XSS) attacks via control characters in the link scheme to the clean_html function. | fixed | osv:PYSEC-2014-9 |
| medium | any | 4.2.5 | Improper Neutralization of Input During Web Page Generation in LXML An issue was discovered in lxml before 4.2.5. lxml/html/clean.py in the lxml.html.clean module does not remove javascript: URLs that use escaping, allowing a remote attacker to conduct XSS attacks, as demonstrated by "j a v a s c r i p t:" in Internet Explorer. This is a similar issue to CVE-2014-3146. | fixed | osv:GHSA-xp26-p53h-6h2p |
| medium | any | 4.9.1 | lxml NULL Pointer Dereference allows attackers to cause a denial of service NULL Pointer Dereference allows attackers to cause a denial of service (or application crash). This only applies when lxml is used together with libxml2 2.9.10 through 2.9.14. libxml2 2.9.9 and earlier are not affected. It allows triggering crashes through forged input data, given a vulnerable code sequence in the application. The vulnerability is caused by the iterwalk function (also used by the canonicalize function). Such code shouldn't be in wide-spread use, given that parsing + iterwalk would usually be replaced with the more efficient iterparse function. However, an XML converter that serialises to C14N would also be vulnerable, for example, and there are legitimate use cases for this code sequence. If untrusted input is received (also remotely) and processed via iterwalk function, a crash can be triggered. | fixed | osv:GHSA-wrxv-2j5q-m38w |
| medium | any | 4.6.2 | lxml vulnerable to Cross-site Scripting A XSS vulnerability was discovered in python-lxml's clean module. The module's parser didn't properly imitate browsers, which caused different behaviors between the sanitizer and the user's page. A remote attacker could exploit this flaw to run arbitrary HTML/JS code. | fixed | osv:GHSA-pgww-xf46-h92r |
| medium | any | 4.6.3 | lxml vulnerable to Cross-Site Scripting An XSS vulnerability was discovered in the python `lxml` clean module versions before 4.6.3. When disabling `the safe_attrs_only` and `forms` arguments, the `Cleaner` class does not remove the `formaction` attribute allowing for JS to bypass the sanitizer. A remote attacker could exploit this flaw to run arbitrary JS code on users who interact with incorrectly sanitized HTML. This issue is patched in `lxml` 4.6.3. | fixed | osv:GHSA-jq4v-f5q6-mjqq |
| medium | any | 3.3.5 | lxml Cross-site Scripting Via Control Characters Incomplete blacklist vulnerability in the `lxml.html.clean` module in lxml before 3.3.5 allows remote attackers to conduct cross-site scripting (XSS) attacks via control characters in the link scheme to the `clean_html` function. | fixed | osv:GHSA-57qw-cc2g-pv5p |
| medium | any | 4.6.5 | lxml's HTML Cleaner allows crafted and SVG embedded scripts to pass through ### Impact
The HTML Cleaner in lxml.html lets certain crafted script content pass through, as well as script content in SVG files embedded using data URIs.
Users that employ the HTML cleaner in a security relevant context should upgrade to lxml 4.6.5.
### Patches
The issue has been resolved in lxml 4.6.5.
### Workarounds
None.
### References
The issues are tracked under the report IDs GHSL-2021-1037 and GHSL-2021-1038.
| fixed | osv:GHSA-55x5-fj6c-h6m8 |
API access
Get this data programmatically \u2014 free, no authentication.
curl https://depscope.dev/api/bugs/pypi/lxml