- update to 3.2.1:
* The methods ``apply_templates()`` and ``process_children()`` of XSLT
extension elements have gained two new boolean options ``elements_only``
and ``remove_blank_text`` that discard either all strings or whitespace-only
strings from the result list.
* When moving Elements to another tree, the namespace cleanup mechanism
no longer drops namespace prefixes from attributes for which it finds
a default namespace declaration, to prevent them from appearing as
unnamespaced attributes after serialisation.
* Returning non-type objects from a custom class lookup method could lead
to a crash.
* Instantiating and using subtypes of Comments and ProcessingInstructions
crashed. (forwarded request 175226 from dirkmueller)
OBS-URL: https://build.opensuse.org/request/show/175240
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-lxml?expand=0&rev=41
- update to 3.2.1:
* The methods ``apply_templates()`` and ``process_children()`` of XSLT
extension elements have gained two new boolean options ``elements_only``
and ``remove_blank_text`` that discard either all strings or whitespace-only
strings from the result list.
* When moving Elements to another tree, the namespace cleanup mechanism
no longer drops namespace prefixes from attributes for which it finds
a default namespace declaration, to prevent them from appearing as
unnamespaced attributes after serialisation.
* Returning non-type objects from a custom class lookup method could lead
to a crash.
* Instantiating and using subtypes of Comments and ProcessingInstructions
crashed.
OBS-URL: https://build.opensuse.org/request/show/175226
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-lxml?expand=0&rev=56
- update to 3.2.0:
* Leading whitespace could change the behaviour of the string
parsing functions in ``lxml.html``.
* LP#599318: The string parsing functions in ``lxml.html`` are more robust
in the face of uncommon HTML content like framesets or missing body tags.
Patch by Stefan Seelmann.
* LP#712941: I/O errors while trying to access files with paths that contain
non-ASCII characters could raise ``UnicodeDecodeError`` instead of properly
reporting the ``IOError``.
* LP#673205: Parsing from in-memory strings disabled network access in the
default parser and made subsequent attempts to parse from a URL fail.
* LP#971754: lxml.html.clean appends 'nofollow' to 'rel' attributes instead
of overwriting the current value.
* LP#715687: lxml.html.clean no longer discards scripts that are explicitly
allowed by the user provided whitelist. Patch by Christine Koppelt.
- update to 3.2.0:
* Leading whitespace could change the behaviour of the string
parsing functions in ``lxml.html``.
* LP#599318: The string parsing functions in ``lxml.html`` are more robust
in the face of uncommon HTML content like framesets or missing body tags.
Patch by Stefan Seelmann.
* LP#712941: I/O errors while trying to access files with paths that contain
non-ASCII characters could raise ``UnicodeDecodeError`` instead of properly
reporting the ``IOError``.
* LP#673205: Parsing from in-memory strings disabled network access in the
default parser and made subsequent attempts to parse from a URL fail.
* LP#971754: lxml.html.clean appends 'nofollow' to 'rel' attributes instead
of overwriting the current value.
* LP#715687: lxml.html.clean no longer discards scripts that are explicitly (forwarded request 173959 from dirkmueller)
OBS-URL: https://build.opensuse.org/request/show/174252
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-lxml?expand=0&rev=40
- update to 3.2.0:
* Leading whitespace could change the behaviour of the string
parsing functions in ``lxml.html``.
* LP#599318: The string parsing functions in ``lxml.html`` are more robust
in the face of uncommon HTML content like framesets or missing body tags.
Patch by Stefan Seelmann.
* LP#712941: I/O errors while trying to access files with paths that contain
non-ASCII characters could raise ``UnicodeDecodeError`` instead of properly
reporting the ``IOError``.
* LP#673205: Parsing from in-memory strings disabled network access in the
default parser and made subsequent attempts to parse from a URL fail.
* LP#971754: lxml.html.clean appends 'nofollow' to 'rel' attributes instead
of overwriting the current value.
* LP#715687: lxml.html.clean no longer discards scripts that are explicitly
allowed by the user provided whitelist. Patch by Christine Koppelt.
- update to 3.2.0:
* Leading whitespace could change the behaviour of the string
parsing functions in ``lxml.html``.
* LP#599318: The string parsing functions in ``lxml.html`` are more robust
in the face of uncommon HTML content like framesets or missing body tags.
Patch by Stefan Seelmann.
* LP#712941: I/O errors while trying to access files with paths that contain
non-ASCII characters could raise ``UnicodeDecodeError`` instead of properly
reporting the ``IOError``.
* LP#673205: Parsing from in-memory strings disabled network access in the
default parser and made subsequent attempts to parse from a URL fail.
* LP#971754: lxml.html.clean appends 'nofollow' to 'rel' attributes instead
of overwriting the current value.
* LP#715687: lxml.html.clean no longer discards scripts that are explicitly
OBS-URL: https://build.opensuse.org/request/show/173959
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-lxml?expand=0&rev=54
- Update to version 2.3.3:
* lxml.html.tostring() gained new serialisation options with_tail and doctype.
* Fixed a crash when using iterparse() for HTML parsing and requesting start events.
* Fixed parsing of more selectors in cssselect. Whitespace before
pseudo-elements and pseudo-classes is significant as it is a descendant
combinator. "E :pseudo" should parse the same as "E *:pseudo", not "E:pseudo".
* lxml.html.diff no longer raises an exception when hitting 'img' tags without 'src' attribute.
- Changes from version 2.3.2:
* lxml.objectify.deannotate() has a new boolean option cleanup_namespaces to
remove the objectify namespace declarations (and generally clean up the
namespace declarations) after removing the type annotations.
* lxml.objectify gained its own SubElement() function as a copy of
etree.SubElement to avoid an otherwise redundant import of lxml.etree on the user side.
* Fixed the "descendant" bug in cssselect a second time
* Fixed parsing of some selectors in cssselect.
- Changes from version 2.3.1:
* New option kill_tags in lxml.html.clean to remove specific tags and their
content (i.e. their whole subtree).
* pi.get() and pi.attrib on processing instructions to parse
pseudo-attributes from the text content of processing instructions.
* lxml.get_include() returns a list of include paths that can be used to
compile external C code against lxml.etree.
* Resolver.resolve_file() takes an additional option close_file that
configures if the file(-like) object will be closed after reading or not.
* HTML cleaning didn't remove 'data:' links.
* The html5lib parser integration now uses the 'official' implementation in
html5lib itself, which makes it work with newer releases of the library.
* In lxml.sax, endElementNS() could incorrectly reject a plain tag name when
the corresponding start event inferred the same plain tag name to be in the default namespace.
* When an open file-like object is passed into parse() or iterparse(), the
OBS-URL: https://build.opensuse.org/request/show/108688
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/python-lxml?expand=0&rev=31
* lxml.html.tostring() gained new serialisation options with_tail and doctype.
* Fixed a crash when using iterparse() for HTML parsing and requesting start events.
* Fixed parsing of more selectors in cssselect. Whitespace before
pseudo-elements and pseudo-classes is significant as it is a descendant
combinator. "E :pseudo" should parse the same as "E *:pseudo", not "E:pseudo".
* lxml.html.diff no longer raises an exception when hitting 'img' tags without 'src' attribute.
- Changes from version 2.3.2:
* lxml.objectify.deannotate() has a new boolean option cleanup_namespaces to
remove the objectify namespace declarations (and generally clean up the
namespace declarations) after removing the type annotations.
* lxml.objectify gained its own SubElement() function as a copy of
etree.SubElement to avoid an otherwise redundant import of lxml.etree on the user side.
* Fixed the "descendant" bug in cssselect a second time
* Fixed parsing of some selectors in cssselect.
- Changes from version 2.3.1:
* New option kill_tags in lxml.html.clean to remove specific tags and their
content (i.e. their whole subtree).
* pi.get() and pi.attrib on processing instructions to parse
pseudo-attributes from the text content of processing instructions.
* lxml.get_include() returns a list of include paths that can be used to
compile external C code against lxml.etree.
* Resolver.resolve_file() takes an additional option close_file that
configures if the file(-like) object will be closed after reading or not.
* HTML cleaning didn't remove 'data:' links.
* The html5lib parser integration now uses the 'official' implementation in
html5lib itself, which makes it work with newer releases of the library.
* In lxml.sax, endElementNS() could incorrectly reject a plain tag name when
the corresponding start event inferred the same plain tag name to be in the default namespace.
* When an open file-like object is passed into parse() or iterparse(), the
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-lxml?expand=0&rev=38
Features added
* When looking for children, lxml.objectify takes '{}tag' as
meaning an empty namespace, as opposed to the parent namespace.
Bugs fixed
* When finished reading from a file-like object, the parser
immediately calls its close() method.
* When finished parsing, iterparse() immediately closes the input
file.
* Work-around for libxml2 bug that can leave the HTML parser in a
non-functional state after parsing a severly broken document (fixed
in libxml2 2.7.8).
* marque tag in HTML cleanup code is correctly named marquee.
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-lxml?expand=0&rev=30