* lxml.html.tostring() gained new serialisation options with_tail and doctype.
* Fixed a crash when using iterparse() for HTML parsing and requesting start events.
* Fixed parsing of more selectors in cssselect. Whitespace before
pseudo-elements and pseudo-classes is significant as it is a descendant
combinator. "E :pseudo" should parse the same as "E *:pseudo", not "E:pseudo".
* lxml.html.diff no longer raises an exception when hitting 'img' tags without 'src' attribute.
- Changes from version 2.3.2:
* lxml.objectify.deannotate() has a new boolean option cleanup_namespaces to
remove the objectify namespace declarations (and generally clean up the
namespace declarations) after removing the type annotations.
* lxml.objectify gained its own SubElement() function as a copy of
etree.SubElement to avoid an otherwise redundant import of lxml.etree on the user side.
* Fixed the "descendant" bug in cssselect a second time
* Fixed parsing of some selectors in cssselect.
- Changes from version 2.3.1:
* New option kill_tags in lxml.html.clean to remove specific tags and their
content (i.e. their whole subtree).
* pi.get() and pi.attrib on processing instructions to parse
pseudo-attributes from the text content of processing instructions.
* lxml.get_include() returns a list of include paths that can be used to
compile external C code against lxml.etree.
* Resolver.resolve_file() takes an additional option close_file that
configures if the file(-like) object will be closed after reading or not.
* HTML cleaning didn't remove 'data:' links.
* The html5lib parser integration now uses the 'official' implementation in
html5lib itself, which makes it work with newer releases of the library.
* In lxml.sax, endElementNS() could incorrectly reject a plain tag name when
the corresponding start event inferred the same plain tag name to be in the default namespace.
* When an open file-like object is passed into parse() or iterparse(), the
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-lxml?expand=0&rev=38
Features added
* When looking for children, lxml.objectify takes '{}tag' as
meaning an empty namespace, as opposed to the parent namespace.
Bugs fixed
* When finished reading from a file-like object, the parser
immediately calls its close() method.
* When finished parsing, iterparse() immediately closes the input
file.
* Work-around for libxml2 bug that can leave the HTML parser in a
non-functional state after parsing a severly broken document (fixed
in libxml2 2.7.8).
* marque tag in HTML cleanup code is correctly named marquee.
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-lxml?expand=0&rev=30