14
0
forked from pool/python-lxml
Files
python-lxml/python-lxml.spec

135 lines
4.3 KiB
RPMSpec
Raw Normal View History

#
# spec file for package python-lxml
#
# Copyright (c) 2025 SUSE LLC
#
# All modifications and additions to the file contributed by third parties
# remain the property of their copyright owners, unless otherwise agreed
# upon. The license for this file, and modifications and additions to the
# file, is the same license as for the pristine package itself (unless the
# license for the pristine package is not an Open Source License, in which
# case the license is the MIT License). An "Open Source License" is a
# license that conforms to the Open Source Definition (Version 1.9)
# published by the Open Source Initiative.
# Please submit bugfixes or comments via https://bugs.opensuse.org/
#
%{?sle15_python_module_pythons}
Name: python-lxml
- Update to 6.0.0: * ``lxml.html.diff`` is faster and provides structurally better diffs. * The factories ``Element`` and ``ElementTree`` can now be used in type hints. * Parsing from ``memoryview`` and other buffers is supported to allow zero-copy parsing. * ``lxml.html.builder`` was missing several HTML5 tag names. * ``CDATA`` can now be written into the incremental ``xmlfile()`` writer. * A new parser option ``decompress=False`` was added that controls the automatic input decompression when using libxml2 2.15.0 or later. * The set of compile time / runtime supported libxml2 feature names is available as ``etree.LIBXML_COMPILED_FEATURES`` and ``etree.LIBXML_FEATURES``. * Predicates in ``.find*()`` could mishandle tag indices if a default namespace is provided. * The ``head`` and ``body`` properties of ``lxml.html`` elements failed if no such element was found. They now return ``None`` instead. * Tag names provided by code (API, not data) that are longer than ``INT_MAX`` could be truncated or mishandled in other ways. * ``.text_content()`` on ``lxml.html`` elements accidentally returned a "smart string" without additional information. It now returns a plain string. * Support for Python < 3.8 was removed. * Parsing directly from zlib (or lzma) compressed data is now considered an optional feature in lxml. * The ``Schematron`` class is deprecated and will become non-functional in a future lxml version. * Built using Cython 3.1.2. * The debug methods ``MemDebug.dump()`` and ``MemDebug.show()`` were removed completely. OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-lxml?expand=0&rev=209
2025-07-22 06:13:45 +00:00
Version: 6.0.0
- Update to version 2.3.3: * lxml.html.tostring() gained new serialisation options with_tail and doctype. * Fixed a crash when using iterparse() for HTML parsing and requesting start events. * Fixed parsing of more selectors in cssselect. Whitespace before pseudo-elements and pseudo-classes is significant as it is a descendant combinator. "E :pseudo" should parse the same as "E *:pseudo", not "E:pseudo". * lxml.html.diff no longer raises an exception when hitting 'img' tags without 'src' attribute. - Changes from version 2.3.2: * lxml.objectify.deannotate() has a new boolean option cleanup_namespaces to remove the objectify namespace declarations (and generally clean up the namespace declarations) after removing the type annotations. * lxml.objectify gained its own SubElement() function as a copy of etree.SubElement to avoid an otherwise redundant import of lxml.etree on the user side. * Fixed the "descendant" bug in cssselect a second time * Fixed parsing of some selectors in cssselect. - Changes from version 2.3.1: * New option kill_tags in lxml.html.clean to remove specific tags and their content (i.e. their whole subtree). * pi.get() and pi.attrib on processing instructions to parse pseudo-attributes from the text content of processing instructions. * lxml.get_include() returns a list of include paths that can be used to compile external C code against lxml.etree. * Resolver.resolve_file() takes an additional option close_file that configures if the file(-like) object will be closed after reading or not. * HTML cleaning didn't remove 'data:' links. * The html5lib parser integration now uses the 'official' implementation in html5lib itself, which makes it work with newer releases of the library. * In lxml.sax, endElementNS() could incorrectly reject a plain tag name when the corresponding start event inferred the same plain tag name to be in the default namespace. * When an open file-like object is passed into parse() or iterparse(), the OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-lxml?expand=0&rev=38
2012-03-09 21:29:46 +00:00
Release: 0
Summary: Pythonic XML processing library
License: BSD-3-Clause AND GPL-2.0-or-later
URL: https://lxml.de/
Accepting request 720214 from home:pgajdos - version update to 4.4.0 * ``Element.clear()`` accepts a new keyword argument ``keep_tail=True`` to clear everything but the tail text. This is helpful in some document-style use cases. * When creating attributes or namespaces from a dict in Python 3.6+, lxml now preserves the original insertion order of that dict, instead of always sorting the items by name. A similar change was made for ElementTree in CPython 3.8. See https://bugs.python.org/issue34160 * Integer elements in ``lxml.objectify`` implement the ``__index__()`` special method. * GH#269: Read-only elements in XSLT were missing the ``nsmap`` property. Original patch by Jan Pazdziora. * ElementInclude can now restrict the maximum inclusion depth via a ``max_depth`` argument to prevent content explosion. It is limited to 6 by default. * The ``target`` object of the XMLParser can have ``start_ns()`` and ``end_ns()`` callback methods to listen to namespace declarations. * The ``TreeBuilder`` has new arguments ``comment_factory`` and ``pi_factory`` to pass factories for creating comments and processing instructions, as well as flag arguments ``insert_comments`` and ``insert_pis`` to discard them from the tree when set to false. * A `C14N 2.0 <https://www.w3.org/TR/xml-c14n2/>`_ implementation was added as ``etree.canonicalize()``, a corresponding ``C14NWriterTarget`` class, and a ``c14n2`` serialisation method. * bugfixes, see CHANGES.txt - deleted sources - lxmldoc-4.3.3.pdf (renamed) - added sources + lxmldoc-4.4.0.pdf + world.txt OBS-URL: https://build.opensuse.org/request/show/720214 OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-lxml?expand=0&rev=127
2019-08-01 13:54:01 +00:00
Source0: https://files.pythonhosted.org/packages/source/l/lxml/lxml-%{version}.tar.gz
Source1: https://lxml.de/lxmldoc-4.5.2.pdf
Source99: python-lxml.rpmlintrc
# PATCH-FIX-OPENSUSE Skip a test under libxml2 2.10.4+
# https://bugs.launchpad.net/lxml/+bug/2016939
Patch1: skip-test-under-libxml2-2.10.4.patch
BuildRequires: %{python_module Cython >= 3.0.7}
- Update to 6.0.0: * ``lxml.html.diff`` is faster and provides structurally better diffs. * The factories ``Element`` and ``ElementTree`` can now be used in type hints. * Parsing from ``memoryview`` and other buffers is supported to allow zero-copy parsing. * ``lxml.html.builder`` was missing several HTML5 tag names. * ``CDATA`` can now be written into the incremental ``xmlfile()`` writer. * A new parser option ``decompress=False`` was added that controls the automatic input decompression when using libxml2 2.15.0 or later. * The set of compile time / runtime supported libxml2 feature names is available as ``etree.LIBXML_COMPILED_FEATURES`` and ``etree.LIBXML_FEATURES``. * Predicates in ``.find*()`` could mishandle tag indices if a default namespace is provided. * The ``head`` and ``body`` properties of ``lxml.html`` elements failed if no such element was found. They now return ``None`` instead. * Tag names provided by code (API, not data) that are longer than ``INT_MAX`` could be truncated or mishandled in other ways. * ``.text_content()`` on ``lxml.html`` elements accidentally returned a "smart string" without additional information. It now returns a plain string. * Support for Python < 3.8 was removed. * Parsing directly from zlib (or lzma) compressed data is now considered an optional feature in lxml. * The ``Schematron`` class is deprecated and will become non-functional in a future lxml version. * Built using Cython 3.1.2. * The debug methods ``MemDebug.dump()`` and ``MemDebug.show()`` were removed completely. OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-lxml?expand=0&rev=209
2025-07-22 06:13:45 +00:00
BuildRequires: %{python_module base >= 3.9}
BuildRequires: %{python_module cssselect >= 0.9.1}
- Update to 6.0.0: * ``lxml.html.diff`` is faster and provides structurally better diffs. * The factories ``Element`` and ``ElementTree`` can now be used in type hints. * Parsing from ``memoryview`` and other buffers is supported to allow zero-copy parsing. * ``lxml.html.builder`` was missing several HTML5 tag names. * ``CDATA`` can now be written into the incremental ``xmlfile()`` writer. * A new parser option ``decompress=False`` was added that controls the automatic input decompression when using libxml2 2.15.0 or later. * The set of compile time / runtime supported libxml2 feature names is available as ``etree.LIBXML_COMPILED_FEATURES`` and ``etree.LIBXML_FEATURES``. * Predicates in ``.find*()`` could mishandle tag indices if a default namespace is provided. * The ``head`` and ``body`` properties of ``lxml.html`` elements failed if no such element was found. They now return ``None`` instead. * Tag names provided by code (API, not data) that are longer than ``INT_MAX`` could be truncated or mishandled in other ways. * ``.text_content()`` on ``lxml.html`` elements accidentally returned a "smart string" without additional information. It now returns a plain string. * Support for Python < 3.8 was removed. * Parsing directly from zlib (or lzma) compressed data is now considered an optional feature in lxml. * The ``Schematron`` class is deprecated and will become non-functional in a future lxml version. * Built using Cython 3.1.2. * The debug methods ``MemDebug.dump()`` and ``MemDebug.show()`` were removed completely. OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-lxml?expand=0&rev=209
2025-07-22 06:13:45 +00:00
BuildRequires: %{python_module pip}
BuildRequires: %{python_module setuptools >= 18.0.1}
- Update to 6.0.0: * ``lxml.html.diff`` is faster and provides structurally better diffs. * The factories ``Element`` and ``ElementTree`` can now be used in type hints. * Parsing from ``memoryview`` and other buffers is supported to allow zero-copy parsing. * ``lxml.html.builder`` was missing several HTML5 tag names. * ``CDATA`` can now be written into the incremental ``xmlfile()`` writer. * A new parser option ``decompress=False`` was added that controls the automatic input decompression when using libxml2 2.15.0 or later. * The set of compile time / runtime supported libxml2 feature names is available as ``etree.LIBXML_COMPILED_FEATURES`` and ``etree.LIBXML_FEATURES``. * Predicates in ``.find*()`` could mishandle tag indices if a default namespace is provided. * The ``head`` and ``body`` properties of ``lxml.html`` elements failed if no such element was found. They now return ``None`` instead. * Tag names provided by code (API, not data) that are longer than ``INT_MAX`` could be truncated or mishandled in other ways. * ``.text_content()`` on ``lxml.html`` elements accidentally returned a "smart string" without additional information. It now returns a plain string. * Support for Python < 3.8 was removed. * Parsing directly from zlib (or lzma) compressed data is now considered an optional feature in lxml. * The ``Schematron`` class is deprecated and will become non-functional in a future lxml version. * Built using Cython 3.1.2. * The debug methods ``MemDebug.dump()`` and ``MemDebug.show()`` were removed completely. OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-lxml?expand=0&rev=209
2025-07-22 06:13:45 +00:00
BuildRequires: %{python_module wheel}
BuildRequires: fdupes
%if 0%{?suse_version} == 1500
# Assume the best that the old libxml2 in SLE15 is patched for https://gitlab.gnome.org/GNOME/libxml2/-/issues/378 (CVE-2022-2309)
BuildRequires: libxml2-devel
%else
BuildRequires: libxml2-devel >= 2.10.2
%endif
BuildRequires: libxslt-devel >= 1.1.27
BuildRequires: python-rpm-macros
Requires: python-cssselect >= 0.9.1
%python_subpackages
%description
lxml is a Pythonic binding for the libxml2 and libxslt libraries. It
provides convenient access to these libraries using the ElementTree
API. It extends the ElementTree API significantly to offer support for XPath,
RelaxNG, XML Schema, XSLT and C14N.
%if 0%{?suse_version} > 1500
%package -n %{name}-doc
Summary: Documentation for python-lxml, an XML processing library
Group: Documentation/Other
BuildArch: noarch
%description -n %{name}-doc
lxml is a Pythonic binding for the libxml2 and libxslt libraries. It
provides convenient access to these libraries using the ElementTree
API. It extends the ElementTree API significantly to offer support for XPath,
RelaxNG, XML Schema, XSLT and C14N.
This package contains documentation for lxml (HTML and PDF).
%endif
%package devel
Summary: Development files for python-lxml
Group: Development/Libraries/Python
Requires: %{name} = %{version}
%description devel
lxml is a Pythonic binding for the libxml2 and libxslt libraries. It
provides convenient access to these libraries using the ElementTree
API. It extends the ElementTree API significantly to offer support for XPath,
RelaxNG, XML Schema, XSLT and C14N.
This package contains header files needed to use lxml's C API.
%prep
%autosetup -p1 -n lxml-%{version}
cp %{SOURCE1} .
# remove generated files
find -name '*.c' -delete -print
rm src/lxml/lxml.etree.h
rm src/lxml/lxml.etree_api.h
%build
export CFLAGS="%{optflags}"
- Update to 6.0.0: * ``lxml.html.diff`` is faster and provides structurally better diffs. * The factories ``Element`` and ``ElementTree`` can now be used in type hints. * Parsing from ``memoryview`` and other buffers is supported to allow zero-copy parsing. * ``lxml.html.builder`` was missing several HTML5 tag names. * ``CDATA`` can now be written into the incremental ``xmlfile()`` writer. * A new parser option ``decompress=False`` was added that controls the automatic input decompression when using libxml2 2.15.0 or later. * The set of compile time / runtime supported libxml2 feature names is available as ``etree.LIBXML_COMPILED_FEATURES`` and ``etree.LIBXML_FEATURES``. * Predicates in ``.find*()`` could mishandle tag indices if a default namespace is provided. * The ``head`` and ``body`` properties of ``lxml.html`` elements failed if no such element was found. They now return ``None`` instead. * Tag names provided by code (API, not data) that are longer than ``INT_MAX`` could be truncated or mishandled in other ways. * ``.text_content()`` on ``lxml.html`` elements accidentally returned a "smart string" without additional information. It now returns a plain string. * Support for Python < 3.8 was removed. * Parsing directly from zlib (or lzma) compressed data is now considered an optional feature in lxml. * The ``Schematron`` class is deprecated and will become non-functional in a future lxml version. * Built using Cython 3.1.2. * The debug methods ``MemDebug.dump()`` and ``MemDebug.show()`` were removed completely. OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-lxml?expand=0&rev=209
2025-07-22 06:13:45 +00:00
%pyproject_wheel
# The testsuite relies on the inplace build
%python_expand PYTHON=$python make inplace
%install
%pyproject_install
%python_expand %fdupes %{buildroot}
%check
# The tests fail on SLE 11 due to broken incremental parsing in libxml2
export CFLAGS="%{optflags}"
export LANG=en_US.UTF-8
export PYTHONUNBUFFERED=x
- Update to 6.0.0: * ``lxml.html.diff`` is faster and provides structurally better diffs. * The factories ``Element`` and ``ElementTree`` can now be used in type hints. * Parsing from ``memoryview`` and other buffers is supported to allow zero-copy parsing. * ``lxml.html.builder`` was missing several HTML5 tag names. * ``CDATA`` can now be written into the incremental ``xmlfile()`` writer. * A new parser option ``decompress=False`` was added that controls the automatic input decompression when using libxml2 2.15.0 or later. * The set of compile time / runtime supported libxml2 feature names is available as ``etree.LIBXML_COMPILED_FEATURES`` and ``etree.LIBXML_FEATURES``. * Predicates in ``.find*()`` could mishandle tag indices if a default namespace is provided. * The ``head`` and ``body`` properties of ``lxml.html`` elements failed if no such element was found. They now return ``None`` instead. * Tag names provided by code (API, not data) that are longer than ``INT_MAX`` could be truncated or mishandled in other ways. * ``.text_content()`` on ``lxml.html`` elements accidentally returned a "smart string" without additional information. It now returns a plain string. * Support for Python < 3.8 was removed. * Parsing directly from zlib (or lzma) compressed data is now considered an optional feature in lxml. * The ``Schematron`` class is deprecated and will become non-functional in a future lxml version. * Built using Cython 3.1.2. * The debug methods ``MemDebug.dump()`` and ``MemDebug.show()`` were removed completely. OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-lxml?expand=0&rev=209
2025-07-22 06:13:45 +00:00
export PYTHONDONTWRITEBYTECODE=1
# cyclic dependency between html5lib and lxml
rm -v src/lxml/html/tests/test_html5parser.py
%python_exec test.py
%files %{python_files}
%license LICENSES.txt
%doc CHANGES.txt CREDITS.txt README.rst
%{python_sitearch}/lxml/
- Update to 6.0.0: * ``lxml.html.diff`` is faster and provides structurally better diffs. * The factories ``Element`` and ``ElementTree`` can now be used in type hints. * Parsing from ``memoryview`` and other buffers is supported to allow zero-copy parsing. * ``lxml.html.builder`` was missing several HTML5 tag names. * ``CDATA`` can now be written into the incremental ``xmlfile()`` writer. * A new parser option ``decompress=False`` was added that controls the automatic input decompression when using libxml2 2.15.0 or later. * The set of compile time / runtime supported libxml2 feature names is available as ``etree.LIBXML_COMPILED_FEATURES`` and ``etree.LIBXML_FEATURES``. * Predicates in ``.find*()`` could mishandle tag indices if a default namespace is provided. * The ``head`` and ``body`` properties of ``lxml.html`` elements failed if no such element was found. They now return ``None`` instead. * Tag names provided by code (API, not data) that are longer than ``INT_MAX`` could be truncated or mishandled in other ways. * ``.text_content()`` on ``lxml.html`` elements accidentally returned a "smart string" without additional information. It now returns a plain string. * Support for Python < 3.8 was removed. * Parsing directly from zlib (or lzma) compressed data is now considered an optional feature in lxml. * The ``Schematron`` class is deprecated and will become non-functional in a future lxml version. * Built using Cython 3.1.2. * The debug methods ``MemDebug.dump()`` and ``MemDebug.show()`` were removed completely. OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-lxml?expand=0&rev=209
2025-07-22 06:13:45 +00:00
%{python_sitearch}/lxml-%{version}.dist-info
%exclude %{python_sitearch}/lxml/*.h
%exclude %{python_sitearch}/lxml/includes/*.h
- Update to version 2.3.3: * lxml.html.tostring() gained new serialisation options with_tail and doctype. * Fixed a crash when using iterparse() for HTML parsing and requesting start events. * Fixed parsing of more selectors in cssselect. Whitespace before pseudo-elements and pseudo-classes is significant as it is a descendant combinator. "E :pseudo" should parse the same as "E *:pseudo", not "E:pseudo". * lxml.html.diff no longer raises an exception when hitting 'img' tags without 'src' attribute. - Changes from version 2.3.2: * lxml.objectify.deannotate() has a new boolean option cleanup_namespaces to remove the objectify namespace declarations (and generally clean up the namespace declarations) after removing the type annotations. * lxml.objectify gained its own SubElement() function as a copy of etree.SubElement to avoid an otherwise redundant import of lxml.etree on the user side. * Fixed the "descendant" bug in cssselect a second time * Fixed parsing of some selectors in cssselect. - Changes from version 2.3.1: * New option kill_tags in lxml.html.clean to remove specific tags and their content (i.e. their whole subtree). * pi.get() and pi.attrib on processing instructions to parse pseudo-attributes from the text content of processing instructions. * lxml.get_include() returns a list of include paths that can be used to compile external C code against lxml.etree. * Resolver.resolve_file() takes an additional option close_file that configures if the file(-like) object will be closed after reading or not. * HTML cleaning didn't remove 'data:' links. * The html5lib parser integration now uses the 'official' implementation in html5lib itself, which makes it work with newer releases of the library. * In lxml.sax, endElementNS() could incorrectly reject a plain tag name when the corresponding start event inferred the same plain tag name to be in the default namespace. * When an open file-like object is passed into parse() or iterparse(), the OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-lxml?expand=0&rev=38
2012-03-09 21:29:46 +00:00
%if 0%{?suse_version} > 1500
%files -n %{name}-doc
%license LICENSES.txt
%endif
%doc doc/html
%doc lxmldoc-*.pdf
%files %{python_files devel}
%license LICENSES.txt
%{python_sitearch}/lxml/*.h
%{python_sitearch}/lxml/includes/*.h
%changelog