- update to 2.11.2 (bsc#1224474, CVE-2024-1968):

* Redirects to non-HTTP protocols are no longer followed. Please, see the 23j4-mw76-5v7h security advisory for more information. (:issue:`457`) * The Authorization header is now dropped on redirects to a different scheme (http:// or https://) or port, even if the domain is the same. Please, see the 4qqq-9vqf-3h3f security advisory for more information. * When using system proxy settings that are different for http:// and https://, redirects to a different URL scheme will now also trigger the corresponding change in proxy settings for the redirected request. Please, see the jm3v-qxmh-hxwv security advisory for more information. (:issue:`767`) * :attr:`Spider.allowed_domains <scrapy.Spider.allowed_domains>` is now enforced for all requests, and not only requests from spider callbacks. * :func:`~scrapy.utils.iterators.xmliter_lxml` no longer resolves XML entities. * defusedxml is now used to make :class:`scrapy.http.request.rpc.XmlRpcRequest` more secure. * Restored support for brotlipy_, which had been dropped in Scrapy 2.11.1 in favor of brotli. (:issue:`6261`) Note brotlipy is deprecated, both in Scrapy and upstream. Use brotli instead if you can. * Make :setting:`METAREFRESH_IGNORE_TAGS` ["noscript"] by default. This prevents :class:`~scrapy.downloadermiddlewares. redirect.MetaRefreshMiddleware` from following redirects that would not be followed by web browsers with JavaScript enabled. OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-Scrapy?expand=0&rev=41
2024-07-11 10:53:38 +00:00
parent 04481ebc46
commit add99b967c
4 changed files with 76 additions and 7 deletions
--- a/Scrapy-2.11.1.tar.gz
+++ b/Scrapy-2.11.1.tar.gz
@@ -1,3 +0,0 @@
-version https://git-lfs.github.com/spec/v1
-oid sha256:733a039c7423e52b69bf2810b5332093d4e42a848460359c07b02ecff8f73ebe
-size 1176726
--- a/python-Scrapy.changes
+++ b/python-Scrapy.changes
@@ -1,3 +1,69 @@
+-------------------------------------------------------------------
+Thu Jul 11 10:38:36 UTC 2024 - Dirk Müller <dmueller@suse.com>
+
+- update to 2.11.2 (bsc#1224474, CVE-2024-1968):
+  * Redirects to non-HTTP protocols are no longer followed.
+    Please, see the 23j4-mw76-5v7h security advisory for more
+    information. (:issue:`457`)
+  * The Authorization header is now dropped on redirects to a
+    different scheme (http:// or https://) or port, even if the
+    domain is the same. Please, see the 4qqq-9vqf-3h3f security
+    advisory for more information.
+  * When using system proxy settings that are different for
+    http:// and https://, redirects to a different URL scheme
+    will now also trigger the corresponding change in proxy
+    settings for the redirected request. Please, see the
+    jm3v-qxmh-hxwv security advisory for more information.
+    (:issue:`767`)
+  * :attr:`Spider.allowed_domains
+    <scrapy.Spider.allowed_domains>` is now enforced for all
+    requests, and not only requests from spider callbacks.
+  * :func:`~scrapy.utils.iterators.xmliter_lxml` no longer
+    resolves XML entities.
+  * defusedxml is now used to make
+    :class:`scrapy.http.request.rpc.XmlRpcRequest` more secure.
+  * Restored support for brotlipy_, which had been dropped in
+    Scrapy 2.11.1 in favor of brotli. (:issue:`6261`)  Note
+    brotlipy is deprecated, both in Scrapy and upstream. Use
+    brotli instead if you can.
+  * Make :setting:`METAREFRESH_IGNORE_TAGS` ["noscript"] by
+    default. This prevents :class:`~scrapy.downloadermiddlewares.
+    redirect.MetaRefreshMiddleware` from following redirects that
+    would not be followed by web browsers with JavaScript
+    enabled.
+  * During :ref:`feed export <topics-feed-exports>`, do not close
+    the underlying file from :ref:`built-in post-processing
+    plugins <builtin-plugins>`.
+  * :class:`LinkExtractor
+    <scrapy.linkextractors.lxmlhtml.LxmlLinkExtractor>` now
+    properly applies the unique and canonicalize parameters.
+  * Do not initialize the scheduler disk queue if
+    :setting:`JOBDIR` is an empty string.
+  * Fix :attr:`Spider.logger <scrapy.Spider.logger>` not logging
+    custom extra information.
+  * robots.txt files with a non-UTF-8 encoding no longer prevent
+    parsing the UTF-8-compatible (e.g. ASCII) parts of the
+    document.
+  * :meth:`scrapy.http.cookies.WrappedRequest.get_header` no
+    longer raises an exception if default is None.
+    :func:`scrapy.utils.response.get_base_url` to determine the
+    base URL of a given :class:`~scrapy.http.Response`.
+  * :class:`~scrapy.selector.Selector` now uses
+    :func:`scrapy.utils.response.get_base_url` to determine the
+    base URL of a given :class:`~scrapy.http.Response`.
+    (:issue:`6265`)
+  * The :meth:`media_to_download` method of :ref:`media pipelines
+    <topics-media-pipeline>` now logs exceptions before stripping
+    them.
+  * When passing a callback to the :command:`parse` command,
+    build the callback callable with the right signature.
+  * Add a FAQ entry about :ref:`creating blank requests <faq-
+    blank-request>`.
+  * Document that :attr:`scrapy.selector.Selector.type` can be
+    "json".
+  * Make builds reproducible.
+  * Packaging and test fixes
+
 -------------------------------------------------------------------
 Mon Mar 25 14:12:20 UTC 2024 - Dirk Müller <dmueller@suse.com>

--- a/python-Scrapy.spec
+++ b/python-Scrapy.spec
@@ -18,15 +18,16 @@

 %{?sle15_python_module_pythons}
 Name:           python-Scrapy
-Version:        2.11.1
+Version:        2.11.2
 Release:        0
 Summary:        A high-level Python Screen Scraping framework
 License:        BSD-3-Clause
 Group:          Development/Languages/Python
 URL:            https://scrapy.org
-Source:         https://files.pythonhosted.org/packages/source/S/Scrapy/Scrapy-%{version}.tar.gz
+Source:         https://files.pythonhosted.org/packages/source/S/Scrapy/scrapy-%{version}.tar.gz
+BuildRequires:  %{python_module Brotli}
 BuildRequires:  %{python_module Pillow}
-BuildRequires:  %{python_module Protego >= 0.1.15}
+BuildRequires:  %{python_module Protego}
 BuildRequires:  %{python_module PyDispatcher >= 2.0.5}
 BuildRequires:  %{python_module Twisted >= 18.9.0}
 BuildRequires:  %{python_module attrs}
@@ -35,6 +36,7 @@ BuildRequires:  %{python_module botocore >= 1.4.87}
 BuildRequires:  %{python_module cryptography >= 36.0.0}
 BuildRequires:  %{python_module cssselect >= 0.9.1}
 BuildRequires:  %{python_module dbm}
+BuildRequires:  %{python_module defusedxml >= 0.7.1}
 BuildRequires:  %{python_module itemadapter >= 0.1.0}
 BuildRequires:  %{python_module itemloaders >= 1.0.1}
 BuildRequires:  %{python_module lxml >= 4.4.1}
@@ -63,6 +65,7 @@ Requires:       python-PyDispatcher >= 2.0.5
 Requires:       python-Twisted >= 18.9.0
 Requires:       python-cryptography >= 36.0.0
 Requires:       python-cssselect >= 0.9.1
+Requires:       python-defusedxml >= 0.7.1
 Requires:       python-itemadapter >= 0.1.0
 Requires:       python-itemloaders >= 1.0.1
 Requires:       python-lxml >= 4.4.1
@@ -93,7 +96,7 @@ Group:          Documentation/HTML
 Provides documentation for %{name}.

 %prep
-%autosetup -p1 -n Scrapy-%{version}
+%autosetup -p1 -n scrapy-%{version}

 sed -i -e 's:= python:= python3:g' docs/Makefile

--- a/scrapy-2.11.2.tar.gz
+++ b/scrapy-2.11.2.tar.gz
@@ -0,0 +1,3 @@
+version https://git-lfs.github.com/spec/v1
+oid sha256:dfbd565384fc3fffeba121f5a3a2d0899ac1f756d41432ca0879933fbfb3401d
+size 1187710