forked from pool/python-Scrapy
- update to 2.11.2 (bsc#1224474, CVE-2024-1968):
* Redirects to non-HTTP protocols are no longer followed. Please, see the 23j4-mw76-5v7h security advisory for more information. (:issue:`457`) * The Authorization header is now dropped on redirects to a different scheme (http:// or https://) or port, even if the domain is the same. Please, see the 4qqq-9vqf-3h3f security advisory for more information. * When using system proxy settings that are different for http:// and https://, redirects to a different URL scheme will now also trigger the corresponding change in proxy settings for the redirected request. Please, see the jm3v-qxmh-hxwv security advisory for more information. (:issue:`767`) * :attr:`Spider.allowed_domains <scrapy.Spider.allowed_domains>` is now enforced for all requests, and not only requests from spider callbacks. * :func:`~scrapy.utils.iterators.xmliter_lxml` no longer resolves XML entities. * defusedxml is now used to make :class:`scrapy.http.request.rpc.XmlRpcRequest` more secure. * Restored support for brotlipy_, which had been dropped in Scrapy 2.11.1 in favor of brotli. (:issue:`6261`) Note brotlipy is deprecated, both in Scrapy and upstream. Use brotli instead if you can. * Make :setting:`METAREFRESH_IGNORE_TAGS` ["noscript"] by default. This prevents :class:`~scrapy.downloadermiddlewares. redirect.MetaRefreshMiddleware` from following redirects that would not be followed by web browsers with JavaScript enabled. OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-Scrapy?expand=0&rev=41
This commit is contained in:
@@ -1,3 +0,0 @@
|
|||||||
version https://git-lfs.github.com/spec/v1
|
|
||||||
oid sha256:733a039c7423e52b69bf2810b5332093d4e42a848460359c07b02ecff8f73ebe
|
|
||||||
size 1176726
|
|
@@ -1,3 +1,69 @@
|
|||||||
|
-------------------------------------------------------------------
|
||||||
|
Thu Jul 11 10:38:36 UTC 2024 - Dirk Müller <dmueller@suse.com>
|
||||||
|
|
||||||
|
- update to 2.11.2 (bsc#1224474, CVE-2024-1968):
|
||||||
|
* Redirects to non-HTTP protocols are no longer followed.
|
||||||
|
Please, see the 23j4-mw76-5v7h security advisory for more
|
||||||
|
information. (:issue:`457`)
|
||||||
|
* The Authorization header is now dropped on redirects to a
|
||||||
|
different scheme (http:// or https://) or port, even if the
|
||||||
|
domain is the same. Please, see the 4qqq-9vqf-3h3f security
|
||||||
|
advisory for more information.
|
||||||
|
* When using system proxy settings that are different for
|
||||||
|
http:// and https://, redirects to a different URL scheme
|
||||||
|
will now also trigger the corresponding change in proxy
|
||||||
|
settings for the redirected request. Please, see the
|
||||||
|
jm3v-qxmh-hxwv security advisory for more information.
|
||||||
|
(:issue:`767`)
|
||||||
|
* :attr:`Spider.allowed_domains
|
||||||
|
<scrapy.Spider.allowed_domains>` is now enforced for all
|
||||||
|
requests, and not only requests from spider callbacks.
|
||||||
|
* :func:`~scrapy.utils.iterators.xmliter_lxml` no longer
|
||||||
|
resolves XML entities.
|
||||||
|
* defusedxml is now used to make
|
||||||
|
:class:`scrapy.http.request.rpc.XmlRpcRequest` more secure.
|
||||||
|
* Restored support for brotlipy_, which had been dropped in
|
||||||
|
Scrapy 2.11.1 in favor of brotli. (:issue:`6261`) Note
|
||||||
|
brotlipy is deprecated, both in Scrapy and upstream. Use
|
||||||
|
brotli instead if you can.
|
||||||
|
* Make :setting:`METAREFRESH_IGNORE_TAGS` ["noscript"] by
|
||||||
|
default. This prevents :class:`~scrapy.downloadermiddlewares.
|
||||||
|
redirect.MetaRefreshMiddleware` from following redirects that
|
||||||
|
would not be followed by web browsers with JavaScript
|
||||||
|
enabled.
|
||||||
|
* During :ref:`feed export <topics-feed-exports>`, do not close
|
||||||
|
the underlying file from :ref:`built-in post-processing
|
||||||
|
plugins <builtin-plugins>`.
|
||||||
|
* :class:`LinkExtractor
|
||||||
|
<scrapy.linkextractors.lxmlhtml.LxmlLinkExtractor>` now
|
||||||
|
properly applies the unique and canonicalize parameters.
|
||||||
|
* Do not initialize the scheduler disk queue if
|
||||||
|
:setting:`JOBDIR` is an empty string.
|
||||||
|
* Fix :attr:`Spider.logger <scrapy.Spider.logger>` not logging
|
||||||
|
custom extra information.
|
||||||
|
* robots.txt files with a non-UTF-8 encoding no longer prevent
|
||||||
|
parsing the UTF-8-compatible (e.g. ASCII) parts of the
|
||||||
|
document.
|
||||||
|
* :meth:`scrapy.http.cookies.WrappedRequest.get_header` no
|
||||||
|
longer raises an exception if default is None.
|
||||||
|
:func:`scrapy.utils.response.get_base_url` to determine the
|
||||||
|
base URL of a given :class:`~scrapy.http.Response`.
|
||||||
|
* :class:`~scrapy.selector.Selector` now uses
|
||||||
|
:func:`scrapy.utils.response.get_base_url` to determine the
|
||||||
|
base URL of a given :class:`~scrapy.http.Response`.
|
||||||
|
(:issue:`6265`)
|
||||||
|
* The :meth:`media_to_download` method of :ref:`media pipelines
|
||||||
|
<topics-media-pipeline>` now logs exceptions before stripping
|
||||||
|
them.
|
||||||
|
* When passing a callback to the :command:`parse` command,
|
||||||
|
build the callback callable with the right signature.
|
||||||
|
* Add a FAQ entry about :ref:`creating blank requests <faq-
|
||||||
|
blank-request>`.
|
||||||
|
* Document that :attr:`scrapy.selector.Selector.type` can be
|
||||||
|
"json".
|
||||||
|
* Make builds reproducible.
|
||||||
|
* Packaging and test fixes
|
||||||
|
|
||||||
-------------------------------------------------------------------
|
-------------------------------------------------------------------
|
||||||
Mon Mar 25 14:12:20 UTC 2024 - Dirk Müller <dmueller@suse.com>
|
Mon Mar 25 14:12:20 UTC 2024 - Dirk Müller <dmueller@suse.com>
|
||||||
|
|
||||||
|
@@ -18,15 +18,16 @@
|
|||||||
|
|
||||||
%{?sle15_python_module_pythons}
|
%{?sle15_python_module_pythons}
|
||||||
Name: python-Scrapy
|
Name: python-Scrapy
|
||||||
Version: 2.11.1
|
Version: 2.11.2
|
||||||
Release: 0
|
Release: 0
|
||||||
Summary: A high-level Python Screen Scraping framework
|
Summary: A high-level Python Screen Scraping framework
|
||||||
License: BSD-3-Clause
|
License: BSD-3-Clause
|
||||||
Group: Development/Languages/Python
|
Group: Development/Languages/Python
|
||||||
URL: https://scrapy.org
|
URL: https://scrapy.org
|
||||||
Source: https://files.pythonhosted.org/packages/source/S/Scrapy/Scrapy-%{version}.tar.gz
|
Source: https://files.pythonhosted.org/packages/source/S/Scrapy/scrapy-%{version}.tar.gz
|
||||||
|
BuildRequires: %{python_module Brotli}
|
||||||
BuildRequires: %{python_module Pillow}
|
BuildRequires: %{python_module Pillow}
|
||||||
BuildRequires: %{python_module Protego >= 0.1.15}
|
BuildRequires: %{python_module Protego}
|
||||||
BuildRequires: %{python_module PyDispatcher >= 2.0.5}
|
BuildRequires: %{python_module PyDispatcher >= 2.0.5}
|
||||||
BuildRequires: %{python_module Twisted >= 18.9.0}
|
BuildRequires: %{python_module Twisted >= 18.9.0}
|
||||||
BuildRequires: %{python_module attrs}
|
BuildRequires: %{python_module attrs}
|
||||||
@@ -35,6 +36,7 @@ BuildRequires: %{python_module botocore >= 1.4.87}
|
|||||||
BuildRequires: %{python_module cryptography >= 36.0.0}
|
BuildRequires: %{python_module cryptography >= 36.0.0}
|
||||||
BuildRequires: %{python_module cssselect >= 0.9.1}
|
BuildRequires: %{python_module cssselect >= 0.9.1}
|
||||||
BuildRequires: %{python_module dbm}
|
BuildRequires: %{python_module dbm}
|
||||||
|
BuildRequires: %{python_module defusedxml >= 0.7.1}
|
||||||
BuildRequires: %{python_module itemadapter >= 0.1.0}
|
BuildRequires: %{python_module itemadapter >= 0.1.0}
|
||||||
BuildRequires: %{python_module itemloaders >= 1.0.1}
|
BuildRequires: %{python_module itemloaders >= 1.0.1}
|
||||||
BuildRequires: %{python_module lxml >= 4.4.1}
|
BuildRequires: %{python_module lxml >= 4.4.1}
|
||||||
@@ -63,6 +65,7 @@ Requires: python-PyDispatcher >= 2.0.5
|
|||||||
Requires: python-Twisted >= 18.9.0
|
Requires: python-Twisted >= 18.9.0
|
||||||
Requires: python-cryptography >= 36.0.0
|
Requires: python-cryptography >= 36.0.0
|
||||||
Requires: python-cssselect >= 0.9.1
|
Requires: python-cssselect >= 0.9.1
|
||||||
|
Requires: python-defusedxml >= 0.7.1
|
||||||
Requires: python-itemadapter >= 0.1.0
|
Requires: python-itemadapter >= 0.1.0
|
||||||
Requires: python-itemloaders >= 1.0.1
|
Requires: python-itemloaders >= 1.0.1
|
||||||
Requires: python-lxml >= 4.4.1
|
Requires: python-lxml >= 4.4.1
|
||||||
@@ -93,7 +96,7 @@ Group: Documentation/HTML
|
|||||||
Provides documentation for %{name}.
|
Provides documentation for %{name}.
|
||||||
|
|
||||||
%prep
|
%prep
|
||||||
%autosetup -p1 -n Scrapy-%{version}
|
%autosetup -p1 -n scrapy-%{version}
|
||||||
|
|
||||||
sed -i -e 's:= python:= python3:g' docs/Makefile
|
sed -i -e 's:= python:= python3:g' docs/Makefile
|
||||||
|
|
||||||
|
3
scrapy-2.11.2.tar.gz
Normal file
3
scrapy-2.11.2.tar.gz
Normal file
@@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:dfbd565384fc3fffeba121f5a3a2d0899ac1f756d41432ca0879933fbfb3401d
|
||||||
|
size 1187710
|
Reference in New Issue
Block a user