2012-09-18 20:20:37 +02:00
|
|
|
#
|
|
|
|
# spec file for package python-beautifulsoup4
|
|
|
|
#
|
2017-04-12 21:04:14 +02:00
|
|
|
# Copyright (c) 2017 SUSE LINUX GmbH, Nuernberg, Germany.
|
2012-09-18 20:20:37 +02:00
|
|
|
#
|
|
|
|
# All modifications and additions to the file contributed by third parties
|
|
|
|
# remain the property of their copyright owners, unless otherwise agreed
|
|
|
|
# upon. The license for this file, and modifications and additions to the
|
|
|
|
# file, is the same license as for the pristine package itself (unless the
|
|
|
|
# license for the pristine package is not an Open Source License, in which
|
|
|
|
# case the license is the MIT License). An "Open Source License" is a
|
|
|
|
# license that conforms to the Open Source Definition (Version 1.9)
|
|
|
|
# published by the Open Source Initiative.
|
|
|
|
|
|
|
|
# Please submit bugfixes or comments via http://bugs.opensuse.org/
|
|
|
|
#
|
|
|
|
|
2013-05-02 11:11:07 +02:00
|
|
|
|
2017-04-12 21:04:14 +02:00
|
|
|
%{?!python_module:%define python_module() python-%{**} python3-%{**}}
|
2013-06-27 15:28:55 +02:00
|
|
|
Name: python-beautifulsoup4
|
2017-04-12 21:04:14 +02:00
|
|
|
Version: 4.5.3
|
2012-09-18 20:20:37 +02:00
|
|
|
Release: 0
|
|
|
|
Summary: HTML/XML Parser for Quick-Turnaround Applications Like Screen-Scraping
|
2013-01-10 09:50:08 +01:00
|
|
|
License: MIT
|
2012-09-18 20:20:37 +02:00
|
|
|
Group: Development/Libraries/Python
|
|
|
|
Url: http://www.crummy.com/software/BeautifulSoup/
|
Accepting request 424243 from home:tbechtold:branches:devel:languages:python
- update to 4.5.1:
* Fixed a crash when passing Unicode markup that contained a
processing instruction into the lxml HTML parser on Python
3. [bug=1608048]
* Beautiful Soup is no longer compatible with Python 2.6. This
actually happened a few releases ago, but it's now official.
* Beautiful Soup will now work with versions of html5lib greater than
0.99999999. [bug=1603299]
* If a search against each individual value of a multi-valued
attribute fails, the search will be run one final time against the
complete attribute value considered as a single string. That is, if
a tag has class="foo bar" and neither "foo" nor "bar" matches, but
"foo bar" does, the tag is now considered a match.
This happened in previous versions, but only when the value being
searched for was a string. Now it also works when that value is
a regular expression, a list of strings, etc. [bug=1476868]
* Fixed a bug that deranged the tree when a whitespace element was
reparented into a tag that contained an identical whitespace
element. [bug=1505351]
* Added support for CSS selector values that contain quoted spaces,
such as tag[style="display: foo"]. [bug=1540588]
* Corrected handling of XML processing instructions. [bug=1504393]
* Corrected an encoding error that happened when a BeautifulSoup
object was copied. [bug=1554439]
* The contents of <textarea> tags will no longer be modified when the
tree is prettified. [bug=1555829]
* When a BeautifulSoup object is pickled but its tree builder cannot
be pickled, its .builder attribute is set to None instead of being
destroyed. This avoids a performance problem once the object is
unpickled. [bug=1523629]
OBS-URL: https://build.opensuse.org/request/show/424243
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-beautifulsoup4?expand=0&rev=41
2016-09-01 14:32:00 +02:00
|
|
|
Source: https://pypi.io/packages/source/b/beautifulsoup4/beautifulsoup4-%{version}.tar.gz
|
2014-01-08 16:11:23 +01:00
|
|
|
# PATCH-FIX-UPSTREAM speilicke@suse.com -- Backport of https://code.launchpad.net/~saschpe/beautifulsoup/beautifulsoup/+merge/200849
|
|
|
|
Patch0: beautifulsoup4-lxml-fixes.patch
|
2013-06-27 15:28:55 +02:00
|
|
|
# Documentation requirements:
|
2017-04-12 21:04:14 +02:00
|
|
|
BuildRequires: %{python_module devel >= 2.6}
|
|
|
|
BuildRequires: %{python_module html5lib >= 0.999999}
|
|
|
|
BuildRequires: %{python_module lxml >= 3.4.4}
|
|
|
|
BuildRequires: %{python_module setuptools}
|
|
|
|
# Test requirements
|
|
|
|
BuildRequires: %{python_module nose}
|
|
|
|
BuildRequires: fdupes
|
|
|
|
BuildRequires: python-rpm-macros
|
|
|
|
BuildRequires: python3-Sphinx
|
2015-08-03 11:53:30 +02:00
|
|
|
Requires: python-html5lib >= 0.999999
|
|
|
|
Requires: python-lxml >= 3.4.4
|
2017-04-12 21:04:14 +02:00
|
|
|
BuildRoot: %{_tmppath}/%{name}-%{version}-build
|
2013-06-27 15:28:55 +02:00
|
|
|
BuildArch: noarch
|
2017-04-12 21:04:14 +02:00
|
|
|
%python_subpackages
|
2012-09-18 20:20:37 +02:00
|
|
|
|
|
|
|
%description
|
|
|
|
Beautiful Soup is a Python HTML/XML parser designed for quick turnaround
|
|
|
|
projects like screen-scraping. Three features make it powerful:
|
|
|
|
|
|
|
|
* Beautiful Soup won't choke if you give it bad markup. It yields a parse tree
|
|
|
|
that makes approximately as much sense as your original document. This is
|
|
|
|
usually good enough to collect the data you need and run away
|
|
|
|
|
|
|
|
* Beautiful Soup provides a few simple methods and Pythonic idioms for
|
|
|
|
navigating, searching, and modifying a parse tree: a toolkit for dissecting a
|
|
|
|
document and extracting what you need. You don't have to create a custom
|
|
|
|
parser for each application
|
|
|
|
|
|
|
|
* Beautiful Soup automatically converts incoming documents to Unicode and
|
|
|
|
outgoing documents to UTF-8. You don't have to think about encodings, unless
|
|
|
|
the document doesn't specify an encoding and Beautiful Soup can't autodetect
|
|
|
|
one. Then you just have to specify the original encoding
|
|
|
|
|
|
|
|
Beautiful Soup parses anything you give it, and does the tree traversal stuff
|
|
|
|
for you. You can tell it "Find all the links", or "Find all the links of class
|
|
|
|
externalLink", or "Find all the links whose urls match "foo.com", or "Find the
|
|
|
|
table heading that's got bold text, then give me that text."
|
|
|
|
|
|
|
|
Valuable data that was once locked up in poorly-designed websites is now within
|
|
|
|
your reach. Projects that would have taken hours take only minutes with
|
|
|
|
Beautiful Soup.
|
|
|
|
|
2013-05-18 16:19:58 +02:00
|
|
|
%package doc
|
|
|
|
Summary: Documentation for %{name}
|
|
|
|
Group: Development/Libraries/Python
|
2013-06-13 10:54:41 +02:00
|
|
|
Recommends: %{name} = %{version}
|
2013-05-18 16:19:58 +02:00
|
|
|
|
|
|
|
%description doc
|
|
|
|
Documentation and help files for %{name}
|
|
|
|
|
2012-09-18 20:20:37 +02:00
|
|
|
%prep
|
2013-06-27 15:28:55 +02:00
|
|
|
%setup -q -n beautifulsoup4-%{version}
|
2014-01-08 16:11:23 +01:00
|
|
|
%patch0 -p1
|
2012-09-18 20:20:37 +02:00
|
|
|
|
|
|
|
%build
|
2017-04-12 21:04:14 +02:00
|
|
|
%python_build
|
|
|
|
pushd doc && make html && rm build/html/.buildinfo build/html/objects.inv && popd
|
|
|
|
%{_python_use_flavor python3}
|
|
|
|
%__python3 %{_bindir}/2to3 -w -n build/lib/bs4
|
2012-09-18 20:20:37 +02:00
|
|
|
|
|
|
|
%install
|
2017-04-12 21:04:14 +02:00
|
|
|
%python_install
|
|
|
|
# until it can be fixed
|
|
|
|
find %{buildroot}%{python3_sitelib} -name test_soup.* -delete
|
|
|
|
%python_expand %fdupes -s %{buildroot}%{$python_sitelib}
|
2012-09-18 20:20:37 +02:00
|
|
|
|
2013-01-10 14:18:45 +01:00
|
|
|
%check
|
2017-04-12 21:04:14 +02:00
|
|
|
%{python_expand export PYTHONPATH="%{buildroot}%{$python_sitelib}"
|
|
|
|
pushd $PYTHONPATH
|
|
|
|
$python %{_bindir}/nosetests-%{$python_version}
|
|
|
|
popd
|
|
|
|
}
|
2012-09-18 20:20:37 +02:00
|
|
|
|
2017-04-12 21:04:14 +02:00
|
|
|
%files %{python_files}
|
2012-09-18 20:20:37 +02:00
|
|
|
%defattr(-,root,root)
|
2013-05-18 16:19:58 +02:00
|
|
|
%doc AUTHORS.txt COPYING.txt
|
2013-01-12 15:09:18 +01:00
|
|
|
%{python_sitelib}/bs4/
|
2013-06-27 15:28:55 +02:00
|
|
|
%{python_sitelib}/beautifulsoup4-%{version}-py*.egg-info
|
2012-09-18 20:20:37 +02:00
|
|
|
|
2017-04-12 21:04:14 +02:00
|
|
|
%files %{python_files doc}
|
2013-05-18 16:19:58 +02:00
|
|
|
%defattr(-,root,root)
|
2013-06-27 15:28:55 +02:00
|
|
|
%doc NEWS.txt README.txt TODO.txt doc/build/html
|
2013-05-18 16:19:58 +02:00
|
|
|
|
2012-09-18 20:20:37 +02:00
|
|
|
%changelog
|