2017-04-12 21:04:14 +02:00
|
|
|
-------------------------------------------------------------------
|
|
|
|
Sat Apr 8 17:35:17 UTC 2017 - aloisio@gmx.com
|
|
|
|
|
|
|
|
- update to version 4.5.3:
|
|
|
|
* Fixed foster parenting when html5lib is the tree builder. Thanks
|
|
|
|
to Geoffrey Sneddon for a patch and test.
|
|
|
|
* Fixed yet another problem that caused the html5lib tree builder to
|
|
|
|
create a disconnected parse tree. [bug=1629825]
|
|
|
|
changes from version 4.5.2:
|
|
|
|
* Apart from the version number, this release is identical to
|
|
|
|
4.5.3. Due to user error, it could not be completely uploaded to
|
|
|
|
PyPI. Use 4.5.3 instead.
|
|
|
|
|
|
|
|
- Converted to single-spec
|
|
|
|
|
2016-09-02 10:07:32 +02:00
|
|
|
-------------------------------------------------------------------
|
|
|
|
Thu Sep 1 19:20:36 UTC 2016 - tbechtold@suse.com
|
|
|
|
|
|
|
|
- Relax BuildRequires for python-Sphinx
|
|
|
|
|
Accepting request 424243 from home:tbechtold:branches:devel:languages:python
- update to 4.5.1:
* Fixed a crash when passing Unicode markup that contained a
processing instruction into the lxml HTML parser on Python
3. [bug=1608048]
* Beautiful Soup is no longer compatible with Python 2.6. This
actually happened a few releases ago, but it's now official.
* Beautiful Soup will now work with versions of html5lib greater than
0.99999999. [bug=1603299]
* If a search against each individual value of a multi-valued
attribute fails, the search will be run one final time against the
complete attribute value considered as a single string. That is, if
a tag has class="foo bar" and neither "foo" nor "bar" matches, but
"foo bar" does, the tag is now considered a match.
This happened in previous versions, but only when the value being
searched for was a string. Now it also works when that value is
a regular expression, a list of strings, etc. [bug=1476868]
* Fixed a bug that deranged the tree when a whitespace element was
reparented into a tag that contained an identical whitespace
element. [bug=1505351]
* Added support for CSS selector values that contain quoted spaces,
such as tag[style="display: foo"]. [bug=1540588]
* Corrected handling of XML processing instructions. [bug=1504393]
* Corrected an encoding error that happened when a BeautifulSoup
object was copied. [bug=1554439]
* The contents of <textarea> tags will no longer be modified when the
tree is prettified. [bug=1555829]
* When a BeautifulSoup object is pickled but its tree builder cannot
be pickled, its .builder attribute is set to None instead of being
destroyed. This avoids a performance problem once the object is
unpickled. [bug=1523629]
OBS-URL: https://build.opensuse.org/request/show/424243
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-beautifulsoup4?expand=0&rev=41
2016-09-01 14:32:00 +02:00
|
|
|
-------------------------------------------------------------------
|
|
|
|
Thu Sep 1 10:26:24 UTC 2016 - tbechtold@suse.com
|
|
|
|
|
|
|
|
- update to 4.5.1:
|
|
|
|
* Fixed a crash when passing Unicode markup that contained a
|
|
|
|
processing instruction into the lxml HTML parser on Python
|
|
|
|
3. [bug=1608048]
|
|
|
|
* Beautiful Soup is no longer compatible with Python 2.6. This
|
|
|
|
actually happened a few releases ago, but it's now official.
|
|
|
|
* Beautiful Soup will now work with versions of html5lib greater than
|
|
|
|
0.99999999. [bug=1603299]
|
|
|
|
* If a search against each individual value of a multi-valued
|
|
|
|
attribute fails, the search will be run one final time against the
|
|
|
|
complete attribute value considered as a single string. That is, if
|
|
|
|
a tag has class="foo bar" and neither "foo" nor "bar" matches, but
|
|
|
|
"foo bar" does, the tag is now considered a match.
|
|
|
|
This happened in previous versions, but only when the value being
|
|
|
|
searched for was a string. Now it also works when that value is
|
|
|
|
a regular expression, a list of strings, etc. [bug=1476868]
|
|
|
|
* Fixed a bug that deranged the tree when a whitespace element was
|
|
|
|
reparented into a tag that contained an identical whitespace
|
|
|
|
element. [bug=1505351]
|
|
|
|
* Added support for CSS selector values that contain quoted spaces,
|
|
|
|
such as tag[style="display: foo"]. [bug=1540588]
|
|
|
|
* Corrected handling of XML processing instructions. [bug=1504393]
|
|
|
|
* Corrected an encoding error that happened when a BeautifulSoup
|
|
|
|
object was copied. [bug=1554439]
|
|
|
|
* The contents of <textarea> tags will no longer be modified when the
|
|
|
|
tree is prettified. [bug=1555829]
|
|
|
|
* When a BeautifulSoup object is pickled but its tree builder cannot
|
|
|
|
be pickled, its .builder attribute is set to None instead of being
|
|
|
|
destroyed. This avoids a performance problem once the object is
|
|
|
|
unpickled. [bug=1523629]
|
|
|
|
* Specify the file and line number when warning about a
|
|
|
|
BeautifulSoup object being instantiated without a parser being
|
|
|
|
specified. [bug=1574647]
|
|
|
|
* The `limit` argument to `select()` now works correctly, though it's
|
|
|
|
not implemented very efficiently. [bug=1520530]
|
|
|
|
* Fixed a Python 3 ByteWarning when a URL was passed in as though it
|
|
|
|
were markup. Thanks to James Salter for a patch and
|
|
|
|
test. [bug=1533762]
|
|
|
|
* We don't run the check for a filename passed in as markup if the
|
|
|
|
'filename' contains a less-than character; the less-than character
|
|
|
|
indicates it's most likely a very small document. [bug=1577864]
|
|
|
|
|
2015-11-17 11:22:25 +01:00
|
|
|
-------------------------------------------------------------------
|
|
|
|
Sun Nov 15 16:31:46 UTC 2015 - idonmez@suse.com
|
|
|
|
|
|
|
|
- Update to version 4.4.1
|
|
|
|
* Fixed a bug that deranged the tree when part of it was
|
|
|
|
removed. Thanks to Eric Weiser for the patch and John Wiseman for a
|
|
|
|
test. lp#1481520
|
|
|
|
* Fixed a parse bug with the html5lib tree-builder. Thanks to Roel
|
|
|
|
Kramer for the patch. lp#1483781
|
|
|
|
* Improved the implementation of CSS selector grouping. Thanks to
|
|
|
|
Orangain for the patch. lp#1484543
|
|
|
|
* Fixed the test_detect_utf8 test so that it works when chardet is
|
|
|
|
installed. lp#1471359
|
|
|
|
* Corrected the output of Declaration objects. lp#1477847
|
|
|
|
|
2015-08-03 11:53:30 +02:00
|
|
|
-------------------------------------------------------------------
|
|
|
|
Mon Jul 27 18:54:20 UTC 2015 - aloisio@gmx.com
|
|
|
|
|
|
|
|
- update to 4.4.0
|
|
|
|
Especially important changes:
|
|
|
|
* Added a warning when you instantiate a BeautifulSoup object without
|
|
|
|
explicitly naming a parser. [bug=1398866]
|
|
|
|
* __repr__ now returns an ASCII bytestring in Python 2, and a Unicode
|
|
|
|
string in Python 3, instead of a UTF8-encoded bytestring in both
|
|
|
|
versions. In Python 3, __str__ now returns a Unicode string instead
|
|
|
|
of a bytestring. [bug=1420131]
|
|
|
|
* The `text` argument to the find_* methods is now called `string`,
|
|
|
|
which is more accurate. `text` still works, but `string` is the
|
|
|
|
argument described in the documentation. `text` may eventually
|
|
|
|
change its meaning, but not for a very long time. [bug=1366856]
|
|
|
|
* Changed the way soup objects work under copy.copy(). Copying a
|
|
|
|
NavigableString or a Tag will give you a new NavigableString that's
|
|
|
|
equal to the old one but not connected to the parse tree. Patch by
|
|
|
|
Martijn Peters. [bug=1307490]
|
|
|
|
* Started using a standard MIT license. [bug=1294662]
|
|
|
|
* Added a Chinese translation of the documentation by Delong .w.
|
|
|
|
New features:
|
|
|
|
* Introduced the select_one() method, which uses a CSS selector but
|
|
|
|
only returns the first match, instead of a list of
|
|
|
|
matches. [bug=1349367]
|
|
|
|
* You can now create a Tag object without specifying a
|
|
|
|
TreeBuilder. Patch by Martijn Pieters. [bug=1307471]
|
|
|
|
* You can now create a NavigableString or a subclass just by invoking
|
|
|
|
the constructor. [bug=1294315]
|
|
|
|
* Added an `exclude_encodings` argument to UnicodeDammit and to the
|
|
|
|
Beautiful Soup constructor, which lets you prohibit the detection of
|
|
|
|
an encoding that you know is wrong. [bug=1469408]
|
|
|
|
* The select() method now supports selector grouping. Patch by
|
|
|
|
Francisco Canas [bug=1191917]
|
|
|
|
Bug fixes:
|
|
|
|
* Fixed yet another problem that caused the html5lib tree builder to
|
|
|
|
create a disconnected parse tree. [bug=1237763]
|
|
|
|
* Force object_was_parsed() to keep the tree intact even when an element
|
|
|
|
from later in the document is moved into place. [bug=1430633]
|
|
|
|
* Fixed yet another bug that caused a disconnected tree when html5lib
|
|
|
|
copied an element from one part of the tree to another. [bug=1270611]
|
|
|
|
* Fixed a bug where Element.extract() could create an infinite loop in
|
|
|
|
the remaining tree.
|
|
|
|
* The select() method can now find tags whose names contain
|
|
|
|
dashes. Patch by Francisco Canas. [bug=1276211]
|
|
|
|
* The select() method can now find tags with attributes whose names
|
|
|
|
contain dashes. Patch by Marek Kapolka. [bug=1304007]
|
|
|
|
* Improved the lxml tree builder's handling of processing
|
|
|
|
instructions. [bug=1294645]
|
|
|
|
* Restored the helpful syntax error that happens when you try to
|
|
|
|
import the Python 2 edition of Beautiful Soup under Python 3.
|
|
|
|
[bug=1213387]
|
|
|
|
* In Python 3.4 and above, set the new convert_charrefs argument to
|
|
|
|
the html.parser constructor to avoid a warning and future
|
|
|
|
failures. Patch by Stefano Revera. [bug=1375721]
|
|
|
|
* The warning when you pass in a filename or URL as markup will now be
|
|
|
|
displayed correctly even if the filename or URL is a Unicode
|
|
|
|
string. [bug=1268888]
|
|
|
|
* If the initial <html> tag contains a CDATA list attribute such as
|
|
|
|
'class', the html5lib tree builder will now turn its value into a
|
|
|
|
list, as it would with any other tag. [bug=1296481]
|
|
|
|
* Fixed an import error in Python 3.5 caused by the removal of the
|
|
|
|
HTMLParseError class. [bug=1420063]
|
|
|
|
* Improved docstring for encode_contents() and
|
|
|
|
decode_contents(). [bug=1441543]
|
|
|
|
* Fixed a crash in Unicode, Dammit's encoding detector when the name
|
|
|
|
of the encoding itself contained invalid bytes. [bug=1360913]
|
|
|
|
* Improved the exception raised when you call .unwrap() or
|
|
|
|
.replace_with() on an element that's not attached to a tree.
|
|
|
|
* Raise a NotImplementedError whenever an unsupported CSS pseudoclass
|
|
|
|
is used in select(). Previously some cases did not result in a
|
|
|
|
NotImplementedError.
|
|
|
|
* It's now possible to pickle a BeautifulSoup object no matter which
|
|
|
|
tree builder was used to create it. However, the only tree builder
|
|
|
|
that survives the pickling process is the HTMLParserTreeBuilder
|
|
|
|
('html.parser'). If you unpickle a BeautifulSoup object created with
|
|
|
|
some other tree builder, soup.builder will be None. [bug=1231545]
|
|
|
|
- Aligned requirement version with PyPI
|
|
|
|
|
2015-07-26 14:39:45 +02:00
|
|
|
-------------------------------------------------------------------
|
|
|
|
Fri Jul 24 20:25:54 UTC 2015 - seife+obs@b1-systems.com
|
|
|
|
|
|
|
|
- fix non-SUSE build by conditionalizing Recommends: tag
|
|
|
|
|
2014-01-08 16:11:23 +01:00
|
|
|
-------------------------------------------------------------------
|
|
|
|
Wed Jan 8 15:05:55 UTC 2014 - speilicke@suse.com
|
|
|
|
|
|
|
|
- Add beautifulsoup4-lxml-fixes.patch: LXML fixes
|
|
|
|
|
2013-10-22 15:47:05 +02:00
|
|
|
-------------------------------------------------------------------
|
|
|
|
Tue Oct 22 13:00:00 UTC 2013 - toddrme2178@gmail.com
|
|
|
|
|
|
|
|
- update to 4.3.2
|
|
|
|
* Fixed a bug in which short Unicode input was improperly encoded to
|
|
|
|
ASCII when checking whether or not it was the name of a file on
|
|
|
|
disk. [bug=1227016]
|
|
|
|
* Fixed a crash when a short input contains data not valid in
|
|
|
|
filenames. [bug=1232604]
|
|
|
|
* Fixed a bug that caused Unicode data put into UnicodeDammit to
|
|
|
|
return None instead of the original data. [bug=1214983]
|
|
|
|
* Combined two tests to stop a spurious test failure when tests are
|
|
|
|
run by nosetests. [bug=1212445]
|
|
|
|
- update to 4.3.1
|
|
|
|
* Fixed yet another problem with the html5lib tree builder, caused by
|
|
|
|
html5lib's tendency to rearrange the tree during
|
|
|
|
parsing. [bug=1189267]
|
|
|
|
* Fixed a bug that caused the optimized version of find_all() to
|
|
|
|
return nothing. [bug=1212655]
|
|
|
|
- update to 4.3.0
|
|
|
|
* Instead of converting incoming data to Unicode and feeding it to the
|
|
|
|
lxml tree builder in chunks, Beautiful Soup now makes successive
|
|
|
|
guesses at the encoding of the incoming data, and tells lxml to
|
|
|
|
parse the data as that encoding. Giving lxml more control over the
|
|
|
|
parsing process improves performance and avoids a number of bugs and
|
|
|
|
issues with the lxml parser which had previously required elaborate
|
|
|
|
workarounds:
|
|
|
|
- An issue in which lxml refuses to parse Unicode strings on some
|
|
|
|
systems. [bug=1180527]
|
|
|
|
- A returning bug that truncated documents longer than a (very
|
|
|
|
small) size. [bug=963880]
|
|
|
|
- A returning bug in which extra spaces were added to a document if
|
|
|
|
the document defined a charset other than UTF-8. [bug=972466]
|
|
|
|
This required a major overhaul of the tree builder architecture. If
|
|
|
|
you wrote your own tree builder and didn't tell me, you'll need to
|
|
|
|
modify your prepare_markup() method.
|
|
|
|
* The UnicodeDammit code that makes guesses at encodings has been
|
|
|
|
split into its own class, EncodingDetector. A lot of apparently
|
|
|
|
redundant code has been removed from Unicode, Dammit, and some
|
|
|
|
undocumented features have also been removed.
|
|
|
|
* Beautiful Soup will issue a warning if instead of markup you pass it
|
|
|
|
a URL or the name of a file on disk (a common beginner's mistake).
|
|
|
|
* A number of optimizations improve the performance of the lxml tree
|
|
|
|
builder by about 33%, the html.parser tree builder by about 20%, and
|
|
|
|
the html5lib tree builder by about 15%.
|
|
|
|
* All find_all calls should now return a ResultSet object. Patch by
|
|
|
|
Aaron DeVore. [bug=1194034]
|
|
|
|
|
2013-07-22 08:18:03 +02:00
|
|
|
-------------------------------------------------------------------
|
|
|
|
Fri Jul 19 17:07:52 UTC 2013 - berendt@b1-systems.de
|
|
|
|
|
|
|
|
- remove .buildinfo before installation
|
|
|
|
|
2013-07-18 11:04:15 +02:00
|
|
|
-------------------------------------------------------------------
|
|
|
|
Thu Jul 18 08:25:50 UTC 2013 - berendt@b1-systems.de
|
|
|
|
|
|
|
|
- removed python-lxml as build requirement to be able to
|
|
|
|
successfully pass the check section on SLES11 SP3
|
|
|
|
|
2013-06-27 15:28:55 +02:00
|
|
|
-------------------------------------------------------------------
|
|
|
|
Thu Jun 27 13:32:06 UTC 2013 - speilicke@suse.com
|
|
|
|
|
|
|
|
- Update upstream URL
|
|
|
|
|
2013-06-25 14:20:25 +02:00
|
|
|
-------------------------------------------------------------------
|
|
|
|
Tue Jun 25 11:52:34 UTC 2013 - dmueller@suse.com
|
|
|
|
|
|
|
|
- update to 4.2.1:
|
|
|
|
* The default XML formatter will now replace ampersands even if they
|
|
|
|
appear to be part of entities. That is, "<" will become
|
|
|
|
"&lt;". The old code was left over from Beautiful Soup 3, which
|
|
|
|
didn't always turn entities into Unicode characters.
|
|
|
|
|
|
|
|
If you really want the old behavior (maybe because you add new
|
|
|
|
strings to the tree, those strings include entities, and you want
|
|
|
|
the formatter to leave them alone on output), it can be found in
|
|
|
|
EntitySubstitution.substitute_xml_containing_entities(). [bug=1182183]
|
|
|
|
|
|
|
|
* Gave new_string() the ability to create subclasses of
|
|
|
|
NavigableString. [bug=1181986]
|
|
|
|
|
|
|
|
* Fixed another bug by which the html5lib tree builder could create a
|
|
|
|
disconnected tree. [bug=1182089]
|
|
|
|
|
|
|
|
* The .previous_element of a BeautifulSoup object is now always None,
|
|
|
|
not the last element to be parsed. [bug=1182089]
|
|
|
|
|
|
|
|
* Fixed test failures when lxml is not installed. [bug=1181589]
|
|
|
|
|
|
|
|
* html5lib now supports Python 3. Fixed some Python 2-specific
|
|
|
|
code in the html5lib test suite. [bug=1181624]
|
|
|
|
|
|
|
|
* The html.parser treebuilder can now handle numeric attributes in
|
|
|
|
text when the hexidecimal name of the attribute starts with a
|
|
|
|
capital X. Patch by Tim Shirley. [bug=1186242]
|
|
|
|
|
2013-06-13 10:54:41 +02:00
|
|
|
-------------------------------------------------------------------
|
|
|
|
Mon Jun 10 20:34:00 UTC 2013 - dmueller@suse.com
|
|
|
|
|
|
|
|
- disable tests on SLE_11, fail due to too old python-lxml
|
|
|
|
|
2013-05-18 16:19:58 +02:00
|
|
|
-------------------------------------------------------------------
|
|
|
|
Sat May 18 13:30:00 UTC 2013 - toddrme2178@gmail.com
|
|
|
|
|
|
|
|
- Update to 4.2.0
|
|
|
|
* The Tag.select() method now supports a much wider variety of CSS
|
|
|
|
selectors.
|
|
|
|
- Added support for the adjacent sibling combinator (+) and the
|
|
|
|
general sibling combinator (~). Tests by "liquider". [bug=1082144]
|
|
|
|
- The combinators (>, +, and ~) can now combine with any supported
|
|
|
|
selector, not just one that selects based on tag name.
|
|
|
|
- Added limited support for the "nth-of-type" pseudo-class. Code
|
|
|
|
by Sven Slootweg. [bug=1109952]
|
|
|
|
* The BeautifulSoup class is now aliased to "_s" and "_soup", making
|
|
|
|
it quicker to type the import statement in an interactive session
|
|
|
|
The alias may change in the future, so don't use this in code you're
|
|
|
|
going to run more than once.
|
|
|
|
* Added the 'diagnose' submodule, which includes several useful
|
|
|
|
functions for reporting problems and doing tech support.
|
|
|
|
- diagnose(data) tries the given markup on every installed parser,
|
|
|
|
reporting exceptions and displaying successes. If a parser is not
|
|
|
|
installed, diagnose() mentions this fact.
|
|
|
|
- lxml_trace(data, html=True) runs the given markup through lxml's
|
|
|
|
XML parser or HTML parser, and prints out the parser events as
|
|
|
|
they happen. This helps you quickly determine whether a given
|
|
|
|
problem occurs in lxml code or Beautiful Soup code.
|
|
|
|
- htmlparser_trace(data) is the same thing, but for Python's
|
|
|
|
built-in HTMLParser class.
|
|
|
|
* In an HTML document, the contents of a <script> or <style> tag will
|
|
|
|
no longer undergo entity substitution by default. XML documents work
|
|
|
|
the same way they did before. [bug=1085953]
|
|
|
|
* Methods like get_text() and properties like .strings now only give
|
|
|
|
you strings that are visible in the document--no comments or
|
|
|
|
processing commands. [bug=1050164]
|
|
|
|
* The prettify() method now leaves the contents of <pre> tags
|
|
|
|
alone. [bug=1095654]
|
|
|
|
* Fix a bug in the html5lib treebuilder which sometimes created
|
|
|
|
disconnected trees. [bug=1039527]
|
|
|
|
* Fix a bug in the lxml treebuilder which crashed when a tag included
|
|
|
|
an attribute from the predefined "xml:" namespace. [bug=1065617]
|
|
|
|
* Fix a bug by which keyword arguments to find_parent() were not
|
|
|
|
being passed on. [bug=1126734]
|
|
|
|
* Stop a crash when unwisely messing with a tag that's been
|
|
|
|
decomposed. [bug=1097699]
|
|
|
|
* Now that lxml's segfault on invalid doctype has been fixed, fixed a
|
|
|
|
corresponding problem on the Beautiful Soup end that was previously
|
|
|
|
invisible. [bug=984936]
|
|
|
|
* Fixed an exception when an overspecified CSS selector didn't match
|
|
|
|
anything. Code by Stefaan Lippens. [bug=1168167]
|
|
|
|
- Re-enable lxml support (unit tests require it)
|
|
|
|
- Build documentation and add doc sub-package
|
|
|
|
|
2013-05-02 11:11:07 +02:00
|
|
|
-------------------------------------------------------------------
|
|
|
|
Tue Apr 30 12:59:02 UTC 2013 - dmueller@suse.com
|
|
|
|
|
|
|
|
- remove lxml support (fails unit test)
|
|
|
|
|
2013-01-12 15:10:31 +01:00
|
|
|
-------------------------------------------------------------------
|
|
|
|
Sat Jan 12 14:10:18 UTC 2013 - toddrme2178@gmail.com
|
|
|
|
|
|
|
|
- Use explicit file list
|
|
|
|
- Fix building on openSUSE 12.1 and 12.2
|
|
|
|
- Use recommended lxml parser instead of native one
|
|
|
|
(native fails fails for some python versions)
|
|
|
|
|
2013-01-10 09:50:08 +01:00
|
|
|
-------------------------------------------------------------------
|
|
|
|
Wed Jan 9 21:15:18 UTC 2013 - cfarrell@suse.com
|
|
|
|
|
|
|
|
- license update: MIT
|
|
|
|
See COPYING.txt
|
|
|
|
|
2012-09-18 20:20:37 +02:00
|
|
|
-------------------------------------------------------------------
|
|
|
|
Mon Sep 10 18:52:45 UTC 2012 - nmo.marques@gmail.com
|
|
|
|
|
|
|
|
- initial package from version 4.1.3
|
|
|
|
- based on spec file from python-beautifulsoup
|
|
|
|
- requires python >= 2.6
|
|
|
|
|