forked from pool/python-nltk
		
	- Update to 3.7
- Improve and update the NLTK team page on nltk.org (#2855, #2941) - Drop support for Python 3.6, support Python 3.10 (#2920) - Update to 3.6.7 - Resolve IndexError in `sent_tokenize` and `word_tokenize` (#2922) - Update to 3.6.6 - Refactor `gensim.doctest` to work for gensim 4.0.0 and up (#2914) - Add Precision, Recall, F-measure, Confusion Matrix to Taggers (#2862) - Added warnings if .zip files exist without any corresponding .csv files. (#2908) - Fix `FileNotFoundError` when the `download_dir` is a non-existing nested folder (#2910) - Rename omw to omw-1.4 (#2907) - Resolve ReDoS opportunity by fixing incorrectly specified regex (#2906, bsc#1191030, CVE-2021-3828). - Support OMW 1.4 (#2899) - Deprecate Tree get and set node methods (#2900) - Fix broken inaugural test case (#2903) - Use Multilingual Wordnet Data from OMW with newer Wordnet versions (#2889) - Keep NLTKs "tokenize" module working with pathlib (#2896) - Make prettyprinter to be more readable (#2893) - Update links to the nltk book (#2895) - Add `CITATION.cff` to nltk (#2880) - Resolve serious ReDoS in PunktSentenceTokenizer (#2869) - Delete old CI config files (#2881) OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-nltk?expand=0&rev=36
This commit is contained in:
		| @@ -1,3 +0,0 @@ | |||||||
| version https://git-lfs.github.com/spec/v1 |  | ||||||
| oid sha256:845365449cd8c5f9731f7cb9f8bd6fd0767553b9d53af9eb1b3abf7700936b35 |  | ||||||
| size 1433531 |  | ||||||
							
								
								
									
										3
									
								
								nltk-3.7.zip
									
									
									
									
									
										Normal file
									
								
							
							
						
						
									
										3
									
								
								nltk-3.7.zip
									
									
									
									
									
										Normal file
									
								
							| @@ -0,0 +1,3 @@ | |||||||
|  | version https://git-lfs.github.com/spec/v1 | ||||||
|  | oid sha256:d6507d6460cec76d70afea4242a226a7542f85c669177b9c7f562b7cf1b05502 | ||||||
|  | size 3293449 | ||||||
| @@ -1,3 +1,115 @@ | |||||||
|  | ------------------------------------------------------------------- | ||||||
|  | Tue Mar 22 07:48:14 UTC 2022 - Matej Cepl <mcepl@suse.com> | ||||||
|  |  | ||||||
|  | - Update to 3.7 | ||||||
|  |   - Improve and update the NLTK team page on nltk.org (#2855, | ||||||
|  |     #2941) | ||||||
|  |   - Drop support for Python 3.6, support Python 3.10 (#2920) | ||||||
|  | - Update to 3.6.7 | ||||||
|  |   - Resolve IndexError in `sent_tokenize` and `word_tokenize` | ||||||
|  |     (#2922) | ||||||
|  | - Update to 3.6.6 | ||||||
|  |   - Refactor `gensim.doctest` to work for gensim 4.0.0 and up | ||||||
|  |     (#2914) | ||||||
|  |   - Add Precision, Recall, F-measure, Confusion Matrix to Taggers | ||||||
|  |     (#2862) | ||||||
|  |   - Added warnings if .zip files exist without any corresponding | ||||||
|  |     .csv files. (#2908) | ||||||
|  |   - Fix `FileNotFoundError` when the `download_dir` is | ||||||
|  |     a non-existing nested folder (#2910) | ||||||
|  |   - Rename omw to omw-1.4 (#2907) | ||||||
|  |   - Resolve ReDoS opportunity by fixing incorrectly specified | ||||||
|  |     regex (#2906, bsc#1191030, CVE-2021-3828). | ||||||
|  |   - Support OMW 1.4 (#2899) | ||||||
|  |   - Deprecate Tree get and set node methods (#2900) | ||||||
|  |   - Fix broken inaugural test case (#2903) | ||||||
|  |   - Use Multilingual Wordnet Data from OMW with newer Wordnet | ||||||
|  |     versions (#2889) | ||||||
|  |   - Keep NLTKs "tokenize" module working with pathlib (#2896) | ||||||
|  |   - Make prettyprinter to be more readable (#2893) | ||||||
|  |   - Update links to the nltk book (#2895) | ||||||
|  |   - Add `CITATION.cff` to nltk (#2880) | ||||||
|  |   - Resolve serious ReDoS in PunktSentenceTokenizer (#2869) | ||||||
|  |   - Delete old CI config files (#2881) | ||||||
|  |   - Improve Tokenize documentation + add TokenizerI as superclass | ||||||
|  |     for TweetTokenizer (#2878) | ||||||
|  |   - Fix expected value for BLEU score doctest after changes from | ||||||
|  |     #2572 | ||||||
|  |   - Add multi Bleu functionality and tests (#2793) | ||||||
|  |   - Deprecate 'return_str' parameter in NLTKWordTokenizer and | ||||||
|  |     TreebankWordTokenizer (#2883) | ||||||
|  |   - Allow empty string in CFG's + more (#2888) | ||||||
|  |   - Partition `tree.py` module into `tree` package + pickle fix | ||||||
|  |     (#2863) | ||||||
|  |   - Fix several TreebankWordTokenizer and NLTKWordTokenizer bugs | ||||||
|  |     (#2877) | ||||||
|  |   - Rewind Wordnet data file after each lookup (#2868) | ||||||
|  |   - Correct __init__ call for SyntaxCorpusReader subclasses | ||||||
|  |     (#2872) | ||||||
|  |   - Documentation fixes (#2873) | ||||||
|  |   - Fix levenstein distance for duplicated letters (#2849) | ||||||
|  |   - Support alternative Wordnet versions (#2860) | ||||||
|  |   - Remove hundreds of formatting warnings for nltk.org (#2859) | ||||||
|  |   - Modernize `nltk.org/howto` pages (#2856) | ||||||
|  |   - Fix Bleu Score smoothing function from taking log(0) (#2839) | ||||||
|  |   - Update third party tools to newer versions and removing | ||||||
|  |     MaltParser fixed version (#2832) | ||||||
|  |   - Fix TypeError: _pretty() takes 1 positional argument but 2 | ||||||
|  |     were given in sem/drt.py (#2854) | ||||||
|  |   - Replace `http` with `https` in most URLs (#2852) | ||||||
|  | - Update to 3.6.5 | ||||||
|  |   - modernised nltk.org website | ||||||
|  |   - addressed LGTM.com issues | ||||||
|  |   - support ZWJ sequences emoji and skin tone modifer emoji in | ||||||
|  |     TweetTokenizer | ||||||
|  |   - METEOR evaluation now requires pre-tokenized input | ||||||
|  |   - Code linting and type hinting | ||||||
|  |   - implement get_refs function for DrtLambdaExpression | ||||||
|  |   - Enable automated CoreNLP, Senna, Prover9/Mace4, Megam, | ||||||
|  |     MaltParser CI tests | ||||||
|  |   - specify minimum regex version that supports regex.Pattern | ||||||
|  |   - avoid re.Pattern and regex.Pattern which fail for Python 3.6, | ||||||
|  |     3.7 | ||||||
|  | - Update to 3.6.4 | ||||||
|  |   - deprecate `nltk.usage(obj)` in favor of `help(obj)` | ||||||
|  |   - resolve ReDoS vulnerability in Corpus Reader | ||||||
|  |   - solidify performance tests | ||||||
|  |   - improve phone number recognition in tweet tokenizer | ||||||
|  |   - refactored CISTEM stemmer for German | ||||||
|  |   - identify NLTK Team as the author | ||||||
|  |   - replace travis badge with github actions badge | ||||||
|  |   - add SECURITY.md | ||||||
|  | - Update to 3.6.3 | ||||||
|  |   - Dropped support for Python 3.5 | ||||||
|  |   - Run CI tests on Windows, too | ||||||
|  |   - Moved from Travis CI to GitHub Actions | ||||||
|  |   - Code and comment cleanups | ||||||
|  |   - Visualize WordNet relation graphs using Graphviz | ||||||
|  |   - Fixed large error in METEOR score | ||||||
|  |   - Apply isort, pyupgrade, black, added as pre-commit hooks | ||||||
|  |   - Prevent debug_decisions in Punkt from throwing IndexError | ||||||
|  |   - Resolved ZeroDivisionError in RIBES with dissimilar sentences | ||||||
|  |   - Initialize WordNet IC total counts with smoothing value | ||||||
|  |   - Fixed AttributeError for Arabic ARLSTem2 stemmer | ||||||
|  |   - Many fixes and improvements to lm language model package | ||||||
|  |   - Fix bug in nltk.metrics.aline, C_skip = -10 | ||||||
|  |   - Improvements to TweetTokenizer | ||||||
|  |   - Optional show arg for FreqDist.plot, ConditionalFreqDist.plot | ||||||
|  |   - edit_distance now computes Damerau-Levenshtein edit-distance | ||||||
|  | - Update to 3.6.2 | ||||||
|  |   - move test code to nltk/test | ||||||
|  |   - fix bug in NgramAssocMeasures (order preserving fix) | ||||||
|  | - Update to 3.6 | ||||||
|  |   - add support for Python 3.9 | ||||||
|  |   - add Tree.fromlist | ||||||
|  |   - compute Minimum Spanning Tree of unweighted graph using BFS | ||||||
|  |   - fix bug with infinite loop in Wordnet closure and tree | ||||||
|  |   - fix bug in calculating BLEU using smoothing method 4 | ||||||
|  |   - Wordnet synset similarities work for all pos | ||||||
|  |   - new Arabic light stemmer (ARLSTem2) | ||||||
|  |   - new syllable tokenizer (LegalitySyllableTokenizer) | ||||||
|  |   - remove nose in favor of pytest | ||||||
|  |  | ||||||
| ------------------------------------------------------------------- | ------------------------------------------------------------------- | ||||||
| Thu Apr 23 13:54:08 UTC 2020 - John Vandenberg <jayvdb@gmail.com> | Thu Apr 23 13:54:08 UTC 2020 - John Vandenberg <jayvdb@gmail.com> | ||||||
|  |  | ||||||
|   | |||||||
| @@ -1,7 +1,7 @@ | |||||||
| # | # | ||||||
| # spec file for package python-nltk | # spec file for package python-nltk | ||||||
| # | # | ||||||
| # Copyright (c) 2020 SUSE LINUX GmbH, Nuernberg, Germany. | # Copyright (c) 2022 SUSE LLC | ||||||
| # | # | ||||||
| # All modifications and additions to the file contributed by third parties | # All modifications and additions to the file contributed by third parties | ||||||
| # remain the property of their copyright owners, unless otherwise agreed | # remain the property of their copyright owners, unless otherwise agreed | ||||||
| @@ -20,7 +20,7 @@ | |||||||
| %define pyname nltk | %define pyname nltk | ||||||
| %define skip_python2 1 | %define skip_python2 1 | ||||||
| Name:           python-nltk | Name:           python-nltk | ||||||
| Version:        3.5 | Version:        3.7 | ||||||
| Release:        0 | Release:        0 | ||||||
| Summary:        Natural Language Toolkit | Summary:        Natural Language Toolkit | ||||||
| License:        Apache-2.0 | License:        Apache-2.0 | ||||||
| @@ -44,8 +44,8 @@ Recommends:     python-requests | |||||||
| Recommends:     python-scikit-learn | Recommends:     python-scikit-learn | ||||||
| Recommends:     python-scipy | Recommends:     python-scipy | ||||||
| Recommends:     python-twython | Recommends:     python-twython | ||||||
| Requires(post):   update-alternatives | Requires(post): update-alternatives | ||||||
| Requires(postun):  update-alternatives | Requires(postun):update-alternatives | ||||||
| BuildArch:      noarch | BuildArch:      noarch | ||||||
| %python_subpackages | %python_subpackages | ||||||
|  |  | ||||||
| @@ -55,13 +55,13 @@ Python modules, data sets and tutorials supporting research and | |||||||
| development in Natural Language Processing. | development in Natural Language Processing. | ||||||
|  |  | ||||||
| %prep | %prep | ||||||
| %setup -q -n %{pyname}-%{version} | %autosetup -p1 -n %{pyname}-%{version} | ||||||
|  |  | ||||||
| sed -i "1,4{/\/usr\/bin\/env/d}" nltk/corpus/reader/knbc.py | # sed -i "1,4{/\/usr\/bin\/env/d}" nltk/corpus/reader/knbc.py | ||||||
| sed -i "1,4{/\/usr\/bin\/env/d}" nltk/test/runtests.py | # sed -i "1,4{/\/usr\/bin\/env/d}" nltk/test/runtests.py | ||||||
| sed -i "1,4{/\/usr\/bin\/env/d}" nltk/test/unit/test_tgrep.py | # sed -i "1,4{/\/usr\/bin\/env/d}" nltk/test/unit/test_tgrep.py | ||||||
| sed -i "1,4{/\/usr\/bin\/env/d}" nltk/tgrep.py | # sed -i "1,4{/\/usr\/bin\/env/d}" nltk/tgrep.py | ||||||
| sed -i "1,4{/\/usr\/bin\/env/d}" nltk/tokenize/stanford_segmenter.py | # sed -i "1,4{/\/usr\/bin\/env/d}" nltk/tokenize/stanford_segmenter.py | ||||||
|  |  | ||||||
| %build | %build | ||||||
| %python_build | %python_build | ||||||
|   | |||||||
		Reference in New Issue
	
	Block a user