35a1d0ac0d- Use _service to download source and exclude documentation that has non-commercial license (boo#1232448)
devel
Daniel Garcia2024-10-29 09:33:33 +00:00
5a720a2613- Remove nltk_data to avoid redistribution of files with non-commercial (boo#1232448): > NLTK corpora are provided under the terms given in the README file > for each corpus; all are redistributable and available for > non-commercial use. - Remove not needed skip-networked-test.patchDaniel Garcia2024-10-29 09:09:02 +00:00
f02d8a709bAccepting request 1218494 from devel:languages:pythonAna Guerrero2024-10-27 10:25:56 +00:00
9adfbd9e5d- Add CVE-2024-39705.patch upstream patch to fix unsafe pickle usage. (CVE-2024-39705, gh#nltk/nltk#3266, bsc#1227174). - Drop CVE-2024-39705-disable-download.patch as it's not needed anymore.Daniel Garcia2024-07-26 07:21:06 +00:00
a331f038f7Accepting request 1185062 from devel:languages:pythonAna Guerrero2024-07-04 14:24:00 +00:00
3ac2448eef- Use tarball from GitHub instead of the Zip archive from PyPI, the latter has very messy combination of CRLF and LF EOLs, which are hard to patch. - Refresh all patches from the original locations. - Add CVE-2024-39705-disable-download.patch to make a crude workaround around CVE-2024-39705 (gh#nltk/nltk#3266, bsc#1227174).Matej Cepl2024-07-02 21:12:29 +00:00
59b1ddaa8cAccepting request 1160546 from devel:languages:pythonAna Guerrero2024-03-22 14:21:17 +00:00
f64d1a206eAccepting request 1160467 from home:bnavigator:branches:devel:languages:pythonDaniel Garcia2024-03-22 06:59:49 +00:00
31f5bb280c- Update to 3.7 - Improve and update the NLTK team page on nltk.org (#2855, #2941) - Drop support for Python 3.6, support Python 3.10 (#2920) - Update to 3.6.7 - Resolve IndexError in sent_tokenize and word_tokenize (#2922) - Update to 3.6.6 - Refactor gensim.doctest to work for gensim 4.0.0 and up (#2914) - Add Precision, Recall, F-measure, Confusion Matrix to Taggers (#2862) - Added warnings if .zip files exist without any corresponding .csv files. (#2908) - Fix FileNotFoundError when the download_dir is a non-existing nested folder (#2910) - Rename omw to omw-1.4 (#2907) - Resolve ReDoS opportunity by fixing incorrectly specified regex (#2906, bsc#1191030, CVE-2021-3828). - Support OMW 1.4 (#2899) - Deprecate Tree get and set node methods (#2900) - Fix broken inaugural test case (#2903) - Use Multilingual Wordnet Data from OMW with newer Wordnet versions (#2889) - Keep NLTKs "tokenize" module working with pathlib (#2896) - Make prettyprinter to be more readable (#2893) - Update links to the nltk book (#2895) - Add CITATION.cff to nltk (#2880) - Resolve serious ReDoS in PunktSentenceTokenizer (#2869) - Delete old CI config files (#2881)Matej Cepl2022-03-22 07:58:51 +00:00
183dff8b81- Replace %fdupes -s with plain %fdupes; hardlinks are better.Matej Cepl2019-10-14 14:00:56 +00:00
98f9d97e5dAccepting request 730102 from devel:languages:python
Ludwig Nussel
2019-09-13 12:58:17 +00:00
e103fbc93f- Update to 3.4.5: * Fixed security bug in downloader: Zip slip vulnerability - for the unlikely situation where a user configures their downloader to use a compromised server CVE-2019-14751
Tomáš Chvátal
2019-09-11 11:07:28 +00:00
537166da9d- Update to 3.4.4: * fix bug in plot function (probability.py) * add improved PanLex Swadesh corpus reader * add Text.generate() * add QuadgramAssocMeasures * add SSP to tokenizers * return confidence of best tag from AveragedPerceptron * make plot methods return Axes objects * don't require list arguments to PositiveNaiveBayesClassifier.train * fix Tree classes to work with native Python copy library * fix inconsistency for NomBank * fix random seeding in LanguageModel.generate * fix ConditionalFreqDist mutation on tabulate/plot call * fix broken links in documentation * fix misc Wordnet issues * update installation instructions
Tomáš Chvátal
2019-07-23 13:53:26 +00:00