2010-01-15 15:00:38 +01:00
|
|
|
#
|
2013-10-24 13:09:21 +02:00
|
|
|
# spec file for package python-nltk
|
2010-01-15 15:00:38 +01:00
|
|
|
#
|
2024-03-22 07:59:49 +01:00
|
|
|
# Copyright (c) 2024 SUSE LLC
|
2010-01-15 15:00:38 +01:00
|
|
|
#
|
|
|
|
# All modifications and additions to the file contributed by third parties
|
|
|
|
# remain the property of their copyright owners, unless otherwise agreed
|
|
|
|
# upon. The license for this file, and modifications and additions to the
|
|
|
|
# file, is the same license as for the pristine package itself (unless the
|
|
|
|
# license for the pristine package is not an Open Source License, in which
|
|
|
|
# case the license is the MIT License). An "Open Source License" is a
|
|
|
|
# license that conforms to the Open Source Definition (Version 1.9)
|
|
|
|
# published by the Open Source Initiative.
|
|
|
|
|
2019-02-08 17:57:42 +01:00
|
|
|
# Please submit bugfixes or comments via https://bugs.opensuse.org/
|
2010-01-15 15:00:38 +01:00
|
|
|
#
|
2018-01-31 12:40:02 +01:00
|
|
|
|
|
|
|
|
2010-01-15 15:00:38 +01:00
|
|
|
Name: python-nltk
|
2024-03-22 07:59:49 +01:00
|
|
|
Version: 3.8.1
|
2011-09-23 14:31:44 +02:00
|
|
|
Release: 0
|
2010-01-15 15:00:38 +01:00
|
|
|
Summary: Natural Language Toolkit
|
2011-09-23 14:31:44 +02:00
|
|
|
License: Apache-2.0
|
2019-07-23 15:53:26 +02:00
|
|
|
URL: http://nltk.org/
|
2024-03-22 07:59:49 +01:00
|
|
|
# SourceRepository: https://github.com/nltk/nltk
|
|
|
|
Source0: https://files.pythonhosted.org/packages/source/n/nltk/nltk-%{version}.zip
|
|
|
|
# Download/Update NLTK data:
|
|
|
|
# quilt setup python-nltk.spec
|
|
|
|
# pushd nltk-?.?.?
|
|
|
|
# python3 -m nltk.downloader -d nltk_data tests \
|
|
|
|
# averaged_perceptron_tagger_ru \
|
|
|
|
# brown \
|
|
|
|
# cess_cat \
|
|
|
|
# cess_esp \
|
|
|
|
# conll2007 \
|
|
|
|
# floresta \
|
|
|
|
# gutenberg \
|
|
|
|
# inaugural \
|
|
|
|
# indian \
|
|
|
|
# large_grammars \
|
|
|
|
# nombank.1.0 \
|
|
|
|
# omw-1.4 \
|
|
|
|
# pl196x \
|
|
|
|
# ptb \
|
|
|
|
# punkt \
|
|
|
|
# rte \
|
|
|
|
# sinica_treebank \
|
|
|
|
# stopwords \
|
|
|
|
# treebank \
|
|
|
|
# udhr \
|
|
|
|
# universal_tagset \
|
|
|
|
# wordnet \
|
|
|
|
# wordnet_ic \
|
|
|
|
# words
|
|
|
|
# tar -cJf ../nltk_data.tar.xz nltk_data
|
|
|
|
# popd
|
2022-12-27 11:15:18 +01:00
|
|
|
# see https://www.nltk.org/data.html for more details
|
|
|
|
Source1: nltk_data.tar.xz
|
2022-12-27 18:26:56 +01:00
|
|
|
Source99: python-nltk.rpmlintrc
|
2022-12-27 11:15:18 +01:00
|
|
|
# PATCH-FIX-UPSTREAM skip-networked-test.patch gh#nltk/nltk#2969 mcepl@suse.com
|
|
|
|
# skip tests requiring network connection
|
|
|
|
Patch0: skip-networked-test.patch
|
2024-03-22 07:59:49 +01:00
|
|
|
# PATCH-FIX-UPSTREAM nltk-pr3207-py312.patch gh#nltk/nltk#3207
|
|
|
|
Patch1: nltk-pr3207-py312.patch
|
|
|
|
BuildRequires: %{python_module base >= 3.7}
|
|
|
|
BuildRequires: %{python_module pip}
|
Accepting request 558587 from home:badshah400:branches:devel:languages:python
- Update to version 3.2.5:
* Arabic stemmers (ARLSTem, Snowball)
* NIST MT evaluation metric and added NIST
international_tokenize
* Moses tokenizer
* Document Russian tagger
* Fix to Stanford segmenter
* Improve treebank detokenizer, VerbNet, Vader
* Misc code and documentation cleanups
* Implement fixes suggested by LGTM
- Convert specfile to python single-spec style.
- Drop unneeded BuildRequires: python-PyYAML, python-xml,
python-devel; not required for building.
- Change existing Requires to Recommends: these are really needed
for additional features, and not required for basic nltk usage.
- Add new Recommends: python-scipy, python-matplotlib,
python-pyparsing, and python-gensim; enables other optional
features.
- Run fdupes to link-up duplicate files.
- Remove exec permissions for a file not intended to be executed
(not in exec path, no hashbang, etc.)
- Remove hashbangs from non-executable files.
- Run tests following the suggestion from
http://www.nltk.org/install.html.
OBS-URL: https://build.opensuse.org/request/show/558587
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-nltk?expand=0&rev=9
2017-12-19 19:48:40 +01:00
|
|
|
BuildRequires: %{python_module setuptools}
|
2024-03-22 07:59:49 +01:00
|
|
|
BuildRequires: %{python_module wheel}
|
2020-06-08 08:38:34 +02:00
|
|
|
BuildRequires: %{pythons}
|
2017-03-01 12:23:50 +01:00
|
|
|
BuildRequires: fdupes
|
2018-01-31 12:40:02 +01:00
|
|
|
BuildRequires: python-rpm-macros
|
2019-02-10 11:26:05 +01:00
|
|
|
BuildRequires: unzip
|
2024-03-22 07:59:49 +01:00
|
|
|
# SECTION runtime
|
|
|
|
BuildRequires: %{python_module regex >= 2021.8.3}
|
2022-12-27 11:15:18 +01:00
|
|
|
BuildRequires: %{python_module click}
|
|
|
|
BuildRequires: %{python_module joblib}
|
2024-03-22 07:59:49 +01:00
|
|
|
BuildRequires: %{python_module tqdm}
|
|
|
|
# /SECTION
|
|
|
|
# SECTION test
|
|
|
|
BuildRequires: %{python_module tk}
|
2022-12-27 11:15:18 +01:00
|
|
|
BuildRequires: %{python_module Jinja2}
|
|
|
|
BuildRequires: %{python_module matplotlib}
|
|
|
|
BuildRequires: %{python_module numpy}
|
|
|
|
BuildRequires: %{python_module pyparsing}
|
|
|
|
BuildRequires: %{python_module pytest-cov}
|
|
|
|
BuildRequires: %{python_module pytest-mock}
|
2024-03-22 07:59:49 +01:00
|
|
|
BuildRequires: %{python_module pytest}
|
2022-12-27 11:15:18 +01:00
|
|
|
BuildRequires: %{python_module python-crfsuite}
|
|
|
|
BuildRequires: %{python_module requests}
|
|
|
|
BuildRequires: %{python_module scikit-learn}
|
|
|
|
BuildRequires: %{python_module scipy}
|
|
|
|
BuildRequires: %{python_module text-unidecode}
|
|
|
|
BuildRequires: %{python_module twython}
|
2024-03-22 07:59:49 +01:00
|
|
|
# /SECTION
|
|
|
|
Requires: python-regex >= 2021.8.3
|
|
|
|
Requires: python-click
|
|
|
|
Requires: python-joblib
|
|
|
|
Requires: python-tqdm
|
Accepting request 558587 from home:badshah400:branches:devel:languages:python
- Update to version 3.2.5:
* Arabic stemmers (ARLSTem, Snowball)
* NIST MT evaluation metric and added NIST
international_tokenize
* Moses tokenizer
* Document Russian tagger
* Fix to Stanford segmenter
* Improve treebank detokenizer, VerbNet, Vader
* Misc code and documentation cleanups
* Implement fixes suggested by LGTM
- Convert specfile to python single-spec style.
- Drop unneeded BuildRequires: python-PyYAML, python-xml,
python-devel; not required for building.
- Change existing Requires to Recommends: these are really needed
for additional features, and not required for basic nltk usage.
- Add new Recommends: python-scipy, python-matplotlib,
python-pyparsing, and python-gensim; enables other optional
features.
- Run fdupes to link-up duplicate files.
- Remove exec permissions for a file not intended to be executed
(not in exec path, no hashbang, etc.)
- Remove hashbangs from non-executable files.
- Run tests following the suggestion from
http://www.nltk.org/install.html.
OBS-URL: https://build.opensuse.org/request/show/558587
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-nltk?expand=0&rev=9
2017-12-19 19:48:40 +01:00
|
|
|
Recommends: python-gensim
|
|
|
|
Recommends: python-matplotlib
|
|
|
|
Recommends: python-numpy
|
|
|
|
Recommends: python-pyparsing
|
2019-07-23 15:53:26 +02:00
|
|
|
Recommends: python-python-crfsuite
|
Accepting request 558587 from home:badshah400:branches:devel:languages:python
- Update to version 3.2.5:
* Arabic stemmers (ARLSTem, Snowball)
* NIST MT evaluation metric and added NIST
international_tokenize
* Moses tokenizer
* Document Russian tagger
* Fix to Stanford segmenter
* Improve treebank detokenizer, VerbNet, Vader
* Misc code and documentation cleanups
* Implement fixes suggested by LGTM
- Convert specfile to python single-spec style.
- Drop unneeded BuildRequires: python-PyYAML, python-xml,
python-devel; not required for building.
- Change existing Requires to Recommends: these are really needed
for additional features, and not required for basic nltk usage.
- Add new Recommends: python-scipy, python-matplotlib,
python-pyparsing, and python-gensim; enables other optional
features.
- Run fdupes to link-up duplicate files.
- Remove exec permissions for a file not intended to be executed
(not in exec path, no hashbang, etc.)
- Remove hashbangs from non-executable files.
- Run tests following the suggestion from
http://www.nltk.org/install.html.
OBS-URL: https://build.opensuse.org/request/show/558587
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-nltk?expand=0&rev=9
2017-12-19 19:48:40 +01:00
|
|
|
Recommends: python-requests
|
2019-07-23 15:53:26 +02:00
|
|
|
Recommends: python-scikit-learn
|
Accepting request 558587 from home:badshah400:branches:devel:languages:python
- Update to version 3.2.5:
* Arabic stemmers (ARLSTem, Snowball)
* NIST MT evaluation metric and added NIST
international_tokenize
* Moses tokenizer
* Document Russian tagger
* Fix to Stanford segmenter
* Improve treebank detokenizer, VerbNet, Vader
* Misc code and documentation cleanups
* Implement fixes suggested by LGTM
- Convert specfile to python single-spec style.
- Drop unneeded BuildRequires: python-PyYAML, python-xml,
python-devel; not required for building.
- Change existing Requires to Recommends: these are really needed
for additional features, and not required for basic nltk usage.
- Add new Recommends: python-scipy, python-matplotlib,
python-pyparsing, and python-gensim; enables other optional
features.
- Run fdupes to link-up duplicate files.
- Remove exec permissions for a file not intended to be executed
(not in exec path, no hashbang, etc.)
- Remove hashbangs from non-executable files.
- Run tests following the suggestion from
http://www.nltk.org/install.html.
OBS-URL: https://build.opensuse.org/request/show/558587
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-nltk?expand=0&rev=9
2017-12-19 19:48:40 +01:00
|
|
|
Recommends: python-scipy
|
2019-02-08 17:57:42 +01:00
|
|
|
Recommends: python-twython
|
2022-03-22 08:58:51 +01:00
|
|
|
Requires(post): update-alternatives
|
2024-03-22 07:59:49 +01:00
|
|
|
Requires(postun): update-alternatives
|
2011-09-23 14:31:44 +02:00
|
|
|
BuildArch: noarch
|
Accepting request 558587 from home:badshah400:branches:devel:languages:python
- Update to version 3.2.5:
* Arabic stemmers (ARLSTem, Snowball)
* NIST MT evaluation metric and added NIST
international_tokenize
* Moses tokenizer
* Document Russian tagger
* Fix to Stanford segmenter
* Improve treebank detokenizer, VerbNet, Vader
* Misc code and documentation cleanups
* Implement fixes suggested by LGTM
- Convert specfile to python single-spec style.
- Drop unneeded BuildRequires: python-PyYAML, python-xml,
python-devel; not required for building.
- Change existing Requires to Recommends: these are really needed
for additional features, and not required for basic nltk usage.
- Add new Recommends: python-scipy, python-matplotlib,
python-pyparsing, and python-gensim; enables other optional
features.
- Run fdupes to link-up duplicate files.
- Remove exec permissions for a file not intended to be executed
(not in exec path, no hashbang, etc.)
- Remove hashbangs from non-executable files.
- Run tests following the suggestion from
http://www.nltk.org/install.html.
OBS-URL: https://build.opensuse.org/request/show/558587
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-nltk?expand=0&rev=9
2017-12-19 19:48:40 +01:00
|
|
|
%python_subpackages
|
2010-01-15 15:00:38 +01:00
|
|
|
|
2022-12-27 11:15:18 +01:00
|
|
|
# changedir = nltk/test
|
|
|
|
|
2010-01-15 15:00:38 +01:00
|
|
|
%description
|
2018-03-06 21:46:53 +01:00
|
|
|
NLTK -- the Natural Language Toolkit -- is a suite of
|
2010-01-15 15:00:38 +01:00
|
|
|
Python modules, data sets and tutorials supporting research and
|
|
|
|
development in Natural Language Processing.
|
|
|
|
|
|
|
|
%prep
|
2024-03-22 07:59:49 +01:00
|
|
|
%autosetup -p1 -a1 -n nltk-%{version}
|
Accepting request 558587 from home:badshah400:branches:devel:languages:python
- Update to version 3.2.5:
* Arabic stemmers (ARLSTem, Snowball)
* NIST MT evaluation metric and added NIST
international_tokenize
* Moses tokenizer
* Document Russian tagger
* Fix to Stanford segmenter
* Improve treebank detokenizer, VerbNet, Vader
* Misc code and documentation cleanups
* Implement fixes suggested by LGTM
- Convert specfile to python single-spec style.
- Drop unneeded BuildRequires: python-PyYAML, python-xml,
python-devel; not required for building.
- Change existing Requires to Recommends: these are really needed
for additional features, and not required for basic nltk usage.
- Add new Recommends: python-scipy, python-matplotlib,
python-pyparsing, and python-gensim; enables other optional
features.
- Run fdupes to link-up duplicate files.
- Remove exec permissions for a file not intended to be executed
(not in exec path, no hashbang, etc.)
- Remove hashbangs from non-executable files.
- Run tests following the suggestion from
http://www.nltk.org/install.html.
OBS-URL: https://build.opensuse.org/request/show/558587
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-nltk?expand=0&rev=9
2017-12-19 19:48:40 +01:00
|
|
|
|
2022-12-27 18:26:56 +01:00
|
|
|
# Fix EOL
|
|
|
|
sed -i 's/\r/\n/g; s/\n$//' \
|
|
|
|
README.md \
|
|
|
|
nltk/corpus/reader/knbc.py \
|
|
|
|
nltk/test/unit/test_tgrep.py \
|
|
|
|
nltk/tgrep.py \
|
|
|
|
nltk/tokenize/stanford_segmenter.py \
|
|
|
|
nltk/corpus/reader/knbc.py \
|
|
|
|
nltk/test/unit/test_tgrep.py \
|
|
|
|
nltk/tgrep.py \
|
|
|
|
nltk/tokenize/stanford_segmenter.py \
|
|
|
|
nltk/corpus/reader/knbc.py \
|
|
|
|
nltk/test/unit/test_tgrep.py \
|
|
|
|
nltk/tgrep.py \
|
|
|
|
nltk/tokenize/stanford_segmenter.py
|
|
|
|
|
|
|
|
# Remove unrequired shebangs
|
|
|
|
sed -E -i "/#![[:space:]]*\/usr\/bin\/env python/d" \
|
|
|
|
nltk/tgrep.py \
|
|
|
|
nltk/tokenize/stanford_segmenter.py \
|
|
|
|
nltk/test/unit/test_tgrep.py \
|
|
|
|
nltk/corpus/reader/knbc.py
|
|
|
|
|
|
|
|
# Switch shebangs to the standard Python interpreter
|
|
|
|
sed -E -i "s|#![[:space:]]*%{_bindir}/env python|#!%{_bindir}/python3|" \
|
|
|
|
setup.py \
|
|
|
|
tools/global_replace.py \
|
|
|
|
nltk_data/corpora/pl196x/splitter.py \
|
2024-03-22 07:59:49 +01:00
|
|
|
tools/find_deprecated.py
|
2010-01-15 15:00:38 +01:00
|
|
|
|
|
|
|
%build
|
2024-03-22 07:59:49 +01:00
|
|
|
%pyproject_wheel
|
2010-01-15 15:00:38 +01:00
|
|
|
|
|
|
|
%install
|
2024-03-22 07:59:49 +01:00
|
|
|
%pyproject_install
|
2020-06-08 08:38:34 +02:00
|
|
|
%python_clone -a %{buildroot}%{_bindir}/nltk
|
Accepting request 558587 from home:badshah400:branches:devel:languages:python
- Update to version 3.2.5:
* Arabic stemmers (ARLSTem, Snowball)
* NIST MT evaluation metric and added NIST
international_tokenize
* Moses tokenizer
* Document Russian tagger
* Fix to Stanford segmenter
* Improve treebank detokenizer, VerbNet, Vader
* Misc code and documentation cleanups
* Implement fixes suggested by LGTM
- Convert specfile to python single-spec style.
- Drop unneeded BuildRequires: python-PyYAML, python-xml,
python-devel; not required for building.
- Change existing Requires to Recommends: these are really needed
for additional features, and not required for basic nltk usage.
- Add new Recommends: python-scipy, python-matplotlib,
python-pyparsing, and python-gensim; enables other optional
features.
- Run fdupes to link-up duplicate files.
- Remove exec permissions for a file not intended to be executed
(not in exec path, no hashbang, etc.)
- Remove hashbangs from non-executable files.
- Run tests following the suggestion from
http://www.nltk.org/install.html.
OBS-URL: https://build.opensuse.org/request/show/558587
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-nltk?expand=0&rev=9
2017-12-19 19:48:40 +01:00
|
|
|
|
2019-10-14 16:00:56 +02:00
|
|
|
%{python_expand %fdupes %{buildroot}%{$python_sitelib}/
|
Accepting request 558587 from home:badshah400:branches:devel:languages:python
- Update to version 3.2.5:
* Arabic stemmers (ARLSTem, Snowball)
* NIST MT evaluation metric and added NIST
international_tokenize
* Moses tokenizer
* Document Russian tagger
* Fix to Stanford segmenter
* Improve treebank detokenizer, VerbNet, Vader
* Misc code and documentation cleanups
* Implement fixes suggested by LGTM
- Convert specfile to python single-spec style.
- Drop unneeded BuildRequires: python-PyYAML, python-xml,
python-devel; not required for building.
- Change existing Requires to Recommends: these are really needed
for additional features, and not required for basic nltk usage.
- Add new Recommends: python-scipy, python-matplotlib,
python-pyparsing, and python-gensim; enables other optional
features.
- Run fdupes to link-up duplicate files.
- Remove exec permissions for a file not intended to be executed
(not in exec path, no hashbang, etc.)
- Remove hashbangs from non-executable files.
- Run tests following the suggestion from
http://www.nltk.org/install.html.
OBS-URL: https://build.opensuse.org/request/show/558587
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-nltk?expand=0&rev=9
2017-12-19 19:48:40 +01:00
|
|
|
chmod -x %{buildroot}%{$python_sitelib}/nltk/test/dependency.doctest
|
|
|
|
}
|
|
|
|
|
|
|
|
%check
|
2022-12-27 11:15:18 +01:00
|
|
|
export NLTK_DATA=$(readlink -f ./nltk_data/)
|
|
|
|
# export PYTEST_ADDOPTS="--doctest-modules"
|
|
|
|
%pytest -k 'not network'
|
2010-01-15 15:00:38 +01:00
|
|
|
|
2020-06-08 08:38:34 +02:00
|
|
|
%post
|
|
|
|
%python_install_alternative nltk
|
|
|
|
|
|
|
|
%postun
|
|
|
|
%python_uninstall_alternative nltk
|
|
|
|
|
Accepting request 558587 from home:badshah400:branches:devel:languages:python
- Update to version 3.2.5:
* Arabic stemmers (ARLSTem, Snowball)
* NIST MT evaluation metric and added NIST
international_tokenize
* Moses tokenizer
* Document Russian tagger
* Fix to Stanford segmenter
* Improve treebank detokenizer, VerbNet, Vader
* Misc code and documentation cleanups
* Implement fixes suggested by LGTM
- Convert specfile to python single-spec style.
- Drop unneeded BuildRequires: python-PyYAML, python-xml,
python-devel; not required for building.
- Change existing Requires to Recommends: these are really needed
for additional features, and not required for basic nltk usage.
- Add new Recommends: python-scipy, python-matplotlib,
python-pyparsing, and python-gensim; enables other optional
features.
- Run fdupes to link-up duplicate files.
- Remove exec permissions for a file not intended to be executed
(not in exec path, no hashbang, etc.)
- Remove hashbangs from non-executable files.
- Run tests following the suggestion from
http://www.nltk.org/install.html.
OBS-URL: https://build.opensuse.org/request/show/558587
OBS-URL: https://build.opensuse.org/package/show/devel:languages:python/python-nltk?expand=0&rev=9
2017-12-19 19:48:40 +01:00
|
|
|
%files %{python_files}
|
2019-02-10 11:26:05 +01:00
|
|
|
%doc README.md
|
2018-03-05 16:31:28 +01:00
|
|
|
%license LICENSE.txt
|
2024-03-22 07:59:49 +01:00
|
|
|
%{python_sitelib}/nltk/
|
|
|
|
%{python_sitelib}/nltk-%{version}.dist-info/
|
2020-06-08 08:38:34 +02:00
|
|
|
%python_alternative %{_bindir}/nltk
|
2010-01-15 15:00:38 +01:00
|
|
|
|
|
|
|
%changelog
|