diff --git a/Unidecode-0.04.12.tar.gz b/Unidecode-0.04.12.tar.gz deleted file mode 100644 index a37e879..0000000 --- a/Unidecode-0.04.12.tar.gz +++ /dev/null @@ -1,3 +0,0 @@ -version https://git-lfs.github.com/spec/v1 -oid sha256:a042a7284b36da63440f58c32fabdf723b716082cb3c4b093b421e7c4d6ea5b9 -size 195729 diff --git a/Unidecode-0.04.14.tar.gz b/Unidecode-0.04.14.tar.gz new file mode 100644 index 0000000..75219ae --- /dev/null +++ b/Unidecode-0.04.14.tar.gz @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a66f8a527ee0709742caa16745cc882b85d6cfc12b37fa2f56024d07031f230c +size 200860 diff --git a/python-Unidecode.changes b/python-Unidecode.changes index c07b22a..0b23929 100644 --- a/python-Unidecode.changes +++ b/python-Unidecode.changes @@ -1,3 +1,19 @@ +------------------------------------------------------------------- +Fri Sep 20 20:48:57 UTC 2013 - benoit.monin@gmx.fr + +- update to Unidecode 0.04.14 + * Revert transliteration for Latin characters with umlauts back + to language-neutral, unaccented characters. + * Added transliterations for circled numbers, letters and + ordinals. + * Minor fixes to rarely used punctuation characters. + + +------------------------------------------------------------------- +Sat Jun 8 16:54:21 UTC 2013 - benoit.monin@gmx.fr + +- update to Unidecode 0.04.13 + ------------------------------------------------------------------- Thu Mar 28 19:57:11 UTC 2013 - benoit.monin@gmx.fr @@ -6,5 +22,5 @@ Thu Mar 28 19:57:11 UTC 2013 - benoit.monin@gmx.fr ------------------------------------------------------------------- Wed Mar 27 20:28:53 UTC 2013 - benoit.monin@gmx.fr -- initial package with version 0.04.12 +- initial package with version 0.04.12 diff --git a/python-Unidecode.spec b/python-Unidecode.spec index 98ca18c..5c103e1 100644 --- a/python-Unidecode.spec +++ b/python-Unidecode.spec @@ -16,15 +16,15 @@ Name: python-Unidecode -Version: 0.04.12 +Version: 0.04.14 Release: 0 License: GPL-2.0+ Summary: ASCII transliterations of Unicode text Url: https://pypi.python.org/pypi/Unidecode Group: Development/Languages/Python Source: http://pypi.python.org/packages/source/U/Unidecode/Unidecode-%{version}.tar.gz -BuildRequires: fdupes BuildRequires: python-devel +BuildRequires: fdupes BuildRoot: %{_tmppath}/%{name}-%{version}-build %if 0%{?suse_version} && 0%{?suse_version} <= 1110 %{!?python_sitelib: %global python_sitelib %(python -c "from distutils.sysconfig import get_python_lib; print get_python_lib()")} @@ -41,11 +41,40 @@ human-readable Unicode strings that should still be somewhat intelligible (a popular example of this is when making an URL slug from an article title). +In most of these examples you could represent Unicode characters as +"???" or "\\15BA\\15A0\\1610", to mention two extreme cases. But that's +nearly useless to someone who actually wants to read what the text says. + +What Unidecode provides is a middle road: function unidecode() takes +Unicode data and tries to represent it in ASCII characters (i.e., the +universally displayable characters between 0x00 and 0x7F), where the +compromises taken when mapping between two character sets are chosen to be +near what a human with a US keyboard would choose. + +The quality of resulting ASCII representation varies. For languages of +western origin it should be between perfect and good. On the other hand +transliteration (i.e., conveying, in Roman letters, the pronunciation +expressed by the text in some other writing system) of languages like +Chinese, Japanese or Korean is a very complex issue and this library does +not even attempt to address it. It draws the line at context-free +character-by-character mapping. So a good rule of thumb is that the further +the script you are transliterating is from Latin alphabet, the worse the +transliteration will be. + +Note that this module generally produces better results than simply +stripping accents from characters (which can be done in Python with +built-in functions). It is based on hand-tuned character mappings that for +example also contain ASCII approximations for symbols and non-Latin +alphabets. + +This is a Python port of Text::Unidecode Perl module by +Sean M. Burke . + %prep %setup -q -n Unidecode-%{version} %build -python setup.py build +CFLAGS="%{optflags}" python setup.py build %install python setup.py install --prefix=%{_prefix} --root=%{buildroot}