automatic update OBS-URL: https://build.opensuse.org/request/show/296313 OBS-URL: https://build.opensuse.org/package/show/devel:languages:perl/perl-IO-HTML?expand=0&rev=3
96 lines
3.0 KiB
RPMSpec
96 lines
3.0 KiB
RPMSpec
#
|
|
# spec file for package perl-IO-HTML
|
|
#
|
|
# Copyright (c) 2015 SUSE LINUX GmbH, Nuernberg, Germany.
|
|
#
|
|
# All modifications and additions to the file contributed by third parties
|
|
# remain the property of their copyright owners, unless otherwise agreed
|
|
# upon. The license for this file, and modifications and additions to the
|
|
# file, is the same license as for the pristine package itself (unless the
|
|
# license for the pristine package is not an Open Source License, in which
|
|
# case the license is the MIT License). An "Open Source License" is a
|
|
# license that conforms to the Open Source Definition (Version 1.9)
|
|
# published by the Open Source Initiative.
|
|
|
|
# Please submit bugfixes or comments via http://bugs.opensuse.org/
|
|
#
|
|
|
|
|
|
Name: perl-IO-HTML
|
|
Version: 1.001
|
|
Release: 0
|
|
%define cpan_name IO-HTML
|
|
Summary: Open an HTML file with automatic charset detection
|
|
License: Artistic-1.0 or GPL-1.0+
|
|
Group: Development/Libraries/Perl
|
|
Url: http://search.cpan.org/dist/IO-HTML/
|
|
Source: http://www.cpan.org/authors/id/C/CJ/CJM/%{cpan_name}-%{version}.tar.gz
|
|
BuildArch: noarch
|
|
BuildRoot: %{_tmppath}/%{name}-%{version}-build
|
|
BuildRequires: perl
|
|
BuildRequires: perl-macros
|
|
BuildRequires: perl(Test::More) >= 0.88
|
|
%{perl_requires}
|
|
|
|
%description
|
|
IO::HTML provides an easy way to open a file containing HTML while
|
|
automatically determining its encoding. It uses the HTML5 encoding sniffing
|
|
algorithm specified in section 8.2.2.2 of the draft standard.
|
|
|
|
The algorithm as implemented here is:
|
|
|
|
* 1.
|
|
|
|
If the file begins with a byte order mark indicating UTF-16LE, UTF-16BE,
|
|
or UTF-8, then that is the encoding.
|
|
|
|
* 2.
|
|
|
|
If the first 1024 bytes of the file contain a '<meta>' tag that indicates
|
|
the charset, and Encode recognizes the specified charset name, then that
|
|
is the encoding. (This portion of the algorithm is implemented by
|
|
'find_charset_in'.)
|
|
|
|
The '<meta>' tag can be in one of two formats:
|
|
|
|
<meta charset="...">
|
|
<meta http-equiv="Content-Type" content="...charset=...">
|
|
|
|
The search is case-insensitive, and the order of attributes within the
|
|
tag is irrelevant. Any additional attributes of the tag are ignored. The
|
|
first matching tag with a recognized encoding ends the search.
|
|
|
|
* 3.
|
|
|
|
If the first 1024 bytes of the file are valid UTF-8 (with at least 1
|
|
non-ASCII character), then the encoding is UTF-8.
|
|
|
|
* 4.
|
|
|
|
If all else fails, use the default character encoding. The HTML5 standard
|
|
suggests the default encoding should be locale dependent, but currently
|
|
it is always 'cp1252' unless you set '$IO::HTML::default_encoding' to a
|
|
different value. Note: 'sniff_encoding' does not apply this step; only
|
|
'html_file' does that.
|
|
|
|
%prep
|
|
%setup -q -n %{cpan_name}-%{version}
|
|
|
|
%build
|
|
%{__perl} Makefile.PL INSTALLDIRS=vendor
|
|
%{__make} %{?_smp_mflags}
|
|
|
|
%check
|
|
%{__make} test
|
|
|
|
%install
|
|
%perl_make_install
|
|
%perl_process_packlist
|
|
%perl_gen_filelist
|
|
|
|
%files -f %{name}.files
|
|
%defattr(-,root,root,755)
|
|
%doc Changes LICENSE README
|
|
|
|
%changelog
|