# # spec file for package perl-HTML-SimpleParse (Version 0.12) # # Copyright (c) 2004 SuSE Linux AG, Nuernberg, Germany. # This file and all modifications and additions to the pristine # package are under the same license as the package itself. # # Please submit bugfixes or comments via http://www.suse.de/feedback/ # # norootforbuild # usedforbuild aaa_base acl attr bash bind-utils bison bzip2 coreutils cpio cpp cvs cyrus-sasl db devs diffutils e2fsprogs file filesystem fillup findutils flex gawk gdbm-devel glibc glibc-devel glibc-locale gpm grep groff gzip info insserv kbd less libacl libattr libgcc libstdc++ libxcrypt m4 make man mktemp modutils ncurses ncurses-devel net-tools netcfg openldap2-client openssl pam pam-devel pam-modules patch permissions popt ps rcs readline sed sendmail shadow strace syslogd sysvinit tar texinfo timezone unzip util-linux vim zlib zlib-devel autoconf automake binutils cracklib gcc gdbm gettext libtool perl rpm Name: perl-HTML-SimpleParse Version: 0.12 Release: 71 Provides: HTML-SimpleParse Conflicts: perlmod Requires: perl = %{perl_version} Autoreqprov: on Group: Development/Libraries/Perl License: Artistic License URL: http://cpan.org/modules/by-module/HTML/ Summary: a bare-bones HTML parser Source: HTML-SimpleParse-%{version}.tar.bz2 BuildRoot: %{_tmppath}/%{name}-%{version}-build %description This is the HTML::SimpleParse module. It is a bare-bones HTML parser, similar to HTML::Parser, but with a couple important distinctions: First, HTML::Parser knows which tags can contain other tags, which start tags have corresponding end tags, which tags can exist only in the
portion of the document, and so forth. HTML::SimpleParse does not know any of these things. It just finds tags and text in the HTML you give it, it does not care about the specific content of these tags (though it does distiguish between different _types_ of tags, such as comments, starting tags like , ending tags like , and so on). Second, HTML::SimpleParse does not create a hierarchical tree of HTML content, but rather a simple linear list. It does not pay any attention to balancing start tags with corresponding end tags, or which pairs of tags are inside other pairs of tags. Because of these characteristics, you can make a very effective HTML filter by sub-classing HTML::SimpleParse. Authors: -------- Ken Williams