nekohtml - HTML scanner and tag balancer

Description:

NekoHTML is a simple HTML scanner and tag balancer that enables
application programmers to parse HTML documents and access the
information using standard XML interfaces. The parser can scan HTML
files and "fix up" many common mistakes that human (and computer)
authors make in writing HTML documents.  NekoHTML adds missing parent
elements; automatically closes elements with optional end tags; and
can handle mismatched inline element tags.
NekoHTML is written using the Xerces Native Interface (XNI) that is
the foundation of the Xerces2 implementation. This enables you to use
the NekoHTML parser with existing XNI tools without modification or
rewriting code.

Homepage: http://www.apache.org/~andyc/neko/doc/html/

License: Apache License

Vendor: Fedora Project

Packages

nekohtml-0.9.5-4jpp.1.fc7.noarch [133 KiB] Changelog by Jeff Johnston (2007-02-12):
- Update to address Fedora review comments.