NAME Catmandu::HTML - Modules for handling HTML data within the Catmandu framework SYNOPSIS Command line client catmandu: catmandu convert HTML to JSON < ex/test.html > data/test.json catmandu convert JSON to HTML < data/test.json > ex/test.html # Keep the text curl -s http://librecat.org | catmandu convert HTML to Text --fix "html_text()" # Filter out the anchors curl -s http://librecat.org | catmandu convert HTML to Text --fix "html_filter_tag(a); html_text(join:\"\n\")" # Filter out the meta tags as YAML curl -s http://librecat.org | catmandu convert HTML to YAML --fix "html_filter_tag(meta, group_by:name)" See documentation of modules for more examples. AVAILABLE MODULES Catmandu::Exporter::HTML Serialize HTML data Catmandu::Importer::HTML Parse HTML data Catmandu::Fix::html_filter_tag Filter HTML by name and optional group them Catmandu::Fix::html_filter_type Filter HTML by type Catmandu::Fix::html_text Select only the literal text values from the HTML SEE ALSO This module is based on Catmandu AUTHORS Patrick Hochstenbach, patrick.hochstenbach at ugent.be COPYRIGHT AND LICENSE The Perl software is copyright (c) 2018 by Patrick Hochstenbach. This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.