htmltools
htmltools is a simple set of CLI utilities which can be used in a pipeline to manipulate and transform HTML documents.
Currently included are:
- htmltotext: removes all tags from an HTML document leaving only the text nodes
- htmlremove: removes elements matching a selector from an HTML document
- htmlunwrap: removes elements matching a selector from an HTML document and replaces them with their child nodes
- htmlselect: prints all elements matching a selector from an HTML document
- htmlindentheadings: indents (shifts) all h1-h7 elements by some level