converter
Module¶
Document content post-processing.
-
class
wpull.converter.
BaseDocumentConverter
[source]¶ Bases:
object
Base class for classes that convert links within a document.
-
class
wpull.converter.
BatchDocumentConverter
(html_parser, element_walker, url_table, backup=False)[source]¶ Bases:
object
Convert all documents in URL table.
Parameters: - url_table – An instance of
database.URLTable
. - backup (bool) – Whether back up files are created.
- url_table – An instance of
-
class
wpull.converter.
CSSConverter
(url_table)[source]¶ Bases:
wpull.scraper.css.CSSScraper
,wpull.converter.BaseDocumentConverter
CSS converter.
-
class
wpull.converter.
HTMLConverter
(html_parser, element_walker, url_table)[source]¶ Bases:
wpull.scraper.html.HTMLScraper
,wpull.converter.BaseDocumentConverter
HTML converter.