converter Module¶
Document content post-processing.
-
class
wpull.converter.BaseDocumentConverter[source]¶ Bases:
objectBase class for classes that convert links within a document.
-
class
wpull.converter.BatchDocumentConverter(html_parser, element_walker, url_table, backup=False)[source]¶ Bases:
objectConvert all documents in URL table.
Parameters: - url_table – An instance of
database.URLTable. - backup (bool) – Whether back up files are created.
- url_table – An instance of
-
class
wpull.converter.CSSConverter(url_table)[source]¶ Bases:
wpull.scraper.css.CSSScraper,wpull.converter.BaseDocumentConverterCSS converter.
-
class
wpull.converter.HTMLConverter(html_parser, element_walker, url_table)[source]¶ Bases:
wpull.scraper.html.HTMLScraper,wpull.converter.BaseDocumentConverterHTML converter.