regexstream Module

Regular expression streams.

class wpull.regexstream.RegexStream(file, pattern, read_size=16384, overlap_size=4096)[source]

Bases: object

Streams file with regular expressions.

Parameters:
  • file – File object.
  • pattern – A compiled regular expression object.
  • read_size (int) – The size of a chunk of text that is searched.
  • overlap_size (int) – The amount of overlap between chunks of text that is searched.
stream()[source]

Iterate the file stream.

Returns:Each item is a tuple:
  1. None, regex match
  2. str
Return type:iterator