OpenQuizz
Une application de gestion des contenus pédagogiques
HTMLBinaryInputStream Class Reference
Inheritance diagram for HTMLBinaryInputStream:
Collaboration diagram for HTMLBinaryInputStream:

Public Member Functions

def __init__ (self, source, override_encoding=None, transport_encoding=None, same_origin_parent_encoding=None, likely_encoding=None, default_encoding="windows-1252", useChardet=True)
 
def reset (self)
 
def openStream (self, source)
 
def determineEncoding (self, chardet=True)
 
def changeEncoding (self, newEncoding)
 
def detectBOM (self)
 
def detectEncodingMeta (self)
 
- Public Member Functions inherited from HTMLUnicodeInputStream
def __init__ (self, source)
 
def position (self)
 
def char (self)
 
def readChunk (self, chunkSize=None)
 
def characterErrorsUCS4 (self, data)
 
def characterErrorsUCS2 (self, data)
 
def charsUntil (self, characters, opposite=False)
 
def unget (self, char)
 

Data Fields

 rawStream
 
 numBytesMeta
 
 numBytesChardet
 
 override_encoding
 
 transport_encoding
 
 same_origin_parent_encoding
 
 likely_encoding
 
 default_encoding
 
 charEncoding
 
 dataStream
 
- Data Fields inherited from HTMLUnicodeInputStream
 reportCharacterErrors
 
 newLines
 
 charEncoding
 
 dataStream
 
 chunk
 
 chunkSize
 
 chunkOffset
 
 errors
 
 prevNumLines
 
 prevNumCols
 

Detailed Description

Provides a unicode stream of characters to the HTMLTokenizer.

This class takes care of character encoding and removing or replacing
incorrect byte-sequences and also provides column and line tracking.

Constructor & Destructor Documentation

◆ __init__()

def __init__ (   self,
  source,
  override_encoding = None,
  transport_encoding = None,
  same_origin_parent_encoding = None,
  likely_encoding = None,
  default_encoding = "windows-1252",
  useChardet = True 
)
Initialises the HTMLInputStream.

HTMLInputStream(source, [encoding]) -> Normalized stream from source
for use by html5lib.

source can be either a file-object, local filename or a string.

The optional encoding parameter must be a string that indicates
the encoding.  If specified, that encoding will be used,
regardless of any BOM or later declaration (such as in a meta
element)

Member Function Documentation

◆ changeEncoding()

def changeEncoding (   self,
  newEncoding 
)

◆ detectBOM()

def detectBOM (   self)
Attempts to detect at BOM at the start of the stream. If
an encoding can be determined from the BOM return the name of the
encoding otherwise return None

◆ detectEncodingMeta()

def detectEncodingMeta (   self)
Report the encoding declared by the meta element

◆ determineEncoding()

def determineEncoding (   self,
  chardet = True 
)

◆ openStream()

def openStream (   self,
  source 
)
Produces a file object from source.

source can be either a file object, local filename or a string.

Reimplemented from HTMLUnicodeInputStream.

◆ reset()

def reset (   self)

Reimplemented from HTMLUnicodeInputStream.

Field Documentation

◆ charEncoding

charEncoding

◆ dataStream

dataStream

◆ default_encoding

default_encoding

◆ likely_encoding

likely_encoding

◆ numBytesChardet

numBytesChardet

◆ numBytesMeta

numBytesMeta

◆ override_encoding

override_encoding

◆ rawStream

rawStream

◆ same_origin_parent_encoding

same_origin_parent_encoding

◆ transport_encoding

transport_encoding

The documentation for this class was generated from the following file: