\FinderIndexerParserHtml
HTML Parser class for the Finder indexer package.
Synopsis
class FinderIndexerParserHtml
extends FinderIndexerParser
{
- // methods
- protected string process()
- // Inherited methods from FinderIndexerParser
- public static FinderIndexerParser getInstance()
- public string parse()
- protected abstract string process()
Hierarchy
Extends
Methods
protected
- process() — Method to process HTML input and extract the plain text.
Inherited from FinderIndexerParser
protected
- process() — Method to process input and extract the plain text.
public
- getInstance() — Method to get a parser, creating it if necessary.
- parse() — Method to parse input and extract the plain text. Because this method is called from both inside and outside the indexer, it needs to be able to batch out its parsing functionality to deal with the inefficiencies of regular expressions. We will parse recursively in 2KB chunks.