ITextExtractor Properties

ByteScout PDF Extractor SDK

The ITextExtractor type exposes the following members.

Properties

	Name	Description
	FoundText	Contains the search result of Find(Int32, String, Boolean)or FindNext methods.
	FuzzySearch	Sets whether to use "fuzzy" text search algorithm. It allows to find "approximately equal" strings. For example, the search string "fox" will also find "fix" and "fax. This might be useful for compensation of some common OCR errors, like "paralle1" or "paralle\|".
	FuzzySearchPermissibleErrors	Sets the string equality approximation for the fuzzy search algorithm. Simply, this is the number of permissible errors in the search string. Value 1 or 2 is okay, 3 is iffy, 4 is a poor match. Default is 1.
	PageSeparator	Sets the page separator character or string. Default is '\f' (Form Feed).
	RegexSearch	Sets whether to search the text using regular expressions.
	WordMatchingMode	Sets the word matching mode (used in text search and automatic removal of hyphens). This option is ignored when regular expressions are enabled (when is true). In case of regular expressions, you should use '\b' metacharacter to specify word bounds.
	WordMatchingPunctuationMarks	Sets punctuation marks used by word matching. These marks are considered as a part of a word. Default are: ."'“”