TextRecognizer PropertiesByteScout Text Recognition SDK

The TextRecognizer type exposes the following members.

Properties

  NameDescription
Public propertyAutoDetectPageRotation
Gets or sets a value indicating whether the TextRecognizer will try to automatically detect the rotation of a scanned page. Default is false.
Public propertyBlackList
A set of characters not allowed to be recognized from scanned document. The resulting text will only contain characters that are not in this list. This helps improve uncertain recognition.
Public propertyComHelpers
Set of helping methods for use from COM/ActiveX.
Public propertyCorrections
Collection of corrections automatically applied to recognized text to fix repeating recognition errors.
Public propertyImagePreprocessingFilters
Collection of image preprocessing filters.
Public propertyIsDocumentLoaded
Gets whether a document is loaded.
(Inherited from BaseRecognizer.)
Public propertyKeepTextFormatting
Gets or sets whether to try to keep the text formatting.
Public propertyLicenseInfo
Gets license information.
(Inherited from BaseRecognizer.)
Public propertyMaximizeCPUUtilization
Gets or sets maximum OCR performance using Intel OpenMP (if available) to accelerate to approximately 30%. Default is false.
(Inherited from BaseRecognizer.)
Public propertyOCRLanguage
Language for Optical Character Recognition (OCR). The valid values are:
  • "eng" - English (default)
  • "deu" - German
  • "fra" - French
  • "spa" - Spanish

Download more languages at https://github.com/bytescout/ocrdata.

(Inherited from BaseRecognizer.)
Public propertyOCRLanguageDataFolder
Folder containing OCR language data files.
(Inherited from BaseRecognizer.)
Public propertyPageSeparator
Gets or sets the page separator character or string. Default is "\r\n".
Public propertyPDFRenderingOptions
Gets or sets PDF rendering options.
(Inherited from BaseRecognizer.)
Public propertyPDFRenderingResolution
Gets or sets PDF rendering resolution. Default is 300 DPI.
(Inherited from BaseRecognizer.)
Public propertyRecognitionAreas
Collection of page areas intended for text recognition.
Public propertyRegistrationKey
Gets or sets the key number part of registration information.
(Inherited from BaseRecognizer.)
Public propertyRegistrationName
Gets or sets the name part of the registration information.
(Inherited from BaseRecognizer.)
Public propertyTrimLeadingSpaces
Gets or sets whether to trim redundant leading spaces. Default is false. Works only if KeepTextFormatting is true.
Public propertyUnwrapParagraphs
Gets or sets whether to unwrap paragraph text. Default is false. Works only if KeepTextFormatting is true.
Public propertyVersion
Gets version of the component.
(Inherited from BaseRecognizer.)
Public propertyWhiteList
A set of characters allowed to be recognized from scanned document. Only characters from this list will appear in the result text. This helps improve uncertain recognition.
Top
See Also

Reference