ITextExtractor InterfaceByteScout PDF Extractor SDK
Defines the PDF to Text extractor interface.

Namespace:  Bytescout.PDFExtractor
Assembly:  Bytescout.PDFExtractor (in Bytescout.PDFExtractor.dll) Version: 12.0.0.4062-master
Syntax

public interface ITextExtractor

The ITextExtractor type exposes the following members.

Properties

  NameDescription
Public propertyFoundText
Public propertyPageSeparator
Gets or sets the page separator character or string. Default is '\f' (Form Feed).
Public propertyRegexSearch
Gets or sets a value indicating whether to search the text using regular expressions.
Public propertyWordMatchingMode
Gets or sets a value indicating word matching mode (used in text search and auto removal of hyphens). This option is ignored when regular expressions are enabled (via .RegexSearch to True). In case of regular expressions you should use \b metacharacter to specify word bounds.
Public propertyWordMatchingPunctuationMarks
Punctuation marks used by word matching. These marks are considered as a part of a word. Default are: ."'“”
Top
Methods

  NameDescription
Public methodFind(Int32, String, Boolean)
Searches the document page for specified text.
Public methodFind(Int32, String, RegexOptions)
Searches the document page for specified text in Regex mode with specified options.
Public methodFindAll
Searches for all occurrences of specified text in specified document page or in entire document.
Public methodFindAllToJSON
Searches for all occurrences of specified text in specified document page or in entire document and returns result as JSON string.
Public methodFindNext
Continues the text search started by Find(Int32, String, Boolean) method.
Public methodGetText
Extracts text from whole document.
Public methodGetText(Int32, Int32)
Extracts text from specified page range.
Public methodGetTextFromPage
Extracts text from specified document page.
Public methodSavePageTextToFile(Int32, String)
Saves page text to file.
Public methodSavePageTextToFile(Int32, String, Encoding)
Saves page text to file in specified encoding.
Public methodSavePageTextToStream(Int32, Stream)
Saves page text to stream.
Public methodSavePageTextToStream(Int32, Stream, Encoding)
Saves page text to stream in specified encoding.
Public methodSaveTextToFile(String)
Saves document text to file.
Public methodSaveTextToFile(String, Encoding)
Saves document text to file in specified encoding.
Public methodSaveTextToFile(Int32, Int32, String)
Saves text from specified page range to file.
Public methodSaveTextToFile(Int32, Int32, String, Encoding)
Saves text from specified page range to file in specified encoding.
Public methodSaveTextToStream(Stream)
Saves document text to stream.
Public methodSaveTextToStream(Stream, Encoding)
Saves document text to stream in specified encoding.
Public methodSaveTextToStream(Int32, Int32, Stream)
Saves text from specified page range to stream.
Public methodSaveTextToStream(Int32, Int32, Stream, Encoding)
Saves text from specified page range to stream in specified encoding.
Top
See Also

Reference