XLSExtractor MethodsByteScout PDF Extractor SDK

The XLSExtractor type exposes the following members.

Methods

  NameDescription
Public methodAddFilter(String, Boolean, Boolean)
Adds a filter to remove a text from extracted data.
(Inherited from BaseTextExtractor.)
Public methodAddFilter(String, Int32, Boolean)
Adds filter to exclude text objects with specified attributes.
(Inherited from BaseTextExtractor.)
Public methodAddFilter(String, Int32, Color, Boolean)
Adds filter to exclude text objects with specified attributes.
(Inherited from BaseTextExtractor.)
Public methodAddFilter(String, String, Boolean, Boolean)
Adds a filter to replace a text in extracted data.
(Inherited from BaseTextExtractor.)
Public methodAddFilter(String, Int32, Int32, Int32, Int32, Boolean)
Adds filter to exclude text objects with specified attributes.
(Inherited from BaseTextExtractor.)
Public methodCreateProfile(String, Boolean, Boolean, Boolean)
Creates JSON profile will all extractor properties with current values.
(Inherited from BaseExtractor.)
Public methodCreateProfile(String, String, Boolean, Boolean, Boolean)
Creates JSON profile will all extractor properties with current values.
(Inherited from BaseExtractor.)
Public methodDispose
Releases the unmanaged resources used by the instance and optionally releases the managed resources.
(Inherited from BaseExtractor.)
Public methodDisposePage
Disposes the page object. Uses this method carefully to destroy the page object that should not be used further. Useful to free allocated memory when processing huge PDF documents.
(Inherited from BaseTextExtractor.)
Public methodEquals (Inherited from Object.)
Protected methodFinalize (Inherited from Object.)
Protected methodFireParsingError (Inherited from BaseExtractor.)
Protected methodFireProgressChanged (Inherited from BaseExtractor.)
Public methodGetHashCode (Inherited from Object.)
Public methodGetPageCount
Returns document page count.
(Inherited from BaseExtractor.)
Public methodGetPageRect_Height
Gets the specified page height.
(Inherited from BaseExtractor.)
Public methodGetPageRect_Left
Gets the specified page left coordinate.
(Inherited from BaseExtractor.)
Public methodGetPageRect_Top
Gets the specified page top coordinate.
(Inherited from BaseExtractor.)
Public methodGetPageRect_Width
Gets the specified page width.
(Inherited from BaseExtractor.)
Public methodGetPageRectangle(Int32)
Gets the page rectangle in PDF Points (1 Point = 1/72 in.).
(Inherited from BaseExtractor.)
Public methodGetPageRectangle(Int32, Boolean)
Gets the page rectangle in PDF Points (1 Point = 1/72 in.).
(Inherited from BaseExtractor.)
Public methodGetPageRotationAngle
Returns the rotation angle of specified page.
(Inherited from BaseExtractor.)
Public methodGetPageXLSAsVariant
Returns Excel data as array of bytes. This is COM/ActiveX-compatible version of the method SavePageToXLSStream(Int32, Stream) for in-memory processing of PDF documents or images.
Public methodGetPreprocessedPagePreview
Returns preview image of document page with preprocessing filters applied.
(Inherited from BaseTextExtractor.)
Public methodGetType (Inherited from Object.)
Public methodGetXLSAsVariant
Returns Excel data as array of bytes. This is COM/ActiveX-compatible version of the method SaveToXLSStream(Stream) for in-memory processing of PDF documents or images.
Public methodGetXLSAsVariant(String)
Returns Excel data as array of bytes. This is COM/ActiveX-compatible version of the method SaveToXLSStream(String, Stream) for in-memory processing of PDF documents or images.
Public methodGetXLSAsVariant(Int32, Int32)
Returns Excel data as array of bytes. This is COM/ActiveX-compatible version of the method SaveToXLSStream(Int32, Int32, Stream) for in-memory processing of PDF documents or images.
Public methodIsEncrypted
Gets the document encrypted state.
(Inherited from BaseExtractor.)
Public methodIsOCRRecommendedForPage
Detects whether OCR is recommended for specified page. OCR (Optical Character Recognition) is recommended when pages has no text objects bat has an image that might contain text.
(Inherited from BaseTextExtractor.)
Public methodLoadAndApplyProfiles
Loads profiles from JSON string and automatically applies them. Note that profiles containing detection keywords will be deferred until the extraction.
(Inherited from BaseExtractor.)
Public methodLoadDocumentFromFile
Loads PDF document from specified file.
(Inherited from BaseExtractor.)
Public methodLoadDocumentFromStream
Loads PDF document from provided stream.
(Inherited from BaseExtractor.)
Public methodLoadDocumentFromVariant
Loads PDF document from byte array presented as array of Variant or Byte objects ('Variant()' or 'Byte()'). This is COM/ActiveX-compatible version of the method LoadDocumentFromStream(Stream) for in-memory processing of PDF files.
(Inherited from BaseExtractor.)
Public methodLoadProfiles
Loads profiles from JSON file.
(Inherited from BaseExtractor.)
Public methodLoadProfilesFromString
Loads profiles from JSON string.
(Inherited from BaseExtractor.)
Protected methodMemberwiseClone (Inherited from Object.)
Protected methodPerformTextAnalysis (Inherited from BaseTextExtractor.)
Public methodReset
Resets the instance and disposes internal resources. Also automatically invoked by Dispose.
(Inherited from BaseTextExtractor.)
Protected methodResetBaseExtractionData (Inherited from BaseTextExtractor.)
Public methodResetExtractionArea
Resets the extraction area to the full page.
(Inherited from BaseExtractor.)
Public methodResetFilters
Reset text filters.
(Inherited from BaseTextExtractor.)
Public methodSavePageToXLSFile
Saves document page text to Excel file. The spreadsheet format (XLS/XLSX) is defined by OutputFormat property.
Public methodSavePageToXLSStream
Saves document page text as Excel spreadsheet to stream. The spreadsheet format (XLS/XLSX) is defined by OutputFormat property.
Public methodSavePreprocessedPagePreview
Saves preview image of document page with preprocessing filters applied. Image is saved in PNG format.
(Inherited from BaseTextExtractor.)
Public methodSaveToXLSFile(String)
Saves document text as Excel file. The spreadsheet format (XLS/XLSX) is defined by OutputFormat property.
Public methodSaveToXLSFile(IListInt32, String)
Saves data from specified pages as Excel file. The spreadsheet format (XLS/XLSX) is defined by OutputFormat property.
Public methodSaveToXLSFile(String, String)
Saves data from specified page ranges as Excel file. The spreadsheet format (XLS/XLSX) is defined by OutputFormat property.
Public methodSaveToXLSFile(Int32, Int32, String)
Saves data from specified page range as Excel file. The spreadsheet format (XLS/XLSX) is defined by OutputFormat property.
Public methodSaveToXLSStream(Stream)
Saves document text as Excel spreadsheet to stream. The spreadsheet format (XLS/XLSX) is defined by OutputFormat property.
Public methodSaveToXLSStream(IListInt32, Stream)
Saves data from specified pages as Excel spreadsheet to stream. The spreadsheet format (XLS/XLSX) is defined by OutputFormat property.
Public methodSaveToXLSStream(String, Stream)
Saves data from specified page ranges as Excel spreadsheet to stream. The spreadsheet format (XLS/XLSX) is defined by OutputFormat property.
Public methodSaveToXLSStream(Int32, Int32, Stream)
Saves data from specified page range as Excel spreadsheet to stream. The spreadsheet format (XLS/XLSX) is defined by OutputFormat property.
Public methodSetCustomExtractionColumns
Helper method to set CustomExtractionColumns property when using the extractor though COM from VC++ VB, VBA, VBScript, or Delphi.
(Inherited from BaseTextExtractor.)
Public methodSetExtractionArea(RectangleF)
Sets the extraction area by rectangle.
(Inherited from BaseExtractor.)
Public methodSetExtractionArea(Double, Double, Double, Double)
Sets the extraction area by coordinates and dimensions.
(Inherited from BaseExtractor.)
Public methodSetExtractionArea(Single, Single, Single, Single)
Sets the extraction area by coordinates and dimensions.
(Inherited from BaseExtractor.)
Public methodToString (Inherited from Object.)
Top
See Also

Reference