Free Trial
Web API version
Licensing
Request A Quote
HAVE QUESTIONS OR NEED HELP? SUBMIT THE SUPPORT REQUEST FORM or write email to SUPPORT@BYTESCOUT.COM
The HTMLExtractor type exposes the following members.
Methods
| Name | Description | |
|---|---|---|
| CreateProfile(String, Boolean, Boolean, Boolean) |
Creates JSON profile will all extractor properties with current values.
(Inherited from BaseExtractor.) | |
| CreateProfile(String, String, Boolean, Boolean, Boolean) |
Creates JSON profile will all extractor properties with current values.
(Inherited from BaseExtractor.) | |
| Dispose |
Releases the unmanaged resources used by the instance and optionally releases the managed resources.
(Inherited from BaseExtractor.) | |
| DisposePage |
Disposes the page object.
Uses this method carefully to destroy the page object that should not be used further.
Useful to free allocated memory when processing large PDF documents.
| |
| Equals | (Inherited from Object.) | |
| Finalize | (Inherited from Object.) | |
| FireParsingError | (Inherited from BaseExtractor.) | |
| GetHashCode | (Inherited from Object.) | |
| GetHTML |
Extracts HTML from the entire document.
| |
| GetHTML(IListInt32) |
Extracts HTML from specified pages.
| |
| GetHTML(String) |
Extracts HTML from specified page ranges.
| |
| GetHTML(Int32, Int32) |
Extracts HTML from specified page range.
| |
| GetHTMLPage |
Extracts HTML from specified document page.
| |
| GetOutputHTMLPageHeight |
Get height of the output page rendered in HTML format.
| |
| GetPageCount | (Inherited from BaseExtractor.) | |
| GetPageHeight |
Height of the PDF page (in pdf units).
| |
| GetPageRect_Height | (Inherited from BaseExtractor.) | |
| GetPageRect_Left | (Inherited from BaseExtractor.) | |
| GetPageRect_Top | (Inherited from BaseExtractor.) | |
| GetPageRect_Width | (Inherited from BaseExtractor.) | |
| GetPageRectangle(Int32) | (Inherited from BaseExtractor.) | |
| GetPageRectangle(Int32, Boolean) | (Inherited from BaseExtractor.) | |
| GetPageWidth |
Width of the PDF page (in pdf units).
| |
| GetType | (Inherited from Object.) | |
| LoadAndApplyProfiles |
Loads profiles from JSON string and automatically applies them. Note that profiles containing
detection keywords will be deferred until the extraction.
(Inherited from BaseExtractor.) | |
| LoadDocumentFromFile | (Inherited from BaseExtractor.) | |
| LoadDocumentFromStream | (Inherited from BaseExtractor.) | |
| LoadProfiles |
Loads profiles from JSON file.
(Inherited from BaseExtractor.) | |
| LoadProfilesFromString |
Loads profiles from JSON string.
(Inherited from BaseExtractor.) | |
| MemberwiseClone | (Inherited from Object.) | |
| Reset |
Resets the instance, disposes internal resources and releases the file.
Use this method before loading another PDF file.
(Overrides BaseExtractorReset.) | |
| ResetExtractionArea | (Inherited from BaseExtractor.) | |
| SaveHtmlPageToFile |
Extracts HTML from specified page to stream.
| |
| SaveHtmlPageToStream |
Extracts HTML from specified page to stream.
| |
| SaveHtmlToFile(String) |
Extracts HTML from the entire document to file.
| |
| SaveHtmlToFile(IListInt32, String) |
Extracts HTML from specified pages to file.
| |
| SaveHtmlToFile(String, String) |
Extracts HTML from specified page ranges to file.
| |
| SaveHtmlToFile(Int32, Int32, String) |
Extracts HTML from specified page range to file.
| |
| SaveHtmlToStream(Stream) |
Extracts HTML from the entire document to stream.
| |
| SaveHtmlToStream(IListInt32, Stream) |
Extracts HTML from specified pages to stream.
| |
| SaveHtmlToStream(String, Stream) |
Extracts HTML from specified page ranges to stream.
| |
| SaveHtmlToStream(Int32, Int32, Stream) |
Extracts HTML from specified page range to stream.
| |
| SetExtractionArea(RectangleF) | (Inherited from BaseExtractor.) | |
| SetExtractionArea(Double, Double, Double, Double) | (Inherited from BaseExtractor.) | |
| SetExtractionArea(Single, Single, Single, Single) | (Inherited from BaseExtractor.) | |
| ToString | (Inherited from Object.) |
See Also