Free Trial
Web API version
Licensing
Request A Quote
HAVE QUESTIONS OR NEED HELP? SUBMIT THE SUPPORT REQUEST FORM or write email to SUPPORT@BYTESCOUT.COM
The HTMLExtractor type exposes the following members.
Methods
Name | Description | |
---|---|---|
![]() | CreateProfile(String, Boolean, Boolean, Boolean) |
Creates JSON profile will all extractor properties with current values.
(Inherited from BaseExtractor.) |
![]() | CreateProfile(String, String, Boolean, Boolean, Boolean) |
Creates JSON profile will all extractor properties with current values.
(Inherited from BaseExtractor.) |
![]() | Dispose |
Releases the unmanaged resources used by the instance and optionally releases the managed resources.
(Inherited from BaseExtractor.) |
![]() | DisposePage |
Disposes the page object.
Uses this method carefully to destroy the page object that should not be used further.
Useful to free allocated memory when processing large PDF documents.
|
![]() | Equals | (Inherited from Object.) |
![]() | Finalize | (Inherited from Object.) |
![]() | FireParsingError | (Inherited from BaseExtractor.) |
![]() | GetHashCode | (Inherited from Object.) |
![]() | GetHTML |
Extracts HTML from the entire document.
|
![]() | GetHTML(IListInt32) |
Extracts HTML from specified pages.
|
![]() | GetHTML(String) |
Extracts HTML from specified page ranges.
|
![]() | GetHTML(Int32, Int32) |
Extracts HTML from specified page range.
|
![]() | GetHTMLPage |
Extracts HTML from specified document page.
|
![]() | GetOutputHTMLPageHeight |
Get height of the output page rendered in HTML format.
|
![]() | GetPageCount | (Inherited from BaseExtractor.) |
![]() | GetPageHeight |
Height of the PDF page (in pdf units).
|
![]() | GetPageRect_Height | (Inherited from BaseExtractor.) |
![]() | GetPageRect_Left | (Inherited from BaseExtractor.) |
![]() | GetPageRect_Top | (Inherited from BaseExtractor.) |
![]() | GetPageRect_Width | (Inherited from BaseExtractor.) |
![]() | GetPageRectangle(Int32) | (Inherited from BaseExtractor.) |
![]() | GetPageRectangle(Int32, Boolean) | (Inherited from BaseExtractor.) |
![]() | GetPageWidth |
Width of the PDF page (in pdf units).
|
![]() | GetType | (Inherited from Object.) |
![]() | LoadAndApplyProfiles |
Loads profiles from JSON string and automatically applies them. Note that profiles containing
detection keywords will be deferred until the extraction.
(Inherited from BaseExtractor.) |
![]() | LoadDocumentFromFile | (Inherited from BaseExtractor.) |
![]() | LoadDocumentFromStream | (Inherited from BaseExtractor.) |
![]() | LoadProfiles |
Loads profiles from JSON file.
(Inherited from BaseExtractor.) |
![]() | LoadProfilesFromString |
Loads profiles from JSON string.
(Inherited from BaseExtractor.) |
![]() | MemberwiseClone | (Inherited from Object.) |
![]() | Reset |
Resets the instance, disposes internal resources and releases the file.
Use this method before loading another PDF file.
(Overrides BaseExtractorReset.) |
![]() | ResetExtractionArea | (Inherited from BaseExtractor.) |
![]() | SaveHtmlPageToFile |
Extracts HTML from specified page to stream.
|
![]() | SaveHtmlPageToStream |
Extracts HTML from specified page to stream.
|
![]() | SaveHtmlToFile(String) |
Extracts HTML from the entire document to file.
|
![]() | SaveHtmlToFile(IListInt32, String) |
Extracts HTML from specified pages to file.
|
![]() | SaveHtmlToFile(String, String) |
Extracts HTML from specified page ranges to file.
|
![]() | SaveHtmlToFile(Int32, Int32, String) |
Extracts HTML from specified page range to file.
|
![]() | SaveHtmlToStream(Stream) |
Extracts HTML from the entire document to stream.
|
![]() | SaveHtmlToStream(IListInt32, Stream) |
Extracts HTML from specified pages to stream.
|
![]() | SaveHtmlToStream(String, Stream) |
Extracts HTML from specified page ranges to stream.
|
![]() | SaveHtmlToStream(Int32, Int32, Stream) |
Extracts HTML from specified page range to stream.
|
![]() | SetExtractionArea(RectangleF) | (Inherited from BaseExtractor.) |
![]() | SetExtractionArea(Double, Double, Double, Double) | (Inherited from BaseExtractor.) |
![]() | SetExtractionArea(Single, Single, Single, Single) | (Inherited from BaseExtractor.) |
![]() | ToString | (Inherited from Object.) |
See Also