Defines the PDF to JSON extractor interface.
Assembly: Bytescout.PDFExtractor (in Bytescout.PDFExtractor.dll) Version: 188.8.131.5262-master
public interface IJSONExtractor
Public Interface IJSONExtractor
public interface class IJSONExtractor
type IJSONExtractor = interface end
Thetype exposes the following members.
Gets or sets whether to allow standalone punctuation characters. If false they will be merged with nearest text object.
Get or sets whether to generate regular JSON with camel-cased object identifiers without '@' (attribute) and '#' (node content) marks. Default is true.
Get or sets whether to detect the "strikeout" text style. Default is false.
Get or sets whether to detect the "underline" text style. Default is false.
Gets or sets the folder to put extracted images when SaveImages property is set to ImageHandling.OuterFile. Default is "images" - the extractor will create "images" sub-folder in the same folder with output JSON file.
Gets or sets the image format for extracted images. Default is PNG.
By default JSONExtractor replaces names of embedded fonts with standard (or "descendant") fonts similar by metrics and typeface. This is because embedded fonts differ from fonts installed into your system or absent there at all. Set this property to true if you want to keep the original font names.
Get or sets the image saving way: do not save; save to outer file; embed into result JSON as Base64 string. Default is ImageHandling.None.
Get or sets whether to save vector objects. Default is false.
Extracts data from whole document as JSON string.
Extracts data from specified document page as JSON string.
Extracts data from specified page range as JSON string.
Saves extracted data to file in JSON format.
Saves extracted data from specified page to file in JSON format.
|SaveJSONToFile(Int32, Int32, String)|
Saves extracted data from specified page range to file in JSON format.
Saves extracted data to stream in JSON format.
Saves extracted data from specified page to stream in JSON format.
|SaveJSONToStream(Int32, Int32, Stream)|
Saves extracted data from specified page range to stream in JSON format.