HistoryByteScout PDF To HTML SDK

History (changes log)

 
9.2.0.3254 (October 22, 2018)   
=============================
- Improved support of cropped PDF documents.
= Performance improvements.
- Other minor fixes and improvements.


9.1.0.3163 (July 18, 2018)
==========================
= Fixed bugs in PDF parsing.
- Other minor fixes and improvements.


9.0.0.3079 (April 11, 2018)
===========================
= Improved PDF parsing.
- Other minor fixes and improvements.


8.8.0.3015 (January 22, 2018)
=============================
= Improved support of encrypted PDF documents.
- Other minor fixes and improvements.


8.7.0.2980 (November 8, 2017)
=============================
+ Added `ParsingError` event allowing to handle parsing errors and interrupt or continue the processing.
+ Implemented embedding of images into HTML code as `data URI`.
= Improved rendering of PDF documents.
= Performance improvements.
- Other minor fixes and improvements.


8.6.0.2911 (August 1, 2017)
===========================
+ HTMLExtractor: added KeepOriginalFontNames property.
- Fixed crash on 64-bit CMYK images.
- Subscript and superscript text objects were merged with normal text. Fixed now.
= Other minor fixes and improvements.


8.5.0.2855 (June 1, 2017)
=========================
= Converter now keeps empty input text fields.
= Improved Japanese text conversion.
= Removed obsolete ClientProfile builds.
= Performance improvements;
- Other minor fixes and improvements.


8.3.0.2794 (March 06, 2017)
===========================
= Performance improvements.
+ HTMLExtractor now able to reflow PDF text in PlainHTML mode.
= Other minor improvements and bug fixes.


8.2.0.2697 (January 11, 2017)
=============================
= Improved PDF to plain HTML conversion: it's now possible to reflow newspaper layout to single-column HTML text.
= Other minor improvements and bug fixes.


8.1.0.2600 (October 21, 2016)
=============================
- Fixed extraction of text in Korean charset (KSCms-UHC-H / Code Page 949).
= Improved support of colorspaces and soft masks.
- Improved extraction of invisible text.
= Other minor improvements and bug fixes.


8.0.0.2523 (August 19, 2016)
============================
= Improved text extraction.
= Improved performance.
- Fixed PDF parsing issues.
- Fixed JBIG images decoding.
= Other minor improvements and bug fixes.


7.00.0.2474 (May 26, 2016)
==========================
- Fixed bug in installer causing some files from previous installation were interfering with updates.  
= Reworked the registration checking. Now the library will not throw an exception, 
  but work in demo mode if you missed or input wrong RegistrationName and RegistrationKey.
+ PDF Multitool: Added recent document list to "Open PDF Document" button.
+ PDF Multitool: Selection can be resized now.
+ PDF Multitool: Added Extract JSON feature.
= PDF Multitool: Improved Table Detector UI.
= PDF Multitool: Greatly improved font rendering quality.
+ PDF Multitool: Added debug option "Show Detected Extraction Columns" to the context menu to display 
  the detected columns on the current page. Becomes visible only after running any extraction against 
  the current displayed page.
- PDF Multitool: Fixed font rendering issue on 32-bit Windows.
= Other minor improvements and bug fixes.


6.30.0.2421 (March 23, 2016)
============================
= Improved support of ICC color profiles.
= Imporved handling of embedded fonts.
- Fixed extracted text duplication when using OCRCacheMode.WholePage option.
= Other bug fixes and improvements.


6.20.2354 (January 20, 2016)
============================
PDF To HTML conversion improved
PDF reading speed improved
new property PageDataCaching controlling automatic disposing of previously accessed pages.
implementing page caching type
example "Memory-Care Processing Of Huge Docs" renamed to "Reduce Memory Usage"
new DisposePage() method to HTMLExtractor
SkipInvisibleText now skips clipped text (which is not visible)
Rendering improves annotations rendering 
Colors management improved
PDF2HTML improves form controls output
Images output improved to support more sub-formats
Unicode text handling improved
minor bug-fixes


6.12.2239 (September 17, 2015)
==============================
PDF To HTML conversion improved
PDF format support improved
images and text support improved
minor bug-fixes


6.00.2131 (June 16, 2015)
=========================
HTML output improved (issues with links fixed)
minor bug-fixes


6.00.2114 (June 3, 2015)
========================
improved PDF To HTML conversion
improved PDF format support
improved images, fonts, text encoding support
minor issues fixes with HTML output
minor improvements and fixes


5.10.1750 (November 27, 2014)
=============================
improved pdf to html conversion from asp.net and .net
issue with overlapping content when converting multiple pages from PDF fixed
XHTML output minor fixes
Support for text opacity added
now outputs unknown character (0 to 32) as "?" 
improving support of pdf images conversion into html
fixing minor issues with output images filenames 
minor improvements and fixes


5.00.1639 (August 23, 2014)
===========================
improved handling of images while converting pdf to html
improved support for damaged pdf files
HTML text output improved
PDF Viewer GUI demo improved
minor improvements and fixes

4.00.1481 (May 30, 2014)
improved pdf to html conversion
improved PDF handling and bug-fixes
the issue with missing whitespace while converting text to css
multhithreading performance improved
licensing updated: Now Licensing is per developer
special styles for "print" device causing visible scroll bars in every div were disabled
updated samples and improved demo GUI
minor bug-fixes

3.40.1339 (March 5, 2014)
documentation for the API updated and included now into the SDK
improved text processing in PDF
improved PDF reading support
minor bug-fixes

3.20.1179 (October 22, 2013)
pdf to html conversion improved
new OutputHTMLWidth, GetOutputHTMLPageHeight, GetPageHeight, GetPageWidth functions and properties to read and set information about width and height for PDF and output HTML
demo gui: the issue with automatic scrolling up to top when selecting a region fixed 
improved support of PDF files with rotated objects and pages
minor bug-fixes and processing speed improvements

3.20.1076 (July 11, 2013)
improved PDF To HTML conversion
improved PDF files support
minor bug-fixes

3.10.1052 (June 29, 2013)
improved pdf to html conversion functionality
improved images support in PDF files
speed improvements for pdf to html conversion
improved fonts support 
improved support for various PDF files
minor bug-fixes

3.10.904 (May 14, 2013)
improved pdf to html conversion functionality
improved PDF format support
minor bug-fixes

3.00.864 (April 11, 2013)
improved PDF handling
improved PDF fonts handling
demo utility updated
source code samples updated to run on any .NET framework by default
minor bug-fixes

3.00.829 (March 14, 2013)
improved pdf to html conversion
demo utility PDF Viewer reworked and updated for better UI experience
minor improvements and fixes in PDF support
improved PDF stability while working with PDF files with high density vector graphics inside
improved support for indexed color pallettes 
improved embedded fonts rendering
better support for Unicode fonts
new .Version property to read exact version of the dll
minor updates and improvements

2.40.662 (November 5, 2012)
pdf to html with vector images conversion improved
minor bug-fixes related to vector images representation
improved pdf support
issue with incorrect folder name for images (was using original pdf filename instead of given html output filename)
image files are now using shorter name to prevent issue with too long paths
minor bug-fixes

2.40.650 (November 1, 2012)
improved PDF to HTML conversion support
improved support for Unicode text extraction 
improved support for PDF/A pdf files 
issues with white stripes appearing on multiple images combined fixed
data extraction internal optimizations
improved support for 8 bit images inside PDF
vector drawings improved to provide better support for multiple small objects 
Color representation in images with indexed colors fixed
Type2 fonts support improved
Improved support for embedded fonts in PDF produced by Ghostscript engine
CCIT images compression compression related issues fixed
LZW compressed PDF support improved
improved support for shading objects
improved PDF fonts support 
improved support for PDF with 4 bit images
new .ControlsAsText property.
new source code sample: "PDF To HTML with Images"
support for annotations and control output has been re-enabled 
memory and speed optimizations



2.30.568 (June 22, 2012)
PDF to HTML conversion in PDF to simple HTML mode improved
PDF Viewer demo fixes PDF to HTML mode settings dialog, adds AddFontHTMLTagsToText setting  CSS mode is now selected by default
new .AddFontStyleHTMLTagsToText to control if PDF To HTML SDK should add font style information to text objects or not
PDF To HTML simple HTML mode to reworked and improved
new .FontSubstitutionMap property for HTMLExtractor to read font substitution map used for PDF to HTML conversion
new .PreserveFormattingOnTextExtraction property to HTML2PDF has been added
new .HighPrecisionTextPositioning property for PDF To HTML with CSS mode
multithreaded processing support imporved

2.20.487 (March 9, 2012)

Fixing issue with black and white images missing from PDF files made by differente PDF writers
Fixing inverted images in PDF files generated by some Adobe applications
Improved support for corrupted PDF files
Some security issues fixed
Improved handling of JBIG encoded images 
Improved handling of PDF built-in fonts
Improved handling of superscript text positions
Improved support of encrypted PDF files
Improved handling of different colorspaces for images


2.20.415 (December 21, 2011)
improved fonts support
conversion to simple HTML improved
performance optimized and pdf processing speed improved
improved support for PDF format
PDF Viewer now provides ability to turn on/off text formatting support on extraction
minor bug-fixes

2.10.356 (November 15, 2011)
better support for different PDF files
minor bug-fixes and improvements
saving to HTML does not attemt to create .images folder anymore

2.10.318 (October 18, 2011)
better support for PDF files (vector drawings are supported now)
minor bug-fixes and improvements


2.10.304 (October 4, 2011)
improved support for PDF files
improved HTML output for multiple pages
pdf processing speed increased
minor bug-fixes and improvements

1.10.168 (May 6 2011)
support for password protected PDF documents improved (was not working properly in previous release)
minor bug-fixes and improvements

1.10.160 (12 April 2011)

XML comments are available now to show hints for methods, classes and properties in Visual Studio
New property: .ExtractColumnByColumn (false default), set to True to extract text column by column instead of line by line
PDF Viewer freeware utility updated to feature "Extract Text (line by line)" and "Extract Text (column by column)" buttons
improved support for single paged PDF documents produced by Acrobat Distiller software
clipping issues were fixed 
fixed hanging on some broken PDF documents 
improved text decoding support
minor bug-fixes


1.10.150 (10 March 2011)
* PDF files support improved
+ now handles PDF files from Google Doc without errors
* minor bug-fixes

1.10.144 (26 February 2011)
+ now works with secured documents (provide passsword if needed in .Password property)
+ minor bug-fixes and improvements
+ updated GUI demo application

1.10.121 (11 February 2011)
+ support for invisible text extraction added
+ minor bug-fixes and improvements

1.00.30 (9 November 2010)
+ new version