ByteScout PDF Extractor SDK v.


PDF Extractor SDK for Windows software developers: PDF to Text, PDF to XML, Images from PDF, Read PDF information, PDF to CSV for Excel.

Bytescout PDF Extractor SDK allows to convert PDF to text, PDF to XML, PDF to CSV, extract images from PDF, extract information about PDF files in .NET and ActiveX interfaces without any additional software required.

- converts PDF to plain text (and can follow columns if you converting a newspaper in PDF format!) - including invisible text extraction;
- converts tables in PDF to Excel (CSV) by reading cells from given rectangle;
- converts tables in PDF to XML files;
- extracts PDF file metadata (title, author, description) and get other information about the file (number of pages, encrypted or not);
- extracts embedded images from PDF document (in ASP.NET, VB.NET, C#, VB6 and VBScript);
- NEW: DocumentMerger and DocumentSplitter interfaces and classes to merge and split PDF documents;
doesn't require Adobe Reader or any other PDF reader software to be installed;
- provides .NET and ActiveX interfaces;
- made with 100% managed C# code.

PDF Extractor SDK for .NET, ASP.NET, ActiveX. PDF Extractor SDK allows developers to convert PDF to text, PDF to XML, extract images from PDF, convert PDF tables into CSV for Excel, extract information about PDF file in .NET or ActiveX interfaces. Works without any additional software required.

pdf extractor, pdf to txt, pdf to jpg, pdf to text, pdf to image, pdf to xml, c pdf, pdf library, extract pdf, pdf to csv, pdf to excel, pdf text extraction, pdf conversion, net, activex, pdf sdk, asp net, convert pdf, extract from pdf

  • ByteScout PDF Extractor SDK
  • 11 Apr 18
  • ByteScout, Inc
  • Win2000, WinXP, Win7 x32, Win7 x64, Windows 8, Windows 10, WinServer, WinOther, WinVista, WinVista x64
  • Demo
  • 596 Kb
  • 1086
  • $10.00
Latest Versions History
Version Date Released Release Notes 19.08.2016 Added filtering of extracted content by font name, font size and color. Updated OCR engine to the latest version. Update language files from 'tessdata' folder. Improved text extraction, lines grouping in tabular data, performance, XFA forms extraction, TableDetector, fixed PDF parsing issues. 23.03.2016 added TextComparer utility class (available in .NET 4.0 assemblies only) allowing to compare text in two PDF documents and generate report; improved support of ICC color profiles; improved handling of embedded fonts; improved AttachmentExtractor; fixed XMLExtractor.SaveXMLToStream() method.
5.10.1747 01.12.2014 PDF to XML, PDF to CSV, PDF to Text functions improved now supports text extraction from text controls XML extractor now adds font style, size, name, text coordinates into <text> tags ASP.NET sample for OCR usage added new property OCRLanguageDataFolder to specify the location of ocr data
5.00.1626 19.08.2014 OCR (text from pdf images) functionality: now you may extract text from embedded images and repair damaged text issue fixed with CSV and XML extractor missing last columns with some settings improved support for damaged PDF files multiline search text search with word matching modes and more!
4.00.1487 02.06.2014 improved pdf to text, pdf to csv, pdf to xml new XFA Form XML extractor ZuGFeRD invoices extraction added new .ContentType to check if PDF is PDF, Portfolio or XFAForm new AttachmentInfo class to read details about attachment improved text handling minor bug-fixes and improvements
3.40.1349 10.03.2014 improved stability of pdf to text issue with the very last text line missing in some PDF files fixed tables with empty cells are handled better now issue fixed: incorrect extraction of overlapped text objects fixed, missing spaces between words in some files, minor issues with text search

Review ByteScout PDF Extractor SDK

  • captcha

Other software of ByteScout, Inc
  • Bytescout BarCode Generator  v. Generator is able to generate and export barcode to image (PNG, JPG, TIFF). Types: Codabar, Code 39, GS1, Code 93, Code 128, EAN-13, EAN-8, JAN-13, Bookland, UPC-A, UPC-E, Postnet, PDF417, Truncated PDF417, DataMatrix, QR Code ...
  • Bytescout BarCode Generator SDK  v.4.0This PAD extension allows you to add your site info into your PAD file. This info can be used by site submission software or by web directories themselves.
  • Bytescout BarCode Reader  v. Bytescout BarCode Reader can read barcode from image (JPG, TIFF, PNG, GIF), can read barcode from PDF. The software is based on Bytescout BarCode Reader SDK for software developers. Reads Code 39, Code 128, QR Code, PDF 417.. and much more!

New Components & Libraries software
  • DotConnect for MySQL  v.8.18dotConnect for MySQL is an enhanced data provider built on ADO.NET architecture and a development framework with a number of innovative technologies. It supports Entity Framework, NHibernate, and LinqConnect ORMs.
  • DotConnect for SQLite  v.5.16dotConnect for SQLite is a data provider built on ADO.NET architecture. With Entity Framework and LinqConnect support it introduces new approaches for designing applications, boosts productivity, and leverages database applications.
  • DotConnect for Oracle  v.9.13dotConnect for Oracle is an enhanced ORM enabled data provider for Oracle that builds on ADO.NET technology to present a complete solution for developing Oracle-based database applications.
  • DotNet4Java  v. dotNet4Java is a .Net Runtime Library for Java which helps Java developers work with .Net libraries and framework from Java. It is designed to provide a way to interact with .Net applications from Java.
  • 4 Suit Scorpion Spider Solitaire  v.1.0Hey there, Spider Solitaire master! You have found yourself on the hardest card game on the Card Game Spider Solitaire site! Either you have mastered solitaire or you really like a challenge, or both! Either way, this game will be sure to please you!
  • Split Merge Pro - Extract Pdf  v. Split & Merge Pro is an advanced PDF page extractor application to add, delete and divide pages from pdf files in batch mode. Software can combine PDF as well as images together to create a joined document. It appends PDF & images too.