If you're a Python user and you want to be able to convert PDFs without uploading them manually to PDFTables.com, you can make use of our brand-new Python PDFTables API.
In this tutorial, I'll be showing you how to get the library set up on your local machine and how to use it to convert a PDF to Excel from a folder.
Here's an example of a PDF that I've converted with the library. In order to properly test the library, make sure you have a PDF handy!
Before we start
If you haven't already, install Anaconda on your machine from Anaconda website. You can use either Python 3.6.x or 2.7.x, as the PDFTables API works with both. Downloading Anaconda means that pip will also be installed. Pip gives a simple way to install the PDFTables API Python package.
You'll also need to create an account on PDFTables.com in order to get your free PDFTables API key.
For this tutorial, I'll be using the Windows Python IDLE Shell, but the instructions are almost identical for Linux and Mac.
In your terminal/command line, install the PDFTables Python library with:
pip install git+https://github.com/pdftables/python-pdftables-api.git
Or if you'd prefer to install it manually, you can download it from python-pdftables-api then install it with:
python setup.py install
Create a new Python script then add the following code:
import pdftables_api c = pdftables_api.Client('my-api-key') c.xlsx('input.pdf', 'output.xlsx')
Now, you'll need to make the following changes to the script:
my-api-keywith your PDFTables API key, which you can get here.
input.pdfwith the PDF you would like to convert.
output.xlsxwith the name of the converted spreadsheet.
Now, save your finished script as
convert-pdf.py in the same directory as the PDF document you want to convert.
Open your command line/terminal and change your directory (e.g.
cd C:/Users/Bob) to the folder you saved your
convert-pdf.py script and PDF in, then run the following command:
To find your converted spreadsheet, navigate to the folder in your file explorer and hey presto, you've converted a PDF to Excel with Python!
Looking to convert multiple PDF files at once?
Check out our blog post here.