Extract data from PDF to Excel

Pick the document type, or describe what you want extracted in plain English. ExtractFox reads the PDF, returns structured data, and downloads as a clean .xlsx — no formatting work needed on the other side.

Drop a PDF or image here, or click to browse
Max 20 MB per file · PDF, PNG, JPG, WEBP, HEIC
Pro: drop up to 25 files at once for bulk extraction
What to extract from this pdf?
or describe it yourself
Extracting all tables

Why this matters

PDF to Excel converters that just dump text into cells are useless past the first column. ExtractFox extracts the actual fields you want — invoice line items, statement transactions, contract metadata, anything — and returns a spreadsheet you can pivot, sum, and filter without cleanup.

How it works

  1. Step 1
    Upload your PDF

    Any PDF up to 20 MB, including scans and multi-page docs.

  2. Step 2
    Pick a template or describe the data

    Use a prebuilt schema (invoice, statement, contract, etc.) or type a free-text request like 'pull every line item where amount is over $500'.

  3. Step 3
    Download as Excel

    Tabular data lands in rows; document-level metadata sits in a header strip on a second sheet.

Common use cases

Invoices and bills — line items, totals, vendor
Bank and credit-card statements — every transaction as a row
Receipts — items, prices, tax, total for expense reports
Order confirmations and shipping manifests
Annual reports — financial tables, balance sheet figures
Survey responses and form submissions
Medical lab reports and test results
Real-estate listings and inspection reports
Insurance claims and policy documents
Government filings, tax documents, and certificates

Sample output

Example: a 3-line invoice extracted to spreadsheet rows

vendor          | issue_date | description                  | qty | unit_price | amount
----------------|------------|------------------------------|-----|------------|-------
Acme Supplies   | 2026-04-12 | A4 paper, 80gsm, 500 sheets  |  12 |       4.50 |  54.00
Acme Supplies   | 2026-04-12 | Toner cartridge, black       |   2 |      89.00 | 178.00
Acme Supplies   | 2026-04-12 | Shipping                     |   1 |       9.00 |   9.00

Frequently asked questions

How do I extract data from a PDF to Excel?+

Upload the PDF on this page, choose a document type or write a quick description of what you want, then click Extract and download as .xlsx.

Does this work for scanned PDFs?+

Yes. Scanned PDFs and image-based PDFs both work — the underlying model handles OCR end-to-end.

How is this different from Adobe Acrobat or Smallpdf 'Export to Excel'?+

Acrobat and similar tools convert text positions to cells, which produces a mess on anything that isn't a perfect spreadsheet-shaped table. ExtractFox extracts the fields you actually care about, regardless of layout.

Can I automate this — extract data from PDF to Excel automatically?+

On the paid plan, hit the REST API with a PDF, get JSON or .xlsx back. Wire it to a folder watcher, an email inbox, or a Zap.

What about multi-page PDFs with tables that span pages?+

Tables that wrap across pages are stitched together into a single ordered list — the model reads the whole document in one pass, not page by page.

Can I extract just specific columns or filter rows during extraction?+

Yes. Type your request in the description box below the document tiles — for example, 'just the date and amount columns' or 'only line items where amount is over $500'. ExtractFox figures out the structure and returns only those fields.

Will the Excel file have proper column headers and data types?+

Yes. Numbers are real numbers (so SUM and AVERAGE work), dates are strings in ISO format (YYYY-MM-DD), and column headers come from the field names in the schema.

Can I get the same output as CSV or JSON instead of Excel?+

Yes — see PDF to CSV for delimited output and PDF to JSON for an API-ready structured payload. Same engine, different format.

Related extractors

Compared to alternatives