Extract data from annual reports
Drop a 10-K, annual report, or earnings release PDF and pull the metrics that matter: revenue, net income, EPS, segment breakdowns, balance-sheet figures — across years, in a clean table.
Why this matters
Financial documents are long, dense, and inconsistent in layout from filer to filer. Manual scraping with PDF tools breaks on every new report. ExtractFox understands financial statements semantically and returns numbers in a stable schema you can pipe into a model or dashboard.
How it works
- Step 1Upload the report PDF
10-K, 10-Q, annual report, integrated report — up to 20 MB.
- Step 2Choose what to extract
Income statement, balance sheet, cash flow, segment breakdown, key metrics by year, or a custom set of fields you describe.
- Step 3Export to Excel or JSON
Numeric fields come back as numbers (in the original currency, no symbols), so SUM and AVG work in Excel.
Common use cases
Sample output
Example: 3-year income statement extracted from a 10-K
| company | Acme Corp. |
| fiscal_year_end | December 31 |
| currency | USD |
| year | revenue | operating_income | net_income | eps_diluted |
|---|---|---|---|---|
| 2024 | 12450000000 | 2210000000 | 1820000000 | 4.82 |
| 2025 | 13980000000 | 2540000000 | 2104000000 | 5.49 |
| 2026 | 15210000000 | 2870000000 | 2387000000 | 6.12 |
Frequently asked questions
How do I extract data from an annual report or 10-K PDF?+
Drop the PDF here, choose income statement / balance sheet / segments — or describe the exact fields and years you want. ExtractFox returns numeric values you can sum and chart.
Can it extract financial tables that span multiple pages?+
Yes. Tables that wrap across pages are stitched into a single ordered list. Three-year and five-year comparative tables come through with one row per year.
Does it work on scanned annual reports?+
Yes. Both digital (text-based) and scanned (image-based) annual reports work. Older filings that were scanned to PDF extract just like modern native PDFs.
Can I extract the same fields across multiple companies for comparison?+
Yes. Run the same extraction (same description or schema) on each company's filing. The stable schema means competitive comparison is a join across the JSON outputs.
What about non-US filings — annual reports under IFRS or local GAAP?+
Works the same way. The model identifies the income statement, balance sheet, and cash flow regardless of accounting standard. Currency and units (millions vs thousands) are preserved.
Can it extract risk factors and forward-looking statements?+
Yes. Use the description box to ask for, e.g., 'every risk factor as a row with title and one-paragraph summary'. The model returns a list, not a wall of text.