Free PDF to Excel Converter
Pull tabular data out of any PDF and get an Excel spreadsheet you can sort, filter, and analyze. The converter detects table structures by analyzing text positions on each page. Download as .xlsx for Excel and Google Sheets, or CSV for universal compatibility. All processing happens in your browser — your files never leave your device.
Drop a PDF here or click to browse
Your file never leaves your device
🔒 Your files never leave your device — extracted with PDF.js in your browser.
Frequently Asked Questions
Are my PDF files uploaded to a server?+
No. All table detection and extraction happens inside your browser using PDF.js. Your files never leave your device.
How does table detection work?+
The converter analyzes the position (x, y coordinates) of every text element on each page. Text items with similar y-coordinates are grouped into rows; columns are inferred from their x-positions. Well-formatted tables with consistent alignment extract very accurately.
What if my table has merged cells or complex formatting?+
Merged cells and heavily styled tables may not extract perfectly. The converter handles standard tabular layouts well. For complex tables, try downloading as CSV and then cleaning the data in Excel.
Can I extract tables from scanned PDFs?+
Scanned PDFs contain images, not text. Use the PDF OCR tool first to generate a text layer, then run the PDF to Excel conversion on the resulting searchable PDF.
What is the difference between .xlsx and .csv output?+
.xlsx is a full Excel workbook with formatting support — open directly in Excel, Google Sheets, or LibreOffice Calc. .csv is plain text with comma-separated values — universally compatible with any spreadsheet tool, but supports only text values without formatting.
The data does not look right. What can I do?+
PDF table extraction is heuristic. If the result is poor, try downloading as CSV and cleaning the data in Excel. You can also manually copy the preview text and paste it into a spreadsheet, using the delimiter to align columns.
How PDF Table Extraction Works
PDF files do not store tables as structured data. Instead, every cell is stored as a positioned text item with an x/y coordinate on the page. Extracting a table means inferring structure from position: text items at similar y-coordinates are grouped into rows; items within those rows are clustered into columns by their x-positions. The result is approximate — it works well for evenly spaced, clearly defined tables and poorly for irregular layouts.
What Converts Well
Financial reports with fixed-width columns, bank statements, data exports originally created in Excel and then saved as PDF, and simple grid tables with consistent spacing all convert with high accuracy. If the original document was a spreadsheet, this tool essentially reverses the export step. Preview the detected table before downloading to confirm the column alignment looks correct.
What Converts Poorly
Tables with merged cells, nested tables, tables spanning multiple pages, tables with graphics in cells, and tables with varying row heights often produce misaligned output. For these cases, download as .csv first, open it in a text editor to inspect the raw extraction, and manually adjust the columns before importing into Excel. A partial extraction is still useful — it saves retyping the data.
CSV vs XLSX
CSV is the more portable format: it opens in any spreadsheet application, any programming language, and any text editor. Use CSV when you are piping the data into another tool — for example, the CSV to JSON converter — or when the downstream system expects CSV. Use XLSX when you need to open the file directly in Excel and want cell formatting preserved, or when you are sharing with someone who uses Excel and prefers .xlsx files.
Multi-Page PDFs
For PDFs with tables on multiple pages, the extractor processes each page independently and appends the detected rows into a single sheet. This works well for PDFs where the same table continues across pages (like a long financial report), since the column structure is consistent. If each page contains an unrelated table, split the PDF first using the PDF Splitter and convert each section separately.