Convert any text-based PDF directly into an editable .xlsx workbook. Auto-detected rows and columns, number-aware cells (so SUM and AVG work without re-typing), per-page sheets, and a built-in audit-trail meta sheet.
01 — What you create
Direct PDF → .xlsx conversion. Each page becomes its own named sheet (or everything goes into one combined sheet, your choice). Numeric values land as actual number cells — SUM, AVG, sort, and filter all work the moment you open the workbook.
trial-balance-fy26.xlsx
Converted from 8-page PDF · 5 sheets + _meta · numbers as numbers
Sheet: Page 3
| Account | Code | Opening | Debit | Credit | Closing |
|---|---|---|---|---|---|
| Cash on hand | 1001 | 24,800 | 142,000 | 118,400 | 48,400 |
| HDFC current a/c | 1010 | 482,300 | 1,240,800 | 980,650 | 742,450 |
| Sundry receivables | 1200 | 318,000 | 0 | 95,400 | 222,600 |
| Office furniture | 1500 | 156,000 | 0 | 0 | 156,000 |
| Sundry payables | 2100 | -84,200 | 0 | 42,100 | -126,300 |
| GST input credit | 1310 | 18,900 | 62,400 | 38,800 | 42,500 |
| Travel expenses | 5210 | 0 | 28,400 | 0 | 28,400 |
+ sheets Page 1, 2, 4, 5 plus _meta · every numeric cell is real number-typed for SUM/AVG
Scanned invoices, multi-page batches, multi-currency stacks, and direct push into your accounting system. Free for 30 days, no card required.
Try Premium FreeFree 30 days · no credit card · cancel anytime
02 — How it works
The whole point of "PDF to Excel" is whether you can SUM the column the second you open the file. This converter coerces numbers to number cells so the workbook is genuinely editable — pivot, sort, chart, filter without re-typing a single value.
Drag any text-based PDF — trial balance, GST return, vendor ledger — and the tool extracts its text layer with pdfjs locally.
Live preview shows the table the moment you change settings. Cells in green will become real number cells in the output.
One click writes a proper .xlsx workbook with one sheet per page (or combined), auto-sized columns, and a _meta audit sheet.
03 — Built for accounting
Currency-prefixed, comma-thousands, EU-decimal, accounting-negative, percentage — all detected and upgraded to real number cells. SUM and AVG work without manual cleanup.
Default for multi-section PDFs (trial balances, multi-month statements). Every page becomes its own named sheet so navigation matches the source PDF.
For long single-table PDFs (200-row vendor ledger spanning 5 pages), flatten everything into a single sheet with one header row at the top.
Tune row / column tolerance with a live preview. Cells flagged green will become numeric in the .xlsx output — adjust before exporting.
Every output workbook includes a tiny _meta sheet recording the source PDF, page count, mode, tolerance, header setting, and number-coercion choice. Reproducibility for free.
PDFs and the assembled workbook never touch the network. pdfjs reads the source locally; SheetJS writes the .xlsx locally. No upload, no API, no logging.
Bulk OCR, batch invoicing, multi-party e-signing, redaction, audit logs — pdfFiller picks up where Sonchoy ends. Free for 30 days, no credit card.
Run 100+ invoices, statements, or conversions in one go.
Turn paper invoices into searchable, exportable data.
Multi-party signatures with full audit trails.
Mask sensitive ledger lines before sending to auditors.
04 — Common questions
Same engine — both produce a real .xlsx with number-typed cells, _meta audit sheet, and multi-sheet output. The two pages exist because users searching for "PDF to XLSX Converter" vs "PDF to Excel" expect slightly different framing. Functionality is identical; pick whichever name matches what you typed into Google.
Those cells will become real number-typed cells in the .xlsx output (not strings). Currency-prefixed values ("INR 4,521.50"), accounting negatives ("(1,200.00)"), and percentages ("12.5%") all get detected and converted. Cells that stay default-coloured remain strings.
No — needs a text layer. Scanned image-only PDFs need OCR first; the pdfFiller premium tier handles that. Most modern PDFs (bank statements, GST returns, invoices) have text layers and work fine.
A small audit-trail sheet at the end of the workbook recording source PDF name, page count, mode, tolerance, header setting, and number-coercion choice. Useful for reproducibility ("how was this extracted?") and audit reviews.
Two knobs: row tolerance (if too tight, single rows get split across multiple lines; loosen it) and column tolerance (if too tight, single columns get split; loosen it). The live preview updates instantly without exporting.
Never. The PDF is parsed locally by pdfjs, tabularised in JavaScript, and serialised to .xlsx locally by SheetJS. The browser triggers the download. No upload, no third-party API, no logging.
05 — Related tools