PDF to Word troubleshooting: scanned files, password errors, and layout differences
Fixes for the most common problems when converting a PDF to Word in AttachKit: scanned PDFs with no text layer, password-protected files, OCR that finds no text, output that doesn't match the original layout, and oversized .docx files.
Last updated
PDF to Word runs entirely in your browser — your PDF is never uploaded — so most problems come down to the file itself or to what a structure-based conversion can honestly recover. Here are the common ones and how to fix them.
"This PDF has no text layer — it looks scanned"
Cause: The PDF contains only page images (a scan or photo of a document), so there's no text for the converter to read.
- Click OCR & convert to Word right in the warning. AttachKit recognizes the text on every page in your browser — nothing is uploaded — showing "Reading page X of Y…" with a progress bar. You can Cancel at any time.
- The inline OCR reads English. For a scan in another language, open Searchable (OCR) — the warning links there as "Open the OCR tool →" — pick the language, OCR the file, then convert the searchable copy.
- Spot-check the result. OCR output is a recognition of the page image, not the original text, so check names, numbers and anything critical.
Note: on the OCR path the .docx contains the recognized text only — page images aren't embedded, since a scanned page is itself one big image and embedding it would just duplicate the text visually.
"This PDF is password-protected, so its text can't be read to convert"
Cause: The PDF requires a password to open, so the converter can't read its pages.
- Open Unlock — the error links there as "Remove the password first →".
- Enter the PDF's password to produce an unlocked copy. That also happens locally; the password and file stay on your device.
- Bring the unlocked copy back to PDF to Word and convert it.
"OCR didn't find any readable text on this PDF"
Cause: OCR ran but recognized nothing usable — typically a very low-quality or skewed scan, handwriting (which OCR generally can't read), blank pages, or text in a language the inline English OCR doesn't recognize.
- For non-English documents, use Searchable (OCR) and choose the right language.
- If the scan is poor, re-scan at a higher resolution (300 DPI or better, straight and well-lit) and try again.
- Handwritten documents are out of scope for OCR — there's no reliable fix for those.
"Couldn't OCR and convert this PDF to Word"
Cause: The OCR run itself failed. The most common trigger is a very long scan — the on-device OCR engine caps at 200 pages — or the browser running out of memory mid-run.
- Split the document first with Pages and convert the parts separately.
- Close other heavy tabs to free memory, especially on phones and low-RAM machines, then retry.
The Word file doesn't look like the original PDF
Cause: This is the conversion's honest scope, not a bug. It reconstructs structure — paragraphs in reading order, headings, bold/italic, tables, links, images, alignment — from the text layer. Exact fonts and complex multi-column layout aren't reproduced; reproducing them pixel-perfectly would need a server-side rendering engine, and AttachKit doesn't upload your file.
- If a bordered table came through as plain text, the table's borders may be drawn as images rather than vector lines — bordered (ruled) tables convert best.
- On pages that mix text with figures or tables, content is placed top-to-bottom; multi-column reading order is preserved on pure-text pages.
- If what you actually need is the exact appearance rather than editable text, keep the PDF or export pages as images with PDF to JPG/PNG.
The .docx file is much bigger than the PDF
Cause: You converted with Best quality, which embeds recovered images as lossless PNG.
- Click Convert another, reload the file, switch the image quality back to Smaller file (compressed JPEG images) — the tool keeps whichever option you last picked — and convert again.
"Couldn't convert this PDF to Word — it may be corrupted or in an unsupported format"
Cause: The file couldn't be parsed as a usable PDF. It may be damaged, truncated by a bad download or email transfer, or not really a PDF (e.g. a renamed image or Word file).
- Confirm the file opens in another PDF viewer. If it doesn't open anywhere, re-download or re-export it from the source.
- If it's an image rather than a PDF, convert it first with JPG to PDF.
- If the file opens fine elsewhere but still fails here, report it — that's a case the team wants to see.
Still stuck?
If none of these match what you're seeing, contact support and describe what happened — since your file never leaves your browser, support can't see it, so include the file's size, page count, and whether it's a scan.
Related
Still stuck? Contact support →