Scanned documents and OCR
Lex Cloak reads scanned and image-only PDFs with built-in OCR, so there is no separate step to turn on. If a scanned page detects poorly, check that it is right side up and rotate it if needed, then draw over anything the scan could not read. Clearer scans detect better than faint or skewed ones.
How scanned PDFs are handled
Lex Cloak runs OCR on every page and works from what is actually visible, because a PDF's built-in text layer can be missing, misaligned, or even hidden. That means scanned PDFs and image-only PDFs are scanned without a separate step, and the engine sees the page the way a person does.
When a scanned page detects poorly
- Rotate an upside-down page. A page scanned upside down or sideways often defeats detection. Use the rotate button on the page action float to turn it around, and let it re-scan. Full how-to: rotate an upside-down page.
- Draw over what the scan could not read. For faded handwriting, a stamp partially over text, or a logo, hold Shift and drag to draw a redaction box over it. Full how-to: draw a redaction over anything.
- Redact the whole page on low-quality scans. When a page is too faint or smudged to review match-by-match, press R to redact the entire page.
- Use thumbnails to spot a weak page. The Thumbnails panel shows match counts at a glance, so a page with surprisingly few matches stands out.
Clearer scans detect better than faint or skewed ones. If you have access to a higher-quality source of the same document, scanning that source is usually faster than working around a poor copy.