How to Use OCR For a PDF

Optical Character Recognition, or OCR for short, is the process a computer uses to recognize letters, numbers, and special characters in pictures. Even though you may be able to read the content in a PDF file, a computer considers the writing to be a picture unless OCR has been applied. You can test if a PDF has been OCRed by trying to highlight a word in the file with your cursor; if you can't pick out a word, the file needs OCR. Luckily, many software tools can apply OCR to a PDF file in seconds.

Instructions

  1. Free Online OCR

    • 1

      Visit the Free Online OCR website.

    • 2

      Click the "Choose File" button.

    • 3

      Select the language in the original PDF (for example, English) from the drop-down menu.

    • 4

      Type the two words you see in the box. This is known as a CAPTCHA and prevents automated computer bots from using the service.

    • 5

      Click the "Send File" button.

    • 6

      All the text from the original PDF is displayed in a text box on the screen. To save the text, highlight it with your mouse, press "Ctrl+C" (for Copy), open a new document in your word processing program (for example, Word or Notepad), and press "Ctrl+V" (for Paste). Save the file by pressing "Ctrl+S."

    Adobe Acrobat

    • 7

      Download and install Adobe Acrobat.

    • 8

      Choose "Open" from the "File" menu in Adobe Acrobat. Double-click on your original PDF file to open it.

    • 9

      Click "Document" in the file menu. Choose "OCR Text Recognition" then "Recognize Text Using OCR."

    • 10

      Choose "Save" from the "File" menu to save your updated PDF.

    OmniPage Pro

    • 11

      Download and install OmniPage Pro.

    • 12

      Choose "1-2-3" from the first drop-down list.

    • 13

      Choose "Load Files" from the second drop-down list. Click the button right above it.

    • 14

      Locate your PDF and double-click on it.

    • 15

      Choose "Automatic" from the third drop-down list. Click the button right above it to perform OCR on your PDF.

    • 16

      Choose "Save to File" from the fourth drop-down list. Click the button right above it to save your newly-OCRed PDF.

Tips & Warnings

  • For best results, start with a PDF that has black text on a white background and uses a standard font (for example, Arial or Times New Roman) in size 12 or greater.

Related Searches:

References

Resources

Comments

You May Also Like

  • What Is OCR Software Used For?

    OCR, or optical character recognition, is a software package used with scanners to convert documents containing text to editable documents once they...

  • What Is OCR Scanning?

    OCR stands for "Optical Character Recognition" and is used by computers to translate text from typed physical page to a computer file....

  • How to Convert PDF to OCR

    When a hard-copy document is scanned and saved into PDF format, a computer does not know the difference between your scanned page...

  • How to OCR a Document

    Optical Character Recognition (OCR) software analyzes a scanned document saved to an image file and categorizes each character based on the font...

  • How to Use OCR Library in a PDF

    The OCR engine in Adobe Acrobat is a useful feature that converts image characters in a scanned PDF document to rendered text,...

  • How to Use MS Word

    Microsoft Word is a popular, powerful word processing program. While the most common use of Word is probably typing up documents on...

  • How to Convert TIF to PDF OCR

    Files with the .TIF extension, or TIFF files, are often used for storing digital photos or other graphics consisting of many colors....

  • How to OCR a PDF in Onenote

    OCR stands for optical character recognition. It is a system that identifies alpha and numeric characters in document hard copies, digital pictures...

  • How to Use Function Keys in Microsoft Word

    Microsoft Word 2010 has a rich set of predefined keyboard shortcuts. Each shortcut performs the same function of a sequence of mouse...

Related Ads

Featured