IJ Scan Utility (Windows) - Extracting Text from Scanned Images (OCR) (MB5120 / MB5420)
Article ID: ART176121 | Date published: 01/10/2020 | Date last updated: 01/13/2020
 

Description

This article provides basic steps for extracting text from scanned images in IJ Scan Utility for Windows.

 

Solution

Extracting Text from Scanned Images (OCR)

Click OCR in the IJ Scan Utility main screen to scan text in scanned magazines and newspapers and display it in a specified application.


 Note

  • You can also extract text from Document, Custom, or ScanGear.
 
  1. Start IJ Scan Utility.

  2. Click Settings..., then set the document type, resolution, etc. in the Settings (OCR) dialog box, and then select the application in which you want to display the result.

    After adjusting the settings as desired, click OK.


     Note

    • For Resolution, only 300 dpi or 400 dpi can be set.

    • If a compatible application is not installed, the text in the image is extracted and appears in your text editor.
      Text to be displayed is based on Document Language in the Settings (General Settings) dialog box. Select the language you want to extract in Document Language and scan.

    • You can add the application from the pull-down menu.
       

  3. Click OCR.


    figure: IJ Scan Utility
     

    Scanning starts.

    When scanning is completed, the scanned images are saved according to the settings, and the extracted text appears in the specified application.


     Note

    • Click Cancel to cancel the scan.
    • Text displayed in your text editor is for guidance only. Text in the image of the following types of documents may not be detected correctly.

      • Documents containing text with font size outside the range of 8 points to 40 points (at 300 dpi)

      • Slanted documents

      • Documents placed upside down or documents with text in the wrong orientation (rotated characters)

      • Documents containing special fonts, effects, italics, or hand-written text

      • Documents with narrow line spacing

      • Documents with colors in the background of text

      • Documents containing multiple languages