18604347. INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND STORAGE MEDIUM simplified abstract (CANON KABUSHIKI KAISHA)

From WikiPatents
Jump to navigation Jump to search

INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND STORAGE MEDIUM

Organization Name

CANON KABUSHIKI KAISHA

Inventor(s)

RYO Kosaka of Tokyo (JP)

INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND STORAGE MEDIUM - A simplified explanation of the abstract

This abstract first appeared for US patent application 18604347 titled 'INFORMATION PROCESSING APPARATUS, INFORMATION PROCESSING METHOD, AND STORAGE MEDIUM

Simplified Explanation: The patent application describes an information processing apparatus that can analyze document images to extract token strings and determine document types and character strings within the document using a trained model and rule-based algorithm.

Key Features and Innovation:

  • Obtaining token strings from document images
  • Determining document types using a trained model
  • Extracting character strings corresponding to different items in the document
  • Applying rule-based algorithms to determine character strings for specific items

Potential Applications: This technology can be used in document processing, data extraction, and information retrieval systems.

Problems Solved: This technology streamlines the process of analyzing document images and extracting relevant information, improving efficiency and accuracy in data processing tasks.

Benefits:

  • Faster document analysis and data extraction
  • Improved accuracy in determining document types and character strings
  • Enhanced efficiency in information processing tasks

Commercial Applications: The technology can be applied in industries such as finance, legal, healthcare, and research for automating document analysis and data extraction processes, leading to increased productivity and accuracy.

Prior Art: Researchers can explore prior art related to document image processing, data extraction algorithms, and rule-based systems for information retrieval.

Frequently Updated Research: Stay updated on advancements in document image processing, machine learning models for text analysis, and rule-based algorithms for information extraction.

Questions about Document Image Processing: 1. How does this technology improve document analysis processes? 2. What are the potential applications of this information processing apparatus in different industries?


Original Abstract Submitted

Provided is an information processing apparatus including: an obtaining unit configured to obtain a token string generated based on character strings included in a document image; a first determination unit configured to determine a document type represented by the document image and character strings corresponding to a first item included in the document image by using a result obtained by inputting the token string into a trained model; and a second determination unit configured to determine a character string corresponding to a second item by applying the document type and the character strings corresponding to the first item to a rule-based algorithm.