International business machines corporation (20240193978). DOCUMENT IMAGE TEMPLATE MATCHING simplified abstract

From WikiPatents
Jump to navigation Jump to search

DOCUMENT IMAGE TEMPLATE MATCHING

Organization Name

international business machines corporation

Inventor(s)

Ang Yi of Beijing (CN)

Jing Zhang of Beijing (CN)

Hai Cheng Wang of Beijing (CN)

Jun Hong Zhao of ShangDi (CN)

Rajesh M. Desai of San Jose CA (US)

Yang Zhong Li of Beijing (CN)

Ye Chen of Beijing (CN)

DOCUMENT IMAGE TEMPLATE MATCHING - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240193978 titled 'DOCUMENT IMAGE TEMPLATE MATCHING

The abstract of the patent application describes computer-implemented methods, systems, and computer program products that merge multiple pages of a document into a single document image. The program code then processes this single document image to identify structural elements and textual content. By comparing the structural elements of the document image to a group of document templates stored in a database, the program code can identify a subset of templates with a threshold number of similarities to the document image. A graph structure representing the document is generated from the single document image, which includes visual information and connections related to the structural elements and textual content. This structure is used to identify the document template that is the closest match to the document.

  • Key Features and Innovation:
 - Merging multiple pages of a document into a single document image
 - Identifying structural elements and textual content within the document image
 - Comparing document image to a group of document templates to find the closest match
 - Generating a graph structure representing the document for analysis
  • Potential Applications:
 - Document management systems
 - Automated document processing
 - Content analysis and comparison tools
  • Problems Solved:
 - Streamlining document merging and analysis processes
 - Improving accuracy in identifying document templates
 - Enhancing document organization and retrieval
  • Benefits:
 - Increased efficiency in document handling
 - Improved accuracy in document analysis
 - Simplified document template matching
  • Commercial Applications:
 - Document management software
 - Data extraction and analysis tools
 - Content management systems
  • Prior Art:
 - Prior research on document merging and template matching algorithms
 - Existing technologies for document analysis and comparison
  • Frequently Updated Research:
 - Ongoing advancements in document processing and analysis technologies
 - Research on improving accuracy and efficiency in document management systems

Questions about the technology: 1. How does this technology improve document processing efficiency? 2. What are the potential implications of this technology for content management systems?


Original Abstract Submitted

computer implemented methods, systems, and computer program products include program code executing on a processor(s) that merges a document comprising multiple pages into a single document image. the program code processes the single document image to identify structural elements and textual content. the program code compares the structural elements of the single document image to other structural elements of a group of document templates stored in a database to identify a subset of the group of documents templates with a threshold number of similarities to the single document image. the program code generates, from the single document image, a graph structure representing the document, where the graph structure comprises visual information and connections related to the structural elements and concepts comprising the textual content. the program code uses the structure to identify a document template that is a closest match to the document.