17961069. ENHANCED DOCUMENT INGESTION USING NATURAL LANGUAGE PROCESSING simplified abstract (International Business Machines Corporation)

From WikiPatents
Jump to navigation Jump to search

ENHANCED DOCUMENT INGESTION USING NATURAL LANGUAGE PROCESSING

Organization Name

International Business Machines Corporation

Inventor(s)

Susan Hallen of Chicago IL (US)

Rose Fleischman of Poughkeepsie NY (US)

[[:Category:Alan Daet Mej�a Villase�or of Guadalajara (MX)|Alan Daet Mej�a Villase�or of Guadalajara (MX)]][[Category:Alan Daet Mej�a Villase�or of Guadalajara (MX)]]

Diane Helen Wasserstrom of Poughkeepsie NY (US)

ENHANCED DOCUMENT INGESTION USING NATURAL LANGUAGE PROCESSING - A simplified explanation of the abstract

This abstract first appeared for US patent application 17961069 titled 'ENHANCED DOCUMENT INGESTION USING NATURAL LANGUAGE PROCESSING

Simplified Explanation

The abstract describes a method for enhanced document ingestion using natural language processing, where terms unknown to one model are identified and compared to terms known to another model for improved processing.

  • Identification of unknown terms in a document using a natural language processing model
  • Comparison of unknown terms to known terms from another natural language processing model
  • Reprocessing of the document using the second model if matching terms are found

Potential Applications

This technology could be applied in various industries such as information retrieval, data analysis, and content management systems.

Problems Solved

This innovation helps improve the accuracy and efficiency of document processing by leveraging multiple natural language processing models.

Benefits

The use of multiple natural language processing models enhances the understanding and processing of documents, leading to better insights and decision-making.

Potential Commercial Applications

One potential commercial application of this technology could be in the development of advanced document management systems for businesses.

Possible Prior Art

Prior art in this field may include research on natural language processing, document analysis, and machine learning techniques for text processing.

Unanswered Questions

How does this method handle documents in multiple languages?

The abstract does not specify how the method deals with documents containing terms in different languages and if language translation is involved in the processing.

What are the computational requirements for reprocessing documents using a different natural language processing model?

The abstract does not provide information on the computational resources needed to reprocess documents with a different natural language processing model and if there are any limitations in scalability.


Original Abstract Submitted

Methods, systems, and computer program products for enhanced document ingestion using natural language processing are provided herein. A computer-implemented method includes identifying, within a first document, terms unknown to a first natural language processing model by processing the first document using the first natural language processing model; identifying, within a second document, terms known to a second natural language processing model by processing the second document using the second natural language processing model; comparing the terms unknown to the first natural language processing model to the terms known to the second natural language processing model; and upon determining that at least one of the terms unknown to the first natural language processing model matches at least one of the terms known to the second natural language processing model, reprocessing the first document using the second natural language processing model.