International business machines corporation (20240160607). KEYPHRASE GENERATION LEVERAGING PUBLIC REPOSITORY CATEGORIES simplified abstract

From WikiPatents
Jump to navigation Jump to search

KEYPHRASE GENERATION LEVERAGING PUBLIC REPOSITORY CATEGORIES

Organization Name

international business machines corporation

Inventor(s)

Gaetano Rossiello of Brooklyn NY (US)

Md Faisal Mahbub Chowdhury of Woodside NY (US)

Alfio Massimiliano Gliozzo of Brooklyn NY (US)

Nandana Mihindukulasooriya of Cambridge MA (US)

Michael Robert Glass of Bayonne NJ (US)

KEYPHRASE GENERATION LEVERAGING PUBLIC REPOSITORY CATEGORIES - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240160607 titled 'KEYPHRASE GENERATION LEVERAGING PUBLIC REPOSITORY CATEGORIES

Simplified Explanation

The patent application relates to a process for generating file classifications based on keyphrases derived from evaluating input files, allowing for file system organization and query augmentation.

  • The system includes a memory storing computer executable components and a processor executing these components.
  • A generating component creates keyphrases based on context from evaluating input files using a public repository of annotated files.
  • An execution component classifies input files based on the keyphrase, which can augment queries.

Potential Applications

This technology could be applied in:

  • Document management systems
  • Information retrieval systems

Problems Solved

This technology helps in:

  • Organizing files efficiently
  • Enhancing search capabilities

Benefits

The benefits of this technology include:

  • Improved file organization
  • Enhanced search accuracy

Potential Commercial Applications

A potential commercial application could be in:

  • Enterprise content management systems

Possible Prior Art

One possible prior art could be:

  • Existing file classification systems

Unanswered Questions

How does this technology handle multi-word keyphrases?

The system likely uses algorithms to process and generate keyphrases from input files, but the specific method is not detailed in the abstract.

What is the scalability of this technology for large file repositories?

While the abstract mentions a public repository of annotated files, it does not specify how the system handles scalability issues when dealing with a large number of files.


Original Abstract Submitted

one or more systems, devices, computer program products and/or computer-implemented methods of use provided herein relate to a process for generating the classification of files to allow for file system organization and/or query augmentation. a system can comprise a memory that stores computer executable components, and a processor that executes the computer executable components stored in the memory, wherein the computer executable components can comprise a generating component that generates a keyphrase based on a context derived from evaluation of an input file, wherein the generating component employs a public repository of files annotated with a plurality of keyphrases, including the keyphrase, to generate the keyphrase based on the context, and an execution component that classifies the input file based on the keyphrase. in one or more embodiments, the input file can comprise a query, and classification of the input file can comprise augmenting the query based on the keyphrase.