17936874. GENERATING A PERSONAL CORPUS simplified abstract (International Business Machines Corporation)

From WikiPatents
Jump to navigation Jump to search

GENERATING A PERSONAL CORPUS

Organization Name

International Business Machines Corporation

Inventor(s)

KENTA Watanabe of Soka-shi (JP)

Takahito Tashiro of Mitaka-shi (JP)

Takashi Fukuda of TOKYO (JP)

TAIHEI Miyamoto of Nakano (JP)

GENERATING A PERSONAL CORPUS - A simplified explanation of the abstract

This abstract first appeared for US patent application 17936874 titled 'GENERATING A PERSONAL CORPUS

Simplified Explanation

The abstract describes a method for generating a user-specific personal corpus by creating a basic corpus for a first user and updating it with text extracted from data sources associated with the user.

  • A basic corpus is created for a first user using a set of data sources.
  • Text is extracted from a second set of data sources associated with the first user.
  • If an unknown word is found in the extracted text, the basic corpus is updated by replacing the vector of the unknown word with an average vector of the basic words in the corpus and registering the unknown word in a personal corpus.

Potential Applications

This technology could be applied in:

  • Personalized content recommendations
  • Customized language learning platforms

Problems Solved

This technology solves:

  • Lack of personalized content for users
  • Difficulty in adapting language learning materials to individual users

Benefits

The benefits of this technology include:

  • Improved user experience
  • Enhanced personalization in content delivery

Potential Commercial Applications

A potential commercial application of this technology could be:

  • Personalized e-learning platforms for language education

Possible Prior Art

One possible prior art for this technology could be:

  • Systems for personalized content recommendations based on user preferences

Unanswered Questions

How does the system handle user privacy and data security?

The article does not provide information on the measures taken to ensure user data privacy and security in the process of generating and updating the personal corpus.

What is the scalability of the system for a large number of users?

The scalability of the system for a large user base is not addressed in the article.


Original Abstract Submitted

In an approach for generating a user-specific personal corpus, a processor creates a basic corpus for a first user using a first set of data sources, wherein the basic corpus includes one or more basic words and one or more vectors of the one or more basic words. A processor extracts a set of text from a second set of data sources associated with the first user. Responsive to finding an unknown word included in the set of text extracted, a processor updates the basic corpus, wherein the basic corpus is updated by replacing a vector of the unknown word with an average vector of the one or more basic words in the basic corpus created and registering the unknown word in a first personal corpus.