17936874. GENERATING A PERSONAL CORPUS simplified abstract (International Business Machines Corporation)
Contents
- 1 GENERATING A PERSONAL CORPUS
- 1.1 Organization Name
- 1.2 Inventor(s)
- 1.3 GENERATING A PERSONAL CORPUS - A simplified explanation of the abstract
- 1.4 Simplified Explanation
- 1.5 Potential Applications
- 1.6 Problems Solved
- 1.7 Benefits
- 1.8 Potential Commercial Applications
- 1.9 Possible Prior Art
- 1.10 Unanswered Questions
- 1.11 Original Abstract Submitted
GENERATING A PERSONAL CORPUS
Organization Name
International Business Machines Corporation
Inventor(s)
KENTA Watanabe of Soka-shi (JP)
Takahito Tashiro of Mitaka-shi (JP)
TAIHEI Miyamoto of Nakano (JP)
GENERATING A PERSONAL CORPUS - A simplified explanation of the abstract
This abstract first appeared for US patent application 17936874 titled 'GENERATING A PERSONAL CORPUS
Simplified Explanation
The abstract describes a method for generating a user-specific personal corpus by creating a basic corpus for a first user and updating it with text extracted from data sources associated with the user.
- A basic corpus is created for a first user using a set of data sources.
- Text is extracted from a second set of data sources associated with the first user.
- If an unknown word is found in the extracted text, the basic corpus is updated by replacing the vector of the unknown word with an average vector of the basic words in the corpus and registering the unknown word in a personal corpus.
Potential Applications
This technology could be applied in:
- Personalized content recommendations
- Customized language learning platforms
Problems Solved
This technology solves:
- Lack of personalized content for users
- Difficulty in adapting language learning materials to individual users
Benefits
The benefits of this technology include:
- Improved user experience
- Enhanced personalization in content delivery
Potential Commercial Applications
A potential commercial application of this technology could be:
- Personalized e-learning platforms for language education
Possible Prior Art
One possible prior art for this technology could be:
- Systems for personalized content recommendations based on user preferences
Unanswered Questions
How does the system handle user privacy and data security?
The article does not provide information on the measures taken to ensure user data privacy and security in the process of generating and updating the personal corpus.
What is the scalability of the system for a large number of users?
The scalability of the system for a large user base is not addressed in the article.
Original Abstract Submitted
In an approach for generating a user-specific personal corpus, a processor creates a basic corpus for a first user using a first set of data sources, wherein the basic corpus includes one or more basic words and one or more vectors of the one or more basic words. A processor extracts a set of text from a second set of data sources associated with the first user. Responsive to finding an unknown word included in the set of text extracted, a processor updates the basic corpus, wherein the basic corpus is updated by replacing a vector of the unknown word with an average vector of the one or more basic words in the basic corpus created and registering the unknown word in a first personal corpus.