18671039. STORING AND QUERYING KNOWLEDGE GRAPHS IN COLUMN STORES USING A GLOBAL DICTIONARY simplified abstract (SAP SE)
Contents
STORING AND QUERYING KNOWLEDGE GRAPHS IN COLUMN STORES USING A GLOBAL DICTIONARY
Organization Name
Inventor(s)
Christian Bensberg of Heidelberg (DE)
Jonathan Dees of Karlsruhe (DE)
STORING AND QUERYING KNOWLEDGE GRAPHS IN COLUMN STORES USING A GLOBAL DICTIONARY - A simplified explanation of the abstract
This abstract first appeared for US patent application 18671039 titled 'STORING AND QUERYING KNOWLEDGE GRAPHS IN COLUMN STORES USING A GLOBAL DICTIONARY
Simplified Explanation:
This patent application describes a system for integrating and querying data in a database using dictionary compression. The system identifies tables with triple-formatted data, generates a master dictionary for subject and predicate columns, and replaces values in these columns with unique values from the master dictionary.
- The system operates by identifying tables with triple-formatted data in a database.
- It generates a master dictionary for subject and predicate columns based on identical data types.
- The master dictionary includes unique values from both the subject and predicate data dictionaries.
- Values in the subject and predicate columns are replaced with unique values from the master dictionary.
Key Features and Innovation:
- Integration and querying system with dictionary compression
- Identification of tables with triple-formatted data
- Generation of master dictionary for subject and predicate columns
- Replacement of values in columns with unique values from master dictionary
Potential Applications:
- Database management systems
- Data analytics platforms
- Information retrieval systems
Problems Solved:
- Efficient data storage and retrieval
- Improved query performance
- Reduced storage space requirements
Benefits:
- Faster query processing
- Reduced storage costs
- Enhanced data compression
Commercial Applications:
Questions about the Technology: 1. How does the system handle tables with triple-formatted data? 2. What are the advantages of using dictionary compression in database systems?
Original Abstract Submitted
Various embodiments for a triple integration and querying system with dictionary compression are described herein. An embodiment operates by identifying a table of a database with four or more columns with triple formatted data including one subject column, one predicate column, and two or more object columns. It is determined that a master dictionary is to be generated for the both the subject column and the predicate column based on an identical datatype being used for both columns. A master dictionary including both the unique values from the subject data dictionary and the predicate data dictionary is generated. Values in the subject column and the predicate column are replaced based on the unique values from the master dictionary.