Databricks, Inc. (20240256549). Evaluating Expressions Over Dictionary Data simplified abstract

From WikiPatents
Jump to navigation Jump to search

Evaluating Expressions Over Dictionary Data

Organization Name

Databricks, Inc.

Inventor(s)

Utkarsh Agarwal of San Francisco CA (US)

Shoumik Palkar of San Francisco CA (US)

Alexander Behm of San Francisco CA (US)

Sriram Krishnamurthy of San Francisco CA (US)

Evaluating Expressions Over Dictionary Data - A simplified explanation of the abstract

This abstract first appeared for US patent application 20240256549 titled 'Evaluating Expressions Over Dictionary Data

The abstract describes a method for evaluating a query on a columnar dataset with dictionaries associated with columns, stored on cloud storage.

  • The method involves receiving a query request with an operator for the dataset.
  • The dataset contains columns based on dictionaries mapping values to identifiers.
  • The operator is evaluated on dictionary values to generate an updated dictionary with new values.
  • The updated dictionary can be decoded into an updated column with new data values.

Potential Applications: - Data analysis and querying in cloud-based storage systems. - Database management and optimization for large datasets.

Problems Solved: - Efficient evaluation of queries on columnar datasets with dictionaries. - Improved data processing and storage techniques for cloud environments.

Benefits: - Faster query performance on large datasets. - Enhanced data organization and retrieval capabilities. - Reduced storage space and processing requirements.

Commercial Applications: Title: Cloud-Based Data Query and Optimization System This technology can be utilized in cloud computing services, data analytics platforms, and enterprise database systems to enhance query performance and optimize data storage.

Prior Art: Readers can explore prior research on columnar data storage, dictionary encoding techniques, and cloud-based query optimization to gain more insights into related technologies.

Frequently Updated Research: Stay updated on advancements in cloud computing, data management, and query optimization techniques to leverage the latest innovations in this field.

Questions about Columnar Dataset Query Evaluation: 1. How does the method improve query performance on columnar datasets? - The method optimizes query evaluation by utilizing dictionaries for efficient data processing. 2. What are the key benefits of using dictionaries in columnar datasets? - Dictionaries help map values to identifiers, reducing storage space and improving query efficiency.


Original Abstract Submitted

disclosed herein is a method, system, or non-transitory computer readable medium for evaluating a query on a columnar dataset comprising one or more dictionaries associated with columns in the dataset. the method includes receiving a request to perform a query comprising at least an operator for a columnar dataset on cloud storage. at least one column in the dataset is based on a dictionary, and the dictionary maps one or more values for a column to one or more respective identifiers. the method evaluates the operator on one or more values of the dictionary to generate an updated dictionary comprising updated values. the method may decode the updated dictionary into an updated column comprising updated data values.