Patent Applications by Databricks, Inc. on March 27th, 2025

Databricks, Inc. has applied for patents in the areas of G06F16/23 (1), G06F21/62 (1) G06F16/2315 (1), G06F21/6281 (1)

With keywords such as: data, records, task, subset, table, version, processing, configuration, transaction, and datasets in patent application abstracts.

Patent Applications by Databricks, Inc.

20250103580. CONCURRENT OPTIMISTIC TRANSACTIONS FOR TABLES WITH DELETION VECTORS_simplified_abstract_(databricks, inc.)

Inventor(s): Bart Samwel of Oegstgeest NL for databricks, inc., Christos Stavrakakis of Berlin DE for databricks, inc.

IPC Code(s): G06F16/23

CPC Code(s): G06F16/2315

Abstract: a disclosed configuration receives a first indication that a first transaction is committed to update a first subset of records in a data table at a first version to generate a second version of the data table and receiving a second indication to commit a second transaction to update a second subset of records in a data file of the data table at the first version. the configuration determines a logical prerequisite based on whether the first subset of records changes content of one or more records in the second subset of records and determining a physical prerequisite on whether the second subset of records corresponds to respective data records in data files of the second version of the data table. the configuration commits the second transaction to generate a third version of the data table by updating elements of the deletion vector if the prerequisites are satisfied.

20250103753. CLEAN ROOM GENERATION FOR DATA COLLABORATION AND EXECUTING CLEAN ROOM TASK IN DATA PROCESSING PIPELINE_simplified_abstract_(databricks, inc.)

Inventor(s): William Chau of Redwood City CA US for databricks, inc., Abhijit Chakankar of San Jose CA US for databricks, inc., Stephen Michael Mahoney of Portland OR US for databricks, inc., Daniel Seth Morris of Cranford NJ US for databricks, inc., Itai Shlomo Weiss of River Vale NJ US for databricks, inc.

IPC Code(s): G06F21/62

CPC Code(s): G06F21/6281

Abstract: a data processing service facilitates the creation and processing of data processing pipelines that process data processing jobs defined with respect to a set of tasks in a sequence and with data dependencies associated with each separate task such that the output from one task is used as input for a subsequent task. in various embodiments, the set of tasks include at least one cleanroom task that is executed in a cleanroom station and at least one non-cleanroom task executed in an execution environment of a user where each task is configured to read one or more input datasets and transform the one or more input datasets into one or more output datasets.

Databricks, Inc. patent applications on March 27th, 2025