20250217379. Incremental Execution Extr (Databricks, .)
INCREMENTAL EXECUTION OF EXTRACT, TRANSFORM, LOAD PROCESS USING MICROTECHNIQUES ARCHITECTURE
Abstract: a system receives etl specification for processing stream data, including a transform operation represented using a database query specification. the system generates a dataflow graph of a sequence of database queries by decomposing the database query into a first database query that generates an intermediate results table, and a second database query that receives as input the intermediate results table and outputs data used for performing the transform operation. the system executes the sequence of database queries for performing the transform operation on stream data received from the source. when receiving an incremental data set, the system determines an output change set based on the received incremental data set by traversing an execution plan and processing each operator in the execution plan, and computing a change set of a particular operator from the change sets output by the one or more other operators based on the incremental data set.
Inventor(s): Michael Paul Armbrust, Vuk Ercegovac, Paul Lappas, Xi Liang, Mukul Murthy, Yannis Papakonstantinou, Nitin Sharma, John Sismanis, Joseph Torres, Min Yang
CPC Classification: G06F16/254 ({Extract, transform and load [ETL] procedures, e.g. ETL data flows in data warehouses})
Search for rejections for patent application number 20250217379
- Patent Applications
- Databricks, Inc.
- CPC G06F16/254
- Michael Paul Armbrust of Berkeley CA US
- Vuk Ercegovac of Campbell CA US
- Paul Lappas of Seattle WA US
- Xi Liang of Santa Clara CA US
- Mukul Murthy of Berkeley CA US
- Yannis Papakonstantinou of La Jolla CA US
- Nitin Sharma of Sammamish WA US
- John Sismanis of Campbell CA US
- Joseph Torres of San Francisco CA US
- Min Yang of Mountain View CA US