Snowflake inc. (20250131017). MACHINE TIME ESTIMATION FOR CONTINUOUS MAINTENANCE OF CLUSTERED DATA
MACHINE TIME ESTIMATION FOR CONTINUOUS MAINTENANCE OF CLUSTERED DATA
Organization Name
Inventor(s)
Varun Ganesh of San Bruno CA US
Kevin Ali Li of Burlingame CA US
Ryan Michael Thomas Shelly of San Francico CA US
MACHINE TIME ESTIMATION FOR CONTINUOUS MAINTENANCE OF CLUSTERED DATA
This abstract first appeared for US patent application 20250131017 titled 'MACHINE TIME ESTIMATION FOR CONTINUOUS MAINTENANCE OF CLUSTERED DATA
Original Abstract Submitted
a method includes sampling, by at least one hardware processor, a table using a clustering key to obtain a set of batches. each batch of the set of batches includes a set of partitions of the table. a clustering job is performed for at least one batch of the set of batches. a machine processing cost associated with the clustering job is determined on a per-row basis. a total clustering cost associated with clustering data in the table is determined based on the machine processing cost on the per-row basis.