18455775. DATA-PRIVACY-PRESERVING SYNTHESIS OF REALISTIC SEMI-STRUCTURED TABULAR DATA (SAP SE)
DATA-PRIVACY-PRESERVING SYNTHESIS OF REALISTIC SEMI-STRUCTURED TABULAR DATA
Organization Name
Inventor(s)
Matthias Frank of Heidelberg (DE)
Sundeep Gullapudi of Singapore (SG)
Rajesh Vellore Arumugam of Singapore (SG)
Anantharaman Ravi of Singapore (SG)
Prawira Putra Fadjar of Singapore (SG)
Yi Quan Zhou of Singapore (SG)
DATA-PRIVACY-PRESERVING SYNTHESIS OF REALISTIC SEMI-STRUCTURED TABULAR DATA
This abstract first appeared for US patent application 18455775 titled 'DATA-PRIVACY-PRESERVING SYNTHESIS OF REALISTIC SEMI-STRUCTURED TABULAR DATA
Original Abstract Submitted
Methods, systems, and computer-readable storage media for receiving a real data table, providing a synthetic structured table based on the real data table, providing a sampled data table comprising a sub-set of real data of the real data table, transmitting a prompt to a LLM system, the prompt being generated based on the real data table and the synthetic structured data table, receiving synthetic unstructured data from the LLM system, providing an aggregate synthetic table that includes at least a portion of the synthetic unstructured data, and training a ML model using the aggregate synthetic table.