18657308. LANGUAGE MODEL SUMMARIZATION USING SEMANTICAL CLUSTERING (Oracle International Corporation)
LANGUAGE MODEL SUMMARIZATION USING SEMANTICAL CLUSTERING
Organization Name
Oracle International Corporation
Inventor(s)
Vinod M. Mamtani of Bellevue WA US
LANGUAGE MODEL SUMMARIZATION USING SEMANTICAL CLUSTERING
This abstract first appeared for US patent application 18657308 titled 'LANGUAGE MODEL SUMMARIZATION USING SEMANTICAL CLUSTERING
Original Abstract Submitted
Techniques for language model (LM) summarization using semantical clustering are provided. In one technique, a plurality of concepts reflected in text data is identified. A plurality of concept clusters is generated based on similarity among the plurality of concepts. Thus, some concept clusters may include multiple concepts. For each concept cluster of the plurality of concept clusters, an LM generates a summary of the text corresponding to that concept cluster. A summary response of the text data is generated by aggregating the summary of each concept cluster of the plurality of concept clusters. In another technique, an LM generates a summary based on text data. A first set of concepts reflected in the summary is identified and a second set of concepts reflected in the text data is identified. A difference between the two sets may indicate that the summary is missing one or more concepts.