Oracle international corporation (20250094687). GENERATING SEMANTICALLY REPETITION-FREE LLM TEXT
GENERATING SEMANTICALLY REPETITION-FREE LLM TEXT
Organization Name
oracle international corporation
Inventor(s)
Vinod Murli Mamtani of Bellevue WA US
GENERATING SEMANTICALLY REPETITION-FREE LLM TEXT
This abstract first appeared for US patent application 20250094687 titled 'GENERATING SEMANTICALLY REPETITION-FREE LLM TEXT
Original Abstract Submitted
techniques for generating repetition-free text using a large language model (llm) are provided. in one technique, textual content that was generated by an llm is accessed, where the textual content comprises a plurality of sub-components including a first sub-component and a second sub-component. a first embedding that represents the first sub-component is generated and a second embedding that represents the second sub-component is generated. based on a similarity between the first embedding and the second embedding, it is determined whether the second sub-component is repetitious with respect to the first sub-component. in response to determining that the second sub-component is repetitious with respect to the first sub-component, at least a portion of the second sub-component is removed from the textual content.