Microsoft technology licensing, llc (20250140349). CONDITIONAL GENERATION OF PROTEIN SEQUENCES
CONDITIONAL GENERATION OF PROTEIN SEQUENCES
Organization Name
microsoft technology licensing, llc
Inventor(s)
Bruce James Wittmann of Redmond WA US
Eric J. Horvitz of Freeland WA US
Rohan Vishesh Koodli of Saratoga CA US
CONDITIONAL GENERATION OF PROTEIN SEQUENCES
This abstract first appeared for US patent application 20250140349 titled 'CONDITIONAL GENERATION OF PROTEIN SEQUENCES
Original Abstract Submitted
a computing system for conditional generation of protein sequences includes processing circuitry that implements a denoising diffusion probabilistic model. in an inference phase, the processing circuitry receives an instruction to generate a predicted protein sequence having a target functionality, the instruction including first conditional information and second conditional information. the processing circuitry concatenates a first conditional information embedding generated by a first encoder and a second conditional information embedding generated by a second encoder to produce a concatenated conditional information embedding. the processing circuitry samples noise from a distribution function and combines the concatenated conditional information embedding with the sampled noise to produce a noisy concatenated input. the processor inputs the noisy concatenated input to a denoising neural network to generate a predicted sequence embedding, inputs the predicted sequence embedding to a decoding neural network to generate the predicted protein sequence, and outputs the predicted protein sequence.