DeepMind Technologies Limited patent applications on September 19th, 2024

From WikiPatents
Jump to navigation Jump to search

Patent Applications by DeepMind Technologies Limited on September 19th, 2024

DeepMind Technologies Limited: 3 patent applications

DeepMind Technologies Limited has applied for patents in the areas of G06N3/0455 (1), G06N3/092 (1), G06N3/045 (1), G21D3/00 (1), G21B1/05 (1) G06N3/0455 (1), G06N3/092 (1), G21D3/001 (1)

With keywords such as: magnetic, neural, network, confinement, control, computer, device, observation, controlling, and time in patent application abstracts.



Patent Applications by DeepMind Technologies Limited

20240311617. CONTROLLING AGENTS USING SUB-GOALS GENERATED BY LANGUAGE MODEL NEURAL NETWORKS_simplified_abstract_(deepmind technologies limited)

Inventor(s): Norman Di Palo of London (GB) for deepmind technologies limited, Arunkumar Byravan of London (GB) for deepmind technologies limited, Nicolas Manfred Otto Heess of London (GB) for deepmind technologies limited, Martin Riedmiller of Balgheim (DE) for deepmind technologies limited, Leonard Hasenclever of London (GB) for deepmind technologies limited, Markus Wulfmeier of London (GB) for deepmind technologies limited

IPC Code(s): G06N3/0455

CPC Code(s): G06N3/0455



Abstract: methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling agents using a language model neural network and a vision-language model (vlm) neural network.


20240311639. REINFORCEMENT LEARNING USING AN ENSEMBLE OF DISCRIMINATOR MODELS_simplified_abstract_(deepmind technologies limited)

Inventor(s): Steven Stenberg Hansen of London (GB) for deepmind technologies limited, Daniel Joseph Strouse of London (GB) for deepmind technologies limited

IPC Code(s): G06N3/092, G06N3/045

CPC Code(s): G06N3/092



Abstract: this specification describes a method performed by one or more data processing apparatus that includes: sampling a latent from a set of possible latents, selecting actions to be performed by an agent to interact with an environment over a sequence of time steps using an action selection neural network that is conditioned on the sampled latent, determining a respective reward received for each time step in the sequence of time steps using an ensemble of discriminator models, and training the action selection neural network based on the rewards using a reinforcement learning technique. each discriminator model can process an observation to generate a respective prediction output that predicts which latent the action selection neural network was conditioned on to cause the environment to enter the state characterized by the observation.


20240312657. CONTROLLING A MAGNETIC FIELD OF A MAGNETIC CONFINEMENT DEVICE USING A NEURAL NETWORK_simplified_abstract_(deepmind technologies limited)

Inventor(s): Jonas Degrave of London (GB) for deepmind technologies limited, Federico Alberto Alfredo Felici of Lausanne (CH) for deepmind technologies limited, Jonas Buchli of London (GB) for deepmind technologies limited, Michael Peter Neunert of London (GB) for deepmind technologies limited, Brendan Daniel Tracey of London (GB) for deepmind technologies limited, Francesco Carpanese of Lausanne (CH) for deepmind technologies limited, Timo Victor Ewalds of London (GB) for deepmind technologies limited, Roland Hafner of Balgheim (DE) for deepmind technologies limited, Martin Riedmiller of Balgheim (DE) for deepmind technologies limited

IPC Code(s): G21D3/00, G21B1/05

CPC Code(s): G21D3/001



Abstract: methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for generating control signals for controlling a magnetic field for confining plasma in a chamber of a magnetic confinement device. one of the methods includes, for each of a plurality of time steps, obtaining an observation characterizing a current state of the plasma in the chamber of the magnetic confinement device, processing an input including the observation using a plasma confinement neural network to generate a magnetic control output that characterizes control signals for controlling the magnetic field of the magnetic confinement device, and generating the control signals for controlling the magnetic field of the magnetic confinement device based on the magnetic control output.


DeepMind Technologies Limited patent applications on September 19th, 2024