ALEXANDRIA, Va., March 24 -- United States Patent no. 12,585,917, issued on March 24, was assigned to Google LLC (Mountain View, Calif.).

"Reinforcement learning using advantage estimates" was invented by Shixiang Gu (Cambridge, Great Britain), Timothy Paul Lillicrap (London), Ilya Sutskever (San Francisco) and Sergey Vladimir Levine (Berkeley, Calif.).

According to the abstract* released by the U.S. Patent & Trademark Office: "Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for computing Q values for actions to be performed by an agent interacting with an environment from a continuous action space of actions. In one aspect, a system includes a value subnetwork configured to receive an ob...