ALEXANDRIA, Va., March 24 -- United States Patent no. 12,586,350, issued on March 24, was assigned to Adobe Inc. (San Jose, Calif.).
"Determining audio and video representations using self-supervised learning" was invented by Simon Jenni (Hagendorf, Switzerland) and John Collomosse (Woking, Great Britain).
According to the abstract* released by the U.S. Patent & Trademark Office: "Embodiments are disclosed for training a system to generate audio and video representations using self-supervised learning. The method may include receiving a video signal including an audio component and a video component. A first machine learning model is trained to determine a representation of the audio component using a contrastive learning task and a tempo...