ALEXANDRIA, Va., Feb. 11 -- United States Patent no. 12,548,562, issued on Feb. 10, was assigned to GOOGLE LLC (Mountain View, Calif.).
"Speaker diarization using speaker embedding(s) and trained generative model" was invented by Ignacio Lopez Moreno (New York) and Luis Carlos Cobo Rus (San Francisco).
According to the abstract* released by the U.S. Patent & Trademark Office: "Speaker diarization techniques that enable processing of audio data to generate one or more refined versions of the audio data, where each of the refined versions of the audio data isolates one or more utterances of a single respective human speaker. Various implementations generate a refined version of audio data that isolates utterance(s) of a single human speaker...