ALEXANDRIA, Va., May 19 -- United States Patent no. 12,633,305, issued on May 19, was assigned to Google LLC (Mountain View, Calif.).
"End-to-end speech diarization via iterative speaker embedding" was invented by David Grangier (Mountain View, Calif.), Neil Zeghidour (Mountain View, Calif.) and Oliver Teboul (Mountain View, Calif.).
According to the abstract* released by the U.S. Patent & Trademark Office: "A method includes receiving an input audio signal corresponding to utterances spoken by multiple speakers. The method also includes encoding the input audio signal into a sequence of T temporal embeddings. During each of a plurality of iterations each corresponding to a respective speaker of the multiple speakers, the method includes ...