ALEXANDRIA, Va., Dec. 2 -- United States Patent no. 12,489,956, issued on Dec. 2, was assigned to Snap Inc. (Santa Monica, Calif.).
"Captioning videos with multiple cross-modality teachers" was invented by Tsai-Shien Chen (Merced, Calif.), Yuwei Fang (Redmond, Wash.), Hsin-Ying Lee (San Jose, Calif.), Jian Ren (Hermosa Beach, Calif.), Aliaksandr Siarohin (Los Angeles) and Sergey Tulyakov (Santa Monica, Calif.).
According to the abstract* released by the U.S. Patent & Trademark Office: "Automatic captioning pipelines and methods for automatically annotating video data with subtitles, which can be obtained using automatic speech recognition (ASR). An automatic captioning pipeline with inputs of multimodal data scales up the dataset of high-...