Abstract: Fully immersive and interactive audio-visual scenes are dynamic such that the listeners and the sound emitters move and interact with each other. Reconstruction of an immersive sound ...
People can discriminate the synchrony between audio-visual scenes. However, the sensitivity of audio-visual synchrony perception can be affected by many factors. Using a simultaneity judgment task, ...
Abstract. In recent years, DeepFake technology has achieved unprecedented success in high-quality video synthesis, but these methods also pose potential and severe security threats to humanity.
Humans naturally learn by making connections between sight and sound. For instance, we can watch someone playing the cello and recognize that the cellist's movements are generating the music we hear.
How to run AV-CIL-FFIR? Note: The Code is borrowed from Weiguo Pian (AC-CIL) ICCV 2023, the baseline model and code you can found in (https://github.com/weiguoPian/AV ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results