Self-Supervised Vision-Based Detection of the Active Speaker as Support for Socially Aware Language Acquisition | IEEE Journals & Magazine | IEEE Xplore