Google plans new method to match face with voices
04 July 2018 14:00 GMT

Google has filed patent for a new method to match faces and voices that includes new ability to recognize and match multiple faces and voices in a video.

The patent application describes that the method focuses on determining when somebody is talking and then matching their voice to their face. This approach gives the machine ability to hear and understand individual voices even in noisy environments by simply looking for a face that matches the sound pattern corresponding to a pre-defined voice profile.

The patent details Google using a voice diarization system that starts by finding faces then watches those faces in the video to determine when somebody’s talking. The goal is catch someone talking alone and then isolate their voice by confirming any audio matches with the movement of their mouth. Google then positively profiles that voice and files it together with the face to create a hard match.

In crowded situations or environments with multiple speakers, Google’s algorithm repeats the procedure for every person who speaks in the video. Once it creates a profile for everybody, the system becomes intelligent enough to tell who is speaking and when, as well as understand what they are saying by reading their lips. Android Headlines notes that the machine also transcribes the audio to check that mouth movement matches what the machine’s audio processing thinks the person is saying.

Industry Events



Smart Security Week 24-26 Sep 18


BIOSIG 2018 26-28 Sep 18
ADAS 2018 Event supported by Planet Biometrics 26-28 Sep 18