New Google AI Tech Can Differentiate Between Voices in a Crowd

• Apr 12, 2018 at 07:31pm EDT

There is no doubt that we can differentiate between multiple voices when they're spoken simultaneously. Under right circumstances, we're easily able to tune things out to focus on a single speaker, but a microphone picking up sound from multiple sources can't do the same thing. Speech recognition has been a thing only for the past decade, and it's capabilities are still limited. Researchers at Google have been working on isolating sources of audio like speech in videos, they've made great progress at it, judging by some of the results they've posted.

The researchers at Google have built a machine learning-powered system that can pick out specific sounds like speech in a video. Not only can it isolate spoken words from background audio but also entirely separate the speech of two people talking simultaneously. The concept seems easy enough when the two speakers have drastically different voices. If it's isolating audio based on frequency, the more significant the pitch difference between the speakers' voices, the better the results. The problem gets trickier when there are multiple sounds involved, along with background noise.

For that, the researchers at Google used "fake cocktail parties," composed of manually spliced "clean" sources of audio and video, overlaid with similarly clean background noise. The data is then fed to the network, training it with facial movements from the video and spectrograms of the merged audio track. The system is then able to determine which frequencies at which times are most likely to correspond to a given speaker and that data is then extracted into a new isolated audio track which is almost better than how a human would have gone about it.

While it sounds good from a scientific breakthrough point of view, the technology can spell doom for individual privacy rights. With a little work, it will be possible for anyone with the right equipment to isolate an individual voice from a crowd. But, then again, when has personal privacy ever been a hindrance to technology?

News Source: Google

About the author: Anil has been a lifelong tech enthusiast and has worked a variety of jobs before joining the Wccftech team in 2018. His primary responsibilities include reporting on all things in the Android and mobile gaming sphere. He is also passionate about PC hardware, obscure music and internet culture. He also has a thing for addressing himself in third person as an exercise in self-awareness.

Follow Wccftech on Google to get more of our news coverage in your feeds.

Read all comments on New Google AI Tech Can Differentiate Between Voices in a Crowd

New Google AI Tech Can Differentiate Between Voices in a Crowd

Trending Stories

Marvell’s Structera CXL Accelerators Compress Data By Up To 3.64x To Make Every Gigabyte Count As Memory Shortages Intensify

PlayStation Will Continue To Push Live Service Games Despite Evident Challenges, While Teasing A PlayStation 6 Handheld

CXMT’s ‘Cheap’ DDR5 Is a Myth, Memory Vendors Tell us at Computex — Prices Match Samsung, SK Hynix & Micron

Elon Musk Calls IBM’s 0.7-nanometer Chip Naming Misleading, Says Atoms Should Decide Process Node Labels

PlayStation 6 Frame Generation Research May Help Deliver 4K@120 Gameplay While Keeping Costs Down Amidst Crippling Price Increases

Popular Discussions

Intel Nova Lake Dual-Tile CPUs Reportedly Feature Up To 474W PL2 Power Limit

AMD Rolls Out FSR 4.1 For RX 7000 GPUs, Builds a Lightweight ML Model for RDNA 3.5 and RDNA 3 iGPUs

AMD’s FSR 4.1 Doubles RX 7900 XTX frame Rates In Cyberpunk 2077, Jumping From 24 To 50 FPS At 4K

YouTuber Daniel Owen And Club386 Got Their RTX 5090 Connectors Cooked; Club386 Calls It A “Flawed Design”

Valve’s $1049 Steam Machine Either Hides a Fat Margin or Got Rinsed by Suppliers, Says AMD Leaker

New Google AI Tech Can Differentiate Between Voices in a Crowd

Related Story Google’s TPUs May Deliver ‘Impressive’ Performance, But One Overlooked Bottleneck Could Bring External Scaling to a Halt Before It Even Begins

Further Reading

Trending Stories

Popular Discussions