New AI noise-canceling headphone technology lets wearers pick which sounds they hear

Most anyone who’s used noise-canceling headphones knows that hearing the right noise at the right time can be vital. Someone might want to erase car horns when working indoors, but not when walking along busy streets. Yet people can’t choose what sounds their headphones cancel. Credit: University of Washington Most anyone who’s used noise-canceling headphones […]

Nov 9, 2023 - 18:00

New AI noise-canceling headphone technology lets wearers pick which sounds they hear

Most anyone who’s used noise-canceling headphones knows that hearing the right noise at the right time can be vital. Someone might want to erase car horns when working indoors, but not when walking along busy streets. Yet people can’t choose what sounds their headphones cancel.

SemanticHearing

Credit: University of Washington

Now, a team led by researchers at the University of Washington has developed deep-learning algorithms that let users pick which sounds filter through their headphones in real time. The team is calling the system “semantic hearing.” Headphones stream captured audio to a connected smartphone, which cancels all environmental sounds. Either through voice commands or a smartphone app, headphone wearers can select which sounds they want to include from 20 classes, such as sirens, baby cries, speech, vacuum cleaners and bird chirps. Only the selected sounds will be played through the headphones.

The team presented its findings Nov. 1 at UIST ’23 in San Francisco. In the future, the researchers plan to release a commercial version of the system.

“Understanding what a bird sounds like and extracting it from all other sounds in an environment requires real-time intelligence that today’s noise canceling headphones haven’t achieved,” said senior author Shyam Gollakota, a UW professor in the Paul G. Allen School of Computer Science & Engineering. “The challenge is that the sounds headphone wearers hear need to sync with their visual senses. You can’t be hearing someone’s voice two seconds after they talk to you. This means the neural algorithms must process sounds in under a hundredth of a second.”

Because of this time crunch, the semantic hearing system must process sounds on a device such as a connected smartphone, instead of on more robust cloud servers. Additionally, because sounds from different directions arrive in people’s ears at different times, the system must preserve these delays and other spatial cues so people can still meaningfully perceive sounds in their environment.

Tested in environments such as offices, streets and parks, the system was able to extract sirens, bird chirps, alarms and other target sounds, while removing all other real-world noise. When 22 participants rated the system’s audio output for the target sound, they said that on average the quality improved compared to the original recording.

In some cases, the system struggled to distinguish between sounds that share many properties, such as vocal music and human speech. The researchers note that training the models on more real-world data might improve these outcomes.

Additional co-authors on the paper were Bandhav Veluri and Malek Itani, both UW doctoral students in the Allen School; Justin Chan, who completed this research as a doctoral student in the Allen School and is now at Carnegie Mellon University; and Takuya Yoshioka, director of research at AssemblyAI.

For more information, contact semantichearing@cs.washington.edu.

DOI

10.1145/3586183.3606779

Article Title

Semantic Hearing: Programming Acoustic Scenes with Binaural Hearables

Article Publication Date

29-Oct-2023

Read the original article

Navigating uncertainty: the future of vaccine...

Gilead eyes Kymera’s ‘adhesive’ cancer drug i...

Novartis concludes Regulus Therapeutics acqui...

How AI and machine learning are transforming ...

Eli Lilly receives FDA approval for Amyvid la...

New Study Uncovers How Common Mutation Drives...

Shifting Focus: Understanding How Our Minds N...

Neopred: AI-Driven Dual-Phase CT Tool Enhance...

Mount Sinai Researchers Investigate the Conne...

New ASU Study Targets Drug-Resistant Microbes

CHMP Recommends Conditional EU Approval of Re...

Zanzalintinib-Tecentriq Combo Demonstrates Ov...

Sword Health Ventures Into Mental Health Mark...

Immuneering Reports 94% Six-Month Survival Ra...

Ventyx Releases Phase 2a Data for VTX3232 in ...

8 Reasons Your Hair Follicles Have “Stalled” ...

Stop Skin Aging This Summer With Zinc: Here’s...

Do You NEED To Wash Your Hair? Settling The D...

6 Reasons Your Immune System Needs Vitamin D

Everything You Need To Know About Vitamin D a...

New AI noise-canceling headphone technology lets wearers pick which sounds they hear

DOI

Article Title

Article Publication Date

Tags:

Bacteria-virus arms race provides rare window into rapid and complex evolution

Developing next-gen traffic signal control systems with air quality in mind

What's Your Reaction?

$2.7 million grant to explore hypoxia’s impact on blood...

Study from TU Graz Reveals Front Brake Lights Could Dra...

ASCO23: ‘Better sexual health for female patients on en...

Recommended Posts

STAT+: Illumina says it will divest Grail, startin...

Is it time to worry about Moderna?

Blood testing identifies biomarkers of suicidal th...

Monoblock Mastery: Capmatic's Innovative Approach ...

Follow Us

The FDA cautions against adverse reactions linked ...

Datopotamab deruxtecan by Daiichi Sankyo for Trans...

Datopotamab deruxtecan by Daiichi Sankyo for Perit...

New AI noise-canceling headphone technology lets wearers pick which sounds they hear

DOI

Article Title

Article Publication Date

Tags:

What's Your Reaction?

Related Posts

Recommended Posts

Follow Us