This work will be presented at the 184th Meeting of the Acoustical Society of America, May 2023, in Chicago, Illinois.
Conventional microphones can be referred to as air-conduction mics (ACMs), because they capture sound that propagates through the air. ACMs can record wideband audio, but will capture sounds from undesired sources in noisy scenarios.
In contrast, bone-conduction mics (BCMs) are worn directly on a person to detect sounds propagating through the body. While this can isolate the wearer’s speech, it also severely degrades the quality. We can model this degradation as a low-pass filter.
However, the BCM only provides good estimates of the lower frequencies. Therefore, we need to estimate the missing upper frequencies. This task is called bandwidth extension (BWE), and can be solved in a variety of ways.
We found that ensemble factorization approaches can significantly outperform other low-compute BWE methods.
We provide listening examples of our proposed ensemble system. The audio data was generated from a simulated indoor, multi-talker scene.
Female | Male | |
---|---|---|
BCM | ||
ACM | ||
Baseline | ||
Proposed |