| | |
Summary: ROBUST VARIATIONAL SPEECH SEPARATION USING FEWER MICROPHONES THAN
SPEAKERS
Steven Rennie, Parham Aarabi, Trausti Kristjansson, Brendan J. Frey, Kannan Achan
Department of Electrical and Computer Engineering
University of Toronto
ABSTRACT
A variational inference algorithm for robust speech separa
tion, capable of recovering the underlying speech sources
even in the case of more sources than microphone obser
vations, is presented. The algorithm is based upon an gen
erative probabilistic model that fuses timedelay of arrival
(TDOA) information with prior information about the speak
ers and application, to produce an optimal estimate of the
underlying speech sources. Simulation results are presented
for the case of two, three and four underlying sources and
two microphones observations corrupted by noise. The re
sulting SNR gains (24dB with two sources, 15dB with three
sources, and 9dB with four sources) are significantly higher
than previous speech separation techniques.
1. INTRODUCTION
|