Ron M Hecht, Elad Noor, Naftali Tishby. Speaker Recognition by Gaussian Information Bottleneck


Natural Sciences / Computer Science / Analysis of algorithms

Submitted on: Aug 24, 2012, 19:17:47

Description: This paper explores a novel approach for the extraction of relevant information in speaker recognition tasks. This approach uses a principled information theoretic framework - the Information Bottleneck method (IB). In our application, the method compresses the acoustic data while preserving mostly the relevant information for speaker identification. This paper focuses on a continuous version of the IB method known as the Gaussian Information Bottleneck (GIB). This version assumes that both the source and target variables are high dimensional multivariate Gaussian variables. The GIB was applied in our work to the Super Vector (SV) dimension reduction conundrum. Experiments were conducted on the male part of the NIST SRE 2005 corpora. The GIB representation was compared to other dimension reduction techniques and to a baseline system. In our experiments, the GIB outperformed the baseline system; achieving a 6.1% Equal Error Rate (EER) compared to the 15.1% EER of a baseline system.

The Library of Congress (USA) reference page : http://lccn.loc.gov/cn2013300046.

To read the article posted on Intellectual Archive web site please click the link below.

Naftali_Tishby__Speaker_Recognition.pdf



© Shiny World Corp., 2011-2024. All rights reserved. To reach us please send an e-mail to support@IntellectualArchive.com