Adapted Extended Baum-Welch transformations

Kanevsky, Dr Dimitri and Povey , Dr Daniel and Ramabhadran, Dr Bhuvana and Sainath, Dr Tara (2007) Adapted Extended Baum-Welch transformations. [Preprint]

Full text available as:

PDF (In the paper we develop a concept of adapted Extended Baum-Welch transformations for estimating of parameters of Gaussian mixtures in speech recognition tasks.)


The discrimination technique for estimating parameters of Gaussian mixtures that is based on the Extended Baum-Welch transformations (EBW) has had significant impact on the speech recognition community. In this paper we introduce a general definition of a family of EBW transformations that can be associated with a weighted sum of updated and initial models. We compute a gradient steepness measurement for a family of EBW transformations that are applied to functions of Gaussian mixtures and demonstrate the growth property of these transformations. We consider EBW transformations of discriminative functions in which EBW controlled parameters are adapted to a gradient steepness measurement or to the likelihood of the data given the model. We present experimental results that show that adapted EBW transformations can significantly speed up estimating parameters of Gaussian mixtures and give better decoding results.

Item Type:Preprint
Keywords:MMIE training, EBW transformations, gradient steepness
Subjects:Computer Science > Speech
ID Code:5902
Deposited By:Kanevsky, Dr Dimitri
Deposited On:15 Jan 2008 23:56
Last Modified:11 Mar 2011 08:57

References in Article

Select the SEEK icon to attempt to find the referenced article. If it does not appear to be in cogprints you will be forwarded to the paracite service. Poorly formated references will probably not work.

S. Axelrod, V. Goel, R. Gopinath, P. Olsen, and K. Visweswariah, "Discriminative Training of Subspace Constrained GMMs for Speech Recognition," to be submitted to IEEE Transactions on Speech and Audio Processing.

L.E.Baum and J.A. Eagon, "An inequality with applications to statistical prediction for functions of Markov processes and to a model of ecology," {\em Bull. Amer. Math. Soc.}, vol. 73, pp.360-363, 1967.

A. Gunawardana and W. Byrne, ``Discriminative Speaker Adaptation with Conditional Maximum Likelihood Linear Regression,'' ICASSP, 2002.

P.S. Gopalakrishnan, D. Kanevsky, D. Nahamoo and A. Nadas, "An inequality for rational functions with applications to some statistical estimation problems", IEEE Trans. Information Theory, Vol. 37, No.1 January 1991 .

D. Kanevsky, "Growth Transformations for General Functions", RC22919 (W0309-163), September 25, 2003.

D. Kanevsky, "Extended Baum transformations for general functions", in Proc. ICASSP, 2004.

D. Kanevsky, ``Extended Baum Transformations for General Functions, II", tech. Rep. RC23645(W0506-120), Human Language technologies, IBM , 2005, .

D. Kanevsky, "Constrained corrective training for continuous parameter system", US patent 6,044,344, March 28, 2000.

Cong Liu Peng Liu Hui Jiang Soong, F. Ren-Hua Wang, "A Constrained Line Search Optimization for Discriminative Training in Speech Recognition",

in Proc. ICASSP, 2007.

Y. Normandin, "An improved MMIE Training Algorithm for Speaker Independent, Small Vocabulary, Continuous Speech Recognition", Proc. ICASSP'91, pp. 537-540, 1991.

Daniel Povey, "Discriminative Training for Large Vocabulary Speech Recognition", PhD Thesis March 1, 2003.

Daniel Povey, Dimitri Kanevsky, Brian Kingsbury, Bhuvana Ramabhadran, George Saon and Karthik Visweswariah "Boosted MMI for model and feature-space discriminative training", submitted for ICASSP'08.

Tara N. Sainath, Dimitri Kanevsky, Giridharan Iyengar,``Unsupervised Audio Segmentation Using Extended Baum-Welch Transformations, in Proc. ICASSP, 2007.

Tara N. Sainath, Victor Zue, Dimitri Kanevsky, ``Audio-Classification using Extended Baum-Welch Transformations" , in Proc. Interspeech 2007.

Tara N. Sainath, Dimitri Kanevsky, Bhuvana Ramabhadran ``Broad Phoentic Recognition in a Hidden Markov Model Framework Using Extended Baum-Welch Transformations " , to appear in Proc. ASRU 2007.

Tara N. Sainath, Dimitri Kanevsky, Bhuvana Ramabhadran ``Gradient Steepness Metrics Using Extended Baum-Welch Transformations for Universal Pattern Recognition Tasks" , submitted for ICASSP 2008.

R. Schluter, W. Macherey, B. Muler and H. Ney, "Comparison of discriminative training criteria and optimization methods for speech recognition", Speech Communication, Vol. 34, pp.287-310, 2001.

V. Valtchev , P.C. Woodland and S. J. Young, "Lattice-based Discriminative Training for Large Vocabulary Speech Recognition Systems", Speech Communication, Vol. 22, pp. 303-314, 1996.


Repository Staff Only: item control page