Scispace (Formerly Typeset)
  1. Home
  2. Journals
  3. Computer Speech & Language
  4. 2002
  1. Home
  2. Journals
  3. Computer Speech & Language
  4. 2002
Showing papers in "Computer Speech & Language in 2002"
Journal Article•10.1006/CSLA.2001.0184•
Weighted finite-state transducers in speech recognition

[...]

Mehryar Mohri1, Fernando Pereira1, Michael Riley2•
AT&T Labs1, University of Pennsylvania2
01 Jan 2002-Computer Speech & Language
TL;DR: WFSTs provide a common and natural representation for hidden Markov models (HMMs), context-dependency, pronunciation dictionaries, grammars, and alternative recognition outputs, and general transducer operations combine these representations flexibly and efficiently.

1,199 citations

Journal Article•10.1006/CSLA.2001.0182•
Large scale discriminative training of hidden Markov models for speech recognition

[...]

Philip C. Woodland1, Daniel Povey1•
University of Cambridge1
01 Jan 2002-Computer Speech & Language
TL;DR: It is shown that HMMs trained with MMIE benefit as much as MLE-trained HMMs from applying model adaptation using maximum likelihood linear regression (MLLR), which has allowed the straightforward integration of MMIe- trained HMMs into complex multi-pass systems for transcription of conversational telephone speech.

396 citations

Journal Article•10.1006/CSLA.2001.0183•
Recognition confidence scoring and its use in speech understanding systems

[...]

Timothy J. Hazen1, Stephanie Seneff1, Joseph Polifroni1•
Massachusetts Institute of Technology1
01 Jan 2002-Computer Speech & Language
TL;DR: This paper presents an approach to recognition confidence scoring and a set of techniques for integrating confidence scores into the understanding and dialogue components of a speech understanding system and demonstrates a relative reduction in concept error rate.

198 citations

Journal Article•10.1016/S0885-2308(02)00027-X•
Training a sentence planner for spoken dialogue using boosting

[...]

Marilyn A. Walker1, Owen Rambow2, Monica Rogati3•
AT&T Labs1, University of Pennsylvania2, Carnegie Mellon University3
01 Jul 2002-Computer Speech & Language
TL;DR: SPoT, a trainable sentence planner, and a new methodology for automatically training SPoT on the basis of feedback provided by human judges, which shows that SPiT performs better than the rule-based systems and the baselines, and as well as the hand-crafted system.

99 citations

Journal Article•10.1016/S0885-2308(02)00011-6•
Response planning and generation in the mercury flight reservation system

[...]

Stephanie Seneff1•
Massachusetts Institute of Technology1
01 Jul 2002-Computer Speech & Language
TL;DR: There is a direct meaning-to-speech mapping that eliminates the need to analyze linguistic structure for synthesis in the mercury flight reservation system, a mixed-initiative spoken dialogue system that supports both voice-only interaction and multi-modal interaction augmenting spoken inputs with typing or clicking at a displayed Web page.

90 citations

Journal Article•10.1016/S0885-2308(02)00025-6•
Trainable approaches to surface natural language generation and their application to conversational dialog systems

[...]

Adwait Ratnaparkhi
01 Jul 2002-Computer Speech & Language
TL;DR: How decisions for word ordering and word choice in surface natural language generation can be automatically learned from annotated data is studied to find the highest probability word sequence that is consistent with the rules and conditions of the grammar.

67 citations

Journal Article•10.1006/CSLA.2001.0188•
Theory and practice of acoustic confusability

[...]

Harry Printz, Peder A. Olsen1•
IBM1
01 Jan 2002-Computer Speech & Language
TL;DR: This paper defines two alternatives to the familiar perplexity statistic, respectively acoustic perplexity and the synthetic acoustic word error rate, and shows how to compute these statistics by effectively synthesizing a large acoustic corpus.

62 citations

Journal Article•10.1006/CSLA.2002.0192•
From within-word model search to across-word model search in large vocabulary continuous speech recognition

[...]

Achim Sixtus1, Hermann Ney1•
RWTH Aachen University1
01 Apr 2002-Computer Speech & Language
TL;DR: This paper reports on the application of across-word context dependent acoustic phoneme models in a single-pass large vocabulary continuous speech recognizer and derives a formal specification ofAcross-word word graphs, which are a good representation of the active search space.

36 citations

Journal Article•10.1016/S0885-2308(02)00009-8•
A conversation acts model for generating spoken dialogue contributions

[...]

Amanda Stent1•
Stony Brook University1
01 Jul 2002-Computer Speech & Language
TL;DR: A generation system for spoken dialogue that not only produces coherent, informative and responsive dialogue contributions, but also explicitly models human styles of interaction is described.

35 citations

Journal Article•10.1016/S0885-2308(02)00023-2•
Efficient integrated response generation from multiple targets using weighted finite state transducers

[...]

Ivan Bulyko1, Mari Ostendorf1•
University of Washington1
01 Jul 2002-Computer Speech & Language
TL;DR: This paper describes how language generation and speech synthesis for spoken dialog systems can be efficiently integrated under a weighted finite state transducer architecture and shows that introducing flexible targets in generation leads to more natural sounding synthesis.

34 citations

Journal Article•10.1016/S0885-2308(02)00029-3•
Spoken language generation

[...]

Marilyn A. Walker1, Owen Rambow2•
AT&T Labs1, Columbia University2
01 Jul 2002-Computer Speech & Language
TL;DR: This research is motivated by several goals: improving the quality of synthesis by using the generator to provide information about the purpose, meaning, and linguistic structure of the utterance to the synthesis process, and making it possible to customize systems that generate spoken language to individual or sets of users or new domains very quickly.
Journal Article•10.1016/S0885-2308(02)00022-0•
Exploring features from natural language generation for prosody modeling

[...]

Shimei Pan1, Kathleen R. McKeown2, Julia Hirschberg3•
IBM1, Columbia University2, AT&T Labs3
01 Jul 2002-Computer Speech & Language
TL;DR: Three groups of features are investigated: semantic, syntactic, and surface features produced by SURGE, a general-purpose surface natural language generator for English, deep semantic, and discourse features that are available during the domain modeling and content planning phases of generation, and information-based measures statistically derived from text.
Journal Article•10.1006/CSLA.2001.0187•
Using phonologically-constrained morphological analysis in continuous speech recognition

[...]

Mark Huckvale1, Alex Chengyu Fang1•
University College London1
01 Apr 2002-Computer Speech & Language
TL;DR: Investigation into the use of phonologically-constrained morphological analysis (PCMA) in language modelling for continuous speech recognition shows that PCMA leads to smaller but more generative pronunciation lexicons, and that it does not weaken the quality of the acoustic decoding measured in terms of recognition lattices.
Journal Article•10.1006/CSLA.2002.0190•
Selection of the most significant parameters for duration modelling in a Spanish text-to-speech system using neural networks

[...]

Ricardo de Córdoba1, Juan Manuel Montero1, Juana M. Gutiérrez1, José A. Vallejo1, Emilia Enríquez1, José Manuel Pardo1 •
Technical University of Madrid1
01 Apr 2002-Computer Speech & Language
TL;DR: A neural network system that predicts duration with very good results (19 ms in RMS) and that clearly improves the previous rule-based system.
Journal Article•10.1016/S0885-2308(02)00010-4•
Contrast in concept-to-speech generation

[...]

Mariët Theune1•
University of Twente1
01 Jul 2002-Computer Speech & Language
TL;DR: How information from natural language generation can be used to compute prosody in a concept-to-speech system, focusing on the automatic marking of contrastive accents on the basis of information about the preceding discourse, is discussed and compared.
Journal Article•10.1006/CSLA.2002.0193•
Transformation streams and the HMM error model

[...]

Mark J. F. Gales1•
University of Cambridge1
01 Apr 2002-Computer Speech & Language
TL;DR: A new form of factorial HMM which makes use of transformation streams is introduced which is a generalization of the standard factorialHMM and other related schemes in speech processing.
Journal Article•10.1006/CSLA.2001.0189•
Advances in Large Vocabulary Speech Recognition

[...]

Jean-Luc Gauvain, Renato De Mori, Lori Lamel
01 Jan 2002-Computer Speech & Language
Journal Article•10.1016/S0885-2308(02)00012-8•
Stochastic natural language generation for spoken dialog systems

[...]

Alice Oh1, Alexander I. Rudnicky1•
Carnegie Mellon University1
01 Jul 2002-Computer Speech & Language
TL;DR: It is shown that a simple statistical model alone can generate appropriate language for a spoken dialog system, and a promising avenue for using a statistical approach in future NLG systems is described.
Journal Article•10.1006/CSLA.2001.0185•
An overview of decoding techniques for large vocabulary continuous speech recognition

[...]

Xavier L. Aubert1•
Philips1
01 Jan 2002-Computer Speech & Language
TL;DR: A number of decoding strategies for large vocabulary continuous speech recognition (LVCSR) are examined from the viewpoint of their search space representation, and the main approaches are compared and some prospective views are formulated regarding possible future avenues.
Journal Article•10.1006/CSLA.2002.0191•
Hidden Markov model training with contaminated speech material for distant-talking speech recognition

[...]

Marco Matassoni, Maurizio Omologo, Diego Giuliani, Piergiorgio Svaizer
01 Apr 2002-Computer Speech & Language
TL;DR: Improvements in recognition accuracy due to multiple microphones, HMM training on contaminated speech and incremental adaptation are additive on a connected digits task and the results show that unsupervised incremental adaptation receives the benefits of starting from models trained using contaminated speech.
Journal Article•10.1016/S0885-2308(02)00024-4•
Learning visually grounded words and syntax for a scene description task

[...]

Deb Roy1•
Massachusetts Institute of Technology1
01 Jul 2002-Computer Speech & Language
TL;DR: A spoken language generation system that learns to describe objects in computer-generated visual scenes and generates syntactically well-formed compound adjective noun phrases, as well as relative spatial clauses was comparable to human-generated descriptions.
Journal Article•10.1006/CSLA.2001.0186•
Lightly supervised and unsupervised acoustic model training

[...]

Lori Lamel1, Jean-Luc Gauvain1, Gilles Adda1•
Centre national de la recherche scientifique1
01 Jan 2002-Computer Speech & Language
TL;DR: Experiments providing supervision only via the language model training materials show that including texts which are contemporaneous with the audio data is not crucial for success of the approach, and that the acoustic models can be initialized with as little as 10 min of manually annotated data.
Journal Article•10.1006/CSLA.2001.0181•
Structural maximum a posteriori linear regression for fast HMM adaptation

[...]

Olivier Siohan1, Tor André Myrvoll1, Chin-Hui Lee1•
Alcatel-Lucent1
01 Jan 2002-Computer Speech & Language
TL;DR: The proposed algorithm, called structural MAPLR (SMAPLR), has been evaluated on the Spoke3 1993 test set of the WSJ task and it is shown that SMAPLR reduces the risk of overtraining and exploits the adaptation data much more efficiently than MLLR, leading to a significant reduction of the word error rate for any amount of adaptation data.

Tools

SciSpace AgentBiomedical AgentSciSpace RecruitSciSpace for EnterpriseAgent GalleryChat with PDFLiterature ReviewAI WriterFind TopicsParaphraserCitation GeneratorExtract DataAI DetectorCitation Booster

Learn

ResourcesLive Workshops

SciSpace

CareersSupportBrowse PapersPricingSciSpace Affiliate ProgramCancellation & Refund PolicyTermsPrivacyData Sources

Directories

PapersTopicsJournalsAuthorsConferencesInstitutionsCitation StylesWriting templates

Extension & Apps

SciSpace Chrome ExtensionSciSpace Mobile App

Contact

support@scispace.com
SciSpace

© 2026 | PubGenius Inc. | Suite # 217 691 S Milpitas Blvd Milpitas CA 95035, USA

soc2
Secured by Delve