Optimal Utterance Selection for Unit Selection Speech Synthesis Databases

doi:10.1023/A:1025704800086

Journal Article10.1023/A:1025704800086

Optimal Utterance Selection for Unit Selection Speech Synthesis Databases

Alan W. Black, +1 more

- 01 Oct 2003

- International Journal of Speech Technolo...

- Vol. 6, Iss: 4, pp 357-363

11

TL;DR: This paper describes techniques to find an optimal data set for building high quality unit-selection speech synthesis inventories and a more complex acoustic modeling technique based on the database speaker's acoustic characteristics.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Book

不思議の国のアリス = Alice's adventures in woderland

Lewis Carroll, +1 more

- 01 Jan 1955

TL;DR: In this article, the authors argue that what Alice is exposed to and reacts to in Wonderland generally reflects the genre of a Bildungsroman and also specifically a feminist BildungSroman, and demonstrate how the novel also has a coming of age aspect based on feminism.

...read moreread less

546

Journal Article•10.1109/TASL.2007.903933

On Using Multiple Models for Automatic Speech Segmentation

Seung Seop Park, +1 more

- 01 Nov 2007

- IEEE Transactions on Audio, Speech, and ...

TL;DR: A novel approach to automatic speech segmentation for unit-selection based text-to-speech systems that makes use of multiple independent ASMs to produce a final boundary time-mark, and improves the percentage of boundaries that deviate less than 20 ms with respect to the reference boundary.

...read moreread less

35

•Proceedings Article•10.21437/SPEECHPROSODY.2018-121

Duration modeling using DNN for Arabic speech synthesis

Imene Zangar, +4 more

- 13 Jun 2018

- Speech prosody

TL;DR: This paper compares several modeling of phoneme durations, and proposes a new approach which relies on using a set of models, each one being optimal for a given phoneme class (e.g., simple consonants, geminated consonant, short vowels, and long vowels).

...read moreread less

19

•Journal Article•10.1016/J.SPECOM.2006.02.002

Tone-Group F0 selection for modeling focus prominence in small-footprint speech synthesis

Gerasimos Xydas, +1 more

- 01 Sep 2006

- Speech Communication

TL;DR: This work presents a robust unit-selection methodology for generating realistic F0 curves in cases where focus prominence is required, based on selecting Tone-Group units from commonly used prosodic corpora that are automatically transcribed as patterns of syllables.

...read moreread less

13

Book Chapter•10.1007/978-3-642-20095-3_4

Corpus design for a unit selection TtS system with application to Bulgarian

Aimilios Chalamandaris, +3 more

- 06 Nov 2009

TL;DR: This paper presents the process of designing an efficient speech corpus for the first unit selection speech synthesis system for Bulgarian, along with some significant preliminary results regarding the quality of the resulted system.

...read moreread less

9

References

•Book

Classification and regression trees

Leo Breiman

- 01 Jan 1983

TL;DR: The methodology used to construct tree structured rules is the focus of a monograph as mentioned in this paper, covering the use of trees as a data analysis method, and in a more mathematical framework, proving some of their fundamental properties.

...read moreread less

22.7K

Journal Article•10.2307/2288003

Classification and Regression Trees.

John Van Ryzin, +4 more

- 01 Mar 1986

- Journal of the American Statistical Asso...

21.8K

•Proceedings Article•10.1109/ICASSP.1996.541110

Unit selection in a concatenative speech synthesis system using a large speech database

Andrew Hunt, +1 more

- 07 May 1996

TL;DR: In this paper, a state transition network is proposed to select and concatenate phonemes from a large speech database to produce a natural realisation of a target phoneme sequence predicted from text which is annotated with prosodic and phonetic context information.

...read moreread less

1.4K