Frequency Modulation Technique for Prosodic Modification

doi:10.1109/CHINSL.2008.ECP.41

Open AccessProceedings Article10.1109/CHINSL.2008.ECP.41

Frequency Modulation Technique for Prosodic Modification

Jinfu Ni, +3 more

- 30 Dec 2008

- pp 1-4

5

TL;DR: This technique provides a mathematical formulation for representing speaking tone and manipulating FM in a unified framework for communicative speech synthesis and results indicated that the native speakers identified 90% of samples with emphases and 78% of "good news" as well as 94% of bad news samples.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Figures

Figure 4: Illustration of enhancing emphasis in words with frequency modulation technique for rising and lowering tones.

Figure 3: Schematic diagram of the basic patterns defined by tags baseline (line AB), cap (CDF/CDEF) and toend (line GH).

Figure 2: Schematic diagram of performing prosodic modification within the framework of TTS system XIMERA.

Figure 5: Mean opinion scores (the crosses) and standard deviations (the boxes) on a 7-point scale, – 3 (very good “bad news”), 0 (neutral), and +3 (very good “good news”).

Figure 1: Resonance curve A(λ, ζ) (the left panel) and the warping functions between normalized logF0 ∈ [0, 1] and λ ∈ [1, 2] at several values of ζ (the right panel).

Citations

Conversational Speech Synthesis (and the need for some laughter)

Nick Campbell

- 12 May 2005

TL;DR: This article reported progress in the synthesis of conversational speech from the viewpoint of work carried out on the analysis of a very large corpus of expressive speech in normal everyday situations, and suggested that this problem may be solved by the use of phrase-sized utterance units taken intact from a large corpus.

...read moreread less

6

Proceedings Article•10.1109/ISUC.2008.37

Prosody Modeling from Tone to Intonation in Chinese using a Functional F0 Model

Jinfu Ni, +3 more

- 15 Dec 2008

TL;DR: This paper analyzes tonal patterns as sparse target points (tonal F0 peaks and valleys) and model them using classification and regression trees (CART) with contextual linguistic features to form the final F0 contours based on a functional F0 model.

...read moreread less

4

Proceedings Article•10.1145/3439231.3440617

Towards a Prosodic Model for Synthesized Speech of Mathematical Expressions in MathML

Adriana Silva Souza, +1 more

- 02 Dec 2020

TL;DR: In this article, a model to improve prosody in the synthesized speech of mathematical expressions based on MathML is presented, where the Fujisaki intonation model is adopted for intonATION control, accent and phrase commands have been extracted from the corpus, and some adjustments have been made to manipulate prosodic parameters in the speech in correlation with the MathML tree; additionally, a pattern of pauses control is being created.

...read moreread less

3

Proceedings Article•10.1109/ICASSP.2009.4960568

CART-based modeling of Chinese tonal patterns with a functional model tracing the fundamental frequency trajectories

Jinfu Ni, +3 more

- 19 Apr 2009

TL;DR: The most important roles in characterizing tonal patterns were played by a few linguistic features such as lexical tone context and the distinction between voiced from unvoiced initials.

...read moreread less

1

Proceedings Article•10.1145/1667780.1667860

Hyperbolic structure of fundamental frequency contour

Jinfu Ni, +3 more

- 03 Dec 2009

TL;DR: This paper achieves a generalized hyperbolic structure so as to aggressively manipulate F0 contours and proves an equivalent expression of the resonance mechanism capable for dealing with the interaction of tone and intonation.

...read moreread less

References

Journal Article•10.1109/TSA.2005.860774

The ATR Multilingual Speech-to-Speech Translation System

Satoshi Nakamura, +9 more

- 01 Dec 2006

- IEEE Transactions on Audio, Speech, and ...

TL;DR: The ATR multilingual speech-to-speech translation (S2ST) system, which is mainly focused on translation between English and Asian languages, uses a parallel multilingual database consisting of over 600 000 sentences that cover a broad range of travel-related conversations.

...read moreread less

182

Journal Article•10.1109/TASL.2006.876131

Conversational speech synthesis and the need for some laughter

Nick Campbell

- 01 Jul 2006

- IEEE Transactions on Audio, Speech, and ...

TL;DR: The problem of expressing paralinguistic information in conversational speech may be solved by the use of phrase-sized utterance units taken intact from a large corpus, the complexity of which may be beyond the capabilities of many current synthesis methods.

...read moreread less

55

Journal Article•10.1016/J.SPECOM.2006.01.002

Quantitative and structural modeling of voice fundamental frequency contours of speech in Mandarin

Jinfu Ni, +1 more

- 01 Aug 2006

- Speech Communication

TL;DR: Analysis of 1044 utterances of various sentences read by eight native speakers revealed that the model could closely approximate the observed F 0 contours with a small number of parameters, which are localized and suited to a data-driven fitting process.

...read moreread less

28

Journal Article•10.1121/1.2165071

Constrained tone transformation technique for separation and combination of Mandarin tone and intonation.

Jinfu Ni, +2 more

- 28 Feb 2006

- Journal of the Acoustical Society of Ame...

TL;DR: The underlying scientific and linguistic principles are explained and the method's capability of separating and combining tone and intonation is evaluated through analysis and re-synthesis of several hundred observed F0 contours.

...read moreread less

24

Journal Article•10.1016/J.SPECOM.2005.03.017

Generation and perception of F0 markedness for communicative speech synthesis

Yoshinori Sagisaka, +2 more

- 01 Jul 2005

- Speech Communication

TL;DR: A computational model of conversational F 0 control is proposed using lexical information of adjectives showing positiveness or negativeness and adverbs expressing markedness, which shows strong positive or negative correlation between the markedness of adverbs and F 0 height.

...read moreread less

23

Frequency Modulation Technique for Prosodic Modification

Chat with Paper

AI Agents for this Paper

Figures

Citations

Conversational Speech Synthesis (and the need for some laughter)

Prosody Modeling from Tone to Intonation in Chinese using a Functional F0 Model

Towards a Prosodic Model for Synthesized Speech of Mathematical Expressions in MathML

CART-based modeling of Chinese tonal patterns with a functional model tracing the fundamental frequency trajectories

Hyperbolic structure of fundamental frequency contour

References

The ATR Multilingual Speech-to-Speech Translation System

Conversational speech synthesis and the need for some laughter

Quantitative and structural modeling of voice fundamental frequency contours of speech in Mandarin

Constrained tone transformation technique for separation and combination of Mandarin tone and intonation.

Generation and perception of F0 markedness for communicative speech synthesis

Related Papers (5)

The speech signal

Analytical Study on Fundamental Frequency Contours of Thai Expressive Speech Using Fujisaki's Model

Disfluent Speech Analysis and Synthesis: a preliminary approach.

The role of focus words in natural and in synthetic continuous speech: acoustic aspects

Role of prosody in cognitive process of spoken language