


default search action
5th SSW 2004: Pittsburgh, PA, USA
- Alan W. Black, Kevin A. Lenzo:

Fifth ISCA ITRW on Speech Synthesis, SSW 2004, Pittsburgh, PA, USA, June 14-16, 2004. ISCA 2004
Oral Sessions
- Antje Schweitzer, Norbert Braunschweiler, Grzegorz Dogil, Bernd Möbius:

Assessing the acceptability of the Smartkom speech synthesis voices. 1-6 - Jithendra Vepa, Simon King:

Subjective evaluation of join cost & smoothing methods. 7-12 - Erwin Marsi:

Optionality in evaluating prosody prediction. 13-18 - Yoshinori Shiga, Simon King:

Accurate spectral envelope estimation for articulation-to-speech synthesis. 19-24 - Alexander Kain, Xiaochuan Niu, John-Paul Hosom, Qi Miao, Jan P. H. van Santen:

Formant re-synthesis of dysarthric speech. 25-30 - Tomoki Toda, Alan W. Black, Keiichi Tokuda:

Mapping from articulatory movements to vocal tract spectrum with Gaussian mixture model for articulatory speech synthesis. 31-36 - Toshio Hirai, Seiichi Tenpaku:

Using 5 ms segments in concatenative speech synthesis. 37-42 - Nobuo Nukaga, Ryota Kamoshida, Kenji Nagamatsu:

Unit selection using pitch synchronous cross correlation for Japanese concatenative speech synthesis. 43-48 - Ann K. Syrdal, Alistair Conkie:

Data-driven perceptually based join costs. 49-54 - Matthew P. Aylett:

Merging data driven and rule based prosodic models for unit selection TTS. 55-60 - Jan P. H. van Santen, Taniya Mishra, Esther Klabbers:

Estimating phrase curves in the general superpositional intonation model. 61-66 - Pablo Daniel Agüero, Antonio Bonafonte:

Intonation modeling for TTS using a joint extraction and prediction approach. 67-72 - Esther Klabbers, Jan P. H. van Santen:

Clustering of foot-based pitch contours in expressive speech. 73-78 - Ellen Eide, Andrew Aaron, Raimo Bakis, Wael Hamza, Michael A. Picheny, John F. Pitrelli:

A corpus-based approach to expressive speech synthesis. 79-84 - Guillaume Gibert, Gérard Bailly, Frédéric Elisei:

Audiovisual text-to-cued speech synthesis. 85-90
Poster Sessions
- Rachel Baker, Robert A. J. Clark, Michael White:

Synthesising contextually appropriate intonation in limited domains. 91-96 - Jelske Dijkstra, Louis C. W. Pols, R. J. J. H. van Son:

Frisian TTS, an example of bootstrapping TTS for minority languages. 97-102 - Sebsibe H. Mariam, S. Prahallad Kishore, Alan W. Black, Rohit Kumar, Rajeev Sangal:

Unit selection voice for Amharic using Festvox. 103-108 - Kalika Bali, Partha Pratim Talukdar, Sridhar Krishna Nemala, A. G. Ramakrishnan:

Tools for the development of a Hindi speech synthesis system. 109-114 - Hiroyuki Segi, Tohru Takagi, Takayuki Ito:

A concatenative speech synthesis method using context dependent phoneme sequences with variable length as search units. 115-120 - Justin Fackrell, Wojciech Skut:

Improving pronunciation dictionary coverage of names by modelling spelling variation. 121-126 - Yeon-Jun Kim, Ann K. Syrdal, Matthias Jilka:

Improving TTS by higher agreement between predicted versus observed pronunciations. 127-132 - Jerome R. Bellegarda:

A novel discontinuity metric for unit selection text-to-speech synthesis. 133-138 - Jordi Adell, Antonio Bonafonte:

Towards phone segmentation for concatenative speech synthesis. 139-144 - Joakim Gustafson, Kåre Sjölander:

Voice creation for conversational fairy-tale characters. 145-150 - Shinsuke Sakai:

F0 modeling with multi-layer additive modeling based on a statistical learning technique. 151-154 - John Kominek, Alan W. Black:

Impact of durational outlier removal from unit selection catalogs. 155-160 - Keikichi Hirose, Kentaro Sato, Nobuaki Minematsu:

Corpus-based synthesis of fundamental frequency contours with various speaking styles from text using F0 contour generation process model. 161-166 - Jianhua Tao, Yongguo Kang:

Multi-source based acoustic model for speech synthesis. 167-172 - Robert A. J. Clark, Korin Richmond, Simon King:

Festival 2 - build your own general purpose unit selection speech synthesiser. 173-178 - Hisashi Kawai, Tomoki Toda, Jinfu Ni, Minoru Tsuzaki, Keiichi Tokuda:

XIMERA: a new TTS from ATR based on corpus-based technologies. 179-184 - Fabio Tesser, Piero Cosi, Carlo Drioli, Graziano Tisato:

Prosodic data driven modelling of a narrative style in Festival TTS. 185-190 - Heiga Zen, Keiichi Tokuda, Tadashi Kitamura:

An introduction of trajectory model into HMM-based speech synthesis. 191-196 - Nemala Sridhar Krishna, Hema A. Murthy:

Duration modeling of Indian languages Hindi and Telugu. 197-202 - Jason Y. Zhang, Arthur R. Toth, Kevyn Collins-Thompson, Alan W. Black:

Prominence prediction for supersentential prosodic modeling based on a new database. 203-208 - Robert I. Damper, Yannick Marchand, John-David Marseters, Alex I. Bazin:

Aligning letters and phonemes for speech synthesis. 209-214
Short Contributions
- Peter Rutten, David Talkin:

rvoice studio and activeprompts. 215-216 - Leonardo Badino, Claudia Barolo, Silvia Quazza:

Language independent phoneme mapping for foreign TTS. 217-218 - Enrico Zovato, Alberto Pacchiotti, Silvia Quazza, Stefano Sandri:

Towards emotional speech synthesis: a rule based approach. 219-220 - Alejandro Renato, José A. Alvarez:

Corpora of latin american Spanish for research in prosody and synthesis. 221-222 - John Kominek, Alan W. Black:

The CMU Arctic speech databases. 223-224 - Arthur R. Toth:

Forced alignment for speech synthesis databases using duration and prosodic phrase breaks. 225-226 - Wentao Gu, Hiroya Fujisaki, Keikichi Hirose:

Analysis of fundamental frequency contours of Cantonese based on a command-response model. 227-228 - Brian Langner, Alan W. Black:

Creating a database of speech in noise for unit selection synthesis. 229-230

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














