default search action
5th SSW 2004: Pittsburgh, PA, USA
- Alan W. Black, Kevin A. Lenzo:
Fifth ISCA ITRW on Speech Synthesis, SSW 2004, Pittsburgh, PA, USA, June 14-16, 2004. ISCA 2004
Oral Sessions
- Antje Schweitzer, Norbert Braunschweiler, Grzegorz Dogil, Bernd Möbius:
Assessing the acceptability of the Smartkom speech synthesis voices. 1-6 - Jithendra Vepa, Simon King:
Subjective evaluation of join cost & smoothing methods. 7-12 - Erwin Marsi:
Optionality in evaluating prosody prediction. 13-18 - Yoshinori Shiga, Simon King:
Accurate spectral envelope estimation for articulation-to-speech synthesis. 19-24 - Alexander Kain, Xiaochuan Niu, John-Paul Hosom, Qi Miao, Jan P. H. van Santen:
Formant re-synthesis of dysarthric speech. 25-30 - Tomoki Toda, Alan W. Black, Keiichi Tokuda:
Mapping from articulatory movements to vocal tract spectrum with Gaussian mixture model for articulatory speech synthesis. 31-36 - Toshio Hirai, Seiichi Tenpaku:
Using 5 ms segments in concatenative speech synthesis. 37-42 - Nobuo Nukaga, Ryota Kamoshida, Kenji Nagamatsu:
Unit selection using pitch synchronous cross correlation for Japanese concatenative speech synthesis. 43-48 - Ann K. Syrdal, Alistair Conkie:
Data-driven perceptually based join costs. 49-54 - Matthew P. Aylett:
Merging data driven and rule based prosodic models for unit selection TTS. 55-60 - Jan P. H. van Santen, Taniya Mishra, Esther Klabbers:
Estimating phrase curves in the general superpositional intonation model. 61-66 - Pablo Daniel Agüero, Antonio Bonafonte:
Intonation modeling for TTS using a joint extraction and prediction approach. 67-72 - Esther Klabbers, Jan P. H. van Santen:
Clustering of foot-based pitch contours in expressive speech. 73-78 - Ellen Eide, Andrew Aaron, Raimo Bakis, Wael Hamza, Michael A. Picheny, John F. Pitrelli:
A corpus-based approach to expressive speech synthesis. 79-84 - Guillaume Gibert, Gérard Bailly, Frédéric Elisei:
Audiovisual text-to-cued speech synthesis. 85-90
Poster Sessions
- Rachel Baker, Robert A. J. Clark, Michael White:
Synthesising contextually appropriate intonation in limited domains. 91-96 - Jelske Dijkstra, Louis C. W. Pols, R. J. J. H. van Son:
Frisian TTS, an example of bootstrapping TTS for minority languages. 97-102 - Sebsibe H. Mariam, S. Prahallad Kishore, Alan W. Black, Rohit Kumar, Rajeev Sangal:
Unit selection voice for Amharic using Festvox. 103-108 - Kalika Bali, Partha Pratim Talukdar, Sridhar Krishna Nemala, A. G. Ramakrishnan:
Tools for the development of a Hindi speech synthesis system. 109-114 - Hiroyuki Segi, Tohru Takagi, Takayuki Ito:
A concatenative speech synthesis method using context dependent phoneme sequences with variable length as search units. 115-120 - Justin Fackrell, Wojciech Skut:
Improving pronunciation dictionary coverage of names by modelling spelling variation. 121-126 - Yeon-Jun Kim, Ann K. Syrdal, Matthias Jilka:
Improving TTS by higher agreement between predicted versus observed pronunciations. 127-132 - Jerome R. Bellegarda:
A novel discontinuity metric for unit selection text-to-speech synthesis. 133-138 - Jordi Adell, Antonio Bonafonte:
Towards phone segmentation for concatenative speech synthesis. 139-144 - Joakim Gustafson, Kåre Sjölander:
Voice creation for conversational fairy-tale characters. 145-150 - Shinsuke Sakai:
F0 modeling with multi-layer additive modeling based on a statistical learning technique. 151-154 - John Kominek, Alan W. Black:
Impact of durational outlier removal from unit selection catalogs. 155-160 - Keikichi Hirose, Kentaro Sato, Nobuaki Minematsu:
Corpus-based synthesis of fundamental frequency contours with various speaking styles from text using F0 contour generation process model. 161-166 - Jianhua Tao, Yongguo Kang:
Multi-source based acoustic model for speech synthesis. 167-172 - Robert A. J. Clark, Korin Richmond, Simon King:
Festival 2 - build your own general purpose unit selection speech synthesiser. 173-178 - Hisashi Kawai, Tomoki Toda, Jinfu Ni, Minoru Tsuzaki, Keiichi Tokuda:
XIMERA: a new TTS from ATR based on corpus-based technologies. 179-184 - Fabio Tesser, Piero Cosi, Carlo Drioli, Graziano Tisato:
Prosodic data driven modelling of a narrative style in Festival TTS. 185-190 - Heiga Zen, Keiichi Tokuda, Tadashi Kitamura:
An introduction of trajectory model into HMM-based speech synthesis. 191-196 - Nemala Sridhar Krishna, Hema A. Murthy:
Duration modeling of Indian languages Hindi and Telugu. 197-202 - Jason Y. Zhang, Arthur R. Toth, Kevyn Collins-Thompson, Alan W. Black:
Prominence prediction for supersentential prosodic modeling based on a new database. 203-208 - Robert I. Damper, Yannick Marchand, John-David Marseters, Alex I. Bazin:
Aligning letters and phonemes for speech synthesis. 209-214
Short Contributions
- Peter Rutten, David Talkin:
rvoice studio and activeprompts. 215-216 - Leonardo Badino, Claudia Barolo, Silvia Quazza:
Language independent phoneme mapping for foreign TTS. 217-218 - Enrico Zovato, Alberto Pacchiotti, Silvia Quazza, Stefano Sandri:
Towards emotional speech synthesis: a rule based approach. 219-220 - Alejandro Renato, José A. Alvarez:
Corpora of latin american Spanish for research in prosody and synthesis. 221-222 - John Kominek, Alan W. Black:
The CMU Arctic speech databases. 223-224 - Arthur R. Toth:
Forced alignment for speech synthesis databases using duration and prosodic phrase breaks. 225-226 - Wentao Gu, Hiroya Fujisaki, Keikichi Hirose:
Analysis of fundamental frequency contours of Cantonese based on a command-response model. 227-228 - Brian Langner, Alan W. Black:
Creating a database of speech in noise for unit selection synthesis. 229-230
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.