default search action
EUROSPEECH 1991: Genova, Italy
- Second European Conference on Speech Communication and Technology, EUROSPEECH 1991, Genova, Italy, September 24-26, 1991. ISCA 1991
Plenary
- Sadaoki Furui:
Recent advances in speech recognition. 3-12 - Frank Fallside:
On the acquisition of speech by machines, ASM. 13-14
Continuous Speech Recognition
- Padma Ramesh, Jay G. Wilpon, Maureen A. McGee, David B. Roe, Chin-Hui Lee, Lawrence R. Rabiner:
Speaker independent recognition of spontaneously spoken connected digits. 17-20 - P. S. Gopalakrishnan, David Nahamoo:
Immediate recognition of embedded command words. 21-24 - Lynn Wilcox, Marcia A. Bush:
HMM-based wordspotting for voice editing and indexing. 25-28 - Janet M. Baker:
Large vocabulary speaker-adaptive continuous speech recognition research overview at dragon systems. 29-32 - Victoria Sgardoni, Dimitrios A. Gaganelis, Eleftherios D. Frangoulis:
Continuous density HMM context dependent phones for speech recognition over the telephone. 33-36
Segmental Speech Synthesis
- Katsuhiko Shirai, Kazuo Hashimoto, Tetsunori Kobayashi:
Text-to-speech synthesizer using superposition of sinusoidal waves generated by synchronized oscillators. 39-42 - M. Guerti, Gérard Bailly:
Synthesis-by-rule using compost: modelling resonance trajectories. 43-46 - Yasushi Ishikawa, Kunio Nakajima:
Neural network based spectral interpolation method for speech synthesis by rule. 47-50 - Martine Garnier-Rizet:
A rule-based segmental synthesis module for French. 51-54
Human Factors
- Norman M. Fraser, G. Nigel Gilbert:
Effects of system voice quality on user utterances in speech dialogue systems. 57-60 - P. Day, Andreas Grünupp, Klaus-Peter Muthig:
A human factors study of speech-to-text technology: consequences of discrete speech. 61-64 - Iain R. Murray, John L. Arnott, Alan F. Newell:
A comparison of document composition using a listening typewriter and conventional office systems. 65-68 - Paulus H. Vossen:
Evaluating speech input and output in a CAD-system using the hidden-operator method. 69-72 - Mary Zajicek, Jill Hewitt:
Mixed mode input for a standard wordprocessor. investigating links between input mode, speech and keyboard, and specific task areas. 73-76
Robust Isolated Word Recognition
- Philip Lockwood, Jérôme Boudy:
Experiments with a non-linear spectral subtractor (NSS), hidden Markov models and the projection, for robust speech recognition in cars. 79-82 - Philip Lockwood, C. Baillargeat, J. M. Gillot, Jérôme Boudy, Gérard Faucon:
Noise reduction for speech enhancement in cars: non-linear spectral subtraction / kalman filtering. 83-86 - Klaus Fellbaum, Dieter Becker:
Isolated word recognition with integrated noise reduction. 87-90 - Javier Hernando, Climent Nadeu:
A comparative study of parameters and distances for noisy speech recognition. 91-94
Neural Nets: Phonetic Features, Phoneme Recognition, and Time Alignment
- Jorma Laaksonen:
A new reliability-based phoneme segmentation method for the "neural" phonetic typewriter. 97-100 - Bruno Apolloni, Francesco Pazienti, Vincenzo Trotta:
Isolated word adaptive recognizer based on neural networks. 101-104 - Nobuo Hataoka, Alex Waibel:
Evaluation of speaker-independent phoneme recognition on TIMIT database using TDNNs. 105-108 - Nelson Morgan, Hervé Bourlard, Chuck Wooters, Phil Kohn, Michael Cohen:
Phonetic context in hybrid HMM/MLP continuous speech recognition. 109-112 - E. C. Andrews, John S. Mason:
Neural network classification of complex-valued speech features. 113-116 - Dennis Norris:
Rewiring lexical networks on the fly. 117-120 - Kjell Elenius, G. Takacs:
Phoneme recognition with an artificial neural network. 121-124 - Jianxin Jiang, Kechu Yi, Zheng Hu:
A new self-organization algorithm of forming a phoneme map. 125-128 - Shuping Ran, J. Bruce Millar:
Phoneme classification using neural networks based on acoustic-phonetic structure. 129-132 - Nigel Dodd, Donald MacFarlane, Chris Marland:
Networks for speech recognition structurally optimised by genetic techniques implemented on parallel hardware. 133-136
Phonetics I, II
- Jeff Pittam, John Ingram:
Influence of vietnamese tone and prosody on the acquisition of English stress patterns. 139-142 - Walter F. Sendlmeier:
The voiced/unvoiced distinction of initial stops by normal and hearing impaired listeners. 143-146 - Krishna S. Nathan:
Comparison of formant transition based stop classifiers: time-varying and time-invariant signal models. 147-150 - Christian Benoît, Christian Abry, L. J. Roe:
The effect of context on labiality in French. 151-156 - A. K. Datta, N. R. Ganguli, B. Mukherjee:
Nasalisation in bengali speech sounds acoustic-phonetic study. 157-160 - N. R. Ganguli:
Vowel formant frequency distribution of a major indian language. 161-164 - Bernard Harmegnies, Marielle Bruyninckx, Joaquim Llisterri, Dolors Poch:
Effects of language change on voice quality in bilingual speakers, corpus content effect. 165-168 - T. I. Shevchenko, T. S. Skopintseva:
Effects of social and regional backgrounds on LTAS in british English. 169-172 - Henk van den Heuvel, Bert Cranen, Toni C. M. Rietveld:
Speaker related variability in the durations of dutch speech segments. 251-254 - Johan Liljencrants:
Numerical simulations of glottal flow. 255-258 - Joop Jansen, Bert Cranen, Louis Boves:
Modelling of source characteristics of speech sounds by means of the LF-model. 259-262 - Hanspeter Herzel, J. Wendler:
Evidence of chaos in phonatory samples. 263-266 - Van Loan Trinh, Bernard Guérin, Eric Castelli:
Source-tract coupling and the subglottal system in an articulatory synthesizer. 267-270
Multilingual Speech Recognition Systems (Special Session)
- Paul G. Bamberg, Anne Demedts, John Elder, Caroline B. Huang, Charles Ingold, Mark A. Mandel, Linda Manganaro, Stijn Van Even:
Phoneme-based training for large-vocabulary recognition in six european languages. 175-182 - Helene Cerf-Danon, Steven DeGennaro, Marco Ferretti, Jorge Gonzalez, Eric Keppel:
1.0 TANGORA - a large vocabulary speech recognition system for five languages. 183-192 - Hermann Ney, Roberto Billi:
Prototype systems for large-vocabulary speech recognition: polyglot and spicos. 193-200
Spoken Language Parsing
- J. H. Wright:
Adaptation of grammar-based language models for continuous speech recognition. 203-206 - Keh-Yih Su, Tung-Hui Chiang, Yi-Chung Lin:
A robustness and discrimination oriented score function for integrating speech and language processing. 207-210 - Paolo Baggia, Lorenzo Fissore, Elisabetta Gerbino, Egidio P. Giachin, Claudio Rullent:
Improving speech understanding performance through feedback verification. 211-214 - Anna Corazza, Renato de Mori, Roberto Gretter, Giorgio Satta:
Computation of upper-bounds for island-driven stochastic parsers. 215-218 - François Andry, J. H. Simon Thornton:
A parser for speech lattices using a UCG grammar. 219-222 - Sheryl Young, Michael Matessa:
Using pragmatic and semantic knowledge to correct parsing of spoken language utterances. 223-227
Speech Coding I-IV
- Arnaldo J. Abrantes, Jorge S. Marques, Isabel Trancoso:
Hybrid sinusoidal modeling of speech without voicing decision. 231-234 - Jorge S. Marques, Isabel Trancoso, Arnaldo J. Abrantes:
Harmonic coding of speech: an experimental study. 235-238 - David Rowe, William G. Cowley, Andrew Perkis:
A multiband excitation linear predictive speech coder. 239-242 - Shu Hung Leung, K. L. Lai, O. Y. Wong, Andrew Luk:
A new coded excitation model using multifrequency decomposition. 245-248 - Daniele Sereno:
Frame substitution and adaptive post-filtering in speech coding. 595-598 - S. A. Atungsiri, R. Soheili, Ahmet M. Kondoz, Barry G. Evans:
Effective lost speech frame reconstruction for CELP coders. 599-602 - Hiromi Nagabuchi, Nobuhiko Kitawaki:
Evaluation and improvement of coded speech quality degraded by cell loss in ATM networks. 603-606 - Alain J. Vigier:
Combined source-channel coding for a very noisy channed. 607-610 - G. Rosina, Marcello Sant' Agostino, E. Turco, Luigi Vetrano:
Testing and quality enhancement of the GSM full rate voice channel. 611-614 - U. Kipper, Herbert Reininger, Dietrich Wolf:
Low bit rate speech coding using CELP with adaptive excitation codebook. 893-896 - Arild Fuldseth, Erik Harborg, Finn Tore Johansen, Jan E. Knudsen:
A real-time implementable 7 khz speech coder at 16 kbit/s. 897-900 - D. J. Zarkadis:
Adaptive spectral weighting for vector predictive coding of the LPC-spectra. 901-904 - Samir Saoudi, Jean-Marc Boucher, Alain Le Guyader:
Medium band speech coding using optimal scalar quantization of LSP. 905-908 - Philip Secker, Andrew Perkis:
Joint source and channel coding of line spectrum pairs. 909-912 - C. F. Chan, K. W. Law:
An algorithm for computing LSP frequencies directly from the reflection coefficients. 913-916 - Peter Meyer, W. Peters, Jürgen Paulus:
Variable rate speech coding using perceptive thresholds and adaptive VUS detection. 809-812 - M. R. Suddle, S. A. Atungsiri, Ahmet M. Kondoz, Barry G. Evans:
A secure and robust CELP coder for land and satellite mobile systems. 813-816 - Carlos M. Ribeiro, Isabel Trancoso:
A 4.8 kbps celp coder with post-processing. 817-820 - K. W. Law, O. Y. Wong, C. F. Chan:
A real-time high quality joint-excitation linear predictive coder at 8 kbps. 821-824 - Rosario Drogo de Iacovo, Roberto Montagna:
Some experiments in perceptual masking of quantizing noise in analysis-by-synthesis speech coders. 825-828 - Gao Yang, Henri Leich, René Boite:
A very high-quality CELP coder at the rate of 2400 bps. 829-832 - Z. Yong Liu:
An effective pulse adaptive code-excited linear predictive coder at 4kb/S. 835-838 - C. F. Chan, S. H. Leung:
A vocoder using high-order LPC filter with very few non-zero coefficients. 839-842
Assessment, Intelligibility and Aids for Disabled
- Mario Rossi, Robert Espesser, Chaslav Pavlovic:
The effects of in internal reference system and cross-modality matching on the subjective rating of speech synthesisers. 273-276 - H. A. Sydeserff, R. J. Caley, Stephen D. Isard, Mervyn A. Jack, Alex I. C. Monaghan, Jo Verhoeven:
Evaluation of speech synthesis techniques in a comprehension task. 277-280 - P. A. Howard-Jones:
'SOAP' - a speech output assessment package for controlled multilingual evaluation of synthetic speech. 281-284 - Tammo Houtgast, Jan A. Verhave:
A physical approach to speech quality assessment: correlation patterns in the speech spectrogram. 285-288 - Hiroyuki Miyata, Tammo Houtgast:
Weighted MTF for predicting speech intelligibility in reverberant sound fields. 289-292 - Ute Jekosch:
Speech intelligibility studies for the european hermes spaceplane. 293-296 - Jianing Wei, Andrew Faulkner, Adrian Fourcin:
An application of speech processing and encoding scheme for Chinese lexical tone and consonant perception by hearing impaired listeners. 299-302 - Dimitri Kanevsky, P. Gopalakrishan, Catalina Danis, Gregg Daggett, Edward A. Epstein, David Nahamoo:
On the development of a phone communication aid for the hearing impaired. 303-306 - Yolande Anglade, Jean-Marie Pierrel, Jean-Claude Junqua:
A spoken language interface for a telephone switchboard operator center. 307-310 - Iain R. Murray, John L. Arnott, Norman Alm, Alan F. Newell:
A communication system for the disabled with emotional synthetic speech produced by rule. 311-314
Speech Synthesis: Techniques and Applications
- Thomas Portele, Birgit Steffan, Rainer Preuß, Wolfgang Hess:
German speech synthesis by concatenation of non-parametric units. 317-320 - Giuseppe Abbattista, Antonello Riccio, Enzo Mumolo:
Automatic document reader with speech output capabilities. 321-324 - Robin W. King:
Tools and processes for developing low-cost and high-quality text-to-speech synthesis for communication aids. 325-329 - Hynek Hermansky, Louis Anthony Cox Jr.:
Perceptual linear predictive (PLP) analysis-resynthesis technique. 329-332 - Reinhold Greisbach, Bernd J. Kröger, O. Esser, G. Plaßmann:
A display technique for measurements of natural and synthetic articulatory dynamics. 333-336 - Yueh-Chin Chang, Yi-Fan Lee, Bang-Er Shia, Hsiao-Chuan Wang:
Statistical models for the Chinese text-to-speech system. 337-340 - P. A. Taylor, I. A. Nairn, Andrew M. Sutherland, Mervyn A. Jack:
A realtime speech synthesis system. 341-344 - Hélène Valbret, Eric Moulines, Jean-Pierre Tubach:
Voice tranformation using PSOLA technique. 345-348 - Massimo Giustiniani, Piero Pierucci:
Phonetic ergodic HMM for speech synthesis. 349-352 - Cristina Delogu, P. Paoloni, Paolo Pocci, Ciro Sementina:
Quality evaluation of text-to-speech synthesizers using magnitude estimation, categorical estimation, pair comparison and reaction time methods. 353-356 - H. Zingte, Cl. Hennebois:
Helping young children to associate sounds and letters through speech synthesis. 357-360 - Hervé Bourlard:
Neural nets and hidden Markov models: review and generalizations. 363-369 - Nikil S. Jayant, James D. Johnston, Yair Shoham:
Coding of wideband speech. 373-379
Probabilistic Language Models for Speech Recognition
- Roberto Pieraccini, Esther Levin:
Stochastic representation of semantic structure for speech understanding. 383-386 - Colin Matheson, Fergus R. McInnes:
Incorporating probabilities into the dualgram language model. 387-390 - Egidio P. Giachin:
A dynamic programming based framework for stochastic spoken language understanding. 391-394 - Natividad Prieto, Enrique Vidal:
Learning language models through the ECGI method. 395-398 - Roberto Cremonini, Marco Ferretti, M. C. Galimberti, Giulio Maltese, Federico Mancini:
Using a generative grammar to train a probabilistic language model for speaker-independent speech recognition. 399-402
Speech Recognition and Phonetic Modelling
- Katsuhiko Shirai, Eiichiro Kitagawa, T. Endo:
Optimal construction of context sensitive quantizer for phoneme recognition in continuous speech. 405-408 - Mary O'Kane, P. E. Kenne, D. Landy, S. Atkins:
Generalising from single-speaker recognition in a feature-based recogniser. 409-412 - Hans-Günter Hirsch, Peter Meyer, Hans-Wilhelm Rühl:
Improved speech recognition using high-pass filtering of subband envelopes. 413-416 - Yifan Gong, Jean Paul Haton:
Comparing two phoneme identification methods using a continuous speech recognizer. 417-420 - D. Ederveen, Louis Boves:
Knowledge-based phoneme recognition. 421-424
Speaker Identification and Verification
- J. Kraayeveld, A. C. M. Rietveld, Vincent J. van Heuven:
Speaker characterization in dutch using prosodic parameters. 427-430 - Alan K. Hunt:
New commercial applications of telephone-network-based speech recognition and speaker verification. 431-434 - Jean-François Bonastre, Henri Meloni, Philippe Langlais:
Analytical strategy for speaker identification. 435-438 - L. Xu, John S. Mason:
Optimization of perceptually-based spectral transforms in speaker identification. 439-442
Pitch Determination and Voice Separation
- Alain de Cheveigné:
A mixed speech F0 estimation algorithm. 445-448 - Edward Jones, Eliathamby Ambikairajah:
A perceptually-based pitch extractor for band-limited speech. 449-452 - Yu-Hua Gu:
A robust pseudo perceptual pitch estimator. 453-456 - Neviano Dal Degan, Marco Fratti:
Pitch estimation based on a "narrowed" autocorrelation function. 457-460
Speech Recognition: Understanding Systems
- Seiichi Nakagawa, Yoshimitsu Hirata, Isao Murase:
The syntax-oriented spoken Japanese understanding system SPOJOS-SYNO II. 463-466 - Henning Bergmann, Hans-Hermann Hamer, Andreas Noll, Annedore Paeseler, Horst Tomaschewski:
An adaptable man-machine interface using connected-word recognition. 467-470 - M. J. Poza, Celinda de la Torre, Daniel Tapias, Luis Villarrubia:
An approach to automatic recognition of keywords in unconstrained speech using parametric models. 471-474 - I. Lee Hetherington, Hong C. Leung, Victor W. Zue:
Toward vocabulary-independent recognition of telephone speech. 475-478 - Ronald A. Cole, Krist Roginski, Mark A. Fanty:
English alphabet recognition with telephone speech. 479-482 - Jean-Yves Fiset, Jean-Marc Robert, Raymond Descout:
Evolutionary language models in air traffic control training. 483-486 - Gareth J. F. Jones, Jeremy H. Wright, E. N. Wrigley, Michael J. Carey, Eluned S. Parris:
Isolated-word sentence recognition using probabilistic context-free grammar. 487-489