


default search action
ICASSP 2003: Hong Kong
- 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP '03, Hong Kong, April 6-10, 2003. IEEE 2003, ISBN 0-7803-7663-3
Volume 1
Keynotes
- Nikil Jayant:
Pervasive broadband: opportunities for signal processing. 1 - Ya-Qin Zhang:
Advances in networked media - theory and practice. 2 - Georgios B. Giannakis:
Ultra-wideband communications: an idea whose time has come. 3
Acoustic Modeling for Robust ASR
- Bryan L. Pellom, Kadri Hacioglu:
Recent improvements in the CU Sonic ASR system for noisy speech: the SPINE task. 4-7 - Wei-Tyng Hong:
A discriminative and robust training algorithm for noisy speech recognition. 8-11 - Xiaodong Cui, Yifan Gong:
Variable parameter Gaussian mixture hidden Markov modeling for speech recognition. 12-15 - Takehito Utsuro, Yasuhiro Kodama, Tomohiro Watanabe, Hiromitsu Nishizaki, Seiichi Nakagawa:
Confidence of agreement among multiple LVCSR models and model combination by SVM. 16-19 - Todd A. Stephenson, Mathew Magimai-Doss, Hervé Bourlard:
Speech recognition of spontaneous, noisy speech using auxiliary information in Bayesian networks. 20-23 - Ashutosh Garg, Gerasimos Potamianos, Chalapathy Neti, Thomas S. Huang:
Frame-dependent multi-stream reliability indicators for audio-visual speech recognition. 24-27
Language ID
- Sonia Parandekar, Katrin Kirchhoff:
Multi-stream language identification using data-driven dependency selection. 28-31 - A. K. V. Sai Jayram, V. Ramasubramanian, Thippur V. Sreenivas:
Language identification using parallel sub-word recognition. 32-35 - Qian-Rong Gu, Tadashi Shibata:
Speaker and text independent language identification using predictive error histogram vectors. 36-39 - Jean-Luc Rouas, Jérôme Farinas, François Pellegrino
, Régine André-Obrecht:
Modeling prosody for language identification on read and spontaneous speech. 40-43 - Eddie Wong, Sridha Sridharan:
Three approaches to multilingual phone recognition. 44-47 - Jilei Tian, Janne Suontausta:
Scalable neural network based language identification from written text. 48-51
Novel Feature Extraction and Processing
- Panu Somervuo:
Experiments with linear and nonlinear feature transformations in HMM based phone recognition. 52-55 - Sunil Sivadas, Hynek Hermansky:
Generalized tandem feature extraction. 56-59 - Andrew C. Lindgren, Michael T. Johnson, Richard J. Povinelli:
Speech recognition using reconstructed phase space features. 60-63 - Bojana Gajic, Kuldip K. Paliwal:
Robust speech recognition using features based on zero crossings with peak amplitudes. 64-67 - Hema A. Murthy, Venkata Gadde:
The modified group delay function and its application to phoneme recognition. 68-71 - Jinfu Ni, Hisashi Kawai:
Tone feature extraction through parametric modeling and analysis-by-synthesis-based pattern matching. 72-75
Speech Enhancement I
- Jong Uk Kim, Sang-Gyun Kim, Chang D. Yoo:
The incorporation of masking threshold to subspace speech enhancement. 76-79 - Lee Lin, W. Harvey Holmes, Eliathamby Ambikairajah:
Subband noise estimation for speech enhancement using a perceptual Wiener filter. 80-83 - Justinian Rosca, Radu V. Balan, Christophe Beaugeant:
Multi-channel psychoacoustically motivated speech enhancement. 84-87 - Steven J. Rennie, Parham Aarabi, Trausti T. Kristjansson, Brendan J. Frey, Kannan Achan:
Robust variational speech separation using fewer microphones than speakers. 88-91 - Tomohiro Nakatani, Masato Miyoshi:
Blind dereverberation of single channel speech signal based on harmonic structure. 92-95 - Marcin Kuropatwinski
, W. Bastiaan Kleijn
:
Minimum mean square error estimation of speech short-term predictor parameters under noisy conditions. 96-99
Packet Loss and Channel Coding
- Jonas Lindblom, Per Hedelin:
Error protection and packet loss concealment based on a signal matched sinusoidal vocoder. 100-103 - Christoffer Asgaard Rødbro, Mads Græsbøll Christensen
, Søren Vang Andersen, Søren Holdt Jensen:
Compressed domain packet loss concealment of sinusoidally coded speech. 104-107 - Philippe Gournay, François Rousseau, Roch Lefebvre:
Improved packet loss recovery using late frames for prediction-based speech coders. 108-111 - Costas S. Xydeas, Fotis Zafeiropoulos:
Model-based packet loss concealment for AMR coders. 112-115 - Moon-Keun Lee, Sung-Kyo Jung, Hong-Goo Kang, Young-Cheol Park, Dae Hee Youn:
A packet loss concealment algorithm based on time-scale modification for CELP-type speech coders. 116-119 - Anand D. Subramaniam, William R. Gardner, Bhaskar D. Rao:
Joint source-channel decoding of speech spectrum parameters over erasure channels using Gaussian mixture models. 120-123
Acoustic Modeling: Survey of New Techniques
- Yasuhiro Minami, Erik McDermott, Atsushi Nakamura, Shigeru Katagiri:
Recognition method with parametric trajectory generated from mixture distribution HMMs. 124-127 - John W. McDonough, Alex Waibel:
Maximum mutual information speaker adapted training with semi-tied covariance matrices. 128-131 - Xunying Liu, Mark J. F. Gales, Philip C. Woodland:
Automatic complexity control for HLDA systems. 132-135 - Vlasios Doumpiotis, Stavros Tsakalidis, William Byrne:
Discriminative training for segmental minimum Bayes risk decoding. 136-139 - Tetsuji Ogawa, Tetsunori Kobayashi:
Hybrid modeling of PHMM and HMM for speech recognition. 140-143 - Sebastian Stüker, Tanja Schultz, Florian Metze, Alex Waibel:
Multilingual articulatory features. 144-147
Speech Modeling and Analysis
- Ashraf Alkhairy:
Mathematical models of vocal tract with distributed sources. 148-151 - Paavo Alku, Tom Bäckström:
All-pole modeling of wide-band speech with symmetric linear prediction. 152-155 - Karl Schnell, Arild Lacroix:
Generation of nasalized speech sounds based on branched tube models obtained from separate mouth and nose outputs. 156-159 - Mark Thomson, Simon Boland, Mike Wu, Julien Epps, Michael Smithers:
Decomposition of speech into voiced and unvoiced components based on a state-space signal model. 160-163 - Ramon Prieto, Sora Kim:
Time delay estimation and adaptive frame length iterations for noise robust pitch extraction. 164-167 - Yu Shi, Eric Chang:
Spectrogram-based formant tracking via particle filters. 168-171
New Methods for Speaker Recognition, Segmentation, and Implementation
- Masafumi Nishida, Tatsuya Kawahara:
Unsupervised speaker indexing using speaker model selection based on Bayesian information criterion. 172-175 - Guillaume Lathoud, Iain A. McCowan:
Location based speaker segmentation. 176-179 - Yassine Mami, Delphine Charlet:
Speaker identification by anchor models with PCA/LDA post-processing. 180-183 - Phu Chien Nguyen, Masato Akagi, Tu Bao Ho:
Temporal decomposition: a promising approach to VQ-based speaker identification. 184-187 - LiFeng Sang, Zhaohui Wu, Yingchun Yang, Wanfeng Zhang:
Automatic speaker recognition using dynamic Bayesian network. 188-191 - Chengyuan Ma, Eric Chang:
Comparison of discriminative training methods for speaker verification. 192-195
Large Vocabulary Speech Recognition
- Gustavo Hernández Ábrego, Xavier Menéndez-Pidal, Thomas Kemp, Katsuki Minamino, Helmut Lucke:
Automatic set-up for speech recognition engines based on merit optimization. 196-199 - Miroslav Novak, Radek Hampl, Pavel Krbec, Vladimír Bergl, Jan Sedivý:
Two-pass search strategy for large list recognition on embedded speech recognition platforms. 200-203 - Sabine Deligne, Lidia Mangu:
On the use of lattices for the automatic generation of pronunciations. 204-207 - Dimitra Vergyri, Andreas Stolcke, Venkata Ramana Rao Gadde, Luciana Ferrer, Elizabeth Shriberg:
Prosodic knowledge sources for automatic speech recognition. 208-211 - Jean-Luc Gauvain, Lori Lamel, Holger Schwenk, Gilles Adda, Langzhou Chen, Fabrice Lefèvre:
Conversational telephone speech recognition. 212-215 - Bhuvana Ramabhadran, Jing Huang, Michael Picheny:
Towards automatic transcription of large spoken archives - English ASR for the MALACH project. 216-219
Unsupervised Language Model Adaption
- Langzhou Chen, Jean-Luc Gauvain, Lori Lamel, Gilles Adda:
Unsupervised language model adaptation for broadcast news. 220-223 - Michiel Bacchiani, Brian Roark:
Unsupervised language model adaptation. 224-227 - Takaaki Hori, Daniel Willett, Yasuhiro Minami:
Language model adaptation using WFST-based speaking-style translation. 228-231 - Erwin Leeuwis, Marcello Federico, Mauro Cettolo:
Language modeling and transcription of the TED corpus lectures. 232-235 - Tadasuke Yokoyama, Takahiro Shinozaki, Koji Iwano, Sadaoki Furui:
Unsupervised class-based language model adaptation for spontaneous speech recognition. 236-239 - Wen Wang, Mary P. Harper, Andreas Stolcke:
The robustness of an almost-parsing language model given errorful training data. 240-243
Speech Synthesis Overview
- Jerome R. Bellegarda:
Unsupervised, language-independent grapheme-to-phoneme conversion by latent analogy. 244-247 - Matthias Eichner, Steffen Werner, Matthias Wolff, Rüdiger Hoffmann:
Towards spontaneous speech synthesis - LM based selection of pronunciation variants. 248-251 - Ki-Seung Lee, Jeongsu Kim:
Context-adaptive phone boundary refining for a TTS database. 252-255 - Hideki Kawahara, Hisami Matsui:
Auditory morphing based on an elastic perceptual distance metric in an interference-free time-frequency representation. 256-259 - Matthew Lee, Mark J. T. Smith:
Spectral modification for digital singing voice synthesis using asymmetric generalized Gaussians. 260-263 - Min Chu, Hu Peng, Yong Zhao, Zhengyu Niu, Eric Chang:
Microsoft Mulan - a bilingual TTS system. 264-267
Spoken Language Understanding
- Yulan He
, Steve J. Young:
Hidden vector state model for hierarchical semantic parsing. 268-271 - Anand Venkataraman, Luciana Ferrer, Andreas Stolcke, Elizabeth Shriberg:
Training a prosody-based dialog act tagger from unlabeled data. 272-275 - Gökhan Tür, Robert E. Schapire, Dilek Hakkani-Tür
:
Active learning for spoken language understanding. 276-279 - Ciprian Chelba, Milind Mahajan, Alex Acero:
Speech utterance classification. 280-283 - Ye-Yi Wang, Alex Acero:
Concept acquisition in example-based grammar authoring. 284-287 - Juan M. Huerta, David M. Lubensky:
Graph-based representation and techniques for NLU application development. 288-291
Speaker Adaption
- Daniel Willett, Thomas Niesler, Erik McDermott, Yasuhiro Minami, Shigeru Katagiri:
Pervasive unsupervised adaptation for lecture speech transcription. 292-295 - Kyung-Tak Lee, Lynette Melnar, Jim Talley, Christian Wellekens:
Symbolic speaker adaptation with phone inventory expansion. 296-299 - Guo-Hong Ding, Bo Xu, Juha Iso-Sipilä, Yang Cao:
Fast speaker adaptation using triple diagonal and shared block diagonal transform matrices. 300-303 - Dong Kook Kim, Young Joon Kim, Woohyung Lim, Nam Soo Kim:
Online adaptation using speatransformation space model evolution. 304-307 - Bowen Zhou, John H. L. Hansen:
Discriminative acoustic model using eigenspace mapping for rapid speaker adaptation. 308-311 - Daniel Povey, Philip C. Woodland, Mark J. F. Gales:
Discriminative map for acoustic model adaptation. 312-315
Robust ASR in Mobile and Distributed Environments
- Richard C. Rose, Iker Arizmendi, Sarangarajan Parthasarathy:
An efficient framework for robust mobile speech recognition services. 316-319 - Luca Cristoforetti, Marco Matassoni, Maurizio Omologo, Piergiorgio Svaizer:
Use of parallel recognizers for robust in-car speech interaction. 320-323 - Hideki Banno, Tetsuya Shinde, Kazuya Takeda, Fumitada Itakura:
In-car speech recognition using distributed microphones-adapting to automatically detected driving conditions. 324-327 - Kadri Hacioglu, Bryan L. Pellom:
A distributed architecture for robust automatic speech recognition. 328-331 - Jan Stadermann, Gerhard Rigoll:
Flexible feature extraction and HMM design for a hybrid distributed speech recognition system in noisy environments. 332-335 - Zheng-Hua Tan, Paul Dalsgaard, Børge Lindberg:
OOV-detection and channel error protection for distributed speech recognition over wireless networks. 336-339
Language Modelling and Large Vocabulary Recognition
- Shoichi Matsunaga, Atsunori Ogawa, Yoshikazu Yamaguchi, Akihiro Imamura:
Non-native English speech recognition using bilingual English lexicon and acoustic models. 340-343 - Katrin Kirchhoff, Jeff A. Bilmes, Sourin Das, Nicolae Duta, Melissa Egan, Gang Ji, Feng He, John Henderson, Daben Liu, Mohammed Noamany, Patrick Schone, Richard M. Schwartz, Dimitra Vergyri:
Novel approaches to Arabic speech recognition: report from the 2002 Johns-Hopkins Summer Workshop. 344-347 - Renato De Mori, Frédéric Béchet, Gérard Subsol, Dominique Massonié:
Dynamic scheduling of decoding processes for directory assistance. 348-351 - Cyril Allauzen, Mehryar Mohri:
Generalized optimization algorithm for speech recognition transducers. 352-355 - Diamantino Caseiro, Isabel Trancoso:
A tail-sharing WFST composition algorithm for large vocabulary speech recognition. 356-359 - Fabio Brugnara:
Context-dependent search in a context-independent network. 360-363 - Adam Janin, Don Baron, Jane Edwards, Dan Ellis, David Gelbart, Nelson Morgan, Barbara Peskin, Thilo Pfau, Elizabeth Shriberg, Andreas Stolcke, Chuck Wooters
:
The ICSI Meeting Corpus. 364-367 - Máté Szarvas, Sadaoki Furui:
Finite-state transducer based modeling of morphosyntax with applications to Hungarian LVCSR. 368-371 - Ahmad Emami, Peng Xu, Frederick Jelinek:
Using a connectionist model in a syntactical based language model. 372-375 - Shaojun Wang, Dale Schuurmans, Fuchun Peng, Yunxin Zhao:
Semantic n-gram language modeling with the latent maximum entropy principle. 376-379 - Hong-Kwang Jeff Kuo, Chin-Hui Lee, Imed Zitouni, Eric Fosler-Lussier:
Minimum verification error training for topic verification. 380-383 - Tomonori Kikuchi, Sadaoki Furui, Chiori Hori:
Automatic speech summarization based on sentence extraction and compaction. 384-387 - Bhiksha Raj, Edward W. D. Whittaker:
Lossless compression of language model structure and word identifiers. 388-391
Feature Processing for Robust ASR
- Shingo Kuroiwa, Satoru Tsuge:
Blind equalization techniques for ETSI standard DSR front-end. 392-395 - Rita Singh, Bhiksha Raj:
Tracking noise via dynamical systems with a continuum of states. 396-399 - Ni-Chun Wang, Jeih-Weih Hung, Lin-Shan Lee:
Data-driven temporal filters based on multi-eigenvectors for robust features in speech recognition. 400-403 - Kam-keung Chu, Shu-hung Leung, Chun-Shing Yip:
Perceptually non-uniform spectral compression for noisy speech recognition. 404-407 - Michael L. Seltzer, Richard M. Stern:
Subband parameter optimization of microphone arrays for speech recognition in reverberant environments. 408-411 - Chuan Jia, Peng Ding, Bo Xu:
Sequential MAP estimation based speech feature enhancement for noise robust speech recognition. 412-415 - Peter Jancovic, Münevver Köküer, Fionn Murtagh:
Reliability-based estimation of the number of noisy features: application to model-order selection in the union models. 416-419 - Ji Ming, Francis Jack Smith:
A posterior union model for improved robust speech recognition in nonstationary noise. 420-423 - Françoise Beaufays, Daniel Boies, Mitch Weintraub, Qifeng Zhu:
Using speech/non-speech detection to bias recognition search on noisy data. 424-427 - Lingyun Gu, Jianbo Gao, A. G. Harris:
Endpoint detection in noisy environment using a Poincare recurrence metric. 428-431 - Izhak Shafran, Richard Rose:
Robust speech detection and segmentation for real-time ASR applications. 432-435 - Oh-Wook Kwon, Te-Won Lee:
Optimizing speech/non-speech classifier design using AdaBoost. 436-439
Speech Analysis
- Etan Fisher, Joseph Tabrikian, Shlomo Dubnov:
Generalized likelihood ratio test for voiced/unvoiced decision using the harmonic plus noise model. 440-443 - Ye Tian, Ji Wu, Zuoying Wang, Dajin Lu:
Fuzzy clustering and Bayesian information criterion based threshold estimation for robust voice activity detection. 444-447 - Om Deshmukh, Carol Y. Espy-Wilson:
A measure of aperiodicity and periodicity in speech. 448-451 - Pusadee Seresangtakul, Tomio Takara:
A generative model of fundamental frequency contours for polysyllabic words of Thai tones. 452-455 - Ching X. Xu, Yi Xu:
F0 perturbations by consonants and their implications on tone recognition. 456-459 - Wai C. Chu:
Gradient-descent based window optimization for linear prediction analysis. 460-463 - Issam Bazzi, Alex Acero, Li Deng:
An expectation maximization approach for formant tracking using a parameter-free non-linear predictor. 464-467 - Dong Wang, Lie Lu
, Hong-Jiang Zhang:
Speech segmentation without speech recognition. 468-471 - Akemi Hoshino, Akio Yasuda:
The evaluation of Chinese aspiration sounds uttered by Japanese students using VOT and power. 472-475 - Dorel Picovici, Abdulhussain E. Mahdi:
Output-based objective speech quality measure using self-organizing map. 476-479 - Serdar Yildirim, Shrikanth S. Narayanan:
An information-theoretic analysis of developmental changes in speech. 480-483 - Patrick J. Clemins, Michael T. Johnson:
Application of speech recognition to African elephant (Loxodonta africana) vocalizations. 484-487
Speech Synthesis: Prosody
- Shaw-Hwa Hwang, Cheng-Yu Yeh:
An efficient text analyzer with prosody generator-driven approach for Mandarin text-to-speech. 488-491 - Sheng Zhao, Jianhua Tao, DanLing Jiang:
Chinese prosodic phrasing with extended features. 492-495 - Neng-Huang Pan, Ming-Shing Yu, Ming-Jer Wu:
A Mandarin intonation prediction model that can output real pitch patterns. 496-499 - Jianhua Tao, Xing Ni:
Auditive learning based Chinese F0 prediction. 500-503 - Tu Trong Do, Tomio Takara:
Precise tone generation for Vietnamese text-to-speech system. 504-507 - Haiping Li, Fangxin Chen, Li Qin Shen, Xijun Ma:
Trainable Cantonese/English dual language speech synthesis system. 508-511 - Wei-Chih Kuo, Xiang-Rui Zhong, Yih-Ru Wang, Sin-Horng Chen:
A high-performance Min-Nan/Taiwanese TTS system. 512-515 - Xijun Ma, Wei Zhang, Qin Shi, Weibin Zhu, Liqin Shen:
Automatic prosody labeling using both text and acoustic information. 516-519 - Pierluigi Salvo Rossi, Francesco Palmieri, Francesco Cutugno:
Inversion of F0 model for natural-sounding speech synthesis. 520-523 - Hans Kruschke, Andreas Koch:
Parameter extraction of a quantitative intonation model with wavelet analysis and evolutionary optimization. 524-527 - K. Sreenivasa Rao, B. Yegnanarayana:
Prosodic manipulation using instants of significant excitation. 528-531 - Rüdiger Hoffmann, Oliver Jokisch, Diane Hirschfeld, Guntram Strecha, Hans Kruschke, Ulrich Kordon, Uwe Koloska:
A multilingual TTS system with less than 1 Mbyte footprint for embedded applications. 532-535
Acoustic Adaption Techniques
- Mark J. F. Gales, Yuan Dong, Daniel Povey, Philip C. Woodland:
Porting: SwitchBoard to the VoiceMail task. 536-539 - Zhirong Wang, Tanja Schultz, Alex Waibel:
Comparison of acoustic model adaptation techniques on non-native speech. 540-543 - Denis Jouvet, Katarina Bartkova, Lionel Delphin-Poulat, Alexandre Ferrieux, Xavier Lamming, Jean Monné, Christophe Raix:
About improving recognition of spontaneously uttered French city-names. 544-547 - Gyucheol Jang, Sooyoung Woo, Minho Jin, Chang D. Yoo:
Improvements in speaker adaptation using weighted training. 548-551 - Tor André Myrvoll, Frank K. Soong:
Optimal clustering of multivariate normal distributions using divergence and its application to HMM adaptation. 552-555 - Xiaodong He, Wu Chou:
Minimum classification error linear regression for acoustic model adaptation of continuous density HMMs. 556-559 - Rohit Sinha, Srinivasan Umesh:
A method for compensation of Jacobian in speaker normalization. 560-563 - Eric H. C. Choi, Trym Holter, Julien Epps, Arun Gopalakrishnan:
Temporal structure constrained transformation for speaker adaptation. 564-567 - Shinji Watanabe, Yasuhiro Minami, Atsushi Nakamura, Naonori Ueda:
Application of variational Bayesian estimation and clustering to acoustic model adaptation. 568-571 - Daben Liu, Francis Kubala:
Online speaker clustering. 572-575 - Yoshifumi Onishi, Ken-ichi Iso:
Speaker adaptation by hierarchical EigenVoice. 576-579 - Fabrice Lauri, Irina Illina, Dominique Fohr:
Combining EigenVoices and structural MLLR for speaker adaptation. 580-583
Spoken Language Systems and Confidence Measures
- Ananth Sankar, Su-Lin Wu:
Utterance verification based on statistics of phone-level confidence scores. 584-587 - Yassine Benayed, Dominique Fohr, Jean Paul Haton, Gérard Chollet:
Confidence measures for keyword spotting using support vector machines. 588-591 - Alberto Sanchís, Alfons Juan, Enrique Vidal:
Improving utterance verification using a smoothed naive Bayes model. 592-595 - Dilek Hakkani-Tür, Giuseppe Riccardi:
A general algorithm for word graph matrix decomposition. 596-599 - Ka-Yee Leung, Man-Hung Siu:
Phone level confidence measure using articulatory features. 600-603 - Ruhi Sarikaya, Yuqing Gao, Michael Picheny:
Word level confidence measurement using semantic features. 604-607 - Luciana Ferrer, Elizabeth Shriberg, Andreas Stolcke:
A prosody-based approach to end-of-utterance detection that does not require speech recognition. 608-611 - Mike Lincoln, Stephen Cox:
A comparison of language processing techniques for a constrained speech translation system. 612-615 - Ian R. Lane, Tatsuya Kawahara, Tomoko Matsui:
Language model switching based on topic detection for dialog speech recognition. 616-619 - Stephen J. Cox:
Discriminative techniques in call routing. 620-623 - Chiori Hori, Takaaki Hori, Hideki Isozaki, Eisaku Maeda, Shigeru Katagiri, Sadaoki Furui:
Deriving disambiguous queries in a spoken interactive ODQA system. 624-627 - Corinna Cortes, Patrick Haffner, Mehryar Mohri:
Lattice kernels for spoken-dialog classification. 628-631 - Patrick Haffner, Gökhan Tür, Jerry H. Wright:
Optimizing SVMs for complex call classification. 632-635 - Fu-Hua Liu, Liang Gu, Yuqing Gao, Michael Picheny:
Use of statistical N-gram models in natural language generation for machine translation. 636-639
Speech Enhancement Including Applications to Robust ASR
- Florian Hilger, Hermann Ney, Olivier Siohan, Frank K. Soong:
Combining neighboring filter channels to improve quantile based histogram equalization. 640-643 - Umit H. Yapanel, Satya Dharanipragada:
Perceptual MVDR-based cepstral coefficients (PMCCs) for robust speech recognition. 644-647 - Jounghoon Beh, Hanseok Ko
:
A novel spectral subtraction scheme for robust speech recognition: spectral subtraction using spectral harmonics of speech. 648-651 - Vincent Barreaud, Irina Illina, Dominique Fohr:
On-line frame-synchronous compensation of non-stationary noise. 652-655 - Sirko Molau, Florian Hilger, Hermann Ney:
Feature space normalization in adverse acoustic conditions. 656-659 - Yifan Gong:
Model-space compensation of microphone and noise for speaker-independent speech recognition. 660-663 - Manuel J. Reyes Gomez, Bhiksha Raj, Dan Ellis:
Multi-channel source separation by factorial HMMs. 664-667 - Takanobu Nishiura, Masato Nakayama, Satoshi Nakamura:
An evaluation of adaptive beamformer based on average speech spectrum for noisy speech recognition. 668-671 - Li Deng, Jasha Droppo, Alex Acero:
Incremental Bayes learning with prior evolution for tracking nonstationary noise statistics from noisy speech data. 672-675 - Bradford W. Gillespie, Les E. Atlas:
Strategies for improving audible quality and speech recognition accuracy of reverberant speech. 676-679 - Peter Jax, Peter Vary:
Artificial bandwidth extension of speech signals using MMSE estimation based on a hidden Markov model. 680-683 - Guangji Shi, Parham Aarabi:
Robust digit recognition using phase-dependent time-frequency masking. 684-687
Speech Synthesis: Segmental Modelling and Processing
- Fangxin Chen:
Syllable clustering and spectral discontinuity in syllable-based TTS systems. 688-691 - Christophe Blouin, Paul C. Bagshaw, Olivier Rosec:
A method of unit preselection for speech synthesis based on acoustic clustering and decision trees. 692-695 - Tomoki Toda, Hisashi Kawai, Minoru Tsuzaki, Kiyohiro Shikano:
Segment selection considering local degradation of naturalness in concatenative speech synthesis. 696-699 - David Dorran, Robert Lawlor, Eugene Coyle:
High quality time-scale modification of speech using a peak alignment overlap-add algorithm (PAOLA). 700-703 - Xu Shao, Ben Milner:
Clean speech reconstruction from noisy mel-frequency cepstral coefficients using a sinusoidal model. 704-707 - Ellen Eide, Andrew Aaron, Raimo Bakis, Paul S. Cohen, Robert E. Donovan, Wael Hamza, T. Mathes, Michael Picheny, M. Polkosky, M. Smith, Mahesh Viswanathan:
Recent improvements to the IBM trainable speech synthesis system. 708-711 - Qin Yan, Saeed Vaseghi:
Analysis, modelling and synthesis of formants of British, American and Australian accents. 712-715 - Junichi Yamagishi, Takashi Masuko, Keiichi Tokuda, Takao Kobayashi:
A training method for average voice model based on shared decision tree context clustering and speaker adaptive training. 716-719 - Arun Kumar, Ashish Verma:
Using phone and diphone based acoustic models for voice conversion: a step towards creating voice fonts. 720-723 - Emir Turajlic, Dimitrios Rentzos, Saeed Vaseghi, Ching-Hsiang Ho:
Evaluation of methods for parameteric formant transformation in voice conversion. 724-727 - Yingying Xu, Hao Tang, Peiren Zhang:
An advanced text-to-speech server system based on SOAP protocol. 728-731 - Hao Tang, Bo Yin, Ren-Hua Wang:
Study on distributed speech synthesis system. 732-735
Acoustic Modeling of Coarticulation, Lexical and Task Information
- Georg Stemmer, Viktor Zeißler, Christian Hacker, Elmar Nöth, Heinrich Niemann:
A phone recognizer helps to recognize words better. 736-739 - Hiroyuki Suzuki, Heiga Zen, Yoshihiko Nankaku, Chiyomi Miyajima, Keiichi Tokuda, Tadashi Kitamura:
Speech recognition using voice-characteristic-dependent acoustic models. 740-743 - Jian-Lai Zhou, Frank Seide, Li Deng:
Coarticulation modeling by embedding a target-directed hidden trajectory model into HMM - model and training. 744-747 - Frank Seide, Jian-Lai Zhou, Li Deng:
Coarticulation modeling by embedding a target-directed hidden trajectory model into HMM - MAP decoding and evaluation. 748-751 - Yanli Zheng, Mark Hasegawa-Johnson:
Acoustic segmentation using switching state Kalman filter. 752-755 - Chak-Fai Li, Man-Hung Siu:
An efficient incremental likelihood evaluation for polynomial trajectory model using with application to model training and recognition. 756-759 - Pascale Fung, Yi Liu:
Triphone model reconstruction for Mandarin pronunciation variations. 760-763 - Supphanat Kanokphara, Virongrong Tesprasit, Rachod Thongprasirt:
Pronunciation variation speech recognition without dictionary modification on sparse database. 764-767 - Pieter Nel, Johan A. du Preez:
Automatic syllabification using hierarchical hidden Markov models. 768-771 - Abhinav Sethy, Shrikanth S. Narayanan:
Split-lexicon based hierarchical recognition of speech using syllable and word level acoustic units. 772-775 - Jinsong Zhang, Keikichi Hirose, Satoshi Nakamura:
A multilevel framework to model the inherently confounding nature of sentential F0sentential F0 contours contours for recognizing Chinese lexical tones. 776-779 - Andrej Ljolje:
Multiple task-domain acoustic models. 780-783
Speech Coding and Speech Analysis
- Changchun Bao:
Harmonic excitation LPC (HE-LPC) speech coding at 2.3 kb/s. 784-787 - Mu-Liang Wang, Jar-Ferr Yang:
Complexity reduced shape VQ of spectral envelope with perception consideration. 788-791 - Geneviève Baudoin, Fadi El Chami:
Corpus based very low bit rate speech coding. 792-795 - Ranniery Maia, Ricardo J. da R. Cirigliano, Daniel Rojtenberg, Fernando Gil Vianna Resende Jr.:
Mixed-excited phonetic vocoding at 265 bps. 796-799 - Takahiro Hoshiya, Shinji Sako, Heiga Zen, Keiichi Tokuda, Takashi Masuko, Takao Kobayashi, Tadashi Kitamura:
Improving the performance of HMM-based very low bit rate speech coding. 800-803 - Christian H. Ritz, Ian S. Burnett, Jason Lukasiak:
Low bit rate wideband WI speech coding. 804-807 - Su Yang, Zongge Li, Yan-Qiu Chen:
A fractal based voice activity detector for Internet telephone. 808-811 - Dhany Arifianto, Takao Kobayashi:
IFAS-based voiced/unvoiced classification of speech signal. 812-815 - Sumit Basu:
A linked-HMM model for robust voicing and speech detection. 816-819 - Arthur P. Lobo, Philipos C. Loizou:
Voiced/unvoiced speech discrimination in noise using Gabor atomic decomposition. 820-823 - Peter Kabal:
Ill-conditioning and bandwidth expansion in linear prediction of speech. 824-827 - Davor Petrinovic:
Discrete weighted mean square all-pole modeling. 828-831
Feature-Oriented Acoustic Modeling
- Xiang Li, Richard M. Stern:
Training of stream weights for the decoding of speech using parallel feature streams. 832-835 - Yimin Zhang
, Qian Diao, Shan Huang, Wei Hu, Chris D. Bartels, Jeff A. Bilmes:
DBN based multi-stream models for speech. 836-839 - Konstantin Markov, Satoshi Nakamura:
Hybrid HMM/BN LVCSR system integrating multiple acoustic features. 840-843 - S. S. Airey, Mark J. F. Gales:
Product of Gaussians and multiple stream systems. 844-847 - Karthik Visweswariah, Peder A. Olsen, Ramesh Gopinath, Scott Axelrod:
Maximum likelihood training of subspaces for inverse covariance modeling. 848-851 - Vincent Vanhoucke, Ananth Sankar:
Mixtures of inverse covariances. 852-855 - Satya Dharanipragada, Karthik Visweswariah:
Covariance and precision modeling in shared multiple subspaces. 856-859 - Peng Ding, Shuwu Zhang, Bo Xu:
Comparison and study of some variants of partially tied covariance modeling. 860-863 - Scott Axelrod, Ramesh Gopinath, Peder A. Olsen, Karthik Visweswariah:
Dimensional reduction, covariance modeling, and computational complexity in ASR systems. 864-867 - Alain Biem:
Optimizing features and models using the minimum classification error criterion. 868-871 - Leo J. Lee, Hagai Attias, Li Deng:
Variational inference and learning for segmental switching state space models of hidden speech dynamics. 872-875 - Rong Zhang, Alexander I. Rudnicky:
Improving the performance of an LVCSR system through ensembles of acoustic models. 876-879
Speech Enhancement II
- Thomas Lotter, Christian Benien, Peter Vary:
Multichannel speech enhancement using Bayesian spectral amplitude estimation. 880-883 - Erik M. Visser, Te-Won Lee:
Speech enhancement using blind source separation and two-channel energy based speaker detection. 884-887 - Masashi Unoki, Masashi Furukawa, Keigo Sakata, Masato Akagi:
A method based on the MTF concept for dereverberating the power envelope from the reverberant signal. 888-891 - Mingyang Wu, DeLiang Wang:
A one-microphone algorithm for reverberant speech enhancement. 892-895 - Colin Breithaupt, Rainer Martin
:
MMSE estimation of magnitude-squared DFT coefficients with superGaussian priors. 896-899 - Chang Huai You, SooNgee Koh, Susanto Rahardja:
Adaptive β-order MMSE estimation for speech enhancement. 900-903 - Marcel Gabrea:
Double affine projection algorithm-based speech enhancement algorithm. 904-907 - Sharon Gannot, Israel Cohen:
Speech enhancement based on the general transfer function GSC and postfiltering. 908-911 - Hong Cai, Éric Grivel, Mohamed Najim:
A dual Kalman filter-based smoother for speech enhancement. 912-915 - Masanori Kato, Akihiko Sugiyama, Masahiro Serizawa:
A family of 3GPP-standard noise suppressors for the AMR codec and the evaluation results. 916-919 - Michael T. Johnson, Andrew C. Lindgren, Richard J. Povinelli, Xiaolong Yuan:
Performance of nonlinear speech enhancement using phase space reconstruction. 920-923 - John-Paul Hosom, Alexander Kain, Taniya Mishra, Jan P. H. van Santen, Melanie Fried-Oken, Janice Staehely:
Intelligibility of modifications to dysarthric speech. 924-928
Volume 2
Feature Extraction Techniques and Applications
- Björn W. Schuller, Gerhard Rigoll, Manfred K. Lang:
Hidden Markov model-based speech emotion recognition. 1-4 - Hugo Meinedo, João Paulo Neto:
Audio segmentation, classification and clustering in a broadcast news task. 5-8 - Tin Lay Nwe, Say Wei Foo, Liyanage C. De Silva:
Classification of stress in speech using linear and nonlinear features. 9-12 - Aldebaro Klautau
:
Mining speech: automatic selection of heterogeneous features using boosting. 13-16 - Julien Pinquier
, Jean-Luc Rouas, Régine André-Obrecht:
A fusion study in speech/music classification. 17-20 - Tarek Abu-Amer, Julie Carson-Berndsen:
Multi-linear HMM based system for articulatory feature extraction. 21-24 - Takashi Fukuda, Wataru Yamamoto, Tsuneo Nitta:
Distinctive phonetic feature extraction for robust speech recognition. 25-28 - Kim Foong Chow, Shiang Chen Liew, Kim-Teng Lua:
Thin client front-end processor for distributed speech recognition. 29-32 - Mohamed Chetouani, Bruno Gas, Jean-Luc Zarader:
Modular neural predictive coding for discriminative feature extraction. 33-36 - Changxue Ma:
Novel robust feature extraction based on spectrally masked channel energy ratio (SMaChER) for speech recognition. 37-40 - Qin Li, Les E. Atlas:
Time-variant least squares harmonic modeling. 41-44 - Brian Mak, Yik-Cheung Tam, Roger Hsiao:
Discriminative training of auditory filters of different shapes for robust speech recognition. 45-48
Speaker Verifikation and Identification Systems
- Claude Barras, Jean-Luc Gauvain:
Feature and score normalization for speaker verification of cellular data. 49-52 - Douglas A. Reynolds:
Channel robust speaker verification via feature mapping. 53-56 - Tsuneo Kato, Tohru Shimizu:
Improved speaker, verification over the cellular phone network using phoneme-balanced and digit-sequence-preserving connected digit patterns. 57-60 - Ganesh N. Ramaswamy, Jirí Navrátil, Upendra V. Chaudhari, Ran D. Zilca:
The IBM system for the NIST-2002 cellular speaker verification evaluation. 61-64 - Gurmeet Singh, Ashish Panda, Saurav Bhattacharyya, Thambipillai Srikanthan:
Vector quantization techniques for GMM based speaker verification. 65-68 - Mathieu Ben, Frédéric Bimbot:
D-MAP: a distance-normalized MAP estimation of speaker models for automatic speaker verification. 69-72 - Jiuqing Deng, Qixiu Hu:
Open set text-independent speaker recognition based on set-score pattern classification. 73-76 - Jean-François Bonastre, Sylvain Meignier, Téva Merlin:
Speaker detection using multi-speaker audio files for both enrollment and test. 77-80 - Ran D. Zilca, Jirí Navrátil, Ganesh N. Ramaswamy:
Depitch and the role of fundamental frequency in speaker recognition. 81-84 - Yvonne Moh, Patrick Nguyen, Jean-Claude Junqua:
Towards domain independent speaker clustering. 85-88 - Daniel Moraru, Sylvain Meignier, Laurent Besacier, Jean-François Bonastre, Ivan Magrin-Chagnolleau:
The ELISA consortium approaches in speaker segmentation during the NIST 2002 speaker recognition evaluation. 89-92 - Joaquín González-Rodríguez, Julian Fiérrez-Aguilar, Javier Ortega-Garcia:
Forensic identification reporting using automatic speaker recognition systems. 93-96
General Topics in Robust ASR
- Jian Wu, Qiang Huo:
Modelling uncertainty in stochastic vector mapping with minimum classification error training for robust speech recognition. 97-100 - Yuan-Fu Liao, Jeng-Shien Lin, Sin-Horng Chen:
A mismatch-aware stochastic matching algorithm for robust speech recognition. 101-104 - Febe de Wet, Johan de Veth, Bert Cranen, Louis Boves:
The impact of spectral and energy mismatch on the Aurora2 digit recognition task. 105-108 - Dusan Macho, Yan Ming Cheng:
On the use of wideband signal for noise robust ASR. 109-112 - Murat Akbacak, John H. L. Hansen:
Environmental sniffing: noise knowledge estimation for robust speech systems. 113-116 - Zhaobing Han, Shuwu Zhang, Huayun Zhang, Bo Xu:
A vector statistical piecewise polynomial approximation algorithm for environment compensation in telephone LVCSR. 117-120 - Olivier Bellot, Driss Matrouf, Pascal Nocera, Georges Linarès, Jean-François Bonastre
:
Structural speaker adaptation using maximum a posteriori approach and a Gaussian distributions merging technique. 121-124 - Xianxian Zhang, John H. L. Hansen:
CSA-BF: novel constrained switched adaptive beamforming for speech enhancement & recognition in real car environments. 125-128 - Ben Milner, Xu Shao:
Low bit-rate feature vector compression using transform coding and non-uniform bit allocation. 129-132 - Shajith Ikbal, Hemant Misra, Hervé Bourlard:
Phase autocorrelation (PAC) derived robust speech features. 133-136 - Diego Giuliani, Matteo Gerosa:
Investigating recognition of children's speech. 137-140 - Fernando Díaz-de-María, Jesús Vicente-Peña, Ascensión Gallardo-Antolín, Carmen Peláez-Moreno:
Linear equalization of the modulation spectra: a novel approach for noisy speech recognition. 141-144
Speech Coding
- Fang-Chu Chen, I-Hsien Lee:
CELP based speech coding with fine granularity scalability. 145-148 - Gang Zhang, Keming Xie, Xueying Zhang, Liying Huangfu:
Optimizing gain codebook of LD-CELP. 149-152 - Jacek Stachurski, Alan McCree, Vishu Viswanathan, Ari Heikkinen, Anssi Rämö, Sakari Himanen, Peter Blöcher:
Hybrid MELP/CELP coding at bit rates from 6.4 to 2.4 kb/s. 153-156 - Houman Zarrinkoub, Paul Mermelstein:
Joint optimization of short-term and long-term predictors in CELP speech coders. 157-160 - Fredrik Nordén, Turaj Zakizadeh Shabestary, Per Hedelin:
Rate adjustable speech coding by lattice quantization. 161-164 - Christian Sturt, Stephane Villette, Ahmet M. Kondoz:
LSF quantisation for pitch synchronous speech coders. 165-168 - Miguel Arjona Ramírez
:
A waveform extractor for scalable speech coding. 169-172 - Sung-Kyo Jung, Kyoung-Tae Kim, Hong-Goo Kang, Dae Hee Youn:
A cascaded algebraic codebook structure to improve the performance of speech coder. 173-176 - Sunil Lee, Seongho Seo, Dalwon Jang, Chang D. Yoo:
A novel transcoding algorithm for AMR and EVRC speech codecs via direct parameter transformation. 177-180 - Marcos Faúndez-Zanuy
:
Wide band sub-band speech coding using nonlinear prediction. 181-184 - Kei Kikuiri, Nobuhiko Naka, Tomoyuki Ohya:
Super-frame based source controlled variable rate coding using approximated trellis diagram. 185-188 - José L. Pérez-Córdoba
, Antonio M. Peinado
, Victoria E. Sánchez, Antonio J. Rubio:
A study of joint source-channel coding of LSP parameters for wideband speech coding. 189-192
Speaker ID/Verification: Discriminative Methods and Multiple Speakers
- Ting-Yao Wu, Lie Lu
, Ke Chen, Hong-Jiang Zhang:
UBM-based real-time speaker segmentation for broadcasting news. 193-196 - Takayuki Arai:
Estimating number of speakers by the modulation characteristics of speech. 197-200 - S. Krishnakumar, K. R. Prasanna Kumar, N. Balakrishnan:
Pitch maxima for robust speaker recognition. 201-204 - Yang Shao, DeLiang Wang:
Co-channel speaker identification using usable speech extraction based on multi-pitch tracking. 205-208 - William M. Campbell:
A SVM/HMM system for speaker recognition. 209-212 - Fabio Valente, Christian Wellekens:
Minimum classification error/eigenvoices training for speaker identification. 213-216 - Qi Li, Biing-Hwang Juang:
Fast discriminative training for sequential observations with application to speaker identification. 217-220 - Vincent Wan, Steve Renals
:
SVMSVM: support vector machine speaker verification methodology. 221-224 - Mohamed Faouzi BenZeghiba, Hervé Bourlard:
Hybrid HMM/ANN and GMM combination for user-customized password speaker verification. 225-228 - Daniel Garcia-Romero, Julian Fiérrez-Aguilar, Joaquín González-Rodríguez, Javier Ortega-Garcia:
Support vector machine fusion of idiolectal and acoustic speaker information in Spanish conversational speech. 229-232 - Chun-Nan Hsu, Hau-Chung Yu, Bo-Hou Yang:
Speaker verification without background speaker models. 233-236
Emerging Industrial Applications
- Bogong Su, Jian Wang, Erh-Wen Hu, Joseph B. Manzano:
De-pipeline a software-pipelined loop. 237-240 - Ajay Kumar:
Inspection of surface defects using optimal FIR filters. 241-244 - Xinhao Tian, Jing Lin, Ken R. Fyfe, Ming Jian Zuo:
Gearbox fault diagnosis using independent component analysis in the frequency domain and wavelet filtering. 245-248 - Jonathon C. Ralston, David W. Hainsworth, Ronald J. McPhee, David C. Reid, Chad O. Hargrave:
Application of signal processing technology for automatic underground coal mining machinery. 249-252
Biomedical and Biometric Technology I
- Aziz Umit Batur, Bruce E. Flinchbaugh, Monson H. Hayes III:
A DSP-based approach for the implementation of face recognition algorithms. 253-256 - Kumari L. Fernando, V. John Mathews, Michael W. Varner, Edward B. Clark:
Robust estimation of fetal heart rate variability using Doppler ultrasound. 257-260 - Ayman El-Baz
, Aly A. Farag, Robert Falk, Renato La Rocca:
Automatic identification of lung abnormalities in chest spiral CT scans. 261-264 - Pega Zarjam, Mostefa Mesbah, Boualem Boashash:
Detection of newborn EEG seizure using optimal features based on discrete wavelet transform. 265-268 - Philip de Chazal, Richard B. Reilly
:
Automatic classification of ECG beats using waveform shape and heart beat interval features. 269-272
Speech Recognition
- Yong-Beom Lee, John R. Deller Jr.:
Heuristic structural modifications to the HMM for efficient resource utilization. 273-276 - Astrid Hagen, João Paulo Neto:
Multi-stream processing using context-independent and context-dependent hybrid systems. 277-280 - Sergey Astrov, Josef G. Bauer, Sorel Stan:
High performance speaker and vocabulary independent ASR technology for mobile phones. 281-284 - Say Wei Foo, Liang Dong:
A boosted multi-HMM classifier for recognition of visual speech elements. 285-288 - Claudio Eccher, Lorenzo Eccher, Daniele Falavigna, Luca Nardelli, Marco Orlandi, Andrea Sboner:
On the usage of automatic voice recognition in an interactive Web based medical application. 289-292 - Xuan Zhu, Yining Chen, Jia Liu, Runsheng Liu:
A novel efficient decoding algorithm for CDHMM-based speech recognizer on chip. 293-296
DSP Architectures
- Toshiyuki Yamane, Yasunao Katayama:
An ultra-fast Reed-Solomon decoder soft-IP with 8-error correcting capability. 297-300 - Jeff H. Derby, Jaime H. Moreno:
A high-performance embedded DSP core with novel SIMD features. 301-304 - Nigel C. Paver, Bradley C. Aldrich, Moinul H. Khan:
Intel® wireless MMXTM technology: a 64-bit SIMD architecture for mobile multimedia. 305-308 - Bipul Das, Swapna Banerjee:
A low complexity architecture for complex discrete wavelet transform. 309-312 - Kar-Lik Wong, Nigel P. Topham:
High performance IDCT realization using complex arithmetic. 313-316 - Mark Rygh, Jeff Fratus, Kevin Lee, Syed Husaini, Vidya Premkumar, Konstantinos Konstantinides:
A DVD processor with dual CPUs and integrated digital front-end for advanced DVD-based consumer appliances. 317-320
Communication Technologies
- Koushik Maharatna, Eckhard Grass, Ulrich Jagdhold:
A novel 64-point FF/IFFT processor for IEEE 802.11(a) standard. 321-324 - Heping Ding:
Sub-channel below the perceptual threshold in audio. 325-328 - Alexander R. Wright, Patrick A. Naylor:
I/Q mismatch compensation in zero-IF OFDM receivers with application to DAB. 329-332 - Milos Krstic, Alfonso Troya, Koushik Maharatna, Eckhard Grass:
Optimized low-power synchronizer design for the IEEE 802.11a standard. 333-336 - Jim Chou, Kannan Ramchandran, Daniel Grobe Sachs, Douglas L. Jones:
Audio data hiding with application to surround sound. 337-340 - Andrew Fort, Jan-Willem Weijers, Veerle Derudder, Wolfgang Eberle, André Bourdoux:
A performance and complexity comparison of auto-correlation and cross-correlation for OFDM burst synchronization. 341-344
Biomedical and Biometric Technology II
- Heng-Da Cheng, Jingli Wang:
Fuzzy logic and scale space approach to microcalcification detection. 345-348 - Mitsuru Kondo, Daigo Muramatsu, Masahiro Sasaki, Takashi Matsumoto:
Nonlinear separation of signature trajectories for on-line personal authentication. 349-352 - Do-Hyung Kim, Jaeyeon Lee, Jung Soh, YunKoo Chung:
Real-time face verification using multiple feature combination and a support vector machine supervisor. 353-356 - John K. Mell, Donald A. Jordan, Yuping Xiao, Yibin Zheng, Joseph G. Akar, David E. Haines:
Wavelet analysis of atrial fibrillation electrograms. 357-360 - Gail L. Rosen, Jeffrey D. Moore:
Investigation of coding structure in DNA. 361-364 - Tanveer Fathima Syeda-Mahmood:
Detecting salient changes in genomic signals. 365-368 - Mukund Devarajan, Fansheng Meng, Penny Hix, Stephen A. Zahorian:
HMM-neural network monophone models for computer-based articulation training for the hearing impaired. 369-372 - Abed Elhamid Lawabni, Ahmed H. Tewfik:
Detection and screening of sleep apnea using spectral and time domain analysis of heart rate variability. 373-376 - Alper Kanak, Engin Erzin, Yücel Yemez, A. Murat Tekalp:
Joint audio-video processing for biometric speaker identification. 377-380 - Guoqin Cui, Wen Gao:
SVMs for few examples-based face recognition. 381-384 - Wan Mimi Diyana, Julie Larcher, Rosli Besar:
A comparison of clustered microcalcifications automated detection methods in digital mammogram. 385-388 - Hamid Hassanpour, Mostefa Mesbah, Boualem Boashash:
Comparative performance of time-frequency based newborn EEG seizure detection using spike signatures. 389-392
Defense, Tracking and Security Applications
- Jung-Chieh Chen, Ching-Shyang Maa, Jiunn-Tsair Chen:
Factor graphs for mobile position location. 393-396 - Anindya Sao Paul, Arnab K. Shaw, Koel Das, Atindra Mitra:
Improved HRR-ATR using hybridization of HMM and eigen-template-matched filtering. 397-400 - LipChen Alex Chan, Sandor Z. Der, Nasser M. Nasrabadi:
Improved target detector for FLIR imagery. 401-404 - Kun Lu, Jiong Wang, Xingzhao Liu:
A piecewise parametric method based on polynomial phase model to compensate ionospheric phase contamination. 405-408 - Mukesh A. Zaveri, Uday B. Desai, S. N. Merchant:
Tracking multiple maneuvering point targets using multiple filter bank in infrared image sequence. 409-412 - Anand Krishnamurthy, Yiyan Tang, Cathy Xu, Yuke Wang:
An efficient implementation of multi-prime RSA on DSP processor. 413-416 - Heather Yu:
Scalable encryption for multimedia content access control. 417-420 - Kaliappan Gopalan:
Audio steganography using bit modification. 421-424 - Pei Jung Chung, William J. J. Roberts:
Recursive estimation of K-distribution parameters. 425-428 - Ping Han, Renbiao Wu, Yunhong Wang, Zhaohua Wang:
An efficient SAR ATR approach. 429-432
Radio, Telephony and Television
- Nicolas Ventroux, Jean-François Nezan, Mickaël Raulet, Olivier Déforges:
Rapid prototyping for an optimized MPEG-4 decoder implementation over a parallel heterogenous architecture. 433-436 - Azzédine Touzni, Haosong Fu, Mark Fimoff, Wayne Bretl:
Enhanced 8-VSB transmission for North-American HDTV terrestrial broadcast. 437-440 - Ligang Lu, Vadim Sheinin:
Real-time MPEG video coding with information look-ahead. 441-444 - Jon Arnold, Adrian Caldow, Kevin Harman:
A reconfigurable 100 Mchip/s spread spectrum receiver. 445-448 - Wen Xu, Matthias Marke:
On determining soft output of the cellular text telephone modem (CTM) demodulator. 449-452 - Van-Tam Nguyen, Patrick Loumeau, Jean-François Naviner:
Temporel and spectral analysis of time interleaved high pass sigma delta converter. 453-456 - Vasyl Semenov, Alexander Kalyuzhny, Alexander Kovtonyuk:
Efficient calculation of line spectral frequencies based on new method for solution of transcendental equations. 457-460 - Yeqing Qian, Qi Li, Tianren Yao:
Analysis of different predistortion structures and efficient least-square adaptive algorithms. 461-464 - Jung-Min Choi, Jung Su Kim, Jae Hong Park, Jong-Wha Chong:
Fast Kalman/LMS algorithms on the strong multipath channel. 465-468 - Jianfeng Chen, Louis Shue, Hanwu Sun:
A pseudo adaptive microphone array. 469-472 - Bharath Siravara, Mohamed M. Mansour, Randy Cole, Neeraj Magotra:
Comparative study of wideband single reference active noise cancellation algorithms on a fixed-point DSP. 473-476
High Performance Video and Image Processing Architectures
- Magnus Nilsson, Chaminda Weerasinghe, Serge Lichman, Yu Shi, Igor Kharitonenko:
Design and implementation of a CMOS sensor based video camera incorporating a combined AWB/AEC module. 477-480 - Yijun Li, Ramy E. Aly, Magdy A. Bayoumi, Samia A. Mashali:
Parallel high-speed architecture for EBCOT in JPEG2000. 481-484 - Shinsuke Kobayashi, Kentaro Mita, Yoshinori Takeuchi, Masaharu Imai:
Rapid prototyping of JPEG encoder using the ASIP development system: PEAS-III. 485-488 - Hung-Chi Fang, Tu-Chih Wang, Yu-Wei Chang, Liang-Gee Chen:
Hardware oriented rate control algorithm and implementation for realtime video coding. 489-492 - Tu-Chih Wang, Yu-Wen Huang, Hung-Chi Fang, Liang-Gee Chen:
Performance analysis of hardware oriented algorithm modifications in H.264. 493-496 - Joseph R. Cavallaro, Mani Vaya:
Viturbo: a reconfigurable architecture for Viterbi and turbo decoding. 497-500
High Performance DSP Computational Kernels
- Markus Püschel:
Cooley-Tukey FFT like algorithms for the DCT. 501-504 - Tapio Saramäki, Mrinmoy Bhattacharya:
Multiplierless realization of recursive digital filters using allpass structures. 505-508 - Ji-Suk Park, Byeong-Kuk Kim, Jin-Gyun Chung, Keshab K. Parhi
:
An asynchronous sample-rate converter from CD to DAT. 509-512 - Jinxin Hao, Gang Li:
An improved stability measure for digital filter implementation. 513-516 - Mrinmoy Bhattacharya, Tapio Saramäki:
Some observations leading to multiplierless implementation of linear phase FIR filters. 517-520 - Seonil Choi, Gokul Govindu, Ju-wook Jang, Viktor K. Prasanna:
Energy-efficient and parameterized designs for fast Fourier transform on FPGAs. 521-524
Design Methods for Optimized DSP Architectures
- Adel Baganne, Imed Bennour, Mehrez Elmarzougui, Eric Martin:
A simulation based approach for incorporating virtual components IP cores into multimedia systems design. 525-528 - Changchun Shi, Robert W. Brodersen:
An automated floating-point to fixed-point conversion methodology. 529-532 - Atsushi Hatabu, Takashi Miyazaki, Ichiro Kuroda:
Optimization of decision-timing for early termination of SSDA-based block matching. 533-536 - Franz Franchetti, Markus Püschel:
Short vector code generation and adaptation for DSP algorithms. 537-540 - Aca Gacic, Markus Püschel, José M. F. Moura:
Fast automatic software implementations of FIR filters. 541-544 - Joseph Yeh, John Wawrzynek:
Quality based compute-resource allocation in real-time signal processing. 545-548
Performance Evaluation and Design Methods for DSP Systems
- Jia Wang, Jun Sun, Songyu Yu:
1-D and 2-D transforms from integers to integers. 549-552 - Finbarr O'Regan, Conor Heneghan:
Algorithmic analysis and implementation of a novel natural gradient adaptive filter for sparse systems. 553-556 - Zixue Zhao, Gang Li:
Comparative study of the generalized DFIIt structure and its equivalent state-space realization. 557-560 - Claire Fang Fang, Tsuhan Chen, Rob A. Rutenbar:
Floating-point error analysis based on affine arithmetic. 561-564 - Sang Yoon Park, Nam Ik Cho:
Fixed point error analysis of CORDIC processor based on the variance propagation. 565-568 - Xiaojuan Hu, Linda DeBrunner, Victor E. DeBrunner:
Design of space-efficient, wide- and narrow transition-band, FIR filters. 569-572 - Duy Cuong Nguyen, Parham Aarabi, Ali Sheikholeslami:
Real-time sound localization using field-programmable gate arrays. 573-576 - Victor E. DeBrunner, Ewa Matusiak:
An algorithm to reduce the complexity required to convolve finite length sequences using the Hirschman optimal transform (HOT). 577-580 - Abdsamad Benkrid, Khaled Benkrid, Danny Crookes:
A novel approach for diminishing and predicting the error dynamic range in finite wordlength FIR based architectures. 581-584 - Etienne Cornu, Hamid Sheikhzadeh, Robert L. Brennan, Hamid Reza Abutalebi, Edmund C. Y. Tam, Peter Iles, Kar Wai Wong:
ETSI AMR-2 VAD: evaluation and ultra low-resource implementation. 585-588 - Miodrag Bolic, Petar M. Djuric, Sangjin Hong:
New resampling algorithms for particle filters. 589-592 - Justin J. Song, Jian Li, Yen-Kuang Chen:
Quality-delay-and-computation trade-off analysis of acoustic echo cancellation on general-purpose CPU. 593-596
Innovative DSP Systems and Applications
- Joe C. Chen, Len Yip, Hanbiao Wang, Daniela Maniezzo, Ralph E. Hudson, Jeremy Elson, Kung Yao, Deborah Estrin:
DSP implementation of a distributed acoustical beamformer on a wireless sensor platform. 597-600 - Scott Morrison, Jeremy S. Parks, Karl S. Gugel:
A high-performance multi-purpose DSP architecture for signal processing research. 601-604 - Margarita Cabrera, Xavier Castell, Rafael Montoliu:
Crack detection system based on spectral analysis of a ultrasonic resonance signals. 605-608 - Zhaohui Liu, John V. McCanny:
Implementation of adaptive beamforming based on QR decomposition for CDMA. 609-612 - Michael J. Thul, Frank Gilbert, Norbert Wehn:
Concurrent interleaving architectures for high-throughput channel coding. 613-616 - Amine Bermak, Dominique Martinez:
A very high density VLSI implementation of threshold network ensembles (TNE). 617-620 - Taeksang Hwang, Wonyong Sung:
Implementation of a digital copier using TMS320C6414 VLIW DSP processor. 621-624 - Adnan Abdul-Aziz Gutub
, Mohammad K. Ibrahim:
High radix parallel architecture for GF(p) elliptic curve processor. 625-628 - Zhongfeng Wang, Keshab K. Parhi
:
Efficient interleaver memory architectures for serial turbo decoding. 629-632 - Frank Kienle, Gerd Kreiselmaier, Norbert Wehn:
VLSI-implementation issues of turbo trellis-coded modulation. 633-636 - Marco Liem, Otto Manck:
Architecture of a single chip acoustic echo and noise canceller using cross spectral estimation. 637-640 - Chiman Kwan, Zhubing Ren, Roger Xu, Leonard Haynes, Vernon Lenz:
High performance VOX prototype development and experimental results. 641-644
IPS and Architectures for DSP Applications
- Oscal T.-C. Chen, Nan-Ying Shen, Chih-Chien Shen:
A low-power multiplication accumulation calculation unit for multimedia applications. 645-648 - Jeremy Johnson, Xu Xu:
A recursive implementation of the dimensionless FFT. 649-652 - Zhi-Xiu Lin, An-Yeu Wu:
Mixed-scaling-rotation CORDIC (MSR-CORDIC) algorithm and architecture for scaling-free high-performance rotational operations. 653-656 - Eric Tell, Mikael Olausson, Dake Liu:
A general DSP processor at the cost of 23K gates and 1/2 a man-year design time. 657-660 - Jiangmin Gu, Chip-Hong Chang:
Low voltage, low power (5: 2) compressor cell for fast arithmetic circuits. 661-664 - Daisuke Takahashi
:
A radix-16 FFT algorithm suitable for multiply-add instruction based on Goedecker method. 665-668 - Bogdan J. Falkowski, Cheng Fu:
Fastest linearly independent arithmetic transforms over GF(3). 669-672 - An-Yeu Wu, I-Hsien Lee, Cheng-Shing Wu:
Angle quantization approach for lattice IIR filter implementation and its trellis de-allocation algorithm. 673-676 - Hyugjin Kwon, Jihong Kim:
A low-power image convolution algorithm for variable voltage processors. 677-680 - John Dunlop, Albert Simpson, Shahid Masud, Moira Wylie, Jonathan Cochrane, Roger Kinkead:
Semiconductor IP core for ultra low power MPEG-4 video decode in system-on-silicon. 681-684 - Sung-Won Lee, In-Cheol Park
:
Low-power hybrid structure of digital matched filters for direct sequence spread spectrum systems. 685-688 - Donglai Xu, Rui Gao, Hadj Batatia:
An improved parallel architecture for MPEG-4 motion estimation in 3G mobile applications. 689-692
Neural Models and Systems
- Gen Hori:
A general framework for SVD flows and joint SVD flows. 693-696 - Deniz Erdogmus, Yadunandana N. Rao, M. Can Ozturk, Luis Vielva, José C. Príncipe:
On the convergence of SIPEX: a simultaneous principal components extraction algorithm. 697-700 - Joaquin Quiñonero Candela, Agathe Girard, Jan Larsen
, Carl Edward Rasmussen:
Propagation of uncertainty in Bayesian kernel models - application to multiple-step ahead forecasting. 701-704 - Andrew I. Hanna, Ian Yates, Danilo P. Mandic:
Analysis of the class of complex-valued error adaptive normalised nonlinear gradient descent algorithms. 705-708 - Arthur Gretton, Frédéric Desobry:
On-line one-class support vector machines. An application to signal segmentation. 709-712 - Erik McDermott, Shigeru Katagiri:
A new formalization of minimum classification error using a Parzen estimate of classification chance. 713-716
Blind Source Separation and Independent Component Analysis
- Vince D. Calhoun, Tülay Adali:
Complex ICA for fMRI analysis: performance of several approaches. 717-720 - Seungjin Choi:
Differential learning and random walk model. 721-724 - Mirko Knaak, Shoko Araki, Shoji Makino:
Geometrically constraint ICA for convolutive mixtures of sound. 725-728 - Scott C. Douglas, Sun-Yuan Kung:
A nonlinear recursive least-squares algorithm for the blind separation of finite-alphabet sources. 729-732 - Konstantinos I. Diamantaras, Theophilos Papadimitriou:
Blind signal separation using oriented PCA neural models. 733-736 - Ignacio Santamaría, Jesús Ibáñez, Luis Vielva, Carlos Pantaleón:
Blind equalization of constant modulus signals via support vector regression. 737-740
Neural Networks for Speech Processing
- Hemant Misra
, Hervé Bourlard, Vivek Tyagi:
New entropy based combination rules in HMM/ANN multi-stream ASR. 741-744 - Man-Wai Mak, Ming-Cheung Cheung, Sun-Yuan Kung:
Robust speaker verification from GSM-transcoded speech based on decision fusion and feature transformation. 745-748 - Guoning Hu, DeLiang Wang:
Separation of stop consonants. 749-752 - Suryakanth V. Gangashetty, C. Chandra Sekhar, B. Yegnanarayana:
Constraint satisfaction model for enhancement of evidence in recognition of consonant-vowel utterances. 753-756 - Francis F. Li, Trevor J. Cox:
A neural network for blind identification of speech transmission index. 757-760 - Eros Pasero, Alfonso Montuori:
Neural network based arithmetic coding for real-time audio transmission on the TMS320C6000 DSP platform. 761-764
Architectures and Applications of Neural Networks
- Kai-Pui Lam, Sui-Tung Mak:
An FPGA-based eigenfilter using fast Hebbian learning. 765-768 - Stefan Winter, Hiroshi Sawada, Shoji Makino:
Geometrical understanding of the PCA subspace method for overdetermined blind source separation. 769-772 - Rajai El Dajani, Maryvonne Miquel, Pierre Maison-Blanche, Paul Rubel:
Time series prediction using parametric models and multilayer perceptrons: case study on heart signals. 773-776 - Takaya Soma, Kuniaki Yosui, Takashi Matsumoto:
Reconstructions and predictions of nonlinear dynamical systems by Rao-Blackwellised sequential Monte Carlo. 777-780 - Jerónimo Arenas-García, Fernando Pérez-Cruz:
Multi-class support vector machines: a new approach. 781-784 - Anne-Sophie Capelle, Christine Fernandez-Maloigne, Olivier Colot:
Introduction of spatial information within the context of evidence theory. 785-788 - Stefano Squartini, Amir Hussain, Francesco Piazza:
A recurrent multiscale architecture for long-term memory prediction task. 789-792 - Ali A. Hasan, Mohammed A. Hasan:
Constrained gradient descent and line search for solving optimization problem with elliptic constraints. 793-796 - De-Shuang Huang, Horace H. S. Ip:
Finding the maximum modulus roots of polynomials based on constrained neural networks. 797-800 - Richard Kuehnel, Yuke Wang:
A method of generating uniformly distributed sequences over [0, K], where K+1 is not a power of two. 801-804 - Artur Wróblewski, Thomas Erl, Josef A. Nossek:
Bireciprocal lattice wave digital filters with almost linear phase response. 805-808
Neural Networks for Pattern Recognition and Image Processing
- David J. Miller, John Browning:
A mixture model and EM algorithm for robust classification, outlier rejection, and class discovery. 809-812 - Sung-Jung Cho, Michael Perrone, Eugene H. Ratzlaff:
EM mixture model probability table compression. 813-816 - Roongroj Nopsuwanchai, Alain Biem:
Discriminative training of tied mixture density HMMs for online handwritten digit recognition. 817-820 - Songfeng Zheng, Xiaofeng Lu, Nanning Zheng, Weipu Xu:
Unsupervised clustering based reduced support vector machines. 821-824 - Shantanu Chakrabartty, Masakazu Yagi, Tadashi Shibata, Gert Cauwenberghs:
Robust cephalometric landmark identification using support vector machines. 825-828 - Xiaofeng Lu, Songfeng Zheng, Nanning Zheng, Weixiang Liu:
Learning features from examples for face detection. 829-832 - S. Palanivel, B. S. Venkatesh, B. Yegnanarayana:
Real time face recognition system using autoassociative neural network models. 833-836 - Ho-Man Tang, Michael R. Lyu, Irwin King:
Face recognition committee machine. 837-840 - Chunrong Yuan, Heinrich Niemann:
Appearance-based neural image processing for 3-D object recognition and localization. 841-844 - Tat-Seng Chua, HuaMin Feng, A. Chandrashekhara:
An unified framework for shot boundary detection via active learning. 845-848 - Heng-Da Cheng, Muyi Cui:
Mass lesion detection with a fuzzy neural network. 849-852
Volume 3
Image and Video Indexing and Retrieval
- Paisarn Muneesawang, Ling Guan:
Automatic relevance feedback for video retrieval. 1-4 - Wing Ho Leung, Tsuhan Chen:
Retrieval of hand-drawn sketches with partial matching. 5-8 - Wai-Pak Choi, Kin-Man Lam, Wan-Chi Siu:
Maximal disk based histogram for shape retrieval. 9-12 - Bin Luo, Richard C. Wilson, Edwin R. Hancock:
Spectral method for learning structural variations in graphs. 13-16 - Fariborz Mahmoudi, Jamshid Shanbehzadeh, Amir-Masoud Eftekhari-Moghadam, Hamid Soltanian-Zadeh:
A new non-segmentation shape-based image indexing method. 17-20 - Rong Yan, Yan Liu, Rong Jin, Alexander G. Hauptmann:
On predicting rare classes with SVM ensembles in scene classification. 21-24
Human Movement Analysis and Tracking
- Richard D. Green, Ling Guan:
Tracking human movement patterns using particle filtering. 25-28 - Jose Juarez Gonzalez, Ik Soo Lim
, Pascal Fua, Daniel Thalmann:
Robust tracking and segmentation of human motion in an image sequence. 29-32 - Naresh P. Cuntoor, Amit A. Kale, Rama Chellappa:
Combining multiple evidences for gait recognition. 33-36 - Yang Ran, Qinfen Zheng:
Multi moving people detection from binocular sequences. 37-40 - R. Venkatesh Babu, K. R. Ramakrishnan:
Compressed domain human motion recognition using motion history information. 41-44 - Henry C. Tan, Ruwan Janapriya, Liyanage C. De Silva:
An automatic system for multiple human tracking and actions recognition in office environment. 45-48
Watermarking I
- Qiang Cheng, Yingge Wang, Thomas S. Huang:
How to design efficient watermarks? 49-52 - Alexia Briassouli, Pierre Moulin:
Detection-theoretic analysis of warping attacks in spread-spectrum watermarking. 53-56 - Micheal Mullarkey, Neil J. Hurley, Guenole C. M. Silvestre, Teddy Furon:
Application of side-informed embedding and polynomial detection to audio watermarking. 57-60 - Jin S. Seo, Jaap Haitsma, Ton Kalker, Chang D. Yoo:
Affine transform resilient image fingerprinting. 61-64 - Tie Liu, Pierre Moulin:
Error exponents for one-bit watermarking. 65-68 - John Barr, Brett Bradley, Brett T. Hannigan:
Using digital watermarks with image signatures to mitigate the threat of the copy attack. 69-72
Video Coding I
- Claudia Mayer:
Motion compensated in-band prediction for wavelet-based spatially scalable video coding. 73-76 - Randa Atta, Mohammed Ghanbari:
A layered video coding scheme with its optimum bit allocation. 77-80 - Mihaela van der Schaar, Deepak S. Turaga:
Unconstrained motion compensated temporal filtering (UMCTF) framework for wavelet video coding. 81-84 - Zhenzhong Chen, King Ngi Ngan
:
Improved single video object rate control for MPEG-4. 85-88 - Lifeng Zhao, C.-C. Jay Kuo:
Buffer-constrained R-D optimized rate control for video coding. 89-92 - Zhen Li, Feng Wu, Shipeng Li
, Edward J. Delp:
Wavelet video coding via a spatially adaptive lifting structure. 93-96
Image andVideo Interpolation
- Xiqun Lu, Paul S. Hong, Mark J. T. Smith:
An efficient directional image interpolation method. 97-100 - Hussein A. Aly, Eric Dubois:
Crafting the observation model for regularized image up-sampling. 101-104 - Takuma Ishida, Shogo Muramatsu, Hisakazu Kikuchi, Tetsuro Kuge:
Invertible deinterlacing with variable coefficients and its lifting implementation. 105-108 - Hasan F. Ates, Michael T. Orchard:
Image interpolation using wavelet-based contour estimation. 109-112 - Tien-Ying Kuo, Lin-Ying Chuang:
Fast global motion-compensated frame interpolator for very low-bit-rate video quality enhancement. 113-116 - Hezerul Abdul Karim, Michel Bister, Mohammad Umar Siddiqi:
Low rate video frame interpolation - challenges and solution. 117-120
Face Recognition
- Jian Li, Shaohua Kevin Zhou, Chandra Shekhar:
A comparison of subspace analysis for face recognition. 121-124 - Juwei Lu, Konstantinos N. Plataniotis, Anastasios N. Venetsanopoulos:
Regularized D-LDA for face recognition. 125-128 - Xiaogang Wang, Xiaoou Tang:
An improved Bayesian face recognition algorithm in PCA subspace. 129-132 - Juhua Zhu, Bede Liu, Stuart C. Schwartz:
General illumination correction and its application to face normalization. 133-136 - Alberto Albiol, Luis Torres, Edward J. Delp:
The indexing of persons in news sequences using audio-visual data. 137-140 - Haitao Wang, Yangsheng Wang:
Recognizing face images under different lighting conditions. 141-144
Motion Estimation
- Yu-Wen Huang, Bing-Yu Hsieh, Tu-Chih Wang, Shao-Yi Chien, Shyh-Yih Ma, Chun-Fu Shen, Liang-Gee Chen:
Analysis and reduction of reference frames for motion estimation in MPEG-4 AVC/JVT/H.264. 145-148 - Shiloh L. Dockstader, Nikita S. Imennov, A. Murat Tekalp:
Stochastic modeling of motion tracking failures. 149-152 - Yui-Lam Chan, Wan-Chi Siu:
An adaptive partial distortion search for block motion estimation. 153-156 - Ingo Stuke, Til Aach, Cicero Mota, Erhardt Barth:
Linear and regularized solutions for multiple motions. 157-160 - Mingren Shi, Victor Solo:
Empirical choice of smoothing parameters in optical flow with correlated errors. 161-164 - Jesús Chamorro-Martínez, Joaquín Fernández-Valdivia, Jose A. García, Javier Martinez-Baena:
A frequency-domain approach for the extraction of motion patterns. 165-168
Video Summarization
- Baoxin Li, Hao Pan, M. Ibrahim Sezan:
A general framework for sports video summarization with its application to soccer. 169-172 - Ahmet Ekin, A. Murat Tekalp:
Shot type classification by dominant color for sports video segmentation and summarization. 173-176 - Hsuan-Wei Chen, Jin-Hau Kuo, Jen-Hao Yeh, Ja-Ling Wu:
A multi-modal-feature based algorithm for parsing news program videos. 177-180 - Zuzana Cernekova
, Constantine Kotropoulos, Ioannis Pitas:
Video shot segmentation using singular value decomposition. 181-184 - Kongwah Wan, Joo-Hwee Lim, Changsheng Xu, Xinguo Yu:
Real-time camera field-view tracking in soccer video. 185-188 - Min Xu, Ling-Yu Duan, Changsheng Xu, Qi Tian:
A fusion scheme of visual and auditory modalities for event detection in sports video. 189-192
Face Analysis
- Gang Pan, Zhaohui Wu, Yunhe Pan:
Automatic 3D face verification from range data. 193-196 - José Luis Landabaso, Montse Pardàs, Antonio Bonafonte
:
HMM recognition of expressions in unrestrained video intervals. 197-200 - Wen-Shiung Chen, Shang-Yuan Yuan:
A novel personal biometric authentication technique using human iris based on fractal dimension features. 201-204 - Jianyu Wang, Wen Gao, Shiguang Shan, XiaoPeng Hu:
Facial feature tracking combining model-based and model-free method. 205-208 - Jun Wang, Radhakrishna S. V. Achanta, Mohan S. Kankanhalli, Philippe Mulhem:
A hierarchical framework for face tracking using state vector fusion for compressed video. 209-212 - Heng Liu, Shengye Yan, Xilin Chen, Wen Gao:
Rotated face detection in color images using radial template (RT). 213-216 - Shi-Lin Wang, Wing Hong Lau, Shu-hung Leung:
A new real-time lip contour extraction algorithm. 217-220 - Ying Guo, Geoff Poulton, Jiaming Li, Mark Hedley, Rong-yu Qiao:
Soft margin AdaBoost for face pose classification. 221-224 - Shaohua Kevin Zhou, Rama Chellappa:
Simultaneous tracking and recognition of human faces from video. 225-228 - Xiujuan Chai, Shiguang Shan, Wen Gao, Bo Cao:
Novel example-based shape learning for fast face alignment. 229-232 - Norman Poh Hoon Thian, Sébastien Marcel, Samy Bengio:
Improving face authentication using virtual samples. 233-236
Lossless and Lossy Image Coding
- Guang Deng, Hua Ye:
A general framework for the second-level adaptive prediction. 237-240 - Giovanni Motta, Francesco Rizzo, James A. Storer:
Partitioned vector quantization: application to lossless compression of hyperspectral images. 241-244 - Mehmet Utku Celik
, A. Murat Tekalp, Gaurav Sharma:
Level-embedded lossless image compression. 245-248 - Marie Babel, Olivier Déforges:
Lossless and lossy minimal redundancy pyramidal decomposition for scalable image compression technique. 249-252 - Ahmed Abu-Hajar, Ravi Sankar:
Enhanced partial-SPIHT for lossless and lossy image compression. 253-256 - Aysegül Çuhadar, Sinan Tasdoken:
Multiple, arbitrary shape ROI coding with zerotree based wavelet coders. 257-260 - Yick Ming Yeung, Oscar C. Au, Andy Chang:
Successive bit-plane rate allocation technique for JPEG2000 image coding. 261-264 - Chi-Keung Fong, Wai-kuen Cham:
An improved edge-model based representation and its application in image post-processing. 265-268 - Wenhuan Xu, Asoke K. Nandi, Jihong Zhang:
A new fuzzy reinforcement learning vector quantization algorithm for image compression. 269-272 - Chengjie Tu, Trac D. Tran, Jie Liang:
Error resilient pre-/post-filtering for DCT-based block coding systems. 273-276 - Xingsong Hou, Guizhong Liu, Yiyang Zou:
Embedded quadtree-based image compression in DCT domain. 277-280 - Yu Hen Hu, Rajas A. Sambhare:
Constrained texture synthesis for image post processing. 281-284
Multidimensional Signal Processing Theory and Methods
- Mats T. Andersson, Hans Knutsson:
Transformation of local spatio-temporal structure tensor fields. 285-288 - Steven M. Kay, Christopher P. Carbone:
Vector space solution to the multidimensional Yule-Walker equations. 289-292 - Weixiang Liu, Nanning Zheng, Xiaofeng Lu:
Non-negative matrix factorization for visual coding. 293-296 - Erik G. Miller:
A new class of entropy estimators for multi-dimensional densities. 297-300 - Dimitri Van De Ville, Thierry Blu, Michael Unser:
Recursive filtering for splines on hexagonal lattices. 301-304 - Ilya Pollak, Jeffrey Mark Siskind, Mary P. Harper, Charles A. Bouman:
Modeling and estimation of spatial random trees with application to image classification. 305-308 - Ngai-Fong Law, Wan-Chi Siu:
A fast and efficient computational structure for the 2D over-complete wavelet transform. 309-312 - Arlene A. Cole-Rhodes, Abake Adenle:
Automatic image registration by stochastic optimization of mutual information. 313-316 - Subrata Rakshit, Malay Kumar Nema:
Symmetric residue pyramids - an extension of Burt Laplacian pyramids. 317-320 - Takao Hinamoto, Keisuke Higashi, Wu-Sheng Lu:
Jointly optimized error feedback and realization for roundoff noise minimization in two-dimensional state-space digital filters. 321-324 - Eva Dejnozková, Petr Dokládal:
A parallel algorithm for solving the Eikonal equation. 325-328 - Florent Perronnin, Jean-Luc Dugelay, Kenneth Rose:
Iterative decoding of two-dimensional hidden Markov models. 329-332
Image and Video Segmentation
- Yuan Been Chen, Oscal T.-C. Chen:
Robust fully-automatic segmentation based on modified edge-following technique. 333-336 - Mathias Ortner, Xavier Descombes, Josiane Zerubia
:
Building extraction from digital elevation models. 337-340 - Gouchol Pok, Jyh-Charn Liu, Keun Ho Ryu:
Fast estimation of the number of texture segments using cooccurrence statistics. 341-344 - Qixiang Ye, Wen Gao, Wei Zeng:
Color image segmentation using density-based clustering. 345-348 - Darren E. Butler, Sridha Sridharan, V. Michael Bove Jr.:
Real-time adaptive background segmentation. 349-352 - Son Lam Phung
, Douglas Chai, Abdesselam Bouzerdoum:
Adaptive skin segmentation in color images. 353-356 - Soo-Chang Pei, Jian-Jiun Ding:
The generalized radial Hilbert transform and its applications to 2D edge detection (any direction or specified directions). 357-360 - Shunsuke Kamijo, Masao Sakauchi:
Segmentation of vehicles and pedestrians in traffic scene by spatio-temporal Markov random field model. 361-364 - Hanfeng Chen, Feihu Qi, Su Zhang:
Supervised video object segmentation using a small number of interactions. 365-368 - Su Zhang, Hanfeng Chen, Zheru Chi
, Pengfei Shi:
An algorithm for segmenting moving vehicles. 369-372 - Eliza Yingzi Du, Chein-I Chang:
An unsupervised approach to color video thresholding. 373-376 - Day-Fann Shen, Ming-Tsong Huang:
A watershed-based image segmentation using JND property. 377-380
Video Coding II
- Lorenzo Granai, Fulvio Moschetti, Pierre Vandergheynst:
Ridgelet transform applied to motion compensated images. 381-384 - Huipin Zhang, Frank Bossen:
A heuristic search method of adaptive interpolation filters in motion compensated predictive video coding. 385-388 - Bojun Meng, Oscar C. Au:
Fast intra-prediction mode selection for 4A blocks in H.264. 389-392 - Habibollah Danyali, Alfred Mertins:
Fully scalable texture coding of arbitrarily shaped video objects. 393-396 - Manoranjan Paul, M. Manzur Murshed, Laurence Dooley:
A new real-time pattern selection algorithm for very low bit-rate video coding focusing on moving regions. 397-400 - Shunan Lin, Anthony Vetro, Yao Wang:
Rate-distortion analysis of the multiple description motion compensation video coding scheme. 401-404 - Sadaatsu Kato, Kazuo Sugimoto, Satoru Adachi, Minoru Etoh:
Structured "truncated Golomb code" for context-based adaptive VLC. 405-408 - Ee Ping Ong, Hua Wang, Ping Xue:
Video coding based on true motion estimation. 409-412 - Andy Chang, Oscar C. Au, Yick Ming Yeung:
A novel approach to fast multi-frame selection for H.264 video coding. 413-416 - Yiannis Andreopoulos, Mihaela van der Schaar, Adrian Munteanu, Joeri Barbarien, Peter Schelkens, Jan Cornelis:
Fully-scalable wavelet video coding using in-band motion compensated temporal filtering. 417-420 - Lap-Pui Chau, Xuan Jing:
Efficient three-step search algorithm for block motion estimation in video coding. 421-424 - Hsi-Tzeng Chan, Chung-Lin Huang:
Multiple description and matching pursuit coding for video transmission over the Internet. 425-428
Image Processing: Applications
- Antoine Roueff, Jérôme I. Mars, Jocelyn Chanussot, Helle Pederson:
Simultaneous group and phase correction for the estimation of dispersive propagating waves in the time-frequency plane. 429-432 - Benayad Nsiri
, Thierry Chonavel, Jean-Marc Boucher:
Blind estimation of long impulse response and non-minimum phase wavelets application to seismic data. 433-436 - Qian Du, Sumit Chakrarvarty:
Unsupervised hyperspectral image classification using blind source separation. 437-440 - Yibin Zheng:
A new algorithm for retrieval of 2D exponentials. 441-444 - Anthony Sourice, Guy Plantier, Jean-Louis Saumet:
Two-dimensional frequency estimation using autocorrelation phase fitting. 445-448 - Jingxin Zhang, Jim Schroeder, Nicholas J. Redding:
SAR image enhancement for small target detection. 449-452 - Zhiping Lin, Qiyue Zou, Raimund J. Ober:
The Fisher information matrix for two-dimensional data sets. 453-456 - Damien Muti, Salah Bourennane:
Multidimensional signal processing using lower-rank tensor approximation. 457-460 - Wen-Hung Liao
, Dai-Yun Li:
Homomorphic processing techniques for near-infrared images. 461-464 - Huafeng Liu, Lung Ngong Wong, Pengcheng Shi:
Cardiac motion and material properties analysis using data confidence weighted extended Kalman filter framework. 465-468 - Cha Zhang, Tsuhan Chen:
On generalized sampling for image-based rendering data. 469-472 - Chaminda Weerasinghe, Wanqing Li, Philip Ogunbona
:
Stereoscopic panoramic video generation using centro-circular projection technique. 473-476
Image and Video Analysis I
- Chip-Hong Chang, Rui Xiao, Thambipillai Srikanthan:
An adaptive initialization technique for color quantization by self organizing feature map. 477-480 - Steve Mann, Corey Manders, James Fung:
The lightspace change constraint equation (LCCE) with practical application to estimation of the projectivity+gain transformation between multiple pictures of the same subject matter. 481-484 - Nilanjan Dasgupta, Lawrence Carin:
Context-based graphical modeling for wavelet domain signal processing. 485-488 - Hui Cheng:
Temporal registration of video sequences. 489-492 - Namrata Vaswani, Amit K. Roy-Chowdhury, Rama Chellappa:
Statistical shape theory for activity modeling. 493-496 - Amit K. Roy-Chowdhury, Amit A. Kale, Rama Chellappa:
Video synthesis of arbitrary views for approximately planar scenes. 497-500 - John N. Carter, Pelopidas Lappas, Robert I. Damper:
Evidence-based object tracking via global energy maximization. 501-504 - Fenghui Yao, Guifeng Shao:
Detection of 3D symmetry axis from fragments of a broken pottery bowl. 505-508 - Minghui Xia, Bede Liu:
"Super-resolution curve" and image registration. 509-512
Watermarking II
- Adnan M. Alattar, Eugene T. Lin, Mehmet Utku Celik:
Watermarking low bit-rate Advanced Simple Profile MPEG-4 bitstreams. 513-516 - Jun Tian:
High capacity reversible data embedding and content authentication. 517-520 - Patrick Bas, Nicolas Le Bihan, Jean-Marc Chassery:
Color image watermarking using quaternion Fourier transform. 521-524 - Shih-Hsuan Yang:
Wavelet filter evaluation for image watermarking. 525-528 - Ming Sun Fu, Oscar C. Au:
A novel method to embed watermark in different halftone images: data hiding by conjugate error diffusion (DHCED). 529-532 - Yanjiang Yang, Feng Bao:
An invertible watermarking scheme for authentication of Electronic Clinical Brain Atlas. 533-536 - Dajun He, Qibin Sun, Qi Tian:
An object based watermarking solution for MPEG4 video authentication. 537-540 - Quan He, Guangchuan Su:
A semi-blind robust watermarking for digital images. 541-544 - Tao Zhang, Xijian Ping:
Reliable detection of LSB steganography based on the difference image histogram. 545-548 - Guorui Feng, Ling-ge Jiang, Chen He, Dong-Jian Wang:
A novel algorithm for embedding and detecting digital watermarks. 549-552 - Slaven Marusic, David B. H. Tay, Guang Deng:
A parametric family of wavelet filters for diversity in watermarking application. 553-556 - Oscar C. Au, Ming Sun Fu:
A symmetric key watermark for halftone images. 557-560
Image and Video Indexing and Retrieval II
- Rozenn Dahyot
, Anil C. Kokaram, Niall Rea, Hugh Denman:
Joint audio visual retrieval for tennis broadcasts. 561-564 - Chi-Man Pun:
Invariant content-based image retrieval by wavelet energy signatures. 565-568 - Haim H. Permuter, Joseph M. Francos, Ian H. Jermyn
:
Gaussian mixture models of texture and colour for image database retrieval. 569-572 - Akisato Kimura, Kunio Kashino, Takayuki Kurozumi, Hiroshi Murase:
Dynamic-segmentation-based feature dimension reduction for quick audio/video searching. 573-576 - Seong-O Shim, Tae-Sun Choi:
Image indexing by modified color cooccurrence matrix. 577-580 - Jiqiang Song, Min Cai, Michael R. Lyu:
A robust statistic method for classifying color polarity of video text. 581-584 - Ming Hong Pi, Mrinal Mandal, Anup Basu:
Image retrieval based on histogram of new fractal parameters. 585-588 - Anxiang Hong, Zheru Chi
, Gang Chen, Zhiyong Wang:
Region-of-interest based flower images retrieval. 589-592 - Michael Hoeynck, Jens-Rainer Ohm:
Shape retrieval with robustness against partial occlusion. 593-596 - Guoping Qiu:
Appearance indexing. 597-600 - Chun-Ho Cheung, Lai-Man Po:
A novel histogram-biasing factor for fast sorted histogram-based measurement in large image database retrieval system. 601-604 - Miki Haseyama, Isao Kondo:
2-D functional AR model for image identification. 605-608 - Xiaokang Yang, Weisi Lin, Zhongkang Lu, Ee Ping Ong, Susu Yao:
Just-noticeable-distortion profile with nonlinear additivity model for perceptual masking in color images. 609-612 - Zhizhong Zhe, Hong Ren Wu, Zhenghua Yu, Tim Ferguson, Damian M. Tan:
Performance evaluation of a perceptual ringing distortion metric for digital video. 613-616 - Zhongkang Lu
, Weisi Lin, Ee Ping Ong, Susu Yao, Xiaokang Yang:
Perceptual-quality significance map (PQSM) and its application on video quality distortion metrics. 617-620 - Deepak S. Turaga, Mihaela van der Schaar:
Content-adaptive filtering in the UMCTF framework. 621-624 - Toshiyuki Uto, Masaaki Ikehara:
A smooth extension for the nonexpansive orthogonal wavelet decomposition of finite length signals. 625-628 - Roger Pique, Luis Torres:
Efficient face coding in video sequences combining adaptive principal component analysis and a hybrid codec approach. 629-632 - Hideaki Kimata, Masaki Kitahara, Yoshiyuki Yashima:
3D motion vector coding with block base adaptive interpolation filter on H.264. 633-636 - Jong Chul Ye, Yingwei Chen:
Rate-distortion optimized data partitioning for video using backward adaptation. 637-640 - Gabriella Olmo
, Cristiano Cucco, Marco Grangetto, Enrico Magli:
Few decoders in the encoder: a low complexity encoding strategy for H.26L. 641-644 - Òscar Divorra Escoda, Pierre Vandergheynst:
A locally temporal adaptive transform scheme for sub-band video coding. 645-648 - Golam Sorwar, M. Manzur Murshed, Laurence Dooley:
A fully adaptive performance-scalable distance-dependent thresholding search algorithm for video coding. 649-652 - Shing-Chow Chan, King To Ng, Zhi-Feng Gan, Kin-Lok Chan, Heung-Yeung Shum:
The compression of simplified dynamic light fields. 653-656
Image and Video Analysis II
- Lingyan Bi, Kwok Ping Chan, Yinglin Yu:
Modified CLT-domain motion estimation. 657-660 - Xuan Jing, Ce Zhu, Lap-Pui Chau:
Smooth constrained block matching criterion for motion estimation. 661-664 - Hai-Yun Wang, Kai-Kuang Ma:
Motion field discontinuity classification for tensor-based optical flow estimation. 665-668 - Yu-Chan Lim, Kyeong-Yuk Min, Jong-Wha Chong:
A pentagonal fast block matching algorithm for motion estimation using adaptive search range. 669-672 - Miki Haseyama, Atsushi Matsumura:
A trainable retrieval system for cartoon character images. 673-676 - Sangoh Jeong, Chee Sun Won, Robert M. Gray:
Histogram-based image retrieval using Gauss mixture vector quantization. 677-680 - Shan Suthaharan:
A perceptually significant block-edge impairment metric for digital video coding. 681-684 - Li Cheng, Terry Caelli:
Doubly-MRF stereo matching. 685-688 - Yong Yu, Isabelle Bloch, Alain Trouvé:
A unified unsupervised clustering algorithm and its first application to landcover classification. 689-692 - Peng Wang, Yufei Ma, Hong-Jiang Zhang, Shiqiang Yang:
A people similarity based approach to video indexing. 693-696 - Shunren Xia, Weidong Xu, Yutang Shen:
Two intelligent algorithms applied to automatic chromosome incision. 697-700 - Zhonghua Liang, Ping Wang, Zheng Tan:
Moving object detection from MPEG bit stream. 701-704
Image and Video Restoration
- Javier Mateos, Rafael Molina
, Aggelos K. Katsaggelos:
Bayesian high resolution image reconstruction with incomplete multisensor low resolution systems. 705-708 - Javier Abad, Miguel Vega, Rafael Molina
, Aggelos K. Katsaggelos:
Parameter estimation in super-resolution image reconstruction problems. 709-712 - Ying Fai Ho:
Peer region determination based impulsive noise detection. 713-716 - Euncheol Choi, Moon Gi Kang:
Deblocking algorithm for DCT-based compressed images using anisotropic diffusion. 717-720 - Bogdan Smolka, Konstantinos N. Plataniotis, Rastislav Lukac, Anastasios N. Venetsanopoulos:
New class of impulsive noise reduction filters based on kernel density estimation. 721-724 - Ryo Nakagaki, Aggelos K. Katsaggelos:
A VQ-based blur identification algorithm. 725-728 - Pascal Bourdon, Bertrand Augereau, Christian Olivier, Christian Chatellier:
A PDE-based method for ringing artifact removal on grayscale and color JPEG2000 images. 729-732 - Alexander M. Bronstein, Michael M. Bronstein, Michael Zibulevsky, Yehoshua Y. Zeevi:
Separation of semireflective layers using sparse ICA. 733-736 - Lorenzo Cappellari, Truong Q. Nguyen:
Deblocking of video sequences with lapped embedded IDCT. 737-740 - Ju Jia Zou, Hong Yan:
Model-based smoothing for reducing artifacts in compressed images. 741-744 - Rastislav Lukac, Bogdan Smolka, Konstantinos N. Plataniotis, Anastasios N. Venetsanopoulos, Pavol Zavarsky:
Angular multichannel sigma filter. 745-748
Signal Processing Education
- Roxana Saint-Nom, Daniel Jacoby:
Switched capacitors: a bridge between analog and digital SP. 749-752 - Yong Lian:
The joy of learning DSP in a large class. 753-756 - Eliathamby Ambikairajah, Julien Epps, Ming Sheng, Branko G. Celler:
Evaluation of a virtual teaching laboratory for signal processing education. 757-760 - Jeng-Kuang Hwang:
Innovative communication design lab based on PC sound card and Matlab: a software-defined-radio OFDM modem example. 761-764 - John Håkon Husøy:
Making a case for iterative linear equation solvers in DSP education. 765-768 - Thad B. Welch, Robert W. Ives, Michael G. Morrow, Cameron H. G. Wright:
Using DSP hardware to teach modem design and analysis techniques. 769-772