


Остановите войну!
for scientists:


default search action
ICME 2003: Baltimore, MD, USA
- Proceedings of the 2003 IEEE International Conference on Multimedia and Expo, ICME 2003, 6-9 July 2003, Baltimore, MD, USA. IEEE Computer Society 2003, ISBN 0-7803-7965-9
Volume 1
Networked Video I
- Thinh P. Q. Nguyen, Puneet Mehra, Avideh Zakhor:
Path diversity and bandwidth allocation for multimedia streaming. 1-4 - Susie J. Wee, John G. Apostolopoulos, Wai-tian Tan, Sumit Roy:
Research and design of a mobile streaming media content delivery network. 5-8 - Jacob Chakareski, Eric Setton, Yi J. Liang, Bernd Girod:
Video streaming with diversity. 9-12 - Marco Fumagalli, Phoom Sagetong, Antonio Ortega:
Estimation of erased data in a H.263 coded stream by using unbalanced multiple description coding. 13-16 - Amy R. Reibman
, Vinay A. Vaishampayan
:
Quality monitoring for compressed video subjected to packet loss. 17-20
Automatic Indexing
- Rémi Ronfard
, Tien Tran-Thuong:
A framework for aligning and indexing movies with their script. 21-24 - Xiaofei He, Wei-Ying Ma, Hong-Jiang Zhang:
Imagerank: spectral techniques for structural analysis of image database. 25-28 - Adam Berenzweig, Daniel P. W. Ellis, Steve Lawrence:
Anchor space for classification and similarity measurement of music. 29-32 - Tong Zhang:
Automatic singer identification. 33-36 - Matthew R. Boutell, Jiebo Luo
, Robert T. Gray:
Sunset scene classification using simulated image recomposition. 37-40
Multimodal Interfaces
- Yeow Kee Tan, Nasser Sherkat
, Tony Allen:
Eye gaze and speech for data entry: a comparison of different data entry methods. 41-44 - Yasuhito Sawahata
, Kiyoharu Aizawa:
Wearable imaging system for summarizing personal experiences. 45-48 - Timothy T. H. Chen, Sidney S. Fels, Saehee Sarah Min:
FlowField and beyond: applying pressure-sensitive multi-point touchpad interaction. 49-52 - Xin Fan, Xing Xie, Wei-Ying Ma, Hong-Jiang Zhang, He-Qin Zhou:
Visual attention based image browsing on mobile devices. 53-56 - Björn W. Schuller
, Martin Zobl, Gerhard Rigoll, Manfred K. Lang:
A hybrid music retrieval system using belief networks to integrate multimodal queries and contextual knowledge. 57-60
Speech and Audio Processing I
- Hsuan-Huei Shih, Shrikanth S. Narayanan, C.-C. Jay Kuo
:
A statistical multidimensional humming transcription using phone level hidden Markov models for query by humming systems. 61-64 - Rongshan Yu, Xiao Lin, Susanto Rahardja, Chi Chung Ko:
A fine granular scalable perceptually lossy and lossless audio coder. 65-68 - Simon Lucey, Tsuhan Chen
:
An investigation into subspace rapid speaker adaptation for verification. 69-72 - Manuel J. Reyes Gomez, Daniel P. W. Ellis:
Selection, parameter estimation, and discriminative training of hidden Markov models for general audio modeling. 73-76 - Chih-Kai Yang, Sou-Gee Chen:
New static and dynamic search algorithms for fast MP3 bit allocations. 77-80
Image Processing I
- Yongmin Li
, Li-Qun Xu, Geoff Morrison, Charles Nightingale, Jason Morphett:
Robust panorama from MPEG video. 81-84 - Jun-Wei Hsieh:
Fast stitching algorithm for moving object detection and mosaic construction. 85-88 - Zhang John Chen, Jagath Samarabandu:
Planar region depth filling using edge detection with embedded confidence technique and Hough transform. 89-92 - S. H. Srinivasan, Mohan S. Kankanhalli:
Wide baseline spectral matching. 93-96 - Wei-Qi Yan, Mohan S. Kankanhalli:
Colorizing infrared home videos. 97-100 - Hasan F. Ates
, Michael T. Orchard:
Image interpolation using wavelet-based contour estimation. 101-104 - Andy Chang, Oscar C. Au, Yick Ming Yeung:
A novel approach to fast multi-block motion estimation for H.264 video coding. 105-108 - Gulcin Caner, A. Murat Tekalp
, Wendi B. Heinzelman
:
Super resolution recovery for multi-camera surveillance imaging. 109-112 - Yu Hen Hu, Rajas A. Sambhare:
Constrained texture synthesis for image post processing. 113-116
Multimedia Architectures and Implementation
- Nikolaos Bellas
, Malcolm Dwyer:
A programmable, high performance vector array unit used for real-time motion estimation. 117-120 - Tay-Jyi Lin, Chin-Chi Chang, Tsung-Hsun Yang, Yu-Ming Chang, Chien-Hung Lin, Chen-Chia Lee, Hung-Yueh Lin, Chein-Wei Jen:
Performance evaluation of ring-structure register file in multimedia applications. 121-124 - Tay-Jyi Lin, Tsung-Hsun Yang, Chein-Wei Jen:
Coefficient optimization for area-effective multiplier-less FIR filters. 125-128 - Satoshi Nishiguchi, Kazuhide Higashi, Yoshinari Kameda, Michihiko Minoh:
A sensor-fusion method for detecting a speaking student. 129-132 - Tsung-Han Tsai, Wen-Cheng Chen, Chun-Nan Liu:
A low power VLSI implementation for variable length decoder in MPEG-1 layer III. 133-136 - Hung-Chi Fang, Tu-Chih Wang, Yu-Wei Chang, Ya-Yun Shih, Liang-Gee Chen
:
Novel word-level algorithm of embedded block coding in JPEG 2000. 137-140 - Jongmyon Kim, D. Scott Wills:
Quantized color instruction set for media-on-demand applications. 141-144 - Michelle Yan, James Shaw, Vahid Khamsi, Shih-Ping Liou:
Tracking and presenting user attention for collaborative browsing using heterogeneous devices. 145-148 - Shinsuke Kobayashi, Kentaro Mita, Yoshinori Takeuchi, Masaharu Imai:
Rapid prototyping of JPEG encoder using the ASIP development system: PEAS-III. 149-152
Text, Graphics, Face, Scene, and Song Recognition
- Ioannis Andreou, Nikitas M. Sgouros:
Sketch creation utilizing shape matching techniques. 153-156 - Michael H. Lee, Surya Nepal, Uma Srinivasan:
Edge-based semantic classification of sports video sequences. 157-160 - Gees C. Stein, Jens Rittscher
, Anthony Hoogs:
Enabling video annotation using a semantic database extended with visual knowledge. 161-164 - Hidehisa Nagano, Kunio Kashino, Hiroshi Murase:
A fast search algorithm for background music signals based on the search for numerous small signal components. 165-168 - Ahmet Ekin, A. Murat Tekalp
:
Generic play-break event detection for summarization and hierarchical sports video analysis. 169-172 - Amit Chakraborty, Peiya Liu, Liang H. Hsu:
Extracting anchorable information units from PDF files. 173-176 - Lijun Yin, Sergey Royt, Matt T. Yourst, Anup Basu:
Recognizing facial expressions using active textures with wrinkles. 177-180 - Francis K. H. Quek, Yingen Xiong:
Oscillatory gestures and discourse. 181-184
Networked Video II
- Haitao Zheng:
Optimizing wireless multimedia transmissions through cross layer design. 185-188 - Jacco R. Taal, Ivaylo Haratcherev, Koen Langendoen
, Inald Lagendijk:
Quality of service controlled adaptive video-coding over IEEE 802.11 wireless links. 189-192 - Thomas Stockhammer:
Is fine-granular scalable video coding beneficial for wireless video applications? 193-196 - Jie Chen, S. Hsia:
Joint cross-layer design for wireless QoS video delivery. 197-200 - Trista Pei-Chun Chen, Tsuhan Chen
:
Shaping for video with frame dependency. 201-204
Multimedia Security and Content Protection I
- H. Vicky Zhao, Min Wu, Z. Jane Wang, K. J. Ray Liu:
Performance of detection statistics under collusion attacks on independent multimedia fingerprints. 205-208 - Alexia Giannoula, Anastasios Tefas
, Nikos Nikolaidis
, Ioannis Pitas:
Improving the detection reliability of correlation-based watermarking techniques. 209-212 - Ming Sun Fu, Oscar C. Au:
A multi-bit robust watermark for halftone images. 213-216 - Nedeljko Cvejic, Djordje Tujkovic, Tapio Seppänen:
Increasing robustness of an audio watermark using turbo codes. 217-220 - Jonathan Foote, John Adcock, Andreas Girgensohn:
Time base modulation: a new approach to watermarking audio. 221-224
Virtual Reality and Imaging I
- Satya P. Mallick, Mohan M. Trivedi:
Parametric face modeling and affect synthesis. 225-228 - Inmaculada Rodríguez Santiago
, Manuel Peinado, Ronan Boulic, Daniel Meziat:
Bringing the human arm reachable space to a virtual environment for its analysis. 229-232 - Cha Zhang, Tsuhan Chen
:
A system for active image-based rendering. 233-236 - Yuzhong Shen, Kenneth E. Barner:
Surface denoising with directional fuzzy vector median filtering. 237-240 - Yong-In Yoon, Jang-Hwan Im, Dae-Hyun Kim, Jong-Soo Choi:
Reconstruction of linearly parameterized models using the vanishing points from a single image. 241-244
Authentication and Recognition
- Wende Zhang, Tsuhan Chen
:
Personal authentication based on generalized symmetric max minimal distance in subspace. 245-248 - Thang Viet Nguyen, Jagdish Chandra Patra
, Ee-Luang Ang:
Blind image extraction from nonlinear mixtures using MLP-based ICA. 249-252 - Wei Wang, Aidong Zhang, Yuqing Song:
Identification of objects from image regions. 253-256 - S. Palanivel
, B. S. Venkatesh, B. Yegnanarayana:
Real time face authentication system using autoassociative neural network models. 257-260 - Dong-Wan Kang, Jun Ohya:
Postures of a human wearing a multiple-colored suit based on color information processing. 261-264
Wireless Multimedia Techniques
- Wei Wang, Michael R. Lyu:
Automatic generation of dubbing video slides for mobile wireless environment. 265-268 - Surya Nepal, Uma Srinivasan:
Adaptive video highlights for wired and wireless platforms. 269-272 - Dirk Trossen, Hemant H. Chaskar:
Enabling user-tailored MMS delivery in heterogeneous access scenarios. 273-276 - Shengjie Zhao, Zixiang Xiong, Xiaodong Wang:
Optimal resource allocation for wireless video over CDMA networks. 277-280 - Amol Bhatkar, Rajarathnam Chandramouli
, Narayanan Vijaykrishnan, Mary Jane Irwin:
Computation and transmission energy modeling through profiling for MPEG4 video transmission. 281-284 - Wen Xu, Sheila S. Hemami:
Delay-optimized robust transmission of images over multiple channels. 285-288 - Wanghong Yuan, Klara Nahrstedt:
Buffering approach for energy saving in video sensors. 289-292 - Jiancong Chen, S.-H. Gary Chan, Qian Zhang, Wenwu Zhu, Jin Chen:
A distributed power adaptation algorithm for multimedia delivery over ad hoc networks. 293-296
Content-based Retrieval
- Jieh Hsiang, Wen-Jun Liu, Bee-Chung Chen, Hsieh-Chang Tu:
Multidimensional interactive fine-grained image retrieval. 297-300 - Jürgen Assfalg, Alberto Del Bimbo
, Pietro Pala
:
Curvature maps for 3D CBR. 301-304 - Xiangdong Zhou, Qi Zhang, Lan Lin, Ailin Deng, Gang Wu:
Image retrieval by fuzzy clustering of relevance feedback records. 305-308 - Jun Gao, George Tzanetakis
, Peter Steenkiste:
Content-based retrieval of music in scalable peer-to-peer networks. 309-312 - Lei Zhang, Fang Qian, Mingjing Li, Hong-Jiang Zhang:
An efficient memorization scheme for relevance feedback in image retrieval. 313-316 - Yuxin Peng, Chong-Wah Ngo
, Qing-Jie Dong, Zongming Guo, Jianguo Xiao:
Video clip retrieval by maximal matching and optimal matching in graph theory. 317-320 - Xin Huang, Shu-Ching Chen, Mei-Ling Shyu:
Incorporating real-valued multiple instance learning into relevance feedback for image retrieval. 321-324 - Ming Hong Pi, Mrinal Mandal, Anup Basu:
Image retrieval based on 2-D histogram of fractal parameters. 325-328 - Giridharan Iyengar, Harriet J. Nock, Chalapathy Neti:
Audio-visual synchrony for detection of monologues in video archives. 329-332 - Min Xu
, Ling-Yu Duan, Changsheng Xu, Qi Tian:
A fusion scheme of visual and auditory modalities for event detection in sports video. 333-336
Image Processing II
- Ching-Yeh Chen, Shao-Yi Chien
, Yi-Hau Chen, Yu-Wen Huang, Liang-Gee Chen
:
Unsupervised object-based sprite coding system for tennis sport. 337-340 - Armando J. Pinho
, António J. R. Neves
:
Block-based histogram packing of color-quantized images. 341-344 - Nejat Kamaci, Yucel Altunbasak:
Performance comparison of the emerging H.264 video coding standard with the existing standards. 345-348 - Xiaodong Gu, Hong-Jiang Zhang:
Implementing dynamic GOP in video encoding. 349-352 - Yung-Gi Wu, Ming-Zhi Huang, Yu-Ling Wen:
Fractal image compression with variance and mean. 353-356 - Martin P. Boliek, Gene K. Wu:
JPEG 2000-like access using the JPM compound document file format. 357-360 - Shou-Yi Tseng:
Efficient motion estimation algorithm using run-time and distortion optimization approach. 361-364 - Liang Zhang:
Statistical model for intensity differences of corresponding points between stereo image pairs. 365-368 - Yuhua Ding, George J. Vachtsevanos, Anthony J. Yezzi Jr., Wayne Daley, Bonnie S. Heck-Ferri:
A real-time curve evolution-based image fusion algorithm for multisensory image segmentation. 369-372 - Bernd Girod, Chuo-Ling Chang, Prashant Ramanathan, Xiaoqing Zhu:
Light field compression using disparity-compensated lifting. 373-376
Speech Coding, Analysis, and Synthesis
- Christian H. Ritz
, Ian S. Burnett
, Jason Lukasiak:
Low bit rate wideband WI speech coding. 377-380 - Houman Zarrinkoub, Paul Mermelstein:
Joint optimization of short-term and long-term predictors in CELP speech coders. 381-384 - Om Deshmukh, Carol Y. Espy-Wilson:
A measure of aperiodicity and periodicity in speech. 385-388 - K. Sreenivasa Rao, B. Yegnanarayana:
Prosodic manipulation using instants of significant excitation. 389-392 - Arun Kumar, Ashish Verma:
Using phone and diphone based acoustic models for voice conversion: a step towards creating voice fonts. 393-396 - Xiaodong He, Wu Chou:
minimum classification error linear regression for acoustic model adaptation of continuous density HMMS. 397-400 - Björn W. Schuller
, Gerhard Rigoll, Manfred K. Lang:
Hidden Markov model-based speech emotion recognition. 401-404 - Dong Wang, Lie Lu, Hong-Jiang Zhang:
Speech segmentation without speech recognition. 405-408 - Julien Pinquier
, Jean-Luc Rouas, Régine André-Obrecht:
A fusion study in speech / music classification. 409-412
Multimedia Technology for Gaming
- Mohammed Chalil, K. P. Sreekumar, Manoj Sankar:
MPEG-4 based framework for game engines to handle virtual advertisements in game. 413-416 - Amaryllis Raouzaiou, Kostas Karpouzis, Stefanos D. Kollias
:
Emotion representation for online gaming. 417-420 - Ghassan Al-Regib
, Yucel Altunbasak:
3TP: an application-layer protocol for streaming 3-D graphics. 421-424 - Magy Seif El-Nasr, Ian Horswill:
Expressive lighting for interactive entertainment. 425-428 - Son Minh Tran, Marius Preda, Françoise J. Prêteux, Kalman Fazekas:
Exploring MPEG-4 BIFS features for creating multimedia games. 429-432
Multimedia Learning
- Raghavendra Singh, Ravi Kothari:
Relevance feedback algorithm based on learning from labeled and unlabeled data. 433-436 - Milind R. Naphade, Ching-Yung Lin, Apostol Natsev, Belle L. Tseng, John R. Smith:
A framework for moderate vocabulary semantic visual concept detection. 437-440 - Shinsuke Nakajima, Shinichi Kinoshita, Katsumi Tanaka:
Amplifying the differences between your positive samples and neighbors in image retrieval. 441-444 - Apostol Natsev, John R. Smith:
Active selection for multi-example querying by content. 445-448 - Tzvetanka I. Ianeva, Arjen P. de Vries
, Hein Röhrig:
Detecting cartoons: a case study in automatic video-genre classification. 449-452
QoS
- Wuttipong Kumwilaisak, Qian Zhang, Wenwu Zhu, C.-C. Jay Kuo
, Ya-Qin Zhang:
On the rate constraint of transmitting multiple priority classes with QoS. 453-456 - Bo Shen:
Meta-caching and meta-transcoding for server-side service proxy. 457-460 - Sheau-Ru Tong, Chun-Cheng Chang:
Harmonic DiffServ: provisioning scalable heterogeneous-QoS multicast in DiffServ networks. 461-464 - Rajeev Kumar:
A protocol with transcoding to support QoS over Internet for multimedia traffic. 465-468 - Nam Pham Ngoc, Gauthier Lafruit, Jean-Yves Mignolet, Serge Vernalde, Geert Deconinck
, Rudy Lauwereins:
A framework for mapping scalable networked applications on run-time reconfigurable platforms. 469-472