


default search action
APSIPA 2021: Tokyo, Japan
- Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021, Tokyo, Japan, December 14-17, 2021. IEEE 2021, ISBN 978-988-14768-9-0 
- Jiahong Zhao, Christian Ritz: 
 Coprime Microphone Arrays for Estimating Speech Direction of Arrival Using Deep Learning. 1-8
- Takayuki Sasaki, Ryuichi Tanida, Masaki Kitahara, Hideaki Kimata: 
 Fast-Parallel Singular Value Thresholding for Many Small Matrices based on Geometric Feature of Singular Values. 1-8
- Tomomi Hatano, Tomomi Takezawa, Masashi Sugimoto, Kuangzhe Xu, Takashi Morikawa, Yasuhiro Azuma, Kazuo Shibuta, Noriko Nagata: 
 Measuring Attractiveness of Tourism Resources by Focusing on Kansei Value Structure: Possibility of Inviting Visitors Using the Japanese Heritage "Ako Salt.". 1-7
- Manav Kaushik, Van Tung Pham, Tran The Anh, Eng Siong Chng: 
 End-to-End Speaker Age and Height Estimation using Attention Mechanism and Triplet Loss. 1-8
- Yuto Ueda, Hidetoshi Nakashima, Yuuki Yuno, Nobuhiko Hiruma: 
 Binaural Adaptive Feedback Cancellation Based on Prediction Error Method Using Interaural Level Differences in Hearing Device. 9-16
- Cheng-Yu Cai, Yu-Hui Su, Li Su: 
 Dual-channel Drum Separation for Low-cost Drum Recording Using Non-negative Matrix Factorization. 17-22
- Daichi Hayakawa, Takehiko Kagoshima, Hiroshi Fujimura: 
 Mask-based Beamforming Using Complex-valued Neural Network for Recognition of Spatial Target Speech. 23-29
- Toru Takahashi, Takuma Ekawa, Masato Nakayama: 
 Moving Sound Source Tracking in Wide Space by Multiple Microphone Arrays. 30-35
- Kai Li, Masashi Unoki, Yongwei Li, Jianwu Dang, Masato Akagi: 
 Study on Simultaneous Estimation of Glottal Source and Vocal Tract Parameters by ARMAX-LF Model for Speech Analysis/Synthesis. 36-43
- Oguz Meteer, Marco Jan Gerrit Bekooij: 
 Low-Power Booth Multiplication without Dynamic Range Detection in FFTs for FMCW Radar Signal Processing. 44-48
- Xuehan Wang, Gongping Huang, Israel Cohen, Jacob Benesty, Jingdong Chen: 
 Kronecker Product Adaptive Beamforming for Microphone Arrays. 49-54
- Oguz Meteer, Marco Jan Gerrit Bekooij: 
 An Optimal Variable-Latency Architecture for Deterministic Approaches to Stochastic Computing with Unary Bit Stream Preserving Properties. 55-62
- Hiroyasu Takagi, Norishige Fukushima: 
 Domain Specific Description in Halide for Randomized Image Convolution. 63-69
- Kei Kawamura, Kyohei Unno, Yoshitaka Kidani: 
 Fast Still Picture Coding for VVC. 70-73
- Takumi Kondo, Yoshihiro Maeda, Norishige Fukushima: 
 Accelerating Finite Impulse Response Filtering Using Tensor Cores. 74-79
- Ippei Okuda, Masahiro Takaoka, Tomoaki Tsumura: 
 Hisui: an Image and Video Processing Framework with Auto-optimizer. 80-87
- Yoshihiro Maeda, Norishige Fukushima, Takayuki Hamamoto: 
 Color Transformation for Compressive Computing in Image Filtering. 88-92
- Xumin Yu, Yan Feng, Yanlong Gao: 
 Imbalanced sample feature enhancement of hyperspectral imagery classification. 93-99
- Jin Wu, Wei Dai, Yu Wang, Bo Zhao: 
 Improved Fruit Fly Optimization Algorithm Based on Simulated Annealing in Neural Network. 100-105
- Yun Zhu, Chuanzhan Hu, Lin Jiang, Xubang Shen: 
 An Implementation Method of HEVC Dataflow Graph Based on Reconfigurable Processer. 106-112
- Binghong Jiang: 
 An improved naive bayes model for air temperature prediction. 113-120
- Rong Yang, Xiaoyan Xie, Miaomiao Chai, Lin Fang, Wanqi He, Jingtao Sun: 
 An IDE for Reconfigurable Video Array Processor. 121-126
- Xiaoyan Xie, Miaomiao Chai, Zhuolin Du, Kun Yang, Shaorun Yin: 
 A Reconfigurable Parallelization of Generative Adversarial Networks based on Array Processor. 127-132
- Junyong Deng, Qingqing Ma, Zekun Ye: 
 Performance Characterization of Rasterization Algorithms for Reconfigurable Graphics Processor. 133-140
- Tse Wei Chiu, You Sheng Guo, Pao-Chi Chang: 
 Non-parallel Voice Conversion with Generative Attentional Networks. 141-145
- Hyunkook Park, Vien Gia An, Yeong Jun Koh, Chul Lee: 
 Unpaired Image Demoiréing Based on Cyclic Moiré Learning. 146-150
- Youngjin Oh, Gu Yong Park, Haesoo Chung, Sunwoo Cho, Nam Ik Cho: 
 Residual Dilated U-Net with Spatially Adaptive Normalization for the Restoration of Under Display Camera Images. 151-157
- Jae Hoon Shim, Hochang Rhee, Yeong Il Jang, Geonsu Lee, Seyun Kim, Nam Ik Cho: 
 Lossless Image Compression Based on Image Decomposition and Progressive Prediction Using Convolutional Neural Networks. 158-163
- Jintae Kim, Junheum Park, Whan Choi, Chang-Su Kim: 
 Facial Video Frame Interpolation Combining Symmetric and Asymmetric Motions. 164-169
- Cong Tin Nguyen, Bach-Tung Pham, Thi Phuong Le, Tzu-Chiang Tai, Jia-Ching Wang: 
 Face Anti-Spoofing Using Multi-Branch CNN. 170-173
- Bungo Konishi, Akira Hirose, Ryo Natsuaki: 
 Generalization characteristics of complex-valued reservoir computing for interferometric synthetic aperture radar applications. 174-178
- Takehiko Mizoguchi, Isao Yamada: 
 A Hypercomplex Tensor-SVD and Its Application. 179-186
- Yuto Okawa, Tohru Nitta: 
 Learning Properties of Feedforward Neural Networks Using Dual Numbers. 187-192
- Akira Hirose, Soshi Shimomura: 
 Adaptive Subsurface Imaging based on Peak Phase-Profile: The Significance in Separation of Scattering Phase from Propagation Phase. 193-199
- Yicheng Song, Akira Hirose: 
 Discussion on the Origin of the Strength of Phasor Quaternion Self-Organizing Map. 200-204
- Hiroki Tanji, Takahiro Murakami: 
 Learning the Statistical Model of the NMF Using the Deep Multiplicative Update Algorithm with Applications. 205-211
- Ryota Kato, Kenji Suyama: 
 An Improved Parameter Free Genetic Algorithm for CSD-FIR Filter design. 212-217
- Yuta Harigae, Kazuki Matumoto, Kenji Suyama: 
 A Proposal toward Standardization of Design Examples for IIR Filter Design Methods. 218-221
- Shunsuke Koshita: 
 On Optimal Realizations for All-Pass Fractional Delay Digital Filters. 222-225
- Takashi Yoshida: 
 Low-pass maximally flat IIR digital differentiator design with arbitrary flatness degree. 226-231
- Jitendra K. Tugnait: 
 On Sparse Graph Estimation Under Statistical and Laplacian Constraints. 232-239
- Marisa Mohr, Ralf Möller: 
 Ordering Principal Components of Multivariate Fractional Brownian Motion for Solving Inverse Problems. 240-247
- Ryu Takeda, Kazuhiro Nakadai, Kazunori Komatani: 
 Spatial Normalization to Reduce Positional Complexity in Direction-aided Supervised Binaural Sound Source Separation. 248-253
- Tomoro Tanaka, Kohei Yatabe, Yasuhiro Oikawa: 
 Phase-aware Audio Inpainting Based on Instantaneous Frequency. 254-258
- Koyo Kugiyama, Kimiko Motonaka, Yoshinobu Kajikawa, Seiji Miyoshi: 
 Statistical-Mechanical Analysis of Adaptive Volterra Filter for Time-Varying Unknown System. 259-263
- Dailys Arronde Pérez, Hubert Zangl: 
 High-accuracy reconstruction of periodic signals based on compressive sensing. 264-268
- Yih-Wen Wang, Chia-Ping Chen, Chung-Li Lu, Bo-Cheng Chan: 
 Semi-Supervised Sound Event Detection Using Self-Attention and Multiple Techniques of Consistency Training. 269-274
- Kouki Hori, Nari Tanabe, Masaya Fujisawa: 
 Nonlinear SVM-Type Automatic Dicision Algorithm in Noisy Environment for Hammering Test System. 275-281
- Yucheng Chen, Mingyi He, Yuchao Dai: 
 Nearby-person Occlusion Data Augmentation for Human Pose Estimation with Non-extra Annotations. 282-287
- Koki Yasui, Fumihiko Sakaue, Jun Sato, Yu Koyama, Mitsuyasu Matsuura: 
 Dense Depthmap Prediction from Ultrasonic Sensors. 288-294
- Kazuya Hanamoto, Shuichi Ohno: 
 Feedback Quantization and Bit Allocation for Networked Control Systems with Rate Limited Channels. 295-298
- Arvid B. Van Den Brink, Marco Jan Gerrit Bekooij: 
 Enhanced Loop-weakened Belief Propagation Algorithm for Performance Enhanced Polar Code Decoders. 299-304
- Jiyao Liu, Yanxi Zhao, Hao Wu, Dongmei Jiang: 
 Positional-Spectral-Temporal Attention in 3D Convolutional Neural Networks for EEG Emotion Recognition. 305-312
- Arvid Trapp, Peter Wolfsteiner: 
 Integrated spectral kurtosis analysis. 313-317
- Arvid B. Van Den Brink, Marco Jan Gerrit Bekooij: 
 Computational Complexity Reduced Belief Propagation Algorithm for Polar Code Decoders. 318-323
- Katsuki Fukumoto, Koki Yamada, Yuichi Tanaka: 
 Node Clustering of Time-Varying Graphs Based on Temporal Label Smoothness. 324-329
- Eisuke Yamagata, Shunsuke Ono: 
 Recovery of Time Series of Graph Signals Over Dynamic Topology. 330-336
- Arjun Ashok Rao, Hoi-To Wai: 
 An Empirical Study on Compressed Decentralized Stochastic Gradient Algorithms with Overparameterized Models. 337-343
- Cheng Yang, Fen Wang, Minxiang Ye, Guangtao Zhai, Xiao-Ping Zhang, Vladimir Stankovic, Lina Stankovic: 
 Model Selection-inspired Coefficients Optimization for Polynomial-Kernel Graph Learning. 344-350
- David Bonet, Antonio Ortega, Javier Ruiz Hidalgo, Sarath Shekkizhar: 
 Channel-Wise Early Stopping without a Validation Set via NNK Polytope Interpolation. 351-358
- Kuangzhe Xu, Noriko Nagata, Toshihiko Matsuka: 
 Modeling the dynamics of observational behaviors base on observers' personality traits using hidden Markov Models. 359-365
- Kuangzhe Xu, Kenji Katahira, Yoichi Yamazaki, Fan Zhang, Naoki Nishida, Yuichiro Tamai, Naoyuki Matsuzaki, Noriko Nagata: 
 Estimating Beverage Preference Based on Subjective Emotional Reactions and EEG Activity. 366-372
- Yoshiko Kawabata, Toshihiko Matsuka: 
 Aizuchi as a sign of internal information processing and its interpretations by listeners. 380-385
- Yuta Watanabe, Yoshitsugu Manabe, Noriko Yata: 
 Internal state estimation by thermal image and identification of face and nose position. 386-391
- Kei Irie, Yicheng Qiu, Kiyoshi Nishikawa: 
 On Improving the Accuracy of Object Detection for High Resolution Images Based on SSD. 392-399
- Yuiko Kumagai, Toshihisa Tanaka: 
 Detection of Note Onsets From EEG While Listening to Music. 400-405
- Yosuke Sugiura, Shunta Nagamori, Tetsuya Shimamura: 
 Speech Enhancement Network with Unsupervised Attention using Invariant Information Clustering. 406-409
- Ayana Mussabayeva, Zangar Ermaganbet, Prashant Kumar Jamwal, Muhammad Tahir Akhtar: 
 Event-Related Spectrogram Representation of EEG for CNN-Based P300 Speller. 410-415
- Timur Okhassov, Prashant Kumar Jamwal, Muhammad Tahir Akhtar: 
 Cost-Effective Proportionate Affine Projection Algorithm with Variable Parameters for Acoustic Feedback Cancellation. 416-422
- Nurbek Saidnassim, Beibit Abdikenov, Rauan Kelesbekov, Muhammad Tahir Akhtar, Prashant Kumar Jamwal: 
 Self-supervised Visual Transformers for Breast Cancer Diagnosis. 423-427
- Keiko Ochi, Masaki Kojima, Keiho Owada, Nobutaka Ono, Shigeki Sagayama, Hidenori Yamasue: 
 Pitch and Volume Stability in the Communicative Response of Adults with Autism. 428-432
- Soky Kak, Sheng Li, Masato Mimura, Chenhui Chu, Tatsuya Kawahara: 
 On the Use of Speaker Information for Automatic Speech Recognition in Speaker-imbalanced Corpora. 433-437
- Hao Shi, Longbiao Wang, Sheng Li, Cunhang Fan, Jianwu Dang, Tatsuya Kawahara: 
 Spectrograms Fusion-based End-to-end Robust Automatic Speech Recognition. 438-442
- Shengqiang Li, Menglong Xu, Xiao-Lei Zhang: 
 Conformer-based End-to-end Speech Recognition With Rotary Position Embedding. 443-447
- Shengqiang Li, Menglong Xu, Xiao-Lei Zhang: 
 Efficient conformer-based speech recognition with linear attention. 448-453
- Zhengkun Tian, Jiangyan Yi, Ye Bai, Jianhua Tao, Shuai Zhang, Zhengqi Wen: 
 One In A Hundred: Selecting the Best Predicted Sequence from Numerous Candidates for Speech Recognition. 454-459
- Atsushi Kojima: 
 Large-Context Automatic Speech Recognition Based on RNN Transducer. 460-464
- Masato Mimura, Shinsuke Sakai, Tatsuya Kawahara: 
 An End-To-End Model from Speech to Clean Transcript for Parliamentary Meetings. 465-470
- Kento Fujiwara, Ryoichi Takashima, Chihiro Sugiyama, Nobukazu Tanaka, Kanji Nohara, Kazunori Nozaki, Tetsuya Takiguchi: 
 Data Augmentation Based on Frequency Warping for Recognition of Cleft Palate Speech. 471-476
- Huaibo Zhao, Yosuke Higuchi, Tetsuji Ogawa, Tetsunori Kobayashi: 
 An Investigation of Enhancing CTC Model for Triggered Attention-based Streaming ASR. 477-483
- Protima Nomo Sudro, Rohan Kumar Das, Rohit Sinha, S. R. Mahadeva Prasanna: 
 Significance of Data Augmentation for Improving Cleft Lip and Palate Speech Recognition. 484-490
- Madhu R. Kamble, Shekhar Nayak, M. Ali Basha Shaik, Shakti P. Rath, Vikram Vij, Hemant A. Patil: 
 Teager Energy Subband Filtered Features for Near and Far-Field Automatic Speech Recognition. 491-496
- Duo Ma, Nana Hou, Van Tung Pham, Haihua Xu, Eng Siong Chng: 
 Multitask-based joint learning approach to robust ASR for radio communication speech. 497-502
- Daiki Mori, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa, Norihide Kitaoka: 
 Advanced language model fusion method for encoder-decoder model in Japanese speech recognition. 503-510
- Mirishkar Sai Ganesh, Vishnu Vidyadhara Raju Vegesna, Meher Dinesh Naroju, Sudhamay Maity, Prakash Yalla, Anil Kumar Vuppala: 
 CSTD-Telugu Corpus: Crowd-Sourced Approach for Large-Scale Speech data collection. 511-517
- Shi-Yan Weng, Hsuan-Sheng Chiu, Berlin Chen: 
 An Empirical Study on Transformer-Based End-to-End Speech Recognition with Novel Decoder Masking. 518-522
- Guochen Yu, Yutian Wang, Chengshi Zheng, Hui Wang, Qin Zhang: 
 CycleGAN-based Non-parallel Speech Enhancement with an Adaptive Attention-in-attention Mechanism. 523-529
- Weixin Meng, Chengshi Zheng, Xiaodong Li: 
 A Robust Maximum Likelihood Distortionless Response Beamformer based on a Complex Generalized Gaussian Distribution. 530-535
- Shih-Chuan Chu, Chung-Hsien Wu, Yun-Wen Lin: 
 Speech Enhancement Based on Masking Approach Considering Speech Quality and Acoustic Confidence for Noisy Speech Recognition. 536-540
- Xinyang Feng, Nuo Li, Zunwen He, Yan Zhang, Wancheng Zhang: 
 DNN-Based Linear Prediction Residual Enhancement for Speech Dereverberation. 541-545
- Zhaopeng Qian, Haijun Niu, Li Wang, Kazuhiro Kobayashi, Shaochuan Zhang, Tomoki Toda: 
 Mandarin Electro-Laryngeal Speech Enhancement based on Statistical Voice Conversion and Manual Tone Control. 546-552
- Lu Zhang, Mingjiang Wang, Andong Li, Zehua Zhang, Xuyi Zhuang: 
 Incorporating Multi-Target in Multi-Stage Speech Enhancement Model for Better Generalization. 553-558
- Fei Gao, Haixin Guan: 
 Low-Power Convolutional Recurrent Neural Network For Monaural Speech Enhancement. 559-563
- Quandong Wang, Junnan Wu, Zhao Yan, Sichong Qian, Liyong Guo, Lichun Fan, Weiji Zhuang, Peng Gao, Yujun Wang: 
 Multi-Channel Speech Enhancement with 2-D Convolutional Time-Frequency Domain Features and a Pre-Trained Acoustic Model. 564-570
- Protima Nomo Sudro, Rohit Sinha, S. R. Mahadeva Prasanna: 
 Processing Phoneme Specific Segments for Cleft Lip and Palate Speech Enhancement. 571-577
- Sota Misawa, Norihiro Takamune, Tomohiko Nakamura, Daichi Kitamura, Hiroshi Saruwatari, Masakazu Une, Shoji Makino: 
 Speech Enhancement by Noise Self-Supervised Rank-Constrained Spatial Covariance Matrix Estimation via Independent Deeply Learned Matrix Analysis. 578-584
- Yoshiki Masuyama, Kouei Yamaoka, Yuma Kinoshita, Nobutaka Ono: 
 Causal Distortionless Response Beamforming by Alternating Direction Method of Multipliers. 585-590
- Jinyoung Lee, Hong-Goo Kang: 
 Stacked U-Net with High-Level Feature Transfer for Parameter Efficient Speech Enhancement. 591-595
- Hanako Segawa, Li Li, Shoji Makino, Takeshi Yamada: 
 Extension of virtual microphone technique to multiple real microphones and investigation of the impact of phase and amplitude interpolation on speech enhancement. 597-602
- Kohei Saijo, Kazuhiro Katagiri, Masaru Fujieda, Tetsunori Kobayashi, Tetsuji Ogawa: 
 Comparative Study on DNN-based Minimum Variance Beamforming Robust to Small Movements of Sound Sources. 603-607
- Kazushi Nakazawa, Kazuhiro Kondo: 
 Improvements to Non-Intrusive Intelligibility Prediction for Reverberant Speech. 608-613
- Wenjing Yang, Jing Wang, Hongfeng Li, Na Xu, Fei Xiang, Kai Qian, Shenghua Hu: 
 A Target Speaker Separation Neural Network with Joint-Training. 614-618
- Qian-Bei Hong, Chung-Hsien Wu, Thanh Binh Nguyen, Hsin-Min Wang: 
 Improvement of Spatial Ambiguity in Multi-Channel Speech Separation Using Channel Attention. 619-623
- Kohei Ozamoto, Kuniaki Uto, Koji Iwano, Koichi Shinoda: 
 Noise-Tolerant Time-Domain Speech Separation with Noise Bases. 624-629
- Jianyu Wang, Shanzheng Guan, Xiao-Lei Zhang: 
 Minimum-volume regularized ILRMA for blind audio source separation. 630-634
- Wenbo Zhu, Mou Wang, Xiao-Lei Zhang, Susanto Rahardja: 
 A comparison of handcrafted, parameterized, and learnable features for speech separation. 635-639
- Masahito Togami, Robin Scheibler: 
 Over-Determined Semi-Blind Speech Source Separation. 640-645
- Juntao Yu, Ting Jiang, JiaCheng Yu: 
 Group Multi-Scale convolutional Network for Monaural Speech Enhancement in Time-domain. 646-650
- Yusaku Mizobuchi, Daichi Kitamura, Tomohiko Nakamura, Hiroshi Saruwatari, Yu Takahashi, Kazunobu Kondo: 
 Prior Distribution Design for Music Bleeding-Sound Reduction Based on Nonnegative Matrix Factorization. 651-658
- Yen-Ju Lu, Yu Tsao, Shinji Watanabe: 
 A Study on Speech Enhancement Based on Diffusion Probabilistic Model. 659-666
- Xin Fang, Zhen-Hua Ling, Lei Sun, Shutong Niu, Jun Du, Cong Liu, Zhi-Chao Sheng: 
 A Deep Analysis of Speech Separation Guided Diarization Under Realistic Conditions. 667-671
- Qijie Shao, Jingyong Hou, Yanxin Hu, Qing Wang, Lei Xie, Xin Lei: 
 Target Speaker Extraction for Customizable Query-by-Example Keyword Spotting. 672-678
- Chen Chen, Nana Hou, Duo Ma, Eng Siong Chng: 
 Time Domain Speech Enhancement With Attentive Multi-scale Approach. 679-683
- Adrien Llave, Simon Leglaive: 
 On Speech Sparsity for Computational Efficiency and Noise Reduction in Hearing Aids. 684-688
- Qingjian Lin, Lin Yang, Xuyang Wang, Luyuan Xie, Chen Jia, Junjie Wang: 
 Sparsely Overlapped Speech Training in the Time Domain: Joint Learning of Target Speech Separation and Personal VAD Benefits. 689-693
- Yuuki Tachioka: 
 Integration of Annotator-wise Estimations for Emotion Recognition by Using Group Softmax. 694-699
- Xingfeng Li, Taiyang Guo, Xinhui Hu, Xinkang Xu, Jianwu Dang, Masato Akagi: 
 Hierarchical Prosody Analysis Improves Categorical and Dimensional Emotion Recognition. 700-704
- Simon W. McKnight, Aidan O. T. Hogg, Vincent W. Neo, Patrick A. Naylor: 
 A Study of Salient Modulation Domain Features for Speaker Identification. 705-712
- Di Wang, Lantian Li, Hongzhi Yu, Dong Wang: 
 A Study on Decoupled Probabilistic Linear Discriminant Analysis. 713-718
- Yu-Huai Peng, Hung-Shin Lee, Pin-Tuan Huang, Hsin-Min Wang: 
 Generation of Speaker Representations Using Heterogeneous Training Batch Assembly. 719-724
- Ryotaro Nagase, Takahiro Fukumori, Yoichi Yamashita: 
 Speech Emotion Recognition with Fusion of Acoustic- and Linguistic-Feature-Based Decisions. 725-730
- Bagus Tris Atmaja, Akira Sasou, Masato Akagi: 
 Automatic Naturalness Recognition from Acted Speech Using Neural Networks. 731-736
- Purva Barche, Krishna Gurugubelli, Anil Kumar Vuppala: 
 Comparative Study of Filter Banks to Improve the Performance of Voice Disorder Assessment Systems using LTAS Features. 737-742
- Xiaoquan Ke, Man-Wai Mak, Jinchao Li, Helen M. Meng: 
 Dual Dropout Ranking of Linguistic Features for Alzheimer's Disease Recognition. 743-749
- Zhaohang Zhang, Xiaohui Zhang, Min Guo, Wei-Qiang Zhang, Ke Li, Yukai Huang: 
 A Multilingual Framework Based on Pre-training Model for Speech Emotion Recognition. 750-755
- Anubhav Anand, Shubham Negi, N. Narendra: 
 Filters Know How You Feel: Explaining Intermediate Speech Emotion Classification Representations. 756-761
- Utkarsh Mehrotra, Sparsh Garg, Krishna Gurugubelli, Anil Kumar Vuppala: 
 Detecting Multiple Disfluencies from Speech using Pre-linguistic Automatic Syllabification with Acoustic and Prosody Features. 761-768
- Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai: 
 Siamese Neural Network with Joint Bayesian Model Structure for Speaker Verification. 769-774
- Siddhant Gupta, Kuldeep Khoria, Ankur T. Patil, Hemant A. Patil: 
 Deep Convolutional Neural Network for Voice Liveness Detection. 775-779
- Haoran Sun, Lantian Li, Thomas Fang Zheng, Dong Wang: 
 How Speech is Recognized to Be Emotional - A Study Based on Information Decomposition. 780-785
- Xuan Luo, Shinnosuke Takamichi, Tomoki Koriyama, Yuki Saito, Hiroshi Saruwatari: 
 Emotion-Controllable Speech Synthesis Using Emotion Soft Labels and Fine-Grained Prosody Factors. 794-799
- Ruitong Xiao, Xiaofen Xing, Jichen Yang, Xiangmin Xu: 
 CA-VC: A Novel Zero-Shot Voice Conversion Method With Channel Attention. 800-807
- Kei Akuzawa, Kotaro Onishi, Keisuke Takiguchi, Kohki Mametani, Koichiro Mori: 
 Conditional Deep Hierarchical Variational Autoencoder for Voice Conversion. 808-813
- Chao Xie, Yi-Chiao Wu, Patrick Lumban Tobing, Wen-Chin Huang, Tomoki Toda: 
 Noisy-to-Noisy Voice Conversion Framework with Denoising Model. 814-820
- Ruiyan Chen, Tazuko Nishimura, Nobuaki Minematsu, Daisuke Saito: 
 Acoustic Simulation of Body-conducted Speech and Its Use to Convert One's Recorded Voices to One's Own Voices. 821-828
- Yi-Chieh Lin, Ji-Yan Han, Yu-Min Lin, Wei-Zhong Zheng, Shuenn-Tsong Young, Ying-Hui Lai: 
 Speech Reconstruction from The Larynx Vibration Feature Captured by Laser-Doppler Vibrometer Sensor. 829-835
- Asuka Moritani, Shoki Sakamoto, Ryo Ozaki, Hirokazu Kameoka, Tadahiro Taniguchi: 
 StarGAN-based Emotional Voice Conversion for Japanese Phrases. 836-840
- Peter Wu, Paul Pu Liang, Jiatong Shi, Ruslan Salakhutdinov, Shinji Watanabe, Louis-Philippe Morency: 
 Understanding the Tradeoffs in Client-side Privacy for Downstream Speech Tasks. 841-848
- Zolzaya Byambadorj, Ryota Nishimura, Altangerel Ayush, Kengo Ohta, Norihide Kitaoka: 
 Multi-speaker TTS system for low-resource language using cross-lingual transfer learning and data augmentation. 849-853
- Weirui Lu, Xiaofen Xing, Xiangmin Xu, Weibin Zhang: 
 Towards Unseen Speakers Zero-Shot Voice Conversion with Generative Adversarial Networks. 854-858
- Xingrui Wang, Bowen Zhang, Takahiro Shinozaki: 
 Low-Resource Mandarin Prosodic Structure Prediction Using Self-Training. 859-863
- Zeqing Zhao, Xi Chen, Hui Liu, Xuyang Wang, Lin Yang, Junjie Wang: 
 SPTTS: Parallel Speech Synthesis without Extra Aligner Model. 864-869
- Ding Ma, Wen-Chin Huang, Tomoki Toda: 
 Investigation of Text-to-Speech-based Synthetic Parallel Data for Sequence-to-Sequence Non-Parallel Voice Conversion. 870-877
- Jiyang Tang, Ming Li: 
 End-to-End Mandarin Tone Classification with Short Term Context Information. 878-883
- Shuai Yu  , Chenxing Li, Feng Deng, Xiaorui Wang: , Chenxing Li, Feng Deng, Xiaorui Wang:
 Rethinking Singing Voice Separation With Spectral- Temporal Transformer. 884-889
- Yuya Yamamoto, Juhan Nam, Hiroko Terasawa, Yuzuru Hiraga: 
 Investigating Time-Frequency Representations for Audio Feature Extraction in Singing Technique Classification. 890-896
- Hideki Kawahara, Toshie Matsui, Kohei Yatabe, Ken-Ichi Sakakibara, Minoru Tsuzaki, Masanori Morise, Toshio Irino: 
 Implementation of Interactive Tools for Investigating Fundamental Frequency Response of Voiced Sounds to Auditory Stimulation. 897-903
- Jinhu Li, Chitralekha Gupta, Haizhou Li: 
 Training Explainable Singing Quality Assessment Network with Augmented Data. 904-911
- Chitralekha Gupta, Jinhu Li, Haizhou Li: 
 Towards Reference-Independent Rhythm Assessment of Solo Singing. 912-919
- Yuya Hosoda, Arata Kawamura, Youji Iiguni: 
 Pitch Estimation Algorithm for Narrowband Speech Signal using Phase Differences between Harmonics. 920-925
- Juqiang Chen, Tianyi Ni, Benjawan Kasisopa, Mark Antoniou, Catherine T. Best: 
 SVM-based evaluation of Thai tone imitations by Thai-naïve Mandarin and Vietnamese speakers. 926-931
- Keiichi Funaki: 
 On an Improved F0 Estimation Based on ℓ2-Norm Regularized TV-CAR Speech Analysis. 932-938
- Tiantian Tang, Xinyuan Zhou, Yanhua Long, Yijie Li, Jiaen Liang: 
 CNN-based Discriminative Training for Domain Compensation in Acoustic Event Detection with Frame-wise Classifier. 939-944
- Miao Liu, Jing Wang, Yujun Wang, Lidong Yang: 
 Frequency Axis Pooling Method for Weakly Labeled Sound Event Detection and Classification. 945-949
- Shang Gao, Maoshen Jia, Changchun Bao: 
 A multi-source localization method based on clustering and outlier removal. 950-955
- Sakiko Mishima, Reishi Kondo: 
 Impulsive Timing Detection Based on Multi-Frame Phase Voting for Acoustic Event Detection. 956-960
- Hokuto Munakata, Ryu Takeda, Kazunori Komatani: 
 Multiple-Embedding Separation Networks: Sound Class-Specific Feature Extraction for Universal Sound Separation. 961-967
- Yuting Geng, Haonan Wang, Masato Nakayama, Takanobu Nishiura: 
 Narrow-edged Beamforming Using Masked Parametric Array Loudspeakers. 968-973
- Kenneth Ooi, Karn N. Watcharasupat, Santi Peksi, Furi Andi Karnapi, Zhen-Ting Ong, Danny Chua, Hui-Wen Leow, Li-Long Kwok, Xin-Lei Ng, Zhen-Ann Loh, Woon-Seng Gan: 
 A Strongly-Labelled Polyphonic Dataset of Urban Sounds with Spatiotemporal Context. 982-988
- Kenta Iwai, Yoshinobu Kajikawa, Takanobu Nishiura: 
 Formulation of Multidimensional Frequency Characteristics of Second-Order Nonlinear IIR Filter. 989-994
- Nguyen Binh Thien, Yukoh Wakabayashi, Kenta Iwai, Takanobu Nishiura: 
 Two-stage phase reconstruction using DNN and von Mises distribution-based maximum likelihood. 995-999
- Yuna Harada, Naoto Shimada, Haonan Wang, Kenta Iwai, Masato Nakayama, Takanobu Nishiura: 
 Sharp-sound-image Construction Method Using Multichannel Sound System with Optimal Parametric Loudspeaker Arrangement. 1000-1007
- Takuma Ekawa, Masato Nakayama, Toru Takahashi: 
 Virtual Sound Source Rendering Based on Distance Control to Penetrate Listeners Using Surround Parametric-array and Electrodynamic Loudspeakers. 1008-1015
- Guansan Lian, Yukoh Wakabayashi, Taishi Nakashima, Nobutaka Ono: 
 Self-rotation angle estimation of circular microphone array based on sound field interpolation. 1016-1020
- Tingzhi Mao, Yerbolat Khassanov, Van Tung Pham, Haihua Xu, Hao Huang, Aishan Wumaier, Eng Siong Chng: 
 Enriching Under-Represented Named Entities for Improved Speech Recognition. 1021-1025
- Andrew Liaw, Jia-Hao Hsu, Chung-Hsien Wu: 
 Ensemble of One Model: Creating Model Variations for Transformer with Layer Permutation. 1026-1030
- Binghuai Lin, Liyuan Wang: 
 Uncertainty estimation in automatic pronunciation assessment with pseudo samples based on deep kernel learning. 1031-1036
- Takumi Kurokawa, Atsuhiko Kai: 
 Retrieval-oriented E2E ASR Modeling for Improved Query-by-example Spoken Term Detection. 1037-1042
- Yizhou Peng, Jicheng Zhang, Haobo Zhang, Haihua Xu, Hao Huang, Sheng Li, Eng Siong Chng: 
 Multilingual Approach to Joint Speech and Accent Recognition with DNN-HMM Framework. 1043-1048
- Tien-Hong Lo, Yao-Ting Sung, Berlin Chen: 
 Improving End-To-End Modeling for Mispronunciation Detection with Effective Augmentation Mechanisms. 1049-1055
- Sixia Li, Jianwu Dang: 
 Zero-shot Domain Adaptation with Inference Relation Paths for Spoken Language Understanding. 1056-1061
- Tan Liu, Wu Guo: 
 End to End Spoken Language Understanding Using Partial Disentangled Slot Embedding. 1062-1066
- Kazuki Hatakeyama, Masahiro Nishino, Kazunori Kojima, Shi-wook Lee, Yoshiaki Itoh: 
 Multiple Deep Learning Models and Architectures with Different Numbers of States Used to Improve Retrieval Accuracy of Query-by-Example. 1067-1071
- Shenghua Hu, Jing Wang, Yujun Wang, Wenjing Yang: 
 Separable Temporal Convolution plus Temporally Pooled Attention for Lightweight High-Performance Keyword Spotting. 1072-1076
- Koharu Horii, Meiko Fukuda, Kengo Ohta, Ryota Nishimura, Atsunori Ogawa, Norihide Kitaoka: 
 End-to-End Spontaneous Speech Recognition Using Hesitation Labeling. 1077-1081
- Yu Iwamoto, Takahiro Shinozaki: 
 Unsupervised Spoken Term Discovery Using wav2vec 2.0. 1082-1086
- Jian Gong, Yameng Yu, William Bellamy, Feng Wang, Xiaoli Ji: 
 Effect of Perceptual Training with Noise on Chinese Learners' English Consonant Reception Thresholds. 1087-1091
- Tsubasa Maeda, Satoshi Tamura: 
 Multi-view Convolution for Lipreading. 1092-1096
- Binling Wang, Wenxuan Hu, Jing Li, Yiming Zhi, Zheng Li, Qingyang Hong, Lin Li, Dong Wang, Liming Song, Cheng Yang: 
 OLR 2021 Challenge: Datasets, Rules and Baselines. 1097-1103
- Shih-Hsuan Chiu, Tien-Hong Lo, Fu-An Chao, Berlin Chen: 
 Cross-utterance Reranking Models with BERT and Graph Convolutional Networks for Conversational Speech Recognition. 1104-1110
- Chengdong Liang, Junqi Chen, Shanzheng Guan, Xiao-Lei Zhang: 
 Attention-based multi-channel speaker verification with ad-hoc microphone arrays. 1111-1115
- Shanzheng Guan, Shupei Liu, Junqi Chen, Wenbo Zhu, Shengqiang Li, Xu Tan, Ziye Yang, Menglong Xu, Yijiang Chen, Chengdong Liang, Jianyu Wang, Xiao-Lei Zhang: 
 Libri-adhoc40: A dataset collected from synchronized ad-hoc microphone arrays. 1116-1120
- Jiao Han, Yunqi Cai, Lantian Li, Guanyu Li, Dong Wang: 
 An MAP Estimation for Between-Class Variance. 1121-1126
- Yuxin Zhang, Yatong Xiao, Wei-Qiang Zhang, Xu Tan, Ling Lei, Shengjin Wang: 
 Mixing or Extracting? Further Exploring Necessity of Music Separation for Singer Identification. 1127-1132
- Weicheng Cai, Ming Li: 
 A Unified Deep Speaker Embedding Framework for Mixed-Bandwidth Speech Data. 1133-1138
- Tatsuya Komatsu, Robin Scheibler: 
 Comparison of Low Complexity Self-Attention Mechanisms for Acoustic Event Detection. 1139-1143
- Jisheng Bai, Mou Wang, Jianfeng Chen: 
 Dual-Path Transformer For Machine Condition Monitoring. 1144-1148
- Thanh Thi Hien Duong, Phi-Le Nguyen, Hong-Son Nguyen, Duc-Chien Nguyen, Huy Phan, Ngoc Q. K. Duong: 
 Speaker count: A new building block for speaker diarization. 1149-1155
- Kayo Nada, Keisuke Imoto, Reina Iwamae, Takao Tsuchiya: 
 Multitask Learning of Acoustic Scenes and Events Using Dynamic Weight Adaptation Based on Multi-focal Loss. 1156-1160
- Yuki Shiroma, Keisuke Imoto, Sayaka Shiota, Nobutaka Ono, Hitoshi Kiya: 
 Investigation on Spatial and Frequency-Based Features for Asynchronous Acoustic Scene Analysis. 1161-1166
- Yuma Kinoshita, Nobutaka Ono: 
 Analysis on Roles of DNNs in End-to-End Acoustic Scene Analysis Framework with Distributed Sound-to-Light Conversion Devices. 1167-1172
- Kenta Iwai, Takanobu Nishiura: 
 A Study on Optimal Filter of Feedforward Active Noise Control System Based on Analysis of Frequency Response. 1173-1179
- Shulin Wen, Nguyen Duy Hai, Miqing Wang, Woon-Seng Gan: 
 Design and Evaluation of Active Noise Control on Machinery Noise. 1180-1186
- Satoshi Yamanouchi, Yoshinobu Kajikawa: 
 A Subband Active Noise Control System with Automatic Tap Assignment in Consideration of Psychoacoustic Properties. 1187-1191
- Mingzhe Li, Chuang Shi, Yue Wang: 
 A True Digital Feedforward Active Noise Control System with no Analog-to-Digital and Digital-to-Analog Converters. 1192-1196
- Chong-Rui Huang, Cheng-Yuan Chang, Sen M. Kuo: 
 Development of Active Hear-Through Equalization Algorithm for Earphones. 1197-1201
- Abigail Copiaco, Christian Ritz, Stefano Fasciani, Nidhal Abdulaziz: 
 Development of a Synthetic Database for Compact Neural Network Classification of Acoustic Scenes in Dementia Care Environments. 1202-1209
- Sotaro Nakaoka, Li Li, Shoji Makino, Takeshi Yamada: 
 Reducing algorithmic delay using low-overlap window for online Wave-U-Net. 1210-1214
- Chiho Haruta, Nobutaka Ono, Yuma Kinoshita: 
 Framewise Finite Impulse Response Filtering Based on Time-Frequency Mask for Low-Latency Speech Enhancement. 1215-1220
- Xueqin Luo, Jilu Jin, Gongping Huang, Jingdong Chen, Jacob Benesty, Israel Cohen, Wen Zhang: 
 Constrained Maximum Directivity Beamformers Based on Uniform Linear Acoustic Vector Sensor Arrays. 1221-1225
- Takuya Hasumi, Tomohiko Nakamura, Norihiro Takamune, Hiroshi Saruwatari, Daichi Kitamura, Yu Takahashi, Kazunobu Kondo: 
 Multichannel Audio Source Separation with Independent Deeply Learned Matrix Analysis Using Product of Source Models. 1226-1233
- Yi-Syuan Liou, Wen-Chin Huang, Ming-Chi Yen, Shu-Wei Tsai, Yu-Huai Peng, Tomoki Toda, Yu Tsao, Hsin-Min Wang: 
 Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion. 1234-1238
- Zicheng Feng, Yu Tsao, Fei Chen: 
 Estimation and Correction of Relative Transfer Function for Binaural Speech Separation Networks to Preserve Spatial Cues. 1239-1244
- You-Jin Li, Syu-Siang Wang, Yu Tsao, Borching Su: 
 MIMO Speech Compression and Enhancement Based on Convolutional Denoising Autoencoder. 1245-1250
- Lichin Chen, Ji-Tian Sheu, Yuh-Jue Chuang: 
 Predicting Patient's Choices of Hospital Levels Using Deep Learning and Representation Improvements. 1251-1257
- Yu-Chieh Lin, Chia-Tai Chan, Kuan-Chung Ting, Kai-Chun Liu, Chia-Yeh Hsieh: 
 Instrumented Romberg Test of Postural Stability in Patients with Vestibular Disorders using Inertial Measurement Units. 1258-1261
- Chih-En Kuo, Po-Yu Liao, Yu-Syuan Lin: 
 A Self-attention-based Ensemble Convolution Neural Network Approach for Sleep Stage Classification with Merged Spectrogram. 1262-1268
- Niamh McCallan, Scot Davidson, Kok Yew Ng, Pardis Biglarbeigi, Dewar D. Finlay, Boon Leong Lan, James McLaughlin: 
 Seizure Classification of EEG based on Wavelet Signal Denoising Using a Novel Channel Selection Algorithm. 1269-1276
- Mario Banuelos, Marissa Hernandez: 
 A Recommendation Systems Approach for Detecting Epistasis in Genomic Signals. 1277-1280
- Shefali Gupta, Tapan Kumar Gandhi, Pawan Sinha: 
 Understanding Structure Induced Functional Connectivity in Brain using EEG. 1281-1288
- Kota Yamamoto, Sou Nobukawa, Nobuhiko Wagatsuma, Keiichiro Inagaki: 
 Effect of Visual Attention and Driving Experiences on the Event-Related Potential P300 in the Perception of Traffic Scenes. 1289-1293
- Erika Sekiguchi, Ken Kubota, Shun Nakamura, Kenichi Makita, Toshihisa Tanaka: 
 Toward Estimation of Abnormal Brake in Autonomous Vehicles from Electroencephalogram and Heart Rate Interval. 1294-1298
- Sean Shensheng Xu, Man-Wai Mak, Ka Ho Wong, Helen Meng, Timothy C. Y. Kwok: 
 Speaker Turn Aware Similarity Scoring for Diarization of Speech-Based Cognitive Assessments. 1299-1304
- Chaoyan Wu, Lin Zhou, Xijin Chen, Liyuan Chen: 
 Microphone Array Speech Separation Algorithm based on DNN. 1305-1310
- Hongmei Hu, Stephan Dieter Ewert: 
 Exploring Artifact Rejection for High-pulse Rate Electrically Evoked Auditory Steady State Responses in Cochlear Implants Users. 1311-1316
- Yang Liu, Xiaoyong Lu, Daimin Shi, Jingyi Yuan: 
 Depression Severity Level Classification Using Multitask Learning of Gender Recognition. 1317-1322
- Xuyang Zhao, Jordi Solé-Casals, Qibin Zhao, Jianting Cao, Toshihisa Tanaka: 
 Multi-feature Fusion for Epileptic Focus Localization Based on Tensor Representation. 1323-1327
- Yibin Tang, Junping Jiang, Min Li, Ying Chen, Xiaojin Meng: 
 ADHD classification via auto-encoding network with non-imaging data fusion. 1328-1332
- Mengnan Liang, Aimin Jiang, Xiaofeng Liu, Hon Keung Kwan, Yanping Zhu: 
 Arrhythmia Classification Algorithm based on Sparse Autoencoder. 1333-1337
- Kazuki Hisatsune, Aoi Noguchi, Toshitaka Yamakawa: 
 Real-Time Monitoring System to Evaluate Exercise Load, Hypoxic Load, and Safety in a Normobaric Hypoxic room. 1338-1342
- Manami Wakuya, Takao Inoue, Hirochika Imoto, Sadahiro Nomura, Michiyasu Suzuki, Toshitaka Yamakawa: 
 Preoperative Monitoring Using Implantable, Multimodal, Multichannel Probe. 1343-1347
- Nao Inatsu, Aoi Noguchi, Koshi Ota, Koichi Fujiwara, Takatomi Kubo, Toshitaka Yamakawa: 
 Preliminary Study Using Autoencoder for Early Detection of Heat Illness from Heart Rate Variability Obtained with Wearable Device. 1348-1352
- Asahi Tsuruo, Monamie Ringhofer, Shinya Yamamoto, Kazushi Ikeda: 
 Mathematical Model of a Horse and the Rider during a Jump. 1353-1356
- Riza Rae Pineda, Takatomi Kubo, Masaki Shimada, Kazushi Ikeda: 
 Evaluation of the Effect of Transfer Learning to Multi-Instance Detection of Monkeys. 1357-1362
- Takuma Kuroki, Osamu Shouno, Junichiro Yoshimoto: 
 Semi-Supervised Estimation of Driving Behaviors Using Robust Time-Contrastive Learning. 1363-1366
- Keisuke Ozawa, Shinichi Sumiyoshi, Yuki Tachioka: 
 Snapshot Multispectral Image Completion and Unmixing with Total Variation Regularization on Abundance Maps. 1367-1374
- Yan Liu, Qingwu Li, Guanying Huo, Yan Zhou, Dabin Yu: 
 Underwater Image Dehazing Based on Disparity Estimation and Color Constraint. 1375-1380
- Isana Funahashi, Naoki Yamashita, Taichi Yoshida, Masaaki Ikehara: 
 High Reflection Removal Using CNN with Detection and Estimation. 1381-1385
- Tong Tang, Shun Hu, Linfeng Cui, Zhiyang Yin: 
 Intra Coding Tool Pruning for Reducing Complexity of VVC Screen Content Coding. 1386-1390
- Tien-Ying Kuo, Yu-Jen Wei, Jhih-Jhou Lin: 
 Image Compression Architecture with Built-in Lightweight Model. 1391-1394
- Shuhei Takehisa, Masahiro Okuda: 
 Denoising Hyperspectral Images Using Interband Correlation. 1395-1399
- Mizuki Takanashi, Yoshimitsu Kuroki: 
 A Consensus Framework for Convolutional Dictionary Learning based on L1 Norm Error. 1400-1404
- Shunki Anami, Ryo Matsuoka: 
 Noise Removal for Dynamic Mode Decomposition Based on Plug-and-Play ADMM. 1405-1409
- Lifei Zhong, Jiantao Zhou: 
 New End-to-end Network for Stereo High Dynamic Range Imaging. 1410-1415
- LieLin Pang, KokSheik Wong: 
 Moving Object Detection in HEVC Video. 1416-1421
- Chengyi Zou, Shuai Wan, Tiannan Ji, Marta Mrak, Marc Górriz Blanch, Luis Herranz: 
 Spatial Information Refinement for Chroma Intra Prediction in Video Coding. 1422-1427
- Suwoong Heo, Hyewon Song, Jiwoo Kang, Sanghoon Lee: 
 High-Quality Single Image 3D Facial Shape Reconstruction via Robust Albedo Estimation. 1428-1432
- Huirong Huang, Zhiyong Wu, Shiyin Kang, Dongyang Dai, Jia Jia, Tianxiao Fu, Deyi Tuo, Guangzhi Lei, Peng Liu, Dan Su, Dong Yu, Helen Meng: 
 Speaker Independent and Multilingual/Mixlingual Speech-Driven Talking Head Generation Using Phonetic Posteriorgrams. 1433-1437
- Qifeng Zeng, Jun Du, Zirui Wang: 
 HMM-based Lip Reading with Stingy Residual 3D Convolution. 1438-1443
- Shun-Cheung Lai, Kin-Man Lam: 
 Deep Siamese network for low-resolution face recognition. 1444-1449
- Zhi-Song Liu, Wan-Chi Siu, H. Anthony Chan: 
 Learn to Sketch: A fast approach for universal photo sketch. 1450-1457
- Rabia Shafi, Wan Shuai, Hao Gong, Muhammad Usman Younus: 
 Head Movement Prediction using FCNN. 1458-1464
- Jeonghaeng Lee, Woojae Kim, Jinwoo Kim, Sanghoon Lee: 
 A Study on Virtual Reality Sickness and Visual Attention. 1465-1469
- Seongjean Kim, Jinwoo Kim, Sanghoon Lee: 
 Quality of Interaction Arising from Augmented Reality Content: A Comprehensive Study. 1470-1474
- Yijing Yang, Vasileios Magoulianitis, C.-C. Jay Kuo: 
 E-PixelHop: An Enhanced PixelHop Method for Object Classification. 1475-1482
- Yen-Yu Pu, Ching-Te Chiu, Shu-Yun Wu: 
 Real-Time Edge Attention-Based Learning for Low-Light One-Stage Object Detection. 1483-1487
- Jiwoo Kang, Hyunse Yoon, Seongmin Lee, Sanghoon Lee: 
 Checkerboard Corner Localization Accelerated with Deep False Detection for Multi-camera Calibration. 1488-1493
- Sin-Wun Syu, Po-Chyi Su: 
 Strategies of Traditional Chinese Character Recognition in Streetscape Based on Deep Learning Networks. 1494-1498
- Izbaila Imtiaz, Imran Ahmed, Gwanggil Jeon, Shogo Muramatsu: 
 An Efficient Image Processing and Machine Learning based Technique for Skin Lesion Segmentation and Classification. 1499-1505
- Yan Zhang, Nan Yang, Yong Fang: 
 Distributed Arithmetic Coding for Sources with Hidden Markov Correlation. 1506-1510
- Jiayi Qin, Zheng He, Binyu Yan, Gwanggil Jeon, Xiaomin Yang: 
 Multi-Residual Feature Fusion Network for lightweight Single Image Super-Resolution. 1511-1518
- Michael Abebe Berwo, Yong Fang, Jabar Mahmood, Ephrem Afele Retta: 
 Automotive Engine Cylinder Head Crack Detection: Canny Edge Detection With Morphological Dilation. 1519-1527
- Gai Yamamoto, Yuya Kodama, Shogo Muramatsu, Samuel Choi, Gwanggil Jeon: 
 Acceleration of PDS-Based High-Dimensional Signal Restoration. 1528-1535
- Fuga Nakamura, Ryosuke Harakawa, Masahiro Iwahashi: 
 Product Quantization to Reduce Entropy of Labels for Fast and Accurate Image Retrieval. 1536-1540
- Jun Wu, Tianliang Zhu, Chengtian Yu, Chunzhi Wang, Xianjing Zhou, Hu Liu: 
 Deep Learning Analysis Models for Speech and Emotional Recognition. 1541-1545
- Xuyang Zhao, Shogo Takata, Kosuke Fukumori, Toshihisa Tanaka: 
 Infant Posture Assessment Based on Rotational Keypoint Detection. 1546-1550
- Lin Li, Kaixi Hu: 
 Text Description Generation from Videos via Deep Semantic Models. 1551-1555
- Noboru Yoshida, Jianquan Liu: 
 View-invariant Feature using Pose Information and Flexible Matching Algorithm for Action Retrieval. 1556-1562
- Feyisayo Olalere, Vincent Brouwers, Metehan Doyran, Ronald Poppe, Albert Ali Salah: 
 Video-Based Sports Activity Recognition for Children. 1563-1570
- Teruaki Akazawa, Yuma Kinoshita, Hitoshi Kiya: 
 Spatially varying white balancing for mixed and non-uniform illuminants. 1571-1575
- Dipanita Chakraborty, Werapon Chiracharit, Kosin Chamnongthai: 
 Semantically Relevant Scene Detection Using Deep Learning. 1576-1579
- Jing-Ming Guo, Sankarasrinivasan S: 
 Digital Halftone Classification using Simplified CNN and Stochastic Statistics. 1580-1584
- Lingfeng Fang, Chunhao Li, Songlin Sun: 
 Implementation of AVS3 Multicast System Based on eMBMS. 1585-1589
- Rong Zhang, Pao-Chi Chang: 
 Robustness against adversary models on MNIST by Deep-Q Reinforcement Learning based Parallel-GANs. 1590-1597
- Jung Kyung Lee, Na-Young Kim, Je-Won Kang: 
 Rate-Distortion Optimized Temporal Segmentation Using Reinforcement Learning for Video Coding. 1598-1601
- Farchan Hakim Raswa, Agus Harjoko, Chrisantonius, Jia-Ching Wang: 
 A Fusion Methodology of AKAZE and Neural Network for Fingerprint Recognition. 1602-1606
- Byeong-Ju Han, Jae-Won Yang, Oggyu Lee, Jae-Young Sim: 
 Context-based Matching Refinement for Person Search. 1607-1610
- Chrisantonius, Tri Kuntoro Priyambodo, Farchan Hakim Raswa, Jia-Ching Wang: 
 Partial Fingerprint on Combined Evaluation using Deep Learning and Feature Descriptor. 1611-1614
- Yeseung Park, Kyoungoh Lee, Sanghoon Lee: 
 Environment Adaptive 3D Pose Estimation Model and Learning Strategy. 1615-1620
- Shengbei Wang, Weitao Yuan, Zhen Zhang, Jianming Wang, Masashi Unoki: 
 Tampering Detection for Speech Signals Using Synchronization Code and LSF-based Watermarks. 1621-1626
- Candy Olivia Mawalim, Masashi Unoki: 
 Improving Security in McAdams Coefficient-Based Speaker Anonymization by Watermarking Method. 1627-1633
- Kasorn Galajit, Jessada Karnjana, Pakinee Aimmanee, Masashi Unoki: 
 Hybridization of speech information hiding and encryption for double-layer security in speech communication. 1634-1639
- Akane Yokota, Masaki Kawamura: 
 BSS-Based Extraction For Additive Video Watermarking. 1640-1646
- Rinka Kawano, Masaki Kawamura: 
 Detection of Periodic Pilot Signal in Image Watermarking. 1647-1652
- Tetsuya Kojima, Naoyuki Muraoka, Raito Matsuzaki: 
 An Acoustic Communication Technique Based on Audio Data Hiding Utilizing Artificial Flowing Water Sounds. 1653-1657
- Hao-Wen Chia, Jian-Jiun Ding: 
 Semi-Supervised Learning for Facial Landmarks with Confidence and Augmentation Sifting Mechanisms. 1658-1661
- Hsuan-Wei Hsu, Jian-Jiun Ding: 
 Deepfake Algorithm Using Multiple Noise Modalities with Two-Branch Prediction Network. 1662-1669
- Jing-Ming Guo, S. Sankarasrinivasan: 
 Digital Multitone Image Reconstruction using Deep Generative Adversarial Nets. 1670-1673
- Hung-Tse Chan, Ting-Yu Lin, Shih-Chun Deng, Chih-Hsien Hsia, Chin-Feng Lai: 
 Smart Facial Skincare Products Using Computer Vision Technologies. 1674-1677
- Sin-Ye Jhong, Po-Yen Yang, Chih-Hsien Hsia: 
 An Attention based Expert Inspection System for Smart Scalp. 1678-1681
- Minje Park, Ju-Han Lee, Sang-Ho Lee, Jong-Ok Kim: 
 Multi-Band NIR Colorization Using Structure-Aware Network. 1682-1686
- Ruiki Kobayashi, Shogo Muramatsu, Shunsuke Ono: 
 Proximal Gradient-Based Loop Unrolling with Interscale Thresholding. 1687-1692
- Sung-Jun Min, Suk-Ju Kang: 
 Edge Map-guided Scale-iterative Image Deblurring. 1693-1697
- Sung-Min Woo, Jeong-Won Ha, Jong-Ok Kim: 
 Super-Resolution Imaging Using a Focus Pixel Sensor. 1698-1702
- Daichi Nishikawa, Ryosuke Harakawa, Masahiro Iwahashi: 
 Multi-View Variational Autoencoder for Robust Classification against Irrelevant Data. 1703-1707
- Jiabin Yan, Changsheng Chen: 
 Cross-Domain Recaptured Document Detection with Texture and Reflectance Characteristics. 1708-1715
- Kun Yu, Rongsong Yang, Hui Zeng, Anjie Peng: 
 Joint estimation of image rotation angle and scaling factor. 1716-1721
- Yangguang Wang, Jinwei Li, Yuanzhi Yao, Nenghai Yu: 
 Undetectable JPEG Image Batch Reversible Data Hiding with Content-adaptive Payload Allocation. 1722-1728
- Takahiro Aoki: 
 Workload Based Model of Large Scale 1: N Biometrics Multi-Step Narrowing Down Process. 1729-1735
- Soichi Hama: 
 Evaluation on palm vein recognition of children in growing. 1736-1740
- Atsuya Hirayama, Kazunori Hayashi: 
 An Overloaded MU-MIMO Signal Detection Method Using Piecewise Continuous Nonconvex Sparse Regularizer. 1741-1747
- Hiroki Honda, Kazunori Hayashi, Gurusanthosh Pabbisetty, Hiroki Mori: 
 Received Signal Power based Sensor Zone Estimation with Maximum Likelihood Approach. 1748-1755
- Mahyar Nemati, Jihong Park, Moongu Jeon, Jinho Choi: 
 Anomaly Detection for Wireless Communication Links via Data Integrity Modeling. 1756-1761
- Koichi Ito, Hiroya Kawai, Takafumi Aoki: 
 A Comprehensive Study of Face Recognition Using Deep Learning. 1762-1768
- Yuka Watanabe, Yasushi Yamazaki: 
 Continuous biometric authentication for smartphones considering usage environments. 1769-1774
- Vo Ngoc Khoi Nguyen, Takamichi Terada, Masakatsu Nishigaki, Tetsushi Ohki: 
 Examining of Shallow Autoencoder on Black-box Attack against Face Recognition. 1775-1780
- Koki Kato, Hironobu Takano, Masahiro Saiko, Masahiro Kubo, Hitoshi Imaoka: 
 Comparative Study of Feature Extraction Method for Emotional Classification by Micro-expressions. 1781-1785
- Amna Qureshi, David Megías, Minoru Kuribayashi: 
 Detecting Deepfake Videos using Digital Watermarking. 1786-1793
- Ryota Motomura, Shoko Imaizumi, Hitoshi Kiya: 
 A Flexible Reversible Data Hiding Method in Compressible Encrypted Images. 1794-1799
- Shunsuke Yoshimura, Kazuaki Nakamura, Naoko Nitta, Noboru Babaguchi: 
 Model Inversion Attack against a Face Recognition System in a Black-Box Setting. 1800-1807
- Daichi Takeshita, Minoru Kuribayashi, Nobuo Funabiki: 
 Feature Extraction Suitable for Double JPEG Compression Analysis Based on Statistical Bias Observation of DCT Coefficients. 1808-1814
- Yuma Yamasaki, Minoru Kuribayashi, Nobuo Funabiki, Huy H. Nguyen, Isao Echizen: 
 Feature Extraction Based on Denoising Auto Encoder for Classification of Adversarial Examples. 1815-1820
- Minagi Ueda, Shoko Imaizumi, KokSheik Wong: 
 An Extended Reversible Data Hiding Method for HDR Images Using Edge Estimation. 1821-1827
- Ahmed Khan, KokSheik Wong: 
 Image Watermarking based on Non-Newtonian Effect and Interpolated SWT-DWT. 1828-1832
- Hiroki Ito, MaungMaung AprilPyone, Hitoshi Kiya: 
 Access Control Using Spatially Invariant Permutation of Feature Maps for Semantic Segmentation Models. 1833-1838
- Qihua Feng, Peiya Li, ZhiXun Lu, Guan Liu, Feiran Huang: 
 End-to-end Learning for Encrypted Image Retrieval. 1839-1845
- Kenta Iida, Hitoshi Kiya: 
 A Privacy-Preserving Image Retrieval Scheme Using A Codebook Generated from Independent Plain-Image Dataset. 1846-1850
- MaungMaung AprilPyone, Hitoshi Kiya: 
 A Protection Method of Trained CNN Model Using Feature Maps Transformed With Secret Key From Unauthorized Access. 1851-1857
- Zhenhua Qu, Ziqiang He, Xiangui Kang: 
 Deriving a Compact Analytical Model for Camera Response Functions with Application to Chartless Radiometric Calibration. 1858-1864
- Koki Nakai, Minoru Kuribayashi, Nobuo Funabiki: 
 A Study of Privacy Protection of Photos Taken by a Wide-angle Surveillance Camera. 1865-1871
- Yik Siang Pang, Yiqi Tew: 
 A Pilot Exploration of Industrial Video Scene Data Embedding using Real-Time MV-HEVC. 1872-1876
- Koi Yee Ng, Simying Ong, Yuen Peng Loh, Chee Seng Chan: 
 Relabel, Scramble, Synthesize: A Novel Coverless Steganography Approach via Collage Image. 1877-1882
- Ya-Ju Yu, Ching-Chih Chuang, Yu-Wei Cheng: 
 Deep Reinforcement Learning for NPDCCH Period Adjustment in NB-IoT Networks. 1883-1888
- Ting-Yu Yeh, Wei-Chen Pao, Wei-Hung Chou, Chun-Chia Tsai, Jen-Yi Pan: 
 A Threshold-based Scheduling and Power Control Design on IMT-2020 Evaluation. 1889-1894
- Takeru Misugi, Kouji Hirata, Takuji Tachibana: 
 Implementation of a fast failure recovery method considering load distribution for network slicing. 1895-1898
- Gen Tabei, Yusuke Ito, Tomotaka Kimura, Kouji Hirata: 
 Multi-Armed Bandit-based Routing Method for In-network Caching. 1899-1902
- Lionel F. Gonzalez Casanova, Po-Chiang Lin: 
 Generalized Classification of DNS over HTTPS Traffic with Deep Learning. 1903-1907
- Hideyoshi Miura, Tomotaka Kimura, Kouji Hirata: 
 Inhibition modeling of future malware diffusion with an evolutionary game theory. 1908-1911
- Wei-Hung Chou, Wei-Chen Pao, Chun-Chia Tsai, Ting-Yu Yeh, Jen-Yi Pan: 
 An Adaptive Rank Selection Method in 3GPP 5G NR Systems. 1912-1916
- Chun-Chia Tsai, Ting-Yu Yeh, Wei-Hung Chou, Wei-Chen Pao, Jen-Yi Pan: 
 A Low Complexity PMI Selection Scheme for 3GPP 5G NR FDD Systems. 1917-1922
- Kuan-Lin Lee, Chung-Nan Lee, Ming-Feng Lee: 
 Realizing 5G Network Slicing Provisioning with Open Source Software. 1923-1930
- Yao-Chiang Kan, Kuan-Tzu Chen, Hsueh-Chun Lin, Junghsi Lee: 
 A Parking Monitoring System Using FMCW Radars. 1931-1934
- Wen-Ping Lai, Ming-Jay Lai, Hong-Lun Lai: 
 A Semi-Empirical Data-Rate Estimation Method of 5G RAN Slicing. 1935-1941
- Manh Hung Nguyen, Yu-Kuen Lai, Kai-Po Chang: 
 An Entropy-based DDoS attack Detection and Classification with Hierarchical Temporal Memory. 1942-1948
- Koki Kitazumi, Ryoma Yasutani, Shusuke Narieda, Hiroshi Naruse: 
 Measurement of CO2 in Outdoor Environments Using LPWAN Based WSN and Its Time Correlation Characteristics. 1949-1952
- Tsukasa Chida, Suguru Kameda, Noriharu Suematsu: 
 Fundamental Investigation of Backoff Control Method for Fair Communication Opportunity of mmW WBAN in Overcrowded Environment. 1953-1957
- Mai Ohta, Takeo Fujii: 
 Intra-System Interference Avoidance for Packet-Level Index Modulation in Internet of Things. 1958-1962
- Naotaka Hirayama, Takuya Kobayashi, Koichi Adachi: 
 Offloading Selection with Unequal Timeslot in Mobile Edge Computing. 1963-1968
- Osamu Takyu, Ryota Sugimoto: 
 Highly Efficient Data Gathering with Tendency Prediction based on Position Information of Event in Wireless Sensor Networks. 1969-1974
- Fu-Rong Yang, Yin-Ping Cho, Yi-Hsuan Yang, Da-Yi Wu, Shan-Hung Wu, Yi-Wen Liu: 
 Mandarin Singing Voice Synthesis with a Phonology-based Duration Model. 1975-1981
- Jia-Hao Hsu, Chung-Hsien Wu, Tsung-Hsien Yang: 
 Task-Aware BERT-based Sentiment Analysis from Multiple Essences of the Text. 1982-1986
- S. R. Parvathy, Deepak P. Jayan, Nimmy Pathrose, K. R. Rajesh: 
 Convolutional Autoencoder based Deep Learning Model for Identification of Red Palm Weevil Signals. 1987-1992
- Nakamasa Inoue, Tsubasa Maruyama, Keita Goto: 
 Augmentation-Agnostic Regularization for Unsupervised Contrastive Learning with Its Application to Speaker Verification. 1993-1998
- Peter U. Eze, Udaya Parampalli: 
 Deep Learning Evaluation of a Steganographic Algorithm. 1999-2005
- Wan-Ting Tseng, Chin-Ying Wu, Yung-Chang Hsu, Berlin Chen: 
 FAQ Retrieval using Question-Aware Graph Convolutional Network and Contextualized Language Model. 2006-2012
- Yu-Chen Chou, Yen-Po Lin, Yang-Ming Yeh, Yi-Chang Lu: 
 3D-GFE: a Three-Dimensional Geometric-Feature Extractor for Point Cloud Data. 2013-2017
- Yen-Po Lin, Yang-Ming Yeh, Yu-Chen Chou, Yi-Chang Lu: 
 Attention EdgeConv For 3D Point Cloud Classification. 2018-2022
- Kaito Echizenya, Kazuhiro Kondo: 
 The Effect of Density and Placement of BLE Beacons on Indoor Location and Motion Direction Estimation Accuracy. 2023-2027
- Jen-Tzung Chien, Shu-Hsiang Yang: 
 Model-Based Soft Actor-Critic. 2028-2035
- Jen-Tzung Chien, Sixun Luo: 
 Self-Supervised Learning for Online Speaker Diarization. 2036-2042
- Jen-Tzung Chien, Yu-Min Huang: 
 Multi-Resolution Convolutional Recurrent Networks. 2043-2048
- Geonsu Lee, Hochang Rhee, Jae Hoon Shim, Hyung Il Koo, Nam Ik Cho: 
 Network Intrusion Detection with Improved Feature Representation. 2049-2054
- Ching-Tung Tang, Ching-Te Chiu, Wei-Jyun Chen: 
 3D Landmark-based Face Detection and Recognition System for Large Poses. 2055-2059
- Zeyuan Wang, Zhiyu Wei, Lihui Zhang, Ruifan Li, Zhanyu Ma: 
 Entailment Method Based on Template Selection for Chinese Text Few-shot Learning. 2060-2065
- Yazhou Li, Yihui Shi, Yun Liu, Ruifan Li, Zhanyu Ma: 
 Image Captioning Based on An Improved Transformer with IoU Position Encoding. 2066-2071
- Vinay Chakravarthi Gogineni, Valeriya Naumova, Stefan Werner, Yih-Fang Huang: 
 Graph Kernel Recursive Least-Squares Algorithms. 2072-2076
- Masa-aki Takizawa, Masahiro Yukawa: 
 A Hilbertian Projection Approach with Dictionary Dividing Strategy: Accelerating Nonlinear Estimation Algorithm with Multiscale Gaussians. 2077-2084
- Anthony Kuh, Shuai Huang, Cynthia Chen: 
 Personalized Learning using Multiple Kernel Models. 2085-2088
- Anthony Kuh: 
 Real Time Kernel Learning for Sensor Networks using Principles of Federated Learning. 2089-2093

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


 Google
Google Google Scholar
Google Scholar Semantic Scholar
Semantic Scholar Internet Archive Scholar
Internet Archive Scholar CiteSeerX
CiteSeerX ORCID
ORCID














