default search action
Jia Jia 0001
Person information
- affiliation: Tsinghua University, Graduate School at Shenzhen, Tsinghua-CUHK Joint Research Center for Media Sciences, Technologies and Systems, China
- affiliation (PhD 2008): Tsinghua University, Department of Computer Science and Technology, TNList, Beijing, China
Other persons with the same name
- Jia Jia — disambiguation page
- Jia Jia 0002 — Shanghai Academy of Science and Technology, Shanghai Center for Bioinformation Technology, China
- Jia Jia 0003 — Shandong University of Science and Technology, College of Electrical Engineering and Automation, Qingdao, China
- Jia Jia 0004 — National University of Defense Technology, Changsha, China
- Jia Jia 0005 — Peking University, Beijing, China
- Jia Jia 0006 — Alibaba Group, Hangzhou, ShangHai, China
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [c144]Xiaohan Li, Qixin Wang, Zishan Wang, Zeyu Jin, Jia Jia:
SoulSkipper: A Voice-Controlled Emotional Adaptive Game to Complement Therapy for Social Anxiety Disorder. CHI Extended Abstracts 2024: 298:1-298:7 - [c143]Zixuan Wang, Jia Jia, Shikun Sun, Haozhe Wu, Rong Han, Zhenyu Li, Di Tang, Jiaqing Zhou, Jiebo Luo:
DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance. CVPR 2024: 7892-7901 - [c142]Shikun Sun, Longhui Wei, Zhicai Wang, Zixuan Wang, Junliang Xing, Jia Jia, Qi Tian:
Inner Classifier-Free Guidance and Its Taylor Expansion for Diffusion Models. ICLR 2024 - [c141]Yixuan Zhou, Xiaoyu Qin, Zeyu Jin, Shuoyi Zhou, Shun Lei, Songtao Zhou, Zhiyong Wu, Jia Jia:
VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling. ACM Multimedia 2024: 554-563 - [c140]Zeyu Jin, Jia Jia, Qixin Wang, Kehan Li, Shuoyi Zhou, Songtao Zhou, Xiaoyu Qin, Zhiyong Wu:
SpeechCraft: A Fine-Grained Expressive Speech Dataset with Natural Language Description. ACM Multimedia 2024: 1255-1264 - [c139]Xingqi Wang, Xiaoyuan Yi, Xing Xie, Jia Jia:
Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization. ACM Multimedia 2024: 3558-3567 - [c138]Shuo Huang, Shikun Sun, Zixuan Wang, Xiaoyu Qin, Yanmin Xiong, Yuan Zhang, Pengfei Wan, Di Zhang, Jia Jia:
PlacidDreamer: Advancing Harmony in Text-to-3D Generation. ACM Multimedia 2024: 6880-6889 - [c137]Zixuan Wang, Jiayi Li, Xiaoyu Qin, Shikun Sun, Songtao Zhou, Jia Jia, Jiebo Luo:
DanceCamAnimator: Keyframe-Based Controllable 3D Dance Camera Synthesis. ACM Multimedia 2024: 10200-10209 - [i31]Zixuan Wang, Jia Jia, Shikun Sun, Haozhe Wu, Rong Han, Zhenyu Li, Di Tang, Jiaqing Zhou, Jiebo Luo:
DanceCamera3D: 3D Camera Movement Synthesis with Music and Dance. CoRR abs/2403.13667 (2024) - [i30]Shuo Huang, Shikun Sun, Zixuan Wang, Xiaoyu Qin, Yanmin Xiong, Yuan Zhang, Pengfei Wan, Di Zhang, Jia Jia:
PlacidDreamer: Advancing Harmony in Text-to-3D Generation. CoRR abs/2407.13976 (2024) - [i29]Zeyu Jin, Jia Jia, Qixin Wang, Kehan Li, Shuoyi Zhou, Songtao Zhou, Xiaoyu Qin, Zhiyong Wu:
SpeechCraft: A Fine-grained Expressive Speech Dataset with Natural Language Description. CoRR abs/2408.13608 (2024) - [i28]Yixuan Zhou, Xiaoyu Qin, Zeyu Jin, Shuoyi Zhou, Shun Lei, Songtao Zhou, Zhiyong Wu, Jia Jia:
VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling. CoRR abs/2408.15676 (2024) - [i27]Zixuan Wang, Jiayi Li, Xiaoyu Qin, Shikun Sun, Songtao Zhou, Jia Jia, Jiebo Luo:
DanceCamAnimator: Keyframe-Based Controllable 3D Dance Camera Synthesis. CoRR abs/2409.14925 (2024) - [i26]Houlun Chen, Xin Wang, Hong Chen, Zeyang Zhang, Wei Feng, Bin Huang, Jia Jia, Wenwu Zhu:
VERIFIED: A Video Corpus Moment Retrieval Benchmark for Fine-Grained Video Understanding. CoRR abs/2410.08593 (2024) - [i25]Xingqi Wang, Xiaoyuan Yi, Xing Xie, Jia Jia:
Embedding an Ethical Mind: Aligning Text-to-Image Synthesis via Lightweight Value Optimization. CoRR abs/2410.12700 (2024) - 2023
- [c136]Zhihan Yang, Zhiyong Wu, Ying Shan, Jia Jia:
What Does Your Face Sound Like? 3D Face Shape towards Voice. AAAI 2023: 13905-13913 - [c135]Jinghe Cai, Xiaohan Li, Bohan Chen, Zhigang Wang, Jia Jia:
CatHill: Emotion-Based Interactive Storytelling Game as a Digital Mental Health Intervention. CHI Extended Abstracts 2023: 64:1-64:7 - [c134]Shuo Huang, Jia Jia, Zongxin Yang, Wei Wang, Haozhe Wu, Yi Yang, Junliang Xing:
Shuffled Autoregression for Motion Interpolation. ICASSP 2023: 1-5 - [c133]Shikun Sun, Jia Jia, Haozhe Wu, Zijie Ye, Junliang Xing:
MSNet: A Deep Architecture Using Multi-Sentiment Semantics for Sentiment-Aware Image Style Transfer. ICASSP 2023: 1-5 - [c132]Zijie Ye, Jia Jia, Haozhe Wu, Shuo Huang, Shikun Sun, Junliang Xing:
Salient Co-Speech Gesture Synthesizing with Discrete Motion Representation. ICASSP 2023: 1-5 - [c131]Shikun Sun, Longhui Wei, Junliang Xing, Jia Jia, Qi Tian:
SDDM: Score-Decomposed Diffusion Models on Manifolds for Unpaired Image-to-Image Translation. ICML 2023: 33115-33134 - [c130]Zhihan Yang, Shansong Liu, Xu Li, Haozhe Wu, Zhiyong Wu, Ying Shan, Jia Jia:
Prosody Modeling with 3D Visual Information for Expressive Video Dubbing. INTERSPEECH 2023: 4863-4867 - [c129]Houlun Chen, Xin Wang, Xiaohan Lan, Hong Chen, Xuguang Duan, Jia Jia, Wenwu Zhu:
Curriculum-Listener: Consistency- and Complementarity-Aware Audio-Enhanced Temporal Sentence Grounding. ACM Multimedia 2023: 3117-3128 - [c128]Shuo Huang, Zongxin Yang, Liangting Li, Yi Yang, Jia Jia:
AvatarFusion: Zero-shot Generation of Clothing-Decoupled 3D Avatars Using 2D Diffusion. ACM Multimedia 2023: 5734-5745 - [c127]Haozhe Wu, Songtao Zhou, Jia Jia, Junliang Xing, Qi Wen, Xiang Wen:
Speech-Driven 3D Face Animation with Composite and Regional Facial Movements. ACM Multimedia 2023: 6822-6830 - [c126]Haoyu Wang, Haozhe Wu, Junliang Xing, Jia Jia:
Versatile Face Animator: Driving Arbitrary 3D Facial Avatar in RGBD Space. ACM Multimedia 2023: 7776-7784 - [c125]Zijie Ye, Jia Jia, Junliang Xing:
Semantics2Hands: Transferring Hand Motion Semantics between Avatars. ACM Multimedia 2023: 9282-9290 - [c124]Zeyu Jin, Zixuan Wang, Qixin Wang, Jia Jia, Ye Bai, Yi Zhao, Hao Li, Xiaorui Wang:
HoloSinger: Semantics and Music Driven Motion Generation with Octahedral Holographic Projection. ACM Multimedia 2023: 9393-9395 - [i24]Haozhe Wu, Jia Jia, Junliang Xing, Hongwei Xu, Xiangyuan Wang, Jelo Wang:
MMFace4D: A Large-Scale Multi-Modal 4D Face Dataset for Audio-Driven 3D Face Animation. CoRR abs/2303.09797 (2023) - [i23]Shuo Huang, Jia Jia, Zongxin Yang, Wei Wang, Haozhe Wu, Yi Yang, Junliang Xing:
Shuffled Autoregression For Motion Interpolation. CoRR abs/2306.06367 (2023) - [i22]Shuo Huang, Zongxin Yang, Liangting Li, Yi Yang, Jia Jia:
AvatarFusion: Zero-shot Generation of Clothing-Decoupled 3D Avatars Using 2D Diffusion. CoRR abs/2307.06526 (2023) - [i21]Shikun Sun, Longhui Wei, Junliang Xing, Jia Jia, Qi Tian:
SDDM: Score-Decomposed Diffusion Models on Manifolds for Unpaired Image-to-Image Translation. CoRR abs/2308.02154 (2023) - [i20]Haozhe Wu, Songtao Zhou, Jia Jia, Junliang Xing, Qi Wen, Xiang Wen:
Speech-Driven 3D Face Animation with Composite and Regional Facial Movements. CoRR abs/2308.05428 (2023) - [i19]Zijie Ye, Jia Jia, Junliang Xing:
Semantics2Hands: Transferring Hand Motion Semantics between Avatars. CoRR abs/2308.05920 (2023) - [i18]Haoyu Wang, Haozhe Wu, Junliang Xing, Jia Jia:
Versatile Face Animator: Driving Arbitrary 3D Facial Avatar in RGBD Space. CoRR abs/2308.06076 (2023) - [i17]Xianhao Wei, Jia Jia, Xiang Li, Zhiyong Wu, Ziyi Wang:
A Discourse-level Multi-scale Prosodic Model for Fine-grained Emotion Analysis. CoRR abs/2309.11849 (2023) - [i16]Houlun Chen, Xin Wang, Hong Chen, Zihan Song, Jia Jia, Wenwu Zhu:
Grounding-Prompter: Prompting LLM with Multimodal Information for Temporal Sentence Grounding in Long Videos. CoRR abs/2312.17117 (2023) - 2022
- [j20]Zijie Ye, Haozhe Wu, Jia Jia:
Human motion modeling with deep learning: A survey. AI Open 3: 35-39 (2022) - [c123]Yulan Chen, Zhiyong Wu, Zheyan Shen, Jia Jia:
Learning from Designers: Fashion Compatibility Analysis Via Dataset Distillation. ICIP 2022: 856-860 - [c122]Zhihan Yang, Zhiyong Wu, Jia Jia:
Speaker Characteristics Guided Speech Synthesis. IJCNN 2022: 1-8 - [c121]Xiang Li, Changhe Song, Xianhao Wei, Zhiyong Wu, Jia Jia, Helen Meng:
Towards Cross-speaker Reading Style Transfer on Audiobook Dataset. INTERSPEECH 2022: 5528-5532 - [c120]Zixuan Wang, Jia Jia, Haozhe Wu, Junliang Xing, Jinghe Cai, Fanbo Meng, Guowen Chen, Yanfeng Wang:
GroupDancer: Music to Multi-People Dance Synthesis with Style Collaboration. ACM Multimedia 2022: 1138-1146 - [c119]Jingbei Li, Yi Meng, Xixin Wu, Zhiyong Wu, Jia Jia, Helen Meng, Qiao Tian, Yuping Wang, Yuxuan Wang:
Inferring Speaking Styles from Multi-modal Conversational Context by Multi-scale Relational Graph Convolutional Networks. ACM Multimedia 2022: 5811-5820 - [c118]Ziyi Wang, Xingqi Wang, Zeyu Jin, Xiaohan Li, Shikun Sun, Jia Jia:
AI Carpet: Automatic Generation of Aesthetic Carpet Pattern. ACM Multimedia 2022: 6958-6960 - [i15]Xiang Li, Changhe Song, Xianhao Wei, Zhiyong Wu, Jia Jia, Helen Meng:
Towards Cross-speaker Reading Style Transfer on Audiobook Dataset. CoRR abs/2208.05359 (2022) - 2021
- [c117]Suping Zhou, Jia Jia, Zhiyong Wu, Zhihan Yang, Yanfeng Wang, Wei Chen, Fanbo Meng, Shuo Huang, Jialie Shen, Xiaochuan Wang:
Inferring Emotion from Large-scale Internet Voice Data: A Semi-supervised Curriculum Augmentation based Deep Learning Approach. AAAI 2021: 6039-6047 - [c116]Huirong Huang, Zhiyong Wu, Shiyin Kang, Dongyang Dai, Jia Jia, Tianxiao Fu, Deyi Tuo, Guangzhi Lei, Peng Liu, Dan Su, Dong Yu, Helen Meng:
Speaker Independent and Multilingual/Mixlingual Speech-Driven Talking Head Generation Using Phonetic Posteriorgrams. APSIPA ASC 2021: 1433-1437 - [c115]Yaohua Bu, Tianyi Ma, Weijun Li, Hang Zhou, Jia Jia, Shengqi Chen, Kaiyuan Xu, Dachuan Shi, Haozhe Wu, Zhihan Yang, Kun Li, Zhiyong Wu, Yuanchun Shi, Xiaobo Lu, Ziwei Liu:
PTeacher: a Computer-Aided Personalized Pronunciation Training System with Exaggerated Audio-Visual Corrective Feedback. CHI 2021: 676:1-676:14 - [c114]Jinghe Cai, Bohan Chen, Chen Wang, Jia Jia:
Wander: A breath-control Audio Game to Support Sound Sleep. CHI PLAY 2021: 17-23 - [c113]Xiang Li, Changhe Song, Jingbei Li, Zhiyong Wu, Jia Jia, Helen Meng:
Towards Multi-Scale Style Control for Expressive Speech Synthesis. Interspeech 2021: 4673-4677 - [c112]Haozhe Wu, Jia Jia, Haoyu Wang, Yishun Dou, Chao Duan, Qingshan Deng:
Imitating Arbitrary Talking Style for Realistic Audio-Driven Talking Face Synthesis. ACM Multimedia 2021: 1478-1486 - [c111]Liangqi Liu, Jiankun Hu, Zhiyong Wu, Song Yang, Songfan Yang, Jia Jia, Helen Meng:
Controllable Emphatic Speech Synthesis based on Forward Attention for Expressive Speech Synthesis. SLT 2021: 410-414 - [e1]Tat-Seng Chua, Jingdong Wang, Qi Tian, Cathal Gurrin, Jia Jia, Hanwang Zhang, Qianru Sun:
MMAsia 2020: ACM Multimedia Asia, Virtual Event / Singapore, 7-9 March, 2021. ACM 2021, ISBN 978-1-4503-8308-0 [contents] - [i14]Xiang Li, Changhe Song, Jingbei Li, Zhiyong Wu, Jia Jia, Helen M. Meng:
Towards Multi-Scale Style Control for Expressive Speech Synthesis. CoRR abs/2104.03521 (2021) - [i13]Yaohua Bu, Tianyi Ma, Weijun Li, Hang Zhou, Jia Jia, Shengqi Chen, Kaiyuan Xu, Dachuan Shi, Haozhe Wu, Zhihan Yang, Kun Li, Zhiyong Wu, Yuanchun Shi, Xiaobo Lu, Ziwei Liu:
PTeacher: a Computer-Aided Personalized Pronunciation Training System with Exaggerated Audio-Visual Corrective Feedback. CoRR abs/2105.05182 (2021) - [i12]Haozhe Wu, Jia Jia, Haoyu Wang, Yishun Dou, Chao Duan, Qingshan Deng:
Imitating Arbitrary Talking Style for Realistic Audio-DrivenTalking Face Synthesis. CoRR abs/2111.00203 (2021) - 2020
- [c110]Tiancheng Shen, Jia Jia, Yan Li, Yihui Ma, Yaohua Bu, Hanjie Wang, Bo Chen, Tat-Seng Chua, Wendy Hall:
PEIA: Personality and Emotion Integrated Attentive Model for Music Recommendation on Social Media Platforms. AAAI 2020: 206-213 - [c109]Haozhe Wu, Zhiyuan Hu, Jia Jia, Yaohua Bu, Xiangnan He, Tat-Seng Chua:
Mining Unfollow Behavior in Large-Scale Online Social Networks via Spatial-Temporal Interaction. AAAI 2020: 254-261 - [c108]Jialie Shen, Karen Rafferty, Jia Jia:
Online Intelligent Music Recommendation: The Opportunity and Challenge for People Well-Being Improvement. CogMI 2020: 27-31 - [c107]Haozhe Wu, Jia Jia, Lingxi Xie, Guojun Qi, Yuanchun Shi, Qi Tian:
Cross-VAE: Towards Disentangling Expression from Identity For Human Faces. ICASSP 2020: 4087-4091 - [c106]Tiancheng Shen, Jia Jia, Yan Li, Hanjie Wang, Bo Chen:
Enhancing Music Recommendation with Social Media Content: an Attentive Multimodal Autoencoder Approach. IJCNN 2020: 1-8 - [c105]Kun Zhang, Zhiyong Wu, Daode Yuan, Jian Luan, Jia Jia, Helen Meng, Binheng Song:
Re-Weighted Interval Loss for Handling Data Imbalance Problem of End-to-End Keyword Spotting. INTERSPEECH 2020: 2567-2571 - [c104]Zijie Ye, Haozhe Wu, Jia Jia, Yaohua Bu, Wei Chen, Fanbo Meng, Yanfeng Wang:
ChoreoNet: Towards Music to Dance Synthesis with Choreographic Action Unit. ACM Multimedia 2020: 744-752 - [c103]Zhiyuan Hu, Jia Jia, Bei Liu, Yaohua Bu, Jianlong Fu:
Aesthetic-Aware Image Style Transfer. ACM Multimedia 2020: 3320-3329 - [c102]Yaohua Bu, Weijun Li, Tianyi Ma, Shengqi Chen, Jia Jia, Kun Li, Xiaobo Lu:
Visual-speech Synthesis of Exaggerated Corrective Feedback. ACM Multimedia 2020: 4521-4523 - [c101]Suping Zhou, Jia Jia, Long Zhang, Yanfeng Wang, Wei Chen, Fanbo Meng, Fei Yu, Jialie Shen:
Inferring Emphasis for Real Voice Data: An Attentive Multimodal Neural Network Approach. MMM (2) 2020: 52-62 - [i11]Huirong Huang, Zhiyong Wu, Shiyin Kang, Dongyang Dai, Jia Jia, Tianxiao Fu, Deyi Tuo, Guangzhi Lei, Peng Liu, Dan Su, Dong Yu, Helen Meng:
Speaker Independent and Multilingual/Mixlingual Speech-Driven Talking Head Generation Using Phonetic Posteriorgrams. CoRR abs/2006.11610 (2020) - [i10]Yaohua Bu, Weijun Li, Tianyi Ma, Shengqi Chen, Jia Jia, Kun Li, Xiaobo Lu:
Corrective feedback, emphatic speech synthesis, visual-speech exaggeration, pronunciation learning. CoRR abs/2009.05748 (2020) - [i9]Zijie Ye, Haozhe Wu, Jia Jia, Yaohua Bu, Wei Chen, Fanbo Meng, Yanfeng Wang:
ChoreoNet: Towards Music to Dance Synthesis with Choreographic Action Unit. CoRR abs/2009.07637 (2020)
2010 – 2019
- 2019
- [j19]Jia Jia, Suping Zhou, Yufeng Yin, Boya Wu, Wei Chen, Fanbo Meng, Yanfeng Wang:
Inferring Emotions From Large-Scale Internet Voice Data. IEEE Trans. Multim. 21(7): 1853-1866 (2019) - [c100]Liangqi Liu, Zhiyong Wu, Runnan Li, Jia Jia, Helen Meng:
Learning Contextual Representation with Convolution Bank and Multi-head Self-attention for Speech Emphasis Detection. APSIPA 2019: 922-926 - [c99]Kun Zhang, Zhiyong Wu, Jia Jia, Helen M. Meng, Binheng Song:
Query-by-Example Spoken Term Detection using Attentive Pooling Networks. APSIPA 2019: 1267-1272 - [c98]Senmao Wang, Pan Zhou, Wei Chen, Jia Jia, Lei Xie:
Exploring RNN-Transducer for Chinese speech recognition. APSIPA 2019: 1364-1369 - [c97]Yaohua Bu, Jia Jia, Xiang Li, Xiaobo Lu:
Emotional Design for Children's Electronic Picture Book. HCI (1) 2019: 392-403 - [c96]Pan Zhou, Wenwen Yang, Wei Chen, Yanfeng Wang, Jia Jia:
Modality Attention for End-to-end Audio-visual Speech Recognition. ICASSP 2019: 6565-6569 - [c95]Runnan Li, Zhiyong Wu, Jia Jia, Sheng Zhao, Helen Meng:
Dilated Residual Network with Multi-head Self-attention for Speech Emotion Recognition. ICASSP 2019: 6675-6679 - [c94]Hui Lu, Zhiyong Wu, Runnan Li, Shiyin Kang, Jia Jia, Helen Meng:
A Compact Framework for Voice Conversion Using Wavenet Conditioned on Phonetic Posteriorgrams. ICASSP 2019: 6810-6814 - [c93]Dongyang Dai, Zhiyong Wu, Runnan Li, Xixin Wu, Jia Jia, Helen Meng:
Learning Discriminative Features from Spectrograms Using Center Loss for Speech Emotion Recognition. ICASSP 2019: 7405-7409 - [c92]Yulan Chen, Jia Jia, Zhiyong Wu:
Modeling Emotion Influence Using Attention-based Graph Convolutional Recurrent Network. ICMI 2019: 302-309 - [c91]Runnan Li, Zhiyong Wu, Jia Jia, Yaohua Bu, Sheng Zhao, Helen Meng:
Towards Discriminative Representation Learning for Speech Emotion Recognition. IJCAI 2019: 5060-5066 - [c90]Kehua Lei, Tianyi Ma, Jia Jia, Cunjun Zhang, Zhihan Yang:
Design and Implementation of a Disambiguity Framework for Smart Voice Controlled Devices. IJCAI 2019: 6536-6538 - [c89]Hui Lu, Zhiyong Wu, Dongyang Dai, Runnan Li, Shiyin Kang, Jia Jia, Helen Meng:
One-Shot Voice Conversion with Global Speaker Embeddings. INTERSPEECH 2019: 669-673 - [c88]Dongyang Dai, Zhiyong Wu, Shiyin Kang, Xixin Wu, Jia Jia, Dan Su, Dong Yu, Helen Meng:
Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-Trained BERT. INTERSPEECH 2019: 2090-2094 - [c87]Ruchao Fan, Pan Zhou, Wei Chen, Jia Jia, Gang Liu:
An Online Attention-Based Model for Speech Recognition. INTERSPEECH 2019: 4390-4394 - [c86]Suping Zhou, Jia Jia, Yufeng Yin, Xiang Li, Yang Yao, Ying Zhang, Zeyang Ye, Kehua Lei, Yan Huang, Jialie Shen:
Understanding the Teaching Styles by an Attention based Multi-task Cross-media Dimensional Modeling. ACM Multimedia 2019: 1322-1330 - [i8]Pan Zhou, Ruchao Fan, Wei Chen, Jia Jia:
Improving Generalization of Transformer for Speech Recognition with Parallel Schedule Sampling and Relative Positional Embedding. CoRR abs/1911.00203 (2019) - [i7]Haozhe Wu, Zhiyuan Hu, Jia Jia, Yaohua Bu, Xiangnan He, Tat-Seng Chua:
Mining Unfollow Behavior in Large-Scale Online Social Networks via Spatial-Temporal Interaction. CoRR abs/1911.07156 (2019) - [i6]Suping Zhou, Jia Jia, Yufeng Yin, Xiang Li, Yang Yao, Ying Zhang, Zeyang Ye, Kehua Lei, Yan Huang, Jialie Shen:
Understanding the Teaching Styles by an Attention based Multi-task Cross-media Dimensional modelling. CoRR abs/1911.07253 (2019) - 2018
- [c85]Suping Zhou, Jia Jia, Qi Wang, Yufei Dong, Yufeng Yin, Kehua Lei:
Inferring Emotion from Conversational Voice Data: A Semi-Supervised Multi-Path Generative Neural Network Approach. AAAI 2018: 579-587 - [c84]Yaohua Bu, Jia Jia, Yuhan Tang, Xuan Zang, Tianyu Gao:
Lookine: Let the Blind Hear a Smile. AAAI 2018: 8196-8197 - [c83]Yihui Ma, Jia Jia, Yufan Hou, Yaohua Bu, Wentao Han:
Understanding The Aesthetic Styles of Social Images. ICASSP 2018: 3056-3060 - [c82]Runnan Li, Zhiyong Wu, Yuchen Huang, Jia Jia, Helen Meng, Lianhong Cai:
Emphatic Speech Generation with Conditioned Input Layer and Bidirectional LSTMS for Expressive Speech Synthesis. ICASSP 2018: 5129-5133 - [c81]Wenjing Cai, Jia Jia, Wentao Han:
Inferring Emotions from Image Social Networks Using Group-Based Factor Graph Model. ICME 2018: 1-6 - [c80]Tiancheng Shen, Jia Jia, Guangyao Shen, Fuli Feng, Xiangnan He, Huanbo Luan, Jie Tang, Thanassis Tiropanis, Tat-Seng Chua, Wendy Hall:
Cross-Domain Depression Detection via Harvesting Social Media. IJCAI 2018: 1611-1617 - [c79]Jia Jia:
Mental Health Computing via Harvesting Social Media Data. IJCAI 2018: 5677-5681 - [c78]Xi Ma, Zhiyong Wu, Jia Jia, Mingxing Xu, Helen Meng, Lianhong Cai:
Emotion Recognition from Variable-Length Speech Segments Using Deep Learning on Spectrograms. INTERSPEECH 2018: 3683-3687 - [c77]Long Zhang, Jia Jia, Fanbo Meng, Suping Zhou, Wei Chen, Cunjun Zhang, Runnan Li:
Emphasis Detection for Voice Dialogue Applications Using Multi-channel Convolutional Bidirectional Long Short-Term Memory Network. ISCSLP 2018: 210-214 - [c76]Mu Wang, Zhiyong Wu, Shiyin Kang, Xixin Wu, Jia Jia, Dan Su, Dong Yu, Helen Meng:
Speech Super-Resolution Using Parallel WaveNet. ISCSLP 2018: 260-264 - [c75]Yuguang Wang, Liangliang Shi, Linyu Wei, Weifeng Zhu, Jinkun Chen, Zhichao Wang, Shixue Wen, Wei Chen, Yanfeng Wang, Jia Jia:
The Sogou-TIIC Speech Translation System for IWSLT 2018. IWSLT 2018: 112-117 - [c74]Runnan Li, Zhiyong Wu, Jia Jia, Jingbei Li, Wei Chen, Helen Meng:
Inferring User Emotive State Changes in Realistic Human-Computer Conversational Dialogs. ACM Multimedia 2018: 136-144 - [c73]Cunjun Zhang, Kehua Lei, Jia Jia, Yihui Ma, Zhiyuan Hu:
AI Painting: An Aesthetic Painting Generation System. ACM Multimedia 2018: 1231-1233 - [c72]Taoran Tang, Hanyang Mao, Jia Jia:
AniDance: Real-Time Dance Motion Synthesize to the Song. ACM Multimedia 2018: 1237-1239 - [c71]Yaohua Bu, Jia Jia, Xiang Li, Suping Zhou, Xiaobo Lu:
IcooBook: When the Picture Book for Children Encounters Aesthetics of Interaction. ACM Multimedia 2018: 1260-1262 - [c70]Taoran Tang, Jia Jia, Hanyang Mao:
Dance with Melody: An LSTM-autoencoder Approach to Music-oriented Dance Synthesis. ACM Multimedia 2018: 1598-1606 - [c69]Xueliang Liu, Rui Min, Benoit Huet, Jia Jia:
MAHCI 2018: The 1st Workshop on Multimedia for Accessible Human Computer Interface. ACM Multimedia 2018: 2118-2119 - [c68]Peijun Zhao, Jia Jia, Yongsheng An, Jie Liang, Lexing Xie, Jiebo Luo:
Analyzing and Predicting Emoji Usages in Social Media. WWW (Companion Volume) 2018: 327-334 - [i5]Senmao Wang, Pan Zhou, Wei Chen, Jia Jia, Lei Xie:
Exploring RNN-Transducer for Chinese Speech Recognition. CoRR abs/1811.05097 (2018) - [i4]Ruchao Fan, Pan Zhou, Wei Chen, Jia Jia, Gang Liu:
An Online Attention-based Model for Speech Recognition. CoRR abs/1811.05247 (2018) - [i3]Pan Zhou, Wenwen Yang, Wei Chen, Yanfeng Wang, Jia Jia:
Modality Attention for End-to-End Audio-visual Speech Recognition.