


Остановите войну!
for scientists:


default search action
Bryan Catanzaro
Bryan Christopher Catanzaro
Person information

- affiliation: Baidu Inc., Sunnyvale, USA
- affiliation: University of California, Berkeley, Department of Electrical Engineering and Computer Sciences
- affiliation: Brigham Young University, Electrical and Computer Engineering Department
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [j6]Guilin Liu
, Aysegul Dundar
, Kevin J. Shih, Ting-Chun Wang, Fitsum A. Reda, Karan Sapra, Zhiding Yu, Xiaodong Yang, Andrew Tao, Bryan Catanzaro:
Partial Convolution for Padding, Inpainting, and Image Synthesis. IEEE Trans. Pattern Anal. Mach. Intell. 45(5): 6096-6110 (2023) - [c52]Bryan Catanzaro:
Language Models: The Most Important Compute Challenge of Our Time (Keynote). ASPLOS (3) 2023: 2 - [c51]Dan Su, Mostofa Patwary, Shrimai Prabhumoye, Peng Xu, Ryan Prenger, Mohammad Shoeybi, Pascale Fung, Anima Anandkumar, Bryan Catanzaro:
Context Generation Improves Open Domain Question Answering. EACL (Findings) 2023: 781-796 - [c50]Shrimai Prabhumoye, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro:
Adding Instructions during Pretraining: Effective way of Controlling Toxicity in Language Models. EACL 2023: 2628-2643 - [i68]Rohan Badlani, Rafael Valle, Kevin J. Shih, João Felipe Santos, Siddharth Gururani, Bryan Catanzaro:
Multilingual Multiaccented Multispeaker TTS with RADTTS. CoRR abs/2301.10335 (2023) - [i67]Zhuolin Yang, Wei Ping, Zihan Liu, Vijay Korthikanti, Weili Nie, De-An Huang, Linxi Fan, Zhiding Yu, Shiyi Lan, Bo Li, Ming-Yu Liu, Yuke Zhu, Mohammad Shoeybi, Bryan Catanzaro, Chaowei Xiao, Anima Anandkumar:
Re-ViLM: Retrieval-Augmented Visual Language Model for Zero and Few-Shot Image Captioning. CoRR abs/2302.04858 (2023) - [i66]Shrimai Prabhumoye, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro:
Adding Instructions during Pretraining: Effective Way of Controlling Toxicity in Language Models. CoRR abs/2302.07388 (2023) - [i65]Rohan Badlani, Akshit Arora, Subhankar Ghosh, Rafael Valle, Kevin J. Shih, João Felipe Santos, Boris Ginsburg, Bryan Catanzaro:
VANI: Very-lightweight Accent-controllable TTS for Native and Non-native speakers with Identity Preservation. CoRR abs/2303.07578 (2023) - [i64]Boxin Wang, Wei Ping, Peng Xu, Lawrence McAfee, Zihan Liu, Mohammad Shoeybi, Yi Dong, Oleksii Kuchaiev, Bo Li, Chaowei Xiao, Anima Anandkumar, Bryan Catanzaro:
Shall We Pretrain Autoregressive Language Models with Retrieval? A Comprehensive Study. CoRR abs/2304.06762 (2023) - [i63]Songwei Ge, Seungjun Nah, Guilin Liu, Tyler Poon, Andrew Tao, Bryan Catanzaro, David Jacobs, Jia-Bin Huang, Ming-Yu Liu, Yogesh Balaji:
Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models. CoRR abs/2305.10474 (2023) - [i62]Aysegul Dundar, Jun Gao, Andrew Tao, Bryan Catanzaro:
Progressive Learning of 3D Reconstruction Network from 2D GAN Data. CoRR abs/2305.11102 (2023) - 2022
- [j5]Aysegul Dundar
, Kevin J. Shih, Animesh Garg, Robert Pottorf, Andrew Tao, Bryan Catanzaro:
Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos. IEEE Trans. Pattern Anal. Mach. Intell. 44(7): 3883-3894 (2022) - [c49]Zihan Liu, Mostofa Patwary, Ryan Prenger, Shrimai Prabhumoye, Wei Ping, Mohammad Shoeybi, Bryan Catanzaro:
Multi-Stage Prompting for Knowledgeable Dialogue Generation. ACL (Findings) 2022: 1317-1337 - [c48]Peng Xu, Mostofa Patwary, Shrimai Prabhumoye, Virginia Adams, Ryan Prenger, Wei Ping, Nayeon Lee, Mohammad Shoeybi, Bryan Catanzaro:
Evaluating Parameter Efficient Learning for Generation. EMNLP 2022: 4824-4833 - [c47]Rohan Badlani, Adrian Lancucki, Kevin J. Shih, Rafael Valle, Wei Ping, Bryan Catanzaro:
One TTS Alignment to Rule Them All. ICASSP 2022: 6092-6096 - [c46]Zhifeng Kong, Wei Ping, Ambrish Dantrey, Bryan Catanzaro:
Speech Denoising in the Waveform Domain With Self-Attention. ICASSP 2022: 7867-7871 - [c45]John Guibas, Morteza Mardani, Zongyi Li, Andrew Tao, Anima Anandkumar, Bryan Catanzaro:
Efficient Token Mixing for Transformers via Adaptive Fourier Neural Operators. ICLR 2022 - [c44]Nayeon Lee, Wei Ping, Peng Xu, Mostofa Patwary, Pascale Fung, Mohammad Shoeybi, Bryan Catanzaro:
Factuality Enhanced Language Models for Open-Ended Text Generation. NeurIPS 2022 - [c43]Boxin Wang, Wei Ping, Chaowei Xiao, Peng Xu, Mostofa Patwary, Mohammad Shoeybi, Bo Li, Anima Anandkumar, Bryan Catanzaro:
Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models. NeurIPS 2022 - [i61]Shaden Smith, Mostofa Patwary, Brandon Norick, Patrick LeGresley, Samyam Rajbhandari, Jared Casper, Zhun Liu, Shrimai Prabhumoye, George Zerveas, Vijay Korthikanti, Elton Zheng, Rewon Child, Reza Yazdani Aminabadi, Julie Bernauer, Xia Song, Mohammad Shoeybi, Yuxiong He, Michael Houston, Saurabh Tiwary, Bryan Catanzaro:
Using DeepSpeed and Megatron to Train Megatron-Turing NLG 530B, A Large-Scale Generative Language Model. CoRR abs/2201.11990 (2022) - [i60]Max Ehrlich, Jon Barker, Namitha Padmanabhan, Larry Davis, Andrew Tao, Bryan Catanzaro, Abhinav Shrivastava:
Leveraging Bitstream Metadata for Fast and Accurate Video Compression Correction. CoRR abs/2202.00011 (2022) - [i59]Boxin Wang, Wei Ping, Chaowei Xiao, Peng Xu, Mostofa Patwary, Mohammad Shoeybi, Bo Li, Anima Anandkumar, Bryan Catanzaro:
Exploring the Limits of Domain-Adaptive Training for Detoxifying Large-Scale Language Models. CoRR abs/2202.04173 (2022) - [i58]Zhifeng Kong, Wei Ping, Ambrish Dantrey, Bryan Catanzaro:
Speech Denoising in the Waveform Domain with Self-Attention. CoRR abs/2202.07790 (2022) - [i57]Kevin J. Shih, Rafael Valle, Rohan Badlani, João Felipe Santos, Bryan Catanzaro:
Generative Modeling for Low Dimensional Speech Attributes with Neural Spline Flows. CoRR abs/2203.01786 (2022) - [i56]Zihan Liu, Mostofa Patwary, Ryan Prenger, Shrimai Prabhumoye, Wei Ping, Mohammad Shoeybi, Bryan Catanzaro:
Multi-Stage Prompting for Knowledgeable Dialogue Generation. CoRR abs/2203.08745 (2022) - [i55]Aysegul Dundar, Jun Gao, Andrew Tao, Bryan Catanzaro:
Fine Detailed Texture Learning for 3D Meshes with Generative Models. CoRR abs/2203.09362 (2022) - [i54]Vijay Korthikanti, Jared Casper, Sangkug Lym, Lawrence McAfee, Michael Andersch, Mohammad Shoeybi, Bryan Catanzaro:
Reducing Activation Recomputation in Large Transformer Models. CoRR abs/2205.05198 (2022) - [i53]Rajarshi Roy, Jonathan Raiman, Neel Kant, Ilyas Elkin, Robert Kirby, Michael Y. Siu, Stuart F. Oberman, Saad Godil, Bryan Catanzaro:
PrefixRL: Optimization of Parallel Prefix Circuits using Deep Reinforcement Learning. CoRR abs/2205.07000 (2022) - [i52]Nayeon Lee, Wei Ping, Peng Xu, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro:
Factuality Enhanced Language Models for Open-Ended Text Generation. CoRR abs/2206.04624 (2022) - [i51]Sang-gil Lee, Wei Ping, Boris Ginsburg, Bryan Catanzaro, Sungroh Yoon:
BigVGAN: A Universal Neural Vocoder with Large-Scale Training. CoRR abs/2206.04658 (2022) - [i50]Dan Su, Mostofa Patwary, Shrimai Prabhumoye, Peng Xu, Ryan Prenger, Mohammad Shoeybi, Pascale Fung, Anima Anandkumar, Bryan Catanzaro:
Context Generation Improves Open Domain Question Answering. CoRR abs/2210.06349 (2022) - [i49]Peng Xu, Mostofa Patwary, Shrimai Prabhumoye, Virginia Adams, Ryan J. Prenger, Wei Ping, Nayeon Lee, Mohammad Shoeybi, Bryan Catanzaro:
Evaluating Parameter Efficient Learning for Generation. CoRR abs/2210.13673 (2022) - [i48]Yogesh Balaji, Seungjun Nah, Xun Huang, Arash Vahdat, Jiaming Song, Karsten Kreis, Miika Aittala, Timo Aila, Samuli Laine, Bryan Catanzaro, Tero Karras, Ming-Yu Liu:
eDiff-I: Text-to-Image Diffusion Models with an Ensemble of Expert Denoisers. CoRR abs/2211.01324 (2022) - 2021
- [c42]Devendra Singh Sachan, Mostofa Patwary, Mohammad Shoeybi, Neel Kant, Wei Ping, William L. Hamilton, Bryan Catanzaro:
End-to-End Training of Neural Retrievers for Open-Domain Question Answering. ACL/IJCNLP (1) 2021: 6648-6662 - [c41]Anand Bhattad, Aysegul Dundar, Guilin Liu, Andrew Tao, Bryan Catanzaro:
View Generalization for Single Image Textured 3D Models. CVPR 2021: 6081-6090 - [c40]Rajarshi Roy
, Jonathan Raiman, Neel Kant, Ilyas Elkin, Robert Kirby, Michael Y. Siu, Stuart F. Oberman, Saad Godil, Bryan Catanzaro:
PrefixRL: Optimization of Parallel Prefix Circuits using Deep Reinforcement Learning. DAC 2021: 853-858 - [c39]Ning Yu, Guilin Liu, Aysegul Dundar, Andrew Tao, Bryan Catanzaro, Larry Davis, Mario Fritz:
Dual Contrastive Loss and Attention for GANs. ICCV 2021: 6711-6722 - [c38]Zhifeng Kong, Wei Ping, Jiaji Huang, Kexin Zhao, Bryan Catanzaro:
DiffWave: A Versatile Diffusion Model for Audio Synthesis. ICLR 2021 - [c37]Rafael Valle, Kevin J. Shih, Ryan Prenger, Bryan Catanzaro:
Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis. ICLR 2021 - [c36]Chen Zhu, Wei Ping, Chaowei Xiao, Mohammad Shoeybi, Tom Goldstein, Anima Anandkumar, Bryan Catanzaro:
Long-Short Transformer: Efficient Transformers for Language and Vision. NeurIPS 2021: 17723-17736 - [c35]Deepak Narayanan, Mohammad Shoeybi, Jared Casper, Patrick LeGresley, Mostofa Patwary, Vijay Korthikanti, Dmitri Vainbrand, Prethvi Kashinkunti, Julie Bernauer, Bryan Catanzaro, Amar Phanishayee, Matei Zaharia:
Efficient large-scale language model training on GPU clusters using megatron-LM. SC 2021: 58 - [i47]Devendra Singh Sachan, Mostofa Patwary, Mohammad Shoeybi, Neel Kant, Wei Ping, William L. Hamilton, Bryan Catanzaro:
End-to-End Training of Neural Retrievers for Open-Domain Question Answering. CoRR abs/2101.00408 (2021) - [i46]Ning Yu, Guilin Liu, Aysegul Dundar, Andrew Tao, Bryan Catanzaro, Larry Davis, Mario Fritz:
Dual Contrastive Loss and Attention for GANs. CoRR abs/2103.16748 (2021) - [i45]Deepak Narayanan, Mohammad Shoeybi, Jared Casper, Patrick LeGresley, Mostofa Patwary, Vijay Korthikanti, Dmitri Vainbrand, Prethvi Kashinkunti, Julie Bernauer, Bryan Catanzaro, Amar Phanishayee, Matei Zaharia:
Efficient Large-Scale Language Model Training on GPU Clusters. CoRR abs/2104.04473 (2021) - [i44]Anand Bhattad, Aysegul Dundar, Guilin Liu, Andrew Tao, Bryan Catanzaro:
View Generalization for Single Image Textured 3D Models. CoRR abs/2106.06533 (2021) - [i43]Chen Zhu, Wei Ping, Chaowei Xiao, Mohammad Shoeybi, Tom Goldstein, Anima Anandkumar, Bryan Catanzaro:
Long-Short Transformer: Efficient Transformers for Language and Vision. CoRR abs/2107.02192 (2021) - [i42]Rohan Badlani, Adrian Lancucki, Kevin J. Shih, Rafael Valle, Wei Ping, Bryan Catanzaro:
One TTS Alignment To Rule Them All. CoRR abs/2108.10447 (2021) - [i41]Robert Kirby, Kolby Nottingham, Rajarshi Roy, Saad Godil, Bryan Catanzaro:
Guiding Global Placement With Reinforcement Learning. CoRR abs/2109.02631 (2021) - [i40]John Guibas, Morteza Mardani, Zongyi Li, Andrew Tao, Anima Anandkumar, Bryan Catanzaro:
Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers. CoRR abs/2111.13587 (2021) - [i39]Shrimai Prabhumoye, Rafal Kocielnik, Mohammad Shoeybi, Anima Anandkumar, Bryan Catanzaro:
Few-shot Instruction Prompts for Pretrained Language Models to Detect Social Biases. CoRR abs/2112.07868 (2021) - 2020
- [j4]Brucek Khailany, Haoxing Ren, Steve Dai, Saad Godil, Ben Keller, Robert Kirby, Alicia Klinefelter, Rangharajan Venkatesan, Yanqing Zhang, Bryan Catanzaro, William J. Dally:
Accelerating Chip Design With Machine Learning. IEEE Micro 40(6): 23-32 (2020) - [c34]Alex Boyd, Raul Puri, Mohammad Shoeybi, Mostofa Patwary, Bryan Catanzaro:
Large Scale Multi-Actor Generative Dialog Modeling. ACL 2020: 66-84 - [c33]Aysegul Dundar, Karan Sapra, Guilin Liu, Andrew Tao, Bryan Catanzaro:
Panoptic-Based Image Synthesis. CVPR 2020: 8067-8076 - [c32]Peng Xu, Mostofa Patwary, Mohammad Shoeybi, Raul Puri, Pascale Fung, Anima Anandkumar, Bryan Catanzaro:
MEGATRON-CNTRL: Controllable Story Generation with External Knowledge Using Large-Scale Language Models. EMNLP (1) 2020: 2831-2845 - [c31]Raul Puri, Ryan Spring, Mohammad Shoeybi, Mostofa Patwary, Bryan Catanzaro:
Training Question Answering Models From Synthetic Data. EMNLP (1) 2020: 5811-5826 - [c30]Rafael Valle, Jason Li, Ryan Prenger, Bryan Catanzaro:
Mellotron: Multispeaker Expressive Voice Synthesis by Conditioning on Rhythm, Pitch and Global Style Tokens. ICASSP 2020: 6189-6193 - [c29]Vitaly Kurin, Saad Godil, Shimon Whiteson, Bryan Catanzaro:
Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver? NeurIPS 2020 - [c28]Morteza Mardani, Guilin Liu, Aysegul Dundar, Shiqiu Liu, Andrew Tao, Bryan Catanzaro:
Neural FFTs for Universal Texture Image Synthesis. NeurIPS 2020 - [i38]Aysegul Dundar, Kevin J. Shih, Animesh Garg, Robert Pottorf, Andrew Tao, Bryan Catanzaro:
Unsupervised Disentanglement of Pose, Appearance and Background from Images and Videos. CoRR abs/2001.09518 (2020) - [i37]Raul Puri, Ryan Spring
, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro:
Training Question Answering Models From Synthetic Data. CoRR abs/2002.09599 (2020) - [i36]Aysegul Dundar, Karan Sapra, Guilin Liu, Andrew Tao, Bryan Catanzaro:
Panoptic-based Image Synthesis. CoRR abs/2004.10289 (2020) - [i35]Rafael Valle, Kevin J. Shih, Ryan Prenger, Bryan Catanzaro:
Flowtron: an Autoregressive Flow-based Generative Network for Text-to-Speech Synthesis. CoRR abs/2005.05957 (2020) - [i34]Alex Boyd, Raul Puri, Mohammad Shoeybi, Mostofa Patwary, Bryan Catanzaro:
Large Scale Multi-Actor Generative Dialog Modeling. CoRR abs/2005.06114 (2020) - [i33]Andrew Tao, Karan Sapra, Bryan Catanzaro:
Hierarchical Multi-Scale Attention for Semantic Segmentation. CoRR abs/2005.10821 (2020) - [i32]Guilin Liu, Rohan Taori, Ting-Chun Wang, Zhiding Yu, Shiqiu Liu, Fitsum A. Reda, Karan Sapra, Andrew Tao, Bryan Catanzaro:
Transposer: Universal Texture Synthesis Using Feature Maps as Transposed Convolution Filter. CoRR abs/2007.07243 (2020) - [i31]Zhifeng Kong, Wei Ping, Jiaji Huang, Kexin Zhao, Bryan Catanzaro:
DiffWave: A Versatile Diffusion Model for Audio Synthesis. CoRR abs/2009.09761 (2020) - [i30]Peng Xu, Mostofa Patwary, Mohammad Shoeybi, Raul Puri, Pascale Fung, Anima Anandkumar, Bryan Catanzaro:
MEGATRON-CNTRL: Controllable Story Generation with External Knowledge Using Large-Scale Language Models. CoRR abs/2010.00840 (2020) - [i29]Sashank Santhanam, Wei Ping, Raul Puri, Mohammad Shoeybi, Mostofa Patwary, Bryan Catanzaro:
Local Knowledge Powered Conversational Agents. CoRR abs/2010.10150 (2020)
2010 – 2019
- 2019
- [c27]Yi Zhu, Karan Sapra, Fitsum A. Reda, Kevin J. Shih, Shawn D. Newsam
, Andrew Tao, Bryan Catanzaro:
Improving Semantic Segmentation via Video Propagation and Label Relaxation. CVPR 2019: 8856-8865 - [c26]Ji Zhang, Kevin J. Shih, Ahmed Elgammal
, Andrew Tao, Bryan Catanzaro:
Graphical Contrastive Losses for Scene Graph Parsing. CVPR 2019: 11535-11543 - [c25]Ryan Prenger, Rafael Valle, Bryan Catanzaro:
Waveglow: A Flow-based Generative Network for Speech Synthesis. ICASSP 2019: 3617-3621 - [c24]Fitsum A. Reda, Deqing Sun, Aysegul Dundar, Mohammad Shoeybi, Guilin Liu, Kevin J. Shih, Andrew Tao, Jan Kautz, Bryan Catanzaro:
Unsupervised Video Interpolation Using Cycle Consistency. ICCV 2019: 892-900 - [c23]Ting-Chun Wang, Ming-Yu Liu, Andrew Tao, Guilin Liu, Bryan Catanzaro, Jan Kautz:
Few-shot Video-to-Video Synthesis. NeurIPS 2019: 5014-5025 - [c22]Robert Kirby, Saad Godil, Rajarshi Roy
, Bryan Catanzaro:
CongestionNet: Routing Congestion Prediction Using Deep Graph Neural Networks. VLSI-SoC 2019: 217-222 - [i28]Ji Zhang, Kevin J. Shih, Ahmed Elgammal, Andrew Tao, Bryan Catanzaro:
Graphical Contrastive Losses for Scene Graph Generation. CoRR abs/1903.02728 (2019) - [i27]Alexander Ratner, Dan Alistarh, Gustavo Alonso, David G. Andersen, Peter Bailis, Sarah Bird, Nicholas Carlini, Bryan Catanzaro, Eric Chung, Bill Dally, Jeff Dean, Inderjit S. Dhillon, Alexandros G. Dimakis, Pradeep Dubey, Charles Elkan, Grigori Fursin, Gregory R. Ganger, Lise Getoor, Phillip B. Gibbons, Garth A. Gibson, Joseph E. Gonzalez, Justin Gottschlich, Song Han, Kim M. Hazelwood, Furong Huang, Martin Jaggi, Kevin G. Jamieson, Michael I. Jordan, Gauri Joshi, Rania Khalaf, Jason Knight, Jakub Konecný, Tim Kraska, Arun Kumar, Anastasios Kyrillidis, Jing Li
, Samuel Madden, H. Brendan McMahan, Erik Meijer, Ioannis Mitliagkas, Rajat Monga, Derek Gordon Murray, Dimitris S. Papailiopoulos, Gennady Pekhimenko, Theodoros Rekatsinas, Afshin Rostamizadeh, Christopher Ré, Christopher De Sa, Hanie Sedghi, Siddhartha Sen, Virginia Smith, Alex Smola, Dawn Song, Evan R. Sparks, Ion Stoica, Vivienne Sze, Madeleine Udell, Joaquin Vanschoren, Shivaram Venkataraman, Rashmi Vinayak, Markus Weimer, Andrew Gordon Wilson, Eric P. Xing, Matei Zaharia, Ce Zhang, Ameet Talwalkar:
SysML: The New Frontier of Machine Learning Systems. CoRR abs/1904.03257 (2019) - [i26]Fitsum A. Reda, Deqing Sun, Aysegul Dundar, Mohammad Shoeybi, Guilin Liu, Kevin J. Shih, Andrew Tao, Jan Kautz, Bryan Catanzaro:
Unsupervised Video Interpolation Using Cycle Consistency. CoRR abs/1906.05928 (2019) - [i25]Kevin J. Shih, Aysegul Dundar, Animesh Garg, Robert Pottorf, Andrew Tao, Bryan Catanzaro:
Video Interpolation and Prediction with Unsupervised Landmarks. CoRR abs/1909.02749 (2019) - [i24]Mohammad Shoeybi, Mostofa Patwary, Raul Puri, Patrick LeGresley, Jared Casper, Bryan Catanzaro:
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism. CoRR abs/1909.08053 (2019) - [i23]Vitaly Kurin, Saad Godil, Shimon Whiteson, Bryan Catanzaro:
Improving SAT Solver Heuristics with Graph Networks and Reinforcement Learning. CoRR abs/1909.11830 (2019) - [i22]Rafael Valle, Jason Li, Ryan Prenger, Bryan Catanzaro:
Mellotron: Multispeaker expressive voice synthesis by conditioning on rhythm, pitch and global style tokens. CoRR abs/1910.11997 (2019) - [i21]Ting-Chun Wang, Ming-Yu Liu, Andrew Tao, Guilin Liu, Jan Kautz, Bryan Catanzaro:
Few-shot Video-to-Video Synthesis. CoRR abs/1910.12713 (2019) - [i20]Raul Puri, Bryan Catanzaro:
Zero-shot Text Classification With Generative Language Models. CoRR abs/1912.10165 (2019) - [i19]Rafael Valle, Fitsum A. Reda, Mohammad Shoeybi, Patrick LeGresley, Andrew Tao, Bryan Catanzaro:
Neural ODEs for Image Segmentation with Level Sets. CoRR abs/1912.11683 (2019) - 2018
- [c21]Edward Raff, Jon Barker, Jared Sylvester, Robert Brandon, Bryan Catanzaro, Charles K. Nicholas:
Malware Detection by Eating a Whole EXE. AAAI Workshops 2018: 268-276 - [c20]Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Andrew Tao, Jan Kautz, Bryan Catanzaro:
High-Resolution Image Synthesis and Semantic Manipulation With Conditional GANs. CVPR 2018: 8798-8807 - [c19]Guilin Liu, Fitsum A. Reda, Kevin J. Shih, Ting-Chun Wang, Andrew Tao, Bryan Catanzaro:
Image Inpainting for Irregular Holes Using Partial Convolutions. ECCV (11) 2018: 89-105 - [c18]Fitsum A. Reda, Guilin Liu, Kevin J. Shih, Robert Kirby, Jon Barker, David Tarjan, Andrew Tao, Bryan Catanzaro:
SDC-Net: Video Prediction Using Spatially-Displaced Convolution. ECCV (7) 2018: 747-763 - [c17]Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Nikolai Yakovenko, Andrew Tao, Jan Kautz, Bryan Catanzaro:
Video-to-Video Synthesis. NeurIPS 2018: 1152-1164 - [c16]Raul Puri, Robert Kirby, Nikolai Yakovenko, Bryan Catanzaro:
Large Scale Language Modeling: Converging on 40GB of Text in Four Hours. SBAC-PAD 2018: 290-297 - [i18]Guilin Liu, Fitsum A. Reda, Kevin J. Shih, Ting-Chun Wang, Andrew Tao, Bryan Catanzaro:
Image Inpainting for Irregular Holes Using Partial Convolutions. CoRR abs/1804.07723 (2018) - [i17]Raul Puri, Robert Kirby, Nikolai Yakovenko, Bryan Catanzaro:
Large Scale Language Modeling: Converging on 40GB of Text in Four Hours. CoRR abs/1808.01371 (2018) - [i16]Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Guilin Liu, Andrew Tao, Jan Kautz, Bryan Catanzaro:
Video-to-Video Synthesis. CoRR abs/1808.06601 (2018) - [i15]Ryan Prenger, Rafael Valle, Bryan Catanzaro:
WaveGlow: A Flow-based Generative Network for Speech Synthesis. CoRR abs/1811.00002 (2018) - [i14]Ji Zhang, Kevin J. Shih, Andrew Tao, Bryan Catanzaro, Ahmed Elgammal:
Introduction to the 1st Place Winning Model of OpenImages Relationship Detection Challenge. CoRR abs/1811.00662 (2018) - [i13]Fitsum A. Reda, Guilin Liu, Kevin J. Shih, Robert Kirby, Jon Barker, David Tarjan, Andrew Tao, Bryan Catanzaro:
SDCNet: Video Prediction Using Spatially-Displaced Convolution. CoRR abs/1811.00684 (2018) - [i12]Ji Zhang, Kevin J. Shih, Andrew Tao, Bryan Catanzaro, Ahmed Elgammal:
An Interpretable Model for Scene Graph Generation. CoRR abs/1811.09543 (2018) - [i11]Guilin Liu, Kevin J. Shih, Ting-Chun Wang, Fitsum A. Reda, Karan Sapra, Zhiding Yu, Andrew Tao, Bryan Catanzaro:
Partial Convolution based Padding. CoRR abs/1811.11718 (2018) - [i10]Neel Kant, Raul Puri, Nikolai Yakovenko, Bryan Catanzaro:
Practical Text Classification With Large Pre-Trained Language Models. CoRR abs/1812.01207 (2018) - [i9]Yi Zhu, Karan Sapra, Fitsum A. Reda, Kevin J. Shih, Shawn D. Newsam, Andrew Tao, Bryan Catanzaro:
Improving Semantic Segmentation via Video Propagation and Label Relaxation. CoRR abs/1812.01593 (2018) - 2017
- [c15]Song Han, Jeff Pool, Sharan Narang, Huizi Mao, Enhao Gong, Shijian Tang, Erich Elsen, Peter Vajda, Manohar Paluri, John Tran, Bryan Catanzaro, William J. Dally:
DSD: Dense-Sparse-Dense Training for Deep Neural Networks. ICLR (Poster) 2017 - [i8]Edward Raff, Jon Barker, Jared Sylvester, Robert Brandon, Bryan Catanzaro, Charles K. Nicholas:
Malware Detection by Eating a Whole EXE. CoRR abs/1710.09435 (2017) - [i7]Ting-Chun Wang, Ming-Yu Liu, Jun-Yan Zhu, Andrew Tao, Jan Kautz, Bryan Catanzaro:
High-Resolution Image Synthesis and Semantic Manipulation with Conditional GANs. CoRR abs/1711.11585 (2017) - 2016
- [c14]Dario Amodei, Sundaram Ananthanarayanan, Rishita Anubhai, Jingliang Bai, Eric Battenberg, Carl Case, Jared Casper, Bryan Catanzaro, Jingdong Chen, Mike Chrzanowski, Adam Coates, Greg Diamos, Erich Elsen, Jesse H. Engel, Linxi Fan, Christopher Fougner, Awni Y. Hannun, Billy Jun, Tony Han, Patrick LeGresley, Xiangang Li, Libby Lin, Sharan Narang, Andrew Y. Ng, Sherjil Ozair, Ryan Prenger, Sheng Qian, Jonathan Raiman, Sanjeev Satheesh, David Seetapun, Shubho Sengupta, Chong Wang, Yi Wang, Zhiqian Wang, Bo Xiao, Yan Xie, Dani Yogatama, Jun Zhan, Zhenyao Zhu:
Deep Speech 2 : End-to-End Speech Recognition in English and Mandarin. ICML 2016: 173-182 - [c13]Greg Diamos, Shubho Sengupta, Bryan Catanzaro, Mike Chrzanowski, Adam Coates, Erich Elsen, Jesse H. Engel, Awni Y. Hannun, Sanjeev Satheesh:
Persistent RNNs: Stashing Recurrent Weights On-Chip. ICML 2016: 2024-2033 - [i6]Song Han, Jeff Pool, Sharan Narang, Huizi Mao, Shijian Tang, Erich Elsen, Bryan Catanzaro, John Tran, William J. Dally:
DSD: Regularizing Deep Neural Networks with Dense-Sparse-Dense Training Flow. CoRR abs/1607.04381 (2016) - 2015
- [c12]Saurav Muralidharan, Michael Garland, Bryan Catanzaro, Albert Sidelnik, Mary W. Hall
:
A collection-oriented programming model for performance portability. PPoPP 2015: 263-264 - [i5]Dario Amodei, Rishita Anubhai, Eric Battenberg, Carl Case, Jared Casper, Bryan Catanzaro, Jingdong Chen, Mike Chrzanowski, Adam Coates, Greg Diamos, Erich Elsen, Jesse H. Engel, Linxi Fan, Christopher Fougner, Tony Han, Awni Y. Hannun, Billy Jun, Patrick LeGresley, Libby Lin, Sharan Narang, Andrew Y. Ng, Sherjil Ozair, Ryan Prenger, Jonathan Raiman, Sanjeev Satheesh, David Seetapun, Shubho Sengupta, Yi Wang, Zhiqian Wang, Chong Wang, Bo Xiao, Dani Yogatama, Jun Zhan, Zhenyao Zhu:
Deep Speech 2: End-to-End Speech Recognition in English and Mandarin. CoRR abs/1512.02595 (2015) - 2014
- [c11]Saurav Muralidharan, Manu Shantharam, Mary W. Hall
, Michael Garland, Bryan Catanzaro:
Nitro: A Framework for Adaptive Code Variant Tuning. IPDPS 2014: 501-512 - [c10]Bryan Catanzaro, Alexander Keller, Michael Garland:
A decomposition for in-place matrix transposition. PPoPP 2014: 193-206 - [i4]Sharan Chetlur, Cliff Woolley, Philippe Vandermersch, Jonathan Cohen, John Tran, Bryan Catanzaro, Evan Shelhamer:
cuDNN: Efficient Primitives for Deep Learning. CoRR abs/1410.0759 (2014) - [i3]