


Остановите войну!
for scientists:


default search action
Yikang Shen
Person information

Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2023
- [c28]Zitian Chen, Yikang Shen, Mingyu Ding, Zhenfang Chen, Hengshuang Zhao, Erik G. Learned-Miller, Chuang Gan:
Mod-Squad: Designing Mixtures of Experts As Modular Multi-Task Learners. CVPR 2023: 11828-11837 - [c27]Mingyu Ding, Yikang Shen, Lijie Fan, Zhenfang Chen, Zitian Chen, Ping Luo, Joshua B. Tenenbaum, Chuang Gan:
Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention. CVPR 2023: 14528-14539 - [c26]Zeyu Huang, Yikang Shen, Xiaofeng Zhang, Jie Zhou, Wenge Rong, Zhang Xiong:
Transformer-Patcher: One Mistake Worth One Neuron. ICLR 2023 - [c25]Mengdi Xu, Yuchen Lu, Yikang Shen, Shun Zhang, Ding Zhao, Chuang Gan:
Hyper-Decision Transformer for Efficient Online Policy Adaptation. ICLR 2023 - [c24]Shun Zhang, Zhenfang Chen, Yikang Shen, Mingyu Ding, Joshua B. Tenenbaum, Chuang Gan:
Planning with Large Language Models for Code Generation. ICLR 2023 - [i28]Zhenfang Chen, Qinhong Zhou, Yikang Shen, Yining Hong, Hao Zhang, Chuang Gan:
See, Think, Confirm: Interactive Prompting Between Vision and Language Models for Knowledge-based Visual Reasoning. CoRR abs/2301.05226 (2023) - [i27]Zeyu Huang, Yikang Shen, Xiaofeng Zhang, Jie Zhou, Wenge Rong, Zhang Xiong:
Transformer-Patcher: One Mistake worth One Neuron. CoRR abs/2301.09785 (2023) - [i26]Shun Zhang, Zhenfang Chen, Yikang Shen, Mingyu Ding, Joshua B. Tenenbaum, Chuang Gan:
Planning with Large Language Models for Code Generation. CoRR abs/2303.05510 (2023) - [i25]Mingyu Ding, Yikang Shen, Lijie Fan, Zhenfang Chen, Zitian Chen, Ping Luo, Joshua B. Tenenbaum, Chuang Gan:
Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention. CoRR abs/2304.03282 (2023) - [i24]Mengdi Xu, Yuchen Lu, Yikang Shen, Shun Zhang, Ding Zhao, Chuang Gan:
Hyper-Decision Transformer for Efficient Online Policy Adaptation. CoRR abs/2304.08487 (2023) - [i23]Zhiqing Sun, Yikang Shen, Qinhong Zhou, Hongxin Zhang, Zhenfang Chen, David D. Cox, Yiming Yang, Chuang Gan:
Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision. CoRR abs/2305.03047 (2023) - [i22]Yikang Shen, Zheyu Zhang, Tianyou Cao, Shawn Tan, Zhenfang Chen, Chuang Gan:
ModuleFormer: Learning Modular Large Language Models From Uncurated Data. CoRR abs/2306.04640 (2023) - [i21]Zitian Chen, Mingyu Ding, Yikang Shen, Wei Zhan, Masayoshi Tomizuka, Erik G. Learned-Miller, Chuang Gan:
An Efficient General-Purpose Modular Vision Model via Multi-Task Heterogeneous Training. CoRR abs/2306.17165 (2023) - [i20]Zhiqing Sun, Sheng Shen, Shengcao Cao, Haotian Liu, Chunyuan Li, Yikang Shen, Chuang Gan, Liang-Yan Gui, Yu-Xiong Wang, Yiming Yang, Kurt Keutzer, Trevor Darrell:
Aligning Large Multimodal Models with Factually Augmented RLHF. CoRR abs/2309.14525 (2023) - 2022
- [c23]Yikang Shen, Shawn Tan, Alessandro Sordoni, Peng Li, Jie Zhou, Aaron C. Courville:
Unsupervised Dependency Graph Network. ACL (1) 2022: 4767-4784 - [c22]Xiaotao Gu, Yikang Shen, Jiaming Shen, Jingbo Shang, Jiawei Han:
Phrase-aware Unsupervised Constituency Parsing. ACL (1) 2022: 6406-6415 - [c21]Xiaofeng Zhang, Yikang Shen, Zeyu Huang, Jie Zhou, Wenge Rong, Zhang Xiong:
Mixture of Attention Heads: Selecting Attention Heads Per Token. EMNLP 2022: 4150-4162 - [c20]Mengdi Xu, Yikang Shen, Shun Zhang, Yuchen Lu, Ding Zhao, Joshua B. Tenenbaum, Chuang Gan:
Prompting Decision Transformer for Few-Shot Policy Generalization. ICML 2022: 24631-24645 - [i19]Yikang Shen:
Syntactic Inductive Biases for Deep Learning Methods. CoRR abs/2206.04806 (2022) - [i18]Mengdi Xu, Yikang Shen, Shun Zhang, Yuchen Lu, Ding Zhao, Joshua B. Tenenbaum, Chuang Gan:
Prompting Decision Transformer for Few-Shot Policy Generalization. CoRR abs/2206.13499 (2022) - [i17]Xiaofeng Zhang, Yikang Shen, Zeyu Huang, Jie Zhou, Wenge Rong, Zhang Xiong:
Mixture of Attention Heads: Selecting Attention Heads Per Token. CoRR abs/2210.05144 (2022) - [i16]Zitian Chen, Yikang Shen, Mingyu Ding, Zhenfang Chen, Hengshuang Zhao, Erik G. Learned-Miller, Chuang Gan:
Mod-Squad: Designing Mixture of Experts As Modular Multi-Task Learners. CoRR abs/2212.08066 (2022) - 2021
- [c19]Yikang Shen, Yi Tay, Che Zheng, Dara Bahri, Donald Metzler, Aaron C. Courville:
StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked Language Modeling. ACL/IJCNLP (1) 2021: 7196-7209 - [c18]Yuchen Lu, Yikang Shen, Siyuan Zhou, Aaron C. Courville, Joshua B. Tenenbaum, Chuang Gan:
Learning Task Decomposition with Ordered Memory Policy Network. ICLR 2021 - [c17]Yi Tay, Mostafa Dehghani, Samira Abnar, Yikang Shen, Dara Bahri, Philip Pham, Jinfeng Rao, Liu Yang, Sebastian Ruder, Donald Metzler:
Long Range Arena : A Benchmark for Efficient Transformers. ICLR 2021 - [c16]Yikang Shen, Shawn Tan, Alessandro Sordoni, Siva Reddy, Aaron C. Courville:
Explicitly Modeling Syntax in Language Models with Incremental Parsing and a Dynamic Oracle. NAACL-HLT 2021: 1660-1672 - [c15]Aston Zhang, Yi Tay, Yikang Shen, Alvin Chan, Shuai Zhang:
Self-Instantiated Recurrent Units with Dynamic Soft Recursion. NeurIPS 2021: 6503-6514 - [i15]Yuchen Lu, Yikang Shen, Siyuan Zhou, Aaron C. Courville, Joshua B. Tenenbaum, Chuang Gan:
Learning Task Decomposition with Ordered Memory Policy Network. CoRR abs/2103.10972 (2021) - 2020
- [c14]Wenyu Du, Zhouhan Lin, Yikang Shen, Timothy J. O'Donnell, Yoshua Bengio, Yue Zhang:
Exploiting Syntactic Structure for Better Language Modeling: A Syntactic Distance Approach. ACL 2020: 6611-6628 - [c13]Shawn Tan, Yikang Shen, Alessandro Sordoni, Aaron C. Courville, Timothy J. O'Donnell:
Recursive Top-Down Production for Sentence Generation with Latent Trees. EMNLP (Findings) 2020: 2291-2307 - [i14]Wenyu Du, Zhouhan Lin, Yikang Shen, Timothy J. O'Donnell, Yoshua Bengio, Yue Zhang:
Exploiting Syntactic Structure for Better Language Modeling: A Syntactic Distance Approach. CoRR abs/2005.05864 (2020) - [i13]Shawn Tan, Yikang Shen, Timothy J. O'Donnell, Alessandro Sordoni, Aaron C. Courville:
Recursive Top-Down Production for Sentence Generation with Latent Trees. CoRR abs/2010.04704 (2020) - [i12]Yi Tay, Mostafa Dehghani, Samira Abnar, Yikang Shen, Dara Bahri, Philip Pham, Jinfeng Rao, Liu Yang, Sebastian Ruder, Donald Metzler:
Long Range Arena: A Benchmark for Efficient Transformers. CoRR abs/2011.04006 (2020) - [i11]Yikang Shen, Shawn Tan, Alessandro Sordoni, Siva Reddy, Aaron C. Courville:
Explicitly Modeling Syntax in Language Model improves Generalization. CoRR abs/2011.07960 (2020) - [i10]Yikang Shen, Yi Tay, Che Zheng, Dara Bahri, Donald Metzler, Aaron C. Courville:
StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked Language Modeling. CoRR abs/2012.00857 (2020)
2010 – 2019
- 2019
- [c12]Yikang Shen, Shawn Tan, Alessandro Sordoni, Aaron C. Courville:
Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks. ICLR 2019 - [c11]Yikang Shen, Shawn Tan, Seyed Arian Hosseini, Zhouhan Lin, Alessandro Sordoni, Aaron C. Courville:
Ordered Memory. NeurIPS 2019: 5038-5049 - [i9]Shawn Tan, Yikang Shen, Chin-Wei Huang, Aaron C. Courville:
Investigating Biases in Textual Entailment Datasets. CoRR abs/1906.09635 (2019) - [i8]Yikang Shen, Shawn Tan, Seyedarian Hosseini, Zhouhan Lin, Alessandro Sordoni, Aaron C. Courville:
Ordered Memory. CoRR abs/1910.13466 (2019) - 2018
- [j1]Nan Jiang
, Wenge Rong, Yifan Nie, Yikang Shen, Zhang Xiong:
Biological Event Trigger Identification with Noise Contrastive Estimation. IEEE ACM Trans. Comput. Biol. Bioinform. 15(5): 1549-1559 (2018) - [c10]Yikang Shen, Zhouhan Lin, Athul Paul Jacob, Alessandro Sordoni, Aaron C. Courville, Yoshua Bengio:
Straight to the Tree: Constituency Parsing with Neural Syntactic Distance. ACL (1) 2018: 1171-1180 - [c9]Yue Dong
, Yikang Shen, Eric Crawford, Herke van Hoof, Jackie Chi Kit Cheung:
BanditSum: Extractive Summarization as a Contextual Bandit. EMNLP 2018: 3739-3748 - [c8]Yikang Shen, Zhouhan Lin, Chin-Wei Huang, Aaron C. Courville:
Neural Language Modeling by Jointly Learning Syntax and Lexicon. ICLR (Poster) 2018 - [i7]Yikang Shen, Shawn Tan, Chin-Wei Huang, Aaron C. Courville:
Generating Contradictory, Neutral, and Entailing Sentences. CoRR abs/1803.02710 (2018) - [i6]Yikang Shen, Zhouhan Lin, Athul Paul Jacob, Alessandro Sordoni, Aaron C. Courville, Yoshua Bengio:
Straight to the Tree: Constituency Parsing with Neural Syntactic Distance. CoRR abs/1806.04168 (2018) - [i5]Yue Dong, Yikang Shen, Eric Crawford, Herke van Hoof, Jackie Chi Kit Cheung:
BanditSum: Extractive Summarization as a Contextual Bandit. CoRR abs/1809.09672 (2018) - [i4]Yikang Shen, Shawn Tan, Alessandro Sordoni, Aaron C. Courville:
Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks. CoRR abs/1810.09536 (2018) - 2017
- [c7]Yikang Shen, Wenge Rong, Nan Jiang, Baolin Peng, Jie Tang, Zhang Xiong:
Word Embedding Based Correlation Model for Question/Answer Matching. AAAI 2017: 3511-3517 - [c6]Nan Jiang, Wenge Rong, Min Gao
, Yikang Shen, Zhang Xiong:
Exploration of Tree-based Hierarchical Softmax for Recurrent Language Models. IJCAI 2017: 1951-1957 - [i3]Yikang Shen, Shawn Tan, Christopher Joseph Pal, Aaron C. Courville:
Self-organized Hierarchical Softmax. CoRR abs/1707.08588 (2017) - [i2]Yikang Shen, Zhouhan Lin, Chin-Wei Huang, Aaron C. Courville:
Neural Language Modeling by Jointly Learning Syntax and Lexicon. CoRR abs/1711.02013 (2017) - 2016
- [c5]Siqi Xiang, Wenge Rong, Yikang Shen, Yuanxin Ouyang, Zhang Xiong:
Multidimensional scaling based knowledge provision for new questions in community Question Answering systems. IJCNN 2016: 115-122 - [c4]Yazhi Gao, Wenge Rong, Yikang Shen, Zhang Xiong:
Convolutional Neural Network based sentiment analysis using Adaboost combination. IJCNN 2016: 1333-1338 - 2015
- [c3]Yikang Shen, Wenge Rong, Zhiwei Sun, Yuanxin Ouyang, Zhang Xiong:
Question/Answer Matching for CQA System via Combining Lexical and Sequential Information. AAAI 2015: 275-281 - [i1]Yikang Shen, Wenge Rong, Nan Jiang, Baolin Peng, Jie Tang, Zhang Xiong:
Word Embedding based Correlation Model for Question/Answer Matching. CoRR abs/1511.04646 (2015) - 2014
- [c2]Yifan Nie, Wenge Rong, Yikang Shen, Chao Li, Zhang Xiong:
Choosing the Best Auto-Encoder-Based Bagging Classifier: An Empirical Study. ICONIP (1) 2014: 413-420 - [c1]Zhiwei Sun, Wenge Rong, Yikang Shen, Yuanxin Ouyang, Chao Li, Zhang Xiong:
Influencing Factors Analysis of People's Answering Behaviours on Social Network Based Questions. UIC/ATC/ScalCom 2014: 196-203
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2023-09-28 02:52 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint