default search action
Philipp Koehn
Person information
- affiliation: University of Edinburgh, UK
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [c222]Lingfeng Shen, Weiting Tan, Sihao Chen, Yunmo Chen, Jingyu Zhang, Haoran Xu, Boyuan Zheng, Philipp Koehn, Daniel Khashabi:
The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts. ACL (Findings) 2024: 2668-2680 - [c221]Rachel Wicks, Matt Post, Philipp Koehn:
Recovering document annotations for sentence-level bitext. ACL (Findings) 2024: 9876-9890 - [c220]Tianjian Li, Haoran Xu, Philipp Koehn, Daniel Khashabi, Kenton Murray:
Error Norm Truncation: Robust Training in the Presence of Data Noise for Text Generation Models. ICLR 2024 - [c219]Weiting Tan, Haoran Xu, Lingfeng Shen, Shuyue Stella Li, Kenton Murray, Philipp Koehn, Benjamin Van Durme, Yunmo Chen:
Narrowing the Gap between Zero- and Few-shot Machine Translation by Matching Styles. NAACL-HLT (Findings) 2024: 490-502 - [c218]Patrick Foley, Matthew Wiesner, Bismarck Odoom, Leibny Paola García-Perera, Kenton Murray, Philipp Koehn:
Where are you from? Geolocating Speech and Applications to Language Identification. NAACL-HLT 2024: 5114-5126 - [i58]Lingfeng Shen, Weiting Tan, Sihao Chen, Yunmo Chen, Jingyu Zhang, Haoran Xu, Boyuan Zheng, Philipp Koehn, Daniel Khashabi:
The Language Barrier: Dissecting Safety Challenges of LLMs in Multilingual Contexts. CoRR abs/2401.13136 (2024) - [i57]Weiting Tan, Yunmo Chen, Tongfei Chen, Guanghui Qin, Haoran Xu, Heidi C. Zhang, Benjamin Van Durme, Philipp Koehn:
Streaming Sequence Transduction through Dynamic Compression. CoRR abs/2402.01172 (2024) - [i56]Niyati Bafna, Philipp Koehn, David Yarowsky:
Pointer-Generator Networks for Low-Resource Machine Translation: Don't Copy That! CoRR abs/2403.10963 (2024) - [i55]Weiting Tan, Jingyu Zhang, Lingfeng Shen, Daniel Khashabi, Philipp Koehn:
DiffNorm: Self-Supervised Normalization for Non-autoregressive Speech-to-speech Translation. CoRR abs/2405.13274 (2024) - [i54]John F. Wu, Alina Hyk, Kiera McCormick, Christine Ye, Simone Astarita, Elina Baral, Jo Ciuca, Jesse Cranney, Anjalie Field, Kartheik Iyer, Philipp Koehn, Jenn Kotler, Sandor Kruk, Michelle Ntampaka, Charles O'Neill, Joshua E. G. Peek, Sanjib Sharma, Mikaeel Yunus:
Designing an Evaluation Framework for Large Language Models in Astronomy Research. CoRR abs/2405.20389 (2024) - [i53]Rachel Wicks, Matt Post, Philipp Koehn:
Recovering document annotations for sentence-level bitext. CoRR abs/2406.03869 (2024) - [i52]Taiming Lu, Philipp Koehn:
Every Language Counts: Learn and Unlearn in Multilingual LLMs. CoRR abs/2406.13748 (2024) - [i51]Tom Kocmi, Eleftherios Avramidis, Rachel Bawden, Ondrej Bojar, Anton Dvorkovich, Christian Federmann, Mark Fishel, Markus Freitag, Thamme Gowda, Roman Grundkiewicz, Barry Haddow, Marzena Karpinska, Philipp Koehn, Benjamin Marie, Kenton Murray, Masaaki Nagata, Martin Popel, Maja Popovic, Mariya Shmatova, Steinþór Steingrímsson, Vilém Zouhar:
Preliminary WMT24 Ranking of General MT Systems and LLMs. CoRR abs/2407.19884 (2024) - 2023
- [c217]Jean Maillard, Cynthia Gao, Elahe Kalbassi, Kaushik Ram Sadagopan, Vedanuj Goswami, Philipp Koehn, Angela Fan, Francisco Guzmán:
Small Data, Big Impact: Leveraging Minimal Data for Effective Machine Translation. ACL (1) 2023: 2740-2756 - [c216]Weiting Tan, Kevin Heffernan, Holger Schwenk, Philipp Koehn:
Multilingual Representation Distillation with Contrastive Learning. EACL 2023: 1469-1482 - [c215]Haoran Xu, Weiting Tan, Shuyue Stella Li, Yunmo Chen, Benjamin Van Durme, Philipp Koehn, Kenton Murray:
Condensing Multilingual Knowledge with Lightweight Language-Specific Modules. EMNLP 2023: 1575-1587 - [c214]Elizabeth Salesky, Neha Verma, Philipp Koehn, Matt Post:
Multilingual Pixel Representations for Translation and Effective Cross-lingual Transfer. EMNLP 2023: 13845-13861 - [c213]Tom Kocmi, Eleftherios Avramidis, Rachel Bawden, Ondrej Bojar, Anton Dvorkovich, Christian Federmann, Mark Fishel, Markus Freitag, Thamme Gowda, Roman Grundkiewicz, Barry Haddow, Philipp Koehn, Benjamin Marie, Christof Monz, Makoto Morishita, Kenton Murray, Makoto Nagata, Toshiaki Nakazawa, Martin Popel, Maja Popovic, Mariya Shmatova:
Findings of the 2023 Conference on Machine Translation (WMT23): LLMs Are Here but Not Quite There Yet. WMT 2023: 1-42 - [c212]Longyue Wang, Zhaopeng Tu, Yan Gu, Siyou Liu, Dian Yu, Qingsong Ma, Chenyang Lyu, Liting Zhou, Chao-Hong Liu, Yufeng Ma, Weiyu Chen, Yvette Graham, Bonnie Webber, Philipp Koehn, Andy Way, Yulin Yuan, Shuming Shi:
Findings of the WMT 2023 Shared Task on Discourse-Level Literary Translation: A Fresh Orb in the Cosmos of LLMs. WMT 2023: 55-67 - [c211]Steve Sloto, Brian Thompson, Huda Khayrallah, Tobias Domhan, Thamme Gowda, Philipp Koehn:
Findings of the WMT 2023 Shared Task on Parallel Data Curation. WMT 2023: 95-102 - [c210]Xuan Zhang, Navid Rajabi, Kevin Duh, Philipp Koehn:
Machine Translation with Large Language Models: Prompting, Few-shot Learning, and Fine-tuning with QLoRA. WMT 2023: 468-481 - [c209]Lemao Liu, Francisco Casacuberta, George Foster, Guoping Huang, Philipp Koehn, Geza Kovacs, Shuming Shi, Taro Watanabe, Chengqing Zong:
Findings of the Word-Level AutoCompletion Shared Task in WMT 2023. WMT 2023: 654-662 - [e20]Philipp Koehn, Barry Haddon, Tom Kocmi, Christof Monz:
Proceedings of the Eighth Conference on Machine Translation, WMT 2023, Singapore, December 6-7, 2023. Association for Computational Linguistics 2023, ISBN 979-8-89176-041-7 [contents] - [i50]Haoran Xu, Weiting Tan, Shuyue Stella Li, Yunmo Chen, Benjamin Van Durme, Philipp Koehn, Kenton Murray:
Condensing Multilingual Knowledge with Lightweight Language-Specific Modules. CoRR abs/2305.13993 (2023) - [i49]Elizabeth Salesky, Neha Verma, Philipp Koehn, Matt Post:
Pixel Representations for Multilingual Translation and Data-efficient Cross-lingual Transfer. CoRR abs/2305.14280 (2023) - [i48]Tianjian Li, Haoran Xu, Philipp Koehn, Daniel Khashabi, Kenton Murray:
Error Norm Truncation: Robust Training in the Presence of Data Noise for Text Generation Models. CoRR abs/2310.00840 (2023) - [i47]Weiting Tan, Haoran Xu, Lingfeng Shen, Shuyue Stella Li, Kenton Murray, Philipp Koehn, Benjamin Van Durme, Yunmo Chen:
Narrowing the Gap between Zero- and Few-shot Machine Translation by Matching Styles. CoRR abs/2311.02310 (2023) - [i46]Longyue Wang, Zhaopeng Tu, Yan Gu, Siyou Liu, Dian Yu, Qingsong Ma, Chenyang Lyu, Liting Zhou, Chao-Hong Liu, Yufeng Ma, Weiyu Chen, Yvette Graham, Bonnie Webber, Philipp Koehn, Andy Way, Yulin Yuan, Shuming Shi:
Findings of the WMT 2023 Shared Task on Discourse-Level Literary Translation: A Fresh Orb in the Cosmos of LLMs. CoRR abs/2311.03127 (2023) - 2022
- [c208]Simeng Sun, Angela Fan, James Cross, Vishrav Chaudhary, Chau Tran, Philipp Koehn, Francisco Guzmán:
Alternative Input Signals Ease Transfer in Multilingual Machine Translation. ACL (1) 2022: 5291-5305 - [c207]Weiting Tan, Shuoyang Ding, Huda Khayrallah, Philipp Koehn:
Doubly-Trained Adversarial Data Augmentation for Neural Machine Translation. AMTA 2022: 157-174 - [c206]Kelly Marchisio, Conghao Xiong, Philipp Koehn:
Embedding-Enhanced GIZA++: Improving Low-Resource Word Alignment Using Embeddings. AMTA 2022: 264-273 - [c205]Daniel Licht, Cynthia Gao, Janice Lam, Francisco Guzmán, Mona T. Diab, Philipp Koehn:
Consistent Human Evaluation of Machine Translation across Language Pairs. AMTA 2022: 309-321 - [c204]Haoran Xu, Philipp Koehn, Kenton Murray:
The Importance of Being Parameters: An Intra-Distillation Method for Serious Gains. EMNLP 2022: 170-183 - [c203]Tasnim Mohiuddin, Philipp Koehn, Vishrav Chaudhary, James Cross, Shruti Bhosale, Shafiq R. Joty:
Data Selection Curriculum for Neural Machine Translation. EMNLP (Findings) 2022: 1569-1582 - [c202]Kelly Marchisio, Ali Saad-Eldin, Kevin Duh, Carey E. Priebe, Philipp Koehn:
Bilingual Lexicon Induction for Low-Resource Languages using Graph Matching via Optimal Transport. EMNLP 2022: 2545-2561 - [c201]Yukun Feng, Feng Li, Philipp Koehn:
Toward the Limitation of Code-Switching in Cross-Lingual Transfer. EMNLP 2022: 5966-5971 - [c200]Kelly Marchisio, Neha Verma, Kevin Duh, Philipp Koehn:
IsoVec: Controlling the Relative Isomorphism of Word Embedding Spaces. EMNLP 2022: 6019-6033 - [c199]Xuan-Phi Nguyen, Hongyu Gong, Yun Tang, Changhan Wang, Philipp Koehn, Shafiq R. Joty:
Contrastive Clustering to Mine Pseudo Parallel Data for Unsupervised Translation. ICLR 2022 - [c198]Yukun Feng, Feng Li, Ziang Song, Boyuan Zheng, Philipp Koehn:
Learn To Remember: Transformer with Recurrent Memory for Document-Level Machine Translation. NAACL-HLT (Findings) 2022: 1409-1420 - [c197]Tom Kocmi, Rachel Bawden, Ondrej Bojar, Anton Dvorkovich, Christian Federmann, Mark Fishel, Thamme Gowda, Yvette Graham, Roman Grundkiewicz, Barry Haddow, Rebecca Knowles, Philipp Koehn, Christof Monz, Makoto Morishita, Masaaki Nagata, Toshiaki Nakazawa, Michal Novák, Martin Popel, Maja Popovic:
Findings of the 2022 Conference on Machine Translation (WMT22). WMT 2022: 1-45 - [c196]Francisco Casacuberta, George Foster, Guoping Huang, Philipp Koehn, Geza Kovacs, Lemao Liu, Shuming Shi, Taro Watanabe, Chengqing Zong:
Findings of the Word-Level AutoCompletion Shared Task in WMT 2022. WMT 2022: 812-820 - [e19]Philipp Koehn, Loïc Barrault, Ondrej Bojar, Fethi Bougares, Rajen Chatterjee, Marta R. Costa-jussà, Christian Federmann, Mark Fishel, Alexander Fraser, Markus Freitag, Yvette Graham, Roman Grundkiewicz, Paco Guzman, Barry Haddow, Matthias Huck, Antonio Jimeno-Yepes, Tom Kocmi, André Martins, Makoto Morishita, Christof Monz, Masaaki Nagata, Toshiaki Nakazawa, Matteo Negri, Aurélie Névéol, Mariana Neves, Martin Popel, Marco Turchi, Marcos Zampieri:
Proceedings of the Seventh Conference on Machine Translation, WMT 2022, Abu Dhabi, United Arab Emirates (Hybrid), December 7-8, 2022. Association for Computational Linguistics 2022, ISBN 978-1-959429-29-6 [contents] - [i45]Tasnim Mohiuddin, Philipp Koehn, Vishrav Chaudhary, James Cross, Shruti Bhosale, Shafiq R. Joty:
Data Selection Curriculum for Neural Machine Translation. CoRR abs/2203.13867 (2022) - [i44]Yukun Feng, Feng Li, Ziang Song, Boyuan Zheng, Philipp Koehn:
Learn To Remember: Transformer with Recurrent Memory for Document-Level Machine Translation. CoRR abs/2205.01546 (2022) - [i43]Daniel Licht, Cynthia Gao, Janice Lam, Francisco Guzmán, Mona T. Diab, Philipp Koehn:
Consistent Human Evaluation of Machine Translation across Language Pairs. CoRR abs/2205.08533 (2022) - [i42]Haoran Xu, Philipp Koehn, Kenton Murray:
The Importance of Being Parameters: An Intra-Distillation Method for Serious Gains. CoRR abs/2205.11416 (2022) - [i41]Marta R. Costa-jussà, James Cross, Onur Çelebi, Maha Elbayad, Kenneth Heafield, Kevin Heffernan, Elahe Kalbassi, Janice Lam, Daniel Licht, Jean Maillard, Anna Y. Sun, Skyler Wang, Guillaume Wenzek, Al Youngblood, Bapi Akula, Loïc Barrault, Gabriel Mejia Gonzalez, Prangthip Hansanti, John Hoffman, Semarley Jarrett, Kaushik Ram Sadagopan, Dirk Rowe, Shannon Spruit, Chau Tran, Pierre Andrews, Necip Fazil Ayan, Shruti Bhosale, Sergey Edunov, Angela Fan, Cynthia Gao, Vedanuj Goswami, Francisco Guzmán, Philipp Koehn, Alexandre Mourachko, Christophe Ropers, Safiyyah Saleem, Holger Schwenk, Jeff Wang:
No Language Left Behind: Scaling Human-Centered Machine Translation. CoRR abs/2207.04672 (2022) - [i40]Weiting Tan, Philipp Koehn:
Bitext Mining for Low-Resource Languages via Contrastive Learning. CoRR abs/2208.11194 (2022) - [i39]Weiting Tan, Kevin Heffernan, Holger Schwenk, Philipp Koehn:
Multilingual Representation Distillation with Contrastive Learning. CoRR abs/2210.05033 (2022) - [i38]Kelly Marchisio, Neha Verma, Kevin Duh, Philipp Koehn:
IsoVec: Controlling the Relative Isomorphism of Word Embedding Spaces. CoRR abs/2210.05098 (2022) - [i37]Kelly Marchisio, Ali Saad-Eldin, Kevin Duh, Carey E. Priebe, Philipp Koehn:
Bilingual Lexicon Induction for Low-Resource Languages using Graph Matching via Optimal Transport. CoRR abs/2210.14378 (2022) - 2021
- [c195]Wei-Jen Ko, Ahmed El-Kishky, Adithya Renduchintala, Vishrav Chaudhary, Naman Goyal, Francisco Guzmán, Pascale Fung, Philipp Koehn, Mona T. Diab:
Adapting High-resource NMT Models to Translate Low-resource Related Languages without Parallel Data. ACL/IJCNLP (1) 2021: 802-812 - [c194]Kelly Marchisio, Youngser Park, Ali Saad-Eldin, Anton Alyakin, Kevin Duh, Carey E. Priebe, Philipp Koehn:
An Analysis of Euclidean vs. Graph-Based Framing for Bilingual Lexicon Induction from Word Embedding Spaces. EMNLP (Findings) 2021: 738-749 - [c193]Shuoyang Ding, Marcin Junczys-Dowmunt, Matt Post, Philipp Koehn:
Levenshtein Training for Word-level Quality Estimation. EMNLP (1) 2021: 6724-6733 - [c192]Ahmed El-Kishky, Adithya Renduchintala, James Cross, Francisco Guzmán, Philipp Koehn:
XLEnt: Mining a Large Cross-lingual Entity Dataset with Lexical-Semantic-Phonetic Word Alignment. EMNLP (1) 2021: 10424-10430 - [c191]Xutai Ma, Yongqiang Wang, Mohammad Javad Dousti, Philipp Koehn, Juan Miguel Pino:
Streaming Simultaneous Speech Translation with Augmented Memory Transformer. ICASSP 2021: 7523-7527 - [c190]Gaurav Kumar, Philipp Koehn, Sanjeev Khudanpur:
Learning Curricula for Multilingual Neural Machine Translation Training. MTSummit (1) 2021: 1-9 - [c189]Kelly Marchisio, Philipp Koehn, Conghao Xiong:
An Alignment-Based Approach to Semi-Supervised Bilingual Lexicon Induction with Small Parallel Corpora. MTSummit (1) 2021: 293-304 - [c188]Shuoyang Ding, Philipp Koehn:
Evaluating Saliency Methods for Neural Language Models. NAACL-HLT 2021: 5034-5052 - [c187]Farhad Akhbardeh, Arkady Arkhangorodsky, Magdalena Biesialska, Ondrej Bojar, Rajen Chatterjee, Vishrav Chaudhary, Marta R. Costa-jussà, Cristina España-Bonet, Angela Fan, Christian Federmann, Markus Freitag, Yvette Graham, Roman Grundkiewicz, Barry Haddow, Leonie Harter, Kenneth Heafield, Christopher Homan, Matthias Huck, Kwabena Amponsah-Kaakyire, Jungo Kasai, Daniel Khashabi, Kevin Knight, Tom Kocmi, Philipp Koehn, Nicholas Lourie, Christof Monz, Makoto Morishita, Masaaki Nagata, Ajay Nagesh, Toshiaki Nakazawa, Matteo Negri, Santanu Pal, Allahsera Auguste Tapo, Marco Turchi, Valentin Vydrin, Marcos Zampieri:
Findings of the 2021 Conference on Machine Translation (WMT21). WMT@EMNLP 2021: 1-88 - [c186]Chau Tran, Shruti Bhosale, James Cross, Philipp Koehn, Sergey Edunov, Angela Fan:
Facebook AI's WMT21 News Translation Task Submission. WMT@EMNLP 2021: 205-215 - [c185]Md Mahfuz Ibn Alam, Ivana Kvapilíková, Antonios Anastasopoulos, Laurent Besacier, Georgiana Dinu, Marcello Federico, Matthias Gallé, Kweon Woo Jung, Philipp Koehn, Vassilina Nikoulina:
Findings of the WMT Shared Task on Machine Translation Using Terminologies. WMT@EMNLP 2021: 652-663 - [c184]Shuoyang Ding, Marcin Junczys-Dowmunt, Matt Post, Christian Federmann, Philipp Koehn:
The JHU-Microsoft Submission for WMT21 Quality Estimation Shared Task. WMT@EMNLP 2021: 904-910 - [c183]Gaurav Kumar, Philipp Koehn, Sanjeev Khudanpur:
Learning Feature Weights using Reward Modeling for Denoising Parallel Corpora. WMT@EMNLP 2021: 1100-1109 - [e18]Loïc Barrault, Ondrej Bojar, Fethi Bougares, Rajen Chatterjee, Marta R. Costa-jussà, Christian Federmann, Mark Fishel, Alexander Fraser, Markus Freitag, Yvette Graham, Roman Grundkiewicz, Paco Guzman, Barry Haddow, Matthias Huck, Antonio Jimeno-Yepes, Philipp Koehn, Tom Kocmi, André Martins, Makoto Morishita, Christof Monz:
Proceedings of the Sixth Conference on Machine Translation, WMT@EMNLP 2021, Online Event, November 10-11, 2021. Association for Computational Linguistics 2021, ISBN 978-1-954085-94-7 [contents] - [i36]Haoran Xu, Philipp Koehn:
Zero-Shot Cross-Lingual Dependency Parsing through Contextual Embedding Transformation. CoRR abs/2103.02212 (2021) - [i35]Gaurav Kumar, Philipp Koehn, Sanjeev Khudanpur:
Learning Policies for Multilingual Training of Neural Machine Translation Systems. CoRR abs/2103.06964 (2021) - [i34]Gaurav Kumar, Philipp Koehn, Sanjeev Khudanpur:
Learning Feature Weights using Reward Modeling for Denoising Parallel Corpora. CoRR abs/2103.06968 (2021) - [i33]Shuoyang Ding, Philipp Koehn:
Evaluating Saliency Methods for Neural Language Models. CoRR abs/2104.05824 (2021) - [i32]Ahmed El-Kishky, Adi Renduchintala, James Cross, Francisco Guzmán, Philipp Koehn:
XLEnt: Mining a Large Cross-lingual Entity Dataset with Lexical-Semantic-Phonetic Word Alignment. CoRR abs/2104.08597 (2021) - [i31]Kelly Marchisio, Conghao Xiong, Philipp Koehn:
Embedding-Enhanced Giza++: Improving Alignment in Low- and High- Resource Scenarios Using Embedding Space Geometry. CoRR abs/2104.08721 (2021) - [i30]Wei-Jen Ko, Ahmed El-Kishky, Adithya Renduchintala, Vishrav Chaudhary, Naman Goyal, Francisco Guzmán, Pascale Fung, Philipp Koehn, Mona T. Diab:
Adapting High-resource NMT Models to Translate Low-resource Related Languages without Parallel Data. CoRR abs/2105.15071 (2021) - [i29]Md Mahfuz Ibn Alam, Antonios Anastasopoulos, Laurent Besacier, James Cross, Matthias Gallé, Philipp Koehn, Vassilina Nikoulina:
On the Evaluation of Machine Translation for Terminology Consistency. CoRR abs/2106.11891 (2021) - [i28]Haoran Xu, Philipp Koehn:
Cross-Lingual BERT Contextual Embedding Space Mapping with Isotropic and Isometric Conditions. CoRR abs/2107.09186 (2021) - [i27]Chau Tran, Shruti Bhosale, James Cross, Philipp Koehn, Sergey Edunov, Angela Fan:
Facebook AI WMT21 News Translation Task Submission. CoRR abs/2108.03265 (2021) - [i26]Shuoyang Ding, Marcin Junczys-Dowmunt, Matt Post, Philipp Koehn:
Levenshtein Training for Word-level Quality Estimation. CoRR abs/2109.05611 (2021) - [i25]Shuoyang Ding, Marcin Junczys-Dowmunt, Matt Post, Christian Federmann, Philipp Koehn:
The JHU-Microsoft Submission for WMT21 Quality Estimation Shared Task. CoRR abs/2109.08724 (2021) - [i24]Kelly Marchisio, Youngser Park, Ali Saad-Eldin, Anton Alyakin, Kevin Duh, Carey E. Priebe, Philipp Koehn:
An Analysis of Euclidean vs. Graph-Based Framing for Bilingual Lexicon Induction from Word Embedding Spaces. CoRR abs/2109.12640 (2021) - [i23]Weiting Tan, Shuoyang Ding, Huda Khayrallah, Philipp Koehn:
Doubly-Trained Adversarial Data Augmentation for Neural Machine Translation. CoRR abs/2110.05691 (2021) - [i22]Simeng Sun, Angela Fan, James Cross, Vishrav Chaudhary, Chau Tran, Philipp Koehn, Francisco Guzmán:
Alternative Input Signals Ease Transfer in Multilingual Machine Translation. CoRR abs/2110.07804 (2021) - 2020
- [c182]Marta Bañón, Pinzhen Chen, Barry Haddow, Kenneth Heafield, Hieu Hoang, Miquel Esplà-Gomis, Mikel L. Forcada, Amir Kamran, Faheem Kirefu, Philipp Koehn, Sergio Ortiz-Rojas, Leopoldo Pla Sempere, Gema Ramírez-Sánchez, Elsa Sarrías, Marek Strelec, Brian Thompson, William Waites, Dion Wiggins, Jaume Zaragoza:
ParaCrawl: Web-Scale Acquisition of Parallel Corpora. ACL 2020: 4555-4567 - [c181]Denise Díaz, James Cross, Vishrav Chaudhary, Ahmed El-Kishky, Philipp Koehn:
A Survey of Qualitative Error Analysis for Neural Machine Translation Systems. AMTA (2) 2020: 48-77 - [c180]Philipp Koehn, Barry Haddow:
Interpolated Backoff for Factored Translation Models. AMTA 2020 - [c179]Antonios Anastasopoulos, Alessandro Cattelan, Zi-Yi Dou, Marcello Federico, Christian Federmann, Dmitriy Genzel, Francisco Guzmán, Junjie Hu, Macduff Hughes, Philipp Koehn, Rosie Lazar, William Lewis, Graham Neubig, Mengmeng Niu, Alp Öktem, Eric Paquin, Grace Tang, Sylwia Tur:
TICO-19: the Translation Initiative for COvid-19. NLP4COVID@EMNLP 2020 - [c178]Yvette Graham, Barry Haddow, Philipp Koehn:
Statistical Power and Translationese in Machine Translation Evaluation. EMNLP (1) 2020: 72-81 - [c177]Huda Khayrallah, Brian Thompson, Matt Post, Philipp Koehn:
Simulated multiple reference training improves low-resource machine translation. EMNLP (1) 2020: 82-89 - [c176]Ahmed El-Kishky, Vishrav Chaudhary, Francisco Guzmán, Philipp Koehn:
CCAligned: A Massive Collection of Cross-Lingual Web-Document Pairs. EMNLP (1) 2020: 5960-5969 - [c175]Brian Thompson, Philipp Koehn:
Exploiting Sentence Order in Document Alignment. EMNLP (1) 2020: 5997-6007 - [c174]Xutai Ma, Juan Miguel Pino, Philipp Koehn:
SimulMT to SimulST: Adapting Simultaneous Text Translation to End-to-End Simultaneous Speech Translation. AACL/IJCNLP 2020: 582-587 - [c173]Ahmed El-Kishky, Philipp Koehn, Holger Schwenk:
Searching the Web for Cross-lingual Parallel Data. SIGIR 2020: 2417-2420 - [c172]Loïc Barrault, Magdalena Biesialska, Ondrej Bojar, Marta R. Costa-jussà, Christian Federmann, Yvette Graham, Roman Grundkiewicz, Barry Haddow, Matthias Huck, Eric Joanis, Tom Kocmi, Philipp Koehn, Chi-kiu Lo, Nikola Ljubesic, Christof Monz, Makoto Morishita, Masaaki Nagata, Toshiaki Nakazawa, Santanu Pal, Matt Post, Marcos Zampieri:
Findings of the 2020 Conference on Machine Translation (WMT20). WMT@EMNLP 2020: 1-55 - [c171]Lucia Specia, Zhenhao Li, Juan Miguel Pino, Vishrav Chaudhary, Francisco Guzmán, Graham Neubig, Nadir Durrani, Yonatan Belinkov, Philipp Koehn, Hassan Sajjad, Paul Michel, Xian Li:
Findings of the WMT 2020 Shared Task on Machine Translation Robustness. WMT@EMNLP 2020: 76-91 - [c170]Kelly Marchisio, Kevin Duh, Philipp Koehn:
When Does Unsupervised Machine Translation Work? WMT@EMNLP 2020: 571-583 - [c169]Philipp Koehn, Vishrav Chaudhary, Ahmed El-Kishky, Naman Goyal, Peng-Jen Chen, Francisco Guzmán:
Findings of the WMT 2020 Shared Task on Parallel Corpus Filtering and Alignment. WMT@EMNLP 2020: 726-742 - [c168]Ankur Kejriwal, Philipp Koehn:
An exploratory approach to the Parallel Corpus Filtering shared task WMT20. WMT@EMNLP 2020: 959-965 - [c167]Felicia Koerner, Philipp Koehn:
Dual Conditional Cross Entropy Scores and LASER Similarity Scores for the WMT20 Parallel Corpus Filtering Shared Task. WMT@EMNLP 2020: 966-971 - [e17]Loïc Barrault, Ondrej Bojar, Fethi Bougares, Rajen Chatterjee, Marta R. Costa-jussà, Christian Federmann, Mark Fishel, Alexander Fraser, Yvette Graham, Paco Guzman, Barry Haddow, Matthias Huck, Antonio Jimeno-Yepes, Philipp Koehn, André Martins, Makoto Morishita, Christof Monz, Masaaki Nagata, Toshiaki Nakazawa, Matteo Negri:
Proceedings of the Fifth Conference on Machine Translation, WMT@EMNLP 2020, Online, November 19-20, 2020. Association for Computational Linguistics 2020, ISBN 978-1-948087-81-0 [contents] - [i21]Kelly Marchisio, Kevin Duh, Philipp Koehn:
When Does Unsupervised Machine Translation Work? CoRR abs/2004.05516 (2020) - [i20]Brian Thompson, Philipp Koehn:
Exploiting Sentence Order in Document Alignment. CoRR abs/2004.14523 (2020) - [i19]Huda Khayrallah, Brian Thompson, Matt Post, Philipp Koehn:
Simulated Multiple Reference Training Improves Low-Resource Machine Translation. CoRR abs/2004.14524 (2020) - [i18]Antonios Anastasopoulos, Alessandro Cattelan, Zi-Yi Dou, Marcello Federico, Christian Federmann, Dmitriy Genzel, Francisco Guzmán, Junjie Hu, Macduff Hughes, Philipp Koehn, Rosie Lazar, William Lewis, Graham Neubig, Mengmeng Niu, Alp Öktem, Eric Paquin, Grace Tang, Sylwia Tur:
TICO-19: the Translation Initiative for Covid-19. CoRR abs/2007.01788 (2020) - [i17]Xutai Ma, Yongqiang Wang, Mohammad Javad Dousti, Philipp Koehn, Juan Miguel Pino:
Streaming Simultaneous Speech Translation with Augmented Memory Transformer. CoRR abs/2011.00033 (2020) - [i16]Xutai Ma, Juan Miguel Pino, Philipp Koehn:
SimulMT to SimulST: Adapting Simultaneous Text Translation to End-to-End Simultaneous Speech Translation. CoRR abs/2011.02048 (2020)
2010 – 2019
- 2019
- [j13]Rebecca Knowles, Marina Sanchez-Torron, Philipp Koehn:
A user study of neural interactive translation prediction. Mach. Transl. 33(1-2): 135-154 (2019) - [c166]Yash Kumar Lal, Vaibhav Kumar, Mrinal Dhar, Manish Shrivastava, Philipp Koehn:
De-Mixing Sentiment from Code-Mixed Text. ACL (2) 2019: 371-377 - [c165]Shuoyang Ding, Philipp Koehn:
Parallelizable Stack Long Short-Term Memory. SPNLP@NAACL-HLT 2019: 1-6 - [c164]Adithya Renduchintala, Philipp Koehn, Jason Eisner:
Simple Construction of Mixed-Language Texts for Vocabulary Learning. BEA@ACL 2019: 369-379 - [c163]Brian Thompson, Philipp Koehn:
Vecalign: Improved Sentence Alignment in Linear Time and Space. EMNLP/IJCNLP (1) 2019: 1342-1348 - [c162]Brian Thompson, Rebecca Knowles, Xuan Zhang, Huda Khayrallah, Kevin Duh, Philipp Koehn:
HABLex: Human Annotated Bilingual Lexicons for Experiments in Machine Translation. EMNLP/IJCNLP (1) 2019: 1382-1387 - [c161]