


Остановите войну!
for scientists:


default search action
Jimmy Lin
Person information

- affiliation: University of Waterloo, David R. Cheriton School of Computer Science
- affiliation: Twitter Inc., San Francisco, USA
- affiliation: University of Maryland, College Park, Institute for Advanced Computer Studies (UMIACS)
- affiliation: Massachusetts Institute of Technology (MIT), Artificial Intelligence Laboratory
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2023
- [c320]Ronak Pradeep, Haonan Chen, Lingwei Gu, Manveer Singh Tamber, Jimmy Lin:
PyGaggle: A Gaggle of Resources for Open-Domain Question Answering. ECIR (3) 2023: 148-162 - [c319]Manveer Singh Tamber, Ronak Pradeep, Jimmy Lin:
Pre-processing Matters! Improved Wikipedia Corpora for Open-Domain Question Answering. ECIR (3) 2023: 163-176 - [i142]Shi Zong, Josh Seltzer, Jiahua Pan, Kathy Cheng, Jimmy Lin:
Which Model Shall I Choose? Cost/Quality Trade-offs for Text Classification Tasks. CoRR abs/2301.07006 (2023) - [i141]Minghan Li, Sheng-Chieh Lin, Xueguang Ma, Jimmy Lin:
SLIM: Sparsified Late Interaction for Multi-Vector Retrieval with Inverted Indexes. CoRR abs/2302.06587 (2023) - [i140]Xinyu Zhang, Minghan Li, Jimmy Lin:
Improving Out-of-Distribution Generalization of Neural Rerankers with Contextualized Late Interaction. CoRR abs/2302.06589 (2023) - [i139]Sheng-Chieh Lin, Akari Asai, Minghan Li, Barlas Oguz, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen:
How to Train Your DRAGON: Diverse Augmentation Towards Generalizable Dense Retrieval. CoRR abs/2302.07452 (2023) - [i138]Christopher Akiki, Odunayo Ogundepo, Aleksandra Piktus, Xinyu Zhang, Akintunde Oladipo, Jimmy Lin, Martin Potthast:
Spacerini: Plug-and-play Search Engines with Pyserini and Hugging Face. CoRR abs/2302.14534 (2023) - [i137]Jimmy Lin, David Alfonso-Hermelo, Vitor Jeronymo, Ehsan Kamalloo, Carlos Lassance, Rodrigo Frassetto Nogueira, Odunayo Ogundepo, Mehdi Rezagholizadeh, Nandan Thakur, Jheng-Hong Yang, Xinyu Zhang:
Simple Yet Effective Neural Ranking and Reranking Baselines for Cross-Lingual Information Retrieval. CoRR abs/2304.01019 (2023) - [i136]Jheng-Hong Yang, Carlos Lassance, Rafael Sampaio de Rezende, Krishna Srinivasan, Miriam Redi, Stéphane Clinchant, Jimmy Lin:
AToMiC: An Image/Text Retrieval Test Collection to Support Multimedia Content Creation. CoRR abs/2304.01961 (2023) - [i135]Xueguang Ma, Tommaso Teofili, Jimmy Lin:
Anserini Gets Dense Retrieval: Integration of Lucene's HNSW Indexes. CoRR abs/2304.12139 (2023) - [i134]Xueguang Ma, Xinyu Zhang, Ronak Pradeep, Jimmy Lin:
Zero-Shot Listwise Document Reranking with a Large Language Model. CoRR abs/2305.02156 (2023) - [i133]Ehsan Kamalloo, Xinyu Zhang, Odunayo Ogundepo, Nandan Thakur, David Alfonso-Hermelo, Mehdi Rezagholizadeh, Jimmy Lin:
Evaluating Embedding APIs for Information Retrieval. CoRR abs/2305.06300 (2023) - [i132]Josh Seltzer, Jiahua Pan, Kathy Cheng, Yuxiao Sun, Santosh Kolagati, Jimmy Lin, Shi Zong:
SmartProbe: A Virtual Moderator for Market Research Surveys. CoRR abs/2305.08271 (2023) - [i131]Ronak Pradeep, Kai Hui, Jai Gupta, Ádám Dániel Lelkes, Honglei Zhuang, Jimmy Lin, Donald Metzler, Vinh Q. Tran:
How Does Generative Retrieval Scale to Millions of Passages? CoRR abs/2305.11841 (2023) - [i130]Vanessa Liao, Syed Shariyar Murtaza, Yifan Nie, Jimmy Lin:
Regex-augmented Domain Transfer Topic Classification based on a Pre-trained Language Model: An application in Financial Domain. CoRR abs/2305.18324 (2023) - 2022
- [c318]Sankeerth Durvasula, Raymond Kiguru, Samarth Mathur, Jenny Xu, Jimmy Lin, Nandita Vijaykumar:
VoxelCache: Accelerating Online Mapping in Robotics and 3D Reconstruction Tasks. PACT 2022: 239-251 - [c317]Hang Li
, Shengyao Zhuang
, Xueguang Ma
, Jimmy Lin
, Guido Zuccon
:
Pseudo-Relevance Feedback with Dense Retrievers in Pyserini. ADCS 2022: 1:1-1:6 - [c316]Wei Zhong, Yuqing Xie, Jimmy Lin:
Applying Structural and Dense Semantic Matching for the ARQMath Lab 2022, CLEF. CLEF (Working Notes) 2022: 147-170 - [c315]Hang Li
, Shengyao Zhuang
, Ahmed Mourad
, Xueguang Ma, Jimmy Lin
, Guido Zuccon
:
Improving Query Representations for Dense Retrieval with Pseudo Relevance Feedback: A Reproducibility Study. ECIR (1) 2022: 599-612 - [c314]Xueguang Ma, Kai Sun, Ronak Pradeep, Minghan Li, Jimmy Lin:
Another Look at DPR: Reproduction of Training and Replication of Retrieval. ECIR (1) 2022: 613-626 - [c313]Ronak Pradeep, Yuqi Liu, Xinyu Zhang, Yilin Li, Andrew Yates, Jimmy Lin:
Squeezing Water from a Stone: A Bag of Tricks for Further Improving Cross-Encoder Effectiveness for Reranking. ECIR (1) 2022: 655-670 - [c312]Raphael Tang, Karun Kumar, Gefei Yang, Akshat Pandey, Yajie Mao, Vladislav Belyaev, Madhuri Emmadi, G. Craig Murray, Ferhan Ture, Jimmy Lin:
SpeechNet: Weakly Supervised, End-to-End Speech Recognition at Industrial Scale. EMNLP (Industry Track) 2022: 285-293 - [c311]Minghan Li, Xinyu Zhang, Ji Xin, Hongyang Zhang, Jimmy Lin:
Certified Error Control of Candidate Set Pruning for Two-Stage Relevance Ranking. EMNLP 2022: 333-345 - [c310]Yizhen Zhong, Jiajie Xiao, Thomas Vetterli, Mahan Matin, Ellen Loo, Jimmy Lin, Richard Bourgon, Ofer Shapira:
Improving Precancerous Case Characterization via Transformer-based Ensemble Learning. EMNLP (Industry Track) 2022: 379-389 - [c309]Wei Zhong, Jheng-Hong Yang, Yuqing Xie, Jimmy Lin:
Evaluating Token-Level and Passage-Level Dense Retrieval Models for Math Information Retrieval. EMNLP (Findings) 2022: 1092-1102 - [c308]Peng Shi, Rui Zhang, He Bai, Jimmy Lin:
XRICL: Cross-lingual Retrieval-Augmented In-Context Learning for Cross-lingual Text-to-SQL Semantic Parsing. EMNLP (Findings) 2022: 5248-5259 - [c307]Peng Shi, Linfeng Song, Lifeng Jin, Haitao Mi, He Bai, Jimmy Lin, Dong Yu:
Cross-lingual Text-to-SQL Semantic Parsing with Representation Mixup. EMNLP (Findings) 2022: 5296-5306 - [c306]Odunayo Ogundepo, Xinyu Zhang, Shuo Sun, Kevin Duh, Jimmy Lin:
AfriCLIRMatrix: Enabling Cross-Lingual Information Retrieval for African Languages. EMNLP 2022: 8721-8728 - [c305]Raphael Tang, Karun Kumar, Ji Xin, Piyush Vyas, Wenyan Li, Gefei Yang, Yajie Mao, G. Craig Murray, Jimmy Lin:
Temporal Early Exiting for Streaming Speech Commands Recognition. ICASSP 2022: 7567-7571 - [c304]Matthew Y. R. Yang, Siwen Yang, Jimmy Lin:
Integration of text and geospatial search for hydrographic datasets using the lucene search library. JCDL 2022: 36 - [c303]Zhiying Jiang, Yiqin Dai, Ji Xin, Ming Li, Jimmy Lin:
Few-Shot Non-Parametric Learning with Deep Latent Variable Model. NeurIPS 2022 - [c302]Ronak Pradeep, Yilin Li, Yuetong Wang, Jimmy Lin:
Neural Query Synthesis and Domain-Specific Ranking Templates for Multi-Stage Clinical Trial Matching. SIGIR 2022: 2325-2330 - [c301]Hang Li, Shuai Wang, Shengyao Zhuang, Ahmed Mourad, Xueguang Ma, Jimmy Lin, Guido Zuccon:
To Interpolate or not to Interpolate: PRF, Dense and Sparse Retrievers. SIGIR 2022: 2495-2500 - [c300]Yuqi Liu, Chengcheng Hu, Jimmy Lin:
Another Look at Information Retrieval as Statistical Translation. SIGIR 2022: 2749-2754 - [c299]Jimmy Lin, Daniel Campos, Nick Craswell, Bhaskar Mitra, Emine Yilmaz:
Fostering Coopetition While Plugging Leaks: The Design and Implementation of the MS MARCO Leaderboards. SIGIR 2022: 2939-2948 - [c298]Ellen M. Voorhees, Nick Craswell, Jimmy Lin:
Too Many Relevants: Whither Cranfield Test Collections? SIGIR 2022: 2970-2980 - [c297]Xueguang Ma, Ronak Pradeep, Rodrigo Frassetto Nogueira, Jimmy Lin:
Document Expansion Baselines and Learned Sparse Lexical Representations for MS MARCO V1 and V2. SIGIR 2022: 3187-3197 - [c296]Andrew Trotman, Joel Mackenzie, Pradeesh Parameswaran, Jimmy Lin:
A Common Framework for Exploring Document-at-a-Time and Score-at-a-Time Retrieval Methods. SIGIR 2022: 3229-3234 - [c295]Josh Seltzer, Kathy Cheng, Shi Zong, Jimmy Lin:
Flipping the Script: Inverse Information Seeking Dialogues for Market Research. SIGIR 2022: 3380-3383 - [c294]Josh Devins, Julie Tibshirani, Jimmy Lin:
Aligning the Research and Practice of Building Search Applications: Elasticsearch and Pyserini. WSDM 2022: 1573-1576 - [i129]Ellen M. Voorhees, Ian Soboroff, Jimmy Lin:
Can Old TREC Collections Reliably Evaluate Modern Neural Retrieval Models? CoRR abs/2201.11086 (2022) - [i128]Luyu Gao, Xueguang Ma, Jimmy Lin, Jamie Callan:
Tevatron: An Efficient and Flexible Toolkit for Dense Retrieval. CoRR abs/2203.05765 (2022) - [i127]Wei Zhong, Jheng-Hong Yang, Jimmy Lin:
Evaluating Token-Level and Passage-Level Dense Retrieval Models for Math Information Retrieval. CoRR abs/2203.11163 (2022) - [i126]Xinyu Zhang, Kelechi Ogueji, Xueguang Ma, Jimmy Lin:
Towards Best Practices for Training Multilingual Dense Retrieval Models. CoRR abs/2204.02363 (2022) - [i125]Hang Li, Shuai Wang, Shengyao Zhuang, Ahmed Mourad, Xueguang Ma, Jimmy Lin, Guido Zuccon:
To Interpolate or not to Interpolate: PRF, Dense and Sparse Retrievers. CoRR abs/2205.00235 (2022) - [i124]Minghan Li, Xinyu Zhang, Ji Xin, Hongyang Zhang, Jimmy Lin:
Certified Error Control of Candidate Set Pruning for Two-Stage Relevance Ranking. CoRR abs/2205.09638 (2022) - [i123]Nandan Thakur, Nils Reimers, Jimmy Lin:
Domain Adaptation for Memory-Efficient Dense Retrieval. CoRR abs/2205.11498 (2022) - [i122]Sheng-Chieh Lin, Jimmy Lin:
A Dense Representation Framework for Lexical and Semantic Matching. CoRR abs/2206.09912 (2022) - [i121]Zhiying Jiang, Yiqin Dai, Ji Xin, Ming Li, Jimmy Lin:
Few-Shot Non-Parametric Learning with Deep Latent Variable Model. CoRR abs/2206.11573 (2022) - [i120]Ji Xin, Raphael Tang, Zhiying Jiang, Yaoliang Yu, Jimmy Lin:
Building an Efficiency Pipeline: Commutativity and Cumulativeness of Efficiency Operators for Transformers. CoRR abs/2208.00483 (2022) - [i119]Sheng-Chieh Lin, Minghan Li, Jimmy Lin:
Aggretriever: A Simple Approach to Aggregate Textual Representation for Robust Dense Passage Retrieval. CoRR abs/2208.00511 (2022) - [i118]Raphael Tang, Akshat Pandey, Zhiying Jiang, Gefei Yang, Karun Kumar, Jimmy Lin, Ferhan Ture:
What the DAAM: Interpreting Stable Diffusion Using Cross Attention. CoRR abs/2210.04885 (2022) - [i117]Odunayo Ogundepo, Xinyu Zhang, Jimmy Lin:
Better Than Whitespace: Information Retrieval for Languages without Custom Tokenizers. CoRR abs/2210.05481 (2022) - [i116]Linqing Liu, Minghan Li, Jimmy Lin, Sebastian Riedel, Pontus Stenetorp:
Query Expansion Using Contextual Clue Sampling with Language Models. CoRR abs/2210.07093 (2022) - [i115]Sankeerth Durvasula, Raymond Kiguru, Samarth Mathur, Jenny Xu, Jimmy Lin, Nandita Vijaykumar:
VoxelCache: Accelerating Online Mapping in Robotics and 3D Reconstruction Tasks. CoRR abs/2210.08729 (2022) - [i114]Xinyu Zhang, Nandan Thakur, Odunayo Ogundepo, Ehsan Kamalloo, David Alfonso-Hermelo, Xiaoguang Li, Qun Liu, Mehdi Rezagholizadeh, Jimmy Lin:
Making a MIRACL: Multilingual Information Retrieval Across a Continuum of Languages. CoRR abs/2210.09984 (2022) - [i113]Peng Shi, Rui Zhang, He Bai
, Jimmy Lin:
XRICL: Cross-lingual Retrieval-Augmented In-Context Learning for Cross-lingual Text-to-SQL Semantic Parsing. CoRR abs/2210.13693 (2022) - [i112]Jimmy Lin:
On the Interaction Between Differential Privacy and Gradient Compression in Deep Learning. CoRR abs/2211.00734 (2022) - [i111]Minghan Li, Sheng-Chieh Lin, Barlas Oguz, Asish Ghoshal, Jimmy Lin, Yashar Mehdad, Wen-tau Yih, Xilun Chen:
CITADEL: Conditional Token Interaction via Dynamic Lexical Routing for Efficient and Effective Multi-Vector Retrieval. CoRR abs/2211.10411 (2022) - [i110]Raphael Tang, Karun Kumar, Gefei Yang, Akshat Pandey, Yajie Mao, Vladislav Belyaev, Madhuri Emmadi, G. Craig Murray, Ferhan Ture, Jimmy Lin:
SpeechNet: Weakly Supervised, End-to-End Speech Recognition at Industrial Scale. CoRR abs/2211.11740 (2022) - [i109]Yizhen Zhong, Jiajie Xiao, Thomas Vetterli, Mahan Matin, Ellen Loo, Jimmy Lin, Richard Bourgon, Ofer Shapira:
Improving Precancerous Case Characterization via Transformer-based Ensemble Learning. CoRR abs/2212.05150 (2022) - [i108]Zhiying Jiang, Matthew Y. R. Yang, Mikhail Tsirlin, Raphael Tang, Jimmy Lin:
Less is More: Parameter-Free Text Classification with Gzip. CoRR abs/2212.09410 (2022) - [i107]Luyu Gao, Xueguang Ma, Jimmy Lin, Jamie Callan:
Precise Zero-Shot Dense Retrieval without Relevance Labels. CoRR abs/2212.10496 (2022) - [i106]Jimmy Lin:
Building a Culture of Reproducibility in Academic Research. CoRR abs/2212.13534 (2022) - 2021
- [b3]Jimmy Lin, Rodrigo Frassetto Nogueira
, Andrew Yates:
Pretrained Transformers for Text Ranking: BERT and Beyond. Synthesis Lectures on Human Language Technologies, Morgan & Claypool Publishers 2021, pp. 1-325 - [j61]Samantha Fritz, Ian Milligan, Nick Ruest, Jimmy Lin:
Fostering Community Engagement through Datathon Events: The Archives Unleashed Experience. Digit. Humanit. Q. 15(1) (2021) - [j60]Martin Gauch
, Juliane Mai
, Jimmy Lin:
The proper care and feeding of CAMELS: How limited training data affects streamflow prediction. Environ. Model. Softw. 135: 104926 (2021) - [j59]Jimmy Lin:
A proposed conceptual framework for a representational approach to information retrieval. SIGIR Forum 55(2): 4:1-4:29 (2021) - [j58]Sheng-Chieh Lin, Jheng-Hong Yang, Rodrigo Frassetto Nogueira
, Ming-Feng Tsai, Chuan-Ju Wang, Jimmy Lin:
Multi-Stage Conversational Passage Retrieval: An Approach to Fusing Term Importance Estimation and Neural Query Rewriting. ACM Trans. Inf. Syst. 39(4): 48:1-48:29 (2021) - [c293]He Bai, Peng Shi, Jimmy Lin, Yuqing Xie, Luchen Tan, Kun Xiong, Wen Gao, Ming Li:
Segatron: Segment-Aware Transformer for Language Modeling and Understanding. AAAI 2021: 12526-12534 - [c292]He Bai
, Peng Shi, Jimmy Lin, Luchen Tan, Kun Xiong, Wen Gao, Jie Liu, Ming Li:
Semantics of the Unwritten: The Effect of End of Paragraph and Sequence Tokens on Text Generation with GPT2. ACL (student) 2021: 148-162 - [c291]Kelvin Jiang, Ronak Pradeep, Jimmy Lin:
Exploring Listwise Evidence Reasoning with T5 for Fact Verification. ACL/IJCNLP (2) 2021: 402-410 - [c290]Ji Xin, Raphael Tang, Yaoliang Yu, Jimmy Lin:
The Art of Abstention: Selective Prediction and Error Regularization for Natural Language Processing. ACL/IJCNLP (1) 2021: 1040-1051 - [c289]Ronak Pradeep, Xueguang Ma, Rodrigo Frassetto Nogueira, Jimmy Lin:
Scientific Claim Verification with VerT5erini. LOUHI@EACL 2021: 94-103 - [c288]Zhiying Jiang, Raphael Tang, Ji Xin, Jimmy Lin:
How Does BERT Rerank Passages? An Attribution Analysis with Information Bottlenecks. BlackboxNLP@EMNLP 2021: 496-509 - [c287]Wei Zhong, Xinyu Zhang, Ji Xin, Richard Zanibbi, Jimmy Lin:
Approach Zero and Anserini at the CLEF-2021 ARQMath Track: Applying Substructure Search and BM25 on Operator Tree Path Tokens. CLEF (Working Notes) 2021: 133-156 - [c286]Mayank Anand, Jiarui Zhang, Shane Ding, Ji Xin, Jimmy Lin:
Serverless BM25 Search and BERT Reranking. DESIRES 2021: 3-9 - [c285]Jimmy Lin, Xueguang Ma, Joel Mackenzie, Antonio Mallia:
On the Separation of Logical and Physical Ranking Models for Text Retrieval Applications. DESIRES 2021: 176-178 - [c284]Ogundepo Odunayo, Naveela N. Sookoo, Gautam Bathla, Anthony Cavallin, Bhaleka D. Persaud, Kathy Szigeti
, Philippe Van Cappellen, Jimmy Lin:
Rescuing historical climate observations to support hydrological research: a case study of solar radiation data. DocEng 2021: 19:1-19:4 - [c283]Ji Xin, Raphael Tang, Yaoliang Yu, Jimmy Lin:
BERxiT: Early Exiting for BERT with Better Fine-Tuning and Extension to Regression. EACL 2021: 91-104 - [c282]Mohan Zhang, Luchen Tan, Zihang Fu, Kun Xiong, Jimmy Lin, Ming Li, Zhengkai Tu:
Don't Change Me! User-Controllable Selective Paraphrase Generation. EACL 2021: 3522-3527 - [c281]Xinyu Zhang, Andrew Yates, Jimmy Lin:
Comparing Score Aggregation Approaches for Document Retrieval with Pretrained Transformers. ECIR (2) 2021: 150-163 - [c280]Yue Zhang, Chengcheng Hu, Yuqi Liu, Hui Fang, Jimmy Lin:
Learning to Rank in the Age of Muppets: Effectiveness-Efficiency Tradeoffs in Multi-Stage Ranking. SustaiNLP@EMNLP 2021: 64-73 - [c279]Minghan Li, Ming Li, Kun Xiong, Jimmy Lin:
Multi-Task Dense Retrieval via Model Uncertainty Fusion for Open-Domain Question Answering. EMNLP (Findings) 2021: 274-287 - [c278]Raphael Tang, Karun Kumar, Kendra Chalkley, Ji Xin, Liming Zhang, Wenyan Li, Gefei Yang, Yajie Mao, Junho Shin, Geoffrey Craig Murray, Jimmy Lin:
Voice Query Auto Completion. EMNLP (1) 2021: 900-906 - [c277]Sheng-Chieh Lin, Jheng-Hong Yang, Jimmy Lin:
Contextualized Query Embeddings for Conversational Search. EMNLP (1) 2021: 1004-1015 - [c276]Xueguang Ma, Minghan Li, Kai Sun, Ji Xin, Jimmy Lin:
Simple and Effective Unsupervised Redundancy Elimination to Compress Dense Vectors for Passage Retrieval. EMNLP (1) 2021: 2854-2859 - [c275]Anup Anand Deshmukh, Qianqiu Zhang, Ming Li, Jimmy Lin, Lili Mou:
Unsupervised Chunking as Syntactic Structure Induction with a Knowledge-Transfer Approach. EMNLP (Findings) 2021: 3626-3634 - [c274]Xiao Han, Yuqi Liu, Jimmy Lin:
The Simplest Thing That Can Possibly Work: (Pseudo-)Relevance Feedback via Text Classification. ICTIR 2021: 123-129 - [c273]Sheng-Chieh Lin, Jheng-Hong Yang, Jimmy Lin:
In-Batch Negatives for Knowledge Distillation with Tightly-Coupled Teachers for Dense Retrieval. RepL4NLP@ACL-IJCNLP 2021: 163-173 - [c272]Sebastian Hofstätter, Sheng-Chieh Lin, Jheng-Hong Yang, Jimmy Lin, Allan Hanbury:
Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling. SIGIR 2021: 113-122 - [c271]Nick Craswell, Bhaskar Mitra, Emine Yilmaz, Daniel Campos, Jimmy Lin:
MS MARCO: Benchmarking Ranking Models in the Large-Data Regime. SIGIR 2021: 1566-1576 - [c270]Ronak Pradeep, Xueguang Ma, Rodrigo Frassetto Nogueira, Jimmy Lin:
Vera: Prediction Techniques for Reducing Harmful Misinformation in Consumer Health Search. SIGIR 2021: 2066-2070 - [c269]Jimmy Lin, Daniel Campos, Nick Craswell, Bhaskar Mitra, Emine Yilmaz:
Significant Improvements over the State of the Art? A Case Study of the MS MARCO Document Ranking Leaderboard. SIGIR 2021: 2283-2287 - [c268]Jimmy Lin, Xueguang Ma, Sheng-Chieh Lin, Jheng-Hong Yang, Ronak Pradeep, Rodrigo Frassetto Nogueira:
Pyserini: A Python Toolkit for Reproducible Information Retrieval Research with Sparse and Dense Representations. SIGIR 2021: 2356-2362 - [c267]Edwin Zhang, Sheng-Chieh Lin, Jheng-Hong Yang, Ronak Pradeep, Rodrigo Frassetto Nogueira, Jimmy Lin:
Chatty Goose: A Python Framework for Conversational Search. SIGIR 2021: 2521-2525 - [c266]Wei Zhong, Jimmy Lin:
PYA0: A Python Toolkit for Accessible Math-Aware Search. SIGIR 2021: 2541-2545 - [c265]Andrew Yates, Rodrigo Frassetto Nogueira, Jimmy Lin:
Pretrained Transformers for Text Ranking: BERT and Beyond. SIGIR 2021: 2666-2668 - [c264]Andrew Yates, Rodrigo Frassetto Nogueira, Jimmy Lin:
Pretrained Transformers for Text Ranking: BERT and Beyond. WSDM 2021: 1154-1156 - [i105]Ronak Pradeep, Rodrigo Frassetto Nogueira, Jimmy Lin:
The Expando-Mono-Duo Design Pattern for Text Ranking with Pretrained Sequence-to-Sequence Models. CoRR abs/2101.05667 (2021) - [i104]Jimmy Lin, Xueguang Ma, Sheng-Chieh Lin, Jheng-Hong Yang, Ronak Pradeep, Rodrigo Frassetto Nogueira:
Pyserini: An Easy-to-Use Python Toolkit to Support Replicable IR Research with Sparse and Dense Representations. CoRR abs/2102.10073 (2021) - [i103]Jimmy Lin, Daniel Campos, Nick Craswell, Bhaskar Mitra, Emine Yilmaz:
Significant Improvements over the State of the Art? A Case Study of the MS MARCO Document Ranking Leaderboard. CoRR abs/2102.12887 (2021) - [i102]Rodrigo Frassetto Nogueira, Zhiying Jiang, Jimmy Lin:
Investigating the Limitations of the Transformers with Simple Arithmetic Tasks. CoRR abs/2102.13019 (2021) - [i101]Xueguang Ma, Kai Sun, Ronak Pradeep, Jimmy Lin:
A Replication Study of Dense Passage Retriever. CoRR abs/2104.05740 (2021) - [i100]Sebastian Hofstätter, Sheng-Chieh Lin, Jheng-Hong Yang, Jimmy Lin, Allan Hanbury:
Efficiently Teaching an Effective Dense Retriever with Balanced Topic Aware Sampling. CoRR abs/2104.06967 (2021) - [i99]Sheng-Chieh Lin, Jheng-Hong Yang, Jimmy Lin:
Contextualized Query Embeddings for Conversational Search. CoRR abs/2104.08707 (2021) - [i98]Nick Craswell, Bhaskar Mitra, Emine Yilmaz, Daniel Campos, Jimmy Lin:
MS MARCO: Benchmarking Ranking Models in the Large-Data Regime. CoRR abs/2105.04021 (2021) - [i97]Jimmy Lin, Xueguang Ma:
A Few Brief Notes on DeepImpact, COIL, and a Conceptual Framework for Information Retrieval Techniques. CoRR abs/2106.14807 (2021) - [i96]Xinyu Zhang, Xueguang Ma, Peng Shi, Jimmy Lin:
Mr. TyDi: A Multi-lingual Benchmark for Dense Retrieval. CoRR abs/2108.08787 (2021) - [i95]Peng Shi, Rui Zhang, He Bai, Jimmy Lin:
Cross-Lingual Training with Dense Retrieval for Document Retrieval. CoRR abs/2109.01628 (2021) - [i94]Jimmy Lin:
A Proposed Conceptual Framework for a Representational Approach to Information Retrieval. CoRR abs/2110.01529 (2021) - [i93]Minghan Li, Jimmy Lin:
Encoder Adaptation of Dense Passage Retrieval for Open-Domain Question Answering. CoRR abs/2110.01599 (2021) - [i92]Joel Mackenzie, Andrew Trotman, Jimmy Lin:
Wacky Weights in Learned Sparse Representations and the Revenge of Score-at-a-Time Query Evaluation. CoRR abs/2110.11540 (2021) - [i91]Sheng-Chieh Lin, Jimmy Lin:
Densifying Sparse Representations for Passage Retrieval by Representational Slicing. CoRR abs/2112.04666 (2021) - [i90]Hang Li, Shengyao Zhuang, Ahmed Mourad, Xueguang Ma, Jimmy Lin, Guido Zuccon:
Improving Query Representations for Dense Retrieval with Pseudo Relevance Feedback: A Reproducibility Study. CoRR abs/2112.06400 (2021)