


default search action
Shan Chen 0004
Person information
- affiliation: Harvard Medical School, Mass General Brigham, Artificial Intelligence in Medicine (AIM) Program, Boston, MA, USA
- affiliation: Boston Children's Hospital, Computational Health Informatics Program, Boston, MA, USA
Other persons with the same name
- Shan Chen — disambiguation page
- Shan Chen 0001 — University of Technology Sydney, Innovation and Enterprise Research Laboratory, Australia
- Shan Chen 0002 — Monash University, Australia
- Shan Chen 0003
— Hefei University of Technology, School of Mechanical Engineering, Hefei, China
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2025
- [j3]Wonjin Yoon
, Shan Chen, Yanjun Gao
, Zhanzhan Zhao, Dmitriy Dligach, Danielle S. Bitterman, Majid Afshar
, Timothy A. Miller
:
LCD benchmark: long clinical document benchmark on mortality prediction for language models. J. Am. Medical Informatics Assoc. 32(2): 285-295 (2025) - [c5]João Matos, Shan Chen, Siena Placino, Yingya Li, Juan Carlos Climent Pardo, Daphna Idan, Takeshi Tohyama, David S. Restrepo, Luis Filipe Nakayama, Jose M. M. Pascual-Leone, Guergana K. Savova, Hugo J. W. L. Aerts, Leo Anthony Celi, An-Kwok Ian Wong, Danielle S. Bitterman, Jack Gallifant:
WorldMedQA-V: a multilingual, multimodal medical examination dataset for multimodal language models evaluation. NAACL (Findings) 2025: 7203-7216 - [i21]Jack Gallifant, Shan Chen, Kuleen Sasse, Hugo J. W. L. Aerts, Thomas Hartvigsen, Danielle S. Bitterman:
Sparse Autoencoder Features for Classifications and Transferability. CoRR abs/2502.11367 (2025) - [i20]Yubin Kim, Hyewon Jeong, Shan Chen, Shuyue Stella Li, Mingyu Lu, Kumail Alhamoud, Jimin Mun, Cristina Grau, Minseok Jung, Rodrigo Gameiro, Lizhou Fan, Eugene Park, Tristan Lin, Joonsik Yoon, Wonjin Yoon, Maarten Sap, Yulia Tsvetkov, Paul Liang, Xuhai Xu, Xin Liu, Daniel McDuff, Hyeonhoon Lee, Hae Won Park, Samir Tulebaev, Cynthia Breazeal:
Medical Hallucinations in Foundation Models and Their Impact on Healthcare. CoRR abs/2503.05777 (2025) - [i19]Shan Chen, Pedro Moreira, Yuxin Xiao, Sam Schmidgall, Jeremy L. Warner, Hugo J. W. L. Aerts, Thomas Hartvigsen, Jack Gallifant, Danielle S. Bitterman:
MedBrowseComp: Benchmarking Medical Deep Research and Computer Use. CoRR abs/2505.14963 (2025) - [i18]Yuxin Xiao, Shan Chen, Jack Gallifant, Danielle S. Bitterman, Thomas Hartvigsen, Marzyeh Ghassemi:
KScope: A Framework for Characterizing the Knowledge Status of Language Models. CoRR abs/2506.07458 (2025) - 2024
- [j2]Shan Chen
, Yingya Li, Sheng Lu, Hoang Van, Hugo J. W. L. Aerts, Guergana K. Savova, Danielle S. Bitterman:
Evaluating the ChatGPT family of models for biomedical reasoning and classification. J. Am. Medical Informatics Assoc. 31(4): 940-948 (2024) - [j1]Marco Guevara, Shan Chen
, Spencer Thomas, Tafadzwa L. Chaunzwa, Idalid Franco, Benjamin H. Kann
, Shalini Moningi, Jack M. Qian, Madeleine Goldstein, Susan Harper, Hugo J. W. L. Aerts
, Paul J. Catalano, Guergana K. Savova, Raymond H. Mak, Danielle S. Bitterman:
Large language models to identify social determinants of health in electronic health records. npj Digit. Medicine 7(1) (2024) - [c4]Yanjun Gao, Skatje Myers, Shan Chen, Dmitriy Dligach, Timothy A. Miller, Danielle S. Bitterman, Matthew M. Churpek, Majid Afshar:
When Raw Data Prevails: Are Large Language Model Embeddings Effective in Numerical Data Representation for Medical Machine Learning Applications? EMNLP (Findings) 2024: 5414-5428 - [c3]Jack Gallifant, Shan Chen, Pedro Moreira, Nikolaj Munch, Mingye Gao, Jackson Pond, Leo Anthony Celi, Hugo J. W. L. Aerts, Thomas Hartvigsen
, Danielle S. Bitterman:
Language Models are Surprisingly Fragile to Drug Names in Biomedical Benchmarks. EMNLP (Findings) 2024: 12448-12465 - [c2]Shan Chen, Jack Gallifant, Mingye Gao, Pedro Moreira, Nikolaj Munch, Ajay Muthukkumar, Arvind Rajan, Jaya Kolluri, Amelia Fiske, Janna Hastings, Hugo J. W. L. Aerts, Brian Anthony, Leo Anthony Celi, William G. La Cava, Danielle S. Bitterman:
Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias. NeurIPS 2024 - [i17]Shan Chen, Jack Gallifant, Marco Guevara, Yanjun Gao, Majid Afshar, Timothy Miller, Dmitriy Dligach, Danielle S. Bitterman:
Improving Clinical NLP Performance through Language Model-Generated Synthetic Clinical Data. CoRR abs/2403.19511 (2024) - [i16]Shan Chen, Jack Gallifant, Mingye Gao, Pedro Moreira, Nikolaj Munch, Ajay Muthukkumar, Arvind Rajan, Jaya Kolluri, Amelia Fiske, Janna Hastings, Hugo J. W. L. Aerts, Brian Anthony, Leo Anthony Celi, William G. La Cava, Danielle S. Bitterman:
Cross-Care: Assessing the Healthcare Implications of Pre-training Data on Language Model Bias. CoRR abs/2405.05506 (2024) - [i15]Jack Gallifant, Shan Chen, Pedro Moreira, Nikolaj Munch, Mingye Gao, Jackson Pond, Leo Anthony Celi, Hugo J. W. L. Aerts, Thomas Hartvigsen, Danielle S. Bitterman:
Language Models are Surprisingly Fragile to Drug Names in Biomedical Benchmarks. CoRR abs/2406.12066 (2024) - [i14]Yanjun Gao, Skatje Myers, Shan Chen, Dmitriy Dligach, Timothy A. Miller, Danielle S. Bitterman, Matthew M. Churpek, Majid Afshar:
When Raw Data Prevails: Are Large Language Model Embeddings Effective in Numerical Data Representation for Medical Machine Learning Applications? CoRR abs/2408.11854 (2024) - [i13]Huizi Yu, Jiayan Zhou
, Lingyao Li, Shan Chen, Jack Gallifant, Anye Shi, Xiang Li, Wenyue Hua, Mingyu Jin, Guang Chen, Yang Zhou, Zhao Li, Trisha Gupte, Ming-Li Chen
, Zahra Azizi, Yongfeng Zhang, Themistocles L. Assimes, Xin Ma, Danielle S. Bitterman, Lin Lu, Lizhou Fan:
AIPatient: Simulating Patients with EHRs and LLM Powered Agentic Workflow. CoRR abs/2409.18924 (2024) - [i12]Shan Chen, Mingye Gao, Kuleen Sasse, Thomas Hartvigsen, Brian Anthony, Lizhou Fan, Hugo J. W. L. Aerts, Jack Gallifant, Danielle S. Bitterman:
Wait, but Tylenol is Acetaminophen... Investigating and Improving Language Models' Ability to Resist Requests for Misinformation. CoRR abs/2409.20385 (2024) - [i11]João Matos, Shan Chen, Siena Placino, Yingya Li, Juan Carlos Climent Pardo, Daphna Idan, Takeshi Tohyama, David S. Restrepo, Luis Filipe Nakayama, Jose M. M. Pascual-Leone, Guergana Savova, Hugo J. W. L. Aerts, Leo A. Celi, An-Kwok Ian Wong, Danielle S. Bitterman, Jack Gallifant:
WorldMedQA-V: a multilingual, multimodal medical examination dataset for multimodal language models evaluation. CoRR abs/2410.12722 (2024) - [i10]Kuleen Sasse, Shan Chen, Jackson Pond, Danielle S. Bitterman, John D. Osborne:
Mapping Bias in Vision Language Models: Signposts, Pitfalls, and the Road Ahead. CoRR abs/2410.13146 (2024) - [i9]Yanjun Gao, Skatje Myers, Shan Chen, Dmitriy Dligach, Timothy A. Miller, Danielle S. Bitterman, Guanhua Chen, Anoop M. Mayampurath, Matthew M. Churpek, Majid Afshar:
Position Paper On Diagnostic Uncertainty Estimation from Large Language Models: Next-Word Probability Is Not Pre-test Probability. CoRR abs/2411.04962 (2024) - [i8]Canyu Chen, Jian Yu, Shan Chen, Che Liu, Zhongwei Wan, Danielle S. Bitterman, Fei Wang, Kai Shu:
ClinicalBench: Can LLMs Beat Traditional ML Models in Clinical Prediction? CoRR abs/2411.06469 (2024) - [i7]Mingye Gao, Aman Varshney, Shan Chen, Vikram Goddla, Jack Gallifant, Patrick Doyle, Claire Novack, Maeve Dillon-Martin, Teresia Perkins, Xinrong Correia, Erik P. Duhaime, Howard Isenstein, Elad Sharon, Lisa Soleymani Lehmann, David E. Kozono, Brian Anthony, Dmitriy Dligach, Danielle S. Bitterman:
The use of large language models to enhance cancer clinical trial educational materials. CoRR abs/2412.01955 (2024) - 2023
- [c1]Sheng Lu, Shan Chen, Yingya Li, Danielle S. Bitterman, Guergana Savova, Iryna Gurevych:
Measuring Pointwise \mathcalV-Usable Information In-Context-ly. EMNLP (Findings) 2023: 15739-15756 - [i6]Shan Chen, Marco Guevara, Nicolas Ramirez, Arpi Murray, Jeremy L. Warner, Hugo J. W. L. Aerts, Timothy A. Miller, Guergana K. Savova, Raymond H. Mak, Danielle S. Bitterman:
Natural language processing to automatically extract the presence and severity of esophagitis in notes of patients undergoing radiotherapy. CoRR abs/2303.13722 (2023) - [i5]Shan Chen, Yingya Li, Sheng Lu, Hoang Van, Hugo J. W. L. Aerts, Guergana K. Savova, Danielle S. Bitterman:
Evaluation of ChatGPT Family of Models for Biomedical Reasoning and Classification. CoRR abs/2304.02496 (2023) - [i4]Marco Guevara, Shan Chen, Spencer Thomas
, Tafadzwa L. Chaunzwa, Idalid Franco, Benjamin H. Kann, Shalini Moningi, Jack M. Qian, Madeleine Goldstein, Susan Harper, Hugo J. W. L. Aerts, Guergana K. Savova, Raymond H. Mak, Danielle S. Bitterman:
Large Language Models to Identify Social Determinants of Health in Electronic Health Records. CoRR abs/2308.06354 (2023) - [i3]Sheng Lu, Shan Chen, Yingya Li, Danielle S. Bitterman, Guergana Savova, Iryna Gurevych:
Measuring Pointwise $\mathcal{V}$-Usable Information In-Context-ly. CoRR abs/2310.12300 (2023) - [i2]Shan Chen, Marco Guevara, Shalini Moningi, Frank Hoebers, Hesham Elhalawani
, Benjamin H. Kann, Fallon E. Chipidza, Jonathan Leeman, Hugo J. W. L. Aerts, Timothy A. Miller, Guergana K. Savova, Raymond H. Mak, Maryam Lustberg, Majid Afshar, Danielle S. Bitterman:
The impact of using an AI chatbot to respond to patient messages. CoRR abs/2310.17703 (2023) - 2021
- [i1]Dongfang Xu, Shan Chen, Timothy Miller:
BCH-NLP at BioCreative VII Track 3: medications detection in tweets using transformer networks and multi-task learning. CoRR abs/2111.13726 (2021)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from ,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2025-07-08 21:49 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint