default search action

combined dblp search
author search
venue search
publication search

ask others

Gengyuan Zhang

> Home > Persons

Person information

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2025
[c10]
- view
  - electronic edition @ aclanthology.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/acl/0019LWZ00DTG25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/0019LWZ00DTG25
Tong Liu, Zhixin Lai, Jiawen Wang, Gengyuan Zhang, Shuo Chen, Philip Torr, Vera Demberg, Volker Tresp, Jindong Gu:
Multimodal Pragmatic Jailbreak on Text-to-image Models. ACL (1) 2025: 4681-4720
[c9]
- view
  - electronic edition @ thecvf.com (open access)
  - details & citations
- export record
  dblp key:
  - conf/cvpr/ZhangFM0C0TG25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/ZhangFM0C0TG25
Gengyuan Zhang, Mang Ling Ada Fok, Jialu Ma, Yan Xia, Daniel Cremers, Philip Torr, Volker Tresp, Jindong Gu:
Localizing Events in Videos with Multimodal Queries. CVPR 2025: 3339-3351
[c8]
- view
  - electronic edition @ thecvf.com (open access)
  - details & citations
- export record
  dblp key:
  - conf/cvpr/Chen0ZBZZ0GKT25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cvpr/Chen0ZBZZ0GKT25
Haokun Chen, Hang Li, Yao Zhang, Jinhe Bi, Gengyuan Zhang, Yueqi Zhang, Philip Torr, Jindong Gu, Denis Krompass, Volker Tresp:
FedBiP: Heterogeneous One-Shot Federated Learning with Personalized Latent Diffusion Models. CVPR 2025: 30440-30450
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/wacv/ZhangC0KZGT25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wacv/ZhangC0KZGT25
Yao Zhang, Haokun Chen, Ahmed Frikha, Denis Krompass, Gengyuan Zhang, Jindong Gu, Volker Tresp:
CL-Cross VQA: A Continual Learning Benchmark for Cross-Domain Visual Question Answering. WACV 2025: 6269-6278
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/wacv/AmorosoZK0CT25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wacv/AmorosoZK0CT25
Roberto Amoroso, Gengyuan Zhang, Rajat Koner, Lorenzo Baraldi, Rita Cucchiara, Volker Tresp:
Perceive. Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries. WACV 2025: 8853-8862
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2502-15457
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2502-15457
Gengyuan Zhang, Mingcong Ding, Tong Liu, Yao Zhang, Volker Tresp:
Memory Helps, but Confabulation Misleads: Understanding Streaming Events in Videos with MLLMs. CoRR abs/2502.15457 (2025)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-23798
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-23798
Jian Lan, Yifei Fu, Udo Schlegel, Gengyuan Zhang, Tanveer Hannan, Haokun Chen, Thomas Seidl:
My Answer Is NOT 'Fair': Mitigating Social Bias in Vision-Language Models via Fair and Biased Residuals. CoRR abs/2505.23798 (2025)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-18472
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-18472
Gengyuan Zhang, Tanveer Hannan, Hermine Kleiner, Beste Aydemir, Xinyu Xie, Jian Lan, Thomas Seidl, Volker Tresp, Jindong Gu:
AViLA: Asynchronous Vision-Language Agent for Streaming Multimodal Data Interaction. CoRR abs/2506.18472 (2025)
2024
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/bibm/WangZZLCXWZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/bibm/WangZZLCXWZL24
Haosen Wang, Gengyuan Zhang, Yingnan Zhao, Fang Lai, Wenwei Cui, Jiexiao Xue, Qihang Wang, Hao Zhang, Yi Lin:
RPF-ELD: Regional Prior Fusion using Early and Late Distillation for Breast Cancer Recognition in Ultrasound Images. BIBM 2024: 2605-2612
[c4]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/LiaoEWZZMT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/LiaoEWZZMT24
Ruotong Liao, Max Erler, Huiyu Wang, Guangyao Zhai, Gengyuan Zhang, Yunpu Ma, Volker Tresp:
VideoINSTA: Zero-shot Long Video Understanding via Informative Spatial-Temporal Reasoning with LLMs. EMNLP (Findings) 2024: 6577-6602
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/wacv/ZhangZZT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/wacv/ZhangZZT24
Gengyuan Zhang, Yurui Zhang, Kerui Zhang, Volker Tresp:
Can Vision-Language Models be a Good Guesser? Exploring VLMs for Times and Location Reasoning. WACV 2024: 625-634
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2406-10079
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2406-10079
Gengyuan Zhang, Mang Ling Ada Fok, Yan Xia, Yansong Tang, Daniel Cremers, Philip Torr, Volker Tresp, Jindong Gu:
Localizing Events in Videos with Multimodal Queries. CoRR abs/2406.10079 (2024)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-19149
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-19149
Tong Liu, Zhixin Lai, Gengyuan Zhang, Philip Torr, Vera Demberg, Volker Tresp, Jindong Gu:
Multimodal Pragmatic Jailbreak on Text-to-image Models. CoRR abs/2409.19149 (2024)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-20365
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-20365
Ruotong Liao, Max Erler, Huiyu Wang, Guangyao Zhai, Gengyuan Zhang, Yunpu Ma, Volker Tresp:
VideoINSTA: Zero-shot Long Video Understanding via Informative Spatial-Temporal Reasoning with LLMs. CoRR abs/2409.20365 (2024)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-04810
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-04810
Haokun Chen, Hang Li, Yao Zhang, Gengyuan Zhang, Jinhe Bi, Philip Torr, Jindong Gu, Denis Krompass, Volker Tresp:
FedBiP: Heterogeneous One-Shot Federated Learning with Personalized Latent Diffusion Models. CoRR abs/2410.04810 (2024)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-19304
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-19304
Roberto Amoroso, Gengyuan Zhang, Rajat Koner, Lorenzo Baraldi, Rita Cucchiara, Volker Tresp:
Perceive, Query & Reason: Enhancing Video QA with Question-Guided Temporal Queries. CoRR abs/2412.19304 (2024)
2023
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/ZhangRGT23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/ZhangRGT23
Gengyuan Zhang, Jisen Ren, Jindong Gu, Volker Tresp:
Multi-event Video-Text Retrieval. ICCV 2023: 22056-22066
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-06166
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-06166
Gengyuan Zhang, Yurui Zhang, Kerui Zhang, Volker Tresp:
Can Vision-Language Models be a Good Guesser? Exploring VLMs for Times and Location Reasoning. CoRR abs/2307.06166 (2023)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2307-12980
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2307-12980
Jindong Gu, Zhen Han, Shuo Chen, Ahmad Beirami, Bailan He, Gengyuan Zhang, Ruotong Liao, Yao Qin, Volker Tresp, Philip H. S. Torr:
A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models. CoRR abs/2307.12980 (2023)
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-11551
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-11551
Gengyuan Zhang, Jisen Ren, Jindong Gu, Volker Tresp:
Multi-event Video-Text Retrieval. CoRR abs/2308.11551 (2023)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-12919
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-12919
Gengyuan Zhang, Jinhe Bi, Jindong Gu, Volker Tresp:
SPOT! Revisiting Video-Language Models for Event Understanding. CoRR abs/2311.12919 (2023)
2022
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2211-10567
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2211-10567
Yao Zhang, Haokun Chen, Ahmed Frikha, Yezi Yang, Denis Krompass, Gengyuan Zhang, Jindong Gu, Volker Tresp:
CL-CrossVQA: A Continual Learning Benchmark for Cross-Domain Visual Question Answering. CoRR abs/2211.10567 (2022)
2021
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/HanZMT21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/HanZMT21
Zhen Han, Gengyuan Zhang, Yunpu Ma, Volker Tresp:
Time-dependent Entity Embedding is not All You Need: A Re-evaluation of Temporal Knowledge Graph Completion Models under a Unified Framework. EMNLP (1) 2021: 8104-8118

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.