


default search action
Chenghao Xiao
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
[j2]Ke Huang
, Chenghao Xiao
, Yao Xiao
, Ming Cai, Noura Al Moubayed
:
Reevaluating zero-shot information extraction: Sampling bias, prompting transferability and sensitivity in large language models. Inf. Process. Manag. 63(4): 104611 (2026)
[i27]Joseph James, Chenghao Xiao, Yucheng Li, Nafise Sadat Moosavi, Chenghua Lin:
RIGOURATE: Quantifying Scientific Exaggeration with Evidence-Aligned Claim Evaluation. CoRR abs/2601.04350 (2026)
[i26]Yang Wang, Yiqi Liu, Chenghao Xiao, Chenghua Lin:
The Achilles' Heel of Angular Margins: A Chebyshev Polynomial Fix for Speaker Verification. CoRR abs/2601.13198 (2026)
[i25]Adnan El Assadi, Isaac Chung, Chenghao Xiao, Roman Solomatin, Animesh Jha, Rahul Chand, Silky Singh, Kaitlyn Wang, Ali Sartaz Khan, Marc Moussa Nasser, Sufen Fong, Pengfei He, Alan Xiao, Ayush Sunil Munot, Aditya Shrivastava, Artem Gazizov, Niklas Muennighoff, Kenneth C. Enevoldsen:
MAEB: Massive Audio Embedding Benchmark. CoRR abs/2602.16008 (2026)
[i24]Danlu Chen, Ka Sing He, Jiahe Tian, Chenghao Xiao, Zhaofeng Wu, Taylor Berg-Kirkpatrick, Freda Shi:
Translation or Recitation? Calibrating Evaluation Scores for Machine Translation of Extremely Low-Resource Languages. CoRR abs/2603.25222 (2026)- 2025
[j1]Yang Wang, Chenghao Xiao, Yizhi Li, Stuart E. Middleton, Noura Al Moubayed
, Chenghua Lin:
Adversarial Defense without Adversarial Defense : Enhancing Language Model Robustness via Instance-level Principal Component Removal. Trans. Assoc. Comput. Linguistics 13: 1381-1409 (2025)
[c15]Chenghao Xiao, Hou Pong Chan, Hao Zhang, Mahani Aljunied, Lidong Bing, Noura Al Moubayed, Yu Rong:
Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations. ACL (1) 2025: 24099-24115
[c14]Bohao Yang, Dong Liu, Chenghao Xiao, Kun Zhao, Chen Tang, Chao Li, Lin Yuan, Yang Guang, Chenghua Lin:
Crafting Customisable Characters with LLMs: A Persona-Driven Role-Playing Agent Framework. EMNLP (Findings) 2025: 20216-20240
[c13]Yang Wang, Chenghao Xiao, Chia-Yi Hsiao, Zi Yan Chang, Chi-Li Chen, Tyler Loakman, Chenghua Lin:
Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth. EMNLP 2025: 23074-23096
[c12]Yu Sun, Xingyu Qian, Weiwen Xu, Hao Zhang, Chenghao Xiao, Long Li, Deli Zhao, Wenbing Huang, Tingyang Xu, Qifeng Bai
, Yu Rong:
ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning. EMNLP 2025: 26446-26467
[c11]Zhongtian Sun
, Chenghao Xiao
, Anoushka Harit
, Jongmin Yu
:
Quantifying Semantic Shift in Financial NLP: Robust Metrics for Market Prediction Stability. ICAIF 2025: 177-184
[c10]Yanan Ma, Chenghao Xiao, Chenhan Yuan, Sabine N. van der Veer, Lamiece Hassan, Chenghua Lin, Goran Nenadic:
CAST: Corpus-Aware Self-similarity Enhanced Topic modelling. NAACL (Long Papers) 2025: 7548-7561
[i23]Kenneth C. Enevoldsen, Isaac Chung, Imene Kerboua, Márton Kardos, Ashwin Mathur, David Stap, Jay Gala, Wissam Siblini, Dominik Krzeminski, Genta Indra Winata, Saba Sturua, Saiteja Utpala, Mathieu Ciancone, Marion Schaeffer, Gabriel Sequeira, Diganta Misra, Shreeya Dhakal, Jonathan Rystrøm, Roman Solomatin, Ömer Çagatan, Akash Kundu, Martin Bernstorff, Shitao Xiao, Akshita Sukhlecha, Bhavish Pahwa, Rafal Poswiata, Kranthi Kiran GV, Shawon Ashraf, Daniel Auras, Björn Plüster, Jan Philipp Harries, Loïc Magne, Isabelle Mohr, Mariya Hendriksen, Dawei Zhu, Hippolyte Gisserot-Boukhlef, Tom Aarsen, Jan Kostkan, Konrad Wojtasik, Taemin Lee, Marek Suppa
, Crystina Zhang, Roberta Rocca, Mohammed Hamdy, Andrianos Michail, John Yang, Manuel Faysse, Aleksei Vatolin, Nandan Thakur, Manan Dey, Dipam Vasani, Pranjal A. Chitale, Simone Tedeschi, Nguyen Tai, Artem Snegirev, Michael Günther, Mengzhou Xia, Weijia Shi, Xing Han Lù, Jordan Clive, Gayatri Krishnakumar, Anna Maksimova, Silvan Wehrli
, Maria Tikhonova, Henil Panchal, Aleksandr Abramov, Malte Ostendorff, Zheng Liu, Simon Clematide, Lester James V. Miranda, Alena Fenogenova, Guangyu Song, Ruqiya Bin Safi
, Wen-Ding Li, Alessia Borghini, Federico Cassano, Hongjin Su, Jimmy Lin, Howard Yen, Lasse Hansen, Sara Hooker, Chenghao Xiao, Vaibhav Adlakha, Orion Weller, Siva Reddy, Niklas Muennighoff:
MMTEB: Massive Multilingual Text Embedding Benchmark. CoRR abs/2502.13595 (2025)
[i22]Chenghao Xiao, Isaac Chung, Imene Kerboua, Jamie Stirling, Xin Zhang, Márton Kardos, Roman Solomatin, Noura Al Moubayed, Kenneth C. Enevoldsen, Niklas Muennighoff:
MIEB: Massive Image Embedding Benchmark. CoRR abs/2504.10471 (2025)
[i21]Chenghao Xiao, Hou Pong Chan, Hao Zhang, Mahani Aljunied, Lidong Bing, Noura Al Moubayed, Yu Rong:
Analyzing LLMs' Knowledge Boundary Cognition Across Languages Through the Lens of Internal Representations. CoRR abs/2504.13816 (2025)
[i20]Hanhua Hong, Chenghao Xiao, Yang Wang
, Yiqi Liu, Wenge Rong, Chenghua Lin:
Beyond One-Size-Fits-All: Inversion Learning for Highly Effective NLG Evaluation Prompts. CoRR abs/2504.21117 (2025)
[i19]LASA Team, Weiwen Xu, Hou Pong Chan, Long Li, Mahani Aljunied, Ruifeng Yuan, Jianyu Wang, Chenghao Xiao, Guizhen Chen, Chaoqun Liu
, Zhaodonghui Li, Yu Sun, Junao Shen, Chaojun Wang, Jie Tan, Deli Zhao, Tingyang Xu, Hao Zhang, Yu Rong:
Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning. CoRR abs/2506.07044 (2025)
[i18]Yu Sun, Xingyu Qian, Weiwen Xu, Hao Zhang, Chenghao Xiao, Long Li, Yu Rong, Wenbing Huang, Qifeng Bai
, Tingyang Xu:
ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning. CoRR abs/2506.09513 (2025)
[i17]Yang Wang
, Chenghao Xiao, Yizhi Li, Stuart E. Middleton, Noura Al Moubayed, Chenghua Lin:
Adversarial Defence without Adversarial Defence: Enhancing Language Model Robustness via Instance-level Principal Component Removal. CoRR abs/2507.21750 (2025)
[i16]Ruifeng Yuan, Chenghao Xiao, Sicong Leng, Jianyu Wang, Long Li, Weiwen Xu, Hou Pong Chan, Deli Zhao, Tingyang Xu, Zhongyu Wei, Hao Zhang, Yu Rong:
VL-Cogito: Progressive Curriculum Reinforcement Learning for Advanced Multimodal Reasoning. CoRR abs/2507.22607 (2025)
[i15]Yang Wang
, Chenghao Xiao, Chia-Yi Hsiao, Zi Yan Chang, Chi-Li Chen, Tyler Loakman, Chenghua Lin:
Drivel-ology: Challenging LLMs with Interpreting Nonsense with Depth. CoRR abs/2509.03867 (2025)
[i14]Chenghao Xiao, Hou Pong Chan, Hao Zhang, Weiwen Xu, Mahani Aljunied, Yu Rong:
Scaling Language-Centric Omnimodal Representation Learning. CoRR abs/2510.11693 (2025)- 2024
[c9]Siwei Wu, Yizhi Li, Kang Zhu, Ge Zhang, Yiming Liang, Kaijing Ma, Chenghao Xiao, Haoran Zhang
, Bohao Yang, Wenhu Chen, Wenhao Huang, Noura Al Moubayed
, Jie Fu, Chenghua Lin:
SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval. ACL (Findings) 2024: 12560-12574
[c8]Bohao Yang, Chen Tang, Kun Zhao, Chenghao Xiao, Chenghua Lin:
Effective Distillation of Table-based Reasoning Ability from LLMs. LREC/COLING 2024: 5538-5550
[c7]Joseph James
, Chenghao Xiao, Yucheng Li, Chenghua Lin:
On the Rigour of Scientific Writing: Criteria, Analysis, and Insights. EMNLP (Findings) 2024: 6523-6538
[c6]Yizhi Li, Ruibin Yuan, Ge Zhang, Yinghao Ma, Xingran Chen, Hanzhi Yin, Chenghao Xiao, Chenghua Lin, Anton Ragni, Emmanouil Benetos, Norbert Gyenge, Roger B. Dannenberg, Ruibo Liu, Wenhu Chen, Gus Xia, Yemin Shi, Wenhao Huang, Zili Wang, Yike Guo, Jie Fu:
MERT: Acoustic Music Understanding Model with Large-Scale Self-supervised Training. ICLR 2024
[i13]Siwei Wu, Yizhi Li
, Kang Zhu, Ge Zhang, Yiming Liang, Kaijing Ma, Chenghao Xiao, Haoran Zhang, Bohao Yang, Wenhu Chen, Wenhao Huang, Noura Al Moubayed, Jie Fu, Chenghua Lin:
SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval. CoRR abs/2401.13478 (2024)
[i12]Chenghao Xiao, Zhuoxu Huang, Danlu Chen, G. Thomas Hudson, Yizhi Li, Haoran Duan, Chenghua Lin, Jie Fu, Jungong Han, Noura Al Moubayed:
Pixel Sentence Representation Learning. CoRR abs/2402.08183 (2024)
[i11]Chenghao Xiao, G. Thomas Hudson, Noura Al Moubayed:
RAR-b: Reasoning as Retrieval Benchmark. CoRR abs/2404.06347 (2024)
[i10]Thomas Winterbottom, G. Thomas Hudson, Daniel Kluvanec, Dean L. Slack, Jamie Sterling, Junjie Shentu, Chenghao Xiao, Zheming Zhou, Noura Al Moubayed:
The Power of Next-Frame Prediction for Learning Physical Laws. CoRR abs/2405.17450 (2024)
[i9]Bohao Yang, Dong Liu, Chen Tang, Chenghao Xiao, Kun Zhao, Chao Li, Lin Yuan, Guang Yang, Lanxiao Huang, Chenghua Lin:
SimsChat: A Customisable Persona-Driven Role-Playing Agent. CoRR abs/2406.17962 (2024)
[i8]Chen Tang, Bohao Yang, Kun Zhao, Bo Lv, Chenghao Xiao, Frank Guerin, Chenghua Lin:
BioMNER: A Dataset for Biomedical Method Entity Recognition. CoRR abs/2406.20038 (2024)
[i7]Joseph James, Chenghao Xiao, Yucheng Li, Chenghua Lin:
On the Rigour of Scientific Writing: Criteria, Analysis, and Insights. CoRR abs/2410.04981 (2024)
[i6]Yanan Ma
, Chenghao Xiao, Chenhan Yuan, Sabine N. van der Veer
, Lamiece Hassan
, Chenghua Lin, Goran Nenadic:
CAST: Corpus-Aware Self-similarity Enhanced Topic modelling. CoRR abs/2410.15136 (2024)
[i5]G. Thomas Hudson, Dean L. Slack, Thomas Winterbottom, Jamie Sterling, Chenghao Xiao, Junjie Shentu, Noura Al Moubayed:
Everything is a Video: Unifying Modalities through Next-Frame Prediction. CoRR abs/2411.10503 (2024)- 2023
[c5]Chenghao Xiao, Yang Long, Noura Al Moubayed
:
On Isotropy, Contextualization and Learning Dynamics of Contrastive-based Sentence Representation Learning. ACL (Findings) 2023: 12266-12283
[c4]Chenghao Xiao, Yizhi Li, G. Thomas Hudson
, Chenghua Lin, Noura Al Moubayed
:
Length is a Curse and a Blessing for Document-level Semantics. EMNLP 2023: 1385-1396
[c3]Chenghao Xiao, Zihuiwen Ye, G. Thomas Hudson, Zhongtian Sun, Phil Blunsom, Noura Al Moubayed:
Can Text Encoders be Deceived by Length Attack? Tiny Papers @ ICLR 2023
[i4]Yang Wang
, Qibin Liang, Chenghao Xiao, Yizhi Li, Noura Al Moubayed, Chenghua Lin:
Audio Contrastive based Fine-tuning. CoRR abs/2309.11895 (2023)
[i3]Bohao Yang, Chen Tang, Kun Zhao, Chenghao Xiao, Chenghua Lin:
Effective Distillation of Table-based Reasoning Ability from LLMs. CoRR abs/2309.13182 (2023)
[i2]Chenghao Xiao, Yizhi Li, G. Thomas Hudson, Chenghua Lin, Noura Al Moubayed:
Length is a Curse and a Blessing for Document-level Semantics. CoRR abs/2310.16193 (2023)- 2022
[c2]Chenghao Xiao
, Lei Shi
, Alexandra I. Cristea
, Zhaoxing Li
, Ziqi Pan
:
Fine-grained Main Ideas Extraction and Clustering of Online Course Reviews. AIED (1) 2022: 294-306
[c1]Zhaoxing Li
, Lei Shi
, Alexandra I. Cristea
, Yunzhan Zhou
, Chenghao Xiao
, Ziqi Pan
:
SimStu-Transformer: A Transformer-Based Approach to Simulating Student Behaviour. AIED (2) 2022: 348-351
[i1]Chenghao Xiao, Yang Long, Noura Al Moubayed:
On Isotropy and Learning Dynamics of Contrastive-based Sentence Representation Learning. CoRR abs/2212.09170 (2022)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-04-17 00:28 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







