


default search action
Zefan Cai
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
[j2]Zefan Cai, Haoyi Qiu, Haozhe Zhao, Ke Wan, Jiachen Li, Jiuxiang Gu, Wen Xiao, Nanyun Peng, Junjie Hu:
From Preferences to Prejudice: The Role of Alignment Tuning in Shaping Social Bias in Video Diffusion Models. Trans. Mach. Learn. Res. 2026 (2026)
[i31]Liang Chen, Weichu Xie, Yiyan Liang, Hongfeng He, Hans Zhao, Zhibo Yang, Zhiqi Huang, Haoning Wu, Haoyu Lu, Y. Charles, Yiping Bao, Yuantao Fan, Guopeng Li, Haiyang Shen, Xuanzhong Chen, Wendong Xu, Shuzheng Si, Zefan Cai, Wenhao Chai, Ziqi Huang, Fangfu Liu, Tianyu Liu, Baobao Chang, Xiaobo Hu, Kaiyuan Chen, Yixin Ren, Yang Liu, Yuan Gong, Kuan Li:
BabyVision: Visual Reasoning Beyond Language. CoRR abs/2601.06521 (2026)- 2025
[j1]Timothy Ossowski, Danyal Maqbool, Jixuan Chen, Zefan Cai, Tyler J. Bradshaw, Junjie Hu:
COMMA: A Communicative Multimodal Multi-Agent Benchmark. Trans. Mach. Learn. Res. 2025 (2025)
[c18]Bofei Gao, Zefan Cai, Runxin Xu, Peiyi Wang, Ce Zheng, Runji Lin, Keming Lu, Dayiheng Liu, Chang Zhou, Wen Xiao, Tianyu Liu, Baobao Chang:
LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback. ACL (Findings) 2025: 14588-14604
[c17]Liang Chen, Sinan Tan, Zefan Cai, Weichu Xie, Haozhe Zhao, Yichi Zhang, Junyang Lin, Jinze Bai, Tianyu Liu, Baobao Chang:
A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation. ICLR 2025
[c16]Yu Fu, Zefan Cai, Abedelkadir Asi, Wayne Xiong, Yue Dong, Wen Xiao:
Not All Heads Matter: A Head-Level KV Cache Compression Method with Integrated Retrieval and Reasoning. ICLR 2025
[c15]Bofei Gao, Feifan Song, Zhe Yang, Zefan Cai, Yibo Miao, Qingxiu Dong, Lei Li, Chenghao Ma, Liang Chen, Runxin Xu, Zhengyang Tang, Benyou Wang, Daoguang Zan, Shanghaoran Quan, Ge Zhang, Lei Sha, Yichang Zhang, Xuancheng Ren, Tianyu Liu, Baobao Chang:
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark for Large Language Models. ICLR 2025
[c14]Binwei Yao, Zefan Cai, Yun-Shiuan Chuang, Shanglin Yang, Ming Jiang, Diyi Yang, Junjie Hu:
No Preference Left Behind: Group Distributional Preference Optimization. ICLR 2025
[c13]Yuliang Liu, Junjie Lu, Chaofeng Qu, Zhaoling Chen, Zefan Cai, Jason Klein Liu, Chonghan Liu, Yunhui Xia, Li Zhao, Jiang Bian, Chuheng Zhang, Wei Shen, Zhouhan Lin:
AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence. ICML 2025
[i30]Cheng Luo, Zefan Cai, Hanshi Sun, Jinqi Xiao, Bo Yuan, Wen Xiao, Junjie Hu, Jiawei Zhao, Beidi Chen, Anima Anandkumar:
HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading. CoRR abs/2502.12574 (2025)
[i29]Yuliang Liu, Junjie Lu, Zhaoling Chen, Chaofeng Qu, Jason Klein Liu, Chonghan Liu, Zefan Cai, Yunhui Xia, Li Zhao, Jiang Bian, Chuheng Zhang, Wei Shen, Zhouhan Lin:
AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence. CoRR abs/2502.13943 (2025)
[i28]Zeyi Huang, Yuyang Ji, Anirudh Sundara Rajan, Zefan Cai, Wen Xiao, Junjie Hu, Yong Jae Lee:
VisualToolAgent (VisTA): A Reinforcement Learning Framework for Visual Tool Selection. CoRR abs/2505.20289 (2025)
[i27]Zefan Cai, Wen Xiao, Hanshi Sun, Cheng Luo, Yikai Zhang, Ke Wan
, Yucheng Li, Yeyang Zhou, Li-Wen Chang, Jiuxiang Gu, Zhen Dong, Anima Anandkumar, Abedelkadir Asi, Junjie Hu:
R-KV: Redundancy-aware KV Cache Compression for Training-Free Reasoning Models Acceleration. CoRR abs/2505.24133 (2025)
[i26]Ruijie Zhu, Tianhao Peng, Tianhao Cheng, Xingwei Qu, Jinfa Huang, Dawei Zhu, Hao Wang, Kaiwen Xue, Xuanliang Zhang, Yong Shan, Tianle Cai, Taylor Kergan, Assel Kembay, Andrew Smith, Chenghua Lin, Binh Nguyen, Yuqi Pan, Yuhong Chou, Zefan Cai, Zhenhe Wu, Yongchi Zhao, Tianyu Liu, Jian Yang, Wangchunshu Zhou, Chujie Zheng, Chongxuan Li, Yuyin Zhou, Zhoujun Li, Zhaoxiang Zhang, Jiaheng Liu, Ge Zhang, Wenhao Huang, Jason Eshraghian:
A Survey on Latent Reasoning. CoRR abs/2507.06203 (2025)
[i25]Haozhe Zhao, Zefan Cai, Shuzheng Si, Liang Chen, Jiuxiang Gu, Wen Xiao, Junjie Hu:
MENTOR: Efficient Multimodal-Conditioned Tuning for Autoregressive Vision Generation Models. CoRR abs/2507.09574 (2025)
[i24]Lin Zhang, Zefan Cai, Yufan Zhou, Shentong Mo, Jinhong Lin, Cheng-En Wu, Yibing Wei, Yijing Zhang, Ruiyi Zhang, Wen Xiao, Tong Sun, Junjie Hu, Pedro Morgado:
Scaling Up Audio-Synchronized Visual Animation: An Efficient Training Paradigm. CoRR abs/2508.03955 (2025)
[i23]Zefan Cai, Haoyi Qiu, Haozhe Zhao, Ke Wan
, Jiachen Li, Jiuxiang Gu, Wen Xiao, Nanyun Peng, Junjie Hu:
From Preferences to Prejudice: The Role of Alignment Tuning in Shaping Social Bias in Video Diffusion Models. CoRR abs/2510.17247 (2025)
[i22]Zefan Cai, Haoyi Qiu, Tianyi Ma, Haozhe Zhao, Gengze Zhou, Kung-Hsiang Huang, Parisa Kordjamshidi, Minjia Zhang, Wen Xiao, Jiuxiang Gu, Nanyun Peng, Junjie Hu:
MMGR: Multi-Modal Generative Reasoning. CoRR abs/2512.14691 (2025)- 2024
[c12]Liang Chen, Yichi Zhang, Shuhuai Ren, Haozhe Zhao, Zefan Cai, Yuchi Wang, Peiyi Wang, Xiangdi Meng, Tianyu Liu, Baobao Chang:
PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain. ACL (Findings) 2024: 1086-1104
[c11]Zefan Cai, Po-Nien Kung, Ashima Suvarna, Mingyu Derek Ma, Hritik Bansal, Baobao Chang, P. Jeffrey Brantingham, Wei Wang, Nanyun Peng:
Improving Event Definition Following For Zero-Shot Event Detection. ACL (1) 2024: 2842-2863
[c10]Shuzheng Si, Helan Hu, Haozhe Zhao, Shuang Zeng, Kaikai An, Zefan Cai, Baobao Chang:
Improving the Robustness of Distantly-Supervised Named Entity Recognition via Uncertainty-Aware Teacher Learning and Student-Student Collaborative Learning. ACL (Findings) 2024: 5533-5546
[c9]Peiyi Wang, Lei Li, Liang Chen, Zefan Cai, Dawei Zhu, Binghuai Lin, Yunbo Cao, Lingpeng Kong, Qi Liu, Tianyu Liu, Zhifang Sui:
Large Language Models are not Fair Evaluators. ACL (1) 2024: 9440-9450
[c8]Haozhe Zhao, Zefan Cai, Shuzheng Si, Xiaojian Ma, Kaikai An, Liang Chen, Zixuan Liu, Sheng Wang, Wenjuan Han, Baobao Chang:
MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning. ICLR 2024
[c7]Rongyu Zhang
, Zefan Cai
, Huanrui Yang
, Zidong Liu
, Denis A. Gudovskiy
, Tomoyuki Okuno
, Yohei Nakata
, Kurt Keutzer
, Baobao Chang
, Yuan Du
, Li Du
, Shanghang Zhang
:
VeCAF: Vision-language Collaborative Active Finetuning with Training Objective Awareness. ACM Multimedia 2024: 5451-5459
[c6]Haozhe Zhao, Zefan Cai, Shuzheng Si, Liang Chen, Yufeng He, Kaikai An, Baobao Chang:
Mitigating Language-Level Performance Disparity in mPLMs via Teacher Language Selection and Cross-lingual Self-Distillation. NAACL-HLT 2024: 2893-2907
[c5]Zefan Cai, Xin Zheng, Tianyu Liu, Haoran Meng, Jiaqi Han, Gang Yuan, Binghuai Lin, Baobao Chang, Yunbo Cao:
DialogVCS: Robust Natural Language Understanding in Dialogue System Upgrade. NAACL-HLT 2024: 5431-5452
[i21]Rongyu Zhang, Zefan Cai, Huanrui Yang, Zidong Liu, Denis A. Gudovskiy, Tomoyuki Okuno, Yohei Nakata, Kurt Keutzer, Baobao Chang, Yuan Du, Li Du, Shanghang Zhang:
VeCAF: VLM-empowered Collaborative Active Finetuning with Training Objective Awareness. CoRR abs/2401.07853 (2024)
[i20]Liang Chen, Yichi Zhang, Shuhuai Ren, Haozhe Zhao, Zefan Cai, Yuchi Wang, Peiyi Wang, Xiangdi Meng, Tianyu Liu, Baobao Chang:
PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain. CoRR abs/2402.15527 (2024)
[i19]Zefan Cai, Po-Nien Kung, Ashima Suvarna, Mingyu Derek Ma, Hritik Bansal, Baobao Chang, P. Jeffrey Brantingham, Wei Wang, Nanyun Peng:
Improving Event Definition Following For Zero-Shot Event Detection. CoRR abs/2403.02586 (2024)
[i18]Haozhe Zhao, Zefan Cai, Shuzheng Si, Liang Chen, Yufeng He, Kaikai An, Baobao Chang:
Mitigating Language-Level Performance Disparity in mPLMs via Teacher Language Selection and Cross-lingual Self-Distillation. CoRR abs/2404.08491 (2024)
[i17]Zefan Cai, Yichi Zhang, Bofei Gao, Yuliang Liu, Tianyu Liu, Keming Lu, Wayne Xiong, Yue Dong, Baobao Chang, Junjie Hu, Wen Xiao:
PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling. CoRR abs/2406.02069 (2024)
[i16]Bofei Gao, Zefan Cai, Runxin Xu, Peiyi Wang, Ce Zheng, Runji Lin, Keming Lu, Junyang Lin, Chang Zhou, Wen Xiao, Junjie Hu, Tianyu Liu, Baobao Chang:
LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback. CoRR abs/2406.14024 (2024)
[i15]Bofei Gao, Feifan Song, Yibo Miao, Zefan Cai, Zhe Yang, Liang Chen, Helan Hu, Runxin Xu, Qingxiu Dong, Ce Zheng, Wen Xiao, Ge Zhang, Daoguang Zan, Keming Lu, Bowen Yu, Dayiheng Liu, Zeyu Cui, Jian Yang, Lei Sha, Houfeng Wang, Zhifang Sui, Peiyi Wang, Tianyu Liu, Baobao Chang:
Towards a Unified View of Preference Learning for Large Language Models: A Survey. CoRR abs/2409.02795 (2024)
[i14]Liang Chen, Sinan Tan, Zefan Cai, Weichu Xie, Haozhe Zhao, Yichi Zhang, Junyang Lin, Jinze Bai, Tianyu Liu, Baobao Chang:
A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation. CoRR abs/2410.01912 (2024)
[i13]Timothy Ossowski, Jixuan Chen, Danyal Maqbool, Zefan Cai, Tyler J. Bradshaw, Junjie Hu:
COMMA: A Communicative Multimodal Multi-Agent Benchmark. CoRR abs/2410.07553 (2024)
[i12]Bofei Gao, Feifan Song, Zhe Yang, Zefan Cai, Yibo Miao, Qingxiu Dong, Lei Li, Chenghao Ma, Liang Chen, Runxin Xu, Zhengyang Tang, Benyou Wang, Daoguang Zan, Shanghaoran Quan, Ge Zhang, Lei Sha, Yichang Zhang, Xuancheng Ren, Tianyu Liu, Baobao Chang:
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models. CoRR abs/2410.07985 (2024)
[i11]Yu Fu, Zefan Cai, Abedelkadir Asi, Wayne Xiong, Yue Dong, Wen Xiao:
Not All Heads Matter: A Head-Level KV Cache Compression Method with Integrated Retrieval and Reasoning. CoRR abs/2410.19258 (2024)
[i10]Liang Chen, Zekun Wang, Shuhuai Ren, Lei Li, Haozhe Zhao, Yunshui Li, Zefan Cai, Hongcheng Guo, Lei Zhang, Yizhe Xiong, Yichi Zhang, Ruoyu Wu, Qingxiu Dong, Ge Zhang, Jian Yang, Lingwei Meng, Shujie Hu, Yulong Chen, Junyang Lin, Shuai Bai, Andreas Vlachos, Xu Tan, Minjia Zhang, Wen Xiao, Aaron Yee, Tianyu Liu, Baobao Chang:
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey. CoRR abs/2412.18619 (2024)
[i9]Binwei Yao, Zefan Cai, Yun-Shiuan Chuang, Shanglin Yang, Ming Jiang, Diyi Yang, Junjie Hu:
No Preference Left Behind: Group Distributional Preference Optimization. CoRR abs/2412.20299 (2024)- 2023
[c4]Shuzheng Si, Zefan Cai, Shuang Zeng, Guoqiang Feng, Jiaxing Lin, Baobao Chang:
SANTA: Separate Strategies for Inaccurate and Incomplete Annotation Noise in Distantly-Supervised Named Entity Recognition. ACL (Findings) 2023: 3883-3896
[c3]Wenjuan Han
, Haozhe Zhao
, Zefan Cai
:
Empowering MultiModal Models' In-Context Learning Ability through Large Language Models. ACM TUR-C 2023: 9-10
[c2]Nan Shao, Zefan Cai, Hanwei Xu, Chonghua Liao, Yanan Zheng, Zhilin Yang:
Compositional Task Representations for Large Language Models. ICLR 2023
[i8]Shuzheng Si, Zefan Cai, Shuang Zeng, Guoqiang Feng, Jiaxing Lin, Baobao Chang:
SANTA: Separate Strategies for Inaccurate and Incomplete Annotation Noise in Distantly-Supervised Named Entity Recognition. CoRR abs/2305.04076 (2023)
[i7]Yufeng He, Zefan Cai, Xu Gan, Baobao Chang:
DiffCap: Exploring Continuous Diffusion on Image Captioning. CoRR abs/2305.12144 (2023)
[i6]Zefan Cai, Xin Zheng, Tianyu Liu, Xu Wang, Haoran Meng, Jiaqi Han, Gang Yuan, Binghuai Lin, Baobao Chang, Yunbo Cao:
DialogVCS: Robust Natural Language Understanding in Dialogue System Upgrade. CoRR abs/2305.14751 (2023)
[i5]Zefan Cai, Baobao Chang, Wenjuan Han:
Human-in-the-Loop through Chain-of-Thought. CoRR abs/2306.07932 (2023)
[i4]Haozhe Zhao, Zefan Cai, Shuzheng Si, Xiaojian Ma, Kaikai An, Liang Chen, Zixuan Liu, Sheng Wang, Wenjuan Han, Baobao Chang:
MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning. CoRR abs/2309.07915 (2023)
[i3]Liang Chen, Yichi Zhang, Shuhuai Ren, Haozhe Zhao, Zefan Cai, Yuchi Wang, Peiyi Wang, Tianyu Liu, Baobao Chang:
Towards End-to-End Embodied Decision Making via Multi-modal Large Language Model: Explorations with GPT4-Vision and Beyond. CoRR abs/2310.02071 (2023)
[i2]Helan Hu, Shuzheng Si, Haozhe Zhao, Shuang Zeng, Kaikai An, Zefan Cai, Baobao Chang:
Distantly-Supervised Named Entity Recognition with Uncertainty-aware Teacher Learning and Student-student Collaborative Learning. CoRR abs/2311.08010 (2023)
[i1]Yuliang Liu, Xiangru Tang, Zefan Cai, Junjie Lu, Yichi Zhang, Yanjun Shao, Zexuan Deng, Helan Hu, Zengxian Yang, Kaikai An, Ruijun Huang, Shuzheng Si, Sheng Chen, Haozhe Zhao, Zhengliang Li, Liang Chen, Yiming Zong, Yan Wang, Tianyu Liu, Zhiwei Jiang, Baobao Chang, Yujia Qin, Wangchunshu Zhou, Yilun Zhao, Arman Cohan, Mark Gerstein:
ML-Bench: Large Language Models Leverage Open-source Libraries for Machine Learning Tasks. CoRR abs/2311.09835 (2023)
2010 – 2019
- 2019
[c1]Zhipeng Yu, Zefan Cai:
Using Feature Tree Model to Track High Speed Flying Soccer in Complicated Background. ISKE 2019: 880-885
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-03-31 23:41 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







