


default search action
Tianyi Bai
Person information
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
[i22]Jiajun Zhang, Zeyu Cui, Jiaxi Yang, Lei Zhang, Yuheng Jing, Zeyao Ma, Tianyi Bai, Zilei Wang, Qiang Liu, Liang Wang, Binyuan Hui, Junyang Lin:
From Completion to Editing: Unlocking Context-Aware Code Infilling via Search-and-Replace Instruction Tuning. CoRR abs/2601.13384 (2026)
[i21]Bohan Zeng, Kaixin Zhu, Daili Hua, Bozhou Li, Chengzhuo Tong, Yuran Wang, Xinyi Huang, Yifan Dai, Zixiang Zhang, Yifan Yang, Zhou Liu, Hao Liang, Xiaochen Ma, Ruichuan An, Tianyi Bai, Hongcheng Gao, Junbo Niu, Yang Shi, Xinlong Chen, Yue Ding, Minglei Shi, Kai Zeng, Yiwen Tang, Yuanxing Zhang, Pengfei Wan, Xintao Wang, Wentao Zhang:
Research on World Models Is Not Merely Injecting World Knowledge into Specific Tasks. CoRR abs/2602.01630 (2026)- 2025
[c6]Baichuan Zhou, Haote Yang
, Dairong Chen, Junyan Ye, Tianyi Bai, Jinhua Yu, Songyang Zhang, Dahua Lin, Conghui He, Weijia Li:
UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios. AAAI 2025: 10707-10715
[c5]Tianyi Bai, Ling Yang, Zhen Hao Wong, Fupeng Sun, Xinlin Zhuang, Jiahui Peng, Chi Zhang, Lijun Wu, Jiantao Qiu, Wentao Zhang, Binhang Yuan, Conghui He:
Efficient Pretraining Data Selection for Language Models via Multi-Actor Collaboration. ACL (1) 2025: 9465-9491
[c4]Xinlin Zhuang, Jiahui Peng, Ren Ma, Yinfan Wang, Tianyi Bai, Xingjian Wei, Jiantao Qiu, Chi Zhang, Ying Qian, Conghui He:
Meta-rater: A Multi-dimensional Data Selection Method for Pre-training Language Models. ACL (1) 2025: 10856-10896
[c3]Junyan Ye, Baichuan Zhou, Zilong Huang, Junan Zhang, Tianyi Bai, Hengrui Kang, Jun He, Honglin Lin, Zihao Wang, Tong Wu, Zhizheng Wu, Yiping Chen, Dahua Lin, Conghui He, Weijia Li:
LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models. ICLR 2025
[c2]Chi Zhang, Huaping Zhong, Kuan Zhang, Chengliang Chai, Rui Wang, Xinlin Zhuang, Tianyi Bai, Jiantao Qiu, Lei Cao, Ju Fan, Ye Yuan, Guoren Wang, Conghui He:
Harnessing Diversity for Important Data Selection in Pretraining Large Language Models. ICLR 2025
[i20]Gernot Heiser, Ivan Velickovic, Peter Chubb, Alwin Joshy, Anuraag Ganesh, Bill Nguyen, Cheng Li, Courtney Darville, Guangtao Zhu, James Archer, Jingyao Zhou, Krishnan Winter, Lucy Parker, Szymon Duchniewicz, Tianyi Bai:
Fast, Secure, Adaptable: LionsOS Design, Implementation and Performance. CoRR abs/2501.06234 (2025)
[i19]Jiahui Peng, Xinlin Zhuang, Jiantao Qiu, Ren Ma, Jing Yu, Tianyi Bai, Conghui He:
Unsupervised Topic Models are Data Mixers for Pre-training Language Models. CoRR abs/2502.16802 (2025)
[i18]Xinlin Zhuang, Jiahui Peng, Ren Ma, Yinfan Wang, Tianyi Bai, Xingjian Wei, Jiantao Qiu, Chi Zhang, Ying Qian, Conghui He:
Meta-rater: A Multi-dimensional Data Selection Method for Pre-training Language Models. CoRR abs/2504.14194 (2025)
[i17]Guangxin He, Yuan Cao, Yutong He, Tianyi Bai, Kun Yuan, Binhang Yuan:
TAH-QUANT: Effective Activation Quantization in Pipeline Parallelism over Slow Network. CoRR abs/2506.01352 (2025)
[i16]Tianyi Bai, Yuxuan Fan, Jiantao Qiu, Fupeng Sun, Jiayi Song, Junlin Han, Zichen Liu, Conghui He, Wentao Zhang, Binhang Yuan:
Hallucination at a Glance: Controlled Visual Edits and Fine-Grained Multimodal Learning. CoRR abs/2506.07227 (2025)
[i15]Tianyi Bai, Zengjie Hu, Fupeng Sun, Jiantao Qiu, Yizhen Jiang, Guangxin He, Bohan Zeng, Conghui He, Binhang Yuan, Wentao Zhang:
Multi-Step Visual Reasoning with Visual Tokens Scaling and Verification. CoRR abs/2506.07235 (2025)
[i14]Guangxin He, Shen Nie, Fengqi Zhu, Yuankang Zhao, Tianyi Bai, Ran Yan, Jie Fu, Chongxuan Li, Binhang Yuan:
UltraLLaDA: Scaling the Context Length to 128K for Diffusion Large Language Models. CoRR abs/2510.10481 (2025)
[i13]Zengjie Hu, Jiantao Qiu, Tianyi Bai, Haojin Yang, Binhang Yuan, Qi Jing, Conghui He, Wentao Zhang:
VADE: Variance-Aware Dynamic Sampling via Online Sample-Level Difficulty Estimation for Multimodal RL. CoRR abs/2511.18902 (2025)
[i12]Shuai Wang, Daoan Zhang, Tianyi Bai, Shitong Shao, Jiebo Luo
, Jiaheng Wei:
LAST: LeArning to Think in Space and Time for Generalist Vision-Language Models. CoRR abs/2511.19261 (2025)
[i11]Yiming Chen, Junlin Han, Tianyi Bai, Shengbang Tong, Filippos Kokkinos, Philip Torr:
From Pixels to Feelings: Aligning MLLMs with Human Cognitive Perception of Images. CoRR abs/2511.22805 (2025)
[i10]Yiwen Tang, Zoey Guo, Kaixin Zhu, Ray Zhang, Qizhi Chen, Dongzhi Jiang, Junli Liu, Bohan Zeng, Haoming Song, Delin Qu, Tianyi Bai, Dan Xu, Wentao Zhang, Bin Zhao:
Are We Ready for RL in Text-to-3D Generation? A Progressive Investigation. CoRR abs/2512.10949 (2025)
[i9]Hao Liang, Xiaochen Ma, Zhou Liu, Zhen Hao Wong, Zhengyang Zhao, Zimo Meng, Runming He, Chengyu Shen, Qifeng Cai, Zhaoyang Han, Meiyi Qiang, Yalin Feng
, Tianyi Bai, Zewei Pan, Ziyi Guo, Yizhen Jiang, Jingwen Deng, Qijie You, Peichao Lai, Tianyu Guo, Chi Hsu Tsai, Hengyi Feng, Rui Hu, Wenkai Yu, Junbo Niu, Bohan Zeng, Ruichuan An, Lu Ma, Jihao Huang, Yaowei Zheng, Conghui He, Linpeng Tang, Bin Cui, Weinan E, Wentao Zhang:
DataFlow: An LLM-Driven Framework for Unified Data Preparation and Workflow Automation in the Era of Data-Centric AI. CoRR abs/2512.16676 (2025)- 2024
[i8]Tianyi Bai, Hao Liang, Binwang Wan, Ling Yang, Bozhou Li, Yifan Wang, Bin Cui, Conghui He, Binhang Yuan, Wentao Zhang:
A Survey of Multimodal Large Language Model from A Data-centric Perspective. CoRR abs/2405.16640 (2024)
[i7]Hao Liang, Jiapeng Li, Tianyi Bai, Xijie Huang, Linzhuang Sun, Zhengren Wang, Conghui He, Bin Cui, Chong Chen, Wentao Zhang:
KeyVideoLLM: Towards Large-scale Video Keyframe Selection. CoRR abs/2407.03104 (2024)
[i6]Baichuan Zhou, Haote Yang, Dairong Chen, Junyan Ye, Tianyi Bai, Jinhua Yu, Songyang Zhang
, Dahua Lin, Conghui He, Weijia Li:
UrBench: A Comprehensive Benchmark for Evaluating Large Multimodal Models in Multi-View Urban Scenarios. CoRR abs/2408.17267 (2024)
[i5]Chi Zhang, Huaping Zhong, Kuan Zhang, Chengliang Chai, Rui Wang, Xinlin Zhuang, Tianyi Bai, Jiantao Qiu, Lei Cao, Ye Yuan, Guoren Wang, Conghui He:
Harnessing Diversity for Important Data Selection in Pretraining Large Language Models. CoRR abs/2409.16986 (2024)
[i4]Tianyi Bai, Ling Yang, Zhen Hao Wong, Jiahui Peng, Xinlin Zhuang, Chi Zhang, Lijun Wu, Jiantao Qiu, Wentao Zhang, Binhang Yuan, Conghui He:
Multi-Agent Collaborative Data Selection for Efficient LLM Pretraining. CoRR abs/2410.08102 (2024)
[i3]Junyan Ye, Baichuan Zhou, Zilong Huang, Junan Zhang, Tianyi Bai, Hengrui Kang, Jun He, Honglin Lin, Zihao Wang, Tong Wu, Zhizheng Wu, Yiping Chen, Dahua Lin, Conghui He, Weijia Li:
LOKI: A Comprehensive Synthetic Data Detection Benchmark using Large Multimodal Models. CoRR abs/2410.09732 (2024)- 2023
[i2]Tianyi Bai, Yang Li, Yu Shen, Xinyi Zhang, Wentao Zhang, Bin Cui:
Transfer Learning for Bayesian Optimization: A Survey. CoRR abs/2302.05927 (2023)- 2022
[c1]Yang Li, Yu Shen
, Huaijun Jiang, Tianyi Bai
, Wentao Zhang, Ce Zhang, Bin Cui:
Transfer Learning based Search Space Design for Hyperparameter Tuning. KDD 2022: 967-977
[i1]Yang Li, Yu Shen, Huaijun Jiang, Tianyi Bai, Wentao Zhang, Ce Zhang, Bin Cui:
Transfer Learning based Search Space Design for Hyperparameter Tuning. CoRR abs/2206.02511 (2022)
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-03-25 23:47 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







