Skip to main content
Cornell University
We gratefully acknowledge support from the Simons Foundation, member institutions, and all contributors. Donate
arxiv logo > cs.CL

Help | Advanced Search

arXiv logo
Cornell University Logo

quick links

  • Login
  • Help Pages
  • About

Computation and Language

Authors and titles for recent submissions

  • Fri, 30 Jan 2026
  • Thu, 29 Jan 2026
  • Wed, 28 Jan 2026
  • Tue, 27 Jan 2026
  • Mon, 26 Jan 2026

See today's new changes

Total of 519 entries : 1-50 51-100 101-150 151-200 ... 501-519
Showing up to 50 entries per page: fewer | more | all

Fri, 30 Jan 2026 (showing first 50 of 119 entries )

[1] arXiv:2601.22156 [pdf, html, other]
Title: Hybrid Linear Attention Done Right: Efficient Distillation and Effective Architectures for Extremely Long Contexts
Yingfa Chen, Zhen Leng Thai, Zihan Zhou, Zhu Zhang, Xingyu Shen, Shuo Wang, Chaojun Xiao, Xu Han, Zhiyuan Liu
Comments: 20 pages, 8 figures
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[2] arXiv:2601.22149 [pdf, html, other]
Title: DynaWeb: Model-Based Reinforcement Learning of Web Agents
Hang Ding, Peidong Liu, Junqiao Wang, Ziwei Ji, Meng Cao, Rongzhao Zhang, Lynn Ai, Eric Yang, Tianyu Shi, Lei Yu
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[3] arXiv:2601.22146 [pdf, html, other]
Title: FineInstructions: Scaling Synthetic Instructions to Pre-Training Scale
Ajay Patel, Colin Raffel, Chris Callison-Burch
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[4] arXiv:2601.22139 [pdf, html, other]
Title: Reasoning While Asking: Transforming Reasoning Large Language Models from Passive Solvers to Proactive Inquirers
Xin Chen, Feng Jiang, Yiqian Zhang, Hardy Chen, Shuo Yan, Wenya Xie, Min Yang, Shujian Huang
Comments: The manuscript is under review
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[5] arXiv:2601.22124 [pdf, other]
Title: A Federated and Parameter-Efficient Framework for Large Language Model Training in Medicine
Anran Li, Yuanyuan Chen, Wenjun Long, Yu Yin, Yan Hu, Hyunjae Kim, Weipeng Zhou, Yujia Zhou, Hongyi Peng, Yang Ren, Xuguang Ai, Zhenyue Qin, Ming Hu, Xiaoxiao Li, Han Yu, Yih-Chung Tham, Lucila Ohno-Machado, Hua Xu, Qingyu Chen
Comments: 38 pages, 9 tables, 3 figures
Subjects: Computation and Language (cs.CL); Distributed, Parallel, and Cluster Computing (cs.DC)
[6] arXiv:2601.22101 [pdf, html, other]
Title: ECO: Quantized Training without Full-Precision Master Weights
Mahdi Nikdan, Amir Zandieh, Dan Alistarh, Vahab Mirrokni
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[7] arXiv:2601.22069 [pdf, html, other]
Title: VTC-R1: Vision-Text Compression for Efficient Long-Context Reasoning
Yibo Wang, Yongcheng Jing, Shunyu Liu, Hao Guan, Rong-cheng Tu, Chengyu Wang, Jun Huang, Dacheng Tao
Comments: Code: this https URL
Subjects: Computation and Language (cs.CL)
[8] arXiv:2601.22055 [pdf, html, other]
Title: $G^2$-Reader: Dual Evolving Graphs for Multimodal Document QA
Yaxin Du, Junru Song, Yifan Zhou, Cheng Wang, Jiahao Gu, Zimeng Chen, Menglan Chen, Wen Yao, Yang Yang, Ying Wen, Siheng Chen
Subjects: Computation and Language (cs.CL)
[9] arXiv:2601.22050 [pdf, html, other]
Title: MasalBench: A Benchmark for Contextual and Cross-Cultural Understanding of Persian Proverbs in LLMs
Ghazal Kalhor, Behnam Bahrak
Subjects: Computation and Language (cs.CL)
[10] arXiv:2601.22047 [pdf, html, other]
Title: On the Paradoxical Interference between Instruction-Following and Task Solving
Yunjia Qi, Hao Peng, Xintong Shi, Amy Xin, Xiaozhi Wang, Bin Xu, Lei Hou, Juanzi Li
Subjects: Computation and Language (cs.CL)
[11] arXiv:2601.22040 [pdf, html, other]
Title: A Separable Architecture for Continuous Token Representation in Language Models
Reza T. Batley, Sourav Saha
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[12] arXiv:2601.22035 [pdf, html, other]
Title: Thinking Out of Order: When Output Order Stops Reflecting Reasoning Order in Diffusion Language Models
Longxuan Yu, Yu Fu, Shaorong Zhang, Hui Liu, Mukund Varma T, Greg Ver Steeg, Yue Dong
Comments: 18 pages, 13 figures, 5 tables
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[13] arXiv:2601.22031 [pdf, html, other]
Title: Causal Autoregressive Diffusion Language Model
Junhao Ruan, Bei Li, Yongjing Yin, Pengcheng Huang, Xin Chen, Jingang Wang, Xunliang Cai, Tong Xiao, JingBo Zhu
Subjects: Computation and Language (cs.CL)
[14] arXiv:2601.22025 [pdf, html, other]
Title: When "Better" Prompts Hurt: Evaluation-Driven Iteration for LLM Applications
Daniel Commey
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Software Engineering (cs.SE)
[15] arXiv:2601.21996 [pdf, html, other]
Title: Mechanistic Data Attribution: Tracing the Training Origins of Interpretable LLM Units
Jianhui Chen, Yuzhang Luo, Liangming Pan
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[16] arXiv:2601.21969 [pdf, html, other]
Title: Token-Guard: Towards Token-Level Hallucination Control via Self-Checking Decoding
Yifan Zhu, Huiqiang Rong, Haoran Luo
Comments: 26 pages and 11 figures,this work has been accepted for presentation at ICLR 2026
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[17] arXiv:2601.21968 [pdf, html, other]
Title: OVD: On-policy Verbal Distillation
Jing Xiong, Hui Shen, Shansan Gong, Yuxin Cheng, Jianghan Shen, Chaofan Tao, Haochen Tan, Haoli Bai, Lifeng Shang, Ngai Wong
Comments: Technical Report
Subjects: Computation and Language (cs.CL)
[18] arXiv:2601.21955 [pdf, html, other]
Title: From Generative Modeling to Clinical Classification: A GPT-Based Architecture for EHR Notes
Fariba Afrin Irany
Comments: This submission is a full-length research manuscript consisting of 37 pages and 15 figures. The paper presents a GPT-based architecture with selective fine-tuning for clinical text classification, including detailed architectural diagrams, learning curves, and evaluation figures such as ROC curves and confusion matrices
Subjects: Computation and Language (cs.CL)
[19] arXiv:2601.21927 [pdf, html, other]
Title: SONIC: Segmented Optimized Nexus for Information Compression in Key-Value Caching
Hong Chen, Xiang Liu, Bo Wang, Yuxuan Fan, Yuanlin Chu, Zongluo Li, Xiaowen Chu, Xuming Hu
Subjects: Computation and Language (cs.CL)
[20] arXiv:2601.21895 [pdf, html, other]
Title: Learn-to-Distance: Distance Learning for Detecting LLM-Generated Text
Hongyi Zhou, Jin Zhu, Erhan Xu, Kai Ye, Ying Yang, Chengchun Shi
Comments: Accepted by ICLR2026
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (stat.ML)
[21] arXiv:2601.21841 [pdf, other]
Title: Embodied Task Planning via Graph-Informed Action Generation with Large Lanaguage Model
Xiang Li, Ning Yan, Masood Mortazavi
Subjects: Computation and Language (cs.CL)
[22] arXiv:2601.21826 [pdf, other]
Title: Mil-SCORE: Benchmarking Long-Context Geospatial Reasoning and Planning in Large Language Models
Aadi Palnitkar, Mingyang Mao, Nicholas Waytowich, Vinicius G. Goecks, Tinoosh Mohsenin, Xiaomin Lin
Subjects: Computation and Language (cs.CL)
[23] arXiv:2601.21804 [pdf, html, other]
Title: Distribution-Aware Reward Estimation for Test-Time Reinforcement Learning
Bodong Du, Xuanqi Huang, Xiaomeng Li
Subjects: Computation and Language (cs.CL)
[24] arXiv:2601.21803 [pdf, other]
Title: RAG-E: Quantifying Retriever-Generator Alignment and Failure Modes
Korbinian Randl, Guido Rocchietti, Aron Henriksson, Ziawasch Abedjan, Tony Lindgren, John Pavlopoulos
Subjects: Computation and Language (cs.CL)
[25] arXiv:2601.21797 [pdf, html, other]
Title: Enhancing Conversational Agents via Task-Oriented Adversarial Memory Adaptation
Yimin Deng, Yuqing Fu, Derong Xu, Yejing Wang, Wei Ni, Jingtong Gao, Xiaopeng Li, Chengxu Liu, Xiao Han, Guoshuai Zhao, Xiangyu Zhao, Li Zhu, Xueming Qian
Comments: 11 pages, 4 figures
Subjects: Computation and Language (cs.CL)
[26] arXiv:2601.21796 [pdf, html, other]
Title: KID: Knowledge-Injected Dual-Head Learning for Knowledge-Grounded Harmful Meme Detection
Yaocong Li, Leihan Zhang, Le Zhang, Qiang Yan
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[27] arXiv:2601.21768 [pdf, html, other]
Title: Zonkey: A Hierarchical Diffusion Language Model with Differentiable Tokenization and Probabilistic Attention
Alon Rozental
Subjects: Computation and Language (cs.CL)
[28] arXiv:2601.21767 [pdf, html, other]
Title: Evaluating ChatGPT on Medical Information Extraction Tasks: Performance, Explainability and Beyond
Wei Zhu
Subjects: Computation and Language (cs.CL)
[29] arXiv:2601.21766 [pdf, html, other]
Title: CoFrGeNet: Continued Fraction Architectures for Language Generation
Amit Dhurandhar, Vijil Chenthamarakshan, Dennis Wei, Tejaswini Pedapati, Karthikeyan Natesan Ramamurthy, Rahul Nair
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[30] arXiv:2601.21744 [pdf, html, other]
Title: Temporal Guidance for Large Language Models
Hong-Kai Zheng, Piji Li
Subjects: Computation and Language (cs.CL)
[31] arXiv:2601.21733 [pdf, html, other]
Title: CE-GOCD: Central Entity-Guided Graph Optimization for Community Detection to Augment LLM Scientific Question Answering
Jiayin Lan, Jiaqi Li, Baoxin Wang, Ming Liu, Dayong Wu, Shijin Wang, Bing Qin, Guoping Hu
Comments: Accepted by IEEE ICASSP 2026
Subjects: Computation and Language (cs.CL)
[32] arXiv:2601.21725 [pdf, html, other]
Title: Procedural Pretraining: Warming Up Language Models with Abstract Data
Liangze Jiang, Zachary Shinnick, Anton van den Hengel, Hemanth Saratchandran, Damien Teney
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[33] arXiv:2601.21722 [pdf, html, other]
Title: Enhancing Language Models for Robust Greenwashing Detection
Neil Heinrich Braun, Keane Ong, Rui Mao, Erik Cambria, Gianmarco Mengaldo
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[34] arXiv:2601.21711 [pdf, html, other]
Title: TACLer: Tailored Curriculum Reinforcement Learning for Efficient Reasoning
Huiyuan Lai, Malvina Nissim
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
[35] arXiv:2601.21709 [pdf, html, other]
Title: Why Attention Patterns Exist: A Unifying Temporal Perspective Analysis
Qingyue Yang, Jie Wang, Xing Li, Yinqi Bai, Xialiang Tong, Huiling Zhen, Jianye Hao, Mingxuan Yuan, Bin Li
Comments: ICLR 2026
Subjects: Computation and Language (cs.CL)
[36] arXiv:2601.21700 [pdf, html, other]
Title: Toward Culturally Aligned LLMs through Ontology-Guided Multi-Agent Reasoning
Wonduk Seo, Wonseok Choi, Junseo Koh, Juhyeon Lee, Hyunjin An, Minhyeong Yu, Jian Park, Qingshan Zhou, Seunghyun Lee, Yi Bu
Comments: 35 pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Information Retrieval (cs.IR); Multiagent Systems (cs.MA); Social and Information Networks (cs.SI)
[37] arXiv:2601.21699 [pdf, html, other]
Title: Can David Beat Goliath? On Multi-Hop Reasoning with Resource-Constrained Agents
Hojae Han, Heeyun Jung, Jongyoon Kim, Seung-won Hwang
Comments: Preprint
Subjects: Computation and Language (cs.CL)
[38] arXiv:2601.21684 [pdf, other]
Title: Do Not Waste Your Rollouts: Recycling Search Experience for Efficient Test-Time Scaling
Xinglin Wang, Jiayi Shi, Shaoxiong Feng, Peiwen Yuan, Yiwei Li, Yueqi Zhang, Chuyi Tan, Ji Zhang, Boyuan Pan, Yao Hu, Kan Li
Comments: preprint
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[39] arXiv:2601.21682 [pdf, html, other]
Title: FIT: Defying Catastrophic Forgetting in Continual LLM Unlearning
Xiaoyu Xu, Minxin Du, Kun Fang, Zi Liang, Yaxin Xiao, Zhicong Huang, Cheng Hong, Qingqing Ye, Haibo Hu
Comments: 20 Pages
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Cryptography and Security (cs.CR); Machine Learning (cs.LG)
[40] arXiv:2601.21678 [pdf, other]
Title: Scale-Dependent Semantic Dynamics Revealed by Allan Deviation
Debayan Dasgupta
Subjects: Computation and Language (cs.CL); Data Analysis, Statistics and Probability (physics.data-an)
[41] arXiv:2601.21665 [pdf, other]
Title: AdaptBPE: From General Purpose to Specialized Tokenizers
Vijini Liyanage, François Yvon
Comments: EACL 2026
Subjects: Computation and Language (cs.CL)
[42] arXiv:2601.21647 [pdf, html, other]
Title: ILRR: Inference-Time Steering Method for Masked Diffusion Language Models
Eden Avrahami, Eliya Nachmani
Subjects: Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
[43] arXiv:2601.21587 [pdf, html, other]
Title: Language Models as Artificial Learners: Investigating Crosslinguistic Influence
Abderrahmane Issam, Yusuf Can Semerci, Jan Scholtes, Gerasimos Spanakis
Subjects: Computation and Language (cs.CL)
[44] arXiv:2601.21579 [pdf, html, other]
Title: KromHC: Manifold-Constrained Hyper-Connections with Kronecker-Product Residual Matrices
Wuyang Zhou, Yuxuan Gu, Giorgos Iacovides, Danilo Mandic
Subjects: Computation and Language (cs.CL); Machine Learning (cs.LG)
[45] arXiv:2601.21558 [pdf, html, other]
Title: ASTRA: Automated Synthesis of agentic Trajectories and Reinforcement Arenas
Xiaoyu Tian, Haotian Wang, Shuaiting Chen, Hao Zhou, Kaichi Yu, Yudian Zhang, Jade Ouyang, Junxi Yin, Jiong Chen, Baoyan Guo, Lei Zhang, Junjie Tao, Yuansheng Song, Ming Cui, Chengwei Liu
Subjects: Computation and Language (cs.CL)
[46] arXiv:2601.21551 [pdf, html, other]
Title: Note2Chat: Improving LLMs for Multi-Turn Clinical History Taking Using Medical Notes
Yang Zhou, Zhenting Sheng, Mingrui Tan, Yuting Song, Jun Zhou, Yu Heng Kwan, Lian Leng Low, Yang Bai, Yong Liu
Comments: Accepted at AAAI-26
Subjects: Computation and Language (cs.CL)
[47] arXiv:2601.21543 [pdf, html, other]
Title: inversedMixup: Data Augmentation via Inverting Mixed Embeddings
Fanshuang Kong, Richong Zhang, Qiyu Sun, Zhijie Nie, Ting Deng, Chunming Hu
Subjects: Computation and Language (cs.CL)
[48] arXiv:2601.21525 [pdf, html, other]
Title: LMK > CLS: Landmark Pooling for Dense Embeddings
Meet Doshi, Aashka Trivedi, Vishwajeet Kumar, Parul Awasthy, Yulong Li, Jaydeep Sen, Radu Florian, Sachindra Joshi
Subjects: Computation and Language (cs.CL); Information Retrieval (cs.IR)
[49] arXiv:2601.21512 [pdf, html, other]
Title: MURAD: A Large-Scale Multi-Domain Unified Reverse Arabic Dictionary Dataset
Serry Sibaee, Yasser Alhabashi, Nadia Sibai, Yara Farouk, Adel Ammar, Sawsan AlHalawani, Wadii Boulila
Comments: 18 pages
Subjects: Computation and Language (cs.CL); Computers and Society (cs.CY); Databases (cs.DB); Information Retrieval (cs.IR)
[50] arXiv:2601.21483 [pdf, html, other]
Title: DimStance: Multilingual Datasets for Dimensional Stance Analysis
Jonas Becker, Liang-Chih Yu, Shamsuddeen Hassan Muhammad, Jan Philip Wahle, Terry Ruas, Idris Abdulmumin, Lung-Hao Lee, Wen-Ni Liu, Tzu-Mi Lin, Zhe-Yu Xu, Ying-Lung Lin, Jin Wang, Maryam Ibrahim Mukhtar, Bela Gipp, Saif M. Mohammed
Subjects: Computation and Language (cs.CL)
Total of 519 entries : 1-50 51-100 101-150 151-200 ... 501-519
Showing up to 50 entries per page: fewer | more | all
  • About
  • Help
  • contact arXivClick here to contact arXiv Contact
  • subscribe to arXiv mailingsClick here to subscribe Subscribe
  • Copyright
  • Privacy Policy
  • Web Accessibility Assistance
  • arXiv Operational Status