


default search action
Ting Cao 0003
Person information
- affiliation: Microsoft Research, Beijing, China
Other persons with the same name
- Ting Cao — disambiguation page
- Ting Cao 0001
— Xi'an University of Technology, Xi'an, China - Ting Cao 0002
— Xi'an University of Technology, Xi'an, China - Ting Cao 0004
— Xi'an University of Technology, Xi'an, Shaanxi, China - Ting Cao 0005
— Wuhan University, Wuhan, China - Ting Cao 0006
— Peng Cheng Laboratory, Shenzhen, Guangdong, China - Ting Cao 0007
— Microsoft Research Asia, Beijing, China - Ting Cao 0008
— University of Washington, Seattle, WA, USA
Refine list

refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2026
[c45]Dayou Du, Shijie Cao, Jianyi Cheng, Luo Mai, Ting Cao, Mao Yang:
BitDecoding: Unlocking Tensor Cores for Long-Context LLMs with Low-Bit KV Cache. HPCA 2026: 1-13
[i44]Liang Mi, Weijun Wang, Jinghan Chen, Ting Cao, Haipeng Dai, Yunxin Liu:
Efficient Remote Prefix Fetching with GPU-native Media ASICs. CoRR abs/2602.09725 (2026)- 2025
[j6]Tatsuya Kubo
, Daichi Tokuda, Lei Qu
, Ting Cao
, Shinya Takamaeda-Yamazaki
:
PUDTune: Multi-Level Charging for High-Precision Calibration in Processing-Using-DRAM. IEEE Comput. Archit. Lett. 24(2): 245-248 (2025)
[j5]Qipeng Wang
, Shiqi Jiang
, Yifan Yang
, Ruiqi Liu, Yuanchun Li
, Ting Cao
, Xuanzhe Liu
:
Efficient and Adaptive Diffusion Model Inference Through Lookup Table on Mobile Devices. IEEE Trans. Mob. Comput. 24(9): 8729-8746 (2025)
[j4]Jiaxu Qian, Chendong Wang, Yifan Yang, Chaoyun Zhang, Huiqiang Jiang, Xufang Luo, Yu Kang, Qingwei Lin, Anlan Zhang, Shiqi Jiang, Ting Cao, Tianjun Mao, Suman Banerjee, Guyue Liu, Saravan Rajmohan, Dongmei Zhang, Yuqing Yang, Qi Zhang, Lili Qiu:
Zoomer: Adaptive Image Focus Optimization for Black-box MLLM. Trans. Mach. Learn. Res. 2025 (2025)
[j3]Qipeng Wang
, Shiqi Jiang
, Zhenpeng Chen
, Xu Cao
, Yuanchun Li
, Aoyu Li
, Yun Ma
, Ting Cao
, Xuanzhe Liu
:
Anatomizing Deep Learning Inference in Web Browsers. ACM Trans. Softw. Eng. Methodol. 34(2): 47:1-47:43 (2025)
[c44]Tuowei Wang
, Ruwen Fan
, Minxing Huang
, Zixu Hao
, Kun Li
, Ting Cao
, Youyou Lu
, Yaoxue Zhang
, Ju Ren
:
Neuralink: Fast on-Device LLM Inference with Neuron Co-Activation Linking. ASPLOS (3) 2025: 147-162
[c43]Jianyu Wei
, Shijie Cao
, Ting Cao
, Lingxiao Ma
, Lei Wang
, Yanyong Zhang
, Mao Yang
:
T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge. EuroSys 2025: 278-292
[c42]Guoyu Li, Shengyu Ye, Chunyun Chen, Yang Wang, Fan Yang, Ting Cao, Cheng Liu
, Mohamed M. Sabry Aly, Mao Yang:
LUT-DLA: Lookup Table as Efficient Extreme Low-Bit Deep Learning Accelerator. HPCA 2025: 671-684
[c41]Zhiwen Mo
, Lei Wang
, Jianyu Wei
, Zhichen Zeng
, Shijie Cao
, Lingxiao Ma
, Naifeng Jing
, Ting Cao
, Jilong Xue
, Fan Yang
, Mao Yang
:
LUT Tensor Core: A Software-Hardware Co-Design for LUT-Based Low-Bit LLM Inference. ISCA 2025: 514-528
[c40]Xin Ding
, Jianyu Wei
, Fucheng Jia
, Liang Mi
, Ruofei Ju
, Xianye Wang
, Yikai Zheng
, Ziming Zhang
, Weijun Wang
, Shiqi Jiang
, Yunxin Liu
, Ting Cao
:
Demo: EdgeMind-OS: A Plug-and-Play Embodied Intelligence System for Real-Time On-Device Deployment. MobiCom 2025: 1219-1221
[c39]Haozhi Han
, Kun Li
, Wei Cui
, Donglin Bai
, Yiwei Zhang
, Liang Yuan
, Yifeng Chen
, Yunquan Zhang
, Ting Cao
, Mao Yang
:
FlashFFTStencil: Bridging Fast Fourier Transforms to Memory-Efficient Stencil Computations on Tensor Core Units. PPoPP 2025: 355-368
[c38]Yiwei Zhang
, Kun Li
, Liang Yuan
, Haozhi Han
, Yunquan Zhang
, Ting Cao
, Mao Yang
:
Jigsaw: Toward Conflict-free Vectorized Stencil Computation by Tessellating Swizzled Registers. PPoPP 2025: 481-495
[c37]Qi Li
, Kun Li
, Haozhi Han
, Liang Yuan
, Yunquan Zhang
, Yifeng Chen
, Junshi Chen
, Hong An
, Ting Cao
, Mao Yang
:
SparStencil: Retargeting Sparse Tensor Cores to Scientific Stencil Computations via Structured Sparsity Transformation. SC 2025: 1495-1509
[c36]Haozhi Han
, Kun Li
, Fusong Ju
, Qi Li
, Hong An
, Yifeng Chen
, Yunquan Zhang
, Ting Cao
, Mao Yang
:
Matrix Is All You Need: Rearchitecting Quantum Chemistry to Scale on AI Accelerators. SC 2025: 2126-2142
[c35]Shenghong Dai
, Shiqi Jiang
, Yifan Yang
, Ting Cao
, Mo Li
, Suman Banerjee
, Lili Qiu
:
Babel: A Scalable Pre-trained Model for Multi-Modal Sensing via Expandable Modality Alignment. SenSys 2025: 240-253
[c34]Tuowei Wang, Xingyu Chen, Kun Li, Ting Cao, Ju Ren, Yaoxue Zhang:
JENGA: Enhancing LLM Long-Context Fine-tuning with Contextual Token Sparsity. USENIX ATC 2025: 123-141
[i43]Xin Ding, Shijie Cao, Ting Cao, Zhibo Chen:
Dissecting Bit-Level Scaling Laws in Quantizing Vision Generative Models. CoRR abs/2501.06218 (2025)
[i42]Tuowei Wang, Xingyu Chen, Kun Li, Ting Cao, Ju Ren, Yaoxue Zhang:
LeMo: Enabling LEss Token Involvement for MOre Context Fine-tuning. CoRR abs/2501.09767 (2025)
[i41]Guoyu Li, Shengyu Ye, Chunyun Chen, Yang Wang, Fan Yang, Ting Cao, Cheng Li
, Mohamed M. Sabry, Mao Yang:
LUT-DLA: Lookup Table as Efficient Extreme Low-Bit Deep Learning Accelerator. CoRR abs/2501.10658 (2025)
[i40]Xin Ding, Hao Wu, Yifan Yang, Shiqi Jiang, Donglin Bai, Zhibo Chen, Ting Cao:
StreamMind: Unlocking Full Frame Rate Streaming Video Dialogue through Event-Gated Cognition. CoRR abs/2503.06220 (2025)
[i39]Gaole Dai, Shiqi Jiang, Ting Cao, Yuanchun Li, Yuqing Yang, Rui Tan, Mo Li, Lili Qiu:
Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment. CoRR abs/2503.15937 (2025)
[i38]Dayou Du, Shijie Cao, Jianyi Cheng, Ting Cao, Mao Yang:
BitDecoding: Unlocking Tensor Cores for Long-Context LLMs Decoding with Low-Bit KV Cache. CoRR abs/2503.18773 (2025)
[i37]Tatsuya Kubo, Daichi Tokuda, Tomoya Nagatani, Masayuki Usui, Lei Qu, Ting Cao, Shinya Takamaeda-Yamazaki:
MVDRAM: Enabling GeMV Execution in Unmodified DRAM for Low-Bit LLM Acceleration. CoRR abs/2503.23817 (2025)
[i36]Fucheng Jia, Zewen Wu, Shiqi Jiang, Huiqiang Jiang, Qianxi Zhang, Yuqing Yang, Yunxin Liu, Ju Ren, Deyu Zhang, Ting Cao:
Scaling Up On-Device LLMs via Active-Weight Swapping Between DRAM and Flash. CoRR abs/2504.08378 (2025)
[i35]Yuxuan Yan, Shiqi Jiang, Ting Cao, Yifan Yang, Qianqian Yang, Yuanchao Shu, Yuqing Yang, Lili Qiu:
Empowering Agentic Video Analytics Systems with Video Language Models. CoRR abs/2505.00254 (2025)
[i34]Jiaxu Qian, Chendong Wang, Yifan Yang, Chaoyun Zhang, Huiqiang Jiang, Xufang Luo, Yu Kang
, Qingwei Lin, Anlan Zhang, Shiqi Jiang, Ting Cao, Tianjun Mao, Suman Banerjee, Guyue Liu, Saravan Rajmohan, Dongmei Zhang
, Yuqing Yang, Qi Zhang, Lili Qiu:
Zoomer: Adaptive Image Focus Optimization for Black-box MLLM. CoRR abs/2505.00742 (2025)
[i33]Tatsuya Kubo, Daichi Tokuda, Lei Qu, Ting Cao, Shinya Takamaeda-Yamazaki:
PUDTune: Multi-Level Charging for High-Precision Calibration in Processing-Using-DRAM. CoRR abs/2505.05266 (2025)
[i32]Qi Li, Kun Li, Haozhi Han, Honghui Shang, Xinfu He, Yunquan Zhang, Hong An, Ting Cao, Mao Yang:
SwarmThinkers: Learning Physically Consistent Atomic KMC Transitions at Scale. CoRR abs/2505.20094 (2025)
[i31]Yizhao Gao, Shuming Guo, Shijie Cao, Yuqing Xia, Yu Cheng, Lei Wang, Lingxiao Ma, Yutao Sun, Tianzhu Ye, Li Dong, Hayden Kwok-Hay So, Yu Hua, Ting Cao, Fan Yang, Mao Yang:
SeerAttention-R: Sparse Attention Adaptation for Long Reasoning. CoRR abs/2506.08889 (2025)
[i30]Qi Li, Kun Li, Haozhi Han, Liang Yuan, Junshi Chen, Yunquan Zhang, Yifeng Chen, Hong An, Ting Cao, Mao Yang:
SparStencil: Retargeting Sparse Tensor Cores to Scientific Stencil Computations via Structured Sparsity Transformation. CoRR abs/2506.22969 (2025)
[i29]Gaole Dai, Shiqi Jiang, Ting Cao, Yuqing Yang, Yuanchun Li, Rui Tan, Mo Li, Lili Qiu:
ProRe: A Proactive Reward System for GUI Agents via Reasoner-Actor Collaboration. CoRR abs/2509.21823 (2025)
[i28]Zixu Hao, Jianyu Wei, Tuowei Wang, Minxing Huang, Huiqiang Jiang, Shiqi Jiang, Ting Cao, Ju Ren:
Scaling LLM Test-Time Compute with Mobile NPU on Smartphones. CoRR abs/2509.23324 (2025)
[i27]Xin Ding, Jianyu Wei, Yifan Yang, Shiqi Jiang, Qianxi Zhang, Hao Wu, Fucheng Jia, Liang Mi, Yuxuan Yan, Weijun Wang, Yunxin Liu, Zhibo Chen, Ting Cao:
AdaNav: Adaptive Reasoning with Uncertainty for Vision-Language Navigation. CoRR abs/2509.24387 (2025)
[i26]Chendong Wang, Donglin Bai, Yifan Yang, Xiao Jin, Anlan Zhang, Rui Wang, Shiqi Jiang, Yuqing Yang, Hao Wu, Qi Dai, Chong Luo, Ting Cao, Lili Qiu, Suman Banerjee:
Video-in-the-Loop: Span-Grounded Long Video QA with Interleaved Reasoning. CoRR abs/2510.04022 (2025)
[i25]Tuowei Wang, Kun Li, Zixu Hao, Donglin Bai, Ju Ren, Yaoxue Zhang, Ting Cao, Mao Yang:
Long Exposure: Accelerating Parameter-Efficient Fine-Tuning for LLMs under Shadowy Sparsity. CoRR abs/2510.15964 (2025)
[i24]Jianyu Wei, Qingtao Li, Shijie Cao, Lingxiao Ma, Zixu Hao, Yanyong Zhang, Xiaoyan Hu, Ting Cao:
T-MAN: Enabling End-to-End Low-Bit LLM Inference on NPUs via Unified Table Lookup. CoRR abs/2511.11248 (2025)- 2024
[j2]Jinrui Zhang
, Huan Yang
, Ju Ren
, Deyu Zhang
, Bangwen He
, Youngki Lee
, Ting Cao
, Yuanchun Li
, Yaoxue Zhang
, Yunxin Liu
:
HiMoDepth: Efficient Training-Free High-Resolution On-Device Depth Perception. IEEE Trans. Mob. Comput. 23(5): 4648-4664 (2024)
[c33]Yijia Zhang, Sicheng Zhang, Shijie Cao, Dayou Du, Jianyu Wei, Ting Cao, Ningyi Xu:
AFPQ: Asymmetric Floating Point Quantization for LLMs. ACL (Findings) 2024: 28-36
[c32]Dayou Du, Yijia Zhang, Shijie Cao, Jiaqi Guo, Ting Cao, Xiaowen Chu, Ningyi Xu:
BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation. ACL (1) 2024: 102-116
[c31]Cong Li
, Zhe Zhou
, Yang Wang
, Fan Yang
, Ting Cao
, Mao Yang
, Yun Liang
, Guangyu Sun
:
PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization. ASPLOS (2) 2024: 879-896
[c30]Zixu Hao
, Huiqiang Jiang
, Shiqi Jiang
, Ju Ren
, Ting Cao
:
Hybrid SLM and LLM for Edge-Cloud Collaborative Inference. EdgeFM@MobiSys 2024: 36-41
[c29]Yifei Liu, Jicheng Wen, Yang Wang, Shengyu Ye, Li Lyna Zhang, Ting Cao, Cheng Li, Mao Yang:
VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models. EMNLP 2024: 8181-8196
[c28]Yijia Zhang, Lingran Zhao, Shijie Cao, Sicheng Zhang, Wenqiang Wang, Ting Cao, Fan Yang, Mao Yang, Shanghang Zhang, Ningyi Xu:
Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models. ICME 2024: 1-6
[c27]Ranggi Hwang
, Jianyu Wei, Shijie Cao, Changho Hwang
, Xiaohu Tang, Ting Cao, Mao Yang:
Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference. ISCA 2024: 1018-1031
[c26]Xiangyu Li
, Yuanchun Li, Yuanzhe Li
, Ting Cao, Yunxin Liu:
FlexNN: Efficient and Adaptive DNN Inference on Memory-Constrained Edge Devices. MobiCom 2024: 709-723
[c25]Fucheng Jia
, Shiqi Jiang
, Ting Cao
, Wei Cui
, Tianrui Xia
, Xu Cao
, Yuanchun Li
, Qipeng Wang
, Deyu Zhang
, Ju Ren
, Yunxin Liu
, Lili Qiu
, Mao Yang
:
Empowering In-Browser Deep Learning Inference on Edge Through Just-In-Time Kernel Optimization. MobiSys 2024: 438-450
[c24]Jeongho Won
, Ting Cao
, Huiqiang Jiang
, Junehwa Song
:
Poster: Design of Elastic Deep Neural Network Candidate Spaces for Inference on Diverse Devices. MobiSys 2024: 734-735
[c23]Chengquan Feng, Li Lyna Zhang, Yuanchi Liu, Jiahang Xu, Chengruidong Zhang, Zhiyuan Wang, Ting Cao, Mao Yang, Haisheng Tan:
LitePred: Transferable and Scalable Latency Prediction for Hardware-Aware Neural Architecture Search. NSDI 2024
[c22]Lei Wang, Lingxiao Ma, Shijie Cao, Quanlu Zhang, Jilong Xue, Yining Shi, Ningxin Zheng, Ziming Miao, Fan Yang, Ting Cao, Yuqing Yang, Mao Yang:
Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation. OSDI 2024: 307-323
[c21]Yuetao Chen
, Kun Li
, Yuhao Wang
, Donglin Bai
, Lei Wang
, Lingxiao Ma
, Liang Yuan
, Yunquan Zhang
, Ting Cao
, Mao Yang
:
ConvStencil: Transform Stencil Computation to Matrix Multiplication on Tensor Cores. PPoPP 2024: 333-347
[c20]Tuowei Wang, Kun Li, Zixu Hao, Donglin Bai, Ju Ren, Yaoxue Zhang, Ting Cao, Mao Yang:
Long Exposure: Accelerating Parameter-Efficient Fine-Tuning for LLMs under Shadowy Sparsity. SC 2024: Article 75
[c19]Yiwei Zhang
, Kun Li, Liang Yuan, Jiawen Cheng
, Yunquan Zhang, Ting Cao, Mao Yang:
LoRAStencil: Low-Rank Adaptation of Stencil Computation on Tensor Cores. SC 2024: Article 53
[i23]Qipeng Wang, Shiqi Jiang, Zhenpeng Chen, Xu Cao, Yuanchun Li, Aoyu Li, Ying Zhang, Yun Ma, Ting Cao, Xuanzhe Liu
:
Exploring the Impact of In-Browser Deep Learning Inference on Quality of User Experience and Performance. CoRR abs/2402.05981 (2024)
[i22]Dayou Du, Yijia Zhang, Shijie Cao, Jiaqi Guo, Ting Cao, Xiaowen Chu, Ningyi Xu:
BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation. CoRR abs/2402.10631 (2024)
[i21]Jianyu Wei, Shijie Cao, Ting Cao, Lingxiao Ma, Lei Wang, Yanyong Zhang, Mao Yang:
T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge. CoRR abs/2407.00088 (2024)
[i20]Shenghong Dai, Shiqi Jiang, Yifan Yang, Ting Cao, Mo Li, Suman Banerjee, Lili Qiu:
Advancing Multi-Modal Sensing Through Expandable Modality Alignment. CoRR abs/2407.17777 (2024)
[i19]Zhiwen Mo, Lei Wang, Jianyu Wei, Zhichen Zeng, Shijie Cao, Lingxiao Ma, Naifeng Jing, Ting Cao, Jilong Xue, Fan Yang, Mao Yang:
LUT Tensor Core: Lookup Table Enables Efficient Low-Bit LLM Inference Acceleration. CoRR abs/2408.06003 (2024)
[i18]Yifei Liu, Jicheng Wen, Yang Wang, Shengyu Ye, Li Lyna Zhang, Ting Cao, Cheng Li, Mao Yang:
VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models. CoRR abs/2409.17066 (2024)
[i17]Yizhao Gao, Zhichen Zeng, Dayou Du, Shijie Cao, Hayden Kwok-Hay So, Ting Cao, Fan Yang, Mao Yang:
SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs. CoRR abs/2410.13276 (2024)
[i16]Hao Wu, Donglin Bai, Shiqi Jiang, Qianxi Zhang, Yifan Yang, Ting Cao, Fengyuan Xu:
Making Every Frame Matter: Continuous Video Understanding for Large Models via Adaptive State Modeling. CoRR abs/2410.14993 (2024)
[i15]Tuowei Wang, Ruwen Fan, Minxing Huang, Zixu Hao, Kun Li, Ting Cao, Youyou Lu, Yaoxue Zhang, Ju Ren:
Ripple: Accelerating LLM Inference on Smartphones with Correlation-Aware Neuron Management. CoRR abs/2410.19274 (2024)
[i14]Tuowei Wang, Kun Li, Donglin Bai, Fusong Ju, Leo Xia, Ting Cao, Ju Ren, Yaoxue Zhang, Mao Yang:
Matryoshka: Optimization of Dynamic Diverse Quantum Chemistry Systems via Elastic Parallelism Transformation. CoRR abs/2412.13203 (2024)- 2023
[c18]Yijia Zhang, Yibo Han, Shijie Cao, Guohao Dai, Youshan Miao, Ting Cao, Fan Yang, Ningyi Xu:
Adam Accumulation to Reduce Memory Footprints of Both Activations and Gradients for Large-Scale DNN Training. ECAI 2023: 3058-3065
[c17]Xudong Wang, Li Lyna Zhang, Jiahang Xu, Quanlu Zhang, Yujing Wang, Yuqing Yang, Ningxin Zheng, Ting Cao, Mao Yang:
SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference. ICCV 2023: 5796-5805
[c16]Chen Tang, Li Lyna Zhang, Huiqiang Jiang, Jiahang Xu, Ting Cao, Quanlu Zhang, Yuqing Yang, Zhi Wang, Mao Yang:
ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices. ICCV 2023: 5806-5817
[c15]Junyan Li
, Li Lyna Zhang
, Jiahang Xu
, Yujing Wang, Shaoguang Yan, Yunqing Xia, Yuqing Yang
, Ting Cao, Hao Sun, Weiwei Deng
, Qi Zhang, Mao Yang:
Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer Inference. KDD 2023: 1280-1290
[c14]Bin Lin, Ningxin Zheng, Lei Wang, Shijie Cao, Lingxiao Ma, Quanlu Zhang, Yi Zhu, Ting Cao, Jilong Xue, Yuqing Yang, Fan Yang:
Efficient GPU Kernels for N: M-Sparse Weights in Deep Learning. MLSys 2023
[c13]Xiaohu Tang
, Yang Wang
, Ting Cao
, Li Lyna Zhang
, Qi Chen
, Deng Cai
, Yunxin Liu
, Mao Yang
:
LUT-NN: Empower Efficient Neural Network Inference with Centroid Learning and Table Lookup. MobiCom 2023: 70:1-70:15
[c12]Jianyu Wei
, Ting Cao
, Shijie Cao
, Shiqi Jiang
, Shaowei Fu
, Mao Yang
, Yanyong Zhang
, Yunxin Liu
:
NN-Stretch: Automatic Neural Network Branching for Parallel Inference on Heterogeneous Multi-Processors. MobiSys 2023: 70-83
[c11]Rongjie Yi
, Ting Cao
, Ao Zhou
, Xiao Ma
, Shangguang Wang
, Mengwei Xu
:
Boosting DNN Cold Inference on Edge Devices. MobiSys 2023: 516-529
[i13]Xiaohu Tang, Yang Wang, Ting Cao, Li Lyna Zhang, Qi Chen, Deng Cai, Yunxin Liu, Mao Yang:
LUT-NN: Towards Unified Neural Network Inference by Table Lookup. CoRR abs/2302.03213 (2023)
[i12]Li Lyna Zhang, Xudong Wang, Jiahang Xu, Quanlu Zhang, Yujing Wang, Yuqing Yang, Ningxin Zheng, Ting Cao, Mao Yang:
SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference. CoRR abs/2303.08308 (2023)
[i11]Kun Li, Zhichun Li, Yuetao Chen, Zixuan Wang, Yiwei Zhang, Liang Yuan, Haipeng Jia, Yunquan Zhang, Ting Cao, Mao Yang:
Gamify Stencil Dwarf on Cloud for Democratizing Scientific Computing. CoRR abs/2303.08365 (2023)
[i10]Chen Tang, Li Lyna Zhang, Huiqiang Jiang, Jiahang Xu, Ting Cao, Quanlu Zhang, Yuqing Yang, Zhi Wang, Mao Yang:
ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices. CoRR abs/2303.09730 (2023)
[i9]Yijia Zhang, Lingran Zhao, Shijie Cao, Wenqiang Wang, Ting Cao, Fan Yang, Mao Yang, Shanghang Zhang, Ningyi Xu:
Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models. CoRR abs/2305.12356 (2023)
[i8]Yijia Zhang, Yibo Han, Shijie Cao, Guohao Dai, Youshan Miao, Ting Cao, Fan Yang, Ningyi Xu:
Adam Accumulation to Reduce Memory Footprints of both Activations and Gradients for Large-scale DNN Training. CoRR abs/2305.19982 (2023)
[i7]Junyan Li, Li Lyna Zhang, Jiahang Xu, Yujing Wang, Shaoguang Yan, Yunqing Xia, Yuqing Yang, Ting Cao, Hao Sun, Weiwei Deng, Qi Zhang, Mao Yang:
Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer Inference. CoRR abs/2306.14393 (2023)
[i6]Ranggi Hwang, Jianyu Wei, Shijie Cao, Changho Hwang, Xiaohu Tang, Ting Cao, Mao Yang:
Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference. CoRR abs/2308.12066 (2023)
[i5]Fucheng Jia, Shiqi Jiang, Ting Cao, Wei Cui, Tianrui Xia, Xu Cao, Yuanchun Li, Deyu Zhang, Ju Ren, Yunxin Liu, Lili Qiu, Mao Yang:
Accelerating In-Browser Deep Learning Inference on Diverse Edge Clients through Just-in-Time Kernel Optimizations. CoRR abs/2309.08978 (2023)
[i4]Yijia Zhang, Sicheng Zhang, Shijie Cao, Dayou Du, Jianyu Wei, Ting Cao, Ningyi Xu:
AFPQ: Asymmetric Floating Point Quantization for LLMs. CoRR abs/2311.01792 (2023)- 2022
[c10]Li Lyna Zhang, Youkow Homma, Yujing Wang, Min Wu, Mao Yang, Ruofei Zhang, Ting Cao, Wei Shen:
SwiftPruner: Reinforced Evolutionary Pruning for Efficient Ad Relevance. CIKM 2022: 3654-3663
[c9]Rendong Liang
, Ting Cao, Jicheng Wen, Manni Wang, Yang Wang, Jianhua Zou, Yunxin Liu:
Romou: rapidly generate high-performance tensor kernels for mobile GPUs. MobiCom 2022: 487-500
[c8]Jinrui Zhang, Huan Yang
, Ju Ren, Deyu Zhang, Bangwen He, Ting Cao, Yuanchun Li, Yaoxue Zhang, Yunxin Liu:
MobiDepth: real-time depth estimation using on-device dual cameras. MobiCom 2022: 528-541
[c7]Fucheng Jia
, Deyu Zhang, Ting Cao, Shiqi Jiang, Yunxin Liu, Ju Ren, Yaoxue Zhang:
CoDL: efficient CPU-GPU co-execution for deep learning inference on mobile devices. MobiSys 2022: 209-221
[c6]Yan Lu, Shiqi Jiang, Ting Cao, Yuanchao Shu
:
Turbo: Opportunistic Enhancement for Edge Video Analytics. SenSys 2022: 263-276
[c5]Ziyan Fu
, Ju Ren, Yunxin Liu, Ting Cao, Deyu Zhang, Yuezhi Zhou, Yaoxue Zhang:
Hyperion: A Generic and Distributed Mobile Offloading Framework on OpenCL. SenSys 2022: 607-621
[i3]Rongjie Yi, Ting Cao, Ao Zhou, Xiao Ma, Shangguang Wang, Mengwei Xu:
Understanding and Optimizing Deep Learning Cold-Start Latency on Edge Devices. CoRR abs/2206.07446 (2022)
[i2]Yan Lu, Shiqi Jiang, Ting Cao, Yuanchao Shu
:
Turbo: Opportunistic Enhancement for Edge Video Analytics. CoRR abs/2207.00172 (2022)
[i1]Li Lyna Zhang, Youkow Homma, Yujing Wang, Min Wu, Mao Yang, Ruofei Zhang, Ting Cao, Wei Shen:
SwiftPruner: Reinforced Evolutionary Pruning for Efficient Ad Relevance. CoRR abs/2209.00625 (2022)- 2021
[j1]Li Lyna Zhang, Shihao Han, Jianyu Wei, Ningxin Zheng, Ting Cao, Yunxin Liu:
nn-METER: Towards Accurate Latency Prediction of DNN Inference on Diverse Edge Devices. GetMobile Mob. Comput. Commun. 25(4): 19-23 (2021)
[c4]Xiaohu Tang, Shihao Han, Li Lyna Zhang, Ting Cao, Yunxin Liu:
To Bridge Neural Network Design and Real-World Performance: A Behaviour Study for Neural Networks. MLSys 2021
[c3]Manni Wang, Shaohua Ding, Ting Cao, Yunxin Liu, Fengyuan Xu:
AsyMo: scalable and efficient deep-learning inference on asymmetric mobile CPUs. MobiCom 2021: 215-228
[c2]Li Lyna Zhang, Shihao Han, Jianyu Wei, Ningxin Zheng, Ting Cao, Yuqing Yang, Yunxin Liu:
nn-Meter: towards accurate latency prediction of deep-learning model inference on diverse edge devices. MobiSys 2021: 81-93- 2020
[c1]Shiqi Jiang, Lihao Ran, Ting Cao, Yusen Xu, Yunxin Liu:
Profiling and optimizing deep learning inference on mobile GPUs. APSys 2020: 75-81
Coauthor Index

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from
to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the
of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from
,
, and
to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from
and
to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from
.
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2026-03-28 22:37 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID







