default search action

combined dblp search
author search
venue search
publication search

ask others

Ting Cao 0003

> Home > Persons

Person information

affiliation: Microsoft Research, Beijing, China

Other persons with the same name

see FAQ

Refine list

refinements active!

zoomed in on ?? of ?? records

view refined list in

export refined list as

showing all ?? records

2020 – today

see FAQ

What is the meaning of the colors in the publication lists?

2026
[c45]
- view
  authority control:
- export record
  dblp key:
  - conf/hpca/DuCCMCY26
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hpca/DuCCMCY26
Dayou Du, Shijie Cao, Jianyi Cheng, Luo Mai, Ting Cao, Mao Yang:
BitDecoding: Unlocking Tensor Cores for Long-Context LLMs with Low-Bit KV Cache. HPCA 2026: 1-13
[i44]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2602-09725
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2602-09725
Liang Mi, Weijun Wang, Jinghan Chen, Ting Cao, Haipeng Dai, Yunxin Liu:
Efficient Remote Prefix Fetching with GPU-native Media ASICs. CoRR abs/2602.09725 (2026)
2025
[j6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/cal/KuboTQCT25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/cal/KuboTQCT25
Tatsuya Kubo, Daichi Tokuda, Lei Qu, Ting Cao, Shinya Takamaeda-Yamazaki:
PUDTune: Multi-Level Charging for High-Precision Calibration in Processing-Using-DRAM. IEEE Comput. Archit. Lett. 24(2): 245-248 (2025)
[j5]
- view
  authority control:
- export record
  dblp key:
  - journals/tmc/WangJYLLCL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmc/WangJYLLCL25
Qipeng Wang, Shiqi Jiang, Yifan Yang, Ruiqi Liu, Yuanchun Li, Ting Cao, Xuanzhe Liu:
Efficient and Adaptive Diffusion Model Inference Through Lookup Table on Mobile Devices. IEEE Trans. Mob. Comput. 24(9): 8729-8746 (2025)
[j4]
- view
  - electronic edition @ openreview.net (open access)
  - details & citations
- export record
  dblp key:
  - journals/tmlr/QianWYZJLKLZJCMBLRZYZQ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmlr/QianWYZJLKLZJCMBLRZYZQ25
Jiaxu Qian, Chendong Wang, Yifan Yang, Chaoyun Zhang, Huiqiang Jiang, Xufang Luo, Yu Kang, Qingwei Lin, Anlan Zhang, Shiqi Jiang, Ting Cao, Tianjun Mao, Suman Banerjee, Guyue Liu, Saravan Rajmohan, Dongmei Zhang, Yuqing Yang, Qi Zhang, Lili Qiu:
Zoomer: Adaptive Image Focus Optimization for Black-box MLLM. Trans. Mach. Learn. Res. 2025 (2025)
[j3]
- view
  authority control:
- export record
  dblp key:
  - journals/tosem/WangJCCLLMCL25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tosem/WangJCCLLMCL25
Qipeng Wang, Shiqi Jiang, Zhenpeng Chen, Xu Cao, Yuanchun Li, Aoyu Li, Yun Ma, Ting Cao, Xuanzhe Liu:
Anatomizing Deep Learning Inference in Web Browsers. ACM Trans. Softw. Eng. Methodol. 34(2): 47:1-47:43 (2025)
[c44]
- view
  authority control:
- export record
  dblp key:
  - conf/asplos/WangFHH00LZ025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asplos/WangFHH00LZ025
Tuowei Wang, Ruwen Fan, Minxing Huang, Zixu Hao, Kun Li, Ting Cao, Youyou Lu, Yaoxue Zhang, Ju Ren:
Neuralink: Fast on-Device LLM Inference with Neuron Co-Activation Linking. ASPLOS (3) 2025: 147-162
[c43]
- view
  authority control:
- export record
  dblp key:
  - conf/eurosys/WeiCCMWZY25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/eurosys/WeiCCMWZY25
Jianyu Wei, Shijie Cao, Ting Cao, Lingxiao Ma, Lei Wang, Yanyong Zhang, Mao Yang:
T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge. EuroSys 2025: 278-292
[c42]
- view
  authority control:
- export record
  dblp key:
  - conf/hpca/LiYCWYCLA025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/hpca/LiYCWYCLA025
Guoyu Li, Shengyu Ye, Chunyun Chen, Yang Wang, Fan Yang, Ting Cao, Cheng Liu, Mohamed M. Sabry Aly, Mao Yang:
LUT-DLA: Lookup Table as Efficient Extreme Low-Bit Deep Learning Accelerator. HPCA 2025: 671-684
[c41]
- view
  authority control:
- export record
  dblp key:
  - conf/isca/MoWW0CMJCX0025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isca/MoWW0CMJCX0025
Zhiwen Mo, Lei Wang, Jianyu Wei, Zhichen Zeng, Shijie Cao, Lingxiao Ma, Naifeng Jing, Ting Cao, Jilong Xue, Fan Yang, Mao Yang:
LUT Tensor Core: A Software-Hardware Co-Design for LUT-Based Low-Bit LLM Inference. ISCA 2025: 514-528
[c40]
- view
  authority control:
- export record
  dblp key:
  - conf/mobicom/DingWJMJWZZW00025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mobicom/DingWJMJWZZW00025
Xin Ding, Jianyu Wei, Fucheng Jia, Liang Mi, Ruofei Ju, Xianye Wang, Yikai Zheng, Ziming Zhang, Weijun Wang, Shiqi Jiang, Yunxin Liu, Ting Cao:
Demo: EdgeMind-OS: A Plug-and-Play Embodied Intelligence System for Real-Time On-Device Deployment. MobiCom 2025: 1219-1221
[c39]
- view
  authority control:
- export record
  dblp key:
  - conf/ppopp/HanLCBZYCZCY25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ppopp/HanLCBZYCZCY25
Haozhi Han, Kun Li, Wei Cui, Donglin Bai, Yiwei Zhang, Liang Yuan, Yifeng Chen, Yunquan Zhang, Ting Cao, Mao Yang:
FlashFFTStencil: Bridging Fast Fourier Transforms to Memory-Efficient Stencil Computations on Tensor Core Units. PPoPP 2025: 355-368
[c38]
- view
  authority control:
- export record
  dblp key:
  - conf/ppopp/ZhangLYHZCY25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ppopp/ZhangLYHZCY25
Yiwei Zhang, Kun Li, Liang Yuan, Haozhi Han, Yunquan Zhang, Ting Cao, Mao Yang:
Jigsaw: Toward Conflict-free Vectorized Stencil Computation by Tessellating Swizzled Registers. PPoPP 2025: 481-495
[c37]
- view
  authority control:
- export record
  dblp key:
  - conf/sc/Li0HYZCCA0025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sc/Li0HYZCCA0025
Qi Li, Kun Li, Haozhi Han, Liang Yuan, Yunquan Zhang, Yifeng Chen, Junshi Chen, Hong An, Ting Cao, Mao Yang:
SparStencil: Retargeting Sparse Tensor Cores to Scientific Stencil Computations via Structured Sparsity Transformation. SC 2025: 1495-1509
[c36]
- view
  authority control:
- export record
  dblp key:
  - conf/sc/Han0JLACZ0025
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sc/Han0JLACZ0025
Haozhi Han, Kun Li, Fusong Ju, Qi Li, Hong An, Yifeng Chen, Yunquan Zhang, Ting Cao, Mao Yang:
Matrix Is All You Need: Rearchitecting Quantum Chemistry to Scale on AI Accelerators. SC 2025: 2126-2142
[c35]
- view
  authority control:
- export record
  dblp key:
  - conf/sensys/DaiJYCLBQ25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sensys/DaiJYCLBQ25
Shenghong Dai, Shiqi Jiang, Yifan Yang, Ting Cao, Mo Li, Suman Banerjee, Lili Qiu:
Babel: A Scalable Pre-trained Model for Multi-Modal Sensing via Expandable Modality Alignment. SenSys 2025: 240-253
[c34]
- view
  - electronic edition @ usenix.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/usenix/WangCLC0Z25
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/usenix/WangCLC0Z25
Tuowei Wang, Xingyu Chen, Kun Li, Ting Cao, Ju Ren, Yaoxue Zhang:
JENGA: Enhancing LLM Long-Context Fine-tuning with Contextual Token Sparsity. USENIX ATC 2025: 123-141
[i43]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-06218
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-06218
Xin Ding, Shijie Cao, Ting Cao, Zhibo Chen:
Dissecting Bit-Level Scaling Laws in Quantizing Vision Generative Models. CoRR abs/2501.06218 (2025)
[i42]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-09767
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-09767
Tuowei Wang, Xingyu Chen, Kun Li, Ting Cao, Ju Ren, Yaoxue Zhang:
LeMo: Enabling LEss Token Involvement for MOre Context Fine-tuning. CoRR abs/2501.09767 (2025)
[i41]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2501-10658
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2501-10658
Guoyu Li, Shengyu Ye, Chunyun Chen, Yang Wang, Fan Yang, Ting Cao, Cheng Li, Mohamed M. Sabry, Mao Yang:
LUT-DLA: Lookup Table as Efficient Extreme Low-Bit Deep Learning Accelerator. CoRR abs/2501.10658 (2025)
[i40]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-06220
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-06220
Xin Ding, Hao Wu, Yifan Yang, Shiqi Jiang, Donglin Bai, Zhibo Chen, Ting Cao:
StreamMind: Unlocking Full Frame Rate Streaming Video Dialogue through Event-Gated Cognition. CoRR abs/2503.06220 (2025)
[i39]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-15937
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-15937
Gaole Dai, Shiqi Jiang, Ting Cao, Yuanchun Li, Yuqing Yang, Rui Tan, Mo Li, Lili Qiu:
Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment. CoRR abs/2503.15937 (2025)
[i38]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-18773
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-18773
Dayou Du, Shijie Cao, Jianyi Cheng, Ting Cao, Mao Yang:
BitDecoding: Unlocking Tensor Cores for Long-Context LLMs Decoding with Low-Bit KV Cache. CoRR abs/2503.18773 (2025)
[i37]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2503-23817
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2503-23817
Tatsuya Kubo, Daichi Tokuda, Tomoya Nagatani, Masayuki Usui, Lei Qu, Ting Cao, Shinya Takamaeda-Yamazaki:
MVDRAM: Enabling GeMV Execution in Unmodified DRAM for Low-Bit LLM Acceleration. CoRR abs/2503.23817 (2025)
[i36]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2504-08378
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2504-08378
Fucheng Jia, Zewen Wu, Shiqi Jiang, Huiqiang Jiang, Qianxi Zhang, Yuqing Yang, Yunxin Liu, Ju Ren, Deyu Zhang, Ting Cao:
Scaling Up On-Device LLMs via Active-Weight Swapping Between DRAM and Flash. CoRR abs/2504.08378 (2025)
[i35]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-00254
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-00254
Yuxuan Yan, Shiqi Jiang, Ting Cao, Yifan Yang, Qianqian Yang, Yuanchao Shu, Yuqing Yang, Lili Qiu:
Empowering Agentic Video Analytics Systems with Video Language Models. CoRR abs/2505.00254 (2025)
[i34]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-00742
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-00742
Jiaxu Qian, Chendong Wang, Yifan Yang, Chaoyun Zhang, Huiqiang Jiang, Xufang Luo, Yu Kang, Qingwei Lin, Anlan Zhang, Shiqi Jiang, Ting Cao, Tianjun Mao, Suman Banerjee, Guyue Liu, Saravan Rajmohan, Dongmei Zhang, Yuqing Yang, Qi Zhang, Lili Qiu:
Zoomer: Adaptive Image Focus Optimization for Black-box MLLM. CoRR abs/2505.00742 (2025)
[i33]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-05266
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-05266
Tatsuya Kubo, Daichi Tokuda, Lei Qu, Ting Cao, Shinya Takamaeda-Yamazaki:
PUDTune: Multi-Level Charging for High-Precision Calibration in Processing-Using-DRAM. CoRR abs/2505.05266 (2025)
[i32]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2505-20094
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2505-20094
Qi Li, Kun Li, Haozhi Han, Honghui Shang, Xinfu He, Yunquan Zhang, Hong An, Ting Cao, Mao Yang:
SwarmThinkers: Learning Physically Consistent Atomic KMC Transitions at Scale. CoRR abs/2505.20094 (2025)
[i31]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-08889
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-08889
Yizhao Gao, Shuming Guo, Shijie Cao, Yuqing Xia, Yu Cheng, Lei Wang, Lingxiao Ma, Yutao Sun, Tianzhu Ye, Li Dong, Hayden Kwok-Hay So, Yu Hua, Ting Cao, Fan Yang, Mao Yang:
SeerAttention-R: Sparse Attention Adaptation for Long Reasoning. CoRR abs/2506.08889 (2025)
[i30]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2506-22969
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2506-22969
Qi Li, Kun Li, Haozhi Han, Liang Yuan, Junshi Chen, Yunquan Zhang, Yifeng Chen, Hong An, Ting Cao, Mao Yang:
SparStencil: Retargeting Sparse Tensor Cores to Scientific Stencil Computations via Structured Sparsity Transformation. CoRR abs/2506.22969 (2025)
[i29]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-21823
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-21823
Gaole Dai, Shiqi Jiang, Ting Cao, Yuqing Yang, Yuanchun Li, Rui Tan, Mo Li, Lili Qiu:
ProRe: A Proactive Reward System for GUI Agents via Reasoner-Actor Collaboration. CoRR abs/2509.21823 (2025)
[i28]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-23324
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-23324
Zixu Hao, Jianyu Wei, Tuowei Wang, Minxing Huang, Huiqiang Jiang, Shiqi Jiang, Ting Cao, Ju Ren:
Scaling LLM Test-Time Compute with Mobile NPU on Smartphones. CoRR abs/2509.23324 (2025)
[i27]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2509-24387
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2509-24387
Xin Ding, Jianyu Wei, Yifan Yang, Shiqi Jiang, Qianxi Zhang, Hao Wu, Fucheng Jia, Liang Mi, Yuxuan Yan, Weijun Wang, Yunxin Liu, Zhibo Chen, Ting Cao:
AdaNav: Adaptive Reasoning with Uncertainty for Vision-Language Navigation. CoRR abs/2509.24387 (2025)
[i26]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-04022
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-04022
Chendong Wang, Donglin Bai, Yifan Yang, Xiao Jin, Anlan Zhang, Rui Wang, Shiqi Jiang, Yuqing Yang, Hao Wu, Qi Dai, Chong Luo, Ting Cao, Lili Qiu, Suman Banerjee:
Video-in-the-Loop: Span-Grounded Long Video QA with Interleaved Reasoning. CoRR abs/2510.04022 (2025)
[i25]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2510-15964
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2510-15964
Tuowei Wang, Kun Li, Zixu Hao, Donglin Bai, Ju Ren, Yaoxue Zhang, Ting Cao, Mao Yang:
Long Exposure: Accelerating Parameter-Efficient Fine-Tuning for LLMs under Shadowy Sparsity. CoRR abs/2510.15964 (2025)
[i24]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2511-11248
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2511-11248
Jianyu Wei, Qingtao Li, Shijie Cao, Lingxiao Ma, Zixu Hao, Yanyong Zhang, Xiaoyan Hu, Ting Cao:
T-MAN: Enabling End-to-End Low-Bit LLM Inference on NPUs via Unified Table Lookup. CoRR abs/2511.11248 (2025)
2024
[j2]
- view
  authority control:
- export record
  dblp key:
  - journals/tmc/ZhangYRZHLCLZL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/tmc/ZhangYRZHLCLZL24
Jinrui Zhang, Huan Yang, Ju Ren, Deyu Zhang, Bangwen He, Youngki Lee, Ting Cao, Yuanchun Li, Yaoxue Zhang, Yunxin Liu:
HiMoDepth: Efficient Training-Free High-Resolution On-Device Depth Perception. IEEE Trans. Mob. Comput. 23(5): 4648-4664 (2024)
[c33]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/ZhangZCDWCX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/ZhangZCDWCX24
Yijia Zhang, Sicheng Zhang, Shijie Cao, Dayou Du, Jianyu Wei, Ting Cao, Ningyi Xu:
AFPQ: Asymmetric Floating Point Quantization for LLMs. ACL (Findings) 2024: 28-36
[c32]
- view
  authority control:
- export record
  dblp key:
  - conf/acl/DuZCGCCX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/acl/DuZCGCCX24
Dayou Du, Yijia Zhang, Shijie Cao, Jiaqi Guo, Ting Cao, Xiaowen Chu, Ningyi Xu:
BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation. ACL (1) 2024: 102-116
[c31]
- view
  authority control:
- export record
  dblp key:
  - conf/asplos/LiZWYCYL024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/asplos/LiZWYCYL024
Cong Li, Zhe Zhou, Yang Wang, Fan Yang, Ting Cao, Mao Yang, Yun Liang, Guangyu Sun:
PIM-DL: Expanding the Applicability of Commodity DRAM-PIMs for Deep Learning via Algorithm-System Co-Optimization. ASPLOS (2) 2024: 879-896
[c30]
- view
  authority control:
- export record
  dblp key:
  - conf/edgefm/HaoJJ0C24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/edgefm/HaoJJ0C24
Zixu Hao, Huiqiang Jiang, Shiqi Jiang, Ju Ren, Ting Cao:
Hybrid SLM and LLM for Edge-Cloud Collaborative Inference. EdgeFM@MobiSys 2024: 36-41
[c29]
- view
  authority control:
- export record
  dblp key:
  - conf/emnlp/LiuW0YZC0024
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/emnlp/LiuW0YZC0024
Yifei Liu, Jicheng Wen, Yang Wang, Shengyu Ye, Li Lyna Zhang, Ting Cao, Cheng Li, Mao Yang:
VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models. EMNLP 2024: 8181-8196
[c28]
- view
  authority control:
- export record
  dblp key:
  - conf/icmcs/ZhangZCZWCYYZX24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/icmcs/ZhangZCZWCYYZX24
Yijia Zhang, Lingran Zhao, Shijie Cao, Sicheng Zhang, Wenqiang Wang, Ting Cao, Fan Yang, Mao Yang, Shanghang Zhang, Ningyi Xu:
Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models. ICME 2024: 1-6
[c27]
- view
  authority control:
- export record
  dblp key:
  - conf/isca/HwangWCHTCY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/isca/HwangWCHTCY24
Ranggi Hwang, Jianyu Wei, Shijie Cao, Changho Hwang, Xiaohu Tang, Ting Cao, Mao Yang:
Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference. ISCA 2024: 1018-1031
[c26]
- view
  authority control:
- export record
  dblp key:
  - conf/mobicom/LiLLCL24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mobicom/LiLLCL24
Xiangyu Li, Yuanchun Li, Yuanzhe Li, Ting Cao, Yunxin Liu:
FlexNN: Efficient and Adaptive DNN Inference on Memory-Constrained Edge Devices. MobiCom 2024: 709-723
[c25]
- view
  authority control:
- export record
  dblp key:
  - conf/mobisys/JiaJCCXCLWZ0LQY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mobisys/JiaJCCXCLWZ0LQY24
Fucheng Jia, Shiqi Jiang, Ting Cao, Wei Cui, Tianrui Xia, Xu Cao, Yuanchun Li, Qipeng Wang, Deyu Zhang, Ju Ren, Yunxin Liu, Lili Qiu, Mao Yang:
Empowering In-Browser Deep Learning Inference on Edge Through Just-In-Time Kernel Optimization. MobiSys 2024: 438-450
[c24]
- view
  authority control:
- export record
  dblp key:
  - conf/mobisys/WonCJS24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mobisys/WonCJS24
Jeongho Won, Ting Cao, Huiqiang Jiang, Junehwa Song:
Poster: Design of Elastic Deep Neural Network Candidate Spaces for Inference on Diverse Devices. MobiSys 2024: 734-735
[c23]
- view
  - electronic edition @ usenix.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/nsdi/FengZLXZWCYT24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/nsdi/FengZLXZWCYT24
Chengquan Feng, Li Lyna Zhang, Yuanchi Liu, Jiahang Xu, Chengruidong Zhang, Zhiyuan Wang, Ting Cao, Mao Yang, Haisheng Tan:
LitePred: Transferable and Scalable Latency Prediction for Hardware-Aware Neural Architecture Search. NSDI 2024
[c22]
- view
  - electronic edition @ usenix.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/osdi/WangMCZX0ZM0C0Y24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/osdi/WangMCZX0ZM0C0Y24
Lei Wang, Lingxiao Ma, Shijie Cao, Quanlu Zhang, Jilong Xue, Yining Shi, Ningxin Zheng, Ziming Miao, Fan Yang, Ting Cao, Yuqing Yang, Mao Yang:
Ladder: Enabling Efficient Low-Precision Deep Learning Computing through Hardware-aware Tensor Transformation. OSDI 2024: 307-323
[c21]
- view
  authority control:
- export record
  dblp key:
  - conf/ppopp/ChenLWBWMYZCY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ppopp/ChenLWBWMYZCY24
Yuetao Chen, Kun Li, Yuhao Wang, Donglin Bai, Lei Wang, Lingxiao Ma, Liang Yuan, Yunquan Zhang, Ting Cao, Mao Yang:
ConvStencil: Transform Stencil Computation to Matrix Multiplication on Tensor Cores. PPoPP 2024: 333-347
[c20]
- view
  authority control:
- export record
  dblp key:
  - conf/sc/WangLHB0ZCY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sc/WangLHB0ZCY24
Tuowei Wang, Kun Li, Zixu Hao, Donglin Bai, Ju Ren, Yaoxue Zhang, Ting Cao, Mao Yang:
Long Exposure: Accelerating Parameter-Efficient Fine-Tuning for LLMs under Shadowy Sparsity. SC 2024: Article 75
[c19]
- view
  authority control:
- export record
  dblp key:
  - conf/sc/ZhangLYCZCY24
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sc/ZhangLYCZCY24
Yiwei Zhang, Kun Li, Liang Yuan, Jiawen Cheng, Yunquan Zhang, Ting Cao, Mao Yang:
LoRAStencil: Low-Rank Adaptation of Stencil Computation on Tensor Cores. SC 2024: Article 53
[i23]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-05981
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-05981
Qipeng Wang, Shiqi Jiang, Zhenpeng Chen, Xu Cao, Yuanchun Li, Aoyu Li, Ying Zhang, Yun Ma, Ting Cao, Xuanzhe Liu:
Exploring the Impact of In-Browser Deep Learning Inference on Quality of User Experience and Performance. CoRR abs/2402.05981 (2024)
[i22]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2402-10631
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2402-10631
Dayou Du, Yijia Zhang, Shijie Cao, Jiaqi Guo, Ting Cao, Xiaowen Chu, Ningyi Xu:
BitDistiller: Unleashing the Potential of Sub-4-Bit LLMs via Self-Distillation. CoRR abs/2402.10631 (2024)
[i21]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-00088
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-00088
Jianyu Wei, Shijie Cao, Ting Cao, Lingxiao Ma, Lei Wang, Yanyong Zhang, Mao Yang:
T-MAC: CPU Renaissance via Table Lookup for Low-Bit LLM Deployment on Edge. CoRR abs/2407.00088 (2024)
[i20]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2407-17777
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2407-17777
Shenghong Dai, Shiqi Jiang, Yifan Yang, Ting Cao, Mo Li, Suman Banerjee, Lili Qiu:
Advancing Multi-Modal Sensing Through Expandable Modality Alignment. CoRR abs/2407.17777 (2024)
[i19]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2408-06003
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2408-06003
Zhiwen Mo, Lei Wang, Jianyu Wei, Zhichen Zeng, Shijie Cao, Lingxiao Ma, Naifeng Jing, Ting Cao, Jilong Xue, Fan Yang, Mao Yang:
LUT Tensor Core: Lookup Table Enables Efficient Low-Bit LLM Inference Acceleration. CoRR abs/2408.06003 (2024)
[i18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2409-17066
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2409-17066
Yifei Liu, Jicheng Wen, Yang Wang, Shengyu Ye, Li Lyna Zhang, Ting Cao, Cheng Li, Mao Yang:
VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models. CoRR abs/2409.17066 (2024)
[i17]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-13276
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-13276
Yizhao Gao, Zhichen Zeng, Dayou Du, Shijie Cao, Hayden Kwok-Hay So, Ting Cao, Fan Yang, Mao Yang:
SeerAttention: Learning Intrinsic Sparse Attention in Your LLMs. CoRR abs/2410.13276 (2024)
[i16]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-14993
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-14993
Hao Wu, Donglin Bai, Shiqi Jiang, Qianxi Zhang, Yifan Yang, Ting Cao, Fengyuan Xu:
Making Every Frame Matter: Continuous Video Understanding for Large Models via Adaptive State Modeling. CoRR abs/2410.14993 (2024)
[i15]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2410-19274
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2410-19274
Tuowei Wang, Ruwen Fan, Minxing Huang, Zixu Hao, Kun Li, Ting Cao, Youyou Lu, Yaoxue Zhang, Ju Ren:
Ripple: Accelerating LLM Inference on Smartphones with Correlation-Aware Neuron Management. CoRR abs/2410.19274 (2024)
[i14]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2412-13203
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2412-13203
Tuowei Wang, Kun Li, Donglin Bai, Fusong Ju, Leo Xia, Ting Cao, Ju Ren, Yaoxue Zhang, Mao Yang:
Matryoshka: Optimization of Dynamic Diverse Quantum Chemistry Systems via Elastic Parallelism Transformation. CoRR abs/2412.13203 (2024)
2023
[c18]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - conf/ecai/ZhangHCDMCYX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/ecai/ZhangHCDMCYX23
Yijia Zhang, Yibo Han, Shijie Cao, Guohao Dai, Youshan Miao, Ting Cao, Fan Yang, Ningyi Xu:
Adam Accumulation to Reduce Memory Footprints of Both Activations and Gradients for Large-Scale DNN Training. ECAI 2023: 3058-3065
[c17]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/WangZXZW0ZCY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/WangZXZW0ZCY23
Xudong Wang, Li Lyna Zhang, Jiahang Xu, Quanlu Zhang, Yujing Wang, Yuqing Yang, Ningxin Zheng, Ting Cao, Mao Yang:
SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference. ICCV 2023: 5796-5805
[c16]
- view
  authority control:
- export record
  dblp key:
  - conf/iccv/TangZJXCZYWY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/iccv/TangZJXCZYWY23
Chen Tang, Li Lyna Zhang, Huiqiang Jiang, Jiahang Xu, Ting Cao, Quanlu Zhang, Yuqing Yang, Zhi Wang, Mao Yang:
ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices. ICCV 2023: 5806-5817
[c15]
- view
  authority control:
- export record
  dblp key:
  - conf/kdd/LiZXWYXYC0DZY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/kdd/LiZXWYXYC0DZY23
Junyan Li, Li Lyna Zhang, Jiahang Xu, Yujing Wang, Shaoguang Yan, Yunqing Xia, Yuqing Yang, Ting Cao, Hao Sun, Weiwei Deng, Qi Zhang, Mao Yang:
Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer Inference. KDD 2023: 1280-1290
[c14]
- view
  - electronic edition @ mlsys.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mlsys/LinZWCMZZCX0023
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsys/LinZWCMZZCX0023
Bin Lin, Ningxin Zheng, Lei Wang, Shijie Cao, Lingxiao Ma, Quanlu Zhang, Yi Zhu, Ting Cao, Jilong Xue, Yuqing Yang, Fan Yang:
Efficient GPU Kernels for N: M-Sparse Weights in Deep Learning. MLSys 2023
[c13]
- view
  authority control:
- export record
  dblp key:
  - conf/mobicom/TangWCZC0LY23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mobicom/TangWCZC0LY23
Xiaohu Tang, Yang Wang, Ting Cao, Li Lyna Zhang, Qi Chen, Deng Cai, Yunxin Liu, Mao Yang:
LUT-NN: Empower Efficient Neural Network Inference with Centroid Learning and Table Lookup. MobiCom 2023: 70:1-70:15
[c12]
- view
  authority control:
- export record
  dblp key:
  - conf/mobisys/WeiCCJFYZL23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mobisys/WeiCCJFYZL23
Jianyu Wei, Ting Cao, Shijie Cao, Shiqi Jiang, Shaowei Fu, Mao Yang, Yanyong Zhang, Yunxin Liu:
NN-Stretch: Automatic Neural Network Branching for Parallel Inference on Heterogeneous Multi-Processors. MobiSys 2023: 70-83
[c11]
- view
  authority control:
- export record
  dblp key:
  - conf/mobisys/YiC0MWX23
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mobisys/YiC0MWX23
Rongjie Yi, Ting Cao, Ao Zhou, Xiao Ma, Shangguang Wang, Mengwei Xu:
Boosting DNN Cold Inference on Edge Devices. MobiSys 2023: 516-529
[i13]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2302-03213
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2302-03213
Xiaohu Tang, Yang Wang, Ting Cao, Li Lyna Zhang, Qi Chen, Deng Cai, Yunxin Liu, Mao Yang:
LUT-NN: Towards Unified Neural Network Inference by Table Lookup. CoRR abs/2302.03213 (2023)
[i12]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-08308
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-08308
Li Lyna Zhang, Xudong Wang, Jiahang Xu, Quanlu Zhang, Yujing Wang, Yuqing Yang, Ningxin Zheng, Ting Cao, Mao Yang:
SpaceEvo: Hardware-Friendly Search Space Design for Efficient INT8 Inference. CoRR abs/2303.08308 (2023)
[i11]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-08365
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-08365
Kun Li, Zhichun Li, Yuetao Chen, Zixuan Wang, Yiwei Zhang, Liang Yuan, Haipeng Jia, Yunquan Zhang, Ting Cao, Mao Yang:
Gamify Stencil Dwarf on Cloud for Democratizing Scientific Computing. CoRR abs/2303.08365 (2023)
[i10]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2303-09730
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2303-09730
Chen Tang, Li Lyna Zhang, Huiqiang Jiang, Jiahang Xu, Ting Cao, Quanlu Zhang, Yuqing Yang, Zhi Wang, Mao Yang:
ElasticViT: Conflict-aware Supernet Training for Deploying Fast Vision Transformer on Diverse Mobile Devices. CoRR abs/2303.09730 (2023)
[i9]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-12356
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-12356
Yijia Zhang, Lingran Zhao, Shijie Cao, Wenqiang Wang, Ting Cao, Fan Yang, Mao Yang, Shanghang Zhang, Ningyi Xu:
Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models. CoRR abs/2305.12356 (2023)
[i8]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2305-19982
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2305-19982
Yijia Zhang, Yibo Han, Shijie Cao, Guohao Dai, Youshan Miao, Ting Cao, Fan Yang, Ningyi Xu:
Adam Accumulation to Reduce Memory Footprints of both Activations and Gradients for Large-scale DNN Training. CoRR abs/2305.19982 (2023)
[i7]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2306-14393
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2306-14393
Junyan Li, Li Lyna Zhang, Jiahang Xu, Yujing Wang, Shaoguang Yan, Yunqing Xia, Yuqing Yang, Ting Cao, Hao Sun, Weiwei Deng, Qi Zhang, Mao Yang:
Constraint-aware and Ranking-distilled Token Pruning for Efficient Transformer Inference. CoRR abs/2306.14393 (2023)
[i6]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2308-12066
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2308-12066
Ranggi Hwang, Jianyu Wei, Shijie Cao, Changho Hwang, Xiaohu Tang, Ting Cao, Mao Yang:
Pre-gated MoE: An Algorithm-System Co-Design for Fast and Scalable Mixture-of-Expert Inference. CoRR abs/2308.12066 (2023)
[i5]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2309-08978
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2309-08978
Fucheng Jia, Shiqi Jiang, Ting Cao, Wei Cui, Tianrui Xia, Xu Cao, Yuanchun Li, Deyu Zhang, Ju Ren, Yunxin Liu, Lili Qiu, Mao Yang:
Accelerating In-Browser Deep Learning Inference on Diverse Edge Clients through Just-in-Time Kernel Optimizations. CoRR abs/2309.08978 (2023)
[i4]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2311-01792
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2311-01792
Yijia Zhang, Sicheng Zhang, Shijie Cao, Dayou Du, Jianyu Wei, Ting Cao, Ningyi Xu:
AFPQ: Asymmetric Floating Point Quantization for LLMs. CoRR abs/2311.01792 (2023)
2022
[c10]
- view
  authority control:
- export record
  dblp key:
  - conf/cikm/ZhangHWWYZCS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/cikm/ZhangHWWYZCS22
Li Lyna Zhang, Youkow Homma, Yujing Wang, Min Wu, Mao Yang, Ruofei Zhang, Ting Cao, Wei Shen:
SwiftPruner: Reinforced Evolutionary Pruning for Efficient Ad Relevance. CIKM 2022: 3654-3663
[c9]
- view
  authority control:
- export record
  dblp key:
  - conf/mobicom/LiangCWWWZL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mobicom/LiangCWWWZL22
Rendong Liang, Ting Cao, Jicheng Wen, Manni Wang, Yang Wang, Jianhua Zou, Yunxin Liu:
Romou: rapidly generate high-performance tensor kernels for mobile GPUs. MobiCom 2022: 487-500
[c8]
- view
  authority control:
- export record
  dblp key:
  - conf/mobicom/ZhangYRZHCLZL22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mobicom/ZhangYRZHCLZL22
Jinrui Zhang, Huan Yang, Ju Ren, Deyu Zhang, Bangwen He, Ting Cao, Yuanchun Li, Yaoxue Zhang, Yunxin Liu:
MobiDepth: real-time depth estimation using on-device dual cameras. MobiCom 2022: 528-541
[c7]
- view
  authority control:
- export record
  dblp key:
  - conf/mobisys/JiaZCJLRZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mobisys/JiaZCJLRZ22
Fucheng Jia, Deyu Zhang, Ting Cao, Shiqi Jiang, Yunxin Liu, Ju Ren, Yaoxue Zhang:
CoDL: efficient CPU-GPU co-execution for deep learning inference on mobile devices. MobiSys 2022: 209-221
[c6]
- view
  authority control:
- export record
  dblp key:
  - conf/sensys/0006JCS22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sensys/0006JCS22
Yan Lu, Shiqi Jiang, Ting Cao, Yuanchao Shu:
Turbo: Opportunistic Enhancement for Edge Video Analytics. SenSys 2022: 263-276
[c5]
- view
  authority control:
- export record
  dblp key:
  - conf/sensys/FuRLCZZZ22
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/sensys/FuRLCZZZ22
Ziyan Fu, Ju Ren, Yunxin Liu, Ting Cao, Deyu Zhang, Yuezhi Zhou, Yaoxue Zhang:
Hyperion: A Generic and Distributed Mobile Offloading Framework on OpenCL. SenSys 2022: 607-621
[i3]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2206-07446
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2206-07446
Rongjie Yi, Ting Cao, Ao Zhou, Xiao Ma, Shangguang Wang, Mengwei Xu:
Understanding and Optimizing Deep Learning Cold-Start Latency on Edge Devices. CoRR abs/2206.07446 (2022)
[i2]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2207-00172
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2207-00172
Yan Lu, Shiqi Jiang, Ting Cao, Yuanchao Shu:
Turbo: Opportunistic Enhancement for Edge Video Analytics. CoRR abs/2207.00172 (2022)
[i1]
- view
  - electronic edition via DOI (open access)
  - details & citations
  authority control:
- export record
  dblp key:
  - journals/corr/abs-2209-00625
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/corr/abs-2209-00625
Li Lyna Zhang, Youkow Homma, Yujing Wang, Min Wu, Mao Yang, Ruofei Zhang, Ting Cao, Wei Shen:
SwiftPruner: Reinforced Evolutionary Pruning for Efficient Ad Relevance. CoRR abs/2209.00625 (2022)
2021
[j1]
- view
  authority control:
- export record
  dblp key:
  - journals/sigmobile/ZhangHWZCL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/journals/sigmobile/ZhangHWZCL21
Li Lyna Zhang, Shihao Han, Jianyu Wei, Ningxin Zheng, Ting Cao, Yunxin Liu:
nn-METER: Towards Accurate Latency Prediction of DNN Inference on Diverse Edge Devices. GetMobile Mob. Comput. Commun. 25(4): 19-23 (2021)
[c4]
- view
  - electronic edition @ mlsys.org (open access)
  - details & citations
- export record
  dblp key:
  - conf/mlsys/TangHZCL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mlsys/TangHZCL21
Xiaohu Tang, Shihao Han, Li Lyna Zhang, Ting Cao, Yunxin Liu:
To Bridge Neural Network Design and Real-World Performance: A Behaviour Study for Neural Networks. MLSys 2021
[c3]
- view
  authority control:
- export record
  dblp key:
  - conf/mobicom/WangDCLX21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mobicom/WangDCLX21
Manni Wang, Shaohua Ding, Ting Cao, Yunxin Liu, Fengyuan Xu:
AsyMo: scalable and efficient deep-learning inference on asymmetric mobile CPUs. MobiCom 2021: 215-228
[c2]
- view
  authority control:
- export record
  dblp key:
  - conf/mobisys/ZhangHWZCYL21
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/mobisys/ZhangHWZCYL21
Li Lyna Zhang, Shihao Han, Jianyu Wei, Ningxin Zheng, Ting Cao, Yuqing Yang, Yunxin Liu:
nn-Meter: towards accurate latency prediction of deep-learning model inference on diverse edge devices. MobiSys 2021: 81-93
2020
[c1]
- view
  authority control:
- export record
  dblp key:
  - conf/apsys/JiangRCXL20
- ask others
- share record
  persistent URL:
  - https://dblp.org/rec/conf/apsys/JiangRCXL20
Shiqi Jiang, Lihao Ran, Ting Cao, Yusen Xu, Yunxin Liu:
Profiling and optimizing deep learning inference on mobile GPUs. APSys 2020: 75-81

Coauthor Index

see FAQ

manage site settings

To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.