default search action
Jifeng Dai
Person information
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [j5]Heming Cheng, Dongfang Ding, Jifeng Dai, Gen Li, Ke Zhang, Jianyun Li, Liuchuang Wei, Xue Zhang, Jie Hou:
Effect of a reduced arterial axial pre-stretch ratio during aging on the cardiac output and cerebral blood flow in the healthy elders. Comput. Methods Programs Biomed. 257: 108468 (2024) - [j4]Hongyang Li, Chonghao Sima, Jifeng Dai, Wenhai Wang, Lewei Lu, Huijie Wang, Jia Zeng, Zhiqi Li, Jiazhi Yang, Hanming Deng, Hao Tian, Enze Xie, Jiangwei Xie, Li Chen, Tianyu Li, Yang Li, Yulu Gao, Xiaosong Jia, Si Liu, Jianping Shi, Dahua Lin, Yu Qiao:
Delving Into the Devils of Bird's-Eye-View Perception: A Review, Evaluation and Recipe. IEEE Trans. Pattern Anal. Mach. Intell. 46(4): 2151-2170 (2024) - [j3]Rongyao Fang, Peng Gao, Aojun Zhou, Yingjie Cai, Si Liu, Jifeng Dai, Hongsheng Li:
FeatAug-DETR: Enriching One-to-Many Matching for DETRs With Feature Augmentation. IEEE Trans. Pattern Anal. Mach. Intell. 46(9): 6402-6415 (2024) - [c67]Yuwen Xiong, Zhiqi Li, Yuntao Chen, Feng Wang, Xizhou Zhu, Jiapeng Luo, Wenhai Wang, Tong Lu, Hongsheng Li, Yu Qiao, Lewei Lu, Jie Zhou, Jifeng Dai:
Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications. CVPR 2024: 5652-5661 - [c66]Hao Li, Xue Yang, Zhaokai Wang, Xizhou Zhu, Jie Zhou, Yu Qiao, Xiaogang Wang, Hongsheng Li, Lewei Lu, Jifeng Dai:
Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft. CVPR 2024: 16426-16435 - [c65]Yi Yu, Xue Yang, Qingyun Li, Feipeng Da, Jifeng Dai, Yu Qiao, Junchi Yan:
Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-End Oriented Object Detection with Single Point Supervision. CVPR 2024: 16783-16793 - [c64]Zhe Chen, Jiannan Wu, Wenhai Wang, Weijie Su, Guo Chen, Sen Xing, Muyan Zhong, Qinglong Zhang, Xizhou Zhu, Lewei Lu, Bin Li, Ping Luo, Tong Lu, Yu Qiao, Jifeng Dai:
Intern VL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks. CVPR 2024: 24185-24198 - [c63]Gang Li, Wenhai Wang, Xiang Li, Ziheng Li, Jian Yang, Jifeng Dai, Yu Qiao, Shanshan Zhang:
Distilling Knowledge from Large-Scale Image Models for Object Detection. ECCV (84) 2024: 142-160 - [c62]Weiyun Wang, Yiming Ren, Haowen Luo, Tiantong Li, Chenxiang Yan, Zhe Chen, Wenhai Wang, Qingyun Li, Lewei Lu, Xizhou Zhu, Yu Qiao, Jifeng Dai:
The All-Seeing Project V2: Towards General Relation Comprehension of the Open World. ECCV (33) 2024: 471-490 - [c61]Changyao Tian, Chenxin Tao, Jifeng Dai, Hao Li, Ziheng Li, Lewei Lu, Xiaogang Wang, Hongsheng Li, Gao Huang, Xizhou Zhu:
ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process. ICLR 2024 - [c60]Weiyun Wang, Min Shi, Qingyun Li, Wenhai Wang, Zhenhang Huang, Linjie Xing, Zhe Chen, Hao Li, Xizhou Zhu, Zhiguo Cao, Yushi Chen, Tong Lu, Jifeng Dai, Yu Qiao:
The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World. ICLR 2024 - [c59]Yang Yang, Wenhai Wang, Zhe Chen, Jifeng Dai, Liang Zheng:
Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments. ICLR 2024 - [c58]Yao Mu, Junting Chen, Qinglong Zhang, Shoufa Chen, Qiaojun Yu, Chongjian Ge, Runjian Chen, Zhixuan Liang, Mengkang Hu, Chaofan Tao, Peize Sun, Haibao Yu, Chao Yang, Wenqi Shao, Wenhai Wang, Jifeng Dai, Yu Qiao, Mingyu Ding, Ping Luo:
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis. ICML 2024 - [c57]Xiaoyu Shi, Zhaoyang Huang, Fu-Yun Wang, Weikang Bian, Dasong Li, Yi Zhang, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li:
Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling. SIGGRAPH (Conference Paper Track) 2024: 111 - [i101]Yuwen Xiong, Zhiqi Li, Yuntao Chen, Feng Wang, Xizhou Zhu, Jiapeng Luo, Wenhai Wang, Tong Lu, Hongsheng Li, Yu Qiao, Lewei Lu, Jie Zhou, Jifeng Dai:
Efficient Deformable ConvNets: Rethinking Dynamic and Sparse Operator for Vision Applications. CoRR abs/2401.06197 (2024) - [i100]Changyao Tian, Xizhou Zhu, Yuwen Xiong, Weiyun Wang, Zhe Chen, Wenhai Wang, Yuntao Chen, Lewei Lu, Tong Lu, Jie Zhou, Hongsheng Li, Yu Qiao, Jifeng Dai:
MM-Interleaved: Interleaved Image-Text Generative Modeling via Multi-modal Feature Synchronizer. CoRR abs/2401.10208 (2024) - [i99]Xiaoyu Shi, Zhaoyang Huang, Fu-Yun Wang, Weikang Bian, Dasong Li, Yi Zhang, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li:
Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling. CoRR abs/2401.15977 (2024) - [i98]Yao Mu, Junting Chen, Qinglong Zhang, Shoufa Chen, Qiaojun Yu, Chongjian Ge, Runjian Chen, Zhixuan Liang, Mengkang Hu, Chaofan Tao, Peize Sun, Haibao Yu, Chao Yang, Wenqi Shao, Wenhai Wang, Jifeng Dai, Yu Qiao, Mingyu Ding, Ping Luo:
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis. CoRR abs/2402.16117 (2024) - [i97]Weiyun Wang, Yiming Ren, Haowen Luo, Tiantong Li, Chenxiang Yan, Zhe Chen, Wenhai Wang, Qingyun Li, Lewei Lu, Xizhou Zhu, Yu Qiao, Jifeng Dai:
The All-Seeing Project V2: Towards General Relation Comprehension of the Open World. CoRR abs/2402.19474 (2024) - [i96]Yuchen Duan, Weiyun Wang, Zhe Chen, Xizhou Zhu, Lewei Lu, Tong Lu, Yu Qiao, Hongsheng Li, Jifeng Dai, Wenhai Wang:
Vision-RWKV: Efficient and Scalable Visual Perception with RWKV-Like Architectures. CoRR abs/2403.02308 (2024) - [i95]Yang Yang, Wenhai Wang, Zhe Chen, Jifeng Dai, Liang Zheng:
Bounding Box Stability against Feature Dropout Reflects Detector Generalization across Environments. CoRR abs/2403.13803 (2024) - [i94]Xiaoyi Dong, Pan Zhang, Yuhang Zang, Yuhang Cao, Bin Wang, Linke Ouyang, Songyang Zhang, Haodong Duan, Wenwei Zhang, Yining Li, Hang Yan, Yang Gao, Zhe Chen, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Kai Chen, Conghui He, Xingcheng Zhang, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD. CoRR abs/2404.06512 (2024) - [i93]Zhe Chen, Weiyun Wang, Hao Tian, Shenglong Ye, Zhangwei Gao, Erfei Cui, Wenwen Tong, Kongzhi Hu, Jiapeng Luo, Zheng Ma, Ji Ma, Jiaqi Wang, Xiaoyi Dong, Hang Yan, Hewei Guo, Conghui He, Botian Shi, Zhenjiang Jin, Chao Xu, Bin Wang, Xingjian Wei, Wei Li, Wenjian Zhang, Bo Zhang, Pinlong Cai, Licheng Wen, Xiangchao Yan, Min Dou, Lewei Lu, Xizhou Zhu, Tong Lu, Dahua Lin, Yu Qiao, Jifeng Dai, Wenhai Wang:
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites. CoRR abs/2404.16821 (2024) - [i92]Chongjie Si, Xuehui Wang, Xue Yang, Zhengqin Xu, Qingyun Li, Jifeng Dai, Yu Qiao, Xiaokang Yang, Wei Shen:
FLoRA: Low-Rank Core Space for N-dimension. CoRR abs/2405.14739 (2024) - [i91]Yingqing He, Zhaoyang Liu, Jingye Chen, Zeyue Tian, Hongyu Liu, Xiaowei Chi, Runtao Liu, Ruibin Yuan, Yazhou Xing, Wenhai Wang, Jifeng Dai, Yong Zhang, Wei Xue, Qifeng Liu, Yike Guo, Qifeng Chen:
LLMs Meet Multimodal Generation and Editing: A Survey. CoRR abs/2405.19334 (2024) - [i90]Xizhou Zhu, Xue Yang, Zhaokai Wang, Hao Li, Wenhan Dou, Junqi Ge, Lewei Lu, Yu Qiao, Jifeng Dai:
Parameter-Inverted Image Pyramid Networks. CoRR abs/2406.04330 (2024) - [i89]Chenxin Tao, Xizhou Zhu, Shiqian Su, Lewei Lu, Changyao Tian, Xuan Luo, Gao Huang, Hongsheng Li, Yu Qiao, Jie Zhou, Jifeng Dai:
Learning 1D Causal Visual Representation with De-focus Attention Networks. CoRR abs/2406.04342 (2024) - [i88]Weiyun Wang, Shuibo Zhang, Yiming Ren, Yuchen Duan, Tiantong Li, Shuo Liu, Mengkang Hu, Zhe Chen, Kaipeng Zhang, Lewei Lu, Xizhou Zhu, Ping Luo, Yu Qiao, Jifeng Dai, Wenqi Shao, Wenhai Wang:
Needle In A Multimodal Haystack. CoRR abs/2406.07230 (2024) - [i87]Chenyu Yang, Xizhou Zhu, Jinguo Zhu, Weijie Su, Junjie Wang, Xuan Dong, Wenhai Wang, Lewei Lu, Bin Li, Jie Zhou, Yu Qiao, Jifeng Dai:
Vision Model Pre-training on Interleaved Image-Text Data via Latent Compression Learning. CoRR abs/2406.07543 (2024) - [i86]Haoji Zhang, Yiqin Wang, Yansong Tang, Yong Liu, Jiashi Feng, Jifeng Dai, Xiaojie Jin:
Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams. CoRR abs/2406.08085 (2024) - [i85]Jiannan Wu, Muyan Zhong, Sen Xing, Zeqiang Lai, Zhaoyang Liu, Wenhai Wang, Zhe Chen, Xizhou Zhu, Lewei Lu, Tong Lu, Ping Luo, Yu Qiao, Jifeng Dai:
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks. CoRR abs/2406.08394 (2024) - [i84]Qingyun Li, Zhe Chen, Weiyun Wang, Wenhai Wang, Shenglong Ye, Zhenjiang Jin, Guanzhou Chen, Yinan He, Zhangwei Gao, Erfei Cui, Jiashuo Yu, Hao Tian, Jiasheng Zhou, Chao Xu, Bin Wang, Xingjian Wei, Wei Li, Wenjian Zhang, Bo Zhang, Pinlong Cai, Licheng Wen, Xiangchao Yan, Zhenxiang Li, Pei Chu, Yi Wang, Min Dou, Changyao Tian, Xizhou Zhu, Lewei Lu, Yushi Chen, Junjun He, Zhongying Tu, Tong Lu, Yali Wang, Limin Wang, Dahua Lin, Yu Qiao, Botian Shi, Conghui He, Jifeng Dai:
OmniCorpus: A Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text. CoRR abs/2406.08418 (2024) - [i83]Jiawei Gao, Ziqin Wang, Zeqi Xiao, Jingbo Wang, Tai Wang, Jinkun Cao, Xiaolin Hu, Si Liu, Jifeng Dai, Jiangmiao Pang:
CooHOI: Learning Cooperative Human-Object Interaction with Manipulated Object Dynamics. CoRR abs/2406.14558 (2024) - [i82]Yiqin Wang, Haoji Zhang, Yansong Tang, Yong Liu, Jiashi Feng, Jifeng Dai, Xiaojie Jin:
Hierarchical Memory for Long Video QA. CoRR abs/2407.00603 (2024) - [i81]Pan Zhang, Xiaoyi Dong, Yuhang Zang, Yuhang Cao, Rui Qian, Lin Chen, Qipeng Guo, Haodong Duan, Bin Wang, Linke Ouyang, Songyang Zhang, Wenwei Zhang, Yining Li, Yang Gao, Peng Sun, Xinyue Zhang, Wei Li, Jingwen Li, Wenhai Wang, Hang Yan, Conghui He, Xingcheng Zhang, Kai Chen, Jifeng Dai, Yu Qiao, Dahua Lin, Jiaqi Wang:
InternLM-XComposer-2.5: A Versatile Large Vision Language Model Supporting Long-Contextual Input and Output. CoRR abs/2407.03320 (2024) - [i80]Yangzhou Liu, Yue Cao, Zhangwei Gao, Weiyun Wang, Zhe Chen, Wenhai Wang, Hao Tian, Lewei Lu, Xizhou Zhu, Tong Lu, Yu Qiao, Jifeng Dai:
MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity. CoRR abs/2407.15838 (2024) - [i79]Fanqing Meng, Jin Wang, Chuanhao Li, Quanfeng Lu, Hao Tian, Jiaqi Liao, Xizhou Zhu, Jifeng Dai, Yu Qiao, Ping Luo, Kaipeng Zhang, Wenqi Shao:
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models. CoRR abs/2408.02718 (2024) - [i78]Gen Luo, Xue Yang, Wenhan Dou, Zhaokai Wang, Jifeng Dai, Yu Qiao, Xizhou Zhu:
Mono-InternVL: Pushing the Boundaries of Monolithic Multimodal Large Language Models with Endogenous Visual Pre-training. CoRR abs/2410.08202 (2024) - 2023
- [c56]Xiaoyu Shi, Zhaoyang Huang, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li:
FlowFormer++: Masked Cost Volume Autoencoding for Pretraining Optical Flow Estimation. CVPR 2023: 1599-1610 - [c55]Chenxin Tao, Xizhou Zhu, Weijie Su, Gao Huang, Bin Li, Jie Zhou, Yu Qiao, Xiaogang Wang, Jifeng Dai:
Siamese Image Modeling for Self-Supervised Vision Representation Learning. CVPR 2023: 2132-2141 - [c54]Hao Li, Jinguo Zhu, Xiaohu Jiang, Xizhou Zhu, Hongsheng Li, Chun Yuan, Xiaohua Wang, Yu Qiao, Xiaogang Wang, Wenhai Wang, Jifeng Dai:
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks. CVPR 2023: 2691-2700 - [c53]Wenhai Wang, Jifeng Dai, Zhe Chen, Zhenhang Huang, Zhiqi Li, Xizhou Zhu, Xiaowei Hu, Tong Lu, Lewei Lu, Hongsheng Li, Xiaogang Wang, Yu Qiao:
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions. CVPR 2023: 14408-14419 - [c52]Weijie Su, Xizhou Zhu, Chenxin Tao, Lewei Lu, Bin Li, Gao Huang, Yu Qiao, Xiaogang Wang, Jie Zhou, Jifeng Dai:
Towards All-in-One Pre-Training via Maximizing Multi-Modal Mutual Information. CVPR 2023: 15888-15899 - [c51]Chenyu Yang, Yuntao Chen, Hao Tian, Chenxin Tao, Xizhou Zhu, Zhaoxiang Zhang, Gao Huang, Hongyang Li, Yu Qiao, Lewei Lu, Jie Zhou, Jifeng Dai:
BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision. CVPR 2023: 17830-17839 - [c50]Yihan Hu, Jiazhi Yang, Li Chen, Keyu Li, Chonghao Sima, Xizhou Zhu, Siqi Chai, Senyao Du, Tianwei Lin, Wenhai Wang, Lewei Lu, Xiaosong Jia, Qiang Liu, Jifeng Dai, Yu Qiao, Hongyang Li:
Planning-oriented Autonomous Driving. CVPR 2023: 17853-17862 - [c49]Jiaqi Xu, Xiaowei Hu, Lei Zhu, Qi Dou, Jifeng Dai, Yu Qiao, Pheng-Ann Heng:
Video Dehazing via a Multi-Range Temporal Alignment Network with Physical Prior. CVPR 2023: 18053-18062 - [c48]Yurui Zhu, Tianyu Wang, Xueyang Fu, Xuanyu Yang, Xin Guo, Jifeng Dai, Yu Qiao, Xiaowei Hu:
Learning Weather-General and Weather-Specific Features for Image Restoration Under Multiple Adverse Weather Conditions. CVPR 2023: 21747-21758 - [c47]Xiaoyu Shi, Zhaoyang Huang, Weikang Bian, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li:
VideoFlow: Exploiting Temporal Cues for Multi-frame Optical Flow Estimation. ICCV 2023: 12435-12446 - [c46]Zhe Chen, Yuchen Duan, Wenhai Wang, Junjun He, Tong Lu, Jifeng Dai, Yu Qiao:
Vision Transformer Adapter for Dense Predictions. ICLR 2023 - [c45]Yao Mu, Qinglong Zhang, Mengkang Hu, Wenhai Wang, Mingyu Ding, Jun Jin, Bin Wang, Jifeng Dai, Yu Qiao, Ping Luo:
EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought. NeurIPS 2023 - [c44]Keqiang Sun, Junting Pan, Yuying Ge, Hao Li, Haodong Duan, Xiaoshi Wu, Renrui Zhang, Aojun Zhou, Zipeng Qin, Yi Wang, Jifeng Dai, Yu Qiao, Limin Wang, Hongsheng Li:
JourneyDB: A Benchmark for Generative Image Understanding. NeurIPS 2023 - [c43]Wenhai Wang, Zhe Chen, Xiaokang Chen, Jiannan Wu, Xizhou Zhu, Gang Zeng, Ping Luo, Tong Lu, Jie Zhou, Yu Qiao, Jifeng Dai:
VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks. NeurIPS 2023 - [i77]Xiaoyu Shi, Zhaoyang Huang, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li:
FlowFormer++: Masked Cost Volume Autoencoding for Pretraining Optical Flow Estimation. CoRR abs/2303.01237 (2023) - [i76]Rongyao Fang, Peng Gao, Aojun Zhou, Yingjie Cai, Si Liu, Jifeng Dai, Hongsheng Li:
FeatAug-DETR: Enriching One-to-Many Matching for DETRs with Feature Augmentation. CoRR abs/2303.01503 (2023) - [i75]Xiaoyu Shi, Zhaoyang Huang, Weikang Bian, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li:
VideoFlow: Exploiting Temporal Cues for Multi-frame Optical Flow Estimation. CoRR abs/2303.08340 (2023) - [i74]Jiaqi Xu, Xiaowei Hu, Lei Zhu, Qi Dou, Jifeng Dai, Yu Qiao, Pheng-Ann Heng:
Video Dehazing via a Multi-Range Temporal Alignment Network with Physical Prior. CoRR abs/2303.09757 (2023) - [i73]Zhaoyang Liu, Yinan He, Wenhai Wang, Weiyun Wang, Yi Wang, Shoufa Chen, Qinglong Zhang, Zeqiang Lai, Yang Yang, Qingyun Li, Jiashuo Yu, Kunchang Li, Zhe Chen, Xue Yang, Xizhou Zhu, Yali Wang, Limin Wang, Ping Luo, Jifeng Dai, Yu Qiao:
InternGPT: Solving Vision-Centric Tasks by Interacting with Chatbots Beyond Language. CoRR abs/2305.05662 (2023) - [i72]Wenhai Wang, Zhe Chen, Xiaokang Chen, Jiannan Wu, Xizhou Zhu, Gang Zeng, Ping Luo, Tong Lu, Jie Zhou, Yu Qiao, Jifeng Dai:
VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks. CoRR abs/2305.11175 (2023) - [i71]Yao Mu, Qinglong Zhang, Mengkang Hu, Wenhai Wang, Mingyu Ding, Jun Jin, Bin Wang, Jifeng Dai, Yu Qiao, Ping Luo:
EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought. CoRR abs/2305.15021 (2023) - [i70]Xizhou Zhu, Yuntao Chen, Hao Tian, Chenxin Tao, Weijie Su, Chenyu Yang, Gao Huang, Bin Li, Lewei Lu, Xiaogang Wang, Yu Qiao, Zhaoxiang Zhang, Jifeng Dai:
Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory. CoRR abs/2305.17144 (2023) - [i69]Zeqiang Lai, Yuchen Duan, Jifeng Dai, Ziheng Li, Ying Fu, Hongsheng Li, Yu Qiao, Wenhai Wang:
Denoising Diffusion Semantic Segmentation with Mask Prior Modeling. CoRR abs/2306.01721 (2023) - [i68]Changyao Tian, Chenxin Tao, Jifeng Dai, Hao Li, Ziheng Li, Lewei Lu, Xiaogang Wang, Hongsheng Li, Gao Huang, Xizhou Zhu:
ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process. CoRR abs/2306.05423 (2023) - [i67]Zhaoyang Huang, Xiaoyu Shi, Chao Zhang, Qiang Wang, Yijin Li, Hongwei Qin, Jifeng Dai, Xiaogang Wang, Hongsheng Li:
FlowFormer: A Transformer Architecture and Its Masked Cost Volume Autoencoding for Optical Flow. CoRR abs/2306.05442 (2023) - [i66]Junting Pan, Keqiang Sun, Yuying Ge, Hao Li, Haodong Duan, Xiaoshi Wu, Renrui Zhang, Aojun Zhou, Zipeng Qin, Yi Wang, Jifeng Dai, Yu Qiao, Hongsheng Li:
JourneyDB: A Benchmark for Generative Image Understanding. CoRR abs/2307.00716 (2023) - [i65]Weiyun Wang, Min Shi, Qingyun Li, Wenhai Wang, Zhenhang Huang, Linjie Xing, Zhe Chen, Hao Li, Xizhou Zhu, Zhiguo Cao, Yushi Chen, Tong Lu, Jifeng Dai, Yu Qiao:
The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World. CoRR abs/2308.01907 (2023) - [i64]Zeqiang Lai, Xizhou Zhu, Jifeng Dai, Yu Qiao, Wenhai Wang:
Mini-DALLE3: Interactive Text to Image by Prompting Large Language Models. CoRR abs/2310.07653 (2023) - [i63]Zhaoyang Liu, Zeqiang Lai, Zhangwei Gao, Erfei Cui, Zhiheng Li, Xizhou Zhu, Lewei Lu, Qifeng Chen, Yu Qiao, Jifeng Dai, Wenhai Wang:
ControlLLM: Augment Language Models with Tools by Searching on Graphs. CoRR abs/2310.17796 (2023) - [i62]Yu Yi, Xue Yang, Qingyun Li, Feipeng Da, Junchi Yan, Jifeng Dai, Yu Qiao:
Point2RBox: Combine Knowledge from Synthetic Visual Patterns for End-to-end Oriented Object Detection with Single Point Supervision. CoRR abs/2311.14758 (2023) - [i61]Rongyao Fang, Shilin Yan, Zhaoyang Huang, Jingqiu Zhou, Hao Tian, Jifeng Dai, Hongsheng Li:
InstructSeq: Unifying Vision Tasks with Instruction-conditioned Multi-modal Sequence Generation. CoRR abs/2311.18835 (2023) - [i60]Hao Li, Xue Yang, Zhaokai Wang, Xizhou Zhu, Jie Zhou, Yu Qiao, Xiaogang Wang, Hongsheng Li, Lewei Lu, Jifeng Dai:
Auto MC-Reward: Automated Dense Reward Design with Large Language Models for Minecraft. CoRR abs/2312.09238 (2023) - [i59]Wenhai Wang, Jiangwei Xie, Chuanyang Hu, Haoming Zou, Jianan Fan, Wenwen Tong, Yang Wen, Silei Wu, Hanming Deng, Zhiqi Li, Hao Tian, Lewei Lu, Xizhou Zhu, Xiaogang Wang, Yu Qiao, Jifeng Dai:
DriveMLM: Aligning Multi-Modal Large Language Models with Behavioral Planning States for Autonomous Driving. CoRR abs/2312.09245 (2023) - [i58]Jiankai Sun, Chuanyang Zheng, Enze Xie, Zhengying Liu, Ruihang Chu, Jianing Qiu, Jiaqi Xu, Mingyu Ding, Hongyang Li, Mengzhe Geng, Yue Wu, Wenhai Wang, Junsong Chen, Zhangyue Yin, Xiaozhe Ren, Jie Fu, Junxian He, Wu Yuan, Qi Liu, Xihui Liu, Yu Li, Hao Dong, Yu Cheng, Ming Zhang, Pheng-Ann Heng, Jifeng Dai, Ping Luo, Jingdong Wang, Ji-Rong Wen, Xipeng Qiu, Yike Guo, Hui Xiong, Qun Liu, Zhenguo Li:
A Survey of Reasoning with Foundation Models. CoRR abs/2312.11562 (2023) - [i57]Zhe Chen, Jiannan Wu, Wenhai Wang, Weijie Su, Guo Chen, Sen Xing, Muyan Zhong, Qinglong Zhang, Xizhou Zhu, Lewei Lu, Bin Li, Ping Luo, Tong Lu, Yu Qiao, Jifeng Dai:
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks. CoRR abs/2312.14238 (2023) - 2022
- [c42]Hao Li, Tianwen Fu, Jifeng Dai, Hongsheng Li, Gao Huang, Xizhou Zhu:
AutoLoss-Zero: Searching Loss Functions from Scratch for Generic Tasks. CVPR 2022: 999-1008 - [c41]Chenxin Tao, Honghui Wang, Xizhou Zhu, Jiahua Dong, Shiji Song, Gao Huang, Jifeng Dai:
Exploring the Equivalence of Siamese Self-Supervised Learning via A Unified Gradient Framework. CVPR 2022: 14411-14420 - [c40]Xizhou Zhu, Jinguo Zhu, Hao Li, Xiaoshi Wu, Hongsheng Li, Xiaohua Wang, Jifeng Dai:
Uni-Perceiver: Pre-training Unified Architecture for Generic Perception for Zero-shot and Few-shot Tasks. CVPR 2022: 16783-16794 - [c39]Zhiqi Li, Wenhai Wang, Hongyang Li, Enze Xie, Chonghao Sima, Tong Lu, Yu Qiao, Jifeng Dai:
BEVFormer: Learning Bird's-Eye-View Representation from Multi-camera Images via Spatiotemporal Transformers. ECCV (9) 2022: 1-18 - [c38]Changyao Tian, Wenhai Wang, Xizhou Zhu, Jifeng Dai, Yu Qiao:
VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition. ECCV (25) 2022: 73-91 - [c37]Ziyi Lin, Shijie Geng, Renrui Zhang, Peng Gao, Gerard de Melo, Xiaogang Wang, Jifeng Dai, Yu Qiao, Hongsheng Li:
Frozen CLIP Models are Efficient Video Learners. ECCV (35) 2022: 388-404 - [c36]Renrui Zhang, Wei Zhang, Rongyao Fang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-Free Adaption of CLIP for Few-Shot Classification. ECCV (35) 2022: 493-510 - [c35]Zhaoyang Huang, Xiaoyu Shi, Chao Zhang, Qiang Wang, Ka Chun Cheung, Hongwei Qin, Jifeng Dai, Hongsheng Li:
FlowFormer: A Transformer Architecture for Optical Flow. ECCV (17) 2022: 668-685 - [c34]Peng Gao, Teli Ma, Hongsheng Li, Ziyi Lin, Jifeng Dai, Yu Qiao:
MCMAE: Masked Convolution Meets Masked Autoencoders. NeurIPS 2022 - [c33]Jinguo Zhu, Xizhou Zhu, Wenhai Wang, Xiaohua Wang, Hongsheng Li, Xiaogang Wang, Jifeng Dai:
Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs. NeurIPS 2022 - [i56]Zhaoyang Huang, Xiaoyu Shi, Chao Zhang, Qiang Wang, Ka Chun Cheung, Hongwei Qin, Jifeng Dai, Hongsheng Li:
FlowFormer: A Transformer Architecture for Optical Flow. CoRR abs/2203.16194 (2022) - [i55]Zhiqi Li, Wenhai Wang, Hongyang Li, Enze Xie, Chonghao Sima, Tong Lu, Qiao Yu, Jifeng Dai:
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers. CoRR abs/2203.17270 (2022) - [i54]Peng Gao, Teli Ma, Hongsheng Li, Ziyi Lin, Jifeng Dai, Yu Qiao:
ConvMAE: Masked Convolution Meets Masked Autoencoders. CoRR abs/2205.03892 (2022) - [i53]Zhe Chen, Yuchen Duan, Wenhai Wang, Junjun He, Tong Lu, Jifeng Dai, Yu Qiao:
Vision Transformer Adapter for Dense Predictions. CoRR abs/2205.08534 (2022) - [i52]Chenxin Tao, Xizhou Zhu, Gao Huang, Yu Qiao, Xiaogang Wang, Jifeng Dai:
Siamese Image Modeling for Self-Supervised Vision Representation Learning. CoRR abs/2206.01204 (2022) - [i51]Jinguo Zhu, Xizhou Zhu, Wenhai Wang, Xiaohua Wang, Hongsheng Li, Xiaogang Wang, Jifeng Dai:
Uni-Perceiver-MoE: Learning Sparse Generalist Models with Conditional MoEs. CoRR abs/2206.04674 (2022) - [i50]Renrui Zhang, Zhang Wei, Rongyao Fang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-free Adaption of CLIP for Few-shot Classification. CoRR abs/2207.09519 (2022) - [i49]Ziyi Lin, Shijie Geng, Renrui Zhang, Peng Gao, Gerard de Melo, Xiaogang Wang, Jifeng Dai, Yu Qiao, Hongsheng Li:
Frozen CLIP Models are Efficient Video Learners. CoRR abs/2208.03550 (2022) - [i48]Hongyang Li, Chonghao Sima, Jifeng Dai, Wenhai Wang, Lewei Lu, Huijie Wang, Enze Xie, Zhiqi Li, Hanming Deng, Hao Tian, Xizhou Zhu, Li Chen, Yulu Gao, Xiangwei Geng, Jia Zeng, Yang Li, Jiazhi Yang, Xiaosong Jia, Bohan Yu, Yu Qiao, Dahua Lin, Si Liu, Junchi Yan, Jianping Shi, Ping Luo:
Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe. CoRR abs/2209.05324 (2022) - [i47]Wenhai Wang, Jifeng Dai, Zhe Chen, Zhenhang Huang, Zhiqi Li, Xizhou Zhu, Xiaowei Hu, Tong Lu, Lewei Lu, Hongsheng Li, Xiaogang Wang, Yu Qiao:
InternImage: Exploring Large-Scale Vision Foundation Models with Deformable Convolutions. CoRR abs/2211.05778 (2022) - [i46]Jifeng Dai, Min Shi, Weiyun Wang, Sitong Wu, Linjie Xing, Wenhai Wang, Xizhou Zhu, Lewei Lu, Jie Zhou, Xiaogang Wang, Yu Qiao, Xiaowei Hu:
Demystify Transformers & Convolutions in Modern Image Deep Networks. CoRR abs/2211.05781 (2022) - [i45]Weijie Su, Xizhou Zhu, Chenxin Tao, Lewei Lu, Bin Li, Gao Huang, Yu Qiao, Xiaogang Wang, Jie Zhou, Jifeng Dai:
Towards All-in-one Pre-training via Maximizing Multi-modal Mutual Information. CoRR abs/2211.09807 (2022) - [i44]Hao Li, Jinguo Zhu, Xiaohu Jiang, Xizhou Zhu, Hongsheng Li, Chun Yuan, Xiaohua Wang, Yu Qiao, Xiaogang Wang, Wenhai Wang, Jifeng Dai:
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks. CoRR abs/2211.09808 (2022) - [i43]Chenyu Yang, Yuntao Chen, Hao Tian, Chenxin Tao, Xizhou Zhu, Zhaoxiang Zhang, Gao Huang, Hongyang Li, Yu Qiao, Lewei Lu, Jie Zhou, Jifeng Dai:
BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision. CoRR abs/2211.10439 (2022) - [i42]Yihan Hu, Jiazhi Yang, Li Chen, Keyu Li, Chonghao Sima, Xizhou Zhu, Siqi Chai, Senyao Du, Tianwei Lin, Wenhai Wang, Lewei Lu, Xiaosong Jia, Qiang Liu, Jifeng Dai, Yu Qiao, Hongyang Li:
Goal-oriented Autonomous Driving. CoRR abs/2212.10156 (2022) - 2021
- [c32]Hao Tian, Yuntao Chen, Jifeng Dai, Zhaoxiang Zhang, Xizhou Zhu:
Unsupervised Object Detection With LIDAR Clues. CVPR 2021: 5962-5972 - [c31]Peng Gao, Minghang Zheng, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
Fast Convergence of DETR with Spatially Modulated Co-Attention. ICCV 2021: 3601-3610 - [c30]Wenguan Wang, Tianfei Zhou, Fisher Yu, Jifeng Dai, Ender Konukoglu, Luc Van Gool:
Exploring Cross-Image Pixel Contrast for Semantic Segmentation. ICCV 2021: 7283-7293 - [c29]Zhuoming Liu, Hao Ding, Huaping Zhong, Weijia Li, Jifeng Dai, Conghui He:
Influence Selection for Active Learning. ICCV 2021: 9254-9263 - [c28]Rui Liu, Hanming Deng, Yangyi Huang, Xiaoyu Shi, Lewei Lu, Wenxiu Sun, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting. ICCV 2021: 14020-14029 - [c27]Hao Li, Chenxin Tao, Xizhou Zhu, Xiaogang Wang, Gao Huang, Jifeng Dai:
Auto Seg-Loss: Searching Metric Surrogates for Semantic Segmentation. ICLR 2021 - [c26]Xizhou Zhu, Weijie Su, Lewei Lu, Bin Li, Xiaogang Wang, Jifeng Dai:
Deformable DETR: Deformable Transformers for End-to-End Object Detection. ICLR 2021 - [c25]Chenxin Tao, Zizhang Li, Xizhou Zhu, Gao Huang, Yong Liu, Jifeng Dai:
Searching Parameterized AP Loss for Object Detection. NeurIPS 2021: 22021-22033 - [i41]Peng Gao, Minghang Zheng, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
Fast Convergence of DETR with Spatially Modulated Co-Attention. CoRR abs/2101.07448 (2021) - [i40]Wenguan Wang, Tianfei Zhou, Fisher Yu, Jifeng Dai, Ender Konukoglu, Luc Van Gool:
Exploring Cross-Image Pixel Contrast for Semantic Segmentation. CoRR abs/2101.11939 (2021) - [i39]Hao Li, Tianwen Fu, Jifeng Dai, Hongsheng Li, Gao Huang, Xizhou Zhu:
AutoLoss-Zero: Searching Loss Functions from Scratch for Generic Tasks. CoRR abs/2103.14026 (2021) - [i38]Rui Liu, Hanming Deng, Yangyi Huang, Xiaoyu Shi, Lewei Lu, Wenxiu Sun, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
Decoupled Spatial-Temporal Transformer for Video Inpainting. CoRR abs/2104.06637 (2021) - [i37]Peng Gao, Shijie Geng, Yu Qiao, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
Scalable Transformers for Neural Machine Translation. CoRR abs/2106.02242 (2021) - [i36]Haiyang Wang, Wenguan Wang, Xizhou Zhu, Jifeng Dai, Liwei Wang:
Collaborative Visual Navigation. CoRR abs/2107.01151 (2021) - [i35]Peng Gao, Minghang Zheng, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
Fast Convergence of DETR with Spatially Modulated Co-Attention. CoRR abs/2108.02404 (2021) - [i34]Zhuoming Liu, Hao Ding, Huaping Zhong, Weijia Li, Jifeng Dai, Conghui He:
Influence Selection for Active Learning. CoRR abs/2108.09331 (2021) - [i33]Rui Liu, Hanming Deng, Yangyi Huang, Xiaoyu Shi, Lewei Lu, Wenxiu Sun, Xiaogang Wang, Jifeng Dai, Hongsheng Li:
FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting. CoRR abs/2109.02974 (2021) - [i32]Renrui Zhang, Rongyao Fang, Wei Zhang, Peng Gao, Kunchang Li, Jifeng Dai, Yu Qiao, Hongsheng Li:
Tip-Adapter: Training-free CLIP-Adapter for Better Vision-Language Modeling. CoRR abs/2111.03930 (2021) - [i31]Changyao Tian, Wenhai Wang, Xizhou Zhu, Xiaogang Wang, Jifeng Dai, Yu Qiao:
VL-LTR: Learning Class-wise Visual-Linguistic Representation for Long-Tailed Visual Recognition. CoRR abs/2111.13579 (2021) - [i30]Xizhou Zhu, Jinguo Zhu, Hao Li, Xiaoshi Wu, Xiaogang Wang, Hongsheng Li, Xiaohua Wang, Jifeng Dai:
Uni-Perceiver: Pre-training Unified Architecture for Generic Perception for Zero-shot and Few-shot Tasks. CoRR abs/2112.01522 (2021) - [i29]Chenxin Tao, Zizhang Li, Xizhou Zhu, Gao Huang, Yong Liu, Jifeng Dai:
Searching Parameterized AP Loss for Object Detection. CoRR abs/2112.05138 (2021) - [i28]Chenxin Tao, Honghui Wang, Xizhou Zhu, Jiahua Dong, Shiji Song, Gao Huang, Jifeng Dai:
Exploring the Equivalence of Siamese Self-Supervised Learning via A Unified Gradient Framework. CoRR abs/2112.05141 (2021) - 2020
- [c24]Le Yang, Yizeng Han, Xi Chen, Shiji Song, Jifeng Dai, Gao Huang:
Resolution Adaptive Networks for Efficient Inference. CVPR 2020: 2366-2375 - [c23]Wenguan Wang, Hailong Zhu, Jifeng Dai, Yanwei Pang, Jianbing Shen, Ling Shao:
Hierarchical Human Parsing With Typed Part-Relation Reasoning. CVPR 2020: 8926-8936 - [c22]Guolei Sun, Wenguan Wang, Jifeng Dai, Luc Van Gool:
Mining Cross-Image Semantics for Weakly Supervised Semantic Segmentation. ECCV (2) 2020: 347-365 - [c21]Hang Gao, Xizhou Zhu, Stephen Lin, Jifeng Dai:
Deformable Kernels: Adapting Effective Receptive Fields for Object Deformation. ICLR 2020 - [c20]Weijie Su, Xizhou Zhu, Yue Cao, Bin Li, Lewei Lu, Furu Wei, Jifeng Dai:
VL-BERT: Pre-training of Generic Visual-Linguistic Representations. ICLR 2020 - [i27]Wenguan Wang, Hailong Zhu, Jifeng Dai, Yanwei Pang, Jianbing Shen, Ling Shao:
Hierarchical Human Parsing with Typed Part-Relation Reasoning. CoRR abs/2003.04845 (2020) - [i26]Le Yang, Yizeng Han, Xi Chen, Shiji Song, Jifeng Dai, Gao Huang:
Resolution Adaptive Networks for Efficient Inference. CoRR abs/2003.07326 (2020) - [i25]Guolei Sun, Wenguan Wang, Jifeng Dai, Luc Van Gool:
Mining Cross-Image Semantics for Weakly Supervised Semantic Segmentation. CoRR abs/2007.01947 (2020) - [i24]Jingru Tan, Gang Zhang, Hanming Deng, Changbao Wang, Lewei Lu, Quanquan Li, Jifeng Dai:
1st Place Solution of LVIS Challenge 2020: A Good Box is not a Guarantee of a Good Mask. CoRR abs/2009.01559 (2020) - [i23]Xizhou Zhu, Weijie Su, Lewei Lu, Bin Li, Xiaogang Wang, Jifeng Dai:
Deformable DETR: Deformable Transformers for End-to-End Object Detection. CoRR abs/2010.04159 (2020) - [i22]Hao Li, Chenxin Tao, Xizhou Zhu, Xiaogang Wang, Gao Huang, Jifeng Dai:
Auto Seg-Loss: Searching Metric Surrogates for Semantic Segmentation. CoRR abs/2010.07930 (2020) - [i21]Hao Tian, Yuntao Chen, Jifeng Dai, Zhaoxiang Zhang, Xizhou Zhu:
Unsupervised Object Detection with LiDAR Clues. CoRR abs/2011.12953 (2020)
2010 – 2019
- 2019
- [c19]Xizhou Zhu, Han Hu, Stephen Lin, Jifeng Dai:
Deformable ConvNets V2: More Deformable, Better Results. CVPR 2019: 9308-9316 - [c18]Xizhou Zhu, Dazhi Cheng, Zheng Zhang, Stephen Lin, Jifeng Dai:
An Empirical Study of Spatial Attention Mechanisms in Deep Networks. ICCV 2019: 6687-6696 - [i20]Xizhou Zhu, Dazhi Cheng, Zheng Zhang, Stephen Lin, Jifeng Dai:
An Empirical Study of Spatial Attention Mechanisms in Deep Networks. CoRR abs/1904.05873 (2019) - [i19]Kai Chen, Jiaqi Wang, Jiangmiao Pang, Yuhang Cao, Yu Xiong, Xiaoxiao Li, Shuyang Sun, Wansen Feng, Ziwei Liu, Jiarui Xu, Zheng Zhang, Dazhi Cheng, Chenchen Zhu, Tianheng Cheng, Qijie Zhao, Buyu Li, Xin Lu, Rui Zhu, Yue Wu, Jifeng Dai, Jingdong Wang, Jianping Shi, Wanli Ouyang, Chen Change Loy, Dahua Lin:
MMDetection: Open MMLab Detection Toolbox and Benchmark. CoRR abs/1906.07155 (2019) - [i18]Weijie Su, Xizhou Zhu, Yue Cao, Bin Li, Lewei Lu, Furu Wei, Jifeng Dai:
VL-BERT: Pre-training of Generic Visual-Linguistic Representations. CoRR abs/1908.08530 (2019) - [i17]Hang Gao, Xizhou Zhu, Steve Lin, Jifeng Dai:
Deformable Kernels: Adapting Effective Receptive Fields for Object Deformation. CoRR abs/1910.02940 (2019) - 2018
- [c17]Han Hu, Jiayuan Gu, Zheng Zhang, Jifeng Dai, Yichen Wei:
Relation Networks for Object Detection. CVPR 2018: 3588-3597 - [c16]Xizhou Zhu, Jifeng Dai, Lu Yuan, Yichen Wei:
Towards High Performance Video Object Detection. CVPR 2018: 7210-7218 - [c15]Jiayuan Gu, Han Hu, Liwei Wang, Yichen Wei, Jifeng Dai:
Learning Region Features for Object Detection. ECCV (12) 2018: 392-406 - [i16]Jiayuan Gu, Han Hu, Liwei Wang, Yichen Wei, Jifeng Dai:
Learning Region Features for Object Detection. CoRR abs/1803.07066 (2018) - [i15]Xizhou Zhu, Jifeng Dai, Xingchi Zhu, Yichen Wei, Lu Yuan:
Towards High Performance Video Object Detection for Mobiles. CoRR abs/1804.05830 (2018) - [i14]Zheng Zhang, Dazhi Cheng, Xizhou Zhu, Stephen Lin, Jifeng Dai:
Integrated Object Detection and Tracking with Tracklet-Conditioned Detection. CoRR abs/1811.11167 (2018) - [i13]Xizhou Zhu, Han Hu, Stephen Lin, Jifeng Dai:
Deformable ConvNets v2: More Deformable, Better Results. CoRR abs/1811.11168 (2018) - 2017
- [c14]Xizhou Zhu, Yuwen Xiong, Jifeng Dai, Lu Yuan, Yichen Wei:
Deep Feature Flow for Video Recognition. CVPR 2017: 4141-4150 - [c13]Yi Li, Haozhi Qi, Jifeng Dai, Xiangyang Ji, Yichen Wei:
Fully Convolutional Instance-Aware Semantic Segmentation. CVPR 2017: 4438-4446 - [c12]Xizhou Zhu, Yujie Wang, Jifeng Dai, Lu Yuan, Yichen Wei:
Flow-Guided Feature Aggregation for Video Object Detection. ICCV 2017: 408-417 - [c11]Jifeng Dai, Haozhi Qi, Yuwen Xiong, Yi Li, Guodong Zhang, Han Hu, Yichen Wei:
Deformable Convolutional Networks. ICCV 2017: 764-773 - [i12]Jifeng Dai, Haozhi Qi, Yuwen Xiong, Yi Li, Guodong Zhang, Han Hu, Yichen Wei:
Deformable Convolutional Networks. CoRR abs/1703.06211 (2017) - [i11]Xizhou Zhu, Yujie Wang, Jifeng Dai, Lu Yuan, Yichen Wei:
Flow-Guided Feature Aggregation for Video Object Detection. CoRR abs/1703.10025 (2017) - [i10]Han Hu, Jiayuan Gu, Zheng Zhang, Jifeng Dai, Yichen Wei:
Relation Networks for Object Detection. CoRR abs/1711.11575 (2017) - [i9]Xizhou Zhu, Jifeng Dai, Lu Yuan, Yichen Wei:
Towards High Performance Video Object Detection. CoRR abs/1711.11577 (2017) - 2016
- [c10]Jifeng Dai, Kaiming He, Jian Sun:
Instance-Aware Semantic Segmentation via Multi-task Network Cascades. CVPR 2016: 3150-3158 - [c9]Di Lin, Jifeng Dai, Jiaya Jia, Kaiming He, Jian Sun:
ScribbleSup: Scribble-Supervised Convolutional Networks for Semantic Segmentation. CVPR 2016: 3159-3167 - [c8]Jifeng Dai, Kaiming He, Yi Li, Shaoqing Ren, Jian Sun:
Instance-Sensitive Fully Convolutional Networks. ECCV (6) 2016: 534-549 - [c7]Jifeng Dai, Yi Li, Kaiming He, Jian Sun:
R-FCN: Object Detection via Region-based Fully Convolutional Networks. NIPS 2016: 379-387 - [i8]Jifeng Dai, Kaiming He, Yi Li, Shaoqing Ren, Jian Sun:
Instance-sensitive Fully Convolutional Networks. CoRR abs/1603.08678 (2016) - [i7]Di Lin, Jifeng Dai, Jiaya Jia, Kaiming He, Jian Sun:
ScribbleSup: Scribble-Supervised Convolutional Networks for Semantic Segmentation. CoRR abs/1604.05144 (2016) - [i6]Jifeng Dai, Yi Li, Kaiming He, Jian Sun:
R-FCN: Object Detection via Region-based Fully Convolutional Networks. CoRR abs/1605.06409 (2016) - [i5]Yi Li, Haozhi Qi, Jifeng Dai, Xiangyang Ji, Yichen Wei:
Fully Convolutional Instance-aware Semantic Segmentation. CoRR abs/1611.07709 (2016) - [i4]Xizhou Zhu, Yuwen Xiong, Jifeng Dai, Lu Yuan, Yichen Wei:
Deep Feature Flow for Video Recognition. CoRR abs/1611.07715 (2016) - 2015
- [c6]Jifeng Dai, Kaiming He, Jian Sun:
Convolutional feature masking for joint object and stuff segmentation. CVPR 2015: 3992-4000 - [c5]Jifeng Dai, Kaiming He, Jian Sun:
BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation. ICCV 2015: 1635-1643 - [c4]Jifeng Dai, Ying Nian Wu:
Generative Modeling of Convolutional Neural Networks. ICLR (Poster) 2015 - [i3]Jifeng Dai, Kaiming He, Jian Sun:
BoxSup: Exploiting Bounding Boxes to Supervise Convolutional Networks for Semantic Segmentation. CoRR abs/1503.01640 (2015) - [i2]Jifeng Dai, Kaiming He, Jian Sun:
Instance-aware Semantic Segmentation via Multi-task Network Cascades. CoRR abs/1512.04412 (2015) - 2014
- [c3]Jifeng Dai, Yi Hong, Wenze Hu, Song-Chun Zhu, Ying Nian Wu:
Unsupervised Learning of Dictionaries of Hierarchical Compositional Models. CVPR 2014: 2505-2512 - [i1]Jifeng Dai, Kaiming He, Jian Sun:
Convolutional Feature Masking for Joint Object and Stuff Segmentation. CoRR abs/1412.1283 (2014) - 2013
- [c2]Jifeng Dai, Ying Nian Wu, Jie Zhou, Song-Chun Zhu:
Cosegmentation and Cosketch by Unsupervised Learning. ICCV 2013: 1305-1312 - 2012
- [j2]Jifeng Dai, Jianjiang Feng, Jie Zhou:
Robust and Efficient Ridge-Based Palmprint Matching. IEEE Trans. Pattern Anal. Mach. Intell. 34(8): 1618-1632 (2012) - [c1]Jifeng Dai, Jianjiang Feng, Jie Zhou:
Mining sub-categories for object detection. ICPR 2012: 3260-3263 - 2011
- [j1]Jifeng Dai, Jie Zhou:
Multifeature-Based High-Resolution Palmprint Recognition. IEEE Trans. Pattern Anal. Mach. Intell. 33(5): 945-957 (2011)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-11-21 21:23 CET by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint