Transformers in computational visual media:A survey|Yifan Xu;Huapeng Wei;Minxuan Lin;Yingying Deng;Kekai Sheng;Mengdan Zhang;Fan Tang;Weiming Dong;Feiyue Huang;Changsheng Xu|School of Artificial Intelligence,University of Chinese Academy of Sciences,Beijing 100040,China - 期刊导航|首站-论文投稿智能助手|论文发表|论文智能投稿|期刊自助发表推荐|杂志社快速发表|查同导刊-域田数据官方网站

典型文献

Transformers in computational visual media:A survey

文献摘要：

Transformers,the dominant architecture for natural language processing,have also recently attracted much attention from computational visual media researchers due to their capacity for long-range representation and high performance.Transformers are sequence-to-sequence models,which use a self-attention mechanism rather than the RNN sequential structure.Thus,such models can be trained in parallel and can represent global information.This study comprehensively surveys recent visual transformer works.We categorize them according to task scenario:backbone design,high-level vision,low-level vision and generation,and multimodal learning.Their key ideas are also analyzed.Differing from previous surveys,we mainly focus on visual transformer methods in low-level vision and generation.The latest works on backbone design are also reviewed in detail.For ease of understanding,we precisely describe the main contributions of the latest works in the form of tables.As well as giving quantitative comparisons,we also present image results for low-level vision and generation tasks.Computational costs and source code links for various important works are also given in this survey to assist further development.

文献关键词：

中图分类号：

[1] 自动化技术、计算机技术（TP） / 计算技术、计算机技术（TP3） / 计算机的应用（TP39） / 信息处理(信息加工)（TP391）

[2] 医药、卫生（R） / 基础医学（R3） / 病理学（R36） / 病理过程（R364）

[3] 医药、卫生（R） / 药学（R9） / 药理学（R96） / 实验药理学（R965）

作者姓名：

Yifan Xu;Huapeng Wei;Minxuan Lin;Yingying Deng;Kekai Sheng;Mengdan Zhang;Fan Tang;Weiming Dong;Feiyue Huang;Changsheng Xu

作者机构：

NLPR,Institute of Automation,Chinese Academy of Sciences,Beijing 100190,China;School of Artificial Intelligence,University of Chinese Academy of Sciences,Beijing 100040,China;School of Artificial Intelligence,Jilin University,Changchun 130012,China;Youtu Lab,Tencent Inc.,Shanghai 200233,China;CASIA-LLVISION Joint Lab,Beijing 100190,China

文献出处：

计算可视媒体（英文）

引用格式：

[1]Yifan Xu;Huapeng Wei;Minxuan Lin;Yingying Deng;Kekai Sheng;Mengdan Zhang;Fan Tang;Weiming Dong;Feiyue Huang;Changsheng Xu-.Transformers in computational visual media:A survey)[J].计算可视媒体（英文）,2022(01):33-62

A类：

B类：

Transformers,computational,visual,media,dominant,architecture,natural,language,processing,have,also,recently,attracted,much,attention,from,researchers,due,their,capacity,long,range,representation,high,performance,are,sequence,models,which,use,self,mechanism,rather,than,RNN,sequential,structure,Thus,such,can,trained,parallel,global,information,This,study,comprehensively,surveys,transformer,works,We,categorize,them,according,scenario,backbone,design,level,vision,low,generation,multimodal,learning,Their,key,ideas,analyzed,Differing,previous,mainly,focus,methods,latest,reviewed,detail,For,ease,understanding,precisely,describe,contributions,tables,well,giving,quantitative,comparisons,image,results,tasks,Computational,costs,source,code,links,various,important,given,this,assist,further,development

AB值：

0.610058

相似文献

Influence fast or later:Two types of influencers in social networks

Fang Zhou;Chang Su;Shuqi Xu;Linyuan Lv-Yangtze Delta Region Institute(Huzhou)&Institute of Fundamental and Frontier Sciences,University of Electronic Science and Technology of China,Huzhou 313001,China;Beijing Computational Science Research Center,Beijing 100193,China

Light field imaging for computer vision:a survey

Chen JIA;Fan SHI;Meng ZHAO;Shengyong CHEN-Engineering Research Center of Learning-Based Intelligent System(Ministry of Education),Tianjin University of Technology,Tianjin 300384,China;Key Laboratory of Computer Vision and System(Ministry of Education),Tianjin University of Technology,Tianjin 300384,China

Efficient decoding self-attention for end-to-end speech synthesis

Wei ZHAO;Li XU-College of Electrical Engineering,Zhejiang University,Hangzhou 310027,China;Institute of Robotics,Zhejiang University,Yuyao 315400,China

Recent research progress of bimetallic phosphides-based nanomaterials as cocatalyst for photocatalytic hydrogen evolution

Chunmei Li;Daqiang Zhu;Shasha Cheng;Yan Zuo;Yun Wang;Changchang Ma;Hongjun Dong-Advanced Chemical Engineering Laboratory of Green Materials and Energy of Jiangsu Province,Institute of Green Chemistry and Chemical Technology,School of Chemistry and Chemical Engineering,Jiangsu University,Zhenjiang 212013,China;Department of Chemistry,Dongguk University,Seoul 04620,Republic of Korea

Recent progress of Pd/zeolite as passive NOx adsorber:Adsorption chemistry,structure-performance relationships,challenges and prospects