ResLNet:deep residual LSTM network with longer input for action recognition|Tian WANG;Jiakun LI;Huai-Ning WU;Ce LI;Hichem SNOUSSI;Yang WU|School of Automation Science and Electrical Engineering,Beihang University,Beijing 100191,China - 期刊导航|首站-论文投稿智能助手|论文发表|论文智能投稿|期刊自助发表推荐|杂志社快速发表|查同导刊-域田数据官方网站

典型文献

ResLNet:deep residual LSTM network with longer input for action recognition

文献摘要：

Action recognition is an important research topic in video analysis that remains very challenging.Effective reco-gnition relies on learning a good representation of both spatial information(for appearance)and temporal information(for motion).These two kinds of information are highly correlated but have quite different properties,leading to unsatisfying results of both connecting independent models(e.g.,CNNN-LSTM)and direct unbiased co-modeling(e.g.,3DCNN).Besides,a long-lasting tradition on this task with deep learning models is to just use 8 or 16 consecutive frames as input,making it hard to extract discriminative motion features.In this work,we propose a novel network structure called ResLNet(Deep Residual LSTM network),which can take longer inputs(e.g.,of 64 frames)and have convolutions collaborate with LSTM more effectively under the residual structure to learn better spatial-temporal representations than ever without the cost of extra computations with the proposed embedded vari-able stride convolution.The superiority of this proposal and its ablation study are shown on the three most popular benchmark datasets:Kinetics,HMDB51,and UCF101.The proposed network could be adopted for various features,such as RGB and optical flow.Due to the limitation of the computation power of our experiment equipment and the real-time require-ment,the proposed network is tested on the RGB only and shows great performance.

文献关键词：

中图分类号：

[1] 自动化技术、计算机技术（TP） / 计算技术、计算机技术（TP3） / 计算机的应用（TP39） / 信息处理(信息加工)（TP391）

[2] 自动化技术、计算机技术（TP） / 自动化基础理论（TP1） / 人工智能理论（TP18） / 人工神经网络与计算（TP183）

[3] 自动化技术、计算机技术（TP） / 计算技术、计算机技术（TP3） / 计算机的应用（TP39） / 信息处理(信息加工)（TP391） / 模式识别与装置（TP391.4）

作者姓名：

Tian WANG;Jiakun LI;Huai-Ning WU;Ce LI;Hichem SNOUSSI;Yang WU

作者机构：

Institute of Artificial Intelligence,Beihang University,Beijing 100191,China;School of Automation Science and Electrical Engineering,Beihang University,Beijing 100191,China;College of Electrical and Information Engineering,Lanzhou University of Technology,Lanzhou 730050,China;Institute Charles Delaunay-LM2S FRE CNRS 2019,University of Technology of Troyes,Troyes 10010,France;Institute for Research Initiatives,Nara Institute of Science and Technology,Nara 630-0192,Japan

文献出处：

计算机科学前沿

引用格式：

[1]Tian WANG;Jiakun LI;Huai-Ning WU;Ce LI;Hichem SNOUSSI;Yang WU-.ResLNet:deep residual LSTM network with longer input for action recognition)[J].计算机科学前沿,2022(06):39-47

A类：

ResLNet,gnition,unsatisfying,CNNN

B类：

deep,residual,network,longer,action,recognition,Action,important,research,topic,video,analysis,that,remains,very,challenging,Effective,relies,learning,good,both,spatial,information,appearance,temporal,motion,These,kinds,are,highly,correlated,but,have,quite,different,properties,leading,results,connecting,independent,models,direct,unbiased,modeling,3DCNN,Besides,lasting,tradition,this,task,just,use,consecutive,frames,making,hard,extract,discriminative,features,In,novel,structure,called,Deep,Residual,which,can,take,inputs,convolutions,collaborate,more,effectively,under,better,representations,than,ever,without,cost,computations,proposed,embedded,able,stride,superiority,proposal,its,ablation,study,shown,three,most,popular,benchmark,datasets,Kinetics,HMDB51,UCF101,could,adopted,various,such,RGB,optical,flow,Due,limitation,power,our,experiment,equipment,real,require,tested,only,shows,great,performance

AB值：

0.597944

相似文献

Unsupervised change detection of man-made objects using coherent and incoherent features of multi-temporal SAR images

FENG Hao;WU Jianzhong;ZHANG Lu;LIAO Mingsheng-State Key Laboratory of Information Engineering in Surveying,Mapping and Remote Sensing,Wuhan University,Wuhan 430079,China;Key Laboratory of Land Subsidence Monitoring and Prevention,Ministry of Land and Resources,Shanghai 200072,China;Shanghai Engineering Research Center of Land Subsidence,Shanghai 200072,China;Shanghai Institute of Geological Survey,Shanghai 200072,China

Influence fast or later:Two types of influencers in social networks

Fang Zhou;Chang Su;Shuqi Xu;Linyuan Lv-Yangtze Delta Region Institute(Huzhou)&Institute of Fundamental and Frontier Sciences,University of Electronic Science and Technology of China,Huzhou 313001,China;Beijing Computational Science Research Center,Beijing 100193,China

Learnable three-dimensional Gabor convolutional network with global affinity attention for hyperspectral image classification

Hai-Zhu Pan;Mo-Qi Liu;Hai-Miao Ge;Qi Yuan-College of Computer and Control Engineering,Qiqihar University,Qiqihar 161000,China;College of Telecommunication and Electronic Engineering,Qiqihar University,Qiqihar 161000,China

Adaptive multiscale convolutional neural network model for chemical process fault diagnosis

Ruoshi Qin;Jinsong Zhao-State Key Laboratory of Chemical Engineering,Department of Chemical Engineering,Tsinghua University,Beijing 100084,China;Beijing Key Laboratory of Industrial Big Data System and Application,Tsinghua University,Beijing 100084,China

Behavioral control task supervisor with memory based on reinforcement learning for human-multi-robot coordination systems