Collaborative Pushing and Grasping of Tightly Stacked Objects via Deep Reinforcement Learning|Yuxiang Yang|Zhejiang Provincial Key Laboratory of Equipment Electronics,Hangzhou 310018,China - 期刊导航|首站-论文投稿智能助手|论文发表|论文智能投稿|期刊自助发表推荐|杂志社快速发表|查同导刊-域田数据官方网站

典型文献

Collaborative Pushing and Grasping of Tightly Stacked Objects via Deep Reinforcement Learning

文献摘要：

Directly grasping the tightly stacked objects may cause collisions and result in failures, degenerating the functionality of robotic arms. Inspired by the observation that first pushing objects to a state of mutual separation and then grasping them individually can effectively increase the success rate, we devise a novel deep Q-learning framework to achieve collaborative pushing and grasping. Specifically, an efficient non-maximum suppression policy (PolicyNMS) is proposed to dynamically evaluate pushing and grasping actions by enforcing a suppression constraint on unreasonable actions. Moreover, a novel data-driven pushing reward network called PR-Net is designed to effectively assess the degree of separation or aggregation between objects. To benchmark the proposed method, we establish a dataset containing common household items dataset (CHID) in both simulation and real scenarios. Although trained using simulation data only, experiment results validate that our method generalizes well to real scenarios and achieves a 97％ grasp success rate at a fast speed for object separation in the real-world environment.

文献关键词：

中图分类号：

[1] 医药、卫生（R） / 基础医学（R3） / 病理学（R36） / 病理过程（R364）

[2] 自动化技术、计算机技术（TP） / 计算技术、计算机技术（TP3） / 计算机的应用（TP39） / 信息处理(信息加工)（TP391）

[3] 医药、卫生（R） / 药学（R9） / 药理学（R96） / 实验药理学（R965）

作者姓名：

Yuxiang Yang

作者机构：

School of Electronics and Information,Hangzhou Dianzi University,Hangzhou;Zhejiang Provincial Key Laboratory of Equipment Electronics,Hangzhou 310018,China;School of Computer Science,Faculty of Engineering,University of Sydney,Darlington,NSW 2006,Australia;JD Explore Academy,JD.com,Beijing 101111,China

文献出处：

自动化学报（英文版）

引用格式：

[1]Yuxiang Yang-.Collaborative Pushing and Grasping of Tightly Stacked Objects via Deep Reinforcement Learning)[J].自动化学报（英文版）,2022(01):135-145

A类：

degenerating,PolicyNMS,CHID

B类：

Collaborative,Pushing,Grasping,Tightly,Stacked,Objects,via,Deep,Reinforcement,Learning,Directly,grasping,tightly,stacked,objects,may,cause,collisions,failures,functionality,robotic,arms,Inspired,by,observation,that,first,pushing,state,mutual,separation,then,them,individually,can,effectively,increase,success,rate,devise,novel,deep,learning,framework,collaborative,Specifically,efficient,maximum,suppression,policy,proposed,dynamically,evaluate,actions,enforcing,constraint,unreasonable,Moreover,driven,reward,network,called,PR,Net,designed,assess,degree,aggregation,between,To,benchmark,method,establish,dataset,containing,common,household,items,both,simulation,real,scenarios,Although,trained,using,only,experiment,results,validate,our,generalizes,well,achieves,fast,speed,world,environment

AB值：

0.644434

相似文献

Joint Scheduling and Resource Allocation for Federated Learning in SWIPT-Enabled Micro UAV Swarm Networks

Wanli Wen;Yunjian Jia;Wenchao Xia-School of Microelectronics and Communication Engineering,Chongqing University,Chongqing 400044,China;National Mobile Communications Research Laboratory,Southeast University,Nanjing 210009,China;Jiangsu Key Laboratory of Wireless Communications,Nanjing University of Posts and Telecommunications,Nanjing 210003,China

Collaborative Clustering Parallel Reinforcement Learning for Edge-Cloud Digital Twins Manufacturing System

Fan Yang;Tao Feng;Fangmin Xu;Huiwen Jiang;Chenglin Zhao-School of Information and Communication Engineering,Beijing University of Posts and Telecommunications,Beijing 100876,China;China Tendering Center for Mechanical and Electrical Equipment,Ministry of Industry and Information Technology,Beijing 100142,China

Object Grasping Detection Based on Residual Convolutional Neural Network

WU Di;WU Nailong;SHI Hongrui-College of Information Science and Technology,Donghua University,Shanghai 201620,China

LSTM-Based Adaptive Modulation and Coding for Satellite-to-Ground Communications

Shiqi Zhang;Guoxin Yu;Shanping Yu;Yanjun Zhang;Yu Zhang-School of Information and Electronics, Beijing Institute of Technology, Beijing 100081, China;School of Cyberspace Science and Technology, Beijing Institute of Technology, Beijing 100081, China

Action Status Based Novel Relative Feature Representations for Interaction Recognition