Google's AI-Powered Robot Can Grasp Occluded Objects, Do Never-Seen-Before Tasks

  • The first study details X-Ray, an algorithm that when deployed on a robot can search through heaps of objects to grasp a target object, while the second lays out a policy adaptation technique that “teaches” robots skills without requiring from-scratch model training.

  • In physical experiments involving a real-world ABB YuMi robot with a suction cup and a parallel jaw gripper, the researchers tasked X-Ray with filling a bin with objects and then dumping the bin on top of the target object.

  • Sampling from an open source data set of 1,296 3D CAD models on Thingiverse, they selected 10 box targets of various dimensions with equal volume but small thickness, so that they were more likely to be occluded.


In a pair of papers published on the preprint server Arxiv.org this week, Google and University of California, Berkeley researchers describe new AI and machine learning techniques that enable robots to adapt to never-before-seen tasks and grasp occluded objects. The first study details X-Ray, an algorithm that when deployed on a robot can search through heaps of objects to grasp a target object, while the second lays out a policy adaptation technique that “teaches” robots skills without requiring from-scratch model training.
 

Robot grasping is a surprisingly difficult challenge. For example, robots struggle to perform what’s called “mechanical search,” which is when they have to identify and pick up an object from within a pile of other objects. Most robots aren’t especially adaptable, and there’s a lack of sufficiently capable AI models for guiding robot hands in mechanical search. X-Ray and the policy adaptation step could form the foundation of a product-packaging system that spots, picks up, and drops a range of objects without human oversight.
 

 

X-Ray

 

 

The coauthors of the study about X-Ray note that mechanical search — finding objects in a heap of objects — remains challenging due to a lack of appropriate models. X-Ray tackles the problem with a combination of occlusion inference and hypothesis predictions, which it uses to estimate an occupancy distribution for the bounding box (coordinates for a rectangular border around an object) most similar to an object while accounting for various translations and rotations.
 

X-Ray assumes that there’s at least one target object fully or partially occluded by unknown objects in a heap, and that a maximum of one object is grasped per timestep. Taking RGB images and target objects as inputs, it predicts the occupancy distribution and segmentation masks for the scene and computes several potential grasping actions, executing the one with the highest probability of succeeding.
 

To train and validate X-Ray, the researchers produced a corpus of 10,000 augmented depth images labeled with object occupancy distributions for a rectangular box target object. Sampling from an open source data set of 1,296 3D CAD models on Thingiverse, they selected 10 box targets of various dimensions with equal volume but small thickness, so that they were more likely to be occluded. This netted them a total of 100,000 images.

 


About 8,000 of those 10,000 images were reserved for training, and the rest were set aside for testing. One thousand additional images containing simulated objects — a lid, a domino, and a flute — were used to evaluate X-Ray’s generalization to unseen shapes, objects, aspect ratios, and scales.

 

In physical experiments involving a real-world ABB YuMi robot with a suction cup and a parallel jaw gripper, the researchers tasked X-Ray with filling a bin with objects and then dumping the bin on top of the target object. In heaps initially containing 25 objects, the system extracted the target object in a median of 5 actions over 20 trials with a 100% success rate.
 

Read More: MIT CSAIL’S COMMPLAN AI HELPS ROBOTS EFFICIENTLY COLLABORATE WITH HUMANS
 

The coauthors leave to future work increasing X-Ray’s training efficiency and analyzing the effect of data set size and the number of translations and rotations used to generate training distributions. They also plan to explore reinforcement learning policies based on the reward of target object visibility.
 

 

Policy adaptation


In the more recent of the two papers, the coauthors sought to develop a system that continuously adapts to new real-world environments, objects, and conditions. That’s in contrast to most robots, which are trained once and deployed without much in the way of adaptation capabilities.
 

The researchers pretrained a machine learning model to grasp a range of objects on a corpus of 608,000 grasp attempts, which they then tasked with grasping objects using a gripper moved 10 centimeters to the right of where it started. After the system practiced gripping for a while (over the course of 800 attempts) and logged those attempts into a new data set — a target data set — the new attempts were mixed in 50% of the time with the original data set to fine-tune the model.
 

These steps — pretraining, attempting a new task, and fine-tuning — were repeated for five different scenarios. In one, harsh lighting impeded the robot’s cameras; in another, a checkerboard-patterned background made it difficult for the model to identify objects. Lastly, the experimenters had the robot grasp transparent bottles not seen during training (transparent objects are notoriously hard for robots to grasp because they sometimes confuse depth sensors) and pick up objects sitting on a highly reflective sheet metal surface.
 

The researchers report that in experiments, the model successfully grasped objects 63% of the time in harsh lighting, 74% of the time with transparent bottles, 86% of the time with a checkerboard backing, 88% of the time with an extended gripper, and 91% of the time with an offset gripper. Moreover, they say that it only took 1 to 4 hours of practice for the robot to adapt to new situations (compared with roughly 6,000 hours learning how to grasp) and that performance didn’t degrade the more the model adapted.

 


In the future, the team plans to investigate whether the process can be made automatic.


Read More: DILIGENT ROBOTICS RAISES $10 MILLION FOR NURSE ASSISTANT ROBOT MOXI

Spotlight

Other News
AI Tech

AI and Big Data Expo North America announces leading Speaker Lineup

TechEx Events | March 07, 2024

AI and Big Data Expo North America announces new speakers! SANTA CLARA, CALIFORNIA, UNITED STATES, February 26, 2024 /EINPresswire.com/ -- TheAI and Big Expo North America, the leading event for Enterprise AI, Machine Learning, Security, Ethical AI, Deep Learning, Data Ecosystems, and NLP, has announced a fresh cohort of distinguishedspeakersfor its upcoming conference at the Santa Clara Convention Center on June 5-6, 2024. Some of the top industry speakers set to take the stage are: - Sam Hamilton - Head of Data & AI – Visa - Dr Astha Purohit - Director - Product (Tech) Ops – Walmart - Noorddin Taj - Head of Architecture and Design of Intelligent Operations - BP - Temi Odesanya - Director - AI Governance Automation - Thomson Reuters - Katie Sanders - Assistant Vice President – Tech - Union Pacific Railroad - Prasanth Nandanuru – SVP - Wells Fargo - Rodney Brooks - Professor Emeritus - MIT These esteemed speakers bring a wealth of knowledge and expertise to an already impressive lineup, promising attendees a truly enlightening experience. In addition to the speakers, theAI and Big Data Expo North Americawill feature a series of presentations covering a diverse range of topics in AI and Big Data exploring the latest innovations, implementations and strategies across a range of industries. Attendees can expect to gain valuable insights and practical strategies from presentations such as: How Gen AI Positively Augments Workforce Capabilities Trends in Computer Vision: Applications, Datasets, and Models Getting to Production-Ready: Challenges and Best Practices for Deploying AI Ensuring Your AI is Responsible and Ethical Mitigating Bias and Promoting Fairness in AI Systems Security Challenges in the Era of Gen AI and Data Science AI for Good: Social Impact and Ethics Selling Data Democratization to Executives Spreading Data Insights across the Business Barriers to Overcome: People, Processes, and Technology Optimizing the Customer Experience with AI Using AI to Drive Growth in a Regulated Industry Building an MLOps Foundation for AI at Scale The Expo offers a platform for exploration and discovery, showcasing how cutting-edge technologies are reshaping a myriad of industries, including manufacturing, transport, supply chain, government, legal sectors, financial services, energy, utilities, insurance, healthcare, retail, and more. Attendees will have the chance to witness firsthand the transformative power of AI and Big Data across various sectors, gaining insights that are crucial for staying ahead in today's rapidly evolving technological landscape. Anticipating a turnout of over 7000 attendees and featuring 200 speakers across various tracks, AI and Big Data Expo North America offers a unique opportunity for CTO’s, CDO’s, CIO’s , Heads of IOT, AI /ML, IT Directors and tech enthusiasts to stay abreast of the latest trends and innovations in AI, Big Data and related technologies. Organized by TechEx Events, the conference will also feature six co-located events, including the IoT Tech Expo, Intelligent Automation Conference, Cyber Security & Cloud Congress, Digital Transformation Week, and Edge Computing Expo, ensuring a comprehensive exploration of the technological landscape. Attendees can choose from various ticket options, providing access to engaging sessions, the bustling expo floor, premium tracks featuring industry leaders, a VIP networking party, and a sophisticated networking app facilitating connections ahead of the event. Secure your ticket with a 25% discount on tickets, available until March 31st, 2024. Save up to $300 on your ticket and be part of the conversation shaping the future of AI and Big Data technologies. For more information and to secure your place at AI and Big Data Expo North America, please visit https://www.ai-expo.net/northamerica/. About AI and Big Data Expo North America: The AI and Big Data Expo North America is a leading event in the AI and Big Data landscape, serving as a nexus for professionals, industry experts, and enthusiasts to explore and navigate the ever-evolving technological frontier. Through its focus on education, networking, and collaboration, the Expo continues to be a beacon for those eager to stay at the forefront of technological innovation. “AI and Big Data Expo North Americais a part ofTechEx. For more information regardingTechExplease see onlinehere.”

Read More