DexAdversary: Leveraging adversarial attacks for robust in-hand manipulation

The Idea

The goal of the project was to find a way to isolate the failure modes of controllers trained via reinforcement learning in an effort to increase transparency of machine learning models. Our focus lay on improving the robustness of an already trained model from NVIDIA, namely the in-hand manipulation controller DeXtreme.

We applied adversarial RL models to 'learn' the failure cases of the DeXtreme model, and once we knew what the failure cases were, we improved the controller from DeXtreme to be robust against them.

Some examples of the failure modes discovered

Adversaries added noises to the inputs and outputs of the controller network.

Controller performance after improvement

A residual network was attached at the end of the controller in order learn robustness against the adversarial noise.

MT_Report_ArunimJoarder.pdf

Page updated

Google Sites

Report abuse