Completed

Smart Practice: Learning to Practice Skills from Demonstrations

In order to solve complex tasks, intelligent agents learn to perform a variety of skills. These skills abstract key information necessary for agents to act in real-world environments with high-dimensional state and action spaces. The goal of the Smart Practice project is to develop learning algorithms that...

Autonomous Skill Discovery Through Self-Supervised Exploration

Unsupervised Skill Discovery

Abstract

We introduce Contrastive Intrinsic Control (CIC) - an algorithm for unsupervised skill discovery that maximizes the mutual information between skills and state transitions. In contrast to most prior approaches, CIC uses a decomposition of the mutual information that explicitly incentivizes diverse behaviors by...

Knowledge Transferable Bayesian Optimization

Abstract

The need for automated optimization has become very important in many domains including hyper-parameter tuning in ML or in manufacturing industry. In practice, one frequently has to solve similar optimization problems for a specific customized setting, e.g. manufacturing robots optimized for a new customer environment or hyper-parameter optimization for a new classification task....

Better Visual Representations through Language Supervision

Overview

CLIP [1] demonstrates the power of simple contrastive learning on a large-scale dataset of image and caption pairs collected directly from the internet, without expensive manual labeling. In our project we seek to improve the data efficiency of CLIP and performance when trained on uncurated datasets, as well as explore additional capabilities beyond classification.

Data Efficiency

CLIP achieves impressive results when trained on WIT400M, a dataset of 400M image and caption pairs collected by searching for a curated list of...

Automatic Curriculum Generation and Emergent Complexity via Inter-agent Competition

Reinforcement Learning (RL) has been most successful when agents can collect extensive training experience in a simulated environment [1-4]. However, building simulated environments requires a great deal of manual effort, is error prone, and is unlikely to cover the space of all real world tasks. Inter-agent competition has...

On Theoretical Foundations for Transfer Learning

We aim to provide statistical guarantees for transfer learning in sequential settings under dynamic feedback. This can enable...

Unsupervised Learning of Visual Context from Instance Segmentation

Unsupervised representation learning aims to extract latent information from data that reflects their semantic categories. Contrastive learning has emerged as a direct winning alternative to self-supervised learning. We take a step further and propose to extract latent information that also reflects their visual context.

Overview

Instead of...

Barcode Image Denoise - RL Approach

Barcode reading has various applications in warehouses and as such is critical to supply chain automation. In this project, we develop a pipeline for decoding 1D and 2D segmented barcode images. Our pipeline consists of two parts: denoising network and angle correction. The denoising network deals with removing noise so as to improve the decoding rate of barcodes using open source software such as Zerbra. Angle correction has to do with applying rotation algorithms to segmented barcode images without sacrificing their spatial resolution and introducing anti-aliasing. ...