-
Google Deepmind
- San Francisco, CA
- http://avisingh.org
Stars
A simple, performant and scalable Jax LLM!
Implementation of Perceiver AR, Deepmind's new long-context attention network based on Perceiver architecture, in Pytorch
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
Official implementation for the paper "Offline Meta RL - Identifiability Challenges and Effective Data Collection Strategies", NeurIPS 2021.
Train robotic agents to learn pick and place with deep learning for vision-based manipulation in PyBullet. Transporter Nets, CoRL 2020.
Simulator of vision-based tactile sensors.
ROS packages for the InterbotiX X-series family of robotic arms and turrets
Latex code for making neural networks diagrams
Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction
Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.
Author's PyTorch implementation of BCQ for continuous and discrete actions
Efficient Exploration via State Marginal Matching (2019)
Distributed machine learning infrastructure for large-scale robotics research
robosuite: A Modular Simulation Framework and Benchmark for Robot Learning
Collection of quotes on notation design & how it affects thought.
Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.
Author's PyTorch implementation of TD3 for OpenAI gym tasks
Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.
A toolkit for reproducible reinforcement learning research.
Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
hill-a / stable-baselines
Forked from openai/baselinesA fork of OpenAI Baselines, implementations of reinforcement learning algorithms
MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.
Code for the paper "On First-Order Meta-Learning Algorithms"
Collection of reinforcement learning algorithms