OpenAI Scholars 2018: Final projects
Our first cohort of OpenAI Scholars has now completed the program....
2,476+ articles from 7 top sources — updated every 2 hours.
Our first cohort of OpenAI Scholars has now completed the program....
OpenAI Five lost two games against top Dota 2 players at The International in Vancouver this week, maintaining a good chance of winning for ...
Yesterday, OpenAI Five won a best-of-three against a team of 99.95th percentile Dota players: Blitz, Cap, Fogged, Merlini, and MoonMeander—f...
We’ve trained a human-like robot hand to manipulate physical objects with unprecedented dexterity....
Our first class of OpenAI Scholars is underway, and you can now follow along as this group of experienced software developers becomes machin...
The OpenAI Five Benchmark match is now over!...
We introduce Glow, a reversible generative model which uses invertible 1x1 convolutions. It extends previous work on reversible generative m...
We’ve trained an agent to achieve a high score of 74,500 on Montezuma’s Revenge from a single human demonstration, better than any previousl...
Our team of five neural networks, OpenAI Five, has started to defeat amateur human teams at Dota 2....
The first run of our Retro Contest—exploring the development of algorithms that can generalize from previous experience—is now complete....
We’ve obtained state-of-the-art results on a suite of diverse language tasks with a scalable, task-agnostic system, which we’re also releasi...
We’re now accepting applications for the next cohort of OpenAI Fellows, a program which offers a compensated 6-month apprenticeship in AI re...
We’re releasing the full version of Gym Retro, a platform for reinforcement learning research on games. This brings our publicly-released ga...
We’re releasing an analysis showing that since 2012, the amount of compute used in the largest AI training runs has been increasing exponent...
We’re proposing an AI safety technique which trains agents to debate topics with one another, using a human to judge who wins....
We’re releasing an experimental metalearning approach called Evolved Policy Gradients, a method that evolves the loss function of learning a...