Learning to summarize with human feedback
We’ve applied reinforcement learning from human feedback to train language models that are better at summarization....
971+ articles from 7 top sources — updated every 2 hours.
We’ve applied reinforcement learning from human feedback to train language models that are better at summarization....
Our third class of OpenAI Scholars presented their final projects at virtual Demo Day, showcasing their research results from over the past ...
We’re excited to announce that OpenAI is co-organizing two NeurIPS 2020 competitions with AIcrowd, Carnegie Mellon University, and DeepMind,...
We find that, just as a large transformer model trained on language can generate coherent text, the same exact model trained on pixel sequen...
We’re releasing an API for accessing new AI models developed by OpenAI....
We’re releasing an analysis showing that since 2012 the amount of compute needed to train a neural net to the same performance on ImageNet c...
We’re introducing Jukebox, a neural net that generates music, including rudimentary singing, as raw audio in a variety of genres and artist ...
We’ve contributed to a multi-stakeholder report by 58 co-authors at 30 organizations, including the Centre for the Future of Intelligence, M...
We’re introducing OpenAI Microscope, a collection of visualizations of every significant layer and neuron of eight vision “model organisms” ...
We are standardizing OpenAI’s deep learning framework on PyTorch....
We show that the double descent phenomenon occurs in CNNs, ResNets, and transformers: performance first improves, then gets worse, and then ...
We’re releasing Procgen Benchmark, 16 simple-to-use procedurally-generated environments which provide a direct measure of how quickly a rein...
We’re releasing Safety Gym, a suite of environments and tools for measuring progress towards reinforcement learning agents that respect safe...
As the final model release of GPT-2’s staged release, we’re releasing the largest version (1.5B parameters) of GPT-2 along with code and mod...
We’ve trained a pair of neural networks to solve the Rubik’s Cube with a human-like robot hand. The neural networks are trained entirely in ...
We are now accepting applications for our third class of OpenAI Scholars....