The Computation Limits of Deep Learning

Deep learning's recent history has been one of achievement: from triumphing over humans in the game of Go to world-leading performance in image classification, voice recognition, translation, and other tasks. But this progress has come with a voracious appetite for computing power. This project catalogs the extent of this dependency, showing that progress across a wide variety of applications is strongly reliant on increases in computing power. Extrapolating forward this reliance reveals that progress along current lines is rapidly becoming economically, technically, and environmentally unsustainable. Thus, continued progress in these applications will require dramatically more computationally-efficient methods, which will either have to come from changes to deep learning or from moving to other machine learning methods.


Image classification on ImageNet

Model soups (ViT-G/14)
ViT-MoE-15B (Every-2)
Meta Pseudo Labels (EfficientNet-L2)
TokenLearner L/8 (24+11)
ALIGN (EfficientNet-L2)
EfficientNet-L2-475 (SAM)

Want to contribute?

You have access to our database where you can point out any errors or suggest changes

Go to database
App screenshot