Growth Paths for the Coming Superintelligence
... from the text corpus used in training and a given prompt, and does so quite fast, learning to adapt to new environments or operating within limited computing resources are out of the question. Training is a one-off process—it takes over a month to train ChatGPT on a high-performance GPU cluster and would take over 300 years on a single card. As a result, we cannot yet talk about minimizing power consumption either—running models like ChatGPT today requires as much energy as cryptocurrency mining....