33
/pt/
AIzaSyB4mHJ5NPEv-XzF7P6NDYXjlkCWaeKw5bc
November 1, 2025
8571175
809611
2
Public Timelines
FAQ Receber premium

9 abr 2022 ano - Chinchilla large language model released

Descrição:

DeepMind has developed a new language model called Chinchilla, which has 70 billion parameters. Chinchilla significantly outperforms Gopher (280B parameters) and GPT-3 (175B parameters) on a large range of downstream evaluation tasks. This is despite the fact that Chinchilla is trained on a dataset that is half the size of the dataset used to train Gopher and GPT-3. The researchers believe that this is due to the use of a new training method called PROJUNN, which allows for more efficient updates to unitary matrices. PROJUNN is a promising new method for training large language models, and it could lead to the development of even more powerful language models in the future.

Adicionado na linha do tempo:

Data:

9 abr 2022 ano
Agora
~ 3 years and 6 months ago

Imagens:

YouTube: