Transformers, Explained: Understand the Model Behind GPT-3, BERT, and T5 (Dale Markowitz)

Dale Markowitz: Transformers, Explained: Understand the Model Behind GPT-3, BERT, and T5. “Transformers are models that can be designed to translate text, write poems and op eds, and even generate computer code. In fact, lots of the amazing research I write about on daleonai.com is built on Transformers, like AlphaFold 2, the model that predicts the structures of proteins from their genetic sequences, as well as powerful natural language processing (NLP) models like GPT-3, BERT, T5, Switch, Meena, and others. You might say they’re more than meets the… ugh, forget it. If you want to stay hip in machine learning and especially NLP, you have to know at least a bit about Transformers. So in this post, we’ll talk about what they are, how they work, and why they’ve been so impactful.”

Leave a Reply

%d bloggers like this: