Building a MLM Training Input Pipeline

jamescalam

James Briggs

Posted on July 5, 2021

Building a MLM Training Input Pipeline

The input pipeline of our training process is the more complex part of the entire transformer build. It consists of us taking our raw OSCAR training data, transforming it, and preparing it for Masked-Language Modeling (MLM). Finally, we load our data into a DataLoader ready for training!

💖 💪 🙅 🚩
jamescalam
James Briggs

Posted on July 5, 2021

Join Our Newsletter. No Spam, Only the good stuff.

Sign up to receive the latest update from our blog.

Related