pytorch – Page 2

1.2 – Fine Tune a Transformer Model (2/2)

Posted on December 2, 2021December 2, 2021 by Aritra Sen

In the last post we talked about that we can fine tune a BERT model using below two techniques – Update the weights of the pre-trained BERT model along with the classification layer. Update only the weights of the classification layer and not the pre-trained BERT model. This process becomes as using the pre-trained BERT model as…

1.0 – Getting started with Transformers for NLP

Posted on November 22, 2021November 22, 2021 by Aritra Sen

In this post we will go through about how to do hands-on implementation with Hugging Face transformers library for solving few simple NLP tasks, we will mainly talk about hands-on part , in case you are interested to learn more about transformers/attention mechanism , below are few resources – Getting started with Google BertNeural Machine…

Deep Learning with Pytorch -Text Generation – LSTMs – 3.3

Posted on July 1, 2019July 1, 2019 by Aritra Sen

In this Deep Learning with Pytorch series , so far we have seen the implementation or how to work with tabular data , images , time series data and in this we will how do work normal text data. Along with generating text with the help of LSTMs we will also learn two other important…

Deep Learning with Pytorch -Sequence Modeling – LSTMs – 3.2

Posted on June 20, 2019June 20, 2019 by Aritra Sen

In the previous article related to RNN , we understood the architecture of RNN , RNN has the problem working poorly when we need to maintain the long term dependencies (earlier layers of RNNs suffers the problem of vanishing gradients).This problem has been almost solved with the new architecture of LSTMs (Long Short Term Memory)…

Deep Learning with Pytorch – Custom Weight Initialization – 1.5

Posted on May 26, 2019May 26, 2019 by Aritra Sen

From the below images of Sigmoid & Tanh activation functions we can see that for the higher values(lower values) of Z (present in x axis where z = wx + b) derivative values are almost equal to zero or close to zero. So for the higher values of Z , we will have vanishing gradients…

Deep Learning with Pytorch -Sequence Modeling – Time Series Prediction – RNNs – 3.1

Posted on May 19, 2019May 19, 2019 by Aritra Sen

In the previous post of this series , we learnt about the intuition behind RNNs and we also tried to understood how we can use RNNs for sequential data like time series.In this post we will see a hands on implementation of RNNs in Pytorch.In case of neural networks we feed a vector as predictors…

Deep Learning with Pytorch -Sequence Modeling – Getting Started – RNN – 3.0

Posted on May 18, 2019June 21, 2019 by Aritra Sen

In CNN series , we came to know the limitations of MLPs how it can be solved with CNNs. Here we are getting started with another type of Neural Networks they are RNN(or Recurrent Neural Network). Some of the tasks that we can achieve with RNNs are given below – 1. Time Series Prediction (Stock…

Deep Learning with Pytorch -CNN – Transfer Learning – 2.2

Posted on May 1, 2019May 1, 2019 by Aritra Sen

Transfer learning is the process of transferring / applying your knowledge which you gathered from doing one task to another newly assigned task. One simple example is you pass on your skills/learning of riding a bicycle to the new learning process of riding a motor bike. Referring notes from cs231 course – In practice, very…

Deep Learning with Pytorch -CNN from Scratch with Data Augmentation – 2.1

Posted on April 30, 2019May 1, 2019 by Aritra Sen

In the last post we went through all the building blocks of ConVNets. Now let’s put all the concepts together and see how to do it in Pytorch. Before going into the implementation , lets see what is Data Augmentation. Augmentation:Using pytorch’s torchvision.transforms operations , we can do data augmentation.With data augmentation we can flip/shift/crop…

Deep Learning with Pytorch-CNN – Getting Started – 2.0

Posted on April 29, 2019June 10, 2019 by Aritra Sen

In Deep Learning , we use Convolutional Neural Networks (ConvNets or CNNs) for Image Recognition or Classification. Computer Vision using ConvNets is one of the most exciting fields in current Deep Learning research. For more detail understanding of ConvNets , I will encourage you to go through this amazing Videos tutorials by Andrew Ng In this tutorial…