A. Project 1: Building a Movie Review Sentiment Classifier
The following are the key learnings:
- Building and training a LSTM Network (100 cell single layer) using PyTorch. This link explains the basics of making a model with PyTorch.
- AWS SageMaker.
- Deploying the ML Model.
- Building an AWS Lambda Function.
- Building an AWS API Gateway.
B. Project 2: Population Segmentation using SageMaker
The following are the key learnings:
- AWS SageMaker Ground Truth: Helps in going from raw data, to labelled data. It first sends the data to humans for initial labels. Then it trains an ML model based on these initial labels, to automate the labelling process.
- Kernel to use: co.nda_mxnet_p36 or conda_pytorch_p36 ; p36 denotes python version 3.6
- DeepAR is an inbuilt Supervised Learning model (a RNN) in Sagemaker for Time Series data.
- Calculated two different kind of features viz. containment and longest common subsequence (lcs) for the answer & source texts.
- Categorized text as plagiarized or not using a random forrest classifier.
C. Project 3 and 4: Dog Breed CNN (Convolutional Neural Network) Classifier
The following are the key learnings:
-
To identify the breed of a dog, built two different CNN algorithms viz.
- Built a CNN Algorithm from scratch to identify dog breed from 133 dog breeds.
- Used Transfer Learning to build the dog breed classifier. ResNet-18 model was used whose last fully connected layer was replaced by a 133 neuron fully connected layer. Only the last layer was trained.
-
Additionally:
- Used Haar feature based cascade classifier provided by OpenCV to detect human faces in an image.
- Used a pre-trained VGG-16 from torchvision.models to detect a dog in an image. VGG-16 has been trained on the ImageNet Dataset whose classes 151 to 268 correspond to dogs. Thus, an output within this range implies the image is of a dog.