0% found this document useful (0 votes)

9 views

NLP Assignment 2024

Uploaded by

f20212157

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

NLP Assignment 2024

Uploaded by

f20212157

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Each group can have a maximum of 3 students.

Multiple groups
cannot do the same assignment. Assignments will be allotted on a
first-come-first-allotted basis. One member of each group will email
me regarding which assignment that group wants to do along with the
email-ids of other members of that group. After getting the approval
from me, please fill the following form.
https://docs.google.com/spreadsheets/d/145rzZE2yrKEOGQaUZsH-98
JhWxF991m82xR6BpsVwxM/edit?usp=sharing

Deadline to submit/present the assignments: Nov 22, 2024

General guidelines/suggestions to follow for all assignments: Show

effect of varying different hyperparameters such as # layers, hidden size,
optimizer, batch size, dropout value, learning rate, loss function on model’s
performance. Use validation data to prevent overfitting e.g. use early
stopping conditions. Apply tokenization in batches to reduce time spent on
data preprocessing. Experiment with freezing the initial layers and only
fine-tuning the outer layers. Also, check if there exists a class imbalance
problem in the dataset mentioned in each assignment and try to mitigate
that by assigning class weights. You can try parameter efficient fine-tuning
(PEFT) using approaches like LoRA (low-rank adapters) to tune only a
small subset of model parameters for faster training.

Innovation: Coming up with simple ideas/heuristics to improve the model’s

performance is encouraged. <Extra credit will be given up to 10% >.

Assignment 1: Named Entity Recognition (NER) with BERT

Objective:
Fine-tune a BERT-based model for Named Entity Recognition (NER)
using a publicly available dataset like CoNLL-2003.

Tasks:
1. Preprocessing: Load and preprocess the CoNLL-2003 dataset.
This includes: - Tokenizing the text using BERT tokenizer.
- Structuring the data into a format compatible with the `transformers`
library for token classification.

2. Fine-Tuning:
- Fine-tune BERT on the NER task using Hugging Face’s
`transformers` library. - Implement appropriate hyperparameter
tuning for model optimization.

3. Evaluation:
- Use precision, recall, and F1-score as evaluation metrics to
measure the model's performance.
- Compare the performance of your model with the benchmark
results from the CoNLL-2003 challenge.

4. Error Analysis:
- Perform detailed error analysis to understand the common mistakes
made by the model (e.g., confusion between similar entity types).
- Suggest potential improvements, such as using Conditional Random
Fields (CRF) or data augmentation.

Deliverables:
- A Jupyter notebook with the implementation of preprocessing,
model training, and evaluation.
- A written report detailing the model architecture, the
hyperparameters used, and an analysis of the model’s performance
along with the error analysis.
---

Assignment 2: Sentiment Analysis with RoBERTa

Objective:
Fine-tune RoBERTa for sentiment analysis using the IMDB movie reviews
dataset for binary classification (positive/negative).
Tasks:
1. Preprocessing:
- Clean the IMDB dataset (e.g., removing special characters,
handling missing data). - Tokenize the data using RoBERTa’s
tokenizer.

2. Fine-Tuning:
- Fine-tune RoBERTa on the sentiment classification task.
- Experiment with different hyperparameters like learning rate, batch
size, and training epochs to optimize model performance.

3. Model Evaluation:
- Evaluate the fine-tuned model using metrics like accuracy,
precision, recall, and F1-score.
- Test the model on a custom set of movie reviews and analyze the model’s
performance.

4. Hyperparameter Experimentation:
- Conduct an experiment to study the effect of various
hyperparameters on model performance (learning rate, batch size,
etc.).

Deliverables:
- A notebook showing the fine-tuning process and hyperparameter
experiments. - A report analyzing the impact of hyperparameters on
performance, including any failure cases (reviews incorrectly classified).
---

Assignment 3: Document Classification using BERT

Objective:
Fine-tune BERT for document classification using the 20 Newsgroups
dataset.

Tasks:
1. Preprocessing:
- Clean and preprocess the text in the 20 Newsgroups dataset.
- Tokenize using BERT’s tokenizer, making sure to handle input length
properly by splitting long documents if necessary.

2. Fine-Tuning:
- Fine-tune BERT for multi-class classification on the dataset.
- Experiment with different pooling strategies (CLS token pooling,
mean pooling) to aggregate document representations.

3. Evaluation:
- Use accuracy, precision, recall, and F1-score to evaluate the
model’s performance. - Perform cross-validation to get robust results
and reduce the risk of overfitting.

4. Analysis of Pooling Techniques:

- Compare the results of different pooling methods and analyze their
effects on model accuracy and other evaluation metrics.

Deliverables:
- Code that shows the preprocessing, fine-tuning, and evaluation
steps. - A report comparing pooling strategies and analyzing their
effect on the model’s performance, supported by experimental
results.
---

Assignment 4: Multi-Lingual NER with mBERT

Objective:
Fine-tune mBERT for Named Entity Recognition (NER) using a
multilingual dataset like WikiAnn.

Tasks:
1. Preprocessing:
- Load and preprocess the WikiAnn dataset for multiple languages.
- Tokenize the text using the multilingual BERT tokenizer, and
structure the data accordingly.

2. Fine-Tuning:
- Fine-tune mBERT on NER for multiple languages (e.g., English,
German, French). - Implement transfer learning: fine-tune the model
on one language and evaluate on another.

3. Evaluation:
- Compare performance across different languages using
standard NER metrics (precision, recall, F1-score).
- Perform error analysis on low-resource languages to understand
where the model struggles.

4. Transfer Learning Experiment:

- Evaluate how well the model trained on one language transfers to
another language.

Deliverables:
- A Jupyter notebook/code that fine-tunes mBERT and evaluates
performance across multiple languages.
- A detailed report discussing the challenges of multilingual NER and the
impact of transfer learning on low-resource languages.
---

Assignment 5: Zero-Shot Classification with RoBERTa

Objective:
Fine-tune RoBERTa for zero-shot classification on a custom set of
documents, using Hugging Face’s `transformers` library.

Tasks:
1. Model Setup:
- Load a pre-trained RoBERTa model using the `transformers`
library for zero-shot classification.

2. Data Collection:
- Create or use a custom dataset containing various categories
(e.g., news, sports, technology).
- Use the zero-shot learning setup to classify the documents into
predefined categories.

3. Evaluation:
- Evaluate the model’s performance by comparing the assigned labels
with the ground truth.
- Perform error analysis to identify which categories are difficult for the
model to classify.

4. Improvements:
- Suggest improvements based on error analysis, such as better
prompt engineering or dataset augmentation.

Deliverables:
- Code demonstrating the setup, fine-tuning, and evaluation of
RoBERTa for zero-shot classification.
- A report analyzing the model’s performance, the challenges
encountered, and potential improvements.
---

Assignment 6: Token Classification with DistilBERT

Objective:
Fine-tune DistilBERT for token classification on a task like Part-of-Speech
(POS) tagging.

Tasks:
1. Preprocessing:
- Preprocess a POS tagging dataset, ensuring that the text is
tokenized properly using DistilBERT’s tokenizer.

2. Fine-Tuning:
- Fine-tune DistilBERT on the POS tagging task, experimenting with
different batch sizes, learning rates, and epochs.

3. Evaluation:
- Evaluate the model using token-level accuracy, F1-score, and other
relevant metrics. - Compare the performance of DistilBERT to BERT and
analyze the trade-offs between model size and performance.

4. Efficiency Analysis:
- Analyze the performance of DistilBERT in terms of computational
efficiency (e.g., training time, memory usage) compared to BERT.
Deliverables:
- A Jupyter notebook with the preprocessing, fine-tuning, and evaluation
of DistilBERT for token classification.
- A report discussing the performance trade-offs between DistilBERT
and BERT, and an analysis of computational efficiency.

Deliverables for assignments 7-12:

A Jypyter notebook/code including tokenization, model fine-tuning,

evaluation of the model, and error handling.

Write a report on data preparation and preprocessing, model

architecture, optimal hyperparameter setting, results, and error
analysis.

Assignment 7: Sequence classification with BART model

(encoder-decoder model)

Objective:
Fine-tune a BART-based model (facebook/bart-base recommended as it is
smaller and faster) for sequence classification on CoLA dataset. Ref:
https://openreview.net/pdf?id=rJ4km2R5t7
Each example is a sequence of words annotated with whether it is a
grammatical English sentence.

Tasks:
1. Load and preprocess the CoLA dataset . This includes:
- Tokenizing the text using BARTTokenizer/AutoTokenizer.
2. Fine-Tuning:
- Fine-tune BART on the sequence classification task.
- Implement appropriate hyperparameter tuning for model optimization.

3. Evaluation:
- Use accuracy metric to measure the model's performance.
- The test data with gold labels is not available in CoLA dataset. So, use a
small part of the development data as the development set and use the
remaining data as the test set.
- Compare the performance of your model with the benchmark result from
the BERT model.

Assignment 8: Bitext classification (textual entailment classification)

with BART model (encoder-decoder model)

Objective:
Fine-tune a BART-based model for bitext classification on RTE dataset.
Ref: https://openreview.net/pdf?id=rJ4km2R5t7
RTE stands for recognizing textual entailment i.e. whether one sentence
entails/supports another.

Example of positive entailment:

1. Italian film-maker, Fellini was awarded an honorary Oscar for lifetime
achievement. He died on October 31, 1993.
2. An Italian director is awarded an honorary Oscar.
The above two example sentences are an example of positive entailment.

Example of negative entailment:

3. A smaller proportion of Yugoslavia's Italians were settled in Slovenia
(at the 1991 national census, some 3000 inhabitants of Slovenia
declared themselves as ethnic Italians).
4. Slovenia has 3,000 inhabitants.
The above two example sentences are an example of negative entailment.
Tasks:
1. Load and preprocess the RTE dataset . This includes:
- Tokenizing the text using BartTokenizer/AutoTokenizer.

2. Fine-Tuning:
- Fine-tune BART on the bitext classification task.
- Implement appropriate hyperparameter tuning for model optimization.

3. Evaluation:
- Use accuracy metric to measure the model's performance.
- Compare the performance of your model with the benchmark result from
the BERT model.

Assignment 9: Paraphrasing task with T5 model (text-to-text transfer

Transformer)

Objective:
Fine-tune T5 model to classify whether two sentences are semantically
(meaningfully) equivalent. Use MRPC dataset, Ref:
https://openreview.net/pdf?id=rJ4km2R5t7

Tasks:
1. Load and preprocess the MRPC dataset . This includes:
- Tokenizing the text using T5Tokenizer.

2. Fine-Tuning:
- Fine-tune T5 for paraphrasing task.
- Implement appropriate hyperparameter tuning for model optimization.

3. Evaluation:
- Use accuracy and F1 metric to measure the model's performance.
- Compare the performance of your model with the benchmark result from
the BERT model.

Assignment 10: Word sense disambiguation task using T5 model

text-to-text transfer Transformer)

Objective:
Fine-tune T5 model on WiC dataset. Ref: https://arxiv.org/pdf/1905.00537
Do preprocessing of the dataset, and appropriate hyperparameter tuning
for model optimization.

Task description:
Input: a word w which is present in two sentences. The task is to classify
whether the given word is used in the same sense in both sentences or not.
This is a binary classification problem.

Example of different senses of play:

- this speech did n't play well with the american public .
- play football .

Example of similar sense of person:

there was too much for one person to do .
each person is unique , both mentally and physically .

Evaluation:
- Report accuracy.

Assignment 11: Boolean Question Answering task using T5 model

text-to-text transfer Transformer)

Objective:
Fine-tune T5 model on BoolQ dataset. Ref:
https://arxiv.org/pdf/1905.00537.
Do preprocessing of the dataset, and appropriate hyperparameter tuning
for model optimization.

Task example:

Passage: Barq’s – Barq’s is an American soft drink. Its brand of root beer is
notable for having caffeine. Barq’s, created by Edward Barq and bottled
since the turn of the 20th century, is owned by the Barq family but bottled
by the Coca-Cola Company. It was known as Barq’s Famous Olde Tyme
Root Beer until 2012.
Question: is barq’s root beer a pepsi product
Answer: No

Evaluation:
Report accuracy

Assignment 12: Coreference resolution task using T5 model

text-to-text transfer Transformer)

Objective:
Fine-tune T5 model on WSC dataset. Ref: https://arxiv.org/pdf/1905.00537.
Do preprocessing of the dataset, and appropriate hyperparameter tuning
for model optimization.

Task Description:
Coreference resolution is determining the particular entity that a pronoun
refers to.

Example:
Text: Mark told Pete many lies about himself, which Pete included in his
book. He should have been more truthful. Coreference: False
## The output should be “False” because “he” doesn’t refer to Pete. “Mark”
is referred to by the pronoun “he”.

Evaluation:
Report accuracy

Project Questions
No ratings yet
Project Questions
4 pages
Project On Data Mining: Prepared by Ashish Pavan Kumar K PGP-DSBA at Great Learning
No ratings yet
Project On Data Mining: Prepared by Ashish Pavan Kumar K PGP-DSBA at Great Learning
50 pages
MMSegmentation
No ratings yet
MMSegmentation
2 pages
Machine Learning-Assignments PDF
No ratings yet
Machine Learning-Assignments PDF
2 pages
CSE472_Assignment_2
No ratings yet
CSE472_Assignment_2
3 pages
Project
No ratings yet
Project
11 pages
Assignment 3
No ratings yet
Assignment 3
2 pages
Module 3 Data Science Machine Learning
No ratings yet
Module 3 Data Science Machine Learning
53 pages
Anomaly Detection in Social Networks Twitter Bot
No ratings yet
Anomaly Detection in Social Networks Twitter Bot
11 pages
CSC 603 - Final Project
No ratings yet
CSC 603 - Final Project
3 pages
Data Mining Assignment No 2
No ratings yet
Data Mining Assignment No 2
4 pages
Smai A1 PDF
No ratings yet
Smai A1 PDF
3 pages
Challenge-2024
No ratings yet
Challenge-2024
5 pages
ML QB Answers
No ratings yet
ML QB Answers
11 pages
Data Mining - Lab 2
No ratings yet
Data Mining - Lab 2
5 pages
Assignment_2
No ratings yet
Assignment_2
3 pages
UpdatedNew Lp3LabManual
No ratings yet
UpdatedNew Lp3LabManual
118 pages
PYTHON PROGRAMMING FOR MACHINE LEARNING-220901004_compressed (1)
No ratings yet
PYTHON PROGRAMMING FOR MACHINE LEARNING-220901004_compressed (1)
6 pages
Advanced Techniques in Machine Learning and Optimization (3)
No ratings yet
Advanced Techniques in Machine Learning and Optimization (3)
8 pages
Assignment1_LATEX
No ratings yet
Assignment1_LATEX
11 pages
AIML LAB WEEK 8 SET-2
No ratings yet
AIML LAB WEEK 8 SET-2
5 pages
DAM1
No ratings yet
DAM1
6 pages
Fam QB Ans
No ratings yet
Fam QB Ans
9 pages
DPT Week 1
No ratings yet
DPT Week 1
3 pages
EDA_unit-3
No ratings yet
EDA_unit-3
16 pages
Assignment2 2024
No ratings yet
Assignment2 2024
4 pages
20CB913 Machine Learning Module 2
No ratings yet
20CB913 Machine Learning Module 2
52 pages
Theory (10 Marks)
No ratings yet
Theory (10 Marks)
4 pages
Ce473 Project - Fall 2024
No ratings yet
Ce473 Project - Fall 2024
8 pages
Gen AI Notes Paer 2
No ratings yet
Gen AI Notes Paer 2
14 pages
ISB_Assignment 2
No ratings yet
ISB_Assignment 2
5 pages
DSBDA Lab Manual
No ratings yet
DSBDA Lab Manual
167 pages
Assignment_1_Machine Learning
No ratings yet
Assignment_1_Machine Learning
3 pages
Bacancy Technology Decode
No ratings yet
Bacancy Technology Decode
7 pages
AICS Topics
No ratings yet
AICS Topics
250 pages
Traffic Flow Prediction Using The METR-LA Traffic
No ratings yet
Traffic Flow Prediction Using The METR-LA Traffic
8 pages
ML%20PROJECT%20PROPOSAL.pdf
No ratings yet
ML%20PROJECT%20PROPOSAL.pdf
4 pages
homework3
No ratings yet
homework3
1 page
BDA Lab 9 Manual
No ratings yet
BDA Lab 9 Manual
3 pages
GenAI Assignment 3 & 4
No ratings yet
GenAI Assignment 3 & 4
6 pages
Assignment_1
No ratings yet
Assignment_1
6 pages
ML A Deep Dive in The World of AI and LLM Tun'Up Munich - 241021 - 130023
No ratings yet
ML A Deep Dive in The World of AI and LLM Tun'Up Munich - 241021 - 130023
34 pages
ML Assign1 Part2 2023
No ratings yet
ML Assign1 Part2 2023
2 pages
All Units MAAL BDA - Chatgpt
No ratings yet
All Units MAAL BDA - Chatgpt
17 pages
Data Mining and Warehousing Lab
No ratings yet
Data Mining and Warehousing Lab
4 pages
Data Mining & Machine Learning Courseoutline
No ratings yet
Data Mining & Machine Learning Courseoutline
7 pages
Machine Learning
No ratings yet
Machine Learning
14 pages
Lab 8
No ratings yet
Lab 8
5 pages
ADS E2
No ratings yet
ADS E2
5 pages
Machine Learning: Assignment 04 Text Classification in Scikit-Learn
No ratings yet
Machine Learning: Assignment 04 Text Classification in Scikit-Learn
4 pages
VL2024250502040_ASSIGNMT1
No ratings yet
VL2024250502040_ASSIGNMT1
3 pages
Analysis
No ratings yet
Analysis
3 pages
Project Big Data
No ratings yet
Project Big Data
2 pages
Building Good Training Sets UNIT 1 PART2
No ratings yet
Building Good Training Sets UNIT 1 PART2
46 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
64 pages
AIML Hard
No ratings yet
AIML Hard
22 pages
phyton
No ratings yet
phyton
10 pages
Predicting Customer Churn With Neural Networks
No ratings yet
Predicting Customer Churn With Neural Networks
2 pages
Assignment 6
No ratings yet
Assignment 6
4 pages
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
SSRN Id3890338
No ratings yet
SSRN Id3890338
20 pages
AutoML and XAI PDF
No ratings yet
AutoML and XAI PDF
12 pages
Optimization of Apparel Supply Chain Using Deep Reinforcement Learning
No ratings yet
Optimization of Apparel Supply Chain Using Deep Reinforcement Learning
9 pages
Hyper Tuner
No ratings yet
Hyper Tuner
11 pages
Hyperparameter Tuning For Machine Learning Models
No ratings yet
Hyperparameter Tuning For Machine Learning Models
14 pages
Dokumen - Pub Handbook of Evolutionary Machine Learning 9789819938148 9789819938131
No ratings yet
Dokumen - Pub Handbook of Evolutionary Machine Learning 9789819938148 9789819938131
1,052 pages
2020 - Implementation Matters in DRL A Case Study On PPO and TRPO
No ratings yet
2020 - Implementation Matters in DRL A Case Study On PPO and TRPO
14 pages
data science notes c
No ratings yet
data science notes c
4 pages
ICAART_2024_-paper (1)
No ratings yet
ICAART_2024_-paper (1)
8 pages
AI - ML in Healthcare - Notes
No ratings yet
AI - ML in Healthcare - Notes
34 pages
The Incorporation of Stacked Long Short-Term Memory Into Intrusion Detection Systems For Botnet Attack Classification
No ratings yet
The Incorporation of Stacked Long Short-Term Memory Into Intrusion Detection Systems For Botnet Attack Classification
14 pages
Supervised Learning: Hadrien Lacroix
No ratings yet
Supervised Learning: Hadrien Lacroix
85 pages
Hyper Parameter Turning
No ratings yet
Hyper Parameter Turning
4 pages
Robot Learning For Complex Manufacturing Process: Heping Chen, Binbin Li, Dave Gravel, George Zhang and Biao Zhang
No ratings yet
Robot Learning For Complex Manufacturing Process: Heping Chen, Binbin Li, Dave Gravel, George Zhang and Biao Zhang
5 pages
Predicting Coronary Heart Disease Using An Improved LightGBM Model Performance Analysis and Comparison
No ratings yet
Predicting Coronary Heart Disease Using An Improved LightGBM Model Performance Analysis and Comparison
15 pages
Emotion Sentiment Analysis of Indian Twitter-Data of COVID-19 After Lockdown
No ratings yet
Emotion Sentiment Analysis of Indian Twitter-Data of COVID-19 After Lockdown
6 pages
NDLP Phishing: A Fine-Tuned Application to Detect Phishing Attacks Based on Natural Language Processing and Deep Learning
No ratings yet
NDLP Phishing: A Fine-Tuned Application to Detect Phishing Attacks Based on Natural Language Processing and Deep Learning
18 pages
ML 2022 Sheet 10
No ratings yet
ML 2022 Sheet 10
1 page
Hyperparameters
No ratings yet
Hyperparameters
15 pages
1.4 Intro To Need of Estimation and Validation PDF
No ratings yet
1.4 Intro To Need of Estimation and Validation PDF
18 pages
Improving Default Risk Information System With TensorFlow
No ratings yet
Improving Default Risk Information System With TensorFlow
7 pages
498 FA2019 Lecture11
No ratings yet
498 FA2019 Lecture11
100 pages
8 Id1308 Jte Van Hoang Phuoc Toan 6447
No ratings yet
8 Id1308 Jte Van Hoang Phuoc Toan 6447
7 pages
A Rapid Method To Predict Type and Adulteration of Coconut Milk by - 2023
No ratings yet
A Rapid Method To Predict Type and Adulteration of Coconut Milk by - 2023
12 pages
ML
No ratings yet
ML
8 pages
Documentation-Fake News Detection
100% (1)
Documentation-Fake News Detection
57 pages
Optimal Hyperparameters For Deep LSTM-Networks For Sequence Labeling Tasks
No ratings yet
Optimal Hyperparameters For Deep LSTM-Networks For Sequence Labeling Tasks
34 pages
Machine Learning Workflow Ebook
No ratings yet
Machine Learning Workflow Ebook
22 pages
An Empirical Investigation of Catastrophic Forgeti
No ratings yet
An Empirical Investigation of Catastrophic Forgeti
10 pages
Auto ML
No ratings yet
Auto ML
15 pages