π Course Overview:
This repository contains my coursework (assignments, & semester exams) for the Natural Language Processing course at IIIT Delhi in Winter 2025.
The course covers text processing, language modeling, word embeddings, sequence tagging (POS, HMM), attention, transformers and their applications (BERT, GPT, T5, etc.), with tasks including sentiment analysis, fake news, hate speech detection, dialog systems, summarization, and syntactic parsing.
π Folder Structure:
/Assignment 1
: Implemented WordPiece tokenizer, trained word2vec, and built a neural language model for next-word prediction./Assignment 2
: Performed Aspect Term Extraction (BIO encoding) and Aspect-Based Sentiment Analysis using RNNs, pretrained embeddings, and fine-tuned transformer models, fine-tuned SpanBERT and SpanBERT-CRF for question answering on the SQuAD v2 dataset./Assignment 3
: Implemented a Transformer from scratch for language modeling, applied BART and T5 for claim normalization (CLAN dataset), and built a multimodal sarcasm explanation model./Quizzes
: Contains materials related to the quizzes./Midsem
: Contains materials related to the midterm examination./Endsem
: Contains materials related to the endterm examination./Project
: Multimodal Classification for Mental Health Memes
π Semester: 6th Semester (3rd Year)
π Course Details:
Instructor's name - Md. Shad Akhtar ([email protected])
Course Code - CSE556
π§βπ€βπ§ Other Contributors: My IIIT Delhi batchmates Manan Aggarwal & Souparno Ghose also contributed in the assignments submissions.
π Important: Please make sure to follow the guidelines and policies outlined by the institution regarding the use of shared coursework materials. Use this repository responsibly and avoid any violations of academic integrity. Codes are provided for reference purposes only. It's recommended to understand the codes and implement them independently.