Posts by Collection
cv
experience
Data Science Intern
- Wrote python scripts to scrape data using BeautifulSoup and Selenium to create database for offline use
- Cleaned and visualized client data using dashboards in Power BI to deliver actionable insights
- Built a people counter app using OpenCV to keep track of people inside the shop using video from CCTV
Natural Language Processing Intern
- Developed a Resume information extractor using NLTK and OpenCV and stored data in JSON format
- Built and deployed a RASA based chatbot for tasks like leave management and stress assesment
- Secured the chatbot with a custom HTTP input channel having JWT authentication to allow authorized access
Research Intern
- Built a document information extractor using open-source libraries in Python and Java
- Finetuned LayoutLM model for Document Understanding on FUNSD dataset
- Achieved Recall and Precision over 0.95 in extracting various fields from medical documents
Visiting Researcher
- Working under the guidance of Dr. Kokil Jaidka and Prof. Lyle Ungar (University of Pennsylvania)
- Researching Multimodal and Graph-based approaches for context aware email reply prediction
portfolio
Portfolio item number 1
Short description of portfolio item number 1
Portfolio item number 2
Short description of portfolio item number 2
projects
Cross Domain Recommendation System
Python, Pandas, NumPy, scikit-learn | GitHub Link
- Implemented a collaborative filtering model to recommend Books to users based on how they rated Movies and vice-versa with 0.84 RMSE
- Data was collected by surveying over 300 people and the model was implemented using pandas, numpy and scikit-learn
Fashion Intelligence System
Python, PyTorch, React | GitHub Link
- Developed a Fashion Intelligence System which scrapes and ranks products from e-commerce websites
- Products were ranked based on similarity between image embeddings of scraped and trending products
- Code was written in a modular format to facilitate scaling across new products and websites
YouTubeNLP
Python, Docker, Flask, Gensim, Transformers | GitHub Link
- Worked on a web application which provided insights about comments and transcript of YouTube videos
- Used different transformer based models like BERT and GPT for tasks like NER and Emotion Analysis
Attention based LSTM
Python, PyTorch, NumPy | GitHub Link
- Implemented the paper Attention-based LSTM for Aspect-level sentiment classification on SemEval 2014 dataset
- Reproduced the results from the paper and additionally visualized attention weights for different aspects
ML for Option Pricing
Python, Keras, Pandas, NumPy | GitHub Link
- Explore the effectiveness of different Neural Network architectures for the task of Option Pricing
- Compare the results of ML based techniques with traditional models like the Black-Scholes and the Heston model
Transfer Learning in NLP
Python, Transformers, PyTorch Lightning | GitHub Link
- Understand current concepts in NLP like Self-Attention, Pretraining and Language Models
- Perform literature review on State-of-the-Art models like T5, GPT and BERT
- FineTune t5-small on different tasks like Sentiment Analysis and Emotion Classification
publications
Paper Title Number 1
Published in Journal 1, 2009
This paper is about the number 1. The number 2 is left for future work.
Recommended citation: Your Name, You. (2009). "Paper Title Number 1." Journal 1. 1(1). http://academicpages.github.io/files/paper1.pdf
Paper Title Number 2
Published in Journal 1, 2010
This paper is about the number 2. The number 3 is left for future work.
Recommended citation: Your Name, You. (2010). "Paper Title Number 2." Journal 1. 1(2). http://academicpages.github.io/files/paper2.pdf
Paper Title Number 3
Published in Journal 1, 2015
This paper is about the number 3. The number 4 is left for future work.
Recommended citation: Your Name, You. (2015). "Paper Title Number 3." Journal 1. 1(3). http://academicpages.github.io/files/paper3.pdf
talks
Talk 1 on Relevant Topic in Your Field
Published:
This is a description of your talk, which is a markdown files that can be all markdown-ified like any other post. Yay markdown!
Conference Proceeding talk 3 on Relevant Topic in Your Field
Published:
This is a description of your conference proceedings talk, note the different field in type. You can put anything in this field.
teaching
Head Teaching Assistant
Undergraduate Course, Neural Networks and Fuzzy Logic, BITS Pilani, 2021
- Conducted workshops to familiarise students with NumPy, Pandas, PyTorch and TensorFlow
- Designed and evaluated Neural Networks and Machine Learning based Python assignments
- Supervised students as they implemented Research Papers for their final assesement