Hi!
I'm a Research Fellow at Microsoft Research India. I am Advised by Dr. Sunayana Sitaram where on Project VeLLM where I work on benchmarking and improving Large Language Models in multilingual settings.
Currently, I am working on a) Multilingual Parameter Efficient Finetuning of LLMs b)Evaluation and Prevention of Catastrophic Forgetting in Finetuning of Multilingual LLMs and c) Language Adaption in Large Language Models. My primary focus is on expanding the capabilities of Large Language Models to new languages and domains through the use of efficient and modular deep learning techniques during the fine-tuning phase.
Before this I worked with Dr. Vivek Gupta from UPENN , Dr. Anoop Kunchukuttan from AI4Bharat and Dr. Ashwini Vaidya from IIT Delhi on building benchmarking datasets and finetuning multilingual language models for indic languages.
I also held positions at Amex AI Labs and Builder.ai as an AI Researcher and a Data Scientist. During my tenure, I focused on refining language models for credit and fraud risk, as well as enhancing customer service use cases. I graduated from Delhi Technological University (Formerly DCE) in 2021.
Please feel free to reach out to me over my email if you have any questions regarding my research. I am also happy providing mentorship to students looking to start their research journey in NLP.
Publications
MAPLE: Multilingual Evaluation of Parameter Efficient Finetuning of Large Language Models
, Ashutosh Sathe, Ishaan Watts, Sunayana Sitaram
Arxiv Preprint 2024
Preprint
MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks
Sanchit Ahuja, , Varun Gumma, Ishaan Watts, Ashutosh Sathe, Millicent Ochieng, Rishav Hada, Prachi Jain, Maxamed Axmed, Kalika Bali, Sunayana Sitaram
Arxiv Preprint 2024
Preprint
Evaluating Inter-Bilingual Semantic Parsing for Indian Languages
, Vivek Gupta, Anoop Kunchukuttan
Proceedings of the 5th Workshop on NLP for Conversational AI (NLP4ConvAI 2023)
PDF|
Website
Xinfotabs: Evaluating multilingual tabular natural language inference
Bhavnick Minhas, Anant Shankhdhar, Vivek Gupta, , Shuo Zhang
Proceedings of the Fifth Fact Extraction and VERification Workshop (FEVER)
PDF|
Website
IndicXNLI: Evaluating multilingual inference for indian languages
, Vivek Gupta, Anoop Kunchukuttan
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
PDF|
Website
A Review of Deep Learning Techniques for Protein Function Prediction
, Yasha Hasija
IEEE 2nd International Conference for Emerging Technology (INCET) 2021
PDF
Fine-tuning distributional semantic models for closely-related languages
Kushagra Bhatia, , Ashwini Vaidya
Proceedings of the Eighth Workshop on NLP for Similar Languages, Varieties and Dialects
PDF
Experience
-- Project VeLLM Project VeLLM .
-- Working on Finetuning Multilingual LLMs, Continual learning for Multilingual LLMs and Language Adaptation.
-- ScamBERT: finetuned a BERT Model to classify scam call logs vs fraud call logs complaints.
-- LLM Finetuning: finetuned LLaMA-2 to recommend best response to customer support staff in a web chat.
-- Natasha: Worked on text based recommendation and conversation orchestration for Natasha Cockpit
-- Intent Classification: built an in house intent classification model for indentifying intent behind customer utterances in a video call.
Updates
Jan 2024 | Preprint for MAPLE:Evaluating Multilingual Parameter Efficient Finetuning in Large Language Models is now available! |
Nov 2023 | Preprint for MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks is now available! |
Sep 2023 | I will be joining Microsoft Research India as a Research Fellow with Sunayana Sitaram! |
May 2023 | Our work Evaluating Inter-Bilingual Semantic Parsing for Indic Languages is accepted in NLP4AI Workshop Co-located with ACL 2023! |
Oct 2022 | Our Work IndicXNLI is accepted in EMNLP 2022 Main! |