Research Intern || May, 2021 - January, 2022 Advisor: <a href="https://mayank4490.github.io/" target=”_blank”>Dr. Mayank Singh</a>
Demystifying automated peer-review generators and evaluating their robustness to adversarial perturbations.
Formulated desiderata for an ideal review generator system and provided a public leaderboard along with a framework for unified & comprehensive measurement of their performance.
Implemented statistical methods: re-sampling, cost-sensitive learning, and SMOTE for data with class-imbalance.
Performed data mining to curated ∼1M tweets in low resource Hindi language & conducted emoji prediction using bi-LSTM, mBERT, XLM-R, etc. (Published: EMNLP 2022)
Research Student || January, 2022 - Present Advisor: <a href="https://unicode-research.netlify.app/people/" target=”_blank”>Swapneel Mehta, Dr. Akash Srivastava</a>
Served as TA for Google Research funded 9-week Machine Learning Course UMLSC 2021 with 100+ students.
Built data pipeline for mining Twitter data and managed workflows using Airflow, AWS EC2 & S3, and Docker.
Worked with the SimPPL team to build better civic integrity tools that support newsrooms to better understand their audiences on social networks. (supported by Wikimedia Foundation, Google, AWS, NYC Media Lab)