Research Collaboration
Independant Researcher, , 2022
- Implemented statistical methods: re-sampling, cost-sensitive learning, and SMOTE for data with class-imbalance.
- Performed data mining to curated ∼1M tweets in low resource Hindi language & conducted emoji prediction using bi-LSTM, mBERT, XLM-R, etc. (Published: EMNLP 2022)
- Standardized 9 hate-speech datasets & implemented LSTM, BERT, RoBERTa, etc. (Published: EACL 2023)
- Developed distributed FL architecture to obtain 14.52% improvement in F1-score while preserving privacy.