logo
Contact
Rishi
I'm Rishi!

I do Code & Chill 🍿

Passionate about the magic of machine learning, crafting predictive models, optimizing data processes, and exploring deep learning boundaries. Facinated by technology to make life easier and more efficient.

ABOUT ME

EXPLORE NOW

I enjoy crafting predictive models, crunching statistical numbers, and playing around with patterns and techniques. Jumping into the exciting world of data and algorithms. Isn't it amazing to watch data and algorithms come together to create something exciting?

My ride in this rollercoaster involves tweaking data engineering processes, ETL pipelines, and taking on the twists and turns of deep learning. With a vow to keep on learning, I'm on a quest through the ever-changing landscapes of machine learning. Turning ideas into real-deal answers and pushing the limits of what we can pull off! 🚀✨

EXPERIENCE

EXPLORE NOW

Perfios / Data Scientist

NOV 2024 – PRESENT, NEW DELHI

➥ Engineered a robust sequence tagging model (MTEC-CRF) using a multi-branch deep learning architecture combining Char-CNNs, phonetic encodings (Metaphone), keyword flagging, and BiLSTM-CRF, improving F1-score by +0.28 over the BiLSTM baseline.

➥ Designed and applied over 15 data augmentation techniques including structural reordering, synonym/phonetic replacements, pincode anomalies, and separator removal, addressing linguistic and regional variability across 2M+ Indian addresses.

➥ Achieved improvement on Address Match model for KYC in financial domain by 12% through targeted feature engineering and hyperparameter tuning using Bayesian optimization.

Deep LearningCRFNLPBayesian Optimization

Indian Institute of Technology, Delhi.

Research Assistant

OCT 2024 – PRESENT, NEW DELHI

➥ Pretrained and fine-tuned transformer-based LLMs , including IndicBERT, NLLB and multilingual BERT, for low-resource Indian tribal languages as part of a Govern- ment of India Ministry of Tribal Affairs initiative.

➥ Built custom tokenizers, created and preprocessed domain-specific corpora, and implemented translation pipelines with a focus on cross-lingual generalization and data augmentation.

➥ Conducted research on advanced Information Retrieval methods including Boolean retrieval, TF-IDF, and Okapi BM25 for optimized search algorithms.

TransformersIndicBERTMultilingual NLPInformation Retrieval

3BiGS / Machine Learning Engineer

SEP 2022 – NOV 2024, HYDERABAD

➥ Designed a feature extraction framework using NLP techniques to identify drug-disease relationships for frequency, lexical, morphological, syntactic, and semantic aspects. Utilized BERT and BiLSTM to enhance embeddings and sequence tagging accuracy, increasing relationship identification precision by 0.24.

➥ Developed a Language Model for answering research questions, covering end-to-end processes from Data Curation to Evaluation, achieving 79% accuracy in the Healthcare domain.

➥ Enhanced Semantic Search with RAG and cross-encoders, aachieving a 92% hit rate for relevant research papers. Improved precision and relevance with advanced ReRankers.

➥ Built MD-CNN and SD-CNN models to predict antibiotic resistance in Mycobacterium tuberculosis, using whole-genome sequencing data from 18 loci in 161 isolates. Applied convolutional layers and residual connections to boost prediction accuracy.

NLPRAGPyTorchCNNRecommendation Systems

PROJECTS

EXPLORE NOW

BLOGS

EXPLORE NOW

GithubMediumLinkedInSpotifyInstagram