M.Tech Data Science |
M.Tech Data Science student at MMCOE, Pune with hands-on experience in building ML pipelines, LLM-based systems, and deep learning models. I write clean Python, ask sharp questions about data, and focus on building AI systems that are practical, reliable, and effective in real-world applications.
About
I'm a M.Tech Data Science student at MMCOE, Pune (CGPA 8.61/10), with a unique blend of a B.Tech Mechanical Engineering foundation and advanced expertise in AI/ML engineering. This cross-domain background gives me an analytical edge in approaching complex problems.
My research sits at the intersection of AI Security and Medical AI. I investigate adversarial vulnerabilities in Large Language Models and build multi-layer defense frameworks, while simultaneously exploring explainable deep learning for cervical cancer detection.
I've had hands-on industry experience at DATASMITH AI Solutions, where I built end-to-end ML pipelines for real-world datasets. I believe in writing clean, well-documented code and in making AI not just powerful — but interpretable and trustworthy.
When not training models, I'm reading research papers, experimenting with RAG systems, or diving into federated learning for privacy-preserving ML.
Technical Skills
Experience
Projects
A two-phase research project on securing instruction-tuned LLMs. Phase 1 (ICAIS 2026): Evaluated FLAN-T5-base against 120 adversarial prompts across 4 attack types — instruction override, role-play manipulation, contextual injection, and logical confusion. Introduced a lightweight lexical sanitization mechanism reducing adversarial attack success from 23.33% → 15.83% (a relative drop of 32.14%) without model retraining. Phase 2 (ARML): Proposed a novel 4-layer adaptive framework with risk formula R = αK + βS + γC, achieving 93.3% defense accuracy and F1 = 0.94 with only 7.7% false positive rate across FLAN-T5 and Phi-3 Mini.
Building a CNN-based model for cervical cell classification (SIPaKMeD dataset) integrated with clinical risk features (demographics, HPV status). Applying Grad-CAM and LIME for Explainable AI to generate clinically interpretable saliency maps. Synthesised 20+ IEEE papers on multimodal DL and XAI for cervical cancer — accepted and presented at ETFI 2026, DESPU.
Built a domain-specific conversational AI using Retrieval-Augmented Generation (RAG). The system embeds and indexes philosophical texts, then retrieves contextually relevant passages before generating responses — dramatically reducing hallucination compared to vanilla LLM inference. Implemented chunking strategies, vector similarity search, and curated knowledge grounding to preserve authentic philosophical meaning across Gita's 18 chapters and Chanakya's Arthashastra excerpts.
Designed a distributed ML workflow for agricultural disease prediction — trained models locally on individual farm nodes without centralising raw data. Studied and compared convergence behaviour (FedAvg vs FedProx), communication overhead, and non-IID data challenges. Implemented differential privacy mechanisms and analysed the privacy-accuracy trade-off, achieving competitive accuracy while ensuring farm data never leaves local nodes.
Cleaned and transformed production datasets from a packaging company using Pandas — resolved inconsistent formats, imputed missing values, and standardised KPI definitions. Built interactive Power BI dashboards tracking production throughput, defect rates (%), yield loss, and machine downtime. Enabled data-driven decisions reducing reporting time from days to real-time visibility.
Research
Conducted a controlled empirical assessment of adversarial robustness in instruction-tuned LLMs using 120 balanced prompts across 4 attack categories. Introduced a lightweight lexical sanitization defense reducing attack success rate from 23.33% to 15.83% — a relative reduction of 32.14% — without requiring model retraining on FLAN-T5-base.
Proposed ARML-Defense — a 4-layer adaptive security framework using composite risk scoring R = αK + βS + γC with a novel Allow/Modify/Restrict policy. Achieved 93.3% defense accuracy and F1 = 0.94 on FLAN-T5 and Phi-3 Mini across 100 adversarial prompts from JailbreakBench and AdvBench. Outperforms Hard Restriction by 26.6 percentage points while maintaining 92.3% usability preservation.
Systematic review and analysis of 20+ IEEE research papers on CNN-based architectures, hybrid ML-DL models, Vision Transformers, and XAI methods (Grad-CAM, SHAP, LIME) for cervical cancer detection. Identified research gaps in multimodal integration and post-hoc explainability. Presented at ETFI 2026, Department of Electronics, Telecommunication and Future Innovations conference.
Publication Indexing Note
ICAIS 2026 (Springer LNCS): Springer Lecture Notes in Computer Science (LNCS) series is indexed in Web of Science + Scopus (Elsevier) and DBLP. | ETFI 2026 (IEEE): IEEE conference proceedings are published on IEEE Xplore, indexed in both Scopus and Web of Science (Clarivate / SCI/ESCI), subject to conference-specific inclusion. | ARML (Jamia Millia Islamia): Decision pending.
Education
Focus: Machine Learning, Deep Learning, LLM Security, Federated Learning, Research Methodology. Active researcher with 2 accepted international conference papers during this program.
Strong foundation in analytical thinking, systems design, and engineering mathematics. Transitioned to Data Science leveraging engineering problem-solving skills.
Credentials
2 International Conference Papers Accepted at ICAIS 2026 (Springer) and ETFI 2026 (IEEE) during M.Tech studies
98 / 100 in Cyber Security coursework — demonstrating strong foundation in security fundamentals aligned with LLM research
32.14% Adversarial Attack Reduction achieved via novel lightweight defense — published, peer-reviewed finding
Get In Touch
I'm currently open to AI/ML Engineering, Data Science, and Python Development roles. I'm also happy to discuss research collaborations, paper reviews, or just an interesting conversation about LLM security and explainable AI.
Reach out via email or LinkedIn — I typically respond within 24 hours.