About Me
I am an experienced Data Scientist with a passion for Deep Learning. I like to solve real world problems with AI and have a proven research track record in Computer Vision and Natural Language Processing. Besides data science, I also like to watch anime and travel.
Discover a compilation of projects I've passionately developed during my leisure hours on this website.

Data Science & AI Research
I am a data scientist and AI researcher with 4 years of professional work experience and 3 years of academic research experience.
My journey in machine learning started with designing computer vision models for recognizing Indian Sign Language gestues on a live webcam video feed, implemented by using Haar Cascade Classifier. Having implemented this project at a school teaching Indian Sign Language in Mumbai, it felt rewarding and has been since motivated me to tackle real world problems using the power of AI. During my Masters Research, I took the challenge to design NLP model that would help people overcome Social Anxiety by suggesting sentences to talk about.
I love being up-to-date in the everchanging field of AI, by reading research papers about new algorithms and techniques. I try to implement the papers by applying the model architecture on a creative data to something unique.
Currently, I am applying my knowledge to solve problems in the healthcare industry tackling complex tasks involving health insurance claims, audits, and enable wide reach of medicines by promoting health insurance sales.
Projects
A collection of deep learning projects that I have created.
- All
- NLP
- Vision
- Generative AI
- Classical ML
- Visualizations
Work Experience
Professional Experience
Lead Data Scientist
April 2023 - Present
Cardinal Health, Chicago, IL
- Led the development of Agentic AI powered semi-autonomous customer service agents, boosting productivity by 60% and increased customer satisfaction by 33% through personalized interactions
- Defined and delivered the enterprise AI roadmap for multimodal email classification using LLMs and RAG with vector databases, reducing operational costs by 40% and increased customer service productivity by 20% across business units
- Established enterprise-wide prompt engineering best practices and ethical AI governance for HIPAA-compliant applications; standards adopted across multiple divisions
- Designed and optimized prompts for Multimodal Generative AI models, significantly enhancing auto-generated response quality and relevance, resulting in a 40% uplift in customer service agent productivity
- Architected and deployed a 70B+ parameter LLM-based email generation platform, reducing average case handling time by 40% for 200+ agents, fully integrated into CRM workflows
- Served as the Subject Matter Expert (SME) for Named Entity Recognition (NER) and extracted information using Document AI, Vision OCR. Designed NER models that achieved accuracy of over 94% while complying with HIPAA regulations
- Implemented Retrieval Augmented Generation (RAG) to efficiently generate customer service email replies, significantly improving agent productivity. Created dedicated knowledge bases using PGVector for generating emails for 50+ categories
- Conducted workshops and training sessions, to foster a data-driven culture within the organization. Mentored junior data scientists, and contributed to the team's skill development and a collaborative work environment
Data Scientist
August 2021 - April 2023
Healthcare.com, Chicago, IL
- Designed Recommend Systems using Q-Learning Reinforcement Learning to optimize call buyer selection process and improve match rate by 11% uplifting revenue by $1.5 million per year
- Increased sales conversion rates by 19% by deploying Multi-Critera Decision Making (MCDM) Ranking Systems for call centers through performance-based call routing
- Researched & implemented Natural Language Understanding (NLU) Speech Recognition and Transcription using TensorFlow ASR and AWS Transcribe to understand speech characteristics for sales and customer intent of buying
- Increased sales conversion rates by 8% using generative NLP Models, BERT, Transformers to produce sales scripts dynamically
- Examined Clustering (Machine Learning) Techniques in Python to identify characteristics of top performing agents
- Designed recruitment models that support in hiring top performing sales agents, successively adding $1.2M / year in revenue
- Created Ensemble Learning Models using Gradient Boosting Machines (GBM) and Synthetic Data to verify customer lifetime value (LTV) and accurately determine customer duration for health insurance policies
Research Assistant
August 2020 - May 2021
Illinois Institute of Technology, Chicago, IL
- Researched & designed Generative AI virtual chat assistants that support humans with social anxiety to communicate efficiently
- Designed ensemble of open domain chatbots using GPT-2, Transformers and Encoders for context understanding and next sentence generation that helped to achieve fluent conversations
- Deployed Large Language Models into production on AWS EC2 using Flask, NodeJS, Express and VueJS for real-time predictions
- Improved Perplexity to 3.61 for testing data that was gathered while performing human study to achieve better conversations
Data Scientist
June 2018 - August 2019
S & S InfoTech Services Pvt. Ltd., Mumbai, India
- Increased eCommerce sales by 12% by designing inventory management platform for demand forecasting using Prophet and ARIMA timeseries models
- Applied K-means clustering for classifying inventory based on sales velocity and stock movement, enabling accurate reorder point planning and stock prioritization. Which lead to reducing overstocked items by 43%
- Designed anomaly detection pipelines to monitor email spoofing attempts, reducing detection time from 2.2 hours to under 3 minutes and preventing an estimated INR 7.3M in potential fraud losses annually
- Improved company outreach by 27% by developing context-aware chatbot using Google Cloud Platform and DialogFlow API
Machine Learning Research Assistant
June 2017 - June 2018
Indian Institute of Technology, Mumbai, India
- Researched rapid object detection techniques for recognizing real-time video sign language gestures with Python and OpenCV
- Deployed gesture recognition models on web application for practicing sign language gestures with PHP, Python and JavaScript
- Identified key parameters that enhances the probability of students obtaining jobs at career fairs by analyzing student’s profile
- Obtained principal components by applying dimensionality reduction like PCA and t-SNE using Sci-Kit Learn, NumPy and Pandas
- Developed supervised and unsupervised ensemble learning strategies to predict pay range based on the candidate’s profile
Leadership Experience
Lead Data Scientist
April 2023 - Present
Cardinal Health, Chicago, IL
- Founded and led the company's AI CodeJam Hackathon program for 2 years, resulting in 5 production-ready prototypes, including an LLM-powered case summarizer adopted business wide
- Established Prompt Engineering Guidelines and Ethical AI Governance Framework for HIPAA-compliant GenAI use; adopted across multiple business units
- Fostered cross-functional collaboration between engineering, operations, and analytics teams to accelerate AI adoption as a part of the AI Center of Excellence
- Upskilled 10+ data scientists via workshops on RAG, multimodal AI, and LLM fine-tuning
- Defined enterprise AI vision and aligned GenAI roadmap with cross-department OKRs, ensuring measurable ROI on AI initiatives
Data Scientist
August 2021 - April 2023
Healthcare.com, Chicago, IL
- Created the organization's first AI adoption framework, enabling ML-driven lead routing, customer retention modeling, and sales optimization
- Advocated for and implemented a centralized data quality and governance process, improving ML model reliability and adoption rates
- Mentored 6+ junior analysts and engineers, transitioning them into applied data science roles
- Introduced early NLP and text classification models for sales script generation, laying the foundation for later GenAI adoption
Educational Qualification
Master of Science & Computer Science
Illinois Institute of Technology, Chicago, IL
Thesis, Machine Learning, Natural Language Processing, Deep Learning, Combinatorial Optimization, Game Theory, Cloud Computing
Bachelor of Engineering & Computer Engineering
University of Mumbai, Mumbai, India
Artificial Intelligence, Human Machine Interaction, Distributed Databases, Analysis of Algorithms, Software Systems Architecture
Publications
Towards Assisting Human-Human Conversations
Diss. Illinois Institute of Technology, 2021 Thesis
DMARCBox - Corporate Email Security and Analytics using DMARC
5th International Conference for Convergence in Technology (I2CT). IEEE, 2019
FingerSpelling - Indian Sign Language Training Tool
18th International Conference on Advanced Learning Technologies (ICALT). IEEE, 2018
Exploratory Data Analysis using Dimension Reduction
IOSR J Eng (IOSRJEN) Best Paper Award
Improving Predictions Using Qualitative Parameters
JournalNX 3.08: 77-82