Swapnil Tiwari

Analytics Professional | Data Scientist | AI/ML Specialist
Gurugram, IN.

About

Highly accomplished Analytics Professional with over 4 years of experience specializing in Risk Management and data-driven insights. Proven expertise in Python, SQL, and advanced ML algorithms, excelling in data cleaning, exploratory data analysis, feature engineering, and cutting-edge NLP techniques. Adept at unearthing actionable insights to drive strategic decision-making and organizational growth. Seeking a dynamic role to apply a strong foundation in Applied Statistics and AI/ML, delivering significant business impact.

Work

Freelance Analytics Professional
|

Data Scientist & AI/ML Consultant

Summary

Provided expert data science and AI/ML consulting services to various organizations, focusing on leveraging advanced analytics for actionable business insights.

Highlights

Developed and deployed an AI-powered Insurance Policy Insights & Comparison System on Hugging Face using Gradio, implementing RAG with Mistral 7B and FAISS for efficient, semantic comparison and extraction of information from diverse policy documents.

Delivered actionable business insights for a global ride-sharing company by leveraging BigQuery, BigQueryML, and Looker, optimizing operational strategies.

Completed intensive, industry-aligned programs to advance expertise in Machine Learning, Natural Language Processing, and foundational Large Language Models (LLMs).

Currently developing an NLP system for scene-wise dialogue extraction and character development analysis, utilizing sentiment and emotion shifts from theatrical works (e.g., Tamasha).

PayU
|

Data Scientist - Digital Marketing Analytics

Summary

Spearheaded advanced analytics initiatives to optimize digital marketing performance and drive revenue growth for diverse clients.

Highlights

Spearheaded the design and implementation of a multiclass CatBoost model, integrating advanced embeddings to accurately classify clothing purchase categories, enhancing targeting precision for over 500M+ customers.

Achieved 91% AUC by building a robust model with iterative feature selection, effectively managing concept drift, and integrating an attention layer with embeddings, leading to precise targeting for 500M+ customers across class-wise ventiles.

Generated INR 300K in incremental monthly revenue for 8+ clients by deploying highly accurate model predictions, directly impacting business growth.

Pioneered data-driven digital marketing solutions tailored for FMCG, Real Estate, and Telecom sectors.

Collaborated cross-functionally with clients to develop and execute ML-based audience strategies, optimizing campaigns through A/B testing, which resulted in a significant INR 3M+ increase in monthly net revenue and improved Return on Ad Spend (ROAS).

American Express
|

Data Scientist - Acquisition Risk Analytics

Summary

Led critical data science initiatives focused on mitigating offer abuse and enhancing profitability within acquisition processes.

Highlights

Led a critical enterprise-wide initiative to combat offer abuse, significantly impacting 10% of 2023 acquisitions and enhancing overall profitability.

Engineered a novel, personalized offer data-driven definition to detect 1.3X more intentional offer exploiters, replacing a 4-year-old legacy system and accelerating identification.

Revitalized predictive gamer models by redefining offer exploitation flags and integrating clickstream signals, achieving an 8% Year-over-Year improvement in AUC.

Drove cross-functional collaboration with marketing, finance, and product teams, managing rapid iterations to implement ROI-driven model cutoffs across 100+ cohorts, which enhanced policy controls and generated $48.2M in savings in 2023.

Developed and deployed a comprehensive MIS dashboard for daily application and gaming impact tracking, resulting in 20+ hours of monthly time savings.

Successfully implemented digital signal analysis to detect offer abuse, leading to $4.7M in direct savings.

Designed and validated 13 behavioral indicators from diverse digital sources (pages, URLs, IPs, devices) to identify gaming tendencies through rigorous hypothesis testing.

Leveraged TF-IDF and word embeddings to identify high-risk pages, then developed a robust Random Forest classifier for detection.

Processed JSON data to generate over 600 variables, employing XGBoost to pinpoint the top 30 critical gaming features.

Constructed a neural network leveraging 43 digital signals to accurately predict digital gaming probability for new acquisitions.

Accenture AI
|

Analytics Engineer

Summary

Delivered end-to-end cloud-based analytics solutions to optimize supply chain and operational efficiency for a leading CPG company.

Highlights

Designed and delivered an end-to-end cloud-based analytics solution for a leading US/UK CPG company, leveraging Google Cloud Platform (GCP) and Looker to enhance operational efficiency.

Developed an ML model on GCP to accurately predict order lead times, providing quantifiable financial insights to inform strategic decision-making.

Automated ML predictions using Airflow DAGs to provide daily estimates of delivery probability, inventory levels, and stockout losses with 50% and 95% confidence intervals, improving supply chain foresight.

Identified and mitigated $5-6 million in 120-day risk-adjusted revenue losses attributed to stockouts.

Created dynamic Looker dashboards to visualize key performance indicators (KPIs), stockout alerts, and financial losses, significantly enhancing business visibility and accelerating data-driven decision-making.

Vodafone Idea
|

Data Scientist Intern

Summary

Contributed to the development and deployment of data science solutions for a large telecommunications subscriber base.

Highlights

Strategized and executed Data Science solutions impacting 270 million subscribers across diverse business use cases, driving significant value.

Developed and deployed Data Science solutions for critical business areas including prepaid-to-postpaid upselling, churn prediction, and sentiment analysis.

Enhanced model performance by engineering features, optimizing imputation, and fine-tuning, resulting in a 1.4X improvement in churn prediction lift and 1.5X in upselling lift.

Utilized Explainable AI (XAI) to pinpoint key attrition drivers for high-risk customers, enabling the development of targeted and effective retention strategies.

Education

National Institute of Technology, Kurukshetra, India

Bachelor of Technology

Electrical Engineering

Grade: 9.13/10

Atomic Energy Central School, Narora, India

Class XII

Science

Grade: 88.8%

Atomic Energy Central School, Narora, India

Class X

General Studies

Grade: 10/10

Awards

Dphi Data Sprint 22 Data Science Hackathon 9th Rank

Awarded By

Dphi

Achieved 9th rank in the Dphi Data Sprint 22 Data Science Hackathon.

Python and SQL 5-star Badges

Awarded By

HackerRank

Earned 5-star badges in Python and SQL on HackerRank, demonstrating expert proficiency.

Statistics & Probability Expert

Awarded By

Chegg

Delivered over 100 high-quality solutions as a Statistics & Probability Expert on Chegg with a 95%+ Content Feedback Score.

Institute Table Tennis Team Leader

Awarded By

National Institute of Technology, Kurukshetra

Led the Institute Table Tennis Team for two consecutive years, securing multiple awards.

JEE Mains- 2017 Top Rank

Awarded By

JEE Mains

Accomplished rank among the top 1.2% of all students (1.4 million) who appeared for JEE Mains- 2017.

Languages

English

Fluent

Hindi

Native

Certificates

Leading with Analytics and Artificial Intelligence

Issued By

Northwestern University

Econometrics and Causal Inference for Business in R and Python

Issued By

Udemy

Tableau 10 A-Z

Issued By

Udemy

Deeplearning.ai Specialization

Issued By

Coursera

Data Science-Probability and Inference using R

Issued By

DataCamp

Marketing Analytics in Python, Data Engineering and Big Data with Py Spark

Issued By

DataCamp

Skills

Programming Languages

Python, SQL.

Machine Learning

Sklearn, Tensorflow, Keras, CatBoost, Random Forest, XGBoost, Descriptive Analytics, Predictive Analytics, Machine Learning, Statistics, A/B Testing, Causal Analytics, Anomaly Detection, Clustering, Model Monitoring, Recommendation Systems, Neural Networks, Hierarchical Attention Models, Ensemble Learning.

Natural Language Processing (NLP)

Text Representation, Word Embeddings, Topic Modelling, Attention Models, Transformer, BERT, Spacy, RobBERT, Hugging Face, RAG, Mistral 7B, FAISS, LLMs.

Cloud Platforms & Tools

Google Cloud Platform (GCP), Looker, PySpark, Airflow, BigQuery, BigQueryML, FastAPI, Streamlit, PyWebIO, Gradio.

Data Management

Data Cleaning, Exploratory Data Analysis (EDA), Feature Engineering, Data Pipelines, JSON Parsing.

Statistical Analysis

Applied Statistics, Hypothesis Testing, Statistical Tests, Confidence Intervals.

Business Intelligence & Visualization

Looker Dashboards, MIS Dashboards, KPI Tracking, Business Visibility.

Project Management & Methodologies

Iterative Development, Agile, Cross-functional Collaboration.

Interests

Sports

Table Tennis.

Arts

Acting, Poetry.

Projects

Retail Product Performance Dashboards

Summary

Developed comprehensive dashboards and analytical models for retail product performance and causal inference.

Recommendation Engine

Summary

Designed and implemented a robust recommendation engine using various filtering techniques.

Classifying Dutch News Articles by NOS

Summary

Developed an NLP solution for classifying and summarizing Dutch news articles.

User Segmentation and Churn Prediction

Summary

Implemented advanced analytics for user segmentation and churn prediction using ride-share data.