I build resilient, scalable enterprise data platforms
Data Engineer specializing in AI and MLOps—building scalable data platforms, extensible warehouses, and end‑to‑end ML pipelines that ship value.
About Me
I am a seasoned engineer with over 8 years of experience in the Design, and Deployment of Data Engineering, Analytical Solutions and Machine Learning Models across the U.S, Middle East, and India.
Here are a few technologies I've been working with:
Developed and Deployed real-time Clustering machine learning models and algorithms for E-commerce customers on GCP using Python, Vertex AI, GCS, Flask, Airflow, Docker, MLflow and TensorFlow
StreamlitPythonAPI
YouTube Dashboard
Real-Time analytics dashboard generated on any input YouTube video with a Demo. Pictures sentiment analysis and 5 KPIs that can be used to drive up ad-revenue.
TableauSQLData Visualization
Age of Plastic
Data-driven dashboards showing the impact of global plastic pollution on the environment; Land, Ocean and the mitigation steps taken by different countries using Tableau.
Machine LearningPython
Appliance Energy Prediction
Predicted the energy consumed by appliances using custom-coded Machine Learning models and Algorithms like PCA, Neural Networks, Lasso, Ridge, and Linear Regression from scratch in Python with 80% confidence.
Machine LearningPythonArcGISAPI
Clustering Paris and London
Visualizing Geo spatial analysis to cluster similar neighborhoods using ArcGIS and Folium to reveal new insights using Machine Learning
Machine LearningTableauData Visualization
FBI Crime Reports
Forecasted and visualized FBI reported uniform major crimes with focus on Aggravated Assault, Homicides, Intimidation and Motor theft in every US state visualized on a Tableau dashboard and Machine Learning with 90% Accuracy
APIKafkaElasticsearch
Kafka Tweet Stream
Streamed & ingested real-time Tweets of current affairs between high-performance tuned Kafka 2.0.0 producer & consumer into Elasticsearch using safe, idempotent, and compression configurations
Data EngineeringSQLScala
Movie Analytics
Analyzed a million movies using Spark Scala to draw useful insights on viewer engagement
Data EngineeringData AnalysisPython
Diabetic Readmission Exploration
Exploring and drawing meaningful insights for patients readmitted with Diabetes with a Report
Data VisualizationData AnalysisTableau
Analysis And Visualizations Of Nursing Home Data
Computed and visualized a data driven story of the Center for Medicare & Medicaid Services (CMS) nursing facility data to generate visuals that highlight the nursing home’s resource limits using Flourish, Data Wrapper and Tableau hosted on Google sites.
Data VisualizationData Analysis
Investigating GDP Expenditure
Visualized the expenditure trends in various sectors like Education, Pharmaceuticals, Military, Infrastructure, Research and Development by different countries for the years 1960 - 2020 using Flourish, Data wrapper hosted on Google Sites
Machine LearningRegressionPython
Forecasting Healthcare costs
Predicted the cost of healthcare and insurance using Python and Linear Regression Machine Learning model with 80% accuracy
Machine LearningClassificationPython
Predicting hits on Spotify
Analyzed over 40000 songs of 6 different decades to predict hit songs on Spotify using various classification Machine learning models and Python.
Data AnalysisData VisualizationR
Olympic History Analytics
Discovering & visualizing various trends in 120 years of Olympic history using R
Data AnalysisData VisualizationR
Social Media Analysis
Taking a look at data of 1.6 million twitter users and drawing useful insights while exploring interesting patterns. The techniques used include text mining, sentimental analysis, probability, time series analysis and Hierarchical clustering on text/words using R