Welcome to Sahil's Portfolio

Turning Data Into Decisions...

Sahil Kumar - Data Analyst | Data Scientist | ML Engineer

About Me

I'm a passionate Data Analyst, Data Scientist, and ML / AI Enthusiast who loves diving deep into raw data, building predictive models, and engineering AI-driven solutions that turn complexity into clarity.

Whether you're a startup that needs customer insights from day one, a business ready to automate reporting and forecasting, or an enterprise deploying machine learning at scale — I bring the analytical depth and engineering mindset to make your data work for you.

Skills
Experience
Education

Programming Languages

Python
Core programming language for data analysis, ML modeling, and automation with extensive library ecosystem
SQL
Advanced SQL for data extraction, transformation, and analysis across large datasets
Java
Object-oriented programming, data structures, and 270+ algorithm problems solved
C++
Low-level programming, competitive programming, and algorithm optimization
HTML/CSS/JavaScript/React.js/Next.js
Web development fundamentals for creating interactive dashboards and web applications

Tools & Platforms

Power BI & Tableau
Interactive dashboards and data visualizations for business intelligence and reporting
Jupyter Notebook
Interactive computational environment for data analysis, visualization, and documentation
Google Colab
Cloud-based Jupyter notebook for ML experiments with GPU/TPU acceleration
Git & GitHub
Version control, collaborative repositories, and open-source project management
Visual Studio
Integrated development environments for coding, debugging, and project management
Excel & Google Sheets
Advanced Excel with pivot tables, slicers, and data dashboards for analysis

Algorithms & ML Models

Classification Models
Logistic Regression, Decision Trees, Random Forest, SVM, Naive Bayes, Gradient Boosting
Regression Models
Linear Regression, Ridge, Lasso, Polynomial Regression, Multiple Regression
Clustering Algorithms
K-Means, Hierarchical Clustering, DBSCAN, Gaussian Mixture Models for unsupervised learning
Model Evaluation
Accuracy, Precision, Recall, F1-Score, ROC-AUC, Confusion Matrix, Cross-Validation
EDA & Feature Engineering
Data cleaning, missing value imputation, outlier detection, correlation analysis, feature selection

Data Analyst (Freelance)

Fiverr Aug 2024 – Present

Delivered 20+ freelance analytics projects, automating data cleaning and analysis workflows with Python and SQL, reducing client reporting time by 40%. Designed interactive dashboards in Power BI/Tableau for clients across 5+ industries.

Python SQL Power BI Tableau

Data Analytics Virtual Intern

Deloitte Australia (Job Simulation) Oct 2025

Completed Deloitte's data analytics and forensic technology job simulation. Created a Tableau dashboard to visualize analytical findings. Used Excel to classify datasets and draw key business conclusions.

Tableau Excel Forensic Analytics

Active Member – CS Society

Computer Science Society, Sukkur IBA University Nov 2024 – Present

Organized 3+ hackathons and tech seminars, increasing student participation by 50%. Collaborated with 20+ team members to run coding competitions, fostering a culture of innovation and problem-solving.

Event Management Hackathons Community Building

Bachelor of Science in Computer Science

Sukkur IBA University

Aug 2023 – Aug 2027 | CGPA: 3.35
Programming Fundamentals OOP Data Structures Database Systems Data Science Web Engineering Machine Learning Artificial Intelligence Information Security

Key Achievements

Academic & Competitive

2023 – Present
Merit Scholarship – Govt. of Sindh (2023) 🏆 Hackathon Winner – Sukkur IBA (2025) 290+ LeetCode Problems Solved

About Me

I'm a passionate Data Analyst, Data Scientist, and ML / AI Enthusiast who loves turning complex data into simple, beautiful, and actionable insights.

Whether you're a startup wanting to understand your customers, a business needing interactive dashboards, or an enterprise requiring predictive models, I help you achieve your goals with cutting-edge data science and analytical storytelling.

20+

Projects

5+

Industries Served

290+

LeetCode Problems

3.35

University CGPA

What I Do Best

Data That Speaks

Turning raw data into interactive dashboards & actionable insights

Models That Predict

Building ML models that forecast outcomes and drive decisions

Queries That Deliver

Extracting meaningful patterns from complex SQL databases

Frameworks & Libraries

Data Manipulation
Pandas, NumPy, Polars for efficient data loading, cleaning, and transformation
Data Visualization
Matplotlib, Seaborn, Plotly, ggplot for creating beautiful and interactive visualizations
Machine Learning
Scikit-learn, TensorFlow, PyTorch, Keras, XGBoost for building and training ML models
Image Processing
OpenCV, Pillow for image manipulation, computer vision, and image enhancement tasks
Statistical Analysis
SciPy, StatsModels for statistical testing, hypothesis testing, and statistical modeling
Feature Engineering
Feature-engine, Category Encoders for advanced feature transformation and selection

What Drives Me

Curiosity

Always exploring new datasets, algorithms, and data stories to uncover hidden insights.

Impact

Creating analyses and models that create measurable, real-world business value.

Growth

Continuously learning new tools and techniques to stay ahead in data science.

How I Can Help You

Transforming your raw data into powerful insights that drive results

Data Analysis & Dashboards

Turn your data into clear decisions

I transform messy, raw data into clean, interactive dashboards and reports. From sales trends to customer behavior analysis, I deliver insights that drive smarter business decisions.

What You Get:

  • Data cleaning & preprocessing
  • Interactive Power BI / Tableau dashboards
  • Excel reports with pivot tables & slicers
  • KPI tracking & business metrics
  • Automated reporting workflows
  • Presentation-ready insights
Python Power BI Tableau Excel

Machine Learning & Predictive Models

Predict outcomes, automate decisions

I build end-to-end ML pipelines that predict customer churn, detect anomalies, classify data, and uncover patterns — giving your business a competitive edge powered by AI.

What You Get:

  • Classification & regression models
  • Feature engineering & EDA
  • Model evaluation (Accuracy, F1, ROC-AUC)
  • Scikit-learn pipelines
  • Jupyter Notebook documentation
  • Interpretable predictions for stakeholders
Scikit-Learn Pandas NumPy Jupyter

SQL & Data Engineering

Query, structure, and deliver your data

I design efficient SQL queries, data pipelines, and structured databases that help organizations manage, access, and analyze large datasets — fast and reliably.

What You Get:

  • Advanced SQL queries & stored procedures
  • Database schema design
  • ETL pipeline development
  • Data aggregation & reporting
  • Statistical hypothesis testing
  • Research-grade data analysis
SQL Python Pandas Statistics

AI-Enabled Web Applications

Integrate intelligent models into web experiences

I build and deploy AI-powered web applications: model-backed APIs, real-time inference, chatbots, recommendation engines, and interactive visualizations that connect ML with modern web stacks.

What You Get:

  • Model serving & API integration (FastAPI/Flask)
  • Frontend integration (React/Vue)
  • Real-time inference & WebSockets
  • Containerization & deployment (Docker)
  • Scalable hosting (Cloud/Heroku)
  • Conversational agents & embeddings
FastAPI React Docker HuggingFace

My Process

1

Understand

We discuss your data, goals, and business questions

2

Explore

I clean, explore, and understand your dataset deeply

3

Analyze

I build models or dashboards tailored to your needs

4

Deliver

Clear reports, visualizations, and actionable recommendations

Professional Certifications

Validated credentials across data science, programming, and professional development

My Projects

Real projects, real results — see how I turn data into decisions

All Projects
Machine Learning
Data Analysis
SQL
Titanic Survival Prediction
Machine Learning

Titanic Survival Prediction

Classification Model with 87% Accuracy

Cleaned and engineered features from 890 passenger records using EDA, missing value imputation, and label encoding. Built Logistic Regression and Random Forest models — Random Forest achieved 87% accuracy (+5% over LR). Developed probability-based survival scoring for interpretable predictions.

Python Pandas Scikit-Learn Random Forest Logistic Regression
87% Accuracy
890 Records
International Debt Statistics
SQL Analysis

International Debt Statistics

Large-Scale SQL Data Analysis

Processed 100K+ records from the World Bank to identify key debt indicators and economic patterns. Produced structured insights used in 3 university research projects. Applied advanced SQL aggregations, filtering, and statistical summaries.

SQL Data Aggregation World Bank Data Research
100K+ Records
3 Research Uses
Students Mental Health Analysis
SQL Analysis

Students' Mental Health

Analyzing Survey Data for Wellbeing Insights

Analyzed survey data from 500+ students to highlight top factors affecting mental well-being. Suggested interventions that were adopted by student committees. Applied hypothesis testing and correlation analysis to draw meaningful conclusions.

SQL Statistics Survey Analysis Hypothesis Testing
500+ Students
Adopted By Committees
Excel Interactive Dashboards
Data Analysis

Interactive Excel Dashboards

Business Intelligence for Non-Technical Users

Built 5+ interactive dashboards using Excel and Python (Pandas) with slicers, pivot tables, and dynamic charts. Improved data accessibility for non-technical stakeholders by 60%. Automated repetitive reporting tasks, saving hours of manual work each week.

Excel Python Pandas Pivot Tables
5+ Dashboards
60% Better Access
PhoneBook Management System
Java / File Handling

PhoneBook Management System

Robust Record Management with Java & File Handling

Developed a full-featured phonebook application managing 1K+ contact records using Java and file handling. Implemented robust data storage and retrieval features with search, add, update, and delete operations. Focused on clean code architecture, usability, and performance for large record sets.

Java File Handling OOP Data Structures
1K+ Records Managed
CRUD Full Operations

Ready to Start Your Data Project?

Let's discuss how I can help bring clarity to your data

Start Your Project

What Clients Say

Don't just take my word for it — hear from satisfied clients

"Sahil delivered an exceptional analytics dashboard for our e-commerce business. His insights helped us cut reporting time by 40% and we made better decisions within weeks. Highly recommended!"

Client

Fiverr Client

E-Commerce Business Owner

"Outstanding work on our sales data analysis! Sahil's Power BI dashboard was professional, insightful, and delivered on time. Our team now makes decisions in hours instead of days."

Client

Fiverr Client

Sales Manager, Retail Industry

"The ML model Sahil built for our customer churn prediction was a game-changer. His findings helped us optimize retention strategy and reduce churn significantly. Excellent technical work!"

Client

Fiverr Client

Operations Manager, SaaS Company

Let's Work Together

Ready to transform your data into digital reality? I'm here to help!

Get In Touch

Whether you have a data project in mind, need an analytics dashboard, or just want to say hello, I'd love to hear from you. Let's discuss how we can turn your data into competitive advantage!

Email Me

sagramanisahil@gmail.com

I'll respond within 24 hours

Call Me

+92 349 6846904

Available Mon-Fri, 9AM-6PM PKT

Location

Sukkur, Pakistan

Available for remote work worldwide

Send Me a Message

Fill out the form below and I'll get back to you as soon as possible.

Frequently Asked Questions

How long does a data project take?

Timelines vary by complexity. A data cleaning + dashboard project takes 3–7 days. An ML model project typically takes 1–3 weeks depending on dataset size and requirements.

Do you provide ongoing support?

Yes! I offer support after project delivery, plus ongoing maintenance for dashboards and models as your data evolves over time.

What's your payment structure?

I typically work with a 50% upfront payment and 50% upon completion. For larger projects, we can discuss milestone-based payments.

What data formats do you work with?

I work with CSV, Excel, SQL databases, JSON, and API data. I can handle messy, unstructured, or large-scale datasets and deliver clean, usable outputs.