top of page
Data Science Projects
1
Predicting Cancer Diagnosis in Women

The Problem
I had the Breast Cancer Wisconsin (Diagnostic) Data Set of several patients. I had to train my dataset of certain characteristics of patients and determine if the patient had breast cancer or not. I had to find out predicted classification answer for new unseen data of new patients.
The Solution
I built a Python based project and first figured out how to deal with missing data and cleaned a lot of data. Later I ran algorithms to predict cancer prognosis based on certain features.

2
Analyzing Users online behavior on a website

The Problem
I received data set about advertising data set, indicating whether or not a particular internet user clicked on an Advertisement. The goal is to create a model that will predict whether or not a given user will click on an ad based on the features characterizing him/her.
The Solution
I explored the data set and feature engineered new features about users behavior when seeing advertisement. I trained and tested data set on KNN model and logistical regression model to predict answers in Pandas Python.

3
Predicting Sale Prices of Used Houses

The Problem
I received data set to predict the sale price of a house based on different factors of the house. I had almost 90+ variables/features to select important features which would help me predict accurate prices of different types of houses.
The Solution
I explored data and cleaned a lot of data. Then I ran algorithms to find coefficients of different variables to see how much would they affect the outcome variable. Then after selecting important variables and doing feature engineering, I ran stacking, boosting, bagging Machine Learning algorithms to predict prices.


B. Yourself!
bottom of page