Learning & Building in Data Science Sharing my journey as I pave my own path through the world of data science
You have found one of the best places to level up your data science skills and learn how to make better data-driven decisions.
My name is Joleen, I’m a data scientist and writer and I use this blog to share my passion for data science and statistics.
Blog Every week, I post new articles on data science, statistics and business intelligence. Here are my latest articles.
One of the most important aspects of machine learning classification models is evaluating how well they predict the target. For this, it’s essential to have a solid understanding of the confusion matrix and ROC curves. The confusion matrix breaks down a model’s predictions by showing true positives, true negatives, false positives, and false negatives. On […]
In data science, there’s an often-underestimated hero working behind the scenes: data preprocessing. Imagine analyzing a dataset filled with gaps, inconsistencies, and outliers. The results would be like deciphering a blurred photograph—it’s frustrating and usually just a total waste of time. That’s where data preprocessing comes in. It’s the process of cleaning, transforming, and structuring […]
This project is based on Season 3, Episode 2 of the Kaggle Playground Series. The title of this episode is: “Tabular Classification with a Stroke Prediction Dataset”. Our task is to predict the probability that a patient will have a stroke. The target, stroke, is a binary variable and so classification methods are needed to predict the […]
This project is based on Season 3, Episode 1 of the Kaggle Playground Series. The title of this episode is: “Tabular Regression with the California Housing Dataset”. Our task is to predict the median housing value of a block group of housing. In this project, my goal is to use the Julia programming language for […]