Learning & Building in Data Science Sharing my journey as I pave my own path through the world of data science
Hi there!
You have found one of the best places to level up your data science skills and learn how to make better data-driven decisions.
My name is Joleen, I’m a data scientist and writer and I use this blog to share my passion for data science and statistics.
Blog Every week, I post new articles on data science, statistics and business intelligence. Here are my latest articles.

Overview of Machine Learning Ensemble Methods
Voting Ensembles Voting ensembles combine diverse machine learning models using techniques like majority voting or average predictions. The individual models used in the ensemble could be regression or classification-based algorithms. Once the individual models have been trained, the ensemble can be constructed in a couple of different ways. In regression, ensembles are created by averaging […]

Decision Trees in Python: Predicting Diabetes
In this post, we’ll be learning about decision trees, how they work and what the benefits are for using them. We’ll also use this algorithm in a real-world data to predict diabetes. So, what are decision trees? Decision trees are a machine learning method for classification or regression. It works by segmenting the dataset through […]

The Goldilocks Rule for Data Science Learning
I first discovered the Goldilocks Rule in the book Atomic Habits by James Clear where he says: “The Goldilocks Rule states that humans experience peak motivation when working on tasks that are right on the edge of their current abilities. Not too hard. Not too easy. Just right.” James Clear When you take on projects […]

An Introduction to Synthetic Data
According to Gartner, “by 2024, 60% of the data used for the development of AI and analytics projects will be synthetically generated.” I don’t know about you but I was shocked by that statistic and it shows that synthetic data is definitely a rising trend. In this blog post I will be exploring synthetic data […]