Two weeks ago I claimed that women report higher job satisfaction when they work in countries where tech is more male-dominated. And then instead of backing up my claim last week, I got sidetracked by questions of sample size and statistical power. In a previous blog post I introduced the Kaggle survey on women in tech and I did some basic data cleaning for that survey. To save time and get to the point, I now pick up where I left off.

Just as the original Titanic VHS was published in two video cassettes, this Titanic analysis is also being published in two posts. In this post–part 2–I’m going to be exploring random forests for the first time, and I will compare it to the outcome of the logistic regression I did last time. Random forest vs. Logistic regression Last time I explained how logistic regression uses a link function transforms non-linear relationships into linear ones.

