datasets

The datasets module provides auxiliary datasets used to support the package documentation

Transport dataset

The transport dataset was originally retrieved from the Kaggle datasets website. A description of it is available here.

The dataset contains around 50 individuals answers from a questionaire on their means of transport. For each answer we have the Age, Sex, Family Situation, If he has a car, If he has a bike and if he takes public transport

There is some information as well on the monthly income of the respondent


source

load_transport

 load_transport ()

Loads the transport dataset

To load the dataset simply call the function …

transport = load_transport()
transport.isna().sum()
age          0
sexe         7
sitfam       0
principal    0
voiture      4
velo         4
commun       4
revenu       3
dtype: int64

Predictions dataset


source

load_predictions

 load_predictions ()

Loads the transport dataset

predictions = load_predictions()
predictions.head()
y_true y_proba
0 0 0.283242
1 0 0.397620
2 0 0.417143
3 0 0.376334
4 0 0.299264