You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 

1.4 KiB

First Kaggle: Titanic

Preliminary
project
│   README.md
│   environment.yml
│   username.txt
│
└───data
│   │   train.csv
│   |   test.csv
|   |   gender_submission.csv
│
└───notebook
│   │   EDA.ipynb
|
|───scripts
│

Does the structure of the project look like above?
Does the readme file give an introduction of the project, show the username, describe the feature engineering and show the best score on the leaderboard?
Does the environment contain all libraries used and their versions that are necessary to run the code?

Feature engineering

Can the notebook be executed without any error?
Does the notebook explain the feature engineering that contributed to improve the accuracy?

Scripts

Can you train the best model on the train data with feature engineering without any error?
Can you predict on the test set using the best model without any error?
Is the score you get on the test set with the best model close to what is expected?

Final score

Is the accuracy associated with the username in username.txt higher than 79%? The best submission score can be accessed from the user profile.

Examples

Here are two very good submissions explained and detailed: