mirror of https://github.com/01-edu/public.git
You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
54 lines
1.5 KiB
54 lines
1.5 KiB
2 years ago
|
#### First Kaggle: Titanic
|
||
|
|
||
|
##### Preliminary
|
||
|
|
||
|
```
|
||
|
project
|
||
|
│ README.md
|
||
|
│ environment.yml
|
||
|
│ username.txt
|
||
|
│
|
||
|
└───data
|
||
|
│ │ train.csv
|
||
|
│ | test.csv
|
||
|
| | gender_submission.csv
|
||
|
│
|
||
|
└───notebook
|
||
|
│ │ EDA.ipynb
|
||
|
|
|
||
|
|───scripts
|
||
|
│
|
||
|
|
||
|
```
|
||
|
|
||
|
###### Does the structure of the project is as below ?
|
||
|
|
||
|
###### Does the readme file give an introduction of the project, show the username, describe the feature engineering and show the best score the on the leaderboard ?
|
||
|
|
||
|
###### Does the environment contain all libraries used and their versions that are necessary to run the code ?
|
||
|
|
||
|
##### Feature engineering
|
||
|
|
||
|
###### Does the notebook can be executed without any arror ?
|
||
|
|
||
|
###### Does the notebook explain the feature engineering that contributed to improve the accuracy ?
|
||
|
|
||
|
##### Scripts
|
||
|
|
||
|
###### Can you train the best model on the train data with feature engineering without any error ?
|
||
|
|
||
|
###### Can you predict on the test set using the best model without any error ?
|
||
|
|
||
|
###### Is the score you get **on the test set** with the best model is close to what is expected ?
|
||
|
|
||
|
##### Final score
|
||
|
|
||
|
###### Is the accuracy associated with the username in `username.txt` is higher than 79% ? The best submission score can be accessed from the user profile.
|
||
|
|
||
|
##### Examples
|
||
|
|
||
|
Here are two very good submissions explained and detailed:
|
||
|
|
||
|
- https://www.kaggle.com/konstantinmasich/titanic-0-82-0-83
|
||
|
- https://www.kaggle.com/sreevishnudamodaran/ultimate-eda-fe-neural-network-model-top-2
|