Browse Source

Chore(AI): Fix piscine structure

CON-3189-Update-structure-for-piscine
Oumaima Fisaoui 2 months ago
parent
commit
5272a6e0d0
  1. 3
      subjects/ai/classification/README.md
  2. 6
      subjects/ai/classification/audit/README.md
  3. 2
      subjects/ai/data-wrangling/audit/README.md
  4. 1
      subjects/ai/keras-2/README.md
  5. 1
      subjects/ai/keras-2/audit/README.md
  6. 2
      subjects/ai/keras/README.md
  7. 10
      subjects/ai/linear-regression/README.md
  8. 6
      subjects/ai/model-selection/README.md
  9. 4
      subjects/ai/neural-networks/README.md
  10. 2
      subjects/ai/nlp-spacy/README.md
  11. 2
      subjects/ai/nlp-spacy/audit/README.md
  12. 1
      subjects/ai/nlp/README.md
  13. 2
      subjects/ai/nlp/audit/README.md
  14. 2
      subjects/ai/pandas/README.md
  15. 2
      subjects/ai/training/README.md
  16. 5
      subjects/ai/visualizations/audit/README.md

3
subjects/ai/classification/README.md

@ -304,8 +304,8 @@ Preliminary:
- [Database](data/breast-cancer-wisconsin.data) and [database information](data/breast-cancer-wisconsin.names)
---
---
---
### Exercise 6: Multi-class (Optional)
@ -361,6 +361,7 @@ def predict_one_vs_all(X, clf0, clf1, clf2 ):
#TODO
return classes
```
Resources :
- https://www.kaggle.com/code/rahulrajpandey31/logistic-regression-from-scratch-iris-data-set

6
subjects/ai/classification/audit/README.md

@ -31,7 +31,6 @@ Score:
0.7142857142857143
```
---
---
@ -73,9 +72,9 @@ Coefficient: [[1.18866075]]
###### For question 4, does `predict_probability` output the same probabilities as `predict_proba`? Note that the values have to match one of the class probabilities, not both. To do so, compare the output with: `clf.predict_proba(X)[:,1]`. The shape of the arrays is not important.
###### Does `predict_class` output the same classes as `cfl.predict(X)` for question 5? The shape of the arrays is not important.
###### Does `predict_class` output the same classes as `cfl.predict(X)` for question 5? The shape of the arrays is not important.
###### Does the plot for question 6 look like the plot below? As mentioned, it is not required to shift the class prediction to make the plot easier to understand.
###### Does the plot for question 6 look like the plot below? As mentioned, it is not required to shift the class prediction to make the plot easier to understand.
![alt text][ex3q6]
@ -193,6 +192,7 @@ As said, for some reasons, the results may be slightly different from mine becau
---
#### Bonus
#### Exercise 6: Multi-class (Optional)
##### The exercise is validated if all questions of the exercise are validated

2
subjects/ai/data-wrangling/audit/README.md

@ -52,7 +52,7 @@
| 5 | 6 | nan | nan | O | P |
| 6 | 7 | nan | nan | Q | R |
| 7 | 8 | nan | nan | S | T |
Note: Check that the suffixes are set using the suffix parameters rather than manually changing the columns' name.
---

1
subjects/ai/keras-2/README.md

@ -4,7 +4,6 @@
This exercise set focuses on advanced applications of Keras for building and training neural networks. You'll work on both regression and multi-class classification problems, using real-world datasets like the Auto MPG and Iris datasets.
### Role Play
You're a data scientist at a biotech company developing AI-powered systems for various applications. Your current project involves creating neural networks for both regression and multi-class classification tasks. You'll be working on predicting car fuel efficiency and classifying flower species, showcasing the versatility of neural networks in different domains.

1
subjects/ai/keras-2/audit/README.md

@ -131,7 +131,6 @@ model.compile(loss='categorical_crossentropy',
---
#### Exercise 5: Multi classification example
##### The exercise is validated if all questions of the exercise are validated

2
subjects/ai/keras/README.md

@ -2,7 +2,7 @@
### Overview
This exercise focuses on using Keras to build and train neural networks. Keras is a high-level deep learning API that runs on top of TensorFlow, designed for fast experimentation with deep neural networks. You'll learn to create sequential models, add dense layers, design network architectures, and optimize your models.
This exercise focuses on using Keras to build and train neural networks. Keras is a high-level deep learning API that runs on top of TensorFlow, designed for fast experimentation with deep neural networks. You'll learn to create sequential models, add dense layers, design network architectures, and optimize your models.
### Role Play

10
subjects/ai/linear-regression/README.md

@ -126,7 +126,7 @@ X, y, coef = make_regression(n_samples=100,
![alt text][q1]
[q1]: ./w2_day1_ex2_q1.png 'Scatter plot'
[q1]: ./w2_day1_ex2_q1.png "Scatter plot"
2. Fit a LinearRegression from Scikit-learn on the generated data and give the equation of the fitted line. The expected output is: `y = coef * x + intercept`
@ -134,7 +134,7 @@ X, y, coef = make_regression(n_samples=100,
![alt text][q3]
[q3]: ./w2_day1_ex2_q3.png 'Scatter plot + fitted line'
[q3]: ./w2_day1_ex2_q3.png "Scatter plot + fitted line"
4. Predict on X.
@ -229,7 +229,7 @@ _Warning: The shape of X is not the same as the shape of y. You may need (for so
![alt text][ex5q1]
[ex5q1]: ./w2_day1_ex5_q1.png 'Scatter plot '
[ex5q1]: ./w2_day1_ex5_q1.png "Scatter plot "
As a reminder, fitting a Linear Regression on this data means finding (a, b) that fits well the data points.
@ -311,7 +311,7 @@ The expected output is:
![alt text][ex5q5]
[ex5q5]: ./w2_day1_ex5_q5.png 'MSE '
[ex5q5]: ./w2_day1_ex5_q5.png "MSE "
6. From the `losses` list, find the optimal value of a and b and plot the line in the scatter point of question 1.
@ -327,6 +327,6 @@ In a nutshell, Gradient descent is an optimization algorithm used to minimize so
![alt text][ex5q8]
[ex5q8]: ./w2_day1_ex5_q8.png 'MSE + Gradient descent'
[ex5q8]: ./w2_day1_ex5_q8.png "MSE + Gradient descent"
9. Use Linear Regression from Scikit-learn. Compare the results.

6
subjects/ai/model-selection/README.md

@ -2,7 +2,7 @@
### Overview
This exercise set focuses on advanced model selection techniques in machine learning. You'll work with cross-validation, grid search, and performance evaluation tools.
This exercise set focuses on advanced model selection techniques in machine learning. You'll work with cross-validation, grid search, and performance evaluation tools.
### Role Play
@ -245,7 +245,7 @@ The plot should look like this:
![alt text][logo_ex5q1]
[logo_ex5q1]: ./w2_day5_ex5_q1.png 'Validation curve '
[logo_ex5q1]: ./w2_day5_ex5_q1.png "Validation curve "
The interpretation is that from max_depth=10, the train score keeps increasing but the test score (or validation score) reaches a plateau. It means that choosing max_depth = 20 may lead to have an over fitted model.
@ -261,7 +261,7 @@ The interpretation is that from max_depth=10, the train score keeps increasing b
![alt text][logo_ex5q2]
[logo_ex5q2]: ./w2_day5_ex5_q2.png 'Learning curve '
[logo_ex5q2]: ./w2_day5_ex5_q2.png "Learning curve "
- **Note Plot Learning Curves**: The learning curves is detailed in the first resource.

4
subjects/ai/neural-networks/README.md

@ -147,7 +147,7 @@ Notice that the neuron **o1** in the output layer takes as input the output of t
In exercise 1, you implemented this neuron.
![alt text][neuron]
[neuron]: ./w3_day1_neuron.png 'Plot'
[neuron]: ./w3_day1_neuron.png "Plot"
Now, we add two more neurons:
@ -156,7 +156,7 @@ Now, we add two more neurons:
![alt text][nn]
[nn]: ./w3_day1_neural_network.png 'Plot'
[nn]: ./w3_day1_neural_network.png "Plot"
1. Implement the function `feedforward` of the class `OurNeuralNetwork` that takes as input the input data and returns the output y. Return the output for these neurons:

2
subjects/ai/nlp-spacy/README.md

@ -107,7 +107,7 @@ The goal of this exercise is to learn to use `spaCy` embedding on a document.
![alt text][logo]
[logo]: ./w3day05ex1_plot.png 'Plot'
[logo]: ./w3day05ex1_plot.png "Plot"
https://medium.com/datadriveninvestor/cosine-similarity-cosine-distance-6571387f9bf8

2
subjects/ai/nlp-spacy/audit/README.md

@ -58,7 +58,7 @@
![alt text][logo]
[logo]: ../w3day05ex1_plot.png 'Plot'
[logo]: ../w3day05ex1_plot.png "Plot"
---

1
subjects/ai/nlp/README.md

@ -5,6 +5,7 @@
“NLP makes it possible for humans to talk to machines:” This branch of AI enables computers to understand, interpret, and manipulate human language. This technology is one of the most broadly applied areas of machine learning and is critical in effectively analyzing massive quantities of unstructured, text-heavy data.
### Role Play
You're a Natural Language Processing (NLP) specialist at a tech startup developing a sentiment analysis tool for social media posts. Your task is to build the preprocessing pipeline and create a bag-of-words representation for tweet analysis.
### Learning Objectives

2
subjects/ai/nlp/audit/README.md

@ -40,7 +40,7 @@ Name: text, dtype: object
#### Exercise 2: Punctuation
###### For question 1, is validated if the ouptut doesn't contain punctuation `` !"#$%&'()*+,-./:;<=>?@[]^_`{|}~ ``. Is the previous statement true? Do not take into account the spaces in the output. The output should be as:
###### For question 1, is validated if the ouptut doesn't contain punctuation ``!"#$%&'()*+,-./:;<=>?@[]^_`{|}~``. Is the previous statement true? Do not take into account the spaces in the output. The output should be as:
```
Remove this from the sentence

2
subjects/ai/pandas/README.md

@ -6,7 +6,7 @@ This set of exercises focuses on using Pandas, a powerful library for data manip
### Role Play
You are a data analyst at a multinational energy company. Your team has been tasked with analyzing various datasets to improve operational efficiency and customer service.
You are a data analyst at a multinational energy company. Your team has been tasked with analyzing various datasets to improve operational efficiency and customer service.
Your manager emphasizes the importance of clean, efficient code and clear explanations of your methods and findings. You'll need to present your results to both technical team members and non-technical executives, so focus on creating clear visualizations and concise summaries of your insights.

2
subjects/ai/training/README.md

@ -306,4 +306,4 @@ Ressources:
- https://elutins.medium.com/grid-searching-in-machine-learning-quick-explanation-and-python-implementation-550552200596
- https://scikit-learn.org/stable/auto_examples/model_selection/plot_grid_search_digits.html
- https://scikit-learn.org/stable/auto_examples/model_selection/plot_grid_search_digits.html

5
subjects/ai/visualizations/audit/README.md

@ -20,11 +20,12 @@
##### The solution of question 1 is accepted if the plot reproduces the plot in the image and respect those criteria.
###### Does it have a title?
###### Does it have a title?
###### Does it have a name on x-axis?
###### Does it have a legend?
###### Does it have a legend?
![alt text][logo]
[logo]: ../w1day03_ex1_plot1.png "Bar plot ex1"

Loading…
Cancel
Save