public/subjects/devops/crud-master-py/README.md

## CRUD Master Py

### Instructions

APIs are a very common and convenient way to deploy services in a modular way.
In this exercise we will create a simple microservices infrastructure, having
an API Gateway connected to two services. While one service, the inventory API, retrieves data from a PostgreSQL database, the other service, the billing API, exclusively processes messages received through RabbitMQ without direct database interactions. Communication between these services will occur via HTTP and message queuing systems. Each of these services will operate within distinct virtual machines, facilitating a segregated environment for their functionalities.

#### General overview

![CRUD Master architecture diagram](resources/crud-master-diagram.png)

We will set up a movie streaming platform, where one API (`inventory`) will
have information on the movies available and another one (`billing`) will
process the payments.

We'll establish a movie streaming platform. One API (`inventory`) will provide details about available movies, while another (`billing`) will handle payment processing.

The API gateway will communicate in HTTP with the `inventory` service and using
RabbitMQ for `billing` service.

In this exercise you will need to install Python3 (with Flask, SQLAlchemy and
other packages), PostgreSQL, RabbitMQ, Postman, VirtualBox and Vagrant.

While it may seem overwhelming at first, there are a lot of resources available
both on official website and on community blogs about setting up those tools.
Also, the specific configuration details may change from platform to platform so
don't hesitate to play around with it and be sure everything is installed
correctly before to move on.

#### API 1: Inventory

##### Definition of the Inventory API

This API will be a CRUD (Create, Read, Update, Delete) RESTful API. It will use
a PostgreSQL database. It will provide information about the movies present in
the inventory and allow users to do basic operations on it.

A common way to do so is to use Flask which is a popular Python web framework.
We will couple it with SQLAlchemy, an ORM which will abstract and simplify the
interactions between our API and the database.

Here are the endpoints with the possible HTTP requests:

- `/api/movies`: GET, POST, DELETE
- `/api/movies/:id`: GET, PUT, DELETE

Some details about each one of them:

- `GET /api/movies` retrieve all the movies.
- `GET /api/movies?title=[name]` retrieve all the movies with `name` in the title.
- `POST /api/movies` create a new product entry.
- `DELETE /api/movies` delete all movies in the database.

- `GET /api/movies/:id` retrieve a single movie by `id`.
- `PUT /api/movies/:id` update a single movie by `id`.
- `DELETE /api/movies/:id` delete a single movie by `id`.

The API should work on `http://localhost:8080/`.

##### Defining the Database

For the database we will use PostgreSQL.
The database will be called `movies_db`.

The `movies` table will contain the following columns:

- `id`: auto-generated unique identifier.
- `title`: the title of the movie.
- `description`: the description of the movie.

##### Testing the Inventory API

In order to test the correctness of your API you should use Postman or a
similar tool. You have to create one or more tests for every endpoint and then
export the configuration, so you will be able to reproduce the tests on
different machines easily.

> The configuration will be checked during the audit.

#### API 2: Billing

##### Definition of the billing API

This API will only receive messages through RabbitMQ, specifically it will
consume messages on the queue `billing_queue`. The message it receives are
going to be a "stringified" JSON object as in this example:

```json
{
  "user_id": "3",
  "number_of_items": "5",
  "total_amount": "180"
}
```

It will parse the message and create a new entry in the `billing_db` database.
It will also acknowledge the RabbitMQ queue that the message has been
processed. When the API is started it will take and process all messages
present in the queue.

> Take a look into `pika` Python library for an easy way to interface with
> RabbitMQ.

##### Defining the Database

For the database we will use PostgreSQL here as well.
The database will be called `billing_db`.

The `orders` table will contain the following columns:

- `id`: auto-generated unique identifier.
- `user_id`: the id of the user making the order.
- `number_of_items`: the number of items included in the order.
- `total_amount`: the total cost of the order.

##### Testing the Billing API

To test this API here are some steps:

- Publish a message directly to the `billing_queue` in RabbitMQ using its UI or CLI.
- When the Billing API is running the orders should appear instantaneously in
  the `orders` table in the `billing_db` database.
- When the Billing API is not running the queries to the API Gateway should
  still return success but the `orders` table in the `billing_db` database won't be updated.
- When the Billing API is started again the unfulfilled messages should be
  processed and the `orders` table in the `billing_db` database should be updated.

#### The API Gateway

The Gateway will take care of routing the requests to the appropriate service
using the right protocol (it could be HTTP for the Inventory API or RabbitMQ
for the Billing API).

##### Interfacing with Inventory API

The gateway will route all requests to `/api/movies` at the API 1, without any
need to check the information passed through it. It will return the exact
response received by the API1.

<!-- TO DO: Add a suggestion on how to implement this ???-->

##### Interfacing with Billing API

The gateway will receive POST requests from `api/billing` and send a message
using RabbitMQ in a queue called `billing_queue`. The content of the message
will be the POST request body stringified with `JSON.stringify`. The Gateway
should be able to send messages to queue even if the API 2 is not running.
When the API2 will be started it should be able to process that message and
send an acknowledgement back.

An example of POST request to
`http://[API_GATEWAY_URL]:[API_GATEWAY_PORT]/api/billing/`:

```json
{
  "user_id": "3",
  "number_of_items": "5",
  "total_amount": "180"
}
```

Upon successful processing, you can expect a response message such as "Message posted" or a similar acknowledgment.

> Remember to set up `Content-Type: application/json` for the body of the
> request.

##### Documenting the API

Good documentation is a very critical feature of every API. By design the APIs
are meant for others to use, so there have been very good efforts to create
standard and easy to implement ways to document it.

As an introduction to the art of great documentation you must create an OpenAPI
documentation file for the API Gateway. There are many different ways to do so,
a good start could be using SwaggerHub with at least a meaningful description
for each endpoint. Feel free to implement any extra feature as you see fit.

> You must also create a `README.md` file at the root of your project with
> detailed instructions on how to build and run your infrastructure and which
> design choices you made to structure it.

#### Virtual Machines

##### General overview

You will use VirtualBox and Vagrant to set up three different VMs in order to
test the interactions and correctness of responses between your APIs
infrastructure.

Vagrant is an open-source software that helps you create and manage virtual
machines. With Vagrant, you can create a development environment that is
identical to your production environment, which makes it easier to develop,
test, and deploy your applications.

Your VMs will be structured as follows:

- `gateway-vm`: This VM will only contain the `api-gateway`.
- `inventory-vm`: This VM will contain the `inventory-app` API and the database
  `movies_db`.
- `billing-vm`: This VM will contain the `billing-app` API, database `orders`
  and RabbitMQ.

> Vagrant is designed for development and should not be used in production
> environments.

##### Environment variables

To simplify the building process, it's recommended to define essential variables in a `.env` file. This approach facilitates the modification or update of critical information such as URLs, passwords, usernames and so on.

For this exercise, consider listing all required environment variables in the README.md file. Once you have these variables identified, create a `.env` file with the necessary credentials.

These variables will be utilized by Vagrant and distributed across the various microservices to centralize the credentials.

Your `.env` file should contain all the necessary credentials and none of the
microservices should have any credential hard coded in the source code.

> For the purpose of this exercise, the `.env` file must be included in your repository, in real-world scenarios, it's crucial to avoid including sensitive data in repositories to prevent potential leaks.

##### Configuration of the VMs

- You will have a `Vagrantfile` which will create and start the three VMs. It
  will import the environment variables and pass them through each API.
- You will have a `scripts/` directory which will store all the scripts you may
  want to run in order to install the necessary tools on each VM. Those scripts
  may also be very useful for setting up the databases.

Your configuration will work properly for the following commands (executed from
the root of the project):

- `vagrant up --provider virtualbox`: Starts all the VMs.
- `vagrant status`: Shows the status for all the VMs.
- `vagrant ssh <vm-name>`: Will let you access the VM through SSH.

#### Manage Your Python applications with PM2

PM2 is a process manager for Node.js applications that makes it easy to manage
and scale your application. It is designed to keep your application running
continuously, even in the event of an unexpected failure.

PM2 can be used to start, stop, and list Node.js applications, as well as
monitor their resource usage and log output.

Additionally, PM2 provides a number of features for managing multiple
applications, such as load balancing and automatic restarts.

In our situation we will use it mainly to test resilience for messages sent to
the Billing API when the API is not up and running.

After entering in your VM via SSH you may run the following commands:

- `sudo pm2 list`: List all running applications.
- `sudo pm2 stop <app_name>`: Stop a specific application.
- `sudo pm2 start <app_name>`: Start a specific application.

#### Project organization

##### README.md

As a good exercise and a helpful tool it is required for you to deliver a
`README.md` describing the project.

The idea of a `README.md` is to give in few lines enough context about a
project to understand what is it about and how to run it.

This file should include instructions to run and test the project, it should
also give a brief and clear overview of the stack used to build it.

##### Overall file structure

You can organize your internal file structure as you prefer. That said here is
a common way to structure this kind of projects that may help you:

```console
.
├── README.md
├── config.yaml
├── .env
├── scripts
│   └── [...]
├── srcs
│   ├── api-gateway
│   │   ├── app
│   │   │   ├── __init__.py
│   │   │   └── ...             // Other python files
│   │   ├── requirements.txt
│   │   └── server.js
│   ├── billing-app
│   │   ├── app
│   │   │   ├── __init__.py
│   │   │   └── ...             // Other python files
│   │   ├── requirements.txt
│   │   └── server.js
│   └── inventory-app
│   │   ├── app
│   │   │   ├── __init__.py
│   │   │   └── ...             // Other python files
│       ├── requirements.txt
│       └── server.py
└── Vagrantfile
```

When testing and before automating it through the VM build you should be able
to start the API Gateway and the two APIs by using the command `python
server.js` inside their respective directories.

As a best practice, you should develop your APIs using separates python virtual
environments to isolate the requirements needed for each API. You can use
`venv` or any equivalent tool.

If you decide to use a different structure for your project remember you should
be able to explain and justify your decision during the audit.

> As a best practice it is strongly advised to add `venv/` to your
> `.gitignore` in order not to upload useless files into your git repository
> (they will be auto-generated during the build process).
feat(crud-master-py): add subject 11 months ago			`## CRUD Master Py`

			`### Instructions`

			`APIs are a very common and convenient way to deploy services in a modular way.`
			`In this exercise we will create a simple microservices infrastructure, having`
docs(crud-master-py): fixing readme 10 months ago			`an API Gateway connected to two services. While one service, the inventory API, retrieves data from a PostgreSQL database, the other service, the billing API, exclusively processes messages received through RabbitMQ without direct database interactions. Communication between these services will occur via HTTP and message queuing systems. Each of these services will operate within distinct virtual machines, facilitating a segregated environment for their functionalities.`
feat(crud-master-py): add subject 11 months ago
			`#### General overview`

			`![CRUD Master architecture diagram](resources/crud-master-diagram.png)`

			We will set up a movie streaming platform, where one API (`inventory`) will
			have information on the movies available and another one (`billing`) will
			`process the payments.`

docs(crud-master-py): fixing readme 10 months ago			We'll establish a movie streaming platform. One API (`inventory`) will provide details about available movies, while another (`billing`) will handle payment processing.
docs(crud-master-py): fixing readme 10 months ago
feat(crud-master-py): add subject 11 months ago			The API gateway will communicate in HTTP with the `inventory` service and using
			RabbitMQ for `billing` service.

			`In this exercise you will need to install Python3 (with Flask, SQLAlchemy and`
			`other packages), PostgreSQL, RabbitMQ, Postman, VirtualBox and Vagrant.`

			`While it may seem overwhelming at first, there are a lot of resources available`
			`both on official website and on community blogs about setting up those tools.`
			`Also, the specific configuration details may change from platform to platform so`
			`don't hesitate to play around with it and be sure everything is installed`
			`correctly before to move on.`

			`#### API 1: Inventory`

			`##### Definition of the Inventory API`

			`This API will be a CRUD (Create, Read, Update, Delete) RESTful API. It will use`
			`a PostgreSQL database. It will provide information about the movies present in`
			`the inventory and allow users to do basic operations on it.`

			`A common way to do so is to use Flask which is a popular Python web framework.`
			`We will couple it with SQLAlchemy, an ORM which will abstract and simplify the`
			`interactions between our API and the database.`

			`Here are the endpoints with the possible HTTP requests:`

			- `/api/movies`: GET, POST, DELETE
			- `/api/movies/:id`: GET, PUT, DELETE

			`Some details about each one of them:`

			- `GET /api/movies` retrieve all the movies.
			- `GET /api/movies?title=[name]` retrieve all the movies with `name` in the title.
			- `POST /api/movies` create a new product entry.
			- `DELETE /api/movies` delete all movies in the database.

			- `GET /api/movies/:id` retrieve a single movie by `id`.
			- `PUT /api/movies/:id` update a single movie by `id`.
			- `DELETE /api/movies/:id` delete a single movie by `id`.

			The API should work on `http://localhost:8080/`.

			`##### Defining the Database`

			`For the database we will use PostgreSQL.`
			The database will be called `movies_db`.

			The `movies` table will contain the following columns:

			- `id`: auto-generated unique identifier.
			- `title`: the title of the movie.
			- `description`: the description of the movie.

			`##### Testing the Inventory API`

			`In order to test the correctness of your API you should use Postman or a`
feat(crud-master-py): clarify postman requirements for audit 10 months ago			`similar tool. You have to create one or more tests for every endpoint and then`
feat(crud-master-py): add subject 11 months ago			`export the configuration, so you will be able to reproduce the tests on`
			`different machines easily.`

feat(crud-master-py): clarify postman requirements for audit 10 months ago			`> The configuration will be checked during the audit.`

feat(crud-master-py): add subject 11 months ago			`#### API 2: Billing`

docs(crud-master-py): fixing readme 10 months ago			`##### Definition of the billing API`
feat(crud-master-py): add subject 11 months ago
			`This API will only receive messages through RabbitMQ, specifically it will`
			consume messages on the queue `billing_queue`. The message it receives are
			`going to be a "stringified" JSON object as in this example:`

			```json
			`{`
feat(crud-master-py): clarify postman requirements for audit 10 months ago			`"user_id": "3",`
			`"number_of_items": "5",`
			`"total_amount": "180"`
feat(crud-master-py): add subject 11 months ago			`}`
			```

docs(crud-master-py): fixing readme 10 months ago			It will parse the message and create a new entry in the `billing_db` database.
feat(crud-master-py): add subject 11 months ago			`It will also acknowledge the RabbitMQ queue that the message has been`
			`processed. When the API is started it will take and process all messages`
			`present in the queue.`

			> Take a look into `pika` Python library for an easy way to interface with
			`> RabbitMQ.`

			`##### Defining the Database`

			`For the database we will use PostgreSQL here as well.`
docs(crud-master-py): fixing readme 10 months ago			The database will be called `billing_db`.
feat(crud-master-py): add subject 11 months ago
			The `orders` table will contain the following columns:

			- `id`: auto-generated unique identifier.
			- `user_id`: the id of the user making the order.
			- `number_of_items`: the number of items included in the order.
			- `total_amount`: the total cost of the order.

			`##### Testing the Billing API`

			`To test this API here are some steps:`

docs(crud-master-py): fixing readme 10 months ago			- Publish a message directly to the `billing_queue` in RabbitMQ using its UI or CLI.
feat(crud-master-py): add subject 11 months ago			`- When the Billing API is running the orders should appear instantaneously in`
docs(crud-master-py): fixing readme 10 months ago			the `orders` table in the `billing_db` database.
feat(crud-master-py): add subject 11 months ago			`- When the Billing API is not running the queries to the API Gateway should`
docs(crud-master-py): fixing readme 10 months ago			still return success but the `orders` table in the `billing_db` database won't be updated.
feat(crud-master-py): add subject 11 months ago			`- When the Billing API is started again the unfulfilled messages should be`
docs(crud-master-py): fixing readme 10 months ago			processed and the `orders` table in the `billing_db` database should be updated.
feat(crud-master-py): add subject 11 months ago
			`#### The API Gateway`

			`The Gateway will take care of routing the requests to the appropriate service`
			`using the right protocol (it could be HTTP for the Inventory API or RabbitMQ`
			`for the Billing API).`

			`##### Interfacing with Inventory API`

			The gateway will route all requests to `/api/movies` at the API 1, without any
			`need to check the information passed through it. It will return the exact`
			`response received by the API1.`

			`<!-- TO DO: Add a suggestion on how to implement this ???-->`

			`##### Interfacing with Billing API`

			The gateway will receive POST requests from `api/billing` and send a message
			using RabbitMQ in a queue called `billing_queue`. The content of the message
			will be the POST request body stringified with `JSON.stringify`. The Gateway
			`should be able to send messages to queue even if the API 2 is not running.`
			`When the API2 will be started it should be able to process that message and`
			`send an acknowledgement back.`

			`An example of POST request to`
			`http://[API_GATEWAY_URL]:[API_GATEWAY_PORT]/api/billing/`:

			```json
			`{`
feat(crud-master-py): clarify postman requirements for audit 10 months ago			`"user_id": "3",`
			`"number_of_items": "5",`
			`"total_amount": "180"`
feat(crud-master-py): add subject 11 months ago			`}`
			```

docs(crud-master-py): fixing readme 10 months ago			`Upon successful processing, you can expect a response message such as "Message posted" or a similar acknowledgment.`

feat(crud-master-py): add subject 11 months ago			> Remember to set up `Content-Type: application/json` for the body of the
			`> request.`

			`##### Documenting the API`

			`Good documentation is a very critical feature of every API. By design the APIs`
			`are meant for others to use, so there have been very good efforts to create`
			`standard and easy to implement ways to document it.`

			`As an introduction to the art of great documentation you must create an OpenAPI`
			`documentation file for the API Gateway. There are many different ways to do so,`
			`a good start could be using SwaggerHub with at least a meaningful description`
			`for each endpoint. Feel free to implement any extra feature as you see fit.`

			> You must also create a `README.md` file at the root of your project with
			`> detailed instructions on how to build and run your infrastructure and which`
			`> design choices you made to structure it.`

			`#### Virtual Machines`

			`##### General overview`

			`You will use VirtualBox and Vagrant to set up three different VMs in order to`
			`test the interactions and correctness of responses between your APIs`
			`infrastructure.`

			`Vagrant is an open-source software that helps you create and manage virtual`
			`machines. With Vagrant, you can create a development environment that is`
			`identical to your production environment, which makes it easier to develop,`
			`test, and deploy your applications.`

			`Your VMs will be structured as follows:`

			- `gateway-vm`: This VM will only contain the `api-gateway`.
			- `inventory-vm`: This VM will contain the `inventory-app` API and the database
			`movies_db`.
			- `billing-vm`: This VM will contain the `billing-app` API, database `orders`
			`and RabbitMQ.`

			`> Vagrant is designed for development and should not be used in production`
			`> environments.`

			`##### Environment variables`

docs(crud-master-py): fixing readme 10 months ago			To simplify the building process, it's recommended to define essential variables in a `.env` file. This approach facilitates the modification or update of critical information such as URLs, passwords, usernames and so on.

			For this exercise, consider listing all required environment variables in the README.md file. Once you have these variables identified, create a `.env` file with the necessary credentials.
feat(crud-master-py): add subject 11 months ago
docs(crud-master-py): fixing readme 10 months ago			`These variables will be utilized by Vagrant and distributed across the various microservices to centralize the credentials.`
feat(crud-master-py): add subject 11 months ago
			Your `.env` file should contain all the necessary credentials and none of the
			`microservices should have any credential hard coded in the source code.`

docs(crud-master-py): fixing readme 10 months ago			> For the purpose of this exercise, the `.env` file must be included in your repository, in real-world scenarios, it's crucial to avoid including sensitive data in repositories to prevent potential leaks.
feat(crud-master-py): add subject 11 months ago
			`##### Configuration of the VMs`

			- You will have a `Vagrantfile` which will create and start the three VMs. It
			`will import the environment variables and pass them through each API.`
docs(crud-master-py): fixing readme 10 months ago			- You will have a `scripts/` directory which will store all the scripts you may
feat(crud-master-py): add subject 11 months ago			`want to run in order to install the necessary tools on each VM. Those scripts`
			`may also be very useful for setting up the databases.`

			`Your configuration will work properly for the following commands (executed from`
			`the root of the project):`

			- `vagrant up --provider virtualbox`: Starts all the VMs.
			- `vagrant status`: Shows the status for all the VMs.
			- `vagrant ssh <vm-name>`: Will let you access the VM through SSH.

			`#### Manage Your Python applications with PM2`

			`PM2 is a process manager for Node.js applications that makes it easy to manage`
			`and scale your application. It is designed to keep your application running`
			`continuously, even in the event of an unexpected failure.`

			`PM2 can be used to start, stop, and list Node.js applications, as well as`
			`monitor their resource usage and log output.`

			`Additionally, PM2 provides a number of features for managing multiple`
			`applications, such as load balancing and automatic restarts.`

			`In our situation we will use it mainly to test resilience for messages sent to`
feat(crud-master-py): clarify postman requirements for audit 10 months ago			`the Billing API when the API is not up and running.`
feat(crud-master-py): add subject 11 months ago
			`After entering in your VM via SSH you may run the following commands:`

			- `sudo pm2 list`: List all running applications.
			- `sudo pm2 stop <app_name>`: Stop a specific application.
			- `sudo pm2 start <app_name>`: Start a specific application.

			`#### Project organization`

			`##### README.md`

			`As a good exercise and a helpful tool it is required for you to deliver a`
			`README.md` describing the project.

			The idea of a `README.md` is to give in few lines enough context about a
			`project to understand what is it about and how to run it.`

			`This file should include instructions to run and test the project, it should`
			`also give a brief and clear overview of the stack used to build it.`

			`##### Overall file structure`

			`You can organize your internal file structure as you prefer. That said here is`
			`a common way to structure this kind of projects that may help you:`

			```console
			`.`
			`├── README.md`
			`├── config.yaml`
			`├── .env`
			`├── scripts`
			`│ └── [...]`
			`├── srcs`
			`│ ├── api-gateway`
			`│ │ ├── app`
feat(crud-master-py): clarify postman requirements for audit 10 months ago			`│ │ │ ├── __init__.py`
feat(crud-master-py): add subject 11 months ago			`│ │ │ └── ... // Other python files`
			`│ │ ├── requirements.txt`
			`│ │ └── server.js`
			`│ ├── billing-app`
			`│ │ ├── app`
feat(crud-master-py): clarify postman requirements for audit 10 months ago			`│ │ │ ├── __init__.py`
feat(crud-master-py): add subject 11 months ago			`│ │ │ └── ... // Other python files`
			`│ │ ├── requirements.txt`
			`│ │ └── server.js`
			`│ └── inventory-app`
			`│ │ ├── app`
feat(crud-master-py): clarify postman requirements for audit 10 months ago			`│ │ │ ├── __init__.py`
feat(crud-master-py): add subject 11 months ago			`│ │ │ └── ... // Other python files`
feat(crud-master-py): clarify postman requirements for audit 10 months ago			`│ ├── requirements.txt`
feat(crud-master-py): add subject 11 months ago			`│ └── server.py`
			`└── Vagrantfile`
			```

			`When testing and before automating it through the VM build you should be able`
			to start the API Gateway and the two APIs by using the command `python
			server.js` inside their respective directories.

			`As a best practice, you should develop your APIs using separates python virtual`
			`environments to isolate the requirements needed for each API. You can use`
			`venv` or any equivalent tool.

			`If you decide to use a different structure for your project remember you should`
			`be able to explain and justify your decision during the audit.`

			> As a best practice it is strongly advised to add `venv/` to your
			> `.gitignore` in order not to upload useless files into your git repository
			`> (they will be auto-generated during the build process).`