mirror of https://github.com/01-edu/public.git
You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
nprimo
700efcb57b
|
7 months ago | |
---|---|---|
.. | ||
README.md | 7 months ago |
README.md
NLP-enriched News Intelligence platform
Preliminary
Does the structure of the project look like the one described in the subject?
Does the environment contain all libraries used and their versions that are necessary to run the code?
Scraper
There are at least 300 news articles stored in the file system or the database.
Run the scraper with python scraper_news.py
and fetch 3 documents. The scraper is not expected to fetch 3 documents and stop by itself, you can stop it manually.
Does it run without any error and store the 3 files as expected?
Topic classifier
Are the learning curves provided?
Do the learning curves prove the topics classifier is trained correctly - without overfitting? Ask the student to explain what the term "overfitting" means and how he avoided this phenomenon.
Additionally, you can look for external resources. For example, Wikipedia has a good page on "overfitting".