mirror of https://github.com/01-edu/public.git
You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
2.1 KiB
2.1 KiB
NLP-enriched News Intelligence platform
Preliminary
Does the structure of the project look like the one described in the subject?
Does the environment contain all libraries used and their versions that are necessary to run the code?
Scraper
Run the scraper with python scraper_news.py
and fetch 300 articles. If needed, stop the program manually when enough data has been retrieved.
Does it run without any error and store the articles as described in the subject?
Topic classifier
Are the learning curves provided?
Do the learning curves prove the topics classifier is trained correctly - without overfitting? Ask the student to explain what the term "overfitting" means and how he avoided this phenomenon.
Additionally, you can look for external resources. For example, Wikipedia has a good page on "overfitting".