There are many different news websites across the web that serve similar news. The quality of articles varies greatly between different sources. There are also several applications that aggregate similar news. They often show the user the freshest article even though it may not necessarily be the most informative. The purpose of the thesis is to upgrade the basic news aggregator. This thesis covers the analysis of presentation and a content on news websites and the development of web application, which collects the news. Which are aggregated and sorted in a way that the better articles are exposed, based on algorithmic evaluation. The application consists of three components, all are made in programming language such as JavaScript, TypeScript and Python.
The first component collects content and serves as REST API to access collected content. It is implemented by using Node.js, Express and MongoDB. The second component grades and groups the collected texts by using machine learning libraries and is implemented in programming language Python. The third component is implemented using Angular to display the results of the analysis.
|