Today, we have many archives around the world with data that we want to present in a useful way. Sometimes we want to be able to search this data, so we build IT solutions that allow us to do this. In the context of this thesis, it was necessary to develop such a solution for searching the TV archive. We used Elasticsearch to store and search the data, as it is well suited for this type of problems. The first goal of the task was to implement scripts that were able to properly structure the data from the archive and write it to the Elasticsearch database. The second objective was to develop a web application that offered the user a search engine for shows. Despite a few obstacles, we ended up with a solution that met all our objectives. The result is a web application that searches the metadata of the shows and the subtitles and speech in them. Search hits are displayed on the screen so that the user can see various information about the broadcasts, and there is also the option to watch video clips of individual broadcasts.
|