As part of this thesis, a reporting system was developed based on data obtained from automatic passenger counters installed on Slovenian Railways trains.
The thesis is divided into two main parts: the preparation of data for analysis as part of building a data warehouse, and the development of the final report.
The data warehouse was built in Microsoft Fabric, where the collected data was integrated with supporting sources such as timetables, validation records, and vehicle information.
All acquired data was transformed into a standardized format, after which a data model in the form of a snowflake schema was designed.
The data warehouse architecture follows the medallion architecture principle.
Based on the processed data, an interactive report was created using Power BI.
This report enables drill-down analysis, allowing users to explore data in increasing levels of detail.
The report is divided into multiple categories (Train, Station, Vehicle), each containing multiple logically connected pages.
Each page contains various key performance indicators, graphs, and tables that support analysis ranging from high-level overviews of aggregated data to insights into near-raw data.
This structure enables data analysis from multiple perspectives, thereby contributing to greater analytical flexibility.
|