Avtomatizirano pridobivanje in relacijsko-grafna analiza podatkov omrežne infrastrukture National Grid

Uršič, Dominik

Repository of the University of Ljubljana

Details

Avtomatizirano pridobivanje in relacijsko-grafna analiza podatkov omrežne infrastrukture National Grid
ID Uršič, Dominik (Author), ID Kukar, Matjaž (Mentor) More about this mentor... This link opens in a new window

, ID Hajna, Tadej (Comentor)

PDF - Presentation file, Download (5,93 MB)
MD5: 1BE63F7E9C54E773815FF39B3D612F1D

Abstract

Cilj diplomske naloge je razvoj in implementacija avtomatiziranega ETL-sistema za pridobivanje, obdelavo in shranjevanje podatkov o javni električni infrastrukturi operaterja National Grid v Združenem kraljestvu, z implementacijo hibridnega relacijsko-grafovskega pristopa k analizi omrežne strukture. Sistem bo redno prenašal javno dostopne Excelove datoteke s spletne platforme National Grid, jih arhiviral v Google Cloud Storage za zagotavljanje zgodovinske sledljivosti ter jih procesiral s pythonovimi skriptami za čiščenje, validacijo in transformacijo podatkov. Posebna pozornost bo namenjena kazalniku razpoložljive kapacitete (angl. Demand Headroom), ki predstavlja razliko med zanesljivo nosilnostjo omrežnega elementa in njegovo pričakovano najvišjo obremenitvijo ter tako določa preostalo zmogljivost pred potrebnimi infrastrukturnimi nadgradnjami. Obdelani podatki bodo naloženi v PostgreSQL podatkovno bazo, razširjeno z Apache AGE grafovsko nadgradnjo, kar bo omogočalo izvajanje tako tradicionalnih SQL kot tudi grafovskih cypherskih poizvedb nad isto podatkovno strukturo. Ta hibridni pristop bo demonstriran z implementacijo analitičnih poizvedb za optimizacijo omrežnih povezav in izračun prenosnih izgub električne energije, izvedenih v obeh pristopih za neposredno primerjavo njihovih prednosti in omejitev. Celoten proces bo v celoti avtomatiziran z uporabo Google Cloud Scheduler, ki bo zagotavljal redno izvajanje, mehanizme nadzora kakovosti podatkov ter popolno sledljivost vseh operacij. Sistem je zasnovan skalabilno, kar omogoča enostavno razširitev na dodatne operaterje distribucijskih omrežij ter predstavlja osnovo za potencialni razvoj celovite nacionalne platforme spremljanja električne infrastrukture v podporo energetski tranziciji.

Language:	Slovenian
Keywords:	avtomatizacija pridobivanja podatkov, grafovske baze podatkov, Apache AGE, elektroenergetska infrastruktura, ETL sistem, hibridna analiza
Work type:	Bachelor thesis/paper
Organization:	FRI - Faculty of Computer and Information Science
Year:	2026
PID:	20.500.12556/RUL-181331
Publication date in RUL:	01.04.2026
Views:	27
Downloads:	1
Metadata:
:	Copy citation
Share:

Secondary language

Abstract:
Language:	English
Title:	Automated data acquisition and relational/graph-based analysis of National Grid network infrastructure
The objective of this thesis is the development and implementation of an automated ETL system for acquiring, processing, and storing data on public electrical infrastructure of the National Grid operator in the United Kingdom, with implementation of a hybrid relational-graph approach to network structure analysis. The system will regularly download publicly available Excel files from the National Grid platform, archive them in Google Cloud Storage to ensure historical traceability, and process them using Python scripts for data cleaning, validation, and transformation. Special attention will be given to the Demand Headroom indicator, which represents the difference between the reliable capacity of a network element and its expected peak load, thus determining the remaining capacity before infrastructural upgrades are required. The processed data will be loaded into a PostgreSQL database extended with the Apache AGE graph extension, enabling the execution of both traditional SQL and graph-based cypher queries on the same data structure. This hybrid approach will be demonstrated through the implementation of analytical queries for network connection optimization and electrical transmission loss calculations, executed in both approaches for direct comparison of their advantages and limitations. The entire process will be fully automated using Google Cloud Scheduler, which will ensure regular execution, data quality control mechanisms, and complete traceability of all operations. The system is designed to be scalable, enabling easy extension to additional distribution network operators and providing a foundation for potential development of a comprehensive national platform for monitoring electrical infrastructure in support of energy transition.
Keywords:	automated data acquisition, graph databases, Apache AGE, electrical grid infrastructure, ETL system, hybrid analysis

Similar works from RUL:
Similar works from other Slovenian collections:

Details

Secondary language

Similar documents