Podrobno

SyntheRela : a benchmark for synthetic relational database generation
ID Hudovernik, Valter (Avtor), ID Jurkovič, Martin (Avtor), ID Štrumbelj, Erik (Avtor)

.pdfPDF - Predstavitvena datoteka, prenos (13,13 MB)
MD5: 95EE2739509E0982590930C03B450B2F
URLURL - Izvorni URL, za dostop obiščite https://openreview.net/pdf?id=Mi8XioazWy Povezava se odpre v novem oknu

Izvleček
Synthesizing relational databases has started to receive more attention from researchers, practitioners, and industry. The task is more difficult than synthesizing a single table due to the added complexity of relationships between tables. For the same reasons, benchmarking methods for synthesizing relational databases introduces new challenges. Our work is motivated by a lack of an empirical evaluation of state-of-the-art methods and by gaps in the understanding of how such an evaluation should be done. We review related work on relational database synthesis, common benchmarking datasets, and approaches to measuring the fidelity and utility of synthetic data. We combine best practices, a novel robust detection metric, and a novel approach to evaluating utility with graph neural networks into a benchmarking tool. We use this benchmark to compare 6 open-source methods over 8 real-world databases, with a total of 39 tables. The open-source SyntheRela benchmark is available on GitHub with a public leaderboard.

Jezik:Angleški jezik
Ključne besede:relational databases, synthetic data, benchmarking, evaluation
Vrsta gradiva:Članek v reviji
Tipologija:1.01 - Izvirni znanstveni članek
Organizacija:FRI - Fakulteta za računalništvo in informatiko
Status publikacije:Objavljeno
Različica publikacije:Objavljena publikacija
Leto izida:2026
Št. strani:37 str.
PID:20.500.12556/RUL-182333 Povezava se odpre v novem oknu
UDK:004.65
ISSN pri članku:2835-8856
COBISS.SI-ID:276886531 Povezava se odpre v novem oknu
Datum objave v RUL:07.05.2026
Število ogledov:14
Število prenosov:1
Metapodatki:XML DC-XML DC-RDF
:
Kopiraj citat
Objavi na:Bookmark and Share

Gradivo je del revije

Naslov:Transactions on machine learning research
Skrajšan naslov:Transact. mach. learn. res.
Založnik:OpenReview.net
ISSN:2835-8856
COBISS.SI-ID:233590019 Povezava se odpre v novem oknu

Licence

Licenca:CC BY 4.0, Creative Commons Priznanje avtorstva 4.0 Mednarodna
Povezava:http://creativecommons.org/licenses/by/4.0/deed.sl
Opis:To je standardna licenca Creative Commons, ki daje uporabnikom največ možnosti za nadaljnjo uporabo dela, pri čemer morajo navesti avtorja.

Sekundarni jezik

Jezik:Slovenski jezik
Ključne besede:relacijske baze, sintetični podatki, primerjalna analiza, ocenjevanje

Podobna dela

Podobna dela v RUL:
Podobna dela v drugih slovenskih zbirkah:

Nazaj