Uporaba s poizvedovanjem obogatenega generiranja za avtomatsko generiranje testov

Starašinič, Žan

Repository of the University of Ljubljana

Details

Uporaba s poizvedovanjem obogatenega generiranja za avtomatsko generiranje testov
ID Starašinič, Žan (Author), ID Vavpotič, Damjan (Mentor) More about this mentor... This link opens in a new window

PDF - Presentation file, Download (363,06 KB)
MD5: C5BC8A8817A9C43177090CF3CF68FB88

Abstract

Pisanje testov je časovno zahteven del razvoja programske opreme in razvijalci vsak dan porabijo veliko časa za njihovo pripravo. Ob hitrem razvoju na področju velikih jezikovnih modelov smo hoteli preveriti uspešnost tehnologije s poizvedovanjem obogatenega generiranja (RAG) za avtomatsko generiranje testov v jeziku Python. Veliki jezikovni modeli so uspešni pri generiranju kode, a kakovostno testiranje je ena izmed njihovih večjih pomanjkljivosti, pogosto namreč generirajo teste brez razumevanja konteksta. RAG je tehnika, ki zaobide pomanjkanje domenskega znanja tako, da na podlagi analizirane funkcije v bazi znanja najde podobne teste, s katerimi lahko pravilno generira nove. Z najdenimi primeri jezikovni model ustvari teste brez halucinacij in drugih omejitev. Raziskali in implementirali smo prototip sistema in ga preizkusili na realnem projektu z objektivnimi in subjektivnimi metrikami. Generirani testi so dosegli višjo pokritost kode, medtem ko so bili pri odkrivanju dejanskih napak manj uspešni od ročno napisanih testov. Programerji pa so s subjektivnimi ocenami dodatno potrdili prednost ročnih testov.

Language:	Slovenian
Keywords:	avtomatsko generiranje testov, veliki jezikovni modeli, testiranje programske opreme, RAG
Work type:	Bachelor thesis/paper
Typology:	2.11 - Undergraduate Thesis
Organization:	FRI - Faculty of Computer and Information Science
Year:	2025
PID:	20.500.12556/RUL-172524
COBISS.SI-ID:	248932611
Publication date in RUL:	08.09.2025
Views:	162
Downloads:	22
Metadata:
:	Copy citation
Share:

Secondary language

Abstract:
Language:	English
Title:	Using retrieval-augmented generation for automated test generation
Writing tests is a time-consuming part of software development, and developers spend considerable time preparing them daily. With rapid advances in large language models, we investigated the effectiveness of Retrieval-Augmented Generation (RAG) technology for automated Python test generation. Large language models are successful at generating code, but quality testing is one of their major weaknesses; they often generate tests without understanding context. RAG is a technique that bypasses the lack of domain knowledge by identifying similar tests in a knowledge base based on the analyzed function, which it uses to generate new ones correctly. With the retrieved examples, the language model creates tests without hallucinations and other limitations. We researched and implemented a prototype system and tested it on a real project using objective and subjective metrics. Generated tests achieved higher code coverage, while being less successful at detecting actual bugs compared to manually written tests. Developers further confirmed the superiority of manual tests through subjective assessments.
Keywords:	automated test generation, RAG, large language models, software testing.

Similar works from RUL:
Similar works from other Slovenian collections:

Details

Secondary language

Similar documents