Učenje igranja igre štiri v vrsto z uporabo globokega spodbujevalnega učenja : magistrsko delo

Lušina, Domen

Repository of the University of Ljubljana

Details

Učenje igranja igre štiri v vrsto z uporabo globokega spodbujevalnega učenja : magistrsko delo
ID Lušina, Domen (Author), ID Guid, Matej (Mentor) More about this mentor... This link opens in a new window

PDF - Presentation file, Download (6,85 MB)
MD5: F06F0AC95716F610F5D6C136DB21FED6

Abstract

Učenje igranja iger je ena izmed tem, s katero se raziskovalci s področja umetne inteligence ukvarjajo že od njenega začetka. Želimo ustvariti programe, s katerimi računalniku omogočimo inteligentno igranje iger. V zadnjih letih se za to vedno večkrat uporabljajo metode globokega učenja. AlphaZero je eden izmed algoritmov globokega spodbujevalnega učenja, ki je z velikim uspehom brez ekspertnega znanja naučil nadčloveško igrati šah, šogi in Go. V tem delu smo algoritem AlphaZero uporabili za učenje igre štiri v vrsto s poudarkom na raziskovanju vplivov vpeljave ekspertnega znanja na uspešnost delovanja programa. Predstavili smo več metod vpeljave ekspertne hevristike igre štiri v vrsto v fazo učenja algoritma AlphaZero. Uporabili smo več ekspertnih hevristik in različne metode vpeljave ekspertnega znanja. Evalvacija je potekala na vnaprej pripravljenih množicah pozicij iz različnih stadijev iger, s pomočjo iger s popravljanjem potez ter z igranjem proti nasprotnikom različnih težavnosti, vključno z optimalnim nasprotnikom. S hevristiko značilk, ki med drugim spodbuja povezovanje žetonov v vrsto, smo dosegli rahlo izboljšavo rezultatov.

Language:	Slovenian
Keywords:	umetna inteligenca, globoko učenje, spodbujevalno učenje, drevesno preiskovanje Monte Carlo, nevronske mreže, ekspertna hevristika
Work type:	Master's thesis/paper
Typology:	2.09 - Master's Thesis
Organization:	FMF - Faculty of Mathematics and Physics
Year:	2021
PID:	20.500.12556/RUL-124785
UDC:	004.42
COBISS.SI-ID:	200663299
Publication date in RUL:	18.02.2021
Views:	3154
Downloads:	240
Metadata:
:	Copy citation
Share:

Secondary language

Abstract:
Language:	English
Title:	Learning to play the game of connect four with deep reinforcement learning
Learning to play games has been a topic of interest to researchers since the early days of artificial intelligence. The goal is to create programs that enable computers to play games intelligently. In recent years, we have seen deep learning being used more and more. AlphaZero is one of the deep reinforcement learning algorithms that has achieved superhuman level of play in Chess, Shogi and Go without any domain knowledge. In this paper, we used AlphaZero to learn how to play the game Connect Four, with a focus on using expert knowledge to improve it. Several methods are presented that introduce expert heuristics into the learning phase of the AlphaZero algorithm. Using field and feature heuristics, we analyzed different methods on sets of positions, games with error corrections, and four different opponents, one of which plays optimally. By using the feature heuristic, which encourages connecting game pieces, we were able to slightly improve the results of the position sets as measured by various metrics.
Keywords:	artificial intelligence, deep learning, reinforcement learning, Monte Carlo tree search, neural networks, expert heuristic

Similar works from RUL:
Similar works from other Slovenian collections:

Details

Secondary language

Similar documents