Učenje igranja realno-časovne strateške igre z uporabo globokega spodbujevalnega učenja

HABJAN, JERNEJ

Učenje igranja realno-časovne strateške igre z uporabo globokega spodbujevalnega učenja
ID HABJAN, JERNEJ (Author), ID Guid, Matej (Mentor) More about this mentor... This link opens in a new window

, ID Šter, Branko (Comentor)

PDF - Presentation file, Download (9,14 MB)
MD5: 3EED427493675897A1DADCCECED11D92

Abstract

Z algoritmom AlphaZero smo implementirali učenje in priporočanje akcij v realno-časovni strateški igri. Pregledali smo krajšo zgodovino globokega spodbujevalnega učenja na igrah in povzeli, zakaj je pristop samostojnega učenja najprimernejši. Za strateško igro smo določili stanja igre in stanje s kodirnikom preoblikovali v format, primeren za učenje nevronske mreže. Določili smo ustavitveni pogoj z iztekom števila preostalih potez za posameznega igralca. Utemeljili smo različne konfiguracije učnih parametrov in izpostavili tisto, ki računalniškega agenta na naši igri najuspešnejše uči. Rezultate smo prikazali s Python knjižnico Pygame in v celostnem pogonu Unreal Engine 4. V obeh vizualizacijah lahko igramo proti naučenemu modelu ali opazujemo, kako se dva računalniška nasprotnika bojujeta med sabo.

Language:	Slovenian
Keywords:	AlphaZero, realno-časovna strateška igra, Unreal Engine
Work type:	Bachelor thesis/paper
Organization:	FRI - Faculty of Computer and Information Science
Year:	2019
PID:	20.500.12556/RUL-106470
Publication date in RUL:	26.02.2019
Views:	1989
Downloads:	269
Metadata:
:	Copy citation
Share:

Secondary language

Abstract:
Language:	English
Title:	Learning to play a real-time strategy game with deep reinforcement learning
With algorithm AlphaZero we have implemented the learning and recommendation of actions in a real-time strategy game. We examined a short history of deep reinforcement learning in games and summarized why the self-learning approach is best suited. For a strategic game, we determined the state of the game and transformed it with the encoder into a format suitable for learning a neural network. We determined a stopping condition with the expiry of the number of remaining moves for each player. We substantiated different configurations of learning parameters and exposed the most successful configuration for learning our game. The results were displayed with the Python Pygame module and the game engine Unreal Engine 4. In both visualizations we can play against the learned model, or we can observe two computer opponents fighting against each other.
Keywords:	AlphaZero, real-time strategy game, Unreal Engine

Similar works from RUL:
Similar works from other Slovenian collections:

Secondary language

Similar documents