Ansambli verjetnostnih klasifikacijskih pravil

BAJUK, MAJ

Ansambli verjetnostnih klasifikacijskih pravil
ID BAJUK, MAJ (Author), ID Robnik Šikonja, Marko (Mentor) More about this mentor... This link opens in a new window

PDF - Presentation file, Download (5,13 MB)
MD5: 12A8878FFD214786AE0EA0337885562E

Abstract

Ansambli združujejo več šibkih napovednih modelov in iz posameznih napovedi gradijo točnejše modele. Pridobitki niso le v večji klasifikacijski točnosti, temveč zmanjšujemo tudi pristranskost in varianco. Ideja diplomske naloge je v ansamble združiti verjetnostna odločitvena pravila. Ta se od navadnih odločitvenih pravil razlikujejo v izhodu, saj nam namesto odločitve o posameznem razredu vrnejo verjetnost zanj. To izkoristimo za gradnjo modela naključnih gaussovskih množic pravil oz. RGRS, ki navdih jemlje iz metode naključnih gozdov. Model za gradnjo vsakega pravila naključno izbere primere in poskuša iz porazdelitve vrednosti atributov sestaviti konjunkt osnovnih gaussovskih pravil. Opišemo tudi alternativno idejo, kjer v ansamblu AdaBoost uporabimo algoritem gradnje pravil CN2SD. Oba pristopa testiramo s prečnim preverjanjem in primerjamo z obstoječimi pristopi. Medtem ko ansambel CN2SD AdaBoost le izboljša točnosti uporabljenega modela, se izkaže, da algoritem RGRS vrača primerljive rezultate kot pogosto uporabljen model C4.5rules, a zaostaja za naključnimi gozdovi in klasifikacijo z mehkimi pravili z algoritmom FURIA.

Language:	Slovenian
Keywords:	ansambelsko učenje, klasifikacija, verjetnost, gaussovska pravila
Work type:	Bachelor thesis/paper
Organization:	FRI - Faculty of Computer and Information Science
Year:	2019
PID:	20.500.12556/RUL-110080
COBISS.SI-ID:	1538349251
Publication date in RUL:	11.09.2019
Views:	1834
Downloads:	240
Metadata:
:	Copy citation
Share:

Secondary language

Abstract:
Language:	English
Title:	Ensembles of probabilistic classification rules
Ensemble methods combine a number of weak classifiers and combine their predictions for more accurate models. The gain is not only an increase of classification accuracy but also decrease in bias and variance. The idea of our work is to build probabilistic classification rule ensembles. Probabilistic rules differ from common classification rules by outputting a class probability estimation instead of a crisp class decision. We build an ensemble model named Random Gaussian Rule Set or RGRS, inspired by the Random Forest method. Our model uses a random sample of examples and tries to create a conjunction of elementary Gaussian rules based on their attribute value distribution. We also describe an alternate idea where we form an AdaBoost ensemble with the CN2SD rule-building algorithm. We test both methods with cross validation and compare them to existing approaches. While the AdaBoost ensemble only improves the given algorithm, the RGRS results are comparable to classification models such as C4.5rules, but trail behind Random Forest and fuzzy rule classifier FURIA.
Keywords:	ensemble learning, classification, probability, Gaussian rules

Similar works from RUL:
Similar works from other Slovenian collections:

Secondary language

Similar documents