Primerjava statističnih modelov za ocenjevanje dejavnikov tveganja slovenskih družb

Sokolič, Andrej

Primerjava statističnih modelov za ocenjevanje dejavnikov tveganja slovenskih družb
ID Sokolič, Andrej (Author), ID Marinšek, Denis (Mentor) More about this mentor... This link opens in a new window

PDF - Presentation file, Download (6,08 MB)
MD5: 2C254369F3FE9D80CFC6FC060E69A1B4

Abstract

V magistrskem delu je obravnavan problem ocenjevanja dejavnikov tveganja družb na podlagi računovodskih in firmografskih podatkov, s poudarkom na statističnih metodah, ki se uporabljajo za njihovo ocenjevanje. Kreditno tveganje je bilo v zadnjih desetletjih deležno velikega zanimanja in so metode za njegovo ocenjevanje prešle iz subjektivnih modelov ekspertnega mnenja do modernejših kvantitativnih metod. S tem je tesno povezano naraščajoče število razpoložljivih informacij ter lažja dostopnost do informacij in programske opreme. Po začetnih klasifikacijskih modelih, ki so pogosto temeljili na podatkih iz daljših časovnih obdobij in delovali na principu uvrščanja družb v dve skupini glede na njihove lastnosti v nekem času, so zagon dobili modeli ogroženosti v diskretnem času, ki se jih ocenjuje z modelom logistične regresije. Ti med drugim s ponovnimi meritvami omogočajo spremljanje finančne strukture družb. S tem so lahko tako ocene dejavnikov tveganja kot tudi samega tveganja natančnejše in bolj opisne za proučevano populacijo, saj se zaradi uporabe vseh podatkov zmanjša vpliv pristranskosti zaradi izbire vzorca. Čeprav so omenjeni modeli pogosto priporočani in tudi uporabljeni, je njihovo delovanje najpogosteje predstavljeno le z AUC, mero razločevalne moči klasifikacijskih modelov, in brez vpogleda v nove možnosti, ki jih model omogoča. Z namenom proučitve razlik med priljubljenimi metodami sta bila na podatkih slovenskih družb poleg izhodiščnega modela ogroženosti v diskretnem času s časovno spreminjajočimi se spremenljivkami razvita še model ogroženosti v diskretnem času z začetnimi vrednostmi spremenljivk in model logistične regresije. Ugotovitve vseh modelov kažejo, da se zlasti donosnost, likvidnost in zadolženost družb s težavami v poslovanju razlikujejo od družb brez težav in da se finančne spremenljivke dopolnjujejo z nefinančnimi spremenljivkami. Najboljšo razločevalno moč ugotovimo pri modelu logistične regresije ocenjenem na zadnjih razpoložljivih podatkih vsake družbe, vendar zaradi pristranske izbire vzorca občutno precenjuje opazovano stopnjo tveganja. Prav tako je ugotovljen problem pri uporabi podatkov iz različnih časovnih obdobij/let, saj se v dotičnem primeru vplivi dejavnikov tveganja in število dogodkov razlikujejo v času, zaradi česar z logistično regresijo ocenjeni vplivi dejavnikov tveganja v nekaterih obdobjih odstopajo od dejanskega stanja. Iz primerjave modelov ogroženosti je očitno, da je za učinkovitost modela potrebno upoštevati čim bolj ažurne podatke, saj z oddaljevanjem od časa meritev vse slabše opisujejo stanje družbe. Vsi trije modeli so ustrezni in uporabni, a ugotavljamo, da ima model ogroženosti s časovno spreminjajočimi se spremenljivkami več informacij in veliko bolje zajame dogajanje v vseh obdobjih. Ugotovljeno je bilo, da je model mogoče dodatno izboljšati z vključitvijo slučajnih vplivov, vendar v tem delu vplivi vključitve le-teh niso bili podrobneje raziskani. Razlike so tudi v sami uporabnosti modelov, kjer modeli ogroženosti poleg same ocene tveganja in uvrščanja družb omogočajo še spremljanje profilov ogroženosti različnih družb. Z ocenjenimi ogroženostmi v posameznih obdobjih se lahko ustrezno oceni verjetnost "preživetja" nekega časovnega obdobja, kar je zelo koristno za oceno tveganja kreditov z daljšo ročnostjo. Nenazadnje ocenjene verjetnosti preživetja omogočajo alternativen pristop ovrednotenja razločevalne moči modelov v času z izračunom AUC(t) in Ctd. Meri se izkažeta kot bolj informativni, saj nudita vpogled v razločevalno moč modela tako v posameznem obdobju kot tudi v celotnem obdobju opazovanja. Poleg tega prav tako nakažeta na slabost uveljavljene mere razločevalne moči AUC, ki je lahko preveč odvisna od porazdelitve števila in stopnje dogodkov po obdobjih.

Language:	Slovenian
Keywords:	kreditno tveganje, analiza preživetja, model ogroženosti v diskretnem času, logistična regresija
Work type:	Master's thesis/paper
Organization:	FE - Faculty of Electrical Engineering
Year:	2022
PID:	20.500.12556/RUL-136982
COBISS.SI-ID:	119182339
Publication date in RUL:	27.05.2022
Views:	1993
Downloads:	263
Metadata:
:	Copy citation
Share:

Secondary language

Abstract:
Language:	English
Title:	Comparison of statistical models for estimating the risk factors of Slovenian companies
The master's thesis deals with the problem of estimating the risk factors of firms on the basis of accounting and firmographic data, with an emphasis on statistical methods used for their estimation. Credit risk has received a great deal of interest in recent decades, and methods for estimating it have shifted from subjective models of expert opinion to advanced quantitative methods. This is also due to the growing amount of information available and the ease of access to information and software. The initial models were often based on pooled data from long periods of time and worked on the principle of classifying companies into two distinct groups according to their characteristics at a single point in time. In recent years, they have been overshadowed by discrete-time hazard models, which have became state of the art in predicting risk. Discrete-time hazard models on panel data are estimated using logistic regression and they take into account the volatile nature of companies' financial structure and other characteristics. Furthermore, they are less subject to sample bias, which makes both estimates of risk and risk factors more accurate and descriptive for the study population. Although these models are often recommended and used, their performance is most often represented only by AUC, a measure of the discrimination power of classification models, and without looking into the new possibilities that the model allows. In order to study the differences between popular methods, a discrete-time hazard model without time varying-covariates and a logistic regression model were developed in addition to the main discrete-time hazard model with time-varying covariates. The findings of all models show that the profitability, liquidity and indebtedness of companies with operating difficulties differ from those without operating difficulties and that non-financial variables complement the information given by the financial variables. Using the last available data to estimate the logistic regression model leads to the best discrimination power, but due to sampling bias, the model significantly overestimates the actual probability of a firm having operating difficulties. Because the incidence of the events and the effects of risk factors differ on the period-by-period basis, pooling data from several years caused the logistic regression estimates of risk factors to deviate from the observed effects in some periods. From the comparison of discrete-time hazard models, it is obvious that in order for the model to be effective, it is necessary to take into account the most recent data of the firm's characteristics, as the information and relevance of the data diminishes with time. Although all three models are valid and relevant, the discrete-time hazard model with time-varying covariates uses more information and is better at describing the risk factors and at capturing firms with operating difficulties in all periods. The model can be further improved by including random effects, but in this thesis the effects of including them have not been investigated in detail. Another advantage of hazard models is that in addition to estimating the hazard and the risk factors, they also allow for the comparison of companies using their hazard profiles. The estimated hazards in individual periods can be used to adequately estimate the probability of "surviving" up to a certain period, which can be very useful in estimating the risk of loans with longer maturities. Last but not least, the estimated survival probabilities allow an alternative approach to estimating discrimination power with time-dependent discrimination indices AUC(t) and Ctd}. Since they provide insight into the discrimination power of the model both in each period and in the whole follow-up period, the time-dependent measures prove to be more informative of the model's ability to discriminate between firms based on operating difficulties. Apart from that they also point to the weakness of the established measure of discrimination power, which may depend too much on the distribution of the number and rate of events across time periods.
Keywords:	credit risk, survival analysis, discrete-time hazard model, logistic regression

Similar works from RUL:
Similar works from other Slovenian collections:

Secondary language

Similar documents