DELITEV PROSTORA PROBLEMSKE DOMENE PRI IDENTIFIKACIJI Z MEHKIMI MODELI

ČERNE, GREGOR

DELITEV PROSTORA PROBLEMSKE DOMENE PRI IDENTIFIKACIJI Z MEHKIMI MODELI
ID ČERNE, GREGOR (Author), ID Škrjanc, Igor (Mentor) More about this mentor... This link opens in a new window

PDF - Presentation file, Download (6,88 MB)
MD5: FDFB52BC83710780998F2888368D1317

Abstract

Pri načrtovanju vodenja kateregakoli procesa je kvaliteten matematični model ključni del za dobro delovanje zaprtozančnega sistema, predvsem pri uporabi regulatorjev, ki izhajajo iz modela procesa. Ker je načrtovanje modela iz fizikalnih zakonitosti v določenih primerih časovno zelo potratno, je že dolgo aktivno področje identifikacije, katerega cilj je izluščiti zakonitosti dinamičnega procesa neposredno iz podatkov. S tem lahko drastično skrajšamo čas načrtovanja modela in zahtevanega znanja za uporabo le-teh. Za identifikacijo nelinearnih dinamičnih in statičnih sistemov se uspešno uporabljajo mehki modeli Takagi-Sugeno, saj so kljub njihovi enostavni ter razumljivi strukturi sposobni modeliranja kompleksnih procesov. Ideja modelov Takagi-Sugeno je razdelitev problemske domene na manjša področja, na katerih lahko kompleksen proces opišemo z enostavnejšimi modeli, te modele pa nato mehko zlijemo skupaj. Tako pridobimo model, ki zadostno aproksimira proces preko celotnega območja delovanja. Navkljub eksponentnemu razvoju na področju globokih nevronskih mrež imajo modeli Takagi-Sugeno še vedno nekaj lastnosti, ki so ključne na področju vodenja in identifikacije: dokazljiva stabilnost, enostaven vpogled v delovanje sistema in možnost lokalne identifikacije. Predvsem zadnja omenjena prednost -- lokalna identifikacija -- je lastnost, zaradi katere bodo modeli Takagi-Sugeno po našem mnenju še dolgo ostali del raziskovalnega področja identifikacije. Ta omogoča mnogo višjo izkoriščenost vzorcev, saj z njo vsak vzorec vpliva na manjše število parametrov, kar prinese temu primerno hitrejšo konvergenco parametrov. Hitrejša konvergenca pa omogoča hitrejšo in natančnejšo adaptacijo modela na spremembe v sistemu. Dobra lokalna identifikacija posledično omogoči tudi bolj natančne samorazvijajoče se mehke modele. Še zadnja prednost lokalne identifikacije je sposobnost paralelnega izvajanja in s tem razširljivost, kar je kritičnega pomena za modeliranje kompleksnih sistemov. Eden glavnih problemov pri identifikaciji modelov Takagi-Sugeno je deljenje problemske domene oziroma kako razdeliti področje delovanja. Naloga načrtovalca je iskanje primernega razmerja med zmožnostjo modeliranja kompleksnih nelinearnosti (z več področji lahko opišemo širši spekter nelinearnosti) in natančnostjo parametrov lokalnih modelov (z manj področji dobimo področja z več vzorci, s čimer lahko zmanjšamo varianco ocenjenih parametrov). V disertaciji naprej predstavimo združeno pripadnostno funkcijo z upoštevanjem kvalitete modela, kjer v proces deljenja na inovativen način vključimo informacijo o kvaliteti lokalnih modelov. Razvita združena pripadnostna funkcija zahteva, da vzorec pripada določenemu lokalnemu modelu le, če je hkrati blizu v problemski domeni in blizu v izhodnem prostoru. To omogoči boljšo detekcijo linearnih področij. Rezultati z združeno pripadnostno funkcijo pa niso bili enolično boljši. Končni rezultat, da lahko model Takagi-Sugeno z enim lokalnim modelom boljlše opiše obliko porabe kot z večimi, ni intuitiven, saj smo pričakovali, da imamo zadosti vzorcev za vsaj dva lokalna modela. Po pregledu literature predpostavimo, da je to posledica znane lastnosti mehkih modelov: razhajanja med natančnostjo in transparentnostjo. V nadaljevanju disertacije je razhajanje med natančnostjo in transparentnostjo mehkih modelov podrobno analizirano. V disertaciji izpostavimo strukturo modela Takagi-Sugeno kot glavni razlog razhajanje med natančnostjo modela in njegovo transparentnostjo, kar tudi dokažemo. Iz dokaza izhaja, da je struktura glavni razlog, zakaj je deljenje prostora mehkih modelov pereče področje raziskovanja že pet desetletij in še do danes ni popolnoma rešeno. Ker se mehko modeliranje kljub obstoju razhajanja še vedno uspešno uporablja na mnogih področjih, smo v disertaciji izpostavili izjeme razhajanja, pri katerih so mehki modeli še vedno verodostojna izbira za modeliranje procesov. Kljub temu predvidevamo, da raziskovanje deljenja problemske domene modelov Takagi-Sugeno v klasičnem smislu ne bo drastično pripomoglo k izboljšani kvaliteti modelov, saj napaka zaradi omejitev strukture modela Tagaki-Sugeno prevlada nad napako zaradi slabega deljenja prostora. Kot rešitev omejitev modela Takagi-Sugeno predstavimo novo strukturo mehkih modelov, ki je osnovana na združevanju v parametrskem prostoru, poimenovano struktura mehkih parametrov. Cilji pri načrtovanju nove strukture so bili ohranitev glavnih principov mehkih modelov -- kot sta deljenje na manjša področja, na katerih lahko načrtamo enostavnejše modele -- in možnost uporabe metod lokalne identifikacije, vendar brez obstoja razhajanja med natančnostjo ter transparentnostjo. Glavna razlika med modeli Takagi-Sugeno in strukturo mehkih parametrov je v prostoru zlivanja lokalnih modelov, saj modeli Takagi-Sugeno zlivajo v izhodnem prostoru, medtem ko je zlivanje pri strukturi mehkih parametrov prestavljeno v parametrski prostor afinega modela. Struktura mehkih parametrov je definirana tako, da lokalne modele predstavimo z verjetnostno porazdelitvijo preko funkcijskega prostora, kjer je vpliv modela pri določenem vhodu posredno definiran preko verjetnostne porazdelitve. Ključni korak modela je preslikava porazdelitve v funkcijskem prostoru v parametrski prostor afinega modela s pomočjo linearizacije baznih funkcij funkcijskega prostora. Opis porazdelitve v parametrskem prostoru omogoči združevanje lokalnih modelov v enotnem in relativno nizko-dimenzionalnem prostoru, ki hkrati ne omeji načrtovalske svobode pri izbiri lokalnih modelov. Takšen pristop ima mnogo prednosti -- poljuben opis za vsak lokalni model posebej, lokalni vpliv lokalnih modelov in enostavna vključitev ekspertnega znanja o sistemu ter možnost lokalne identifikacije. Je pa ravno fleksibilnost strukture tudi njen največji problem, saj s poljubnimi baznimi funkcijami ter verjetnostnimi porazdelitvami ne moremo zagotoviti analitične rešitve izhoda. Zato reševanje problema v disertaciji nato predstavimo prvi model zgrajen na strukturi mehkih parametrov, imenovan model mehkih parametrov s pragovno linearno funkcijo. Glavna značilnost modela je enostavnost uporabe, saj mora načrtovalec nastaviti le parametre afinih modelov in centre, kjer veljajo, medtem ko so ostale lastnosti (npr. verjetnostne porazdelitve v funkcijskem prostoru) izpeljane na podlagi podanih informacij o ostalih lokalnih modelih. Model je primerjan z različnimi modeli Takagi-Sugeno pri eksperimentih, kjer so vnaprej znani transparentni parametri in centri lokalnih modelov. Model mehkih parametrov, poleg tega da se drži načela mehkih modelov in ohrani možnost lokalne identifikacije, močno izboljša rezultate vseh modelov TS pri enostavnih primerih. Pri kompleksnejših primerih pa opazimo negativen vpliv nekaterih predpostavk, za katere pa verjamemo, da jih bomo lahko v prihodnosti odpravili ali omilili. Med vnesenimi predpostavkami bi izpostavili opis področja modela le s centrom brez kovariančne matrike, s čimer omejimo oblike področij. Nova struktura potrebuje tudi nov način deljenja problemske domene. Cilja pri razvoju novega načina deljenja sta bila dva: ohranitev lokalnosti rojenja (možnost paralelizacije izvajanja in enostavna adaptacija rojenja na samorazvijajoče se sisteme) in natančnost lokalnih modelov. Končni rezultat razvoja je naslednji prispevek disertacije, rojenje od spodaj navzgor z vpeljavo kvalitete modela. Ohranitev lokalnosti dosežemo z deljenjem problemske domene od spodaj navzgor, medtem ko natančnost dosežemo s kriterijem združevanja, ki popolnoma temelji na minimizaciji matematičnega upanja kvadrata napake modela. Kriterij združevanja najde optimalno razmerje med napako zaradi variance identificiranih parametrov in napako zaradi strukturne pristranskosti afinih modelov. To smo dosegli preko uvedbe treh novosti: vpeljave \textit{eksterne strukturne pristranskosti}, na podlagi katere v izračun kvadrata modela vpeljemo lokalnost modelov med seboj in omogoči razvrščanje različnih združevanj med seboj, vpeljave \textit{področja uporabe in interne strukturne pristranskosti}, preko katere dosežemo boljšo oceno kvadrata napake modela predvsem pri lokalnih modelih z malo vzorci, ter \textit{novega načina upoštevanja vhodnega šuma}. Rezultati rojenja so sicer v primerjavi z obstoječimi metodami le primerljivi, vendar je v disertaciji to pripisano slabemu zlivanju modelov Takagi-Sugeno in modelov mehkih parametrov, saj zlivanje modela Takagi-Sugeno deluje nepričakovano zaradi razhajanja med natančnostjo ter transparentnostjo, medtem ko zlivanje modela mehkih parametrov še ne upošteva oblike pridobljenih področij, kar je predvsem pri dinamičnih sistemih kritičnega pomena. Ne glede na izbiro rojenja, model mehkih parametrov doseže mnogo boljše rezultate v primerjavi z modeli Takagi-Sugeno, pod pogojem, da so parametri identificirani z lokalno identifikacijo. Niti model mehkih parametrov niti rojenje z vpeljavo kvalitete modela nista specifično prilagojena za modeliranja časovno-spremenljivih procesov. Zato je v zadnjem delu disertacije predstavljeno inovativno deljenje prostora v vertikalni smeri, kjer enostavnejši model, pri katerem lahko hitreje adaptiramo parametre modela, krčimo s kompleksnejšimi modeli. Tako združimo sposobnost modeliranja kompleksnejših modelov in hitrejše učenje parametrov enostavnejših modelov. Zadnji prispevek disertacije je tako kaskadna struktura, kjer z modelom afine konstante popravimo rekurzivno ocenjevan afin model. Model afine konstante ocenjujemo na napakah afinega modela. Identifikacija afine konstante se izvaja lokalno v času, kar pomenit, da za oceno uporabimo le vzorce, ki so časovno blizu in tako najmanj pristranski. Kaskadni model smo testirali v kombinaciji s prediktivnim regulatorjem in primerjali z ostalimi rekurzivnimi metodami. Kaskadni model izboljša vodenje procesov pri nenadni spremembi dinamike procesa v primerjavi z obstoječimi načini modeliranja linearnih procesov.

Language:	Slovenian
Keywords:	Model mehkih parametrov, kaskadno ocenjevanje, rojenje od spodaj navzgor, stohastični modeli, lokalna optimizacija
Work type:	Doctoral dissertation
Organization:	FE - Faculty of Electrical Engineering
Year:	2023
PID:	20.500.12556/RUL-152631
COBISS.SI-ID:	184735235
Publication date in RUL:	01.12.2023
Views:	1129
Downloads:	83
Metadata:
:	Copy citation
Share:

Secondary language

Abstract:
Language:	English
Title:	PROBLEM DOMAIN PARTITIONING IN FUZZY MODEL IDENTIFICATION
In control design, the quality of the mathematical model of the process is the key to accurate closed-loop operation, especially when combined with model-based controllers. As the analytical procedure for deriving such a model proves to be very time-consuming, the blooming field of identification focuses on extracting dynamic process properties directly from the data. Identification can drastically reduce the time and required expertise in model development. In the identification of non-linear systems, whether static or dynamic, Takagi-Sugeno fuzzy models are successfully used, primarily due to their capabilities in modelling complex non-linear systems despite their simple and interpretable structure. The idea behind the Takagi-Sugeno models is to split the problem domain into smaller partitions, within which the otherwise complex process can be described with simpler models. These models are later fuzzily fused together into single model, which now describes the behaviour of the system across the entire problem domain. Despite the exponential development in the field of deep neural networks, Takagi-Sugeno models still offer some advantages that are critical to the field of control and identification. These advantages include demonstrable stability, simple insight into the system’s operation, and the possibility of local identification. In particular, the latter benefit -– local identification -– is a feature that we believe, in our opinion, ensure the enduring relevance of Takagi-Sugeno models in the field of identification research. Takagi-Sugeno models provides a much higher sample efficiency, as each sample affects a smaller number of parameters, resulting in a correspondingly faster convergence of the parameters. This faster convergence, in turn, allows the model to describe changes in the system more quickly and accurately, making them suitable for time-variant dynamical systems. Proficient local identification also enables the usage of evolving fuzzy models. The main challenge in the identification of Takagi-Sugeno models is the problem of domain partitioning. The main task of the model designer is to find a good balance between the ability to model complex systems (the model can describe more complex processes if more partitions are used) and the accuracy of local model parameters (the larger the partitions, the more samples can be used in parameter estimation, which decreases the parameter variance). In this thesis, the author presents a combined membership function in which information about the quality of local models is innovatively incorporated into the sharing process. The combined membership function alters the criteria for a sample to belong to a specific local model, as the sample needs to be close both in the problem domain and in the output space. This approach enhances the detection of linear areas. However, the results with the combined membership function were not uniquely better. The final result shows that a model with single local model is the best description of the shape of load consumption. The final result is not intuitive, since we expected that it has sufficient samples for at least two local models if not more. After reviewing the literature, the author concludes that this is a consequence of well-known property of Takagi-Sugeno models, namely the trade-off between accuracy and transparency. In the following part of the thesis, the trade-off between the accuracy and the transparency of the fuzzy models is analysed in detail. The author identifies and proves that the structure of the Takagi-Sugeno model itself is the primary source of the trade-off between model accuracy and transparency. It is concluded that the structure of Takagi-Sugeno model remains the foremost reason why the partitioning of the problem domain in fuzzy models has persisted as a significant research challenge for the past five decades, remaining unresolved to this day. Since fuzzy modelling is still successfully used in many fields despite the existence of the trade-off, this thesis pointed out the exceptions where fuzzy models are still a accurate choice for identification. Nevertheless, the author assumes that the research of the problem domain partitioning of Takagi-Sugeno models in the classical sense will not drastically contribute to the improved quality of these models, as the error caused by the limitations of the structure of the Tagaki-Sugeno model outweighs the error caused by poor partitioning of the problem domain. As a solution to the limitation of Takagi-Sugeno model, the author presents a new structure of fuzzy models based on the fusion in parameter space, called the fuzzy parameter structure. The objectives in designing this new structure were to preserve the main principles of fuzzy models -- partitioning into smaller domains in which simpler models can be designed and the possibility of using local identification methods -- but without the trade-off between accuracy and transparency. The main distinction between the Takagi-Sugeno models and the fuzzy parameter structure lies in the space of the local model fusion. The Takagi-Sugeno models fuse local models in the output space, while the fusion in the fuzzy parameter structure is relocated to the parameter space of the affine model. The fuzzy parameter structure is defined by representing the local models by a probability distribution over the function space, where the influence of the model at a given input is implicitly defined by means of the probability distribution. The key step of the fuzzy parameter structure is the mapping of the function space into a smaller parametric space of an affine model by linearising the basis functions of the function space. This allows local models to be combined in a single and relatively low-dimensional space, while at the same time not restricting the designer’s freedom in the choice of local models. This approach has many advantages: an arbitrary description for each local model separately, the local influence of local models and easy incorporation of expert knowledge about the system (for example, incorporation of uncertainties due to noise or bias present), and the possibility of local identification, as each parameter has a limited area of influence on the model output. However, the flexibility of the structure is also its biggest problem, as arbitrary basis functions and probability distributions cannot guarantee an analytical solution to the output. Therefore the first model built on a fuzzy parameter structure is introduced, which is called the fuzzy parameter model with a rectified linear unit. The main feature of the model is its simplicity, as the designer only needs to configure the parameters of the linear models and specify their valid centres. Other properties, such as probability distributions in the function space, are derived from the given parameters of the other local models. The fuzzy parameter model with the rectified linear unit is compared with different Takagi-Sugeno models in experiments, where the transparent parameters and the centres of the local models are known in advance. The fuzzy parameter model, while adhering to the principle of fuzzy models and preserving the possibility of local identification, significantly enhances the performance of all Takagi-Sugeno models in simple cases. However, in more complex scenarios, we observe a negative impact originating in some of the assumptions that can be relaxed with the future research. We would like to emphasize that one of these assumptions pertains to the description of the model domain using only the center without incorporating a covariance matrix, thus constraining the shapes of the local domain. The new structure also needs a new way of partitioning the problem domain in order to find the most linear parts of the local domain. The two goals in developing the new partitioning method were to preserve the locality of the partitioning (which allows parallelizable execution and easy adaptation of the partitioning to evolving systems) and the accuracy of the local affine models. The final result is the next contribution of the thesis: bottom-up clustering with the introduction of model quality. The preservation of locality is achieved by the bottom-up partitioning of the problem domain, while accuracy is achieved by a clustering criterion based entirely on minimising the expected value of the square of the model error. The clustering criterion finds the optimal ratio between the error due to the variance of the identified parameters, and the error due to the structural bias of the affine models. This is achieved through the introduction of three novelties. The first novelty is the introduction of external structural bias, based on which the model locality is incorporated into the computation of the expected squared error. This allows better ranking between different cluster merges. The second novelty is the introduction of the desired usage area and internal structural bias, through which a better estimation of the squared error of the model is achieved, especially for local models with few samples. The last novelty is a new way of accounting for the input noise. While the results of the developed method are only comparable to existing methods, worse than anticipated results are attributed to the poor fusion capabilities of Takagi-Sugeno and fuzzy parameter models. The Takagi-Sugeno fusion performs unexpectedly due to the trade-off between accuracy and transparency, while the fusion of the fuzzy parameter model does not take into account the shape of the extracted domains yet, which is critical especially for dynamical systems in this thesis. Regardless of the choice of clustering, the fuzzy parameter model achieves much better results compared to Takagi-Sugeno models, provided that the parameters are identified by local identification. Neither the fuzzy parameter model nor the partitioning with the introduction of model quality is specifically adapted for modelling time-varying processes. Therefore, in the last part of the thesis, an innovative partitioning of the space in the vertical direction is presented, where a simpler model, whose model parameters can be adapted more quickly, is shrunk by more complex models. This combines the ability of more complex models to model non-linear processes and faster parameter convergence of simpler models. The identification of the affine constant is performed based on the errors of the affine model on samples. In the identification of the affine model, the locality is in the time space, so only samples that are close to the current moment in time are used for estimation. The algorithm, how many samples are needed for estimation with the least squared error is also proposed. The cascade model is evaluated in combination with a predictive controller and compared to other recursive models. Using cascade model improve process control in the presence of abrupt changes in process dynamics compared to existing modelling approaches for linear processes.
Keywords:	Parameter-fuzzy models, cascade identification, fuzzy modelling, bottom-up clustering, local optimization

Similar works from RUL:
Similar works from other Slovenian collections:

Secondary language

Similar documents