Details

Avtomatsko povzemanje slovenskih besedil z globokimi nevronskimi mrežami
ID Zidarn, Rok (Author), ID Robnik Šikonja, Marko (Mentor) More about this mentor... This link opens in a new window

.pdfPDF - Presentation file, Download (861,84 KB)
MD5: 7AD43336FC194562E0B32098F0D4E688

Abstract
Povzemanje besedil naslavlja problem naraščujoče količine tekstovnih podatkov, v katerih želimo odkrivati pomembne informacije, npr. med raziskovanjem dokumentov želimo proces izbire poenostaviti in se odločati le na podlagi povzetkov. V delu smo se posvetili problemu povzemanja slovenskih besedil. Naš cilj je generiranje kvalitetnega in berljivega povzetka. Problema smo se lotili z uporabo globokih nevronskih mrež in arhitekture zaporedje v zaporedje. Razvili smo devet modelov, ki se razlikujejo po tipu rekurenčnih celic, številu rekurenčnih celic, številu nivojev in dodatnih mehanizmih, kot sta mehanizem pozornosti in mehanizem kopiranja. Uspešnost povzemanja smo evalvirali z metrikama ROUGE in BERTScore. Med trenutno obstoječimi slovenskimi povzemalniki naš najuspešnejši model dosega najboljše rezultate.

Language:Slovenian
Keywords:obdelava naravnega jezika, povzemanje besedil, nevronske mreže, globoko učenje, umetna inteligenca
Work type:Master's thesis/paper
Organization:FRI - Faculty of Computer and Information Science
Year:2020
PID:20.500.12556/RUL-113698 This link opens in a new window
COBISS.SI-ID:1538516419 This link opens in a new window
Publication date in RUL:27.01.2020
Views:1977
Downloads:340
Metadata:XML DC-XML DC-RDF
:
ZIDARN, Rok, 2020, Avtomatsko povzemanje slovenskih besedil z globokimi nevronskimi mrežami [online]. Master’s thesis. [Accessed 28 March 2025]. Retrieved from: https://repozitorij.uni-lj.si/IzpisGradiva.php?lang=eng&id=113698
Copy citation
Share:Bookmark and Share

Secondary language

Language:English
Title:Automatic text summarization of Slovene texts using deep neural networks
Abstract:
Text summarization allows us to extract useful information from a vast amount of textual documents. For example, during research we want to simplify the paper selection process by reading only abstracts instead of whole articles. In this thesis we focus on the problem of summarization of Slovene texts. Our goal is to generate an accurate and readable summary. We tackle the problem by applying a Sequence2Sequence architecture and deep neural networks. We developed nine models, which differ from one another by the type of recurrent cells, number of recurrent cells, number of levels and additional mechanisms, such as attention and copying. For evaluation we used ROUGE and BERTScore evaluation metrics. Our most succesful model produces the best results among Slovene text summarizers.

Keywords:natural language processing, text summarization, neural networks, deep learning, artificial intelligence

Similar documents

Similar works from RUL:
  1. Fizikalno-metalurški vidiki vpliva elementov na tvorbo kvazikristalov v zlitinah Al-Mn
  2. Karakterizacija zlitine AlSi7Mg0,3
  3. Controlled formation of quasicrystals in Al-Mn alloys
  4. Application of massive laser shock processing for improvement of mechanical and tribological properties
  5. Friction stir welding of aluminium alloy 2030
Similar works from other Slovenian collections:
  1. The influence of cooling rate on the microstructure of an Al-Mn-Be alloy
  2. Microstructure of Al-Mn-Be melt-spun ribbons
  3. The influence of the hot-rolling technique for EN AW-8021B aluminium alloy on the microstructural properties of a cold-rolled foil
  4. Design of construction for ambulance vehicle
  5. Simulation of low frequency electromagnetic DC casting

Back