Details

Measuring catastrophic forgetting in cross-lingual classification : transfer paradigms and tuning strategies
ID Koloski, Boshko (Author), ID Škrlj, Blaž (Author), ID Robnik Šikonja, Marko (Author), ID Pollak, Senja (Author)

.pdfPDF - Presentation file, Download (2,13 MB)
MD5: 45A7548CE216F471D14CF068A4431CA2
URLURL - Source URL, Visit https://ieeexplore.ieee.org/document/10892119 This link opens in a new window

Abstract
Cross-lingual transfer leverages knowledge from a resource-rich source language, commonly English, to enhance performance in less-resourced target languages. Two widely used strategies are: Cross-Lingual Validation (CLV), which involves training on the source language and validating on the target language, and Intermediate Training (IT), where models are first fine-tuned on the source language and then further trained on the target language. While both strategies have been studied, their effects on encoder-based models for classification tasks remain underexplored. In this paper, we systematically compare these strategies across six multilingual classification tasks, evaluating downstream performance, catastrophic forgetting, and both zero-shot and full-shot scenarios. Additionally, we contrast parameter-efficient adapter methods with full-parameter fine-tuning. Our results show that IT generally performs better in the target language, whereas CLV more effectively preserves source-language knowledge across multiple cross-lingual transfers. These findings underscore the trade-offs between optimizing target performance and mitigating catastrophic forgetting.

Language:English
Keywords:cross-lingual transfer, cross-lingual learning, catastrophic-forgetting, document classification
Work type:Article
Typology:1.01 - Original Scientific Article
Organization:FRI - Faculty of Computer and Information Science
Publication status:Published
Publication version:Version of Record
Year:2025
Number of pages:Str. 33509-33520
Numbering:Vol. 13
PID:20.500.12556/RUL-178163 This link opens in a new window
UDC:004.8
ISSN on article:2169-3536
DOI:10.1109/ACCESS.2025.3543608 This link opens in a new window
COBISS.SI-ID:227115523 This link opens in a new window
Publication date in RUL:20.01.2026
Views:123
Downloads:41
Metadata:XML DC-XML DC-RDF
:
Copy citation
Share:Bookmark and Share

Record is a part of a journal

Title:IEEE access
Publisher:Institute of Electrical and Electronics Engineers
ISSN:2169-3536
COBISS.SI-ID:519839513 This link opens in a new window

Licences

License:CC BY 4.0, Creative Commons Attribution 4.0 International
Link:http://creativecommons.org/licenses/by/4.0/
Description:This is the standard Creative Commons license that gives others maximum freedom to do what they want with the work as long as they credit the author.

Secondary language

Language:Slovenian
Keywords:medjezikovno učenje, medjezikovni prenos, globoko učenje, strojno učenje

Projects

Funder:ARIS - Slovenian Research and Innovation Agency
Project number:P2-0103
Name:Tehnologije znanja

Funder:ARIS - Slovenian Research and Innovation Agency
Project number:P6-0411
Name:Jezikovni viri in tehnologije za slovenski jezik

Funder:ARIS - Slovenian Research and Innovation Agency
Project number:J6-2581
Name:Računalniško podprta večjezična analiza novičarskega diskurza s kontekstualnimi besednimi vložitvami

Funder:ARIS - Slovenian Research and Innovation Agency
Project number:J5-3102
Name:Sovražni govor v sodobnih konceptualizacijah nacionalizma, rasizma, spola in migracij

Funder:ARIS - Slovenian Research and Innovation Agency
Project number:L2-50070
Name:Tehnike vektorskih vložitev za medijske aplikacije

Funder:ARIS - Slovenian Research and Innovation Agency
Project number:GC-0002
Name:Veliki jezikovni modeli za digitalno humanistiko

Funder:ARIS - Slovenian Research and Innovation Agency
Project number:J6-60109
Name:Čezjezikovna analiza za zaznavanje kognitivnih motenj v jezikih z manj viri

Funder:ARIS - Slovenian Research and Innovation Agency
Funding programme:Young researchers
Project number:PR-12394

Funder:EC - European Commission
Funding programme:HE
Project number:101186647
Name:Centre of Excellence in Artificial Intelligence for Digital Humanities
Acronym:AI4DH

Funder:EC - European Commission
Project number:C3.K8.IB
Acronym:PoVeJMo

Similar documents

Similar works from RUL:
Similar works from other Slovenian collections:

Back