Your browser does not allow JavaScript!
JavaScript is necessary for the proper functioning of this website. Please enable JavaScript or use a modern browser.
Repository of the University of Ljubljana
Open Science Slovenia
Open Science
DiKUL
slv
|
eng
Search
Advanced
New in RUL
About RUL
In numbers
Help
Sign in
Details
Measuring catastrophic forgetting in cross-lingual classification : transfer paradigms and tuning strategies
ID
Koloski, Boshko
(
Author
),
ID
Škrlj, Blaž
(
Author
),
ID
Robnik Šikonja, Marko
(
Author
),
ID
Pollak, Senja
(
Author
)
PDF - Presentation file,
Download
(2,13 MB)
MD5: 45A7548CE216F471D14CF068A4431CA2
URL - Source URL, Visit
https://ieeexplore.ieee.org/document/10892119
Image galllery
Abstract
Cross-lingual transfer leverages knowledge from a resource-rich source language, commonly English, to enhance performance in less-resourced target languages. Two widely used strategies are: Cross-Lingual Validation (CLV), which involves training on the source language and validating on the target language, and Intermediate Training (IT), where models are first fine-tuned on the source language and then further trained on the target language. While both strategies have been studied, their effects on encoder-based models for classification tasks remain underexplored. In this paper, we systematically compare these strategies across six multilingual classification tasks, evaluating downstream performance, catastrophic forgetting, and both zero-shot and full-shot scenarios. Additionally, we contrast parameter-efficient adapter methods with full-parameter fine-tuning. Our results show that IT generally performs better in the target language, whereas CLV more effectively preserves source-language knowledge across multiple cross-lingual transfers. These findings underscore the trade-offs between optimizing target performance and mitigating catastrophic forgetting.
Language:
English
Keywords:
cross-lingual transfer
,
cross-lingual learning
,
catastrophic-forgetting
,
document classification
Work type:
Article
Typology:
1.01 - Original Scientific Article
Organization:
FRI - Faculty of Computer and Information Science
Publication status:
Published
Publication version:
Version of Record
Year:
2025
Number of pages:
Str. 33509-33520
Numbering:
Vol. 13
PID:
20.500.12556/RUL-178163
UDC:
004.8
ISSN on article:
2169-3536
DOI:
10.1109/ACCESS.2025.3543608
COBISS.SI-ID:
227115523
Publication date in RUL:
20.01.2026
Views:
123
Downloads:
41
Metadata:
Cite this work
Plain text
BibTeX
EndNote XML
EndNote/Refer
RIS
ABNT
ACM Ref
AMA
APA
Chicago 17th Author-Date
Harvard
IEEE
ISO 690
MLA
Vancouver
:
Copy citation
Share:
Record is a part of a journal
Title:
IEEE access
Publisher:
Institute of Electrical and Electronics Engineers
ISSN:
2169-3536
COBISS.SI-ID:
519839513
Licences
License:
CC BY 4.0, Creative Commons Attribution 4.0 International
Link:
http://creativecommons.org/licenses/by/4.0/
Description:
This is the standard Creative Commons license that gives others maximum freedom to do what they want with the work as long as they credit the author.
Secondary language
Language:
Slovenian
Keywords:
medjezikovno učenje
,
medjezikovni prenos
,
globoko učenje
,
strojno učenje
Projects
Funder:
ARIS - Slovenian Research and Innovation Agency
Project number:
P2-0103
Name:
Tehnologije znanja
Funder:
ARIS - Slovenian Research and Innovation Agency
Project number:
P6-0411
Name:
Jezikovni viri in tehnologije za slovenski jezik
Funder:
ARIS - Slovenian Research and Innovation Agency
Project number:
J6-2581
Name:
Računalniško podprta večjezična analiza novičarskega diskurza s kontekstualnimi besednimi vložitvami
Funder:
ARIS - Slovenian Research and Innovation Agency
Project number:
J5-3102
Name:
Sovražni govor v sodobnih konceptualizacijah nacionalizma, rasizma, spola in migracij
Funder:
ARIS - Slovenian Research and Innovation Agency
Project number:
L2-50070
Name:
Tehnike vektorskih vložitev za medijske aplikacije
Funder:
ARIS - Slovenian Research and Innovation Agency
Project number:
GC-0002
Name:
Veliki jezikovni modeli za digitalno humanistiko
Funder:
ARIS - Slovenian Research and Innovation Agency
Project number:
J6-60109
Name:
Čezjezikovna analiza za zaznavanje kognitivnih motenj v jezikih z manj viri
Funder:
ARIS - Slovenian Research and Innovation Agency
Funding programme:
Young researchers
Project number:
PR-12394
Funder:
EC - European Commission
Funding programme:
HE
Project number:
101186647
Name:
Centre of Excellence in Artificial Intelligence for Digital Humanities
Acronym:
AI4DH
Funder:
EC - European Commission
Project number:
C3.K8.IB
Acronym:
PoVeJMo
Similar documents
Similar works from RUL:
Similar works from other Slovenian collections:
Back