Jezik slovenskih tvitov : korpusna raziskava

Erjavec, Tomaž; Fišer, Darja

Jezik slovenskih tvitov : korpusna raziskava
ID Erjavec, Tomaž (Author), ID Fišer, Darja (Author)

	PDF - Presentation file, Download (831,68 KB) MD5: 58ECAD501DFBAB94A3B76C925D83825F
	URL - Source URL, Visit https://centerslo.si/simpozij-obdobja/zborniki/obdobja-32/

Abstract

V prispevku predstavimo korpus in analizo nestandardne slovenščine z družbenega omrežja Twitter. Korpus, ki vključuje tvite iz prvih štirih let obstoja omrežja, vsebuje 360.000 tvitov oz. pet milijonov besed. Slovenščina, uporabljena v njih, je zelo bogata in se precej razlikuje od slovenščine, uporabljene v slovenskem uravnoteženem korpusu ccKRES, in sicer predvsem po pogovorni, bolj fonetični ortografiji, rabi prvin, ki so bolj značilne za govorjeni jezik, ter po pogosti rabi tujejezičnih besed.

Language:	Slovenian
Keywords:	tviti, nestandardna slovenščina, korpusna analiza, ortografija, besedišče
Work type:	Article
Typology:	1.16 - Independent Scientific Component Part or a Chapter in a Monograph
Organization:	FF - Faculty of Arts
Year:	2013
Number of pages:	Str. 109-116
PID:	20.500.12556/RUL-147644
UDC:	004.773:811.163.6'276'354
COBISS.SI-ID:	53226594
Publication date in RUL:	10.07.2023
Views:	402
Downloads:	45
Metadata:
:	Copy citation
Share:

Record is a part of a monograph

Title:	Družbena funkcijskost jezika : (vidiki, merila, opredelitve)
Editors:	Andreja Žele
Place of publishing:	Ljubljana
Publisher:	Znanstvena založba Filozofske fakultete
Year:	2013
ISBN:	978-961-237-609-3
COBISS.SI-ID:	269357568
Collection title:	Obdobja
Collection numbering:	32
Collection ISSN:	1408-211X

Secondary language

Abstract:
Language:	English
This paper presents a corpus of Slovene tweets and the analysis of non-standard Slovene as used on the Twitter social network. The corpus, which comprises tweets from the first four years of Twitterʼs existence, contains 360,000 tweets or 5 million tokens. The Slovene used in the analysed tweets issubstantially different from the balanced corpus of standard Slovene ccKRES.The distinguishing features of ŽTwitter SloveneŽ are a more colloquial,phonetic orthography, frequent use of spoken language elements and an abundance of foreign words.
Keywords:	Tweets, non-standard Slovene, corpus analysis, orthography, vocabulary

Similar works from RUL:
Similar works from other Slovenian collections:

Record is a part of a monograph

Secondary language

Similar documents