izpis_h1_title_alt

Lexicon construction and corpus annotation of historical language with CoBaLT editor
Kentner, Tom (Avtor), Erjavec, Tomaž (Avtor), Žorga Dulmin, Maja (Avtor), Fišer, Darja (Avtor)

URLURL - Predstavitvena datoteka, za dostop obiščite http://aclweb.org/anthology-new/W/W12/W12-1001.pdf Povezava se odpre v novem oknu

Izvleček
called CoBaLT (Corpus-Based Lexicon Tool), developed to construct corpusbased computational lexica and to correct word-level annotations and transcription errors in corpora. The paper describes the tool as well as our experience in using it to annotate a reference corpus and compile a large lexicon of historical Slovene. The annotations used in our project are modern-day word form equivalent, lemma, part-of-speech tag and optional gloss. The CoBaLT interface is word form oriented and compact. It enables wildcard word searching and sorting according to several criteria, which makes the editing process flexible and efficient. The tool accepts preannotated corpora in TEI P5 format and is able to export the corpus and lexicon in TEI P5 as well. The tool is implemented using the LAMP architecture and is freely available for research purposes.

Jezik:Angleški jezik
Ključne besede:označevanje korpusov, zgodovinski korpusi, zgodovinski jezik, corpus annotation, historical corpora, historical language
Vrsta gradiva:Delo ni kategorizirano (r6)
Tipologija:1.08 - Objavljeni znanstveni prispevek na konferenci
Organizacija:FF - Filozofska fakulteta
Leto izida:2012
UDK:004.8:81'322
COBISS.SI-ID:51011682 Povezava se odpre v novem oknu
Število ogledov:355
Število prenosov:110
Metapodatki:XML RDF-CHPDL DC-XML DC-RDF
 
Skupna ocena:(0 glasov)
Vaša ocena:Ocenjevanje je dovoljeno samo prijavljenim uporabnikom.
:
Objavi na:AddThis
AddThis uporablja piškotke, za katere potrebujemo vaše privoljenje.
Uredi privoljenje...

Podobna dela

Podobna dela v RUL:
Podobna dela v drugih slovenskih zbirkah:

Komentarji

Dodaj komentar

Za komentiranje se morate prijaviti.

Komentarji (0)
0 - 0 / 0
 
Ni komentarjev!

Nazaj