Your browser does not allow JavaScript!
JavaScript is necessary for the proper functioning of this website. Please enable JavaScript or use a modern browser.
Open Science Slovenia
Open Science
DiKUL
slv
|
eng
Search
Browse
New in RUL
About RUL
In numbers
Help
Sign in
Automatic segmentation of ethnomusicological field recordings
ID
Marolt, Matija
(
Author
),
ID
Bohak, Ciril
(
Author
),
ID
Kavčič, Alenka
(
Author
),
ID
Pesek, Matevž
(
Author
)
PDF - Presentation file,
Download
(1,12 MB)
MD5: 9C458853EC4E48692D39EFA1248E71FD
URL - Source URL, Visit
https://www.mdpi.com/2076-3417/9/3/439
Image galllery
Abstract
The article presents a method for segmentation of ethnomusicological field recordings. Field recordings are integral documents of folk music performances captured in the field, and typically contain performances, intertwined with interviews and commentaries. As these are live recordings, captured in non-ideal conditions, they usually contain significant background noise. We present a segmentation method that segments field recordings into individual units labelled as speech, solo singing, choir singing, and instrumentals. Classification is based on convolutional deep networks, and is augmented with a probabilistic approach for segmentation. We describe the dataset gathered for the task and the tools developed for gathering the reference annotations. We outline a deep network architecture based on residual modules for labelling short audio segments and compare it to the more standard feature based approaches, where an improvement in classification accuracy of over 10% was obtained. We also present the SeFiRe segmentation tool that incorporates the presented segmentation method.
Language:
English
Keywords:
audio segmentation
,
field recordings
,
deep learning
,
music information retrieval
Work type:
Article
Typology:
1.01 - Original Scientific Article
Organization:
FRI - Faculty of Computer and Information Science
Publication status:
Published
Publication version:
Version of Record
Year:
2019
Number of pages:
12 str.
Numbering:
Vol. 9, iss. 3, art. 439
PID:
20.500.12556/RUL-131958
UDC:
004:78
ISSN on article:
2076-3417
DOI:
10.3390/app9030439
COBISS.SI-ID:
1538109123
Publication date in RUL:
07.10.2021
Views:
1184
Downloads:
163
Metadata:
Cite this work
Plain text
BibTeX
EndNote XML
EndNote/Refer
RIS
ABNT
ACM Ref
AMA
APA
Chicago 17th Author-Date
Harvard
IEEE
ISO 690
MLA
Vancouver
:
Copy citation
Share:
Record is a part of a journal
Title:
Applied sciences
Shortened title:
Appl. sci.
Publisher:
MDPI
ISSN:
2076-3417
COBISS.SI-ID:
522979353
Licences
License:
CC BY 4.0, Creative Commons Attribution 4.0 International
Link:
http://creativecommons.org/licenses/by/4.0/
Description:
This is the standard Creative Commons license that gives others maximum freedom to do what they want with the work as long as they credit the author.
Licensing start date:
01.02.2019
Secondary language
Language:
Slovenian
Keywords:
segmentacija zvočnih posnetkov
,
terenski posnetki
,
globoko učenje
,
pridobivanje informacij iz glasbe
Projects
Funder:
ARRS - Slovenian Research Agency
Project number:
J7-9426
Name:
Misliti folkloro: folkloristične, etnološke in računske perspektive in pristopi k narečju
Similar documents
Similar works from RUL:
Similar works from other Slovenian collections:
Back