Implementacija paralelnega algoritma za analizo obogatenosti k-terk v genomskih zaporedjih : diplomsko delo

Pajnič, Matic

Implementacija paralelnega algoritma za analizo obogatenosti k-terk v genomskih zaporedjih : diplomsko delo
ID Pajnič, Matic (Author), ID Curk, Tomaž (Mentor) More about this mentor... This link opens in a new window

, ID Lotrič, Uroš (Comentor)

PDF - Presentation file, Download (1,36 MB)
MD5: 40F0933BE4699B9B5DD35B78192B7AD4
PID: 20.500.12556/rul/f92306d6-7af2-44a2-8fe1-34d6ed8684b2

Abstract

Cilj diplomske naloge je bil implementirati sekvenčni algoritem za iskanje krajših zaporedij v genomih, ga pohitriti s paralelizacijo ter implementirati paralelno verzijo na grafični kartici. Sekvenčni algoritem je moral poiskati krajša zaporedja znakov v danem zaporedju, ki predstavlja genom. Izračunati je moral frekvence pojavitev zaporedij in pogostost interakcij na podanih položajih ter na naključno premaknjenih položajih, za vsako krajše zaporedje posebej. Na podlagi pojavitev je nato moral določiti, katera krajša zaporedja so za določene položaje v genomu bolj značilna. Na podlagi podatkov o interakcijah med proteini in genomom na določenih položajih, ter na podlagi najdenih krajših zaporedij znakov, je moral nato izračunati in statistično ovrednotiti pogostost pojavitve interakcij. Sekvenčni algoritem smo implementirali v programskem jeziku C, paralelizacijo sekvenčnega algoritma pa smo izvedli na podlagi arhitekture OpenCL, ki omogoča implementacijo algoritmov na grafičnih karticah.

Language:	Slovenian
Keywords:	paralelno računanje, GPE, sinhronizacija, niti, bioinformatika, proteini, DNA, RNA, geni
Work type:	Bachelor thesis/paper
Typology:	2.11 - Undergraduate Thesis
Organization:	FRI - Faculty of Computer and Information Science
Publisher:	[M. Pajnič]
Year:	2015
Number of pages:	55 str.
PID:	20.500.12556/RUL-72478
COBISS.SI-ID:	1536599491
Publication date in RUL:	21.09.2015
Views:	1462
Downloads:	418
Metadata:
:	Copy citation
Share:

Secondary language

Abstract:
Language:	English
Title:	Implementation of parallel algorithm for k-mer enrichment analysis of genomic sequences
The goal of this thesis was to implement a sequential algorithm that would search for subsequences in a genome. To accelerate the execution time of this algorithm we designed a parallel version and implemented the parallel version on a graphics card. The sequential algorithm had to search for predefined subsequences in a genome that was represented as a sequence of characters. It had to calculate the frequencies of sequence occurrences and the frequencies of interactions on predefined positions and on randomly modified positions in the genome, for each subsequence. Based on these frequencies it had to identify sequences that were more frequent on certain locations in a given genome. Based on data about protein-RNA interactions on certain locations in the genome, and based on the found character sequences, the algorithm had to calculate and statistically evaluate the frequencies of interactions. The sequential algorithm was implemented in the C programming language, while the parallelization was implemented on the OpenCL architecture.
Keywords:	parallel computing, GPU, synchronization, threads, bioinformatics, proteins, DNA, RNA, genes

Similar works from RUL:
Similar works from other Slovenian collections:

Secondary language

Similar documents