izpis_h1_title_alt

Implementacija paralelnega algoritma za analizo obogatenosti k-terk v genomskih zaporedjih
ID PAJNIČ, MATIC (Author), ID Curk, Tomaž (Mentor) More about this mentor... This link opens in a new window, ID Lotrič, Uroš (Co-mentor)

.pdfPDF - Presentation file, Download (1,36 MB)
MD5: 40F0933BE4699B9B5DD35B78192B7AD4
PID: 20.500.12556/rul/f92306d6-7af2-44a2-8fe1-34d6ed8684b2

Abstract
Cilj diplomske naloge je bil implementirati sekvenčni algoritem za iskanje krajših zaporedij v genomih, ga pohitriti s paralelizacijo ter implementirati paralelno verzijo na grafični kartici. Sekvenčni algoritem je moral poiskati krajša zaporedja znakov v danem zaporedju, ki predstavlja genom. Izračunati je moral frekvence pojavitev zaporedij in pogostost interakcij na podanih položajih ter na naključno premaknjenih položajih, za vsako krajše zaporedje posebej. Na podlagi pojavitev je nato moral določiti, katera krajša zaporedja so za določene položaje v genomu bolj značilna. Na podlagi podatkov o interakcijah med proteini in genomom na določenih položajih, ter na podlagi najdenih krajših zaporedij znakov, je moral nato izračunati in statistično ovrednotiti pogostost pojavitve interakcij. Sekvenčni algoritem smo implementirali v programskem jeziku C, paralelizacijo sekvenčnega algoritma pa smo izvedli na podlagi arhitekture OpenCL, ki omogoča implementacijo algoritmov na grafičnih karticah.

Language:Slovenian
Keywords:paralelno računanje, GPE, sinhronizacija, niti, bioinformatika, proteini, DNA, RNA, geni
Work type:Bachelor thesis/paper
Organization:FRI - Faculty of Computer and Information Science
Year:2015
PID:20.500.12556/RUL-72478 This link opens in a new window
Publication date in RUL:21.09.2015
Views:1208
Downloads:396
Metadata:XML RDF-CHPDL DC-XML DC-RDF
:
Copy citation
Share:Bookmark and Share

Secondary language

Language:English
Title:Implementation of parallel algorithm for k-mer enrichment analysis of genomic sequences
Abstract:
The goal of this thesis was to implement a sequential algorithm that would search for subsequences in a genome. To accelerate the execution time of this algorithm we designed a parallel version and implemented the parallel version on a graphics card. The sequential algorithm had to search for predefined subsequences in a genome that was represented as a sequence of characters. It had to calculate the frequencies of sequence occurrences and the frequencies of interactions on predefined positions and on randomly modified positions in the genome, for each subsequence. Based on these frequencies it had to identify sequences that were more frequent on certain locations in a given genome. Based on data about protein-RNA interactions on certain locations in the genome, and based on the found character sequences, the algorithm had to calculate and statistically evaluate the frequencies of interactions. The sequential algorithm was implemented in the C programming language, while the parallelization was implemented on the OpenCL architecture.

Keywords:parallel computing, GPU, synchronization, threads, bioinformatics, proteins, DNA, RNA, genes

Similar documents

Similar works from RUL:
Similar works from other Slovenian collections:

Back