CLUSTERING ALGORITHM FOR MASS SPECTROMETRY DATA USING GENERAL-PURPOSE COMPUTING ON GRAPHICS PROCESSING UNITS

CLUSTERING ALGORITHM FOR MASS SPECTROMETRY DATA USING GENERAL-PURPOSE COMPUTING ON GRAPHICS PROCESSING UNITS creator Ali, Ansab advisor Khokhar, Ashfaq A. Modern mass spectrometers can produce mass spectra data at a very high rate. Usually, this data has a signi cant percentage of redundant spectra that in- crease the database lookup time when searching for peptides. Therefore, there is a need for data-mining techniques (e.g. clustering) to reduce the complexity of these mass spectra datasets before database search. Multi-core architectures, speci cally Graphics Processing Units (GPUs) have evolved tremendously in the recent years and are an ideal option for clustering these large mass spectra datasets. In this thesis, we present an e cient and scalable parallel algorithm for clustering mass spectra using the well known 'F-set' similarity metric. We describe the algorithmic framework and the various optimizations that serve to vastly improve the algorithm's performance and accuracy. We test the algorithm on a variety of real as well as self-generated mass spectra datasets and show that the algorithm achieves highly accurate clustering with performance gain of around 50 to 100 times as compared to serial implementations in literature. Thus, by clustering mass spectra corresponding to unique peptides to- gether, the algorithm allows faster identi cation of peptides in a subsequent database search. Submitted by Erma Thomas (thomase@iit.edu) on 2016-07-20T22:17:23Z No. of bitstreams: 1 etdadmin_upload_428742.zip: 1038594 bytes, checksum: b89d2c61757132d8d6f4aeec32196810 (MD5) Made available in DSpace on 2016-07-20T22:17:23Z (GMT). No. of bitstreams: 1 etdadmin_upload_428742.zip: 1038594 bytes, checksum: b89d2c61757132d8d6f4aeec32196810 (MD5) Previous issue date: 2016-05 M.S. in Electrical Engineering, May 2016 2016 2016-05 http://hdl.handle.net/10560/3899 en Clustering Mass Spectra Thesis born digital application/pdf In Copyright http://rightsstatements.org/page/InC/1.0/ Restricted Access ECE / Electrical and Computer Engineering Illinois Institute of Technology Affiliated department