<?xml version='1.0' encoding='utf-8'?>
<mods xmlns="http://www.loc.gov/mods/v3" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" version="3.7" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-7.xsd">
  <titleInfo>
    <title>CLUSTERING ALGORITHM FOR MASS SPECTROMETRY DATA USING GENERAL-PURPOSE COMPUTING ON GRAPHICS PROCESSING UNITS</title>
  </titleInfo>
  <name>
    <role>
      <roleTerm type="text" authority="marcrelator" authorityURI="http://id.loc.gov/vocabulary/relators" valueURI="http://id.loc.gov/vocabulary/relators/cre">creator</roleTerm>
    </role>
    <namePart>Ali, Ansab</namePart>
  </name>
  <name authority="wikidata" authorityURI="https://www.wikidata.org" valueURI="https://www.wikidata.org/wiki/Q102345257">
    <role>
      <roleTerm type="text" authority="marcrelator" authorityURI="http://id.loc.gov/vocabulary/relators" valueURI="http://id.loc.gov/vocabulary/relators/ths">advisor</roleTerm>
    </role>
    <namePart>Khokhar, Ashfaq A.</namePart>
  </name>
  <abstract>Modern mass spectrometers can produce mass spectra data at a very high rate. Usually, this data has a signi cant percentage of redundant spectra that in- crease the database lookup time when searching for peptides. Therefore, there is a need for data-mining techniques (e.g. clustering) to reduce the complexity of these mass spectra datasets before database search. Multi-core architectures, speci cally Graphics Processing Units (GPUs) have evolved tremendously in the recent years and are an ideal option for clustering these large mass spectra datasets. In this thesis, we present an e cient and scalable parallel algorithm for clustering mass spectra using the well known 'F-set' similarity metric. We describe the algorithmic framework and the various optimizations that serve to vastly improve the algorithm's performance and accuracy. We test the algorithm on a variety of real as well as self-generated mass spectra datasets and show that the algorithm achieves highly accurate clustering with performance gain of around 50 to 100 times as compared to serial implementations in literature. Thus, by clustering mass spectra corresponding to unique peptides to- gether, the algorithm allows faster identi cation of peptides in a subsequent database search.</abstract>
  <note type="provenance">Submitted by Erma Thomas (thomase@iit.edu) on 2016-07-20T22:17:23Z No. of bitstreams: 1 etdadmin_upload_428742.zip: 1038594 bytes, checksum: b89d2c61757132d8d6f4aeec32196810 (MD5)</note>
  <note type="provenance">Made available in DSpace on 2016-07-20T22:17:23Z (GMT). No. of bitstreams: 1 etdadmin_upload_428742.zip: 1038594 bytes, checksum: b89d2c61757132d8d6f4aeec32196810 (MD5) Previous issue date: 2016-05</note>
  <note type="thesis">M.S. in Electrical Engineering, May 2016</note>
  <originInfo>
    <dateCaptured>2016</dateCaptured>
  </originInfo>
  <originInfo>
    <dateCreated keyDate="yes">2016-05</dateCreated>
  </originInfo>
  <identifier type="hdl">http://hdl.handle.net/10560/3899</identifier>
  <language>
    <languageTerm type="code" authority="rfc3066">en</languageTerm>
  </language>
  <subject>
    <topic>Clustering</topic>
  </subject>
  <subject>
    <topic>Mass Spectra</topic>
  </subject>
  <typeOfResource authority="coar" valueURI="http://purl.org/coar/resource_type/c_46ec">Thesis</typeOfResource>
  <physicalDescription>
    <digitalOrigin>born digital</digitalOrigin>
    <internetMediaType>application/pdf</internetMediaType>
  </physicalDescription>
  <accessCondition type="useAndReproduction" displayLabel="rightsstatements.org">In Copyright</accessCondition>
  <accessCondition type="useAndReproduction" displayLabel="rightsstatements.orgURI">http://rightsstatements.org/page/InC/1.0/</accessCondition>
  <accessCondition type="restrictionOnAccess">Restricted Access</accessCondition>
  <name type="corporate">
    <namePart>ECE / Electrical and Computer Engineering</namePart>
    <affiliation>Illinois Institute of Technology</affiliation>
    <role>
      <roleTerm type="text">Affiliated department</roleTerm>
    </role>
  </name>
</mods>