Search results
(9,701 - 9,720 of 9,886)
Pages
- Title
- Intraoperative Assessment of Surgical Margins in Head And Neck Cancer Resection Using Time-Domain Fluorescence Imaging
- Creator
- Cleary, Brandon M.
- Date
- 2023
- Description
-
Rapid and accurate determination of surgical margin depth in fluorescence guided surgery has been a difficult issue to overcome, leading to...
Show moreRapid and accurate determination of surgical margin depth in fluorescence guided surgery has been a difficult issue to overcome, leading to over- or under-resection of cancerous tissues and follow-up treatments such as ‘call-back’ surgery and chemotherapy. Current techniques utilizing direct measurement of tumor margins in frozen section pathology are slow, which can prevent surgeons from acting on information before a patient is sent home. Other fluorescence techniques require the measurement of margins via captured images that are overlayed with fluorescent data. This method is flawed, as measuring depth from captured images loses spatial information. Intensity-based fluorescence techniques utilizing tumor-to-background ratios do not decouple the effects of concentration from the depth information acquired. Thus, it is necessary to perform an objective measurement to determine depths of surgical margins. This thesis focuses on the theory, device design, simulation development, and overall viability of time-domain fluorescence imaging as an alternative method of determining surgical margin depths. Characteristic regressions were generated using a thresholding method on acquired time-domain fluorescence signals, which were used to convert time-domain data to a depth value. These were applied to an image space to generate a depth map of a modelled tissue sample. All modeling was performed on homogeneous media using Monte Carlo simulations, providing high accuracy at the cost of increased computational time. In practice, the imaging process should be completed within a span of under 20 minutes for a full tissue sample, rather than 20 minutes for a single slice of the sample. This thesis also explores the effects of different thresholding levels on the accuracy of depth determination, as well as the precautions to be taken regarding hardware limitations and signal noise.
Show less
- Title
- Investigation in the Uncertainty of Chassis Dynamometer Testing for the Energy Characterization of Conventional, Electric and Automated Vehicles
- Creator
- Di Russo, Miriam
- Date
- 2023
- Description
-
For conventional and electric vehicles tested in a standard chassis dynamometer environment precise regulations on the evaluation of their...
Show moreFor conventional and electric vehicles tested in a standard chassis dynamometer environment precise regulations on the evaluation of their energy performance exist. However, the regulations do not include requirements on the confidence value to associate with the results. As vehicles become more and more efficient to meet the stricter regulations mandates on emissions, fuel and energy consumption, traditional testing methods may become insufficient to validate these improvements, and may need revision. Without information about the accuracy associated with the results of those procedures however, adjustments and improvements are not possible, since no frame of reference exists. For connected and automated vehicles, there are no standard testing procedures, and researchers are still in the process of determining if current evaluation methods can be extended to test intelligent technologies and which metrics best represent their performance. For these vehicles is even more important to determine the uncertainty associated with these experimental methods and how they propagate to the final results. The work presented in this dissertation focuses on the development of a systematic framework for the evaluation of the uncertainty associated with the energy performance of conventional, electric and automated vehicles. The framework is based on a known statistical method, to determine the uncertainty associated with the different stages and processes involved in the experimental testing, and to evaluate how the accuracy of each parameter involved impacts the final results. The results demonstrate that the framework can be successfully applied to existing testing methods and provides a trustworthy value of accuracy to associate with the energy performance results, and can be easily extended to connected-automated vehicle testing to evaluate how novel experimental methods impact the accuracy and the confidence of the outputs. The framework can be easily be implemented into an existing laboratory environment to incorporate the uncertainty evaluation among the current results analyzed at the end of each test, and provide a reference for researchers to evaluate the actual benefits of new algorithms and optimization methods and understand margins for improvements, and by regulators to assess which parameters to enforce to ensure compliance and ensure projected benefits.
Show less
- Title
- Using Niobium surface encapsulation and Rhenium to enhance the coherence of superconducting devices
- Creator
- Crisa, Francesco
- Date
- 2024
- Description
-
In recent decades, the scientific community has grappled with escalating complexity, necessitating a more advanced tool capable of tackling...
Show moreIn recent decades, the scientific community has grappled with escalating complexity, necessitating a more advanced tool capable of tackling increasingly intricate simulations beyond the capabilities of classical computers. This tool, known as a quantum computer, features processors composed of individual units termed qubits. While various methods exist for constructing qubits, superconducting circuits have emerged as a leading approach, owing to their parallels with semiconductor technology.In recent years, significant strides have been made in optimizing the geometry and design of qubits. However, the current bottleneck in the performance of superconducting qubits lies in the presence of defects and impurities within the materials used. Niobium, owing to its desirable properties, such as high critical temperature and low kinetic inductance, stands out as the most prevalent superconducting material. Nonetheless, it is encumbered by a relatively thick oxide layer (approximately 5 nm) exhibiting three distinct oxidation states: NbO, NbO$_2$, and Nb$_2$O$_5$. The primary challenge with niobium lies in the multitude of defects localized within the highly disordered Nb$_2$O$_5$ layer and at the interfaces between the different oxides. In this study, I present an encapsulation strategy aimed at restraining surface oxide growth by depositing a thin layer (5 to 10 nm) of another material in vacuum atop the Nb thin film. This approach exploits the superconducting proximity effect, and it was successfully employed in the development of Josephson junction devices on Nb during the 1980s.In the past two years, tantalum and titanium nitride have emerged as promising alternative materials, with breakthrough qubit publications showcasing coherence times five to ten times superior to those achieved in Nb. The focus will be on the fabrication and RF testing of Nb-based qubits with Ta and Au capping layers. With Ta capping, we have achieved the best T1 (not average) decay time of nearly 600 us, which is more than a factor of 10 improvements over the bare Nb. This establishes the unique capping layer approach as a significant new direction for the development of superconducting qubits.Concurrently with the exploration of materials for encapsulation strategies, identifying materials conducive to enhancing the performance of superconducting qubits is imperative. Ideal candidates should exhibit a thin, low-loss surface oxide and establish a clean interface with the substrate, thereby minimizing defects and potential sources of losses. Rhenium, characterized by an extremely thin surface oxide (less than 1 nm) and nearly perfect crystal structure alignment with commonly used substrates such as sapphire, emerges as a promising material platform poised to elevate the performance of superconducting qubits.
Show less
- Title
- The Double-edged Sword of Executive Pay: How the CEO-TMT Pay Gap Influences Firm Performance
- Creator
- Haddadian Nekah, Pouya
- Date
- 2024
- Description
-
This study examines the relationship between the chief executive officer (CEO) and top management team (TMT) pay gap and consequent firm...
Show moreThis study examines the relationship between the chief executive officer (CEO) and top management team (TMT) pay gap and consequent firm performance. Drawing on tournament theory and equity theory, I argue that the effect of the CEO-TMT pay gap on consequent firm performance is non-monotonic. Using data from 1995 to 2022 from S&P 1500 US firms, I explicate an inverted U-shaped relationship, such that an increase in the pay gap leads to an increase in firm performance up to a certain point, after which it declines. Additionally, multilevel analyses reveal that this curvilinear relationship is moderated by attributes of the TMT, and the industry in which the firm competes. My findings show that firms with higher TMT gender diversity suffer lower performance loss due to wider pay gaps. Furthermore, when firm executives are paid more compared to the industry norms, or when the firm has a long-tenured CEO, firm performance becomes less sensitive to larger CEO-TMT pay gaps. Lastly, when the firm competes in a masculine industry, firm performance is more negatively affected by larger CEO-TMT pay gaps. Contrary to my expectations, firm gender-diversity friendly policies failed to influence the CEO-TMT pay gap-firm performance relationship.
Show less
- Title
- Improving Niobium Superconducting Radio-Frequency Cavities by Studying Tantalum
- Creator
- Helfrich, Halle
- Date
- 2023
- Description
-
Niobium superconducting radio-frequency (SRF) cavities are widely used accelerating structures. Improvements in both quality factor, Q0, and...
Show moreNiobium superconducting radio-frequency (SRF) cavities are widely used accelerating structures. Improvements in both quality factor, Q0, and maximum accelerating gradient, Eacc, have been made to SRF cavities by introducing new processing techniques. These breakthroughs include processes such as nitrogen doping(N-Doping) and infusion, electrochemical polishing (EP) and High Pressure Rinsing (HPR). [1] There is still abundant opportunity to improve the cavities or, rather, the material they’re primarily composed of: niobium. A focus here is the role the native oxide of Nb plays in SRF cavity performance. The values of interest in a given cavity are its quality factor Q0, maximum accelerating gradient Eacc and surface resistance Rs . This work characterizes Nb and Ta foils prepared under identical conditions using X-ray photoelectron spectroscopy (XPS) to compare surface oxides and better understand RF loss mechanisms in Nb SRF cavities and qubits. It is well established that Ta qubits experience much longer coherence times than Nb qubits, which is probably due to the larger RF losses in Nb oxide. By studying Tantalum, an element similar to Niobium, the mechanisms of the losses that originate in the oxide and suboxide layers present on the surface of Nb cavities might finally be unlocked. We find noticeable differences in the oxides of Nb and Ta formed by air exposure of clean foils. In particular, Ta does not display the TaO2 suboxide in XPS, while Nb commonly shows NbO2. This suggests that suboxides are an additional contributor of RF losses. We also suggest that thin Ta film coatings of Nb SRF cavities may be a way of increasing Q0. It is in the interest of the accelerator community to fully understand the surface impurities present in Nb SRF cavities so that strategies for mitigating the effects can be proposed.
Show less
- Title
- Improving Localization Safety for Landmark-Based LiDAR Localization System
- Creator
- Chen, Yihe
- Date
- 2024
- Description
-
Autonomous ground robots have gained traction in various commercial applications, with established safety protocols covering subsystem...
Show moreAutonomous ground robots have gained traction in various commercial applications, with established safety protocols covering subsystem reliability, control algorithm stability, path planning, and localization. This thesis specifically delves into the localizer, a critical component responsible for determining the vehicle’s state (e.g., position and orientation), assessing compliance with localization safety requirements, and proposing methods for enhancing localization safety.Within the robotics domain, diverse localizers are utilized, such as scan-matching techniques like normal distribution transformations (NDT), the iterative closest point (ICP) algorithm,probabilistic maps method, and semantic map-based localization.Notably, NDT stands out as a widely adopted standalone laser localization method, prevalent in autonomous driving software such as Autoware and Apollo platforms.In addition to the mentioned localizers, common state estimators include variants of Kalman Filter, particle filter-based, and factor graph-based estimators. The evaluation of localization performance typically involves quantifying the estimated state variance for these state estimators.While various localizer options exist, this study focuses on those utilizing extended Kalman filters and factor graph methods. Unlike methods like NDT and ICP algorithms, extended Kalman filters and factor graph based approaches guarantee bounding of estimated state uncertainty and have been extensively researched for integrity monitoring.Common variance analysis, employed for sensor readings and state estimators, has limitations, primarily focusing on non-faulted scenarios under nominal conditions. This approach proves impractical for real-world scenarios and falls short for safety-critical applications like autonomous vehicles (AVs).To overcome these limitations, this thesis utilizes a dedicated safety metric: integrity risk. Integrity risk assesses the reliability of a robot’s sensory readings and localization algorithm performance under both faulted and non-faulted conditions. With a proven track record in aviation, integrity risk has recently been applied to robotics applications, particularly for evaluating the safety of lidar localization.Despite the significance of improving localization integrity risk through laser landmark manipulation, this remains an under explored territory. Existing research on robot integrity risk primarily focuses on the vehicles themselves. To comprehensively understand the integrity risk of a lidar-based localization system, as addressed in this thesis, an exploration of lidar measurement faults’ modes is essential, a topic covered in this thesis.The primary contributions of this thesis include: A realistic error estimation method for state estimators in autonomous vehicles navigating using pole-shape lidar landmark maps, along with a compensatory method; A method for quantifying the risk associated with unmapped associations in urban environments, enhancing the realism of values provided by the integrity risk estimator; a novel approach to improve the localization integrity of autonomous vehicles equipped with lidar feature extractors in urban environments through minimal environmental modifications, mitigating the impact of unmapped association faults. Simulation results and experimental results are presented and discussed to illustrate the impact of each method, providing further insights into their contributions to localization safety.
Show less
- Title
- Independence and Graphical Models for Fitting Real Data
- Creator
- Cho, Jason Y.
- Date
- 2023
- Description
-
Given some real life dataset where the attributes of the dataset take on categorical values, with corresponding r(1) × r(2) × … × r(m)...
Show moreGiven some real life dataset where the attributes of the dataset take on categorical values, with corresponding r(1) × r(2) × … × r(m) contingency table with nonzero rows or nonzero columns, we will be testing the goodness-of-fit of various independence models to the dataset using a variation of Metropolis-Hastings that uses Markov bases as a tool to get a Monte Carlo estimate of the p-value. This variation of Metropolis-Hastings can be found in Algorithm 3.1.1. Next we will consider the problem: ``out of all possible undirected graphical models each associated to some graph with m vertices that we test to fit on our dataset, which one best fits the dataset?" Here, the m attributes are labeled as vertices for the graph. We would have to conduct 2^(mC2) goodness-of-fit tests since there are 2^(mC2) possible undirected graphs on m vertices. Instead, we consider a backwards selection method likelihood-ratio test algorithm. We first start with the complete graph G = K(m), and call the corresponding undirected graphical model ℳ(G) as the parent model. Then for each edge e in E(G), we repeatedly apply the likelihood-ratio test to test the relative fit of the model ℳ(G-e), the child model, vs. ℳ(G), the parent model, where ℳ(G-e) ⊆ℳ(G). More details on this iterative process can be found in Algorithm 4.1.3. For our dataset, we will be using the alcohol dataset found in https://www.kaggle.com/datasets/sooyoungher/smoking-drinking-dataset, where the four attributes of the dataset we will use are ``Gender" (male, female), ``Age", ``Total cholesterol (mg/dL)", and ``Drinks alcohol or not?". After testing the goodness-of-fit of three independence models corresponding to the independence statements ``Gender vs Drink or not?", ``Age vs Drink or not?", and "Total cholesterol vs Drink or not?", we found that the data came from a distribution from the two independence models corresponding to``Age vs Drink or not?" and "Total cholesterol vs Drink or not?" And after applying the backwards selection likelihood-ratio method on the alcohol dataset, we found that the data came from a distribution from the undirected graphical model associated to the complete graph minus the edge {``Total cholesterol”, ``Drink or not?”}.
Show less
- Title
- Development of a Model To Investigate Inflammation Using Peripheral Blood Mononucleated Cells
- Creator
- Geevarghese Alex, Peter
- Date
- 2023
- Description
-
Our modern culture in our society is facing one of the biggest risks in health which is high-calorie diet-related postprandial inflammation....
Show moreOur modern culture in our society is facing one of the biggest risks in health which is high-calorie diet-related postprandial inflammation. Chronic diseases may be caused if the energy-dense food is the choice meaning if it is uncontrolled, clinical studies have demonstrated this with the body's post-meal inflammatory response. We aimed to find the causes of postprandial inflammation in response to various dietary treatments and provide a model to demonstrate. We aimed to make use of in vivo and in vitro techniques and statistics to create a model. The created model would help us to design specific treatments to minimize inflammation with response to dietary. In addition to figuring out vital dietary additives, the model additionally facilitates the layout of individualized interventions to reduce inflammation, thereby improving long-time period health outcomes. We aim to understand the clinical observations of diet-induced postprandial inflammation on the molecular level. We desire to make contributions to reduce the impact of chronic inflammatory disorders that is associated with postprandial inflammation.
Show less
- Title
- Large Language Model Based Machine Learning Techniques for Fake News Detection
- Creator
- Chen, Pin-Chien
- Date
- 2024
- Description
-
With advanced technology, it’s widely recognized that everyone owns one or more personal devices. Consequently, people are evolving into...
Show moreWith advanced technology, it’s widely recognized that everyone owns one or more personal devices. Consequently, people are evolving into content creators on social media or the streaming platforms sharing their personal ideas regardless of their education or expertise level. Distinguishing fake news is becoming increasingly crucial. However, the recent research only presents comparisons of detecting fake news between one or more models across different datasets. In this work, we applied Natural Language Processing (NLP) techniques with Naïve Bayes and DistilBERT machine learning method combing and augmenting four datasets. The results show that the balanced accuracy is higher than the average in the recent studies. This suggests that our approach holds for improving fake news detection in the era of widespread content creation.
Show less
- Title
- DEVELOPMENT AND APPLICATION OF A NATIONALLY REPRESENTATIVE MODEL SET TO PREDICT THE IMPACTS OF CLIMATE CHANGE ON ENERGY CONSUMPTION AND INDOOR AIR QUALITY (IAQ) IN U.S. RESIDENCES
- Creator
- Fazli, Torkan
- Date
- 2020
- Description
-
Americans spend most of their time inside residences where they are exposed to a number of pollutants of both indoor and outdoor origin....
Show moreAmericans spend most of their time inside residences where they are exposed to a number of pollutants of both indoor and outdoor origin. Residential buildings also account for over 20% of total primary energy consumption in the U.S. and a similar proportion of greenhouse gas emissions. Moreover, climate change is expected to affect building energy use and indoor air quality (IAQ) through both building design (i.e., via our societal responses to climate change) and building operation (i.e., via changing meteorological and ambient air quality conditions). The overarching objectives of this work are to develop a set of combined building energy and indoor air mass balance models that are generally representative of both the current (i.e., ~2010s) and future (i.e., ~2050s) U.S. residential building stock and to apply them using both current and future climate scenarios to estimate the impacts of climate change and climate change policies on building energy use, IAQ, and the prevalence of chronic health hazards in U.S. homes. The developed model set includes over 4000 individual building models with detailed characteristics of both building operation and indoor pollutant physics/chemistry, and is linked to a disability-adjusted life years (DALYs) approach for estimating chronic health outcomes associated with indoor pollutant exposure. The future building stock model incorporates a combination of predicted changes in future meteorological conditions, ambient air quality, the U.S. housing stock, and population demographics. Using the model set, we estimate the total site and source energy consumption for space conditioning in U.S. residences is predicted to decrease by ~37% and ~20% by mid-century (~2050s) compared to 2012, respectively, driven by decreases in heating energy use across the building stock that are larger than coincident increases in cooling energy use in warmer climates. Indoor concentrations of most pollutants of ambient origin are expected to decrease, driven by predicted reductions in ambient concentrations due to tighter emissions controls, with one notable exception of ozone, which is expected to increase in future climate scenarios. This work provides the first known estimates of the potential magnitude of impacts of expected climate changes on building energy use, IAQ, and the prevalence of chronic health hazards in U.S. homes.
Show less
- Title
- Data-Driven Modeling for Advancing Near-Optimal Control of Water-Cooled Chillers
- Creator
- Salimian Rizi, Behzad
- Date
- 2023
- Description
-
Hydronic heating and cooling systems are among the most common types of heating and cooling systems installed in older existing buildings,...
Show moreHydronic heating and cooling systems are among the most common types of heating and cooling systems installed in older existing buildings, especially commercial buildings. The results of this study based on the Commercial Building Energy Consumption Survey (CBECS) indicates chillers account for providing cooling in more than half of the commercial office building floorspaces in the U.S. Therefore, to address the need of improving energy efficiency of chillers systems operation, research studies developed different models to investigate different chiller sequencing approaches. Engineering-based models and empirical models are among the popular approaches for developing prediction models. Engineering-based models utilize the physical principles to calculate the thermal dynamics and energy behaviors of the systems and require detailed system information, while the empirical models deploy machine learning algorithms to develop relationships between input and output data. The empirical models compared to the engineering-based approach are more practical in a system’s energy prediction because of accessibility to required data, superiority in model implementation and prediction accuracy. Moreover, selecting near accurate chiller prediction models for the chiller sequencing needs to consider the importance of each input variable and its contribution to the overall performance of a chiller system, as well as the ease of application and computational time. Among the empirical modeling methods, ensemble learning techniques overcome the instability of the learning algorithm as well as improve prediction accuracy and identify input variable importance. Ensemble models combine multiple individual models, often called base or weak models, to produce a more accurate and robust predictive model. Random Forest (RF) and Extra Gradient Boosting (XGBoost) models are considered as ensemble models which offer built-in mechanisms for assessing feature importance. These techniques work by measuring how much each feature contributes to the overall predictive performance of the ensemble.In the first objective of this work the frequency of hydronic cooling systems in the U.S. building stock for applying potential energy efficiency measures (EEMs) on chiller plants are explored. Results show that the central chillers inside the buildings are responsible for providing cooling for more than 50% of the commercial buildings with areas greater than 9,000 m2(~100,000 ft2). In addition, hydronic cooling systems contribute to the highest Energy Use Intensity (EUI) among other systems, with EUI of 410.0 kWh/m2 (130.0 kBtu/ft2). Therefore, the results of this objective support developing accurate prediction models to assess the chiller performance parameters as an implication for chiller sequencing control strategies in older existing buildings. The second objective of the dissertation is to evaluate the performance of chiller sequencing strategy for the existing water-cooled chiller plant in a high-rise commercial building and develop highly accurate RF chiller models to investigate and determine the input variables of greatest importance to chiller power consumption predictions. The results show that the average value of mean absolute percentage error (MAPE) and root mean squared error (RMSE) for all three RF chiller models are 5.3% and 30 kW, respectively, for the validation dataset, which confirms a good agreement between measured and predicted values. On the other hand, understanding prediction uncertainty is an important task to confidently reporting smaller savings estimates for different chiller sequencing control strategies. This study aims to quantify prediction uncertainty as a percentile for selecting an appropriate confidence level for chillers models which could lead to better prediction of the peak electricity load and participate in demand response programs more efficiently. The results show that by increasing the confidence level from 80% to 90%, the upper and lower bounds of the demand charge differ from the actual value by a factor of 3.3 and 1.7 times greater, respectively. Therefore, it proves the significance of selecting appropriate confidence levels for implementation of chiller sequencing strategy and demand response programs in commercial buildings. As the third objective of this study, the accuracy of these prediction models with respect to the preprocessing, selection of data, noise analysis, effect of chiller control system performance on the recorded data were investigated. Therefore, this study attempts to investigate the impacts of different data resolution, level of noise and data smoothing methods on the chiller power consumption and chiller COP prediction based on time-series Extra Gradient Boosting (XGBoost) models. The results of applying the smoothing methods indicate that the performance of chiller COP and the chiller power consumption models have improved by 2.8% and 4.8%, respectively. Overall, this study would guide the development of data-driven chiller power consumption and chiller COP prediction models in practice.
Show less
- Title
- Scalable Indexing and Search in High-End Computing Systems
- Creator
- Orhean, Alexandru Iulian
- Date
- 2023
- Description
-
Rapid advances in digital sensors, networks, storage, and computation coupled with decreasing costs is leading to the creation of huge...
Show moreRapid advances in digital sensors, networks, storage, and computation coupled with decreasing costs is leading to the creation of huge collections of data. Increasing data volumes, particularly in science and engineering, has resulted in the widespread adoption of parallel and distributed file systems for storing and accessing data efficiently. However, as file system sizes and the amount of data ``owned” by users grows, it is increasingly difficult to discover and locate data amongst the petabytes of data. While much research effort has focused on methods to efficiently store and process data, there has been relatively little focus on methods to efficiently explore, index, and search data using the same high-performance storage and compute systems. Users of large file systems either invest significant resources to implement specialized data catalogs for accessing and searching data, or resort to software tools that were not designed to exploit modern hardware. While it is now trivial to quickly discover websites from the billions of websites accessible on the Internet, it remains surprisingly difficult for users to search for data on large-scale storage systems. We initially explored the prospect of using existing search engine building blocks (e.g. CLucene) to integrate search in a high-performance distributed file system (e.g. FusionFS), by proposing and building the FusionDex system, a distributed indexing and query model for unstructured data. We found indexing performance to be orders of magnitude slower than theoretical speeds we could achieve in raw storage input and output, and sought to investigate a new clean-slate design for high-performance indexing and search.We proposed the SCANNS indexing framework to address the problem of efficiently indexing data in high-end systems, characterized by many-core architectures, with multiple NUMA nodes and multiple PCIe NVMe storage devices. We designed SCANNS as a single-node framework that can be used as a building block for implementing high-performance indexed search engines, where the software architecture of the framework is scalable by design. The indexing pipeline is exposed and allows easy modification and tuning, enabling SCANNS to saturate storage, memory and compute resources on different hardware. The proposed indexing framework uses a novel tokenizer and inverted index design to achieve high performance improvement both in terms of indexing and in terms of search latency. Given the large amounts and the variety of data found in scientific large-scale file systems, it stands to reason to try to bridge the gap between various data representations and to build and provide a more uniform search space. ScienceSearch is a search infrastructure for scientific data that uses machine learning to automate the creation of metadata tags from different data sources, such as published papers, proposals, images and file system structure. ScienceSearch is a production system that is deployed on a container service platform at NERSC and provides search over data obtained from NCEM. We conducted a performance evaluation of the ScienceSearch infrastructure focusing on scalability trends in order to better understand the implications of performing search over an index built from the generated tags. Drawing from the insights gained from SCANNS and the performance evaluation of ScienceSearch, we explored the problems of efficiently building and searching persistent indexes that do not fit into main memory. The SCIPIS framework builds on top of SCANNS and further optimizes the inverted index design and indexing pipeline, by exposing new tuning parameters that allows the user to further adapt the index to the characteristics of the input data. The proposed framework allows the user to quickly build a persistent index and to efficiently run TFIDF queries over the built index. We evaluated SCIPIS over three kinds of datasets (logs, scientific data, and file system metadata) and showed that it achieves high indexing and search performance and good scalability across all datasets.
Show less
- Title
- Colored Pencil Drawings, undated
- Creator
- Henry, Mary Dill, 1913-2009
- Description
-
Untitled colored pencil drawings by Mary Henry, date unknown. Inscription on verso: "William Winter Comments, PO Box 817, Sausalito"
- Collection
- Mary Dill Henry Papers, 1913-2021
- Title
- Computational Genomics of Human-Infecting Microsporidia Species from the Genus Encephalitozoon
- Creator
- Mascarenhas dos Santos, Anne Caroline
- Date
- 2023
- Description
-
Microsporidia are obligate intracellular pathogens classified as category B priority pathogens by the National Institute of Allergy and...
Show moreMicrosporidia are obligate intracellular pathogens classified as category B priority pathogens by the National Institute of Allergy and Infectious Diseases (NIAID), a division of the National Institutes of Health (NIH). Microsporidian species from the genus Encephalitozoon infect humans and can cause encephalitis, keratoconjunctivitis or enteric diseases in both immunocompromised and immunocompetent individuals. The main treatment for disseminated microsporidiosis available in the United States is albendazole, an anthelmintic benzimidazole that is also used to treat fungal infections, but species from the Encephalitozoonidae have already shown signs of resistance against this drug. The Encephalitozoonidae harbors highly specialized pathogens with the smallest known eukaryote genomes, with Encephalitozoon cuniculi featuring a genome of only 2.9 Mbp and coding for a proteome of roughly 2,000 proteins. Pathogens are in an everlasting race to quicken their adaptation pace against host defenses. This adaptation is often driven by gene duplication, recombination and/or mutation, and due to the potentially disruptive nature of duplication and recombination processes, many of these evolutions in pathogens are taking place outside conserved genomic loci. As such, genes involved in virulence and drug resistance in pathogens are often localized in the (sub)telomeres rather than in chromosome cores. The small and streamlined nature of microsporidian genomes makes them excellent candidates to investigate the adaptation of pathogens to host defenses, the evolution of their virulence, and the development of their resistance to drugs from a genomic perspective. However, microsporidian genomes are highly divergent at the DNA sequence level and the ones that have been sequenced so far are incomplete and are lacking the telomeres. This high level of sequence divergence hinders standard sequence homology-based functional annotations, blurring our understanding of what these organisms are capable of from a metabolic perspective. The gap in our knowledge of what is encoded in the microsporidia telomeres could lead to an underestimation of their pathogenic capabilities. Therefore, deciphering the functions of unknown proteins in microsporidia genomes and unraveling the content of their telomeres is important to fully assess their potential for adaptability to host defenses and predisposition to drug resistance. Likewise, a better understanding of the genetic diversity in microsporidia will help assess the extent by which host-pathogen interactions are shaping the adaptation of these parasites to humans. As observed in the COVID-19 pandemic, genetic diversity can influence the speed at which pathogens adapt to host defenses and thus can pose a big challenge to disease control. The development of strategies for controlling microsporidiosis outbreaks will likely benefit from the work performed in this thesis. As part of my PhD work, I investigated the virulence and host-adaptation capabilities of human-infecting microsporidia species from the genus Encephalitozoon with computational genomic approaches. This work included: 1) using structural homology to infer the functions of unknown proteins from the microsporidia proteome; 2) sequencing the complete genomes from telomere-to-telomere of three distinct Encephalitozoon spp. (E. cuniculi, E. hellem and E. intestinalis) to determine the genetic makeup of their telomeres and better understand the extent of their diversity; and 3) assessing the intraspecies genetic diversity that exists between Encephalitozoon species.
Show less
- Title
- Eating disorder support group utilization: Associations with psychological health and eating disorder psychopathology among support group attendees
- Creator
- Murray, Matthew F.
- Date
- 2023
- Description
-
Individuals with eating disorders (EDs) report psychosocial impairments that may persist beyond ED symptom remission, suggesting a need to...
Show moreIndividuals with eating disorders (EDs) report psychosocial impairments that may persist beyond ED symptom remission, suggesting a need to examine ED treatment-adjunctive services that foster psychosocial health. One promising resource is support groups, as evidence across medical and psychiatric illnesses shows associations between group utilization and wellbeing. However, virtually no literature has examined ED-specific support groups and psychosocial health, and it is also unknown how use of supportive services relates to ED symptoms. The present study examined associations between past-month ED support group attendance and participation frequency, psychosocial health indices, and ED symptoms. A total of 215 participants who attended weekly virtual clinician-moderated ED support groups completed measures of psychosocial health, internalized stigma of mental illness, psychosocial impairment from an ED, specific types of social support elicited in group, and ED psychopathology. Adjusting for past-month ED treatment, Benjamini-Hochberg-corrected partial correlation analyses indicated that more frequent attendance was negatively related to body dissatisfaction, purging, excessive exercise, and negative attitudes toward obesity, and positively related to social support. More frequent verbal and chat participation were positively related to emotional and informational support and social companionship. Chat participation was additionally negatively related to excessive exercise and negative attitudes toward obesity. Results suggest that utilizing and participating in clinician-moderated ED support groups could provide an outlet for ED symptom management and solicitation of social support. Findings highlight areas for further consideration in the delivery of and future research on ED support groups.
Show less
- Title
- Optimization methods and machine learning model for improved projection of energy market dynamics
- Creator
- Saafi, Mohamed Ali
- Date
- 2023
- Description
-
Since signing the legally binding Paris agreement, governments have been striving to fulfill the decarbonization mission. To reduce carbon...
Show moreSince signing the legally binding Paris agreement, governments have been striving to fulfill the decarbonization mission. To reduce carbon emissions from the transportation sector, countries around the world have created a well-defined new energy vehicle development strategy that is further expanding into hydrogen vehicle technologies. In this study, we develop the Transportation Energy Analysis Model (TEAM) to investigate the impact of the CO2 emissions policies on the future of the automotive industries. On the demand side, TEAM models the consumer choice considering the impacts of technology cost, energy cost, refueling/charging availability, consumer travel pattern. On the supply side, the module simulates the technology supply by the auto-industry with the objective of maximizing industry profit under the constraints of government policies. Therefore, we apply different optimization methods to guarantee reaching the optimal automotive industry response each year up to 2050. From developing an upgraded differential evolution algorithm, to applying response surface methodology to simply the objective function, the goal is to enhance the optimization performance and efficiency compared to adopting the standard genetic algorithm. Moreover, we investigate TEAM’s robustness by applying a sensitivity analysis to find the key parameters of the model. Finally based on the key sensitive parameters that drive the automotive industry, we develop a neural network to learn the market penetration model and predict the market shares in a competitive time by bypassing the total cost of ownership analysis and profit optimization. The central motivating hypothesis of this thesis is that modern optimization and modeling methods can be applied to obtain a computationally-efficient, industry-relevant model to predict optimal market sales shares for light-duty vehicle technologies. In fact, developing a robust market penetration model that is optimized using sophisticated methods is a crucial tool to automotive companies, as it quantifies consumer’s behavior and delivers the optimal way to maximize their profits by highlighting the vehicles technologies that they could invest in. In this work, we prove that TEAM reaches the global solution to optimize not only the industry profits but also the alternative fuels optimized blends such as synthetic fuels. The time complexity of the model has been substantially improved to decrease from hours using the genetic algorithm, to minutes using differential evolution, to milliseconds using neural network.
Show less
- Title
- Migration of Silver from Silver Zeolite/Low-Density Polyethylene Films into Food Stimulants
- Creator
- Sayeed, Maryam
- Date
- 2023
- Description
-
Zeolites are naturally occurring or synthetic crystalline microporous aluminosilicate structures with remarkable catalytic, adsorption, and...
Show moreZeolites are naturally occurring or synthetic crystalline microporous aluminosilicate structures with remarkable catalytic, adsorption, and ion-exchange properties. Their unique framework of pores, channels, and cages with precise dimensions makes them an excellent fit for ion exchange and storage. Silver-exchanged zeolite (Ag/Y) composites may be incorporated into polymer matrices to create antimicrobial packaging materials. The slow release of Ag from nanosilver-enabled polymer nanocomposites (PNCs) may inhibit the growth of bacteria and other pathogens on the film’s surface, improving food quality and reducing food waste. However, the migration of Ag ions from the film into food matrices is of great concern as it could expose humans to high concentrations of a heavy metal from dietary sources. The amount of migration depends on various factors, including the potential form of Ag and its concentration in the film, the film thickness, and the storage conditions.The primary objective of this study is to investigate the effect of the form of Ag bound to the zeolite on the migration behavior of Ag from Ag/Y incorporated low-density polyethylene (LDPE) films. For Ag/Y-incorporated LDPE PNCs with distinct Ag species, the Ag migration into the water and Squirt (a commercial soft drink) was at least four times higher from films containing zeolites exchanged with ionic Ag versus zeolites exchanged with nanoparticulate Ag. Similarly, migration into 9 wt % aqueous Domino sugar (granulated sucrose) solution was seven times higher in the ionic silver-incorporated film than in the nanoparticulate Ag film. This study suggests that it is important to consider the form of Ag in silver-exchanged zeolite while producing packaging materials since the potential form of Ag in the PNCs might significantly affect Ag migration behavior.
Show less
- Title
- Evaluating Speech Separation Through Pre-Trained Deep Neural Network Models
- Creator
- Prabhakar, Deeksha
- Date
- 2023
- Description
-
Speaker separation involves separating individual speakers from a mixture of voices or background noise, known as the "cocktail party problem....
Show moreSpeaker separation involves separating individual speakers from a mixture of voices or background noise, known as the "cocktail party problem." This refers to the ability to focus on a specific sound while filtering out other distractions.In this analysis, we propose the idea of obtaining features present in the original data and then evaluating the impact they have on the ability of the model to separate the mixed audio streams. The dataset is prepared such that these feature values can be used as predictor variables to various models like Logistic Regression, Decision Trees, SVM (both rbf and linear kernel), XGBoost, AdaBoost, to obtain the most contributing features that is the features that will lead to a better separation. These results shall then be analyzed to conclude the features that affect separating the audio streams the most. Initially, 400 audio streams are selected from the VoxCeleb dataset and combined to form 200 single utterances. After the mixes are obtained, the pre-trained Speechbrain model, sepformer-whamr is used. This model separates the audio mixes given as input and obtain two outputs that should be as close as possible to the original ones. A feature list from the 400 chosen audios is obtained and then the effect of certain features on the model's capability to distinguish between multiple audio sources in a mixed recording is assessed. Two analysis parameters- permutation feature importance and SHAP values are used to conclude which features have more effect on separation. Our hypothesis is that the features contributing the most to a good separation are invariant across datasets. To test this hypothesis, we obtain 1,000 audio streams from the Mozilla Common Voice Dataset and perform the same experimental methodology described above. Our results demonstrate that the features we extract from VoxCeleb dataset are indeed invariant and aid in separating the audio streams of the Mozilla Common Voice dataset.
Show less
- Title
- Improving self-supervised monocular depth estimation from videos using forward and backward consistency
- Creator
- Shen, Hui
- Date
- 2020
- Description
-
Recently, there has been a rapid development in monocular depth estimation based on self-supervised learning. However, these existing self...
Show moreRecently, there has been a rapid development in monocular depth estimation based on self-supervised learning. However, these existing self-supervised learning methods are insufficient for estimating motion objects, occlusions, and large static areas. Uncertainty or vanishing easily occurs during depth inferencing. To address this problem, the model proposed in this thesis further explores the consistency in video and builds a multi-frame model for depth estimation; secondly, by taking advantage of the optical flow, a motion mask is generated, with additional photometric loss applied for those masked regions. Experiments are carried out on the KITTI dataset. The proposed model performs better than the baseline model in quantitative results, and as seen from the depth map, the scale uncertainty and depth incomplete situations are improved in motion objects and occlusions explicitly.
Show less
- Title
- Development of validation guidelines for high pressure processing to inactivate pressure resistant and matrix-adapted Escherichia coli O157:H7, Salmonella spp. and Listeria monocytogenes in treated juices
- Creator
- Rolfe, Catherine
- Date
- 2020
- Description
-
The fruit and vegetable juice industry has shown a growing trend in minimally processed juices. A frequent technology used in the functional...
Show moreThe fruit and vegetable juice industry has shown a growing trend in minimally processed juices. A frequent technology used in the functional juice division is cold pressure, which refers to the application of high pressure processing (HPP) at low temperatures for a mild treatment to inactivate foodborne pathogens instead of thermal pasteurization. HPP juice manufacturers are required to demonstrate a 5-log reduction of the pertinent microorganism to comply with FDA Juice HACCP. The effectiveness of HPP on pathogen inactivation is determinant on processing parameters, juice composition, packaging application, as well as the bacterial strains included for validation studies. Unlike thermal pasteurization, there is currently no consensus on validation study approaches for bacterial strain selection or preparation and no agreement on which HPP process parameters contribute to overall process efficacy.The purpose of this study was to develop validation guidelines for HPP inactivation and post-HPP recovery of pressure resistant and matrix-adapted Escherichia coli O157:H7, Salmonella spp., and Listeria monocytogenes in juice systems. Ten strains of each microorganism were prepared in three growth conditions (neutral, cold-adapted, or acid-adapted) and assessed for barotolerance or sensitivity. Pressure resistant and sensitive strains from each were used to evaluate HPP inactivation with increasing pressure levels (200 – 600 MPa) in two juice matrices (apple and orange). A 75-day shelf-life analysis was conducted on HPP-treated juices inoculated with acid-adapted resistant strains for each pathogen and examined for inactivation and recovery. Individual strains of E. coli O157:H7, Salmonella spp., and L. monocytogenes demonstrated significant (p <0.05) differences in reduction levels in response to pressure treatment in high acid environments. E. coli O157:H7 was the most barotolerant of the three microorganism in multiple matrices. Bacterial screening resulted in identification of pressure resistant strains E. coli O157:H7 TW14359, Salmonella Cubana, and L. monocytogenes MAD328, and pressure sensitive strains E. coli O15:H7 SEA13B88, S. Anatum, and L. monocytogenes CDC. HPP inactivation in juice matrices (apple and orange) confirmed acid adaptation as the most advantageous of the growth conditions. Shelf-life analyses reached the required 5-log reduction in HPP-treated juices immediately following pressure treatment, after 24 h in cold storage, and after 4 days of cold storage for L. monocytogenes MAD328, S. Cubana, and E. coli O157:H7 TW14359, respectively. Recovery of L. monocytogenes in orange juice was observed with prolonged cold storage time. These results suggest the preferred inoculum preparation for HPP validation studies is the use of acid-adapted, pressure resistant strains. At 586 – 600 MPa, critical inactivation (5-log reduction) was achieved during post-HPP cold storage, suggesting sufficient HPP lethality is reached at elevated pressure levels with a subsequent cold holding duration.
Show less