<?xml version='1.0' encoding='utf-8'?>
<mods xmlns="http://www.loc.gov/mods/v3" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" version="3.7" xsi:schemaLocation="http://www.loc.gov/mods/v3 http://www.loc.gov/standards/mods/v3/mods-3-7.xsd">
   <name>
      <role>
         <roleTerm type="text" authority="marcrelator" authorityURI="http://id.loc.gov/vocabulary/relators" valueURI="http://id.loc.gov/vocabulary/relators/cre">creator</roleTerm>
      </role>
      <namePart>Shen, Hui</namePart>
   </name>
   <titleInfo>
      <title>Improving self-supervised monocular depth estimation from videos using forward and backward consistency</title>
   </titleInfo>
   <originInfo>
      <dateCreated keyDate="yes">2020</dateCreated>
   </originInfo>
   <note displayLabel="Degree Awarded">Spring 2020</note>
   <typeOfResource authority="aat" valueURI="http://vocab.getty.edu/page/aat/300028029">Thesis</typeOfResource>
   <name type="corporate">
      <affiliation>Illinois Institute of Technology</affiliation>
   </name>
   <name type="corporate">
      <namePart>ECE / Electrical and Computer Engineering</namePart>
   </name>
   <name authority="wikidata" authorityURI="https://www.wikidata.org" valueURI="https://www.wikidata.org/wiki/Q102410753">
      <role>
         <roleTerm type="text" authority="marcrelator" authorityURI="http://id.loc.gov/vocabulary/relators" valueURI="http://id.loc.gov/vocabulary/relators/cre">advisor</roleTerm>
      </role>
      <namePart>Kim, Joohee</namePart>
   </name>
   <subject>
      <topic>Electrical engineering</topic>
   </subject>
   <subject>
      <topic>Consistency</topic>
   </subject>
   <subject>
      <topic>Depth estimation</topic>
   </subject>
   <subject>
      <topic>Image reconstruction</topic>
   </subject>
   <subject>
      <topic>Motion estimation</topic>
   </subject>
   <subject>
      <topic>Occlusion</topic>
   </subject>
   <subject>
      <topic>Optical flow</topic>
   </subject>
   <language>
      <languageTerm type="code" authority="rfc3066">en</languageTerm>
   </language>
   <abstract>Recently, there has been a rapid development in monocular depth estimation based on self-supervised learning. However, these existing self-supervised learning methods are insufficient for estimating motion objects, occlusions, and large static areas. Uncertainty or vanishing easily occurs during depth inferencing. To address this problem, the model proposed in this thesis further explores the consistency in video and builds a multi-frame model for depth estimation; secondly, by taking advantage of the optical flow, a motion mask is generated, with additional photometric loss applied for those masked regions. Experiments are carried out on the KITTI dataset. The proposed model performs better than the baseline model in quantitative results, and as seen from the depth map, the scale uncertainty and depth incomplete situations are improved in motion objects and occlusions explicitly.</abstract>
   <physicalDescription>
      <digitalOrigin>born digital</digitalOrigin>
      <internetMediaType>application/pdf</internetMediaType>
   </physicalDescription>
   <accessCondition type="useAndReproduction" displayLabel="rightsstatements.org">In
                Copyright</accessCondition>
   <accessCondition type="useAndReproduction" displayLabel="rightsstatements.orgURI">http://rightsstatements.org/page/InC/1.0/</accessCondition>
   <accessCondition type="restrictionOnAccess">Restricted Access</accessCondition>
<identifier type="hdl">http://hdl.handle.net/10560/islandora:1025020</identifier></mods>