Fast and Scalable Structure-from-Motion for High-precision Mobile Augmented Reality Systems

Bae, Hyojoon

Fast and Scalable Structure-from-Motion for High-precision Mobile Augmented Reality Systems

dc.contributor.author	Bae, Hyojoon	en
dc.contributor.committeechair	White, Christopher Jules	en
dc.contributor.committeemember	Reed, Jeffrey H.	en
dc.contributor.committeemember	Athanas, Peter M.	en
dc.contributor.committeemember	Golparvar-Fard, Mani	en
dc.contributor.committeemember	Clancy, Thomas Charles III	en
dc.contributor.department	Electrical and Computer Engineering	en
dc.date.accessioned	2014-04-25T08:01:15Z	en
dc.date.available	2014-04-25T08:01:15Z	en
dc.date.issued	2014-04-24	en
dc.description.abstract	A key problem in mobile computing is providing people access to necessary cyber-information associated with their surrounding physical objects. Mobile augmented reality is one of the emerging techniques that address this key problem by allowing users to see the cyber-information associated with real-world physical objects by overlaying that cyber-information on the physical objects's imagery. As a consequence, many mobile augmented reality approaches have been proposed to identify and visualize relevant cyber-information on users' mobile devices by intelligently interpreting users' positions and orientations in 3D and their associated surroundings. However, existing approaches for mobile augmented reality primarily rely on Radio Frequency (RF) based location tracking technologies (e.g., Global Positioning Systems or Wireless Local Area Networks), which typically do not provide sufficient precision in RF-denied areas or require additional hardware and custom mobile devices. To remove the dependency on external location tracking technologies, this dissertation presents a new vision-based context-aware approach for mobile augmented reality that allows users to query and access semantically-rich 3D cyber-information related to real-world physical objects and see it precisely overlaid on top of imagery of the associated physical objects. The approach does not require any RF-based location tracking modules, external hardware attachments on the mobile devices, and/or optical/fiducial markers for localizing a user's position. Rather, the user's 3D location and orientation are automatically and purely derived by comparing images from the user's mobile device to a 3D point cloud model generated from a set of pre-collected photographs. A further challenge of mobile augmented reality is creating 3D cyber-information and associating it with real-world physical objects, especially using the limited 2D user interfaces in standard mobile devices. To address this challenge, this research provides a new image-based 3D cyber-physical content authoring method designed specifically for the limited screen sizes and capabilities of commodity mobile devices. This new approach does not only provide a method for creating 3D cyber-information with standard mobile devices, but also provides an automatic association of user-driven cyber-information with real-world physical objects in 3D. Finally, a key challenge of scalability for mobile augmented reality is addressed in this dissertation. In general, mobile augmented reality is required to work regardless of users' location and environment, in terms of physical scale, such as size of objects, and in terms of cyber-information scale, such as total number of cyber-information entities associated with physical objects. However, many existing approaches for mobile augmented reality have mainly tested their approaches on limited real-world use-cases and have challenges in scaling their approaches. By designing fast direct 2D-to-3D matching algorithms for localization, as well as applying caching scheme, the proposed research consistently supports near real-time localization and information association regardless of users' location, size of physical objects, and number of cyber-physical information items. To realize all of these research objectives, five research methods are developed and validated: 1) Hybrid 4-Dimensional Augmented Reality (HD4AR), 2) Plane transformation based 3D cyber-physical content authoring from a single 2D image, 3) Cached k-d tree generation for fast direct 2D-to-3D matching, 4) double-stage matching algorithm with a single indexed k-d tree, and 5) K-means Clustering of 3D physical models with geo-information. After discussing each solution with technical details, the perceived benefits and limitations of the research are discussed with validation results.	en
dc.description.degree	Ph. D.	en
dc.format.medium	ETD	en
dc.identifier.other	vt_gsexam:2637	en
dc.identifier.uri	http://hdl.handle.net/10919/47679	en
dc.publisher	Virginia Tech	en
dc.rights	In Copyright	en
dc.rights.uri	http://rightsstatements.org/vocab/InC/1.0/	en
dc.subject	3D Reconstruction	en
dc.subject	Structure-from-Motion	en
dc.subject	3D Cyber-physical Modeling	en
dc.subject	Direct 2D-to-3D Matching	en
dc.subject	Image-based Localization	en
dc.subject	Mobile Augmented Reality	en
dc.title	Fast and Scalable Structure-from-Motion for High-precision Mobile Augmented Reality Systems	en
dc.type	Dissertation	en
thesis.degree.discipline	Computer Engineering	en
thesis.degree.grantor	Virginia Polytechnic Institute and State University	en
thesis.degree.level	doctoral	en
thesis.degree.name	Ph. D.	en

Files

Original bundle

Now showing 1 - 2 of 2

Name:: Bae_H_D_2014.pdf
Size:: 22.02 MB
Format:: Adobe Portable Document Format

Download

Name:: Bae_H_D_2014_support_1.pdf
Size:: 114.09 KB
Format:: Adobe Portable Document Format
Description:: Supporting documents

Download

Collections

Doctoral Dissertations