dc.contributor.author | Kirchhoff, Allan Richard | en |
dc.date.accessioned | 2015-05-23T08:06:00Z | en |
dc.date.available | 2015-05-23T08:06:00Z | en |
dc.date.issued | 2014-10-16 | en |
dc.identifier.other | vt_gsexam:3492 | en |
dc.identifier.uri | http://hdl.handle.net/10919/52569 | en |
dc.description.abstract | Unmanned ground vehicles (UGVs) are increasingly being used for civilian and military
applications. Passive sensing, such as visible cameras, are being used for navigation and object
detection. An additional object of interest in many environments is text. Text information can
supplement the autonomy of unmanned ground vehicles. Text most often appears in the
environment in the form of road signs and storefront signs. Road hazard information, unmapped
route detours and traffic information are available to human drivers through road signs. Premade
road maps lack these traffic details, but with text localization the vehicle could fill the
information gaps. Leading text localization algorithms achieve ~60% accuracy; however,
practical applications are cited to require at least 80% accuracy [49].
The goal of this thesis is to test existing text localization algorithms against challenging scenes,
identify the best candidate and optimize it for scenes a UGV would encounter. Promising text
localization methods were tested against a custom dataset created to best represent scenes a UGV
would encounter. The dataset includes road signs and storefront signs against complex
background. The methods tested were adaptive thresholding, the stroke filter and the stroke
width transform. A temporal tracking proof of concept was also tested. It tracked text through a
series of frames in order to reduce false positives.
Best results were obtained using the stroke width transform with temporal tracking which
achieved an accuracy of 79%. That level of performance approaches requirements for use in
practical applications. Without temporal tracking the stroke width transform yielded an accuracy
of 46%. The runtime was 8.9 seconds per image, which is 44.5 times slower than necessary for
real-time object tracking. Converting the MATLAB code to C++ and running the text
localization on a GPU could provide the necessary speedup | en |
dc.format.medium | ETD | en |
dc.publisher | Virginia Tech | en |
dc.rights | In Copyright | en |
dc.rights.uri | http://rightsstatements.org/vocab/InC/1.0/ | en |
dc.subject | text localization | en |
dc.subject | Optical Character Recognition | en |
dc.subject | Unmanned Ground Vehicles | en |
dc.subject | Stroke Width Transform | en |
dc.subject | Stroke Filter | en |
dc.subject | Adaptive Thresholding | en |
dc.subject | Machine Vision | en |
dc.subject | Robotic Perception | en |
dc.subject | Machine Learning | en |
dc.subject | Support Vector Machine | en |
dc.title | Text Localization for Unmanned Ground Vehicles | en |
dc.type | Thesis | en |
dc.contributor.department | Mechanical Engineering | en |
dc.description.degree | Master of Science | en |
thesis.degree.name | Master of Science | en |
thesis.degree.level | masters | en |
thesis.degree.grantor | Virginia Polytechnic Institute and State University | en |
thesis.degree.discipline | Mechanical Engineering | en |
dc.contributor.committeechair | Wicks, Alfred L. | en |
dc.contributor.committeemember | Parikh, Devi | en |
dc.contributor.committeemember | Bird, John P. | en |