Tracking Text in Mixed Mode Documents

dc.contributor.authorBixler, J. Patricken
dc.contributor.departmentComputer Scienceen
dc.date.accessioned2013-06-19T14:35:42Zen
dc.date.available2013-06-19T14:35:42Zen
dc.date.issued1988en
dc.description.abstractThis paper describes a method for extracting arbitrarily oriented text in documents containing both text and graphics. The technique presented is inspired by the tracking algorithms frequently found in raster to vector conversion systems. By identifying text components in the document, reducing the resolution of the image by the size of the characters, and then tracking the centers of the character components, all text strings can be removed and subsequently reoriented to the horizontal. They can then be presented for automated character recognition. A by-product of the method is that characters are automatically grouped together to form words and/or phrases. We give a detailed description of the algorithm, discuss its strengths and weaknesses, and present some sample results obtained from a typical city street map.en
dc.format.mimetypeapplication/pdfen
dc.identifierhttp://eprints.cs.vt.edu/archive/00000104/en
dc.identifier.sourceurlhttp://eprints.cs.vt.edu/archive/00000104/01/TR-88-19.pdfen
dc.identifier.trnumberTR-88-19en
dc.identifier.urihttp://hdl.handle.net/10919/19984en
dc.language.isoenen
dc.publisherDepartment of Computer Science, Virginia Polytechnic Institute & State Universityen
dc.relation.ispartofHistorical Collection(Till Dec 2001)en
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.titleTracking Text in Mixed Mode Documentsen
dc.typeTechnical reporten
dc.type.dcmitypeTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
TR-88-19.pdf
Size:
871.64 KB
Format:
Adobe Portable Document Format