An O(n log n) Algorithm for Finding Minimal Perfect Hash Functions

dc.contributor.authorFox, Edward A.en
dc.contributor.authorHeath, Lenwood S.en
dc.contributor.authorChen, Qi-Fanen
dc.contributor.departmentComputer Scienceen
dc.date.accessioned2013-06-19T14:37:14Zen
dc.date.available2013-06-19T14:37:14Zen
dc.date.issued1989en
dc.description.abstractWe describe the first practical algorithm for finding minimal perfect hash functions that can be used to access very large databases. This method extends earlier work wherein an O(n3) algorithm was devised, building upon prior work by Sager that described an O(n4) algorithm. Our new O(n log n) expected time algorithm makes use of three key insights: applying randomness whereever possible, ordering our search for hash functions based on the degree of the vertices in a graph that represents word dependencies, and viewing hash value assignment in terms of adding circular patterns of related words to a partially filled disk. While ultimately applicable to a wide variety of data and file access needs, this algorithm has already proven useful in aiding our work in improving performance of CD-ROM systems and our construction of a Large External Network Database (LEND) for semantic networks and hypertext/hypermedia collections. Virginia Disc One includes a demonstration of a minimal perfect hash function running on a PC to access a 130,198 word list on that CD-ROM, and several other microcomputer, minicomputer, and parallel processor versions and applications of our algorithm have also been developed. Tests with a French word list of 420,878 entries and a library catalog key set with over 1.2 million keys have shown that our methods work with very large databases.en
dc.format.mimetypeapplication/pdfen
dc.identifierhttp://eprints.cs.vt.edu/archive/00000147/en
dc.identifier.sourceurlhttp://eprints.cs.vt.edu/archive/00000147/01/TR-89-10.pdfen
dc.identifier.trnumberTR-89-10en
dc.identifier.urihttp://hdl.handle.net/10919/19510en
dc.language.isoenen
dc.publisherDepartment of Computer Science, Virginia Polytechnic Institute & State Universityen
dc.relation.ispartofHistorical Collection(Till Dec 2001)en
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.titleAn O(n log n) Algorithm for Finding Minimal Perfect Hash Functionsen
dc.typeTechnical reporten
dc.type.dcmitypeTexten

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
TR-89-10.pdf
Size:
2.01 MB
Format:
Adobe Portable Document Format