A Specialized Data Crawler for Cross-Laminated Timber Information Resources

dc.contributor.authorThomas, Eden
dc.contributor.authorEspinoza, Omar A.en
dc.contributor.authorBora, Rahulen
dc.contributor.authorBuehlmann, Ursen
dc.contributor.departmentSustainable Biomaterialsen
dc.date.accessioned2021-04-14T12:54:50Zen
dc.date.available2021-04-14T12:54:50Zen
dc.date.issued2020en
dc.description.abstractThe Internet is composed of more than 6.2 billion Web pages and grows larger every day. As the number of links and specialty subject areas grows, it becomes ever more difficult to find pertinent information. For some subject areas, special-purpose data crawlers continually search the Internet for specific information; examples include real estate, air travel, auto sales, and others. The use of such special-purpose data crawlers (i.e., targeted crawlers and knowledge databases) also allows the collection and analysis of agricultural and forestry data. Such single-purpose crawlers can search for hundreds of key words and use machine learning to determine if what is found is relevant. In this article, we examine the design and data return of such a specialty knowledge database and crawler system developed to find information related to cross-laminated timber (CLT). Our search engine uses intelligent software to locate and update pertinent references related to CLT as well as to categorize information with respect to common application and interest areas. At the time of this publication, the CLT knowledge database has cataloged nearly 3,000 publications regarding various aspects of CLT.en
dc.description.adminPublic domain – authored by a U.S. government employeeen
dc.description.notesThe work on which this article is based was funded in whole or in part through a grant awarded by the Wood Innovations Program, USDA Forest Service.en
dc.description.sponsorshipWood Innovations Program, USDA Forest Serviceen
dc.format.mimetypeapplication/pdfen
dc.identifier.doihttps://doi.org/10.13073/FPJ-D-20-00017en
dc.identifier.issn0015-7473en
dc.identifier.issue3en
dc.identifier.urihttp://hdl.handle.net/10919/103016en
dc.identifier.volume70en
dc.language.isoenen
dc.rightsPublic Domainen
dc.rights.urihttp://creativecommons.org/publicdomain/mark/1.0/en
dc.titleA Specialized Data Crawler for Cross-Laminated Timber Information Resourcesen
dc.title.serialForest Products Journalen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten
dc.type.dcmitypeStillImageen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
nrs_2020_thomas_002.pdf
Size:
224.99 KB
Format:
Adobe Portable Document Format
Description: