US State Tourism

dc.contributor.authorVerelly, Abhinaven
dc.contributor.authorDavid, Gruhnen
dc.contributor.authorBhattarai, Ashutoshen
dc.contributor.authorGrishaw, Shaneen
dc.date.accessioned2021-05-14T01:07:36Zen
dc.date.available2021-05-14T01:07:36Zen
dc.date.issued2021-05-13en
dc.description.abstractEach state in the United States has its own state-run website, which is used as a means to attract new tourists to that location. Each of these sites is typically used to highlight any big attractions in that state. Any travel tips, facts regarding that location, blog posts, ratings from other individuals that have traveled there, or any other useful information that may attract potential tourists are also included. These websites are maintained and funded directly by occupancy taxes. Occupancy taxes are a form of state tax that an individual pays whenever one stays in a hotel or visits any attractions in that state. As such, the main goal of these websites is to attract new tourists to their location. These websites are maintained and paid for by past tourists who have visited that state. Funding for future state tourism is determined by how many previous tourists have visited the state and paid the occupancy tax. Researchers need to be able to determine which elements of the website are most beneficial in attracting tourists. This can be determined by examining past tourism websites and looking for any patterns that would determine what worked well and what didn’t. These patterns can then be used to determine what was successful and use that information to make better-informed decisions. Our client, Dr. Florian Zach of the Howard Feiertag Department of Hospitality and Tourism Management, plans to use the historical analysis done by our team, to further help his research on trends in state tourism websites content. Different iterations of each state tourism website are stored as snapshots on the Internet Archive and can be accessed to see changes that took place in that website. Our team was given Parquet files of these snapshots for the states of California, Colorado, and Virginia dating back to 1998. The goal of the project was to assist Dr. Zach by using these Parquet files to perform data extraction and visualization on tourism patterns. This can then be expanded to other states’ tourism websites in the future. We used a combination of Python’s Pandas library, Jupyter Notebook, and BeautifulSoup to examine and extract relevant pieces of data from the given Parquet files. This data was extracted into various different categories, each with its own designated folder. These categories were raw text, images, background colors and background images, internal and external links, and meta tags. With this data sorted into the appropriate folders, we are then able to determine specific patterns such as what colored background was used the most. With our data extraction portion of this project completed along with the visualization, we hope to pass this on to future teams so that they are able to expand on our current project for the rest of the states.en
dc.description.notesUSStateTourismFinalReport.docx (Word doc Version) and USStateTourismFinalReport.pdf (PDF version) are the two versions for our final report. USStateTourismPresentationFinal.pptx (Powerpoint version) and USStateTourismPresentationFinal.pdf (PDF version) are the two versions for our final presentation.en
dc.description.sponsorshipNSF IIS-1619028, Global Event and Trend Archive Research (GETAR)en
dc.description.sponsorshipNSF CMMI-1638207, Coordinated, Behaviorally-Aware Recovery for Transportation and Power Disruptions (CBAR-tpd)en
dc.identifier.urihttp://hdl.handle.net/10919/103269en
dc.language.isoen_USen
dc.publisherVirginia Tech.en
dc.subjectPythonen
dc.subjectData Analyticsen
dc.subjectVisualizaitonen
dc.subjectBeautifulSoupen
dc.subjectpyarrowen
dc.subjectJupyter Notebooken
dc.subjectMatplotliben
dc.subjectTourismen
dc.subjectWeb scrapingen
dc.titleUS State Tourismen
dc.typePresentationen
dc.typeReporten

Files

Original bundle
Now showing 1 - 4 of 4
Loading...
Thumbnail Image
Name:
USStateTourismPresentationFinal.pdf
Size:
2.57 MB
Format:
Adobe Portable Document Format
Name:
USStateTourismPresentationFinal.pptx
Size:
10.5 MB
Format:
Microsoft Powerpoint XML
Loading...
Thumbnail Image
Name:
USStateTourismFinalReport.pdf
Size:
4.11 MB
Format:
Adobe Portable Document Format
Name:
USStateTourismFinalReport.docx
Size:
3.91 MB
Format:
Microsoft Word XML
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Item-specific license agreed upon to submission
Description: