Cholera Database

dc.contributor.authorCroxall, Emilyen
dc.contributor.authorRoberto, Michaelen
dc.contributor.authorSharma, Hemakshien
dc.contributor.authorAlcantara, Gabrielaen
dc.contributor.authorGarcía Solares, Andrésen
dc.date.accessioned2020-05-13T02:46:49Zen
dc.date.available2020-05-13T02:46:49Zen
dc.date.issued2020-05-12en
dc.description.abstractThis project involved work toward a database of Cholera records from 2010 – 2020. The WHO repository was used to extract and normalize data to build CSV files. Each year where data is available has a CSV file containing location and total number of cases in the location. The ProMED repository was used to collect data for the same timeframe. The data was extracted, condensed, and tagged for easier manual viewing. Data for all years available is given in one CSV file. Data from WHO can be viewed in logarithmically colored maps based on the number of cases in each location. These visualizations are produced for each year in the study. The data from ProMED can be viewed in bar graphs which graph the number of articles that occur and in what weeks the articles are written for each country. These visualizations can be seen or downloaded at choleradb.cs.vt.edu. Additionally, all the CSV files of data produced are available for download on our website. Due to the complexity of NLP and the inconsistencies in the ProMED articles, our data is not completely normalized and requires some manual work. Unforeseen circumstances, including the COVID-19 crisis, slowed the project’s progress. Therefore, the ProMED data extraction did not proceed further, other data repositories have not been explored, and interactive visualizations have not been built. The results of this project are compiled datasets and data visualizations from the WHO and ProMED repositories. These are useful to our client for future analysis as well as anyone else who may be interested in the trends of Cholera outbreaks. The results of data collection are formatted for easy analysis and reading. The graphics provide a simple visual for those who are more interested in higher level analysis. This project can be useful to developers who are working on data extraction and representation in the field of epidemiology or other case based global studies. In the future, more repositories can be explored for more extensive results. Additionally, further work can be done with the ProMED set developed in order to condense it further and eliminate the need for any manual analysis after our program is run. The results of this project are all available publicly on choleradb.cs.vt.edu, including for download. All code is open source and available on Gitlab.en
dc.description.notesCode Repository: https://git.cs.vt.edu/mikero/choleradatabase Files: CholeraDBPresentation.mp4: this is a video recording of our final presentation being presented. CholeraDBPresentation.pptx: this is the PowerPoint version of our final presentation. CholeraDBPresentation.pdf: this is a PDF version of our final presentation. CholeraDBReport.docx: this is a Word document of our report. CholeraDBReport.pdf: this is a PDF version of our report.en
dc.identifier.urihttp://hdl.handle.net/10919/98231en
dc.language.isoenen
dc.publisherVirginia Techen
dc.rightsCreative Commons CC0 1.0 Universal Public Domain Dedicationen
dc.rights.urihttp://creativecommons.org/publicdomain/zero/1.0/en
dc.subjectWHOen
dc.subjectWorld Health Organizationen
dc.subjectProMEDen
dc.subjectCholeraen
dc.subjectCholera Databaseen
dc.subjectDatabaseen
dc.subjectPythonen
dc.subjectspaCyen
dc.subjectNLPen
dc.subjectBeautifulSoupen
dc.subjectWebsiteen
dc.titleCholera Databaseen
dc.typePresentationen
dc.typeReporten
dc.typeVideoen

Files

Original bundle
Now showing 1 - 5 of 5
Loading...
Thumbnail Image
Name:
CholeraDBPresentation.pdf
Size:
1.77 MB
Format:
Adobe Portable Document Format
Name:
CholeraDBPresentation.pptx
Size:
7.74 MB
Format:
Microsoft Powerpoint XML
Name:
CholeraDBReport.docx
Size:
1.93 MB
Format:
Microsoft Word XML
Loading...
Thumbnail Image
Name:
CholeraDBReport.pdf
Size:
2.33 MB
Format:
Adobe Portable Document Format
Name:
CholeraDBPresentation.mp4
Size:
17.98 MB
Format:
MP4 Container format for video files
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Item-specific license agreed upon to submission
Description: