Crisis Events Information Extraction

dc.contributor.authorRabbani, Eitsaamen
dc.contributor.authorSpies, Willen
dc.contributor.authorGregory, Sullyen
dc.contributor.authorBrown, Briannaen
dc.contributor.authorSaikrishna, Nishesh en
dc.date.accessioned2024-09-03T18:25:31Zen
dc.date.available2024-09-03T18:25:31Zen
dc.date.issued2024-05-01en
dc.descriptionThe documents titled "Crisis Events Information Extraction Presentation" in both PDF and PPTX formats represent our final presentations. These presentations portray the progress achieved throughout the development stages, provide an overview of the problem, outline our solution strategy, and detail the functionality of our application. The documents tiled "Crisis Events Information Extraction Report" in both PDF and DOCX formats are our in-depth reports that portray everything that needs to be known about this project. This includes our abstract, introduction, requirements, design, implementation, testing, user's and developer's manuals, and lessons learned. The "Crisis Events Extraction Information Extraction Code" is a zip file that contains all the code needed for our application to run.en
dc.description.abstractUnfortunately, crises occur quite frequently throughout the world. In an increasingly digital age, where most news outlets post articles about events online, there are often tens or even hundreds of articles about the same event. Although the information found in each article is often similar, some information may be specific to a certain article or news outlet. And, as each news outlet usually writes a lengthy article for each crisis event that happens, it can be hard to quickly locate and learn the basic, important information about a given crisis event. This web app project aims to expedite this lengthy process by consolidating any number of articles about a crisis event into who, what, where, when, and how (WWWWH). This information extraction is accomplished using machine learning for named entity recognition and dependency parsing. The extracted WWWWH info is displayed to the user in an easily digestible table, which allows for users to quickly learn the essential information regarding any given crisis event. Both the user’s input and the output data will be saved to a database, so that users can see their previous usages of the program again at any time. While users must manually input web articles into the program, whether as links or .txt files, there is potential in the future to use a web crawler to automate this initial article gathering. The stack for this applications utilizes the MERN Stack. MongoDB was chosen due to its flexible document structure. For the back-end features such as natural language processing and our server we utilized Python and Express/Node.js. The front-end consists of React which is used to fetch our data and utilizes component libraries such as MUI for consistent design language. The deliverables for this project include our Final Presentation and Final Report which show our progress throughout the development stages, and finally our code for the application which are submitted to our professor and client, Mohamed Farag.en
dc.identifier.urihttps://hdl.handle.net/10919/121063en
dc.language.isoen_USen
dc.publisherVirginia Techen
dc.rightsCC0 1.0 Universalen
dc.rights.urihttp://creativecommons.org/publicdomain/zero/1.0/en
dc.subjectExtractionen
dc.subjectNatural Language Processingen
dc.subjectWARCen
dc.subjectWebpagesen
dc.subjectArchiveen
dc.subjectPythonen
dc.subjectJavaScripten
dc.subjectReacten
dc.subjectMongoDBen
dc.titleCrisis Events Information Extractionen
dc.typeReporten
dc.typePresentationen
dc.typeSoftwareen

Files

Original bundle
Now showing 1 - 5 of 5
Name:
Crisis Events Extraction Report.docx
Size:
1.78 MB
Format:
Microsoft Word XML
Loading...
Thumbnail Image
Name:
Crisis Events Extraction Report.pdf
Size:
1.41 MB
Format:
Adobe Portable Document Format
Loading...
Thumbnail Image
Name:
Crisis Events Information Extraction Presentation.pdf
Size:
1.04 MB
Format:
Adobe Portable Document Format
Name:
Crisis Events Information Extraction Presentation.pptx
Size:
2.2 MB
Format:
Microsoft Powerpoint XML
Name:
Crisis Events Extraction Information Extraction Code.zip
Size:
131.1 MB
Format:
License bundle
Now showing 1 - 1 of 1
Name:
license.txt
Size:
1.5 KB
Format:
Item-specific license agreed upon to submission
Description: