Artificial Intelligence (AI)-based Semantic Communications with Multimodal Data: Framework and Implementation

dc.contributor.authorDeRieux, Jean-Luc Aristideen
dc.contributor.committeechairSaad, Waliden
dc.contributor.committeememberDhillon, Harpreet Singhen
dc.contributor.committeememberRamakrishnan, Narendranen
dc.contributor.departmentElectrical Engineeringen
dc.date.accessioned2025-09-23T08:00:19Zen
dc.date.available2025-09-23T08:00:19Zen
dc.date.issued2025-09-22en
dc.description.abstractSemantic communication (SC) has emerged as an effective paradigm for reducing the bandwidth needs of wireless services by exploiting the so-called "semantics" or meaning behind the data. To date, existing works in this area either focus on multimodal approaches only and omit context-aware recovery or embed it in cross-modal settings, such as audio-to-video, rather than providing a unified, modality-agnostic method. These works also impose substantial architecture redesigns for additional modalities support and are not easily extensible. In contrast to prior work, in this thesis, a novel semantic framework called the semantic context-aware framework for adaptive multimodal reasoning (SCE-FOAM) is proposed. SCE-FOAM is a multimodal semantic framework that enables compact transmission, efficient reconstruction, and contextually-aware predictions using a unique microservice-based architecture. This unique design simultaneously offers an extensible and modular platform for incorporating new modalities and enabling scalable deployment strategies. Experimental results show that SCE-FOAM can achieve data reductions up to 50% for text, 94.56% for audio, and 98.70% for images, respectively. Lastly, the proposed contextual‑prediction model achieves an average accuracy of 90% across all modalities. In addition, a heuristic-based extension of the deferred acceptance (DA) matching algorithm is proposed. The extension enables network node matches that incorporate exploration, coverage, and diversification heuristics. In summary, this thesis presents a unified, extensible, context-aware, multimodal SC framework and a heuristic extension to the DA matching algorithm.en
dc.description.abstractgeneralSemantic communication is a method for a device to talk with other devices on a wireless network using their own unique and dynamic language, rather than traditional methods that rely on information represented as binary 1's and 0's. By representing information in a traditional binary form, the devices are prevented from understanding the meaning of the message, which can be both inefficient and error-prone when information is being sent or received. By using semantic communication, the devices are endowed with understanding the underlying meaning, or semantics, of the information. This means devices will be able to represent information in their unique language, which can be more efficient than traditional methods and can allow devices to understand context when information is missing. Conversely, in this thesis, a novel semantic framework called the semantic context-aware framework for adaptive multimodal reasoning (SCE-FOAM) is proposed. This framework supports common media types such as text, audio, and images, and is able to efficiently represent information in minimized forms. Additionally, the framework allows for context recovery, which enables devices to predict missing information when their connection is lost. Lastly, it is designed to be easily used by software developers and researchers, which will encourage widespread adoption of SCE-FOAM. In summary, this thesis contributed a semantic framework that supports multimedia (text, audio, image) messages, context recovery, and is designed to be easily used for widespread adoption in the semantic communication field.en
dc.description.degreeMaster of Scienceen
dc.format.mediumETDen
dc.identifier.othervt_gsexam:44664en
dc.identifier.urihttps://hdl.handle.net/10919/137817en
dc.language.isoenen
dc.publisherVirginia Techen
dc.rightsCreative Commons Attribution 4.0 Internationalen
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/en
dc.subjectSemanticsen
dc.subjectContextual Learningen
dc.subjectMultimodalen
dc.subjectMicroserviceen
dc.subjectMatchingen
dc.titleArtificial Intelligence (AI)-based Semantic Communications with Multimodal Data: Framework and Implementationen
dc.typeThesisen
thesis.degree.disciplineElectrical Engineeringen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.levelmastersen
thesis.degree.nameMaster of Scienceen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
DeRieux_JA_T_2025.pdf
Size:
38.28 MB
Format:
Adobe Portable Document Format

Collections