Publication | Closed Access
Cross-Modal Semantic Communications
61
Citations
13
References
2022
Year
Natural Language ProcessingEngineeringMultimodal TranslationSemantic CommunicationsComputational LinguisticsMulti-modal Signal ProcessingMultimodal InteractionMultimodal Signal ProcessingComputer ScienceCommunicationSemanticsSemantic WebCross-modal Semantic CommunicationsCompact Semantic InformationLanguage StudiesLinguisticsSpeech Recognition
Multi‑modal services such as video, audio, and haptic aim for immersive, low‑latency, high‑reliability experiences, yet their structural differences create challenges; semantic communications promise to break modality barriers by extracting compact semantic information, but ensuring reliability remains difficult due to polysemy and ambiguity. The study proposes a cross‑modal semantic communication paradigm to address reliability challenges in semantic communications. The paradigm comprises a cross‑modal knowledge graph providing background knowledge and signal patches, a semantic encoder that infers implicit semantics to reduce polysemy, and a decoder that enforces consistency at bit and semantic levels to mitigate decoding ambiguity. Simulation results show that the integrated modules collaboratively improve system reliability.
It is well known that multi-modal services, including video, audio, and haptic signals, aim to provide immersive experience with low latency and high reliability. Although multi-modal signals have differences in structure, transmission delay, and jitter, there are potential cross-modal relations among these multi-modal signals that can be explored to realize multi-modal signal processing and streaming. As a revolutionary communication paradigm, semantic communications can not only provide human-oriented services by delivering the intended meanings, but also break the modality barriers through compact semantic information extracted from multi-modal signals, which can help to achieve cross-modal applications. However, due to the polysemy and ambiguity issues, how to guarantee high reliability for semantic communications is a tremendous challenge. To overcome this dilemma, this article designs a cross-modal semantic communication paradigm that involves three modules: The cross-modal knowledge graph (CKG) provides essential background knowledge and signal patches for encoding and decoding respectively; the cross-modal semantic encoder infers potential implicit semantics to reduce encoding polysemy; and the cross-modal semantic decoder ensures consistency between source signals and recovered signals in terms of bit level and semantic level in order to reduce decoding ambiguity. They collaboratively improve the reliability of the communication system, which is verified by the simulation result.
| Year | Citations | |
|---|---|---|
Page 1
Page 1