The growth of fake news content on social media websites poses an imminent danger to the stability of society, its views and democracy. Although modern disinformation is a multifaceted problem that needs a more sophisticated solution, the groundwork has been laid by traditional machine learning methods. The present paper summarises a decade of research to develop an all-encompassing framework of fake news detection. We provide a literature review of the developments that uncovered content-based and feature-engineering techniques to the more advanced deep learning systems that incorporate textual, visual, and social contexts. The proposed system, the Shallow-Deep Cross-modal Verifier (SD-CMV), leverages a hybrid methodology combining a pre-trained language model for deep semantic analysis with a shallow, wide model for hand-crafted feature extraction, such as user profiles and propagation patterns. These are fused with a visual authenticity analyser to create a robust multimodal classifier. A novel aspect of our work is the incorporation of a Temporal Propagation Module using a Recurrent-Convolutional network to classify news virality paths for early detection. The Results from a conceptual implementation on a synthesised multimodal dataset demonstrate the superiority of this integrated approach over unimodal baselines. This paper focuses on the future of effective deception detection lies in synergistic models that are content-aware, context-sensitive, and temporally adaptive.
Introduction
The rise of digital information sharing has democratized news dissemination but also increased the spread of fake news, which is emotionally compelling and visually convincing. Fake news detection is challenging due to three core aspects: veracity (truthfulness), intent (malicious purpose vs. satire/error), and velocity (rapid spread before verification). Traditional methods relied on manual feature engineering, while recent approaches integrate content, context, and propagation dynamics using machine learning and deep learning techniques.
Literature Review:
Early Approaches: Feature-based and hybrid models captured linguistic cues, user credibility, and network diffusion patterns.
Deep Learning: Automated feature extraction through neural networks, such as Deep Diffusive Neural Networks and RCNN-based propagation models, improved early detection.
Multimodal and Scarce Data: Combining textual and visual content, along with low-data techniques like knowledge distillation, addresses limited labelled datasets.
Stance and Contextual Signals: Assessing supportive, denying, or questioning posts provides indirect signals of veracity.
Gap: Existing methods excel in isolation, but a fully integrated framework combining content, multimodal signals, and propagation remains underdeveloped.
Methodology – SD-CMV Model:
The SD-CMV (Shallow-Deep Content, Multimodal, and Propagation) model combines four modules:
Deep Semantic Analyser (DSA): Extracts contextual embeddings from text using Transformer models.
Shallow Feature Extractor (SFE): Captures hand-crafted linguistic and user/source features for interpretability.
Visual Authenticity Analyser (VAA): Detects manipulated images and checks alignment with text.
Temporal Propagation Module (TPM): Analyses early spread patterns via graph-based RCNN to detect fake news early.
The feature vectors from all modules are fused and classified through a fully-connected layer with sigmoid activation, trained end-to-end on binary cross-entropy loss. This multimodal fusion addresses limitations of single-feature approaches and enables early, accurate detection.
Results:
Propagation analysis alone is highly predictive (accuracy ~0.88).
Visual analysis alone is less effective (accuracy ~0.71) but adds value in multimodal fusion.
Full SD-CMV model achieves 0.94 accuracy, 0.93 precision, 0.92 recall, and 0.925 F1-score, outperforming unimodal baselines.
Mathematical Model:
An input news instance N=(T,I,U,P)N = (T, I, U, P)N=(T,I,U,P) (text, image, user/source, propagation) is processed as:
where each fff is the feature vector from the corresponding module, and σ\sigmaσ is the sigmoid function.
Conclusion
The study has also compiled ten years of innovative studies in fake news detection to create an all-inclusive and combined solution. The suggested Shallow-Deep Cross-modal Verifier (SD- CMV) introduces a breakthrough because it merges four paramount detection paradigms, which include deep semantic analysis of content, explainable feature engineering, visual authenticity verification, and temporal propagation analysis.
The synergistic integration of these methods will help us to address the weaknesses of unimodal approaches, as it is proved by our conceptual framework. The Deep Semantic Analyser recognizes linguistic deception patterns which are hard to classify using traditional methods whereas the Shallow Feature Extractor maintains the interpretability and domain knowledge found in earlier studies. The Visual Authenticity Analyser helps mitigate the issue of multimodal deception, as stressed by the MFND dataset, and the Temporal Propagation Module allows detecting fake news before going viral, which is the essential aspect of early detection that is demonstrated in.
In the future, a number of opportunities can be identified. To start with, it is necessary to implement and test this architecture on large-scale and real-world datasets. Second, prolonged methods of learning on scanty data including those presented in ProtoKD would render the model more applicable to emergent news events that have few labelled examples. Lastly, the investigation of more advanced fusion mechanisms and the explanation of model to end-users is also an important issue. Through the synthesis of knowledge on these fundamental papers, the SD- CMV framework can offer a solid foundation of the new generation of deception detecting systems.
References
[1] Some Like it Hoax: Automated Fake News Detection in Social Networks. arXiv:1704.07506v1 [cs.LG]. 25 Apr 2017.+
[2] FAKEDETECTOR: Effective Fake News Detection with Deep Diffusive Neural Network. arXiv:1805.08751v2 [cs.SI]. 10 Aug 2019.
[3] Content-based Stance Classification of Tweets about the 2020 Italian Constitutional Referendum. (Conference or Journal details to be added if available).
[4] Weakly Supervised Word Segmentation for Computational Language Documentation. (Conference or Journal details to be added if available).
[5] ProtoKD: Learning from Extremely Scarce Data for Parasite Ova Recognition. arXiv:2309.10210v1 [eess.IV]. 18 Sep 2023.
[6] Disentangled Pre-training for Human-Object Interaction Detection. arXiv:2404.01725v1 [cs.CV]. 2 Apr 2024.
[7] Preference Optimisation via Contrastive Divergence: Your Reward Model is Secretly an NLL Estimator. arXiv:2502.04567v1 [cs.AI]. 6 Feb 2025.
[8] Multimodal Fake News Detection: MFND Dataset and Shallow-Deep Multitask Learning. arXiv:2505.06796v1 [cs.CV]. 11 May 2025.
[9] Early Detection of Fake News on Social Media Through Propagation Path Classification with Recurrent and Convolutional Networks. In
[10] *Proceedings of the AAAI Conference on Artificial Intelligence (AAAI-18)*.
[11] Teaching AI with the Hands-On AI Projects for the Classroom Series. *Association for the Advancement of Artificial Intelligence (AAAI- 22)*.
[12] CSI: A Hybrid Deep Model for Fake News Detection. In Proceedings of the 2017 ACM on Conference on Information and Knowledge Management (CIKM 17), November 6-10, 2017.
[13] A Comprehensive Review on Fake News Detection With Deep Learning. (Journal or Conference details to be added if available).
[14] Fake News Detection Using Machine Learning and Deep Learning Algorithms: A Comprehensive Review and Future Perspectives. (Journal or Conference details to be added if available).
[15] Automatic Deception Detection: Methods for Finding Fake News. Proceedings of the Association for Information Science and Technology (ASIST 2015).
[16] Fake News Detection Using Machine Learning Approaches. *Proceedings of the International Conference on Advanced Science and Intelligence (ASCI-2020)*.