Image steganalysis using active learning and hyperparameter optimization

Bohang L; Li N; Yang J; Alfarraj O; Albelhai F; Tolba A; Shaikh ZA; Alizadehsani R; Pławiak P; Yee PL

doi:10.1038/s41598-025-92082-w

Image steganalysis using active learning and hyperparameter optimization

Bohang L ¹ , Li N ² , Yang J ³ , Alfarraj O ⁴ , Albelhai F ⁴ , Tolba A ⁴ Show all authors , Shaikh ZA ⁵ , Alizadehsani R ⁶ , Pławiak P ⁷ , Yee PL ⁸

Affiliations

¹ Data science, Shopee, Singapore, 118265, Singapore
² Fu Foundation School of Engineering and Applied Science, Columbia University, New York, 10027, USA
³ Department of Computer System and Technology, Faculty of Computer Science and Information Technology, Universiti Malaya, 50603, Kuala Lumpur, Malaysia. s2147529@siswa.um.edu.my
⁴ Computer Science Department, Community College, King Saud University, 11437, Riyadh, Saudi Arabia
⁵ Department of Computer Science and Information Technology, Benazir Bhutto Shaheed University Lyari, Karachi, 75660, Pakistan
⁶ Institute for Intelligent Systems Research and Innovation (IISRI) Deakin University, Waurn Ponds, Australia
⁷ Department of Computer Science, Faculty of Computer Science and Telecommunications, Cracow University of Technology, Warszawska 24, 31-155, Krakow, Poland. pawel.plawiak@pk.edu.pl
⁸ Department of Computer System and Technology, Faculty of Computer Science and Information Technology, Universiti Malaya, 50603, Kuala Lumpur, Malaysia

Sci Rep, 2025 Mar 01;15(1):7340.

PMID: 40025136 DOI: 10.1038/s41598-025-92082-w

Abstract

Image steganalysis, detecting hidden data in digital images, is essential for enhancing digital security. Traditional steganalysis methods typically rely on large, pre-labeled image datasets, which are difficult and costly to compile. To address this, this paper introduces an innovative approach that combines active learning and off-policy Deep Reinforcement Learning (DRL) to improve image steganalysis with minimal labeled data. Active learning allows the model to intelligently choose which unlabeled images should be annotated, thus reducing the amount of labeled data needed for effective training. Traditional active learning strategies often use static selection methods that restrict flexibility and do not adjust well to dynamic environments. To overcome this, our method incorporates off-policy DRL for strategic data selection. The off-policy in DRL can increase sample efficiency and significantly enhance learning outcomes. We also use the Differential Evolution (DE) algorithm to fine-tune the hyperparameters of the model, reducing its sensitivity to different settings and ensuring more stable results. Our testing on the extensive BossBase 1.01 and BOWS-2 datasets demonstrates the robust ability of the approach to distinguish between unaltered and steganographic images, achieving an average F-measure of 93.152% on BossBase 1.01 and 91.834% on the BOWS-2 dataset. In summary, this research enhances digital security by employing advanced image steganalysis to detect hidden data, significantly improving detection accuracy with minimal labeled data.

* Title and MeSH Headings from MEDLINE®/PubMed®, a database of the U.S. National Library of Medicine.

Similar publications