Intellectual property protection of neural network architectures via steganography and IEEE 754 standard

Arevalo-Ancona, Rodrigo Eduardo; Cedillo-Hernandez, Manuel

doi:10.1108/ACI-06-2025-0236

Purpose

Neural networks are used in diverse applications, making them vulnerable to tampering and reinforcing the need for ownership authentication. The proposed method is based on a steganographic technique that embeds binary information into the weights using the IEEE 754 representation to enhance the security of the neural network and ownership authentication.

Design/methodology/approach

The proposed method is assessed using a variational autoencoder. Moreover, this technique can be extended to other neural networks. Ownership information is embedded within the most stable layers of the neural network, determined via gradient-based analysis, to enhance robustness against common model alterations, including fine-tuning, compression, pruning, overwriting, noise injection and weight quantization.

Findings

The experimental results confirm minimal impact on model performance and ensure reliable data recovery. The bit error rate evaluates the robustness of the proposed method, which obtained values ranging from 0.0131 to 0.129 for different weight pruning (10–50%). These results were further corroborated by extensive experimental validation.

Originality/value

The proposed method introduces a steganographic technique that embeds ownership information using the IEEE 754 representation. Unlike existing techniques, this approach embeds information into the weights without modifying the model structure and maintains the model’s performance without structural changes.

1. Introduction

With the rapid development and widespread application of artificial intelligence (AI) systems based on neural networks, the need for robust intellectual property protection has become critical. Therefore, reliable ownership authentication methods are designed to prevent unauthorized use and tampering [1–4]. Different protection methodologies have been proposed, including steganography, watermarking, fingerprinting and backdoor-based verifications, offering copyright protection, enabling ownership verification and supporting authentication [5–9]. White-box protection methods embed information into the parameters of a neural network to verify ownership. Often, these techniques require access to the neural network parameters to ensure intellectual property protection and ownership verification of neural networks [10–13]. Protecting neural networks against pruning and compression remains a critical research area, as these techniques reduce model size and computational cost.

Recent methods have aimed to preserve model accuracy while embedding ownership information. Watermark embedding during training enhance protection effectiveness while maintaining model accuracy. However, retraining the network may remove the watermark [14]. Ownership data embedding on the discrete cosine transform (DCT) coefficients enhances resistance to pruning without affecting accuracy. However, it has limited payload embedding capacity [15]. Multi-watermark embedding has been demonstrated to have a minimal impact on model performance, and it is designed for robust information hiding. The watermark extraction may be computationally intensive [16]. Hybrid methods use zero-watermarking bit and neural network optimization as a second watermarking layer to enhance robustness. This makes it difficult to balance robustness, imperceptibility and efficiency [17]. The parameter regularizer function for embedding during training or fine-tuning allows data embedding across training. Its robustness is limited to certain attacks [18]. White-box feature embedding via decision tree selection enables relevant feature selection for embedding control. However, it is vulnerable to model extraction attacks [19]. The implementation of the Power Function Mapping (PF-Mapping) as an activation function embeds information with a secret key, but it is vulnerable to pruning [20]. DCT middle frequency coefficients replacement maintains the classification accuracy. The method has limited payload embedding [21]. Kernel information embedding generates robustness against fine-tuning and pruning. It requires high computational processing [22]. The protection of neural networks using layers is achieved through keys that modify the parameters, enabling multiple verification checks. If these layers are removed, model performance decreases [23]. ChainMarks embeds watermarks using cryptographic sequences for ownership verification even if the model has been modified or retrained. However, it creates a dependency on the original data [24]. Most methods focus on embedding information in the neural network parameters and assess performance based on the original task.

The proposed method enhances the robustness of neural networks against pruning, fine-tuning, overwriting and quantization attacks by embedding binary ownership information into the model weights. This is achieved through a steganographic bit replacement technique (15th bit) using the IEEE 754 floating-point representation to modify specific bits without affecting the performance. This approach was implemented in a variational autoencoder (VAE), but it is generalizable to other architectures. To minimize performance degradation, ownership information is embedded in stable layers with low gradient magnitudes, as these layers are less sensitive to small parameter changes. Experimental results demonstrate that the embedding process is imperceptible in terms of accuracy loss and robustness against optimization attacks. The main contributions are:

The bit replacement technique is used for the binary ownership information embedding into neural network weights, ensuring imperceptibility and preserving the model performance.
The incorporation of the IEEE 754 standard ensures that the embedded information remains undetectable and can be recovered for neural network ownership verification.
The combination of IEEE 754 representation and the 15th bit replacement from the selected weight value improves the robustness against attacks, including fine-tuning, pruning and parameter overwriting.
Gradient analysis identifies stable layers to enhance security and robustness against structural optimizations of the model and unauthorized tampering.

This methodology contributes to neural network security by introducing a robust and efficient steganographic framework for intellectual property protection.

2. Methods

The proposed method embeds a binary sequence into the weights by modifying specific bits in their IEEE 754 representation, enabling imperceptible ownership information. During authentication, the embedded sequence is retrieved to verify model ownership.

Figure 1 illustrates the data embedding and extraction process. A VAE is employed to demonstrate the effectiveness of the proposed method, preserving its performance by reconstructing the original image in case of tampering.

Figure 1

The flowchart shows the process of embedding and extracting a binary sequence from neural network weights.

View large Download slide

The figure presents the complete workflow for neural network protection, authentication, and performance evaluation based on a variational autoencoder (V A E). The process starts with an image dataset O subscript k, where each image is resized to 128 × 128 pixels before being processed by the encoder. The encoder consists of consecutive layers with 32, 64, 128, 256, 512, and 1024 neurons, which progressively extract feature representations. The encoder output is mapped into a latent space through a reparameterization operation defined as z=μ+σϵ. The latent representation is then passed to the decoder, which reconstructs the images using layers with 1024, 512, 256, 128, 64, 32, and 3 neurons, enabling image reconstruction for performance evaluation of the model. In parallel, encoder weight extraction is performed using the average gradient of each layer to identify the most stable layers. The selected weights are converted into their I E E E 754 floating-point binary representation. For neural network protection, a binary watermark is embedded by replacing the 15th bit of each selected weight. The modified binary values are then converted back to decimal form and reintegrated into the encoder, producing modified encoder weights. For neural network authentication, the same stable layer selection and I E E E 754 conversion process is applied to extract the embedded watermark from the 15th bit of the selected weights. The extracted binary sequence is compared with the original watermark to verify model ownership and integrity.

Binary sequence embedding and extraction from neural network weights for model authentication

2.1 Weights extraction and selection

The weights from stable layers are selected to embed the binary sequence by modifying specific bits of each one on its IEEE 754 floating representation [25]. The selection of stable layers minimizes the performance degradation of the neural network and improves the robustness of the model against manipulations. Layer stability is estimated by analyzing the gradient magnitudes, where lower values indicate more stability. The gradient $▿$ measures how a parameter (weight or bias) changes. For a given weight $w$ and a dataset $O_{k}$ ⁠, k = 1, …, L, the gradient is defined as (1):

∆_{w} L o s s (O_{k}, w) = \frac{\partial L o s s (O_{k}, w)}{\partial w}

(1)

The model parameters were updated during training by using the gradient descent algorithm (2) to minimize the loss function.

w \leftarrow w - η \cdot ▿_{w} L o s s (O_{k}; w)

(2)

where η is the learning rate and w is updated (⁠ $\leftarrow$ ⁠) with gradient descent. Therefore, the average gradient magnitude $G_{k}$ is computed across the dataset, $O_{k}$ which is calculated as (3):

G_{k} = \frac{1}{k} \sum | ▿_{w} L o s s (O_{k}; w) ‖

(3)

After calculating the gradient magnitude, a list containing each layer (layer) and the corresponding average gradient magnitude, $G_{k}$ ⁠, is generated. This list is sorted by SG in ascending order (⁠ $↑$ ⁠) according to the $G_{k}$ values to identify the most stable layers (4):

S G = s o r t ({l a y e r, G_{k}}, b y G_{k} ↑)

(4)

The first four layers in SG with the smallest gradients were selected to embed binary information. This strategy improves robustness since these stable layers are less susceptible to fine-tuning. Then, the weight tensor from encoder $W_{E}$ (5) is extracted to embed the binary information on the stable layers.

W_{E} = {w_{i j} | i = 1, . . ., m; j = 1, . . ., n}

(5)

where i and j are the index of the layer and the neuron, respectively. Therefore, the binary sequence was embedded in the neural network weights, using the bit-replacement steganography method.

Figure 2 presents the pseudocode steps to illustrate the selection of the stable weights procedure employed during the embedding process.

Figure 2

Steps for stable weight parameters selection using gradient-based ranking.

View large Download slide

Figure 2 illustrates the algorithm used to select stable weight parameters from a trained neural network model. The input to the algorithm is the set of trained weight parameters w obtained from model M, and the output is the set of selected stable encoder weights W subscript E. The process begins by initializing a list S subscript G to store the average gradient values of each layer. For each weight node in the model, the gradient is computed. Then, for each layer in the model over the dataset O subscript k, the average gradient magnitude G subscript k is calculated and stored together with the corresponding layer index in S subscript G. The list S subscript G is subsequently sorted in ascending order according to the average gradient magnitude, allowing the layers with the smallest gradient values to be identified as the most stable. The first N layers with the lowest G subscript k values are selected. Finally, the weight parameters corresponding to the selected stable layers are extracted from the encoder and stored in the set W subscript E, which is returned as the output of the algorithm.

Pseudocode for stable weights selection

2.2 Steganography 15th bit-replacement on the neural network weights

Ownership information is embedded in the neural network by replacing the 15th bit of the IEEE 754 floating-point representation in its 32-bit format to encode the watermark. The IEEE 754 format increases security and minimizes perceptibility. This format transforms each floating-point number into its binary representation using three components (6): (1) Sign (S, 1 bit), (2) Exponent (E, 8 bits) and (3) Mantissa (M, 23 bits).

w_{I E E E} = S (1 bit) | E (8 bits) | M (23 bits)

(6)

The Sign (S) indicate whether the number is positive or negative, represented by the most significant bit (MSB) (7):

S = {\begin{cases} \begin{array}{c} 0 & i f & (+) p o s i t i v e \end{array} \\ \begin{array}{c} 1 & i f & (-) n e g a t i v e \end{array} \end{cases}

(7)

The Exponent (E) defines the magnitude of the represented number. In the 32-bit format, the exponent is stored in 8 bits. For example, the binary number 1000.01₂ can be normalized and represented as: 1.0000 $1_{2}$ x $2^{3}$ ⁠, E = 3. Therefore, the stored exponent value $E_{r}$ is computed as (8) by adding a bias of 127, which is specific to the 32-bit floating-point format of the IEEE 754 representation.

E_{r} = E + b i a s

(8)

The Mantissa (M) represents the fractional part of a decimal number. This element consists of 23 bits denoted as ${f r}_{1}, \dots, {f r}_{v}$ ⁠, where each ${f r}_{i} \in {0, 1}$ ⁠, and it is expressed as (9):

M = 1 . f r_{1}, . . ., f r_{v}

(9)

This binary representation is obtained through the following process: 1) Multiply the fractional part by 2. 2) The integer part of the result becomes the next bit. 3) The remaining fractional part is used for the next iteration. For example, for 0.75 in binary: Step 1: 0.75 × 2 = 1.5, integer part = 1 (first bit), fractional part = 0.5. Step 2: 0.5 × 2 = 1.0, integer part = 1 (second bit) and fractional part = 0. Conversion ends since the fractional part is zero. The binary representation of 0.75 is 0.11₂. Therefore, the Mantissa is M = 11,000…0 (21 zeros added to complete 23 bits).

Once the IEEE 754 representation of the weights is obtained, the 15th bit is modified ${m w}_{I E E E}$ with the binary sequence $s_{b}$ (10). The 15th bit was selected because its modification has a minimal impact on the performance of the model.

m w_{I E E E} = {w_{I E E E} (15^{t h} b i t) = s_{b} (l)

(10)

where l is the corresponding bit from the binary sequence $s_{b}$ ⁠. Therefore, the modified weights ${m w}_{I E E E}$ are converted to their decimal representations. The recovery of the original decimal values from the IEEE 754 representation involves extracting the sign bit $S_{r}$ (11), exponent E’ (12) and mantissa $M_{r}$ (13), with the final value computed using (14):

S_{r} = {\begin{cases} \begin{array}{c} (+) p o s i t i v e & i f & {(- 1)}^{0} = 1 \end{array} \\ \begin{array}{c} (-) n e g a t i v e & i f & {(- 1)}^{1} = - 1 \end{array} \end{cases}

(11)

E^{'} = E_{r} - 127

(12)

M_{r} = 1 + \sum_{i = 1}^{23} f r_{v} {(2)}^{- 1}

(13)

X = {(- 1)}^{S_{r}} (M) {(2)}^{E^{'}}

(14)

where $S_{r}$ is the bit sign (0 = positive, 1 = negative), $E_{r}$ is the decimal value from the stored exponent, ${f r}_{v}$ is the decimal value from the Mantissa M and X denotes the value of the number in decimal obtained from its IEEE 754 representation. Finally, the weights parameters were restored to their decimal forms.

Figure 3 illustrates the pseudocode of the watermark embedding process, detailing the steps required to integrate the binary sequence into the selected stable weights.

Figure 3

Steps for modifying weights using I E E E 754 floating-point conversion.

View large Download slide

Figure 3 illustrates the algorithm used to embed a binary watermark into the stable weight parameters of the neural network. The input to the algorithm is the set of stable encoder weights W subscript E, and the output is the set of modified weight parameters. For each weight in W subscript E, the weight is first converted into its I E E E 754 floating-point binary representation. A binary watermark bit is then embedded by replacing the 15th bit of the I E E E 754 representation with the corresponding watermark bit. After the bit replacement, the modified binary weight is converted back into its decimal floating-point representation. Finally, each original stable weight is updated with its corresponding modified decimal value, resulting in the watermarked weight set.

Pseudocode for watermark embedding

2.3 Ownership authentication of deep learning models

The ownership authentication protects against unauthorized use, ensuring the integrity of the model by extracting the embedded information into the weight parameters from the neural network.

First, it is necessary to locate the modified by calculating the gradient to identify the most stable layers. During verification, the gradients are recalculated to accurately locate these stable layers. For this reason, the authentication process uses the modified weight tensor ${m w}_{I E E E}$ (15), which is converted to IEEE 754 format to extract the embedded sequence using in (16), using (6)-(8).

W_{M E} = {m w_{i j} | i = 1, . . ., m; j = 1, . . ., n}

(15)

m w_{I E E E} = S (1 bit) | E (8 bits) | M (23 bits)

(16)

Therefore, the 15th bit of each weight in the IEEE 754 representation is extracted to recover the embedded binary information $s_{b r}$ ⁠, (17), which is used to verify the authenticity of the model.

s_{b r} (l) = {s_{b r} = 15^{t h} b i t m w_{I E E E 754}

(17)

The authentication process verifies the ownership of the neural network by retrieving the embedded information. Furthermore, the proposed method preserves the model accuracy and provides a secure methodology for intellectual property protection.

Figure 4 presents the pseudocode for watermark retrieval, showing the procedure used to extract the embedded binary sequence from the stable weights.

Figure 4

Steps for retrieving the watermark from the neural network weights.

View large Download slide

Figure 4 illustrates the algorithm used to retrieve the embedded binary watermark from the modified stable weight parameters of the neural network. The input to the algorithm is the set of modified stable weights WME, and the output is the retrieved watermark sequence. For each weight in W subscript ME, the weight is converted into its I E E E 754 floating-point binary representation. The watermark retrieval process consists of extracting the 15th bit from each binary weight representation. The extracted bits are collected sequentially to form the retrieved watermark s subscript br, which is returned as the output of the algorithm and used for neural network authentication.

Pseudocode for watermark retrieval

3. Results

This section presents the validation of the robustness against attacks such as quantization, pruning, fine-tuning, and noise injection. In addition, the VAE performance was assessed under image forgery scenarios involving object addition and removal. For evaluation, the MICC-F220 [26], coverage [27], realistic tampering [28] and a proprietary dataset were employed to provide a comprehensive assessment. Results demonstrate that the proposed method preserves the performance of the neural network even when the weights are modified. The algorithm was implemented in Python with PyTorch and executed on a system with a NVIDIA GeForce-4050 GPU and an Intel-CoreUltra-7 processor.

3.1 Neural network performance with modified weights (image reconstruction)

The neural network performance was evaluated by comparing the quality of the reconstructed image against the original image. PSNR measures the ratio between the original image $I_{o}$ and the noise in the reconstructed image $I_{r}$ (18).

P S N R = 10 \log_{10} (\frac{255^{2}}{M S E})

(18)

where MSE is the mean squared error (19).

M S E = \frac{1}{R \cdot C} \sum_{x = 0}^{R - 1} \sum_{y = 0}^{C - 1} {(I_{o} (x, y) - I_{r} (x, y))}^{2}

(19)

R and C denote the number of rows and columns and (x, y) the pixel coordinates. In contrast, SSIM evaluates image quality based on luminance, texture and structural similarity (20).

S S I M (I_{o}, I_{r}) = \frac{(2 μ_{I_{o}} μ_{I_{r}} + C_{1}) (2 σ_{I_{o}, I_{r}} + C_{2})}{(μ_{I_{o}}^{2} + μ_{I_{r}}^{2} + C_{1}) (σ_{I_{o}}^{2} + σ_{I_{r}}^{2} + C_{2})}

(20)

where $μ_{I_{o}}, μ_{I_{r}}$ represent the mean values, $σ_{I_{o}}, σ_{I_{r}}$ are the image variances and $σ_{I_{o}, I_{r}}$ is the covariance. Finally, $C_{1}$ and $C_{2}$ are constant A higher SSIM indicates structural similarity between the reconstructed and original images.

Table 1 shows that the reconstruction of manipulated images using the neural network with modified weights is comparable to the performance of models with unmodified weights, as it is reflected in the PSNR and SSIM values.

Table 1

Image reconstruction with modified weights

Figure 5 compares the image reconstruction accuracy for 20 random images. This comparison shows that SSIM values from the modified and the original model are similar, demonstrating that embedding information has minimal impact on reconstruction quality.

Figure 5

The graphs compare the S S I M values of modified and original weights across different image databases.

View large Download slide

Figure 5 presents a comparison of the S S I M between neural networks using original weights and networks using modified (watermarked) weights across four different image databases. In all graphs, the horizontal axis represents the number of images used for evaluation, while the vertical axis represents the S S I M value. Each graph includes two curves: a solid line corresponding to the modified weights and a dashed line corresponding to the original weights. All evaluations are performed using 20 images. Graph (a), corresponding to the M I C C-F220 database, shows relatively stable S S I M values for both configurations, with the modified weights maintaining S S I M values around 0.80, while the original weights consistently achieve higher S S I M values close to 0.90. Graph (b), corresponding to the Realistic Tampering database, exhibits noticeable fluctuations in S S I M values. Both modified and original weights show variations across the evaluated images, with S S I M values decreasing toward the end of the evaluation. Graph (c), corresponding to the Coverage database, shows similar fluctuation patterns for both modified and original weights, with S S I M values ranging from mid to high levels. A slight decrease is observed for the original weights at higher image counts. Graph (d), corresponding to the High-Resolution database, demonstrates the largest variation in S S I M values. The modified weights exhibit a wide range of S S I M values, while the original weights show both flat regions and sharper decreases at specific image counts. Overall, the figure illustrates that the use of modified weights for watermark embedding preserves acceptable image reconstruction quality while maintaining S S I M behavior comparable to that of the original network across different datasets.

SSIM comparison of reconstructed images using original and modified VAE weights: (a) SSIM MICC-F220, (b) SSIM realistic tampering, (c) SSIM coverage and (d) SSIM high resolution

Figure 6 presents the processing time required for the recovery of the embedded information and image reconstruction. Images from the high-resolution dataset need more processing time during processing and reconstruction for their size.

Figure 6

Comparison of the processing time across four image databases from 20 images.

View large Download slide

Figure 6 presents a comparison of the authentication and image reconstruction processing time of the proposed neural network across different image databases. The horizontal axis represents the number of images used in the evaluation, while the vertical axis represents the processing time in seconds. Each curve corresponds to a different dataset, including M I C C-F220, Realistic Tampering, Coverage, and High-Resolution databases. The High-Resolution database exhibits the highest and most consistent processing time across all evaluated image counts, reflecting the increased computational complexity associated with higher-resolution images. In contrast, the M I C C-F220 database shows the lowest processing times, remaining close to a constant value with only a slight increase at higher image counts. The Realistic Tampering and Coverage databases present moderate processing times, with occasional peaks indicating increased computational demand during authentication and image reconstruction. Despite these fluctuations, the overall processing time remains within a narrow range for all datasets. Overall, the figure demonstrates that the proposed neural network maintains efficient and scalable performance across different databases, even when processing images of varying complexity and resolution.

Processing time for information retrieval

3.2 Information embedding imperceptibility

The evaluation of the information embedding imperceptibility used two binary sequences of the histograms with different payloads.

Figure 7 demonstrates that the embedding process preserves the statistical distribution of the neural network weights. The comparison is made with two payload configurations: 152 and 1,500 embedded bits. The results show the histograms of the original and watermarked weights are similar, confirming that the embedded information is imperceptible. This indicates that the embedding method maintains weight integrity without introducing detectable changes and preserves the statistical distribution of the parameters.

Figure 7

The histograms compare original and modified weights at 1500 bits and 152 bits, showing similar weight value distributions.

View large Download slide

Figure 7 presents a comparison of the distributions of original and modified neural network weights for two different watermark sizes. The figure consists of four histograms arranged in a two-by-two layout. The top row corresponds to a watermark size of 1500 bits, while the bottom row corresponds to a watermark size of 152 bits. The top-left histogram shows the distribution of the original weights for the 1500-bit case, while the top-right histogram shows the distribution of the modified weights after watermark embedding. In both cases, the weight values are concentrated around zero, with fewer values appearing toward the positive and negative extremes. The bottom-left histogram shows the distribution of the original weights for the 152-bit case, and the bottom-right histogram shows the distribution of the modified weights. Despite the smaller number of weights, both distributions remain centered near zero and exhibit similar shapes. Overall, the figure demonstrates that the watermark embedding process does not significantly alter the statistical distribution of the network weights, regardless of the watermark size, thereby preserving the original characteristics of the trained model.

Histogram comparison (a) 1,500 original weights (b) 1,500 modified weights, (c) 152 original weights and (d) 152 modified weights

3.3 Information retrieval from neural network weights

To assess the efficiency of the recovery process, the bit error rate (BER) was used (21) to measure the number of erroneous bits.

B E R = \frac{\sum (s_{b} (l) \oplus s_{b r} (l))}{(L^{'})}

(21)

where $\oplus$ is the XOR operation, L′ is the length of the signal $s_{b}$ ⁠, $s_{b}$ and $s_{b r}$ are the binary information sequence and the recovered binary sequence, respectively.

Table 2 presents image reconstruction performance and information retrieval for different embedding sequence lengths. The number of embedded bits increases from 15,200 bits to 22,829,811 bits, the SSIM of the reconstructed images shows a slight decrease (SSIM_decrease<0.1) between the original model and the modified model, demonstrating that the method does not modify the performance, while the BER from the retrieval information remains zero. These results confirm the robustness of the proposed method and do not compromise the neural network performance.

Table 2

Impact of embedded bit length on neural network performance

Database		Original weights	Bits embedded (15,200 bits)	Bits embedded (total weights = 22,829,811)
MICC-F220	Reconstruction SSIM	0.8135	0.8053	0.8042
MICC-F220	Retrieved information BER	–	0	0
Realistic tampering	Reconstruction SSIM	0.7398	0.7365	0.7353
Realistic tampering	Retrieved information BER	–	0	0
Coverage	Reconstruction SSIM	0.8018	0.8003	0.7998
Coverage	Retrieved information BER	–	0	0
High-resolution	Reconstruction SSIM	0.7234	0.7194	0.7205
High-resolution	Retrieved information BER	–	0	0

Database		Original weights	Bits embedded (15,200 bits)	Bits embedded (total weights = 22,829,811)
MICC-F220	Reconstruction SSIM	0.8135	0.8053	0.8042
MICC-F220	Retrieved information BER	–	0	0
Realistic tampering	Reconstruction SSIM	0.7398	0.7365	0.7353
Realistic tampering	Retrieved information BER	–	0	0
Coverage	Reconstruction SSIM	0.8018	0.8003	0.7998
Coverage	Retrieved information BER	–	0	0
High-resolution	Reconstruction SSIM	0.7234	0.7194	0.7205
High-resolution	Retrieved information BER	–	0	0

Table 3 evaluates the robustness against pruning attacks. Two scenarios were tested: pruning all four embedding layers and pruning only two layers. Results show the method preserves the information even at pruning of 50%. To reduce information loss, redundant embedding in bias parameters is suggested.

Table 3

Sequence retrieval BER from neural network weights pruning

Binary sequence of 15,200 bits
Pruning	4-layer pruning	2-layer pruning
10%	0.0351	0.0221
30%	0.1032	0.0651
50%	0.1837	0.1193

Table 4 shows when information is redundantly embedded in bias, the BER is reduced to 0 because pruning techniques are typically applied to the weights. Furthermore, the method was evaluated under a different manipulation, including quantization, fine-tuning, weight bit values overwriting and noise injection.

Table 4

Binary information sequence retrieval BER from bias redundancy

Binary sequence of 15,200 bits
Pruning	4-layer pruning	2-layer pruning
10%	0	0
30%	0	0
50%	0	0

Table 5 presents the robustness of the recovery process for different attacks. Modifications to the weights have a minimal effect on information recovery. However, higher noise levels increase the BER, which affects the accuracy of the recovered information. Bit overwriting shows no impact on the retrieval of the embedded information, which demonstrates the selection of the 15th bit is resistant to manipulations.

Table 5

Binary information sequence retrieval BER under different attacks

Binary sequence of 152 bits
Attack	BER
Gaussian noise µ = 0, σ² = 0.00001	0.2763
Speckle noise (σ² = 0.0005)	0.1315
Salt and pepper noise (δ = 0.02)	Weights = 0.0263 Bias = 0.0065
Salt and pepper noise (δ = 0.2)	Weights = 0.125 Bias = 0.1118
Bit overwriting on the LSB	0
Bit overwriting on the 25^th position	0
Quantization	0.175

Binary sequence of 152 bits
Attack	BER
Gaussian noise µ = 0, σ² = 0.00001	0.2763
Speckle noise (σ² = 0.0005)	0.1315
Salt and pepper noise (δ = 0.02)	Weights = 0.0263Bias = 0.0065
Salt and pepper noise (δ = 0.2)	Weights = 0.125Bias = 0.1118
Bit overwriting on the LSB	0
Bit overwriting on the 25^th position	0
Quantization	0.175

Table 6 presents the BER of the sequence retrieval after fine-tuning for different numbers of epochs. The results indicate that fine-tuning has an impact on the embedded data since LSB were modified while the bits containing the embedded information remain intact. The BER remains low across all fine-tuning epochs, demonstrating the stability of the proposed method, which leverages embedding information using the gradient to select the most stable layers.

Table 6

Binary information sequence retrieval BER from fine-tuning

Binary sequence of 152 bits
Fine-tuning	BER
5 epochs	0.0328
10 epochs	0.0328
20 epochs	0.0197
50 epochs	0.0131
100 epochs	0.0394

3.4 Method comparison

The proposed method was compared with existing techniques reported in the literature, in which most approaches primarily focused on evaluating the performance of a neural network with embedded data. However, some methodologies focus on information recovery under specific network alterations.

Table 7 presents a comparative analysis with other methods and demonstrate the robustness of the proposed method. Kernel embedding and multi-bit replacement methods achieve a perfect recovery under some scenarios; their evaluations are limited to specific neural network optimizations. In contrast, the proposed method was comprehensively tested under more parameters optimizations and manipulations. Most of the previous studies are tested on classification networks, which may not illustrate the impact of embedded information on the network.

Table 7

Performance comparison

Author	Methodology	Attacks
[18]	Kernel embedding	Overwriting BER = 0 Fine-Tunning BER = 0
[20]	Multi-bit replacement and image embedding	Pruning BER = 0.10 Quantization BER = 0
[21]	Multiple watermarking embedding different keys	Pruning BER = 0.25
Proposed method	IEEE 754 bit replacement	Fine-Tunning BER = 0.394 Pruning BER = 0.1444 Noise Injection BER = 0.1888 Overwriting BER = 0 Quantization BER = 0.175

Author	Methodology	Attacks
[18]	Kernel embedding	OverwritingBER = 0Fine-TunningBER = 0
[20]	Multi-bit replacement and image embedding	PruningBER = 0.10QuantizationBER = 0
[21]	Multiple watermarking embedding different keys	PruningBER = 0.25
Proposed method	IEEE 754 bit replacement	Fine-TunningBER = 0.394PruningBER = 0.1444Noise InjectionBER = 0.1888OverwritingBER = 0QuantizationBER = 0.175

4. Discussion

Table 1 shows that the proposed method does not compromise the forgery image reconstruction performance of the neural network with modified weights, although reconstruction quality slightly decreases for more complex manipulations compared to the unmodified network. In this context, Figure 7 illustrates the imperceptibility of the embedded information, as the histogram remains unchanged.

Table 2 evaluates the method under pruning attacks, achieving high information retrieval accuracy even with 50% pruning. Table 5 shows the robustness of the approach during neural network optimization. Bit overwriting does not affect the retrieval of the embedded data. However, the proposed method has some limitations. High noise levels can alter the values of the modified weights, including the bits containing the embedded information, which increases the BER and reduces retrieval accuracy. Similarly, high pruning removes a significant number of the network weights, eliminating those that were modified with the embedded data, compromising the information retrieval process.

5. Conclusions

This paper introduces a steganographic method using the IEEE 754 standard to embed ownership information directly into neural network weights. The technique ensures imperceptibility and maintains model performance. The proposed method shows robustness for model optimizations such as pruning, quantization and fine-tuning. Experimental results demonstrate the imperceptibility of the embedded information as the statistical distribution of the neural network weights remains unchanged. The results demonstrate robustness in information retrieval, even when up to 50% of the weights from the selected are pruned. Nevertheless, a slight increase in the BER is observed as the pruning ratio grows, which can be attributed to the removal of some modified weight.

Future work will extend this approach to protect the neural network and the associated data against manipulation. Additionally, the network protection mechanisms could be leveraged in the detection of images from generative models by integrating structural components. This design includes activation functions into the inference process by incorporating watermark into the data. Additionally, it is considered the adaptation of image watermarking techniques such as zero-watermarking or data hiding.

Ethics statement

This study did not involve human participants or animals; therefore, ethical approval was not require ethical approval.

The authors thank to the Secretaría de Ciencia, Humanidades, Tecnología e Innovación (SECIHTI) of Mexico and the Instituto Politécnico Nacional for the support provided during the realization of this research.

References

1.

Lin

H

,

Shen

S

,

Lyu

H

.

Protecting IP of deep neural networks with watermarking using logistic disorder generation trigger sets

.

Multimedia Tools Appl

.

2024

;

83

(

4

):

10735

-

54

. doi:

https://doi.org/10.1007/s11042-023-15980-z

.

Google Scholar

Crossref

2.

Liu

J

,

Li

Y

,

Guo

Y

,

Liu

Y

,

Tang

J

,

Nie

Y

.

Generation and countermeasures of adversarial examples on vision: a survey

.

Artif Intell Rev

.

2024

;

57

(

8

):

199

. doi:

https://doi.org/10.1007/s10462-024-10841-z

.

Google Scholar

Crossref

3.

Wang

S

,

Nepal

S

,

Rudolph

C

,

Grobler

M

,

Chen

S

,

Chen

T

.

Backdoor attacks against transfer learning with pre-trained deep learning models

.

IEEE Trans Serv Comput

.

2022

;

15

(

3

):

1526

-

39

. doi:

https://doi.org/10.1109/TSC.2020.3000900

.

Google Scholar

Crossref

4.

Xu

G

,

Li

H

,

Ren

H

,

Yang

K

,

Deng

RH

.

Data security issues in deep learning: attacks, countermeasures, and opportunities

.

IEEE Commun Mag

.

2019

;

57

(

11

):

116

-

22

. doi:

https://doi.org/10.1109/MCOM.001.1900091

.

Google Scholar

Crossref

5.

Wang

R

,

Lin

C

,

Zhao

Q

,

Zhu

F

.

Watermark Faker: towards forgery of digital image watermarking

. In:

Proc IEEE Int Conf Multimedia Expo (ICME)

.

2021

.

1

, -

6

. doi:

https://doi.org/10.1109/ICME51207.2021.9428410

.

Google Scholar

Crossref

6.

Puttagunta

MK

,

Ravi

S

,

Kennedy Babu

CN

.

Adversarial examples: attacks and defences on medical deep learning systems

.

Multimedia Tools Appl

.

2023

;

82

:

33773

-

809

. doi:

https://doi.org/10.1007/s11042-023-14702-9

.

Google Scholar

Crossref

7.

Sun

X

,

Sun

S

.

Adversarial robustness and attacks for multi-view deep models

.

Eng Appl Artif Intell

.

2021

;

97

: 104085. doi:

https://doi.org/10.1016/j.engappai.2020.104085

.

Google Scholar

8.

Apruzzese

G

,

Andreolini

M

,

Ferretti

L

,

Marchetti

M

,

Colajanni

M

.

Modeling realistic adversarial attacks against network intrusion detection systems

.

Digit Threats Res Pract

.

2022

;

3

(

3

):

1

-

19

. doi:

https://doi.org/10.1145/3469659

.

Google Scholar

Crossref

9.

Anthi

E

,

Williams

L

,

Rhode

M

,

Burnap

P

,

Wedgbury

A

.

Adversarial attacks on machine learning cybersecurity defences in industrial control systems

.

J Inf Secur Appl

.

2021

;

58

: 102717. doi:

https://doi.org/10.1016/j.jisa.2020.102717

.

Google Scholar

10.

Uchida

Y

,

Nagai

Y

,

Sakazawa

S

,

Satoh

S

.

Embedding watermarks into deep neural networks

. In:

Proc ACM Int Conf Multimedia Retrieval (ICMR)

;

2017

. p.

269

-

77

. doi:

https://doi.org/10.1145/3078971.3078974

.

Google Scholar

Crossref

11.

Tyagi

T

,

Singh

AK

.

Deep learning models security: a systematic review

.

Comput Electr Eng

.

2024

;

120

(

Pt B

): 109792. doi:

https://doi.org/10.1016/j.compeleceng.2024.109792

.

Google Scholar

12.

Ingle

G

,

Pawale

S

.

Enhancing adversarial defense in neural networks by combining feature masking and gradient manipulation on the MNIST dataset

.

Int J Adv Comput Sci Appl

.

2024

;

15

(

1

). doi:

https://doi.org/10.14569/IJACSA.2024.01501114

.

Google Scholar

13.

Qin

R

,

Wang

L

,

Du

X

,

Xie

P

,

Chen

X

,

Yan

B

.

Adversarial robustness in deep neural networks based on variable attributes of the stochastic ensemble model

.

Front Neurorobot

.

2023

;

17

: 1205370. doi:

https://doi.org/10.3389/fnbot.2023.1205370

.

Google Scholar

14.

Zhang

J

,

Gu

Z

,

Jang

J

,

Wu

H

,

Stoecklin

MP

,

Huang

H

,

Molloy

I

.

Protecting intellectual property of deep neural networks with watermarking

. In:

Proc Asia Conference on Computer and Communications Security (ASIACCS ’18)

;

2018

. p.

159

-

72

. doi:

https://doi.org/10.1145/3196494.3196550

.

Google Scholar

Crossref

15.

Li

L

,

Bai

Y

,

Chang

C-C

,

Fan

Y

,

Gu

W

,

Emam

M

.

Anti-pruning multi-watermarking for ownership proof of steganographic autoencoders

.

J Inf Secur Appl

.

2023

;

76

: 103548. doi:

https://doi.org/10.1016/j.jisa.2023.103548

.

Google Scholar

16.

Gu

W

,

Chang

C-C

,

Bai

Y

,

Fan

Y

,

Tao

L

,

Li

L

.

Multipurpose watermarking approach for copyright and integrity of steganographic autoencoder models

.

Secur Commun Netw

.

2021

;

2021

:

12

. doi:

https://doi.org/10.1155/2021/9936661

.

Google Scholar

Crossref

17.

Fkirin

A

,

Moursi

A

,

Attiya

G

,

El-Sayed

A.

,

Shouman

MA

.

Hybrid two-level protection system for preserving pre-trained DNN models ownership

.

Neural Comput Appl

.

2024

;

36

(

34

):

21415

-

49

. doi:

https://doi.org/10.1007/s00521-024-10304-0

.

Google Scholar

Crossref

18.

Nagai

Y

,

Uchida

Y

,

Sakazawa

S

,

Satoh

S

.

Digital watermarking for deep neural networks

.

Int J Multimed Inf Retr

.

2018

;

7

(

1

):

3

-

16

. doi:

https://doi.org/10.1007/s13735-018-0147-1

.

Google Scholar

Crossref

19.

Wong

WK

,

Juwono

FH

,

Eswaran

S

,

Motelebi

F

.

Intrusion detection system model: a white-box decision tree with feature selection optimization

.

Neural Comput Appl

.

2025

;

37

(

7

):

5655

-

70

. doi:

https://doi.org/10.1007/s00521-024-10942-4

.

Google Scholar

Crossref

20.

Li

L

,

Zhang

W

,

Barni

M

.

Universal blackmarks: key-image-free blackbox multi-bit watermarking of deep neural networks

.

IEEE Signal Process Lett

.

2023

;

30

:

36

-

40

. doi:

https://doi.org/10.1109/LSP.2023.3239737

.

Google Scholar

Crossref

21.

Kakikura

S

,

Kang

H

,

Iwamura

K

.

Deep learning model protection using negative correlation-based watermarking with best embedding regions

. In:

Proc IEEE Int Conf Tools Artif Intell (ICTAI)

;

2021

. p.

1345

-

51

. doi:

https://doi.org/10.1109/ICTAI52525.2021.00031

.

Google Scholar

Crossref

22.

Xie

C

,

Yi

P

,

Zhang

B

,

Zou

F

.

DeepMark: embedding watermarks into deep neural network using pruning

. In:

Proc IEEE Int Conf Trust

.

Secur Priv Comput Commun

;

2021

. p.

144

-

51

. doi:

https://doi.org/10.1109/TrustCom53373.2021.00028

.

Google Scholar

Crossref

23.

Stang

J

,

Krauß

T

,

Dmitrienko

A

.

DNNShield: embedding identifiers for deep neural network ownership verification

.

arXiv:2403.06581

;

2024

. doi:

https://doi.org/10.48550/arXiv.2403.06581

.

Google Scholar

24.

Choi

B

,

Wang

S

,

Choi

I

,

Sun

K

.

ChainMarks: securing DNN watermark with cryptographic chain

. In:

Proc ACM Asia Conference on Computer and Communications Security

;

2025

. p.

442

-

55

. doi:

https://doi.org/10.1145/3708821.3736214

.

Google Scholar

Crossref

25.

Bagnara

R

,

Bagnara

A

,

Biselli

F

,

Chiari

M

,

Gori

R

.

Correct approximation of IEEE 754 floating-point arithmetic for program verification

.

Constraints

.

2022

;

27

(

1-2

):

29

-

69

. doi:

https://doi.org/10.1007/s10601-021-09322-9

.

Google Scholar

Crossref

26.

Amerini

I

,

Ballan

L

,

Caldelli

R

,

Del Bimbo

A

,

Serra

G

.

A SIFT-based forensic method for copy-move attack detection and transformation recovery

.

IEEE Trans Inf Forensics Secur

.

2011

;

6

(

3

):

1099

-

112

. doi:

https://doi.org/10.1109/TIFS.2011.2129512

.

Google Scholar

Crossref

27.

Wen

B

,

Zhu

Y

,

Subramanian

R

,

Ng

T

,

Shen

X

,

Winkler

S

.

COVERAGE – a novel database for copy-move forgery detection

. In:

Proc IEEE Int Conf Image Process (ICIP)

;

2016

. doi:

https://doi.org/10.1109/ICIP.2016.7532339

.

Google Scholar

Crossref

28.

Korus

P

,

Huang

J

.

Multi-scale analysis strategies in PRNU-based tampering localization

.

IEEE Trans Inf Forensics Secur

.

2017

;

12

(

4

):

809

-

24

. doi:

https://doi.org/10.1109/TIFS.2016.2636089

.

Google Scholar

Crossref

2025

Rodrigo Eduardo Arevalo-Ancona and Manuel Cedillo-Hernandez

Published in Applied Computing and Informatics. Published by Emerald Publishing Limited. This article is published under the Creative Commons Attribution (CC BY 4.0) licence. Anyone may reproduce, distribute, translate and create derivative works of this article (for both commercial and non-commercial purposes), subject to full attribution to the original publication and authors. The full terms of this licence may be seen at Link to the terms of the CC BY 4.0 licence.

Intellectual property protection of neural network architectures via steganography and IEEE 754 standard

1. Introduction

2. Methods

2.1 Weights extraction and selection

2.2 Steganography 15th bit-replacement on the neural network weights

2.3 Ownership authentication of deep learning models

3. Results

3.1 Neural network performance with modified weights (image reconstruction)

3.2 Information embedding imperceptibility

3.3 Information retrieval from neural network weights

3.4 Method comparison

4. Discussion

5. Conclusions

Ethics statement

References

Email Alerts

Cited By

Intellectual property protection of neural network architectures via steganography and IEEE 754 standard Open Access

1. Introduction

2. Methods

2.1 Weights extraction and selection

2.2 Steganography 15th bit-replacement on the neural network weights

2.3 Ownership authentication of deep learning models

3. Results

3.1 Neural network performance with modified weights (image reconstruction)

3.2 Information embedding imperceptibility

3.3 Information retrieval from neural network weights

3.4 Method comparison

4. Discussion

5. Conclusions

Ethics statement

References

Email Alerts

Suggested Reading

Related Chapters

Recommended for you

Cited By

Intellectual property protection of neural network architectures via steganography and IEEE 754 standard