An Adaptive Audio Watermarking Scheme Method Based on Kernel Fuzzy C-means Clustering
Автор: Honghong Chen, Zulin Zhang
Журнал: International Journal of Education and Management Engineering(IJEME) @ijeme
Статья в выпуске: 1 vol.2, 2012 года.
Бесплатный доступ
In this paper, we propose an adaptive audio watermarking scheme according to local audio features. Firstly, the original audio signal is partitioned into audio frames and these audio frames are transformed into DWT domain respectively. Next, the local features of each audio frame are extracted respectively, and these features are used to train kernel fuzzy c-means (KFCM) clustering algorithm. According to well-trained KFCM, the audio frames to embed the watermark are selected and their embedding strengths are determined adaptively. The experimental results show the proposed method is robust to common signal processing operations such as lossy compression (MP3), filtering, re-sampling, re-quantizing, etc.
Audio signal, audio watermarking, adaptive watermarking, kernel fuzzy c-means clustering algorithm
Короткий адрес: https://sciup.org/15013651
IDR: 15013651
Текст научной статьи An Adaptive Audio Watermarking Scheme Method Based on Kernel Fuzzy C-means Clustering
Digital audio watermarking technique provides efficient tools for ensuring that product ownership of audio multimedia is preserved, even if multimedia data is attracted by attackers. For an audio watermarking system, imperceptibility and robustness are its two basic requirements. Imperceptibility refers to the perceptual quality of the data being protected. For audio data, digital watermark should be inaudible. The digital watermark should also be robust to audio signal processing. Ideally, the amount of signal distortion necessary to remove the watermark can degrade the desired audio quality to point of becoming commercially valueless. Typical audio processing operations include lossy compression (such as MP3), additive noise, filtering, resampling, etc. Currently, many watermarking techniques have been developed, such as quantization-based techniques, relationship-based techniques, physiological model-based techniques, adaptive techniques, etc.
Recently, some machine learning methods, such as neural networks and support vector machines, are introduced into digital audio watermarking technique, and can simultaneously improve robustness and audible quality of the watermarked audio signal. In [1], neural networks were introduced into a nonblind audio watermarking scheme, which was used to estimate the watermark scaling factor intelligently from the host audio
Corresponding author:
signal, and make the watermark detection more robust against common attacks. Wang et al.[2] introduced SVM for audio watermark detection in DWT domain, which considered the watermark extraction as a two-class problem. In [3], Serap et al. presented an audio watermark decoding process based on SVM, which combined the watermark decoding and detection problems into a single classification problem. Xu et al. [4] proposed a DWT-based audio watermarking method using support vector regression (SVR) which was used to learning the correlation between the sub-audios obtained by sub-sampling technique. Peng et al.[5] proposed an audio watermarking method in multiwavelet domain based on SVM, where SVM was used to learn nonlinear relationship between local audio features in order to extract watermark signal.
In this paper, we propose an adaptive audio watermarking scheme. Firstly, we divide audio signal into audio frames, and these audio frames are transformed into DWT domain respectively. Secondly, we extract its local features for each audio frame, that is, its local energy and the maximal peaks of its all subbands. Next, through running kernel fuzzy c-means (KFCM) clustering algorithm on these features, we obtain the maximal fuzzy membership degree for each audio frame. According the fuzzy membership degree, we can select the audio frames to be embedded watermark and determine the embedding strength of each audio frame adaptively. The proposed method can simultaneously improve the robustness and audible quality of the watermarked audio signal.
The rest of this paper is organized as follows: Section II introduces the concept of kernel fuzzy c-means clustering algorithm. In Section III, we present the proposed audio watermarking approach. Simulation results for several watermarked audio manipulations are presented in Section IV. Finally, we draw our conclusions in Section V.
-
2. Kernel Fuzzy C-means Algorithm
Given an unlabeled data set X ={ x 1 , x 2 ,…, x m } ⊆ Rd , and a nonlinear mapping φ : Rd → F from this input space to a high-dimensional feature space F . By applying the nonlinear mapping φ , the dot product xi ⋅ x j in the space is mapped to φ ( x i ) ⋅φ ( x j ) in the feature space. The key notion in kernel-based learning is that the mapping φ need not be explicitly specified. The dot product φ ( xi ) ⋅φ ( x j ) in the high-dimensional feature space can be calculated through the kernel function K ( x i , x j ) in the input space. The kernel fuzzy c-means algorithm in the feature space F by a mapping φ minimizes the function Jr [6],[7]:
cm
J r ( X ) = ∑∑ ( µ ij ) r || φ ( x j ) - ν i φ ||2 i = 1 j = 1
where µij is the membership degree of data point xj to the ith fuzzy cluster, and r is a fuzziness coefficient. The ith cluster centroid is νiφ =ni-1∑mj=1(µij)rφ(xj) and ni=∑mj=1(µij)r
The key notion in the kernel fuzzy c-means algorithm lies in the calculation of the distance in the feature space. The distance between φ(xj) and νiφ in the feature space is calculated through the kernel in the input space:
II Ф(Xj) -vt II2
ф ( x j ) • ф ( x j ) - 2 Ф (x j ) •
Е mm=1(^ik) rф( xk) е mj m.) r
E m i( M ik ) r ф ( x k ) Е mi- 1( Mu ) r ф ( x i )
+---
е m = i ( M ik ) r е m j M il ) r
= ф ( x j ) • ф ( x j ) -
2E m=1(Mik) r ф(xk) •ф( xj-)
е mm = i ( M ik ) r
+ Е m=1 E L(Mik) r(Mu) rф( xk) •ф( x)
= K ( x j , xj) -
( Е m = i ( M ik ) r ) 2
2 E m j M ik ) r K ( X k , X j )
е m = i ( M ik ) r
+ е "=1 е mj Mik) r (Mil) rK (Xk, xi)
( Е m = = i ( M ik ) r )2
,
Where K ( xk , X i ) = ф (xk ) • ф (X i ) By using n( = E mm = 1( M ij ) r , we have
2 2m ll ^xXj)-vfii = K(Xj, Xj) - n~ EM)) k(Xk, Xj)
mm
+ EE ( M k ) r ( Mil ) r K ( x ) , X )
n i k =1 i =1
Therefore, the objective function can be rewritten as follows:
cm
J t ( X ) = EE ( M ij ) T ( K ( X j , X j )
i =1 j =1
mm
-
- ~EE ( M ik ) T ( M ii ) T K ( X k , X i ) n i k =1 i =1
The kernel fuzzy c-means algorithm iteratively updates the new membership degree M ij at each iteration. The update of M ij in the feature space is defined through the kernel in the input space as follows:
M j
Г
c
E k=1 V
( II Ф (X j ) - v f II2 V^( X j ) - v k II2
1/( r -1) A 1
From (3), the kernel fuzzy c-means algorithm does not need to calculate the cluster centroids because the centroid information is considered in updating the membership degree M ij ■
The kernel fuzzy c-means algorithm can be summarized as follows:
Step 1: Fix c , t ma x , r > 1 and e > 0 for some positive constant.
Step 2: Initialize the memberships µ i 0 j .
Step 3: For $t=1,2, , do:
Список литературы An Adaptive Audio Watermarking Scheme Method Based on Kernel Fuzzy C-means Clustering
- H.J. Yang, J.C. Patra, C.W. Chan, “An artificial neural network-based scheme for robust watermarking of audio signals”, ICASSP'02, 1, 2002, pp.I-1029-1032.
- J. Wang, F.Z. Lin, “Digital audio watermarking based on support vector machine”. Journal of Computer Research and Development, 2005 Vol.42, No.9, pp.1605-1611. (in Chinese)
- S. Kirbiz, B. Gunsel, “Robust audio watermark decoding by supervised learning”, Proceedings of ICASSP 2006, 5, 2006, pp.V-761- V-764.
- X.-J. Xu, H. Peng, C.-Y. He, “DWT-based audio watermarking using support vector regression and subsampling”, In F.Masulli, S.Mitra, and G.Pasi (Eds.): WILF2007, LNAI 4578, 2007, pp. 136-144.
- H. Peng, X. Wang, W.X. Wang, J. Wang, D.Y. Hu, “Audio watermarking approach based on audio features in multiwavelet domain”, Journal of Computer Research and Development, 2010, 47(2), pp.216-222. (in Chinese)
- D.-W. Kima, K.Y. Leeb, D. Leea, K.H. Leea, “Evaluation of the performance of clustering algorithms in kernel-induced feature space”, Pattern Recognition, 38, 2005, pp.607-611.
- J.W. Liu, M.Z. Xu, “Kernelized fuzzy attribute c-means clustering algorithm”, Fuzzy Sets and Systems, 159, 2008, pp.2428-2445.
- http://www.petitcolas.net/fabien/steganography/mp3stego/index.html.
- S.Q. Wu, J.W.Huang, Y.Q. Shi, “Efficiently self-synchronized audio watermarking for assured audio data transmission”, IEEE Trans. on Broadcast, 2005, Vol 51, No.1 pp. 69-76.