关于音频水印的时频观点

论文标题

关于音频水印的时频观点

A Time-Frequency Perspective on Audio Watermarking

论文作者

Zhang, Haijian

论文摘要

现有的音频水印方法通常会单独处理时间或频率功能的主机音频信号，同时考虑它们在联合时间频率（TF）域中的关注较少。本文从TF分析的角度提出了音频水印框架。提出的框架在2维（2D）TF平面中处理主机音频信号，并在2D TF图像中选择一系列贴剂。这些斑块对应于最小平均能量的TF簇，并用于形成用于水印嵌入的特征向量。框架中纳入了经典的扩散频谱嵌入方案。携带水印的特征斑块仅占据主机音频信号的几个TF区域，从而提高了不可识别的属性。此外，由于功能补丁包含音频样本TF表示的邻域区域，因此可以利用单个贴片中样本之间的相关性，以提高与一系列处理攻击的鲁棒性。与对应系统相比，进行了广泛的实验以说明所提出的系统的有效性。这项工作的目的是阐明TF特征域中音频水印的概念，这可能会导致我们对恶意攻击进行更强大的水印解决方案。

Existing audio watermarking methods usually treat the host audio signals of a function of time or frequency individually, while considering them in the joint time-frequency (TF) domain has received less attention. This paper proposes an audio watermarking framework from the perspective of TF analysis. The proposed framework treats the host audio signal in the 2-dimensional (2D) TF plane, and selects a series of patches within the 2D TF image. These patches correspond to the TF clusters with minimum averaged energy, and are used to form the feature vectors for watermark embedding. Classical spread spectrum embedding schemes are incorporated in the framework. The feature patches that carry the watermarks only occupy a few TF regions of the host audio signal, thus leading to improved imperceptibility property. In addition, since the feature patches contain a neighborhood area of TF representation of audio samples, the correlations among the samples within a single patch could be exploited for improved robustness against a series of processing attacks. Extensive experiments are carried out to illustrate the effectiveness of the proposed system, as compared to its counterpart systems. The aim of this work is to shed some light on the notion of audio watermarking in TF feature domain, which may potentially lead us to more robust watermarking solutions against malicious attacks.

下载PDF全文

下载文献需遵守相关版权规定

论文标题