论文标题
隐私权自动扬声器诊断
Privacy-preserving Automatic Speaker Diarization
论文作者
论文摘要
自动扬声器诊断(ASD)是一项具有多种应用程序的能力技术,涉及多个扬声器的录音,从而引起了隐私方面的特殊关注。实际上,在与服务器共享录制的远程设置中,客户端不仅放弃了他们对话的隐私,还放弃了可以从其声音中推断出的所有信息。但是,据我们所知,迄今为止,保护隐私的ASD系统的开发已被忽视。在这项工作中,我们使用两种加密技术,安全的多阶层计算(SMC)和安全模块化哈希来解决此问题,并将它们应用于级联的ASD系统的两个主要步骤:扬声器嵌入提取和集聚性层次结构集群。对于两个不同的SMC安全设置,我们的系统能够在性能和效率之间实现合理的权衡,实时因素为1.1和1.6。
Automatic Speaker Diarization (ASD) is an enabling technology with numerous applications, which deals with recordings of multiple speakers, raising special concerns in terms of privacy. In fact, in remote settings, where recordings are shared with a server, clients relinquish not only the privacy of their conversation, but also of all the information that can be inferred from their voices. However, to the best of our knowledge, the development of privacy-preserving ASD systems has been overlooked thus far. In this work, we tackle this problem using a combination of two cryptographic techniques, Secure Multiparty Computation (SMC) and Secure Modular Hashing, and apply them to the two main steps of a cascaded ASD system: speaker embedding extraction and agglomerative hierarchical clustering. Our system is able to achieve a reasonable trade-off between performance and efficiency, presenting real-time factors of 1.1 and 1.6, for two different SMC security settings.