基于聚类相关性约束的(s,l)-多样性匿名方法

来源期刊:中南大学学报(自然科学版)2015年第10期

论文作者:杨静 张冰 张健沛 谢静

文章页码:3733 - 3743

关键词:(s,l)-多样性;相关性约束;匿名;聚类;隐私保护

Key words:(s,l)-diversity; correlativity constrained; anonymity; clustering; privacy preservation

摘    要:针对传统l-多样性模型易形成敏感值高度相关的等价类问题,提出一种约束等价类中敏感值相关性的(s,l)-多样性模型。该模型在传统l-多样性模型的基础上,以敏感集合中非敏感属性值的分布度量敏感值的相关性,通过等价类中敏感值相关性的约束来降低高相关性敏感值产生的信息泄露。同时,使用属性值间相关性作为距离度量基准,提出一种(s,l)-多样性聚类算法(SLCA)来实现该匿名模型,以降低数据泛化过程中的信息损失。研究结果表明:SLCA算法具有较小的信息损失量与较短的运行时间,能够有效地降低等价类中敏感值的相关性,更好地防止个体敏感信息泄露。

Abstract: In allusion to the problem of traditional data anonymity models constructing equivalence class with high correlative sensitive values, (s,l)-diversity was proposed which limited the correlativity of sensitive values in the equivalence classes. This diversity model was based on traditional l-diversity model, and it measured the correlativity of the sensitive attribute values to decrease the information loss by equivalence classes with high corrective sensitive values. At the same time, a (s,l)-diversity clustering algorithm named SLCA was proposed to achieve (s,l)-diversity, and the SLCA algorithm measured the distance between tuples by measuring the correlativity of attribute values, which greatly decreased the information loss during data generation. The results show that SLCA algorithm is more effective in terms of both information loss and execution time, and SLCA algorithm can effectively decrease the correlativity of the sensitive values in the equivalence classes to protect the privacy security of the data sets.

有色金属在线官网  |   会议  |   在线投稿  |   购买纸书  |   科技图书馆

中南大学出版社 技术支持 版权声明   电话:0731-88830515 88830516   传真:0731-88710482   Email:administrator@cnnmol.com

互联网出版许可证:(署)网出证(京)字第342号   京ICP备17050991号-6      京公网安备11010802042557号