Spherical Microaggregation: Anonymizing Sparse Vector Spaces
2015 (English)In: Computers & security (Print), ISSN 0167-4048, E-ISSN 1872-6208, Vol. 49, 28-44 p.Article in journal (Refereed) Published
Unstructured texts are a very popular data type and still widely unexplored in the privacy preserving data mining field. We consider the problem of providing public information about a set of confidential documents. To that end we have developed a method to protect a Vector Space Model (VSM), to make it public even if the documents it represents are private. This method is inspired by microaggregation, a popular protection method from statistical disclosure control, and adapted to work with sparse and high dimensional data sets.
Place, publisher, year, edition, pages
Elsevier, 2015. Vol. 49, 28-44 p.
Electrical Engineering, Electronic Engineering, Information Engineering
Research subject Technology
IdentifiersURN: urn:nbn:se:his:diva-10653DOI: 10.1016/j.cose.2014.11.005ISI: 000350519300003ScopusID: 2-s2.0-84918506621OAI: oai:DiVA.org:his-10653DiVA: diva2:787893
FunderEU, FP7, Seventh Framework Programme