Högskolan i Skövde

his.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Rounding based continuous data discretization for statistical disclosure control
University of Skövde, School of Informatics. University of Skövde, Informatics Research Environment. (Skövde Artificial Intelligence Lab (SAIL))ORCID iD: 0000-0002-2564-0683
University of Skövde, School of Informatics. University of Skövde, Informatics Research Environment. Hamilton Institute, Maynooth University, Maynooth, Ireland. (Skövde Artificial Intelligence Lab (SAIL))ORCID iD: 0000-0002-0368-8037
2023 (English)In: Journal of Ambient Intelligence and Humanized Computing, ISSN 1868-5137, E-ISSN 1868-5145, Vol. 14, no 11, p. 15139-15157Article in journal (Refereed) Published
Abstract [en]

“Rounding” can be understood as a way to coarsen continuous data. That is, low level and infrequent values are replaced by high-level and more frequent representative values. This concept is explored as a method for data privacy with techniques like rounding, microaggregation, and generalisation. This concept is explored as a method for data privacy in statistical disclosure control literature with perturbative techniques like rounding, microaggregation and non-perturbative methods like generalisation. Even though “rounding” is well known as a numerical data protection method, it has not been studied in depth or evaluated empirically to the best of our knowledge. This work is motivated by three objectives, (1) to study the alternative methods of obtaining the rounding values to represent a given continuous variable, (2) to empirically evaluate rounding as a data protection technique based on information loss (IL) and disclosure risk (DR), and (3) to analyse the impact of data rounding on machine learning based models. Here, in order to obtain the rounding values we consider discretization methods introduced in the unsupervised machine learning literature along with microaggregation and re-sampling based approaches. The results indicate that microaggregation based techniques are preferred over unsupervised discretization methods due to their fair trade-off between IL and DR. 

Place, publisher, year, edition, pages
Springer, 2023. Vol. 14, no 11, p. 15139-15157
Keywords [en]
Micro data protection, Rounding for micro data, Unsupervised discretization, Discrete event simulation, Economic and social effects, Machine learning, Numerical methods, Volume measurement, Data protection techniques, Discretization method, Numerical data protection methods, Perturbative techniques, Statistical disclosure Control, Unsupervised machine learning, Data privacy
National Category
Computer Sciences
Research subject
Skövde Artificial Intelligence Lab (SAIL)
Identifiers
URN: urn:nbn:se:his:diva-17858DOI: 10.1007/s12652-019-01489-7Scopus ID: 2-s2.0-85074009425OAI: oai:DiVA.org:his-17858DiVA, id: diva2:1368632
Part of project
Disclosure risk and transparency in big data privacy, Swedish Research Council
Funder
Swedish Research Council, 2016-03346
Note

CC BY 4.0

Published: 25 September 2019

Correspondence to Navoda Senavirathne.

This work is supported by Vetenskapsrådet project: “Disclosure risk and transparency in big data privacy” (VR 2016-03346, 2017-2020)

DRIAT

Available from: 2019-11-07 Created: 2019-11-07 Last updated: 2024-02-13Bibliographically approved

Open Access in DiVA

fulltext(1928 kB)60 downloads
File information
File name FULLTEXT02.pdfFile size 1928 kBChecksum SHA-512
92de236ab51722f8ee007a8eb9c9cdc7f1f77b010fbdfa376970d26f0e44cafb450bb44786cf7444e3794a1b7181cf27697c5f2baf2438be60c06b75e0f64806
Type fulltextMimetype application/pdf

Other links

Publisher's full textScopus

Authority records

Senavirathne, NavodaTorra, Vicenç

Search in DiVA

By author/editor
Senavirathne, NavodaTorra, Vicenç
By organisation
School of InformaticsInformatics Research Environment
In the same journal
Journal of Ambient Intelligence and Humanized Computing
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 337 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

doi
urn-nbn

Altmetric score

doi
urn-nbn
Total: 528 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • apa-cv
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf