Progress so far?
Posted: Sun Feb 09, 2025 6:13 am
In addition, the program will include the ability to produce synthetic data versions of the confidential sources, which will allow prospective researchers to examine a ‘fake’ version of the data before making an application to access the real data.
The program can save scripts to enable a data producer to re-run the anonymisation specification on later sources of data, in order to produce a consistent trend of datasets.
RUM is based on an existing R program lebanon rcs data called ‘sdcMicroGUI’. Much of the work involves adapting this existing package to improve the feel and user experience. In addition, the underlying R code is being examined to ensure that the anonymisation techniques conform to current specifications as used by data providers who supply data to the UK Data Service, e.g. the Government Statistical Service SDC guidelines.
Progress is going well, and our achievements to date include:
The ability to preview changes to a dataset before the anonymisation techniques are applied
Inclusion of variable metadata to enable speedy sorting of variables into categorical and continuous buckets
simpler functionality for importing and exporting data
Following a round of User Experience (UX) testing with staff internally at the Archive before Christmas, our partners at the Norwegian Social Science Data Services (NSD) are working on a number of improvements to the overall tool flow.
The program can save scripts to enable a data producer to re-run the anonymisation specification on later sources of data, in order to produce a consistent trend of datasets.
RUM is based on an existing R program lebanon rcs data called ‘sdcMicroGUI’. Much of the work involves adapting this existing package to improve the feel and user experience. In addition, the underlying R code is being examined to ensure that the anonymisation techniques conform to current specifications as used by data providers who supply data to the UK Data Service, e.g. the Government Statistical Service SDC guidelines.
Progress is going well, and our achievements to date include:
The ability to preview changes to a dataset before the anonymisation techniques are applied
Inclusion of variable metadata to enable speedy sorting of variables into categorical and continuous buckets
simpler functionality for importing and exporting data
Following a round of User Experience (UX) testing with staff internally at the Archive before Christmas, our partners at the Norwegian Social Science Data Services (NSD) are working on a number of improvements to the overall tool flow.