Live Breaking News & Updates on Sample Size Calculation

Stay updated with breaking news from Sample size calculation. Get real-time updates on events, politics, business, and more. Visit us for reliable news and exclusive interviews.

"Privacy Protection Practice for Data Mining with Multiple Data Sources" by Pauline O'Shaughnessy and Yan Xia Lin

In the age of data, data mining provides feasible tools with which to handle large datasets consisting of data from multiple sources. However, there is limited research on retrieving statistical information from data when data are confidential and cannot be shared directly. In this paper, we address this problem and propose a framework for performing data analysis using data from multiple sources without revealing true values for privacy purposes. The proposed framework includes three steps. First, data custodians individually mask data before publishing; then, the masked data collection is used to reconstruct the density function of the original dataset, from which resampled values are generated; last, existing data mining techniques are applied directly to the resampled data. This framework utilises the technique of reconstructing an original density function from noise-masked data using the moment-based density estimation method, which plays an essential role. Simulation studies sho ....

Data Masking , Data Mining , Ultiplicative Noise , Sample Size Calculation ,