Big Data are huge amounts of digital information that rarely result from properly planned surveys; as a consequence they often contain redundant observations. When the aim is to answer particular questions of interest, we suggest selecting a subsample of units that contains the majority of the information to achieve this goal. Selection methods driven by the theory of optimal design incorporate the inferential purposes and thus perform better than standard sampling schemes.
|Numero di pagine||9|
|Rivista||Journal of Quality Technology|
|Stato di pubblicazione||Pubblicato - 2022|
- design efficiency
- finite population sampling
- optimal design theory