National Information Platforms for Nutrition Data Quality Toolkit
An R package of practical analytical methods applicable to variables in datasets to assess their quality.
National Information Platforms for Nutrition (NiPN) is an initiative of the European Commission to provide support to countries to strengthen their information systems for nutrition and to improve the analysis of data so as to better inform the strategic decisions they are faced with to prevent malnutrition and its consequences.
As part of this mandate, NiPN has commissioned work on the development of a toolkit to assess the quality of various nutrition-specific and nutrition-related data. This is a companion R package to the toolkit of practical analytical methods that can be applied to variables in datasets to assess their quality.
The focus of the toolkit is on data required to assess anthropometric status such as measurements of weight, height or length, MUAC, sex and age. The focus is on anthropometric status but many of presented methods could be applied to other types of data. NiPN may commission additional toolkits to examine other variables or other types of variables.
Data quality is assessed by:
Range checks and value checks to identify univariate outliers
Scatterplots and statistical methods to identify bivariate outliers
Use of flags to identify outliers in anthropometric indices
Examining the distribution and the statistics of the distribution of measurements and anthropometric indices
Assessing the extent of digit preference in recorded measurements
Assessing the extent of age heaping in recorded ages
Examining the sex ratio
Examining age distributions and age by sex distributions