The removal of invalid values from dirty data sets. This can be a manual process or a variety of automated processes depending on the type of invalid data.