ruler
offers a set of tools for creating tidy data validation reports using
dplyr grammar of data manipulation. It
is designed to be flexible and extendable in terms of creating rules and
using their output.
The common workflow is:
Define dplyr-style packs of rules for basic data units (data, group, column, row, cell) to obey.
Expose some data to those rules. The result is the same data with possibly created exposure attribute. Exposure contains information about applied packs and tidy data validation report.
Use data and exposure to perform some actions: assert about rule breakers, impute data, remove outliers and so on.
To learn more about ruler
browse vignettes with browseVignettes(package = "ruler")
. The preferred order is:
Design process and exposure format.
Rule packs.
Validation
Useful links: