The goal of convo
is to enable the creation of a a controlled vocabularly for naming columns in a relational dataset as described in my blog post Column Names as Contracts. This controlled vocabularly can then be used to check a set of names for adherence, to automate documentation, and to generate data checks via the pointblank
package.
You can install the development version of convo from GitHub with:
devtools::install_github("emilyriederer/convo")
convo
) in R or YAML including valid name stubs at different levels of the ontology and optional descriptions or validation checksconvo
s) from a set of variablesconvo
and identify violationsconvo
objects and/or stub lists with set-like operations (union, intersect, setdiff) to identify new candidates for inclusionpointblank
validation agent or YAML file from a convo
object for data validationdescribe_*()
functions)convo
specifications (not just stub lists)dm
package to validate names across a schemaMain pieces of functionality are illustrated in the Quick Start Guide on the package website.