My internet search was unsuccessful, therefore I hope that somebody got an idea on this. Assume I collaborate with a colleague and we work with a dataset. He preprocesses data which I read into R.
What I am looking for is a tool for the command line or a package in R which I can use to define a data format, e.g., something like this:
data description
date: date
age: positive integer
weight: positive float
job: string, NA
Then, I would like to check the validity of a given dataset, either on the command line or in R. Nice to have, but not necessary: Generating test data from a given data structure description.
Thank you for your ideas/hints!