Loupe: compare new and old data in continuously updated timeseries

A loupe is a simple, small magnification device used to examine small details more closely.

Usage

loupe(df_current, df_previous, datetime_variable, ...)

Arguments

df_current: data.frame, the newest/current version of dataset x.
df_previous: data.frame, the old version of dataset, for example x - t1.
datetime_variable: string, which variable to use as unique ID to join df_current and df_previous. Usually a "datetime" variable.
...: Other waldo::compare() arguments can be supplied here, such as tolerance or max_diffs. See ?waldo::compare() for a full list.

Value

A boolean where TRUE indicates no changes to previous data and FALSE indicates unexpected changes.

Details

This function is intended to aid in the verification of continually updating timeseries data where we expect new values but want to ensure previous values remains unchanged.

This function matches two dataframe objects by their unique identifier (usually "time" or "datetime in a timeseries).

It informs the user of new (unmatched) rows which have appeared, and then returns a waldo::compare() call to give a detailed breakdown of changes.

The main assumption is that df_current and df_previous are a newer and older versions of the same data, and that the datetime_variable variable name always remains the same. Elsewhere new columns can of appear, and these will be returned in the report.

The underlying functionality is handled by create_object_list().

Examples