Hierarchical clustering of performances, using audience crossover data to calculate the effective "distance" between performances.
Distance is calculated using Simpson's distance metric
Distance is corrected by time using a binomial regression on time between performances
Clustering is hierarchical using Ward D2 distance
Mapping is classic multidimensional scaling, i.e. PCA
Usage
performance_map_report
# S3 method for class 'performance_map'
read(
report,
since = Sys.Date() - 365 * 5,
until = Sys.Date() + 365,
filter_expr = NULL,
...
)
# S3 method for class 'performance_map'
process(report, n_clusters = 8, ...)
# S3 method for class 'performance_map'
write(report, n_clusters = 8, highlight_since = Sys.Date() - 365, ...)
Arguments
- report
report
object- since
POSIXct
performance data on/after this date will be returned- until
POSIXct
performance data on/before this date will be returned- filter_expr
expression
to use when filtering performances, i.e.read_tessi("performances") %>% filter(!!filter_expr)
- ...
not used
- n_clusters
integer
number of clusters to make- highlight_since
POSIXct
performances on/after this date will be highlighted in the returned plots