The Supersystem is a multinational project run by the University of Cambridge to extend statistical methods for the analysis of incomplete data. With colleagues at the University of Notre Dame, Yale, Duke and Princeton, the Supersystem has generated applications which can be used by statisticians and scientists.

Background
Many real-life situations involve data that are not completely observed, and therefore the analysis of incomplete data can be applied to a wide range of problems. The Supersystem is a project based at the University of Cambridge to build statistical methods for analysis of such incomplete data.

The statistical problems concerned by the Supersystem often arise in the social, health, environment and life sciences where data are often subject to censoring and missingness. Statistics for incomplete data are often incompatible with data analysis software packages, and present users with a number of problems such as the inability to model complex distributions in a single framework, and when dealing with categorical data such as ordinal data. The Supersystem aims to change this by exploiting recent developments in the theory of probabilistic graphical models and relational databases.

As well as developing methods for statistical analysis, the project also aims to create user-friendly software for statistical applications, thus linking statisticians and applied researchers in a mutually beneficial way.

Applications
The Supersystem is the leading manufacturer of software for statistical analysis of incomplete data and has developed applications which can be used by statisticians and scientists. These include:
SIMLR: A statistical analysis package for the comparison and analysis of categorical and ordinal data
SuperLearner: A flexible framework for ensemble learning
SuperStopper: A framework for dealing with missing data in mixed-model designs
SAS+Cast: A package for multivariate data analysis
SuperSnowball: A framework for Bayesian variable selection
Switch: Statistical analysis in heterogeneous population samples
Ultimax: Statistical analysis of health monitoring data

