StatsCube is a gCube application bundle offering facilities to practitioners working with a rich array of information, ranging from observational data to statistical data. It included the following applications.
Code List Discovery This application offers facilities for browsing and discovering available Code Lists from a set of repositories. It offers both a Google-like approach and an advanced search allowing users to characterise in detail the information they are looking for.
Code List Management This application offers facilities for managing code lists, i.e., recognised controlled vocabularies. It includes facilities for code lists creation (also via ingestion), collaborative curation, and publishing.
Statistical Service This application offers facilities for efficiently and effectively executing a rich array of statistical data processing algorithms. The application relies on the distributed and elastic computing capacities offered by the underlying infrastructure. It offers a set of off-the-shelf algorithms including clustering algorithms such as DBScan. Moreover, it enables a simple integration and execution of user-defined algorithms expressed in a number of programming and scripting languages including R. It currently embeds more than 110 different algorithms ranging from Anomalies Detection, Classification, Clustering, Simulation, Training, Bayesian Methods, Trends, and many more. These algorithms are then is executed on a distributed infrastructure by completely hiding the complexity of such an execution while ensuring robustness, throughput, fault-tolerance, and privacy.
Tabular Data Discovery This application offers facilities for browsing and discovery available Tabular Data resources from a set of repositories. It offers both a Google-like approach and an advanced search allowing users to characterise in detail the information they are looking for.
Tabular Data Enrichment This application offers facilities for augmenting a tabular dataset having geospatial and temporal attributes with selected physical and chemical environmental parameters acquired dynamically by authoritative sources. These parameters might include salinity, temperature, ice concentration, etc. The application properly adapts the parameters to the spatial and temporal resolution of the tabular dataset.
Tabular Data Management This application offers facilities for managing tabular data. In particular, it offers facilities for supporting the entire workflow of tasks on tabular data including tabular data creation, collaborative curation and publishing. Moreover, it offers a number of facilities for tabular data manipulation including filtering, grouping, unions and intersections. In addition to that, it is equipped with a powerful mechanism for versioning and a rich set of metadata for describing the tabula data resource including provenance.
Tabular Data Processing This application offers facilities for performing data mining tasks on tabular data. In particular, it relies on the Statistical Service to offer effective tabular data manipulation facilities including geocoding, maps projection, clustering, outlier identification, hidden trends, trends comparison, and many more.