Dataframe
 
0.15

Summary statistics

Edit page Last modified: 14 January 2025

Every summary statistics can be used in aggregations of:

sum, mean, std are available for numeric columns of types Int, Double, Float, BigDecimal, Long, Byte.

min/max, median are available for Comparable columns.

When statistics x is applied to several columns, it can be computed in several modes:

  • x(): DataRow computes separate value per every suitable column

  • x { columns }: Value computes single value across all given columns

  • xFor { columns }: DataRow computes separate value per every given column

  • xOf { rowExpression }: Value computes single value across results of row expression evaluated for every row

min and max statistics have additional mode by:

  • minBy { rowExpression }: DataRow finds a row with minimal result of expression