Data Schemas

The Kotlin DataFrame library provides typed data access via generation of extension properties for the type DataFrame<T> (as well as for DataRow<T>), where T is a marker class representing the DataSchema of the DataFrame.

A schema of a DataFrame is a mapping from column names to column types.
This data schema can be expressed as a Kotlin class or interface.
If the DataFrame is hierarchical — contains a column group or a column of dataframes — the data schema reflects this structure, with a separate class representing the schema of each column group or nested DataFrame.

name	info
	age	height
Alice	23	175.5
Bob	27	160.2

// Data schema of the "info" column group
@DataSchema
data class Info(
    val age: Int,
    val height: Float
)

// Data schema of the entire DataFrame
@DataSchema
data class Person(
    val info: Info,
    val name: String
)

Extension properties for DataFrame<Person>
are generated based on this schema and allow accessing columns or using them in operations:

// Assuming `df` has type `DataFrame<Person>`

// Get "age" column from "info" group
df.info.age

// Select "name" and "height" columns
df.select { name and info.height }

// Filter rows by "age"
df.filter { age >= 18 }

Data Schemas﻿

Schema Retrieving﻿

tip

Plugins﻿

warning

Extension Properties Generation﻿

warning

tip

Data Schemas

Schema Retrieving

Plugins

Extension Properties Generation