Data Schemas Generation From Existing DataFrame

Special utility functions that generate code of useful Kotlin definitions (returned as a String) based on the current DataFrame schema.

generateDataClasses

inline fun <reified T> DataFrame<T>.generateDataClasses(
    markerName: String? = null,
    extensionProperties: Boolean = false,
    visibility: MarkerVisibility = MarkerVisibility.IMPLICIT_PUBLIC,
    useFqNames: Boolean = false,
    nameNormalizer: NameNormalizer = NameNormalizer.default,
): CodeString

Generates Kotlin data classes corresponding to the DataFrame schema (including all nested DataFrame columns and column groups).

Examples

df.generateDataClasses("Customer")

@DataSchema
data class Customer1(
    val amount: Double,
    val orderId: Int
)

@DataSchema
data class Customer(
    val orders: List<Customer1>,
    val user: String
)

val customers: List<Customer> = df.cast<Customer>().toList()

generateInterfaces

inline fun <reified T> DataFrame<T>.generateInterfaces(): CodeString

fun <T> DataFrame<T>.generateInterfaces(markerName: String): CodeString

Generates @DataSchema interfaces for this DataFrame (including all nested DataFrame columns and column groups) as Kotlin interfaces.

Examples

df

df.generateInterfaces()

@DataSchema(isOpen = false)
interface _DataFrameType11 {
    val amount: kotlin.Double
    val orderId: kotlin.Int
}

@DataSchema
interface _DataFrameType1 {
    val orders: List<_DataFrameType11>
    val user: kotlin.String
}

Use cast to apply the generated schema to a DataFrame:

df.cast<_DataFrameType1>().filter { orders.all { orderId >= 102 } }

Data Schemas Generation From Existing DataFrame﻿

generateDataClasses﻿

Arguments﻿

Returns﻿

Examples﻿

generateInterfaces﻿

Arguments﻿

Returns﻿

Examples﻿

Data Schemas Generation From Existing DataFrame

generateDataClasses

Arguments

Returns

Examples

generateInterfaces

Arguments

Returns

Examples