Package org.jetbrains.kotlinx.dl.dataset.embedded

Functions

cifar10Paths

fun cifar10Paths(cacheDirectory: File = File("cache")): Pair<String, String>

Returns paths to images and its labels for the Cifar'10 dataset.

dogsCatsDatasetPath

fun dogsCatsDatasetPath(cacheDirectory: File = File("cache")): String

Returns path to images of the Dogs-vs-Cats dataset.

dogsCatsSmallDatasetPath

fun dogsCatsSmallDatasetPath(cacheDirectory: File = File("cache")): String

Returns path to images of the subset of the Dogs-vs-Cats dataset.

extractCifar10Images

fun extractCifar10Images(archiveName: String): Array<FloatArray>

Loads images from archiveName to heap memory and applies basic normalization preprocessing.

extractCifar10Labels

fun extractCifar10Labels(pathToLabels: String, numClasses: Int): FloatArray

Loads labels from pathToLabels csv file to heap memory and converts to Floats.

extractCifar10LabelsAnsSort

fun extractCifar10LabelsAnsSort(pathToLabels: String, numClasses: Int): FloatArray

Loads labels from pathToLabels csv file to heap memory and converts to Floats, after that it sorts it to have the same order as image files.

extractFashionImages

fun extractFashionImages(archivePath: String): Array<FloatArray>

Extracts Fashion Mnist images from archivePath.

extractFashionLabels

fun extractFashionLabels(archivePath: String, numClasses: Int): Array<FloatArray>

Extracts Fashion Mnist labels from archivePath with number of classes numClasses.

extractImages

fun extractImages(archivePath: String): Array<FloatArray>

Extracts (Fashion) Mnist images from archivePath.

extractLabels

fun extractLabels(archivePath: String, numClasses: Int): FloatArray

Extracts (Fashion) Mnist labels from archivePath with number of classes numClasses.

fashionMnist

fun fashionMnist(cacheDirectory: File = File("cache")): Pair<OnHeapDataset, OnHeapDataset>

Loads the Fashion-MNIST dataset.

freeSpokenDigitDatasetPath

fun freeSpokenDigitDatasetPath(cacheDirectory: File = File("cache")): String

Returns path to sound data files from Free Spoken Digits Dataset.

freeSpokenDigits

fun freeSpokenDigits(cacheDirectory: File = File("cache"), maxTestIndex: Int = 5): Pair<OnHeapDataset, OnHeapDataset>

Loads the Free Spoken Digits Dataset. This is a dataset of wav sound files of the 10 digits spoken by different people many times each. The test set officially consists of the first 10% of the recordings. Recordings numbered 0-4 (inclusive) are in the test and 5-49 are in the training set.

mnist

fun mnist(cacheDirectory: File = File("cache")): Pair<OnHeapDataset, OnHeapDataset>

Loads the MNIST dataset. This is a dataset of 60,000 28x28 grayscale images of the 10 digits, along with a test set of 10,000 images. More info can be found at the MNIST homepage.

mnist3D