mlr3spatiotempcv: Spatiotemporal Resampling Methods for Machine Learning in R
Dublin Core
Title
mlr3spatiotempcv: Spatiotemporal Resampling Methods for Machine Learning in R
Subject
cross-validation, predictive performance, machine learning, autocorrelation, spatial, temporal, R.
Description
Spatial and spatiotemporal machine-learning models require a suitable framework for
their model assessment, model selection, and hyperparameter tuning, in order to avoid
error estimation bias and over-fitting. This contribution provides an overview of the
state-of-the-art in spatial and spatiotemporal cross-validation techniques and their implementations in R while introducing the R package mlr3spatiotempcv as an extension
package of the machine-learning framework mlr3. Currently various R packages implementing different spatiotemporal partitioning strategies exist: blockCV, CAST, skmeans
and sperrorest. The goal of mlr3spatiotempcv is to gather the available spatiotemporal
resampling methods in R and make them available to users through a simple and common
interface. This is made possible by integrating the package directly into the mlr3 machinelearning framework, which already has support for generic non-spatiotemporal resampling
methods such as random partitioning. One advantage is the use of a consistent nomenclature in an overarching machine-learning toolkit instead of a varying package-specific
syntax, making it easier for users to choose from a variety of spatiotemporal resampling
methods. This package avoids giving recommendations which method to use in practice
as this decision depends on the predictive task at hand, the autocorrelation within the
data, and the spatial structure of the sampling design or geographic objects being studied.
their model assessment, model selection, and hyperparameter tuning, in order to avoid
error estimation bias and over-fitting. This contribution provides an overview of the
state-of-the-art in spatial and spatiotemporal cross-validation techniques and their implementations in R while introducing the R package mlr3spatiotempcv as an extension
package of the machine-learning framework mlr3. Currently various R packages implementing different spatiotemporal partitioning strategies exist: blockCV, CAST, skmeans
and sperrorest. The goal of mlr3spatiotempcv is to gather the available spatiotemporal
resampling methods in R and make them available to users through a simple and common
interface. This is made possible by integrating the package directly into the mlr3 machinelearning framework, which already has support for generic non-spatiotemporal resampling
methods such as random partitioning. One advantage is the use of a consistent nomenclature in an overarching machine-learning toolkit instead of a varying package-specific
syntax, making it easier for users to choose from a variety of spatiotemporal resampling
methods. This package avoids giving recommendations which method to use in practice
as this decision depends on the predictive task at hand, the autocorrelation within the
data, and the spatial structure of the sampling design or geographic objects being studied.
Creator
Patrick Schratz
Source
https://www.jstatsoft.org/article/view/v111i07
Publisher
Friedrich Schiller University Jena
Date
November 2024
Contributor
Fajar bagus W
Format
PDF
Language
English
Type
Text
Files
Collection
Citation
Patrick Schratz, “mlr3spatiotempcv: Spatiotemporal Resampling Methods for Machine Learning in R,” Repository Horizon University Indonesia, accessed April 4, 2025, https://repository.horizon.ac.id/items/show/8351.