Split-Apply-Combine with Dynamic Grouping

Dublin Core

Title

Split-Apply-Combine with Dynamic Grouping

Description

Partitioning a data set by one or more of its attributes and computing an aggregate for each part is one of the most common operations in data analyses. There are use cases where the partitioning is determined dynamically by collapsing smaller subsets into larger ones, to ensure sufficient support for the computed aggregate. These use cases are not supported by software implementing split-apply-combine types of operations. This paper presents the R package accumulate that offers convenient interfaces for defining grouped aggregation where the grouping itself is dynamically determined, based on user-defined conditions on subsets, and a user-defined subset collapsing scheme. The formal underlying algorithm is described and analyzed as well.

Creator

Mark P. J. van der Loo

Source

https://www.jstatsoft.org/article/view/v112i04

Publisher

OJS/PKP

Date

29 MARET 2025

Contributor

FAJAR BAGUS W

Format

PDF

Language

ENGLISH

Type

TEXT

Files

Collection

Citation

Mark P. J. van der Loo, “Split-Apply-Combine with Dynamic Grouping,” Repository Horizon University Indonesia, accessed January 12, 2026, https://repository.horizon.ac.id/items/show/9841.