Package: stddiff.spark 1.0

Alicja Januszkiewicz

stddiff.spark: Calculate the Standardized Difference for Numeric, Binary and Category Variables in Apache Spark

Provides functions to compute standardized differences for numeric, binary, and categorical variables on Apache Spark DataFrames using 'sparklyr'. The implementation mirrors the methods used in the 'stddiff' package but operates on distributed data. See Zhicheng Du, Yuantao Hao (2022) <doi:10.32614/CRAN.package.stddiff> for reference.

Authors:Alicja Januszkiewicz [aut, cre, cph]

stddiff.spark_1.0.tar.gz
stddiff.spark_1.0.zip(r-4.7)stddiff.spark_1.0.zip(r-4.6)stddiff.spark_1.0.zip(r-4.5)
stddiff.spark_1.0.tgz(r-4.6-any)stddiff.spark_1.0.tgz(r-4.5-any)
stddiff.spark_1.0.tar.gz(r-4.7-any)stddiff.spark_1.0.tar.gz(r-4.6-any)
stddiff.spark_1.0.tgz(r-4.6-emscripten)
manual.pdf |manual.html
card.svg |card.png
stddiff.spark/json (API)
NEWS

# Install 'stddiff.spark' in R:
install.packages('stddiff.spark', repos = c('https://alicja-januszkiewicz.r-universe.dev', 'https://cloud.r-project.org'))

Bug tracker:https://github.com/alicja-januszkiewicz/stddiff.spark/issues

On CRAN:

Conda:

apache-sparkdescriptive-statisticssparklyr

3.88 score 147 downloads 3 mentions 3 exports 38 dependencies

Last updated from:184e3b2b83. Checks:9 OK. Indexed: yes.

TargetResultTimeFilesSyslog
linux-devel-x86_64OK122
source / vignettesOK165
linux-release-x86_64OK128
macos-release-arm64OK125
macos-oldrel-arm64OK156
windows-develOK88
windows-releaseOK74
windows-oldrelOK64
wasm-releaseOK91

Exports:stddiff.binarystddiff.categorystddiff.numeric

Dependencies:askpassblobclicodetoolsconfigcpp11curlDBIdbplyrdplyrgenericsglobalsgluehttrjsonlitelifecyclemagrittrmimeopensslpillarpkgconfigpurrrR6rlangrstudioapisparklyrstringistringrsystibbletidyrtidyselectutf8uuidvctrswithrxml2yaml