Package: stddiff.spark 1.0

Alicja Januszkiewicz
stddiff.spark: Calculate the Standardized Difference for Numeric, Binary and Category Variables in Apache Spark
Provides functions to compute standardized differences for numeric, binary, and categorical variables on Apache Spark DataFrames using 'sparklyr'. The implementation mirrors the methods used in the 'stddiff' package but operates on distributed data. See Zhicheng Du, Yuantao Hao (2022) <doi:10.32614/CRAN.package.stddiff> for reference.
Authors:
stddiff.spark_1.0.tar.gz
stddiff.spark_1.0.zip(r-4.7)stddiff.spark_1.0.zip(r-4.6)stddiff.spark_1.0.zip(r-4.5)
stddiff.spark_1.0.tgz(r-4.6-any)stddiff.spark_1.0.tgz(r-4.5-any)
stddiff.spark_1.0.tar.gz(r-4.7-any)stddiff.spark_1.0.tar.gz(r-4.6-any)
stddiff.spark_1.0.tgz(r-4.6-emscripten)
manual.pdf |manual.html✨
card.svg |card.png
stddiff.spark/json (API)
NEWS
| # Install 'stddiff.spark' in R: |
| install.packages('stddiff.spark', repos = c('https://alicja-januszkiewicz.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/alicja-januszkiewicz/stddiff.spark/issues
apache-sparkdescriptive-statisticssparklyr
Last updated from:184e3b2b83. Checks:9 OK. Indexed: yes.
| Target | Result | Time | Files | Syslog |
|---|---|---|---|---|
| linux-devel-x86_64 | OK | 122 | ||
| source / vignettes | OK | 165 | ||
| linux-release-x86_64 | OK | 128 | ||
| macos-release-arm64 | OK | 125 | ||
| macos-oldrel-arm64 | OK | 156 | ||
| windows-devel | OK | 88 | ||
| windows-release | OK | 74 | ||
| windows-oldrel | OK | 64 | ||
| wasm-release | OK | 91 |
Exports:stddiff.binarystddiff.categorystddiff.numeric
Dependencies:askpassblobclicodetoolsconfigcpp11curlDBIdbplyrdplyrgenericsglobalsgluehttrjsonlitelifecyclemagrittrmimeopensslpillarpkgconfigpurrrR6rlangrstudioapisparklyrstringistringrsystibbletidyrtidyselectutf8uuidvctrswithrxml2yaml
Readme and manuals
Help Manual
| Help page | Topics |
|---|---|
| Compute Standardized Differences for Binary Variables (Spark) | stddiff.binary |
| Compute Standardized Differences for Categorical Variables (Spark) | stddiff.category |
| Compute Standardized Differences for Numeric Variables (Spark) | stddiff.numeric |