No packages match
stddiff.spark - Calculate the Standardized Difference for Numeric, Binary and Category Variables in Apache Spark
Provides functions to compute standardized differences for numeric, binary, and categorical variables on Apache Spark DataFrames using 'sparklyr'. The implementation mirrors the methods used in the 'stddiff' package but operates on distributed data. See Zhicheng Du, Yuantao Hao (2022) <doi:10.32614/CRAN.package.stddiff> for reference.
Last updated
apache-sparkdescriptive-statisticssparklyr
3.88 score 147 downloads