varbin: Optimal Binning of Continuous and Categorical Variables

Tool for easy and efficient discretization of continuous and categorical data. The package calculates the most optimal binning of a given explanatory variable with respect to a user-specified target variable. The purpose is to assign a unique Weight-of-Evidence value to each of the calculated binpoints in order to recode the original variable. The package allows users to impose certain restrictions on the functional form on the resulting binning while maximizing the overall information value in the original data. The package is well suited for logistic scoring models where input variables may be subject to restrictions such as linearity by e.g. regulatory authorities. An excellent source describing in detail the development of scorecards, and the role of Weight-of-Evidence coding in credit scoring is (Siddiqi 2006, ISBN: 978–0-471–75451–0). The package utilizes the discrete nature of decision trees and Isotonic Regression to accommodate the trade-off between flexible functional forms and maximum information value.

Version: 0.2.1
Imports: rpart
Published: 2018-10-12
Author: Daniel Safai
Maintainer: Daniel Safai <danielsafai at gmail.com>
License: GPL-2 | GPL-3 [expanded from: GPL (≥ 2)]
NeedsCompilation: no
CRAN checks: varbin results

Documentation:

Reference manual: varbin.pdf

Downloads:

Package source: varbin_0.2.1.tar.gz
Windows binaries: r-devel: varbin_0.2.1.zip, r-release: varbin_0.2.1.zip, r-oldrel: varbin_0.2.1.zip
macOS binaries: r-release (arm64): varbin_0.2.1.tgz, r-oldrel (arm64): varbin_0.2.1.tgz, r-release (x86_64): varbin_0.2.1.tgz

Linking:

Please use the canonical form https://CRAN.R-project.org/package=varbin to link to this page.