Naive Bayes, NB, DDA algorithms¶

In statistics, Naive Bayes classifiers are a family of simple "probabilistic classifiers" based on applying Bayes' theorem with strong (naive) independence assumptions between the features. They are among the simplest Bayesian network models.But they could be coupled with Kernel density estimation and achieve higher accuracy levels.
Naive Bayes classifiers are highly scalable, requiring a number of parameters linear in the number of variables (features/predictors) in a learning problem. Maximum-likelihood training can be done by evaluating a closed-form expression, which takes linear time, rather than by expensive iterative approximation as used for many other types of classifiers.
In the statistics and computer science literature, naive Bayes models are known under a variety of names, including simple Bayes and independence Bayes. All these names reference the use of Bayes' theorem in the classifier's decision rule, but naïve Bayes is not (necessarily) a Bayesian method.

In computer graphics, a digital differential analyzer (DDA) is hardware or software used for interpolation of variables over an interval between start and end point. DDAs are used for rasterization of lines, triangles and polygons. They can be extended to non linear functions, such as perspective correct texture mapping, quadratic curves, and traversing voxels.

Here we are going to implement Naive Bayes, NB and DDA algorithms using Telecom Churn Dataset.

0. Loading required libraries¶

library(DBI)
library(corrgram)
library(caret) 
library(gridExtra)
library(ggpubr)

1. Setting up the code parallelizing¶

Today is a good practice to start parallelizing your code. The common motivation behind parallel computing is that something is taking too long time. For somebody that means any computation that takes more than 3 minutes – this because parallelization is incredibly simple and most tasks that take time are embarrassingly parallel. Here are a few common tasks that fit the description:

Bootstrapping
Cross-validation
Multivariate Imputation by Chained Equations (MICE)
Fitting multiple regression models

You can find out more about parallelizing your computations in R - here.

For Windows users

# process in parallel on Windows
library(doParallel) 
cl <- makeCluster(detectCores(), type='PSOCK')
registerDoParallel(cl)

For Mac OSX and Unix like systems users

# process in parallel on Mac OSX and UNIX like systems
library(doMC)
registerDoMC(cores = 4)

2. Importing Data¶

#Set working directory where CSV is located

#getwd()
#setwd("...YOUR WORKING DIRECTORY WITH A DATASET...")
#getwd()

# Load the DataSets: 
dataSet <- read.csv("TelcoCustomerChurnDataset.csv", header = TRUE, sep = ',')
colnames(dataSet) #Check the dataframe column names

3. Exploring the dataset¶

# Print top 10 rows in the dataSet
head(dataSet, 10)

# Print last 10 rows in the dataSet
tail(dataSet, 10)

# Dimention of Dataset
dim(dataSet)

# Check Data types of each column
table(unlist(lapply(dataSet, class)))

 factor integer numeric 
      5       8       8

# Check Data types of individual column
data.class(dataSet$Account_Length) 
data.class(dataSet$Vmail_Message) 
data.class(dataSet$Day_Mins)
data.class(dataSet$Eve_Mins)
data.class(dataSet$Night_Mins) 
data.class(dataSet$Intl_Mins)
data.class(dataSet$CustServ_Calls)
data.class(dataSet$Intl_Plan) 
data.class(dataSet$Vmail_Plan)
data.class(dataSet$Day_Calls)
data.class(dataSet$Day_Charge) 
data.class(dataSet$Eve_Calls)
data.class(dataSet$Eve_Charge) 
data.class(dataSet$Night_Calls)
data.class(dataSet$Night_Charge)
data.class(dataSet$Intl_Calls) 
data.class(dataSet$Intl_Charge)
data.class(dataSet$State) 
data.class(dataSet$Phone)
data.class(dataSet$Churn)

Converting variables Intl_Plan, Vmail_Plan, State to numeric data type.

dataSet$Intl_Plan <- as.numeric(dataSet$Intl_Plan)
dataSet$Vmail_Plan <- as.numeric(dataSet$Vmail_Plan)
dataSet$State <- as.numeric(dataSet$State)

# Check Data types of each column
table(unlist(lapply(dataSet, class)))

 factor integer numeric 
      2       8      11

4. Exploring or Summarising dataset with descriptive statistics¶

# Find out if there is missing value in rows
rowSums(is.na(dataSet))

# Find out if there is missing value in columns
colSums(is.na(dataSet))

Missing value checking using different packages (mice and VIM)

#Checking missing value with the mice package
library(mice)
md.pattern(dataSet)

Attaching package: ‘mice’


The following objects are masked from ‘package:base’:

    cbind, rbind

 /\     /\
{  `---'  }
{  O   O  }
==>  V <==  No need for mice. This data set is completely observed.
 \  \|/  /
  `-----'

#Checking missing value with the VIM package
library(VIM)
mice_plot <- aggr(dataSet, col=c('navyblue','yellow'),
                  numbers=TRUE, sortVars=TRUE,
                  labels=names(dataSet[1:21]), cex.axis=.9,
                  gap=3, ylab=c("Missing data","Pattern"))

Loading required package: colorspace

Loading required package: grid

VIM is ready to use.


Suggestions and bug-reports can be submitted at: https://github.com/statistikat/VIM/issues


Attaching package: ‘VIM’


The following object is masked from ‘package:datasets’:

    sleep

 Variables sorted by number of missings: 
       Variable Count
 Account_Length     0
  Vmail_Message     0
       Day_Mins     0
       Eve_Mins     0
     Night_Mins     0
      Intl_Mins     0
 CustServ_Calls     0
          Churn     0
      Intl_Plan     0
     Vmail_Plan     0
      Day_Calls     0
     Day_Charge     0
      Eve_Calls     0
     Eve_Charge     0
    Night_Calls     0
   Night_Charge     0
     Intl_Calls     0
    Intl_Charge     0
          State     0
      Area_Code     0
          Phone     0

After the observation, we can claim that dataset contains no missing values.

Summary of dataset

# Selecting just columns with numeric data type
numericalCols <- colnames(dataSet[c(1:7,9:20)])

Difference between the lapply and sapply functions (we will use them in the next 2 cells):
We use lapply - when we want to apply a function to each element of a list in turn and get a list back.
We use sapply - when we want to apply a function to each element of a list in turn, but we want a vector back, rather than a list.

Finding statistics metrics with lapply function

#Sum
lapply(dataSet[numericalCols], FUN = sum)

#Mean
lapply(dataSet[numericalCols], FUN = mean)

#median
lapply(dataSet[numericalCols], FUN = median)

#Min
lapply(dataSet[numericalCols], FUN = min)

#Max
lapply(dataSet[numericalCols], FUN = max)

#Length
lapply(dataSet[numericalCols], FUN = length)

Finding statistics metrics with sapply function

# Sum
sapply(dataSet[numericalCols], FUN = sum)

# Mean
sapply(dataSet[numericalCols], FUN = mean)

# Median
sapply(dataSet[numericalCols], FUN = median)

# Min
sapply(dataSet[numericalCols], FUN = min)

# Max
sapply(dataSet[numericalCols], FUN = max)

# Length
sapply(dataSet[numericalCols], FUN = length)

In the next few cells, you will find three different options on how to aggregate data.

# OPTION 1: (Using Aggregate FUNCTION - all variables together)
aggregate(dataSet[numericalCols], list(dataSet$Churn), summary)

# OPTION 2: (Using Aggregate FUNCTION - variables separately)
aggregate(dataSet$Intl_Mins, list(dataSet$Churn), summary)
aggregate(dataSet$Day_Mins, list(dataSet$Churn), summary)
aggregate(dataSet$Night_Mins, list(dataSet$Churn), summary)

# OPTION 3: (Using "by" FUNCTION instead of "Aggregate" FUNCTION)
by(dataSet$Intl_Mins, dataSet[8], FUN = summary)
by(dataSet$Day_Mins, dataSet[8], FUN = summary)
by(dataSet$Night_Mins, dataSet[8], FUN = summary)

Churn: no
   Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
   0.00    8.40   10.20   10.16   12.00   18.90 
------------------------------------------------------------ 
Churn: yes
   Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
    2.0     8.8    10.6    10.7    12.8    20.0

Churn: no
   Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
    0.0   142.8   177.2   175.2   210.3   315.6 
------------------------------------------------------------ 
Churn: yes
   Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
    0.0   153.2   217.6   206.9   265.9   350.8

Churn: no
   Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
   23.2   165.9   200.2   200.1   234.9   395.0 
------------------------------------------------------------ 
Churn: yes
   Min. 1st Qu.  Median    Mean 3rd Qu.    Max. 
   47.4   171.2   204.8   205.2   239.8   354.9

Find out correlation

# Correlations/covariances among numeric variables 
library(Hmisc)
cor(dataSet[c(2,5,11,13,16,18)], use="complete.obs", method="kendall") 
cov(dataSet[c(2,5,11,13,16,18)], use="complete.obs")

Loading required package: survival


Attaching package: ‘survival’


The following object is masked from ‘package:caret’:

    cluster


Loading required package: Formula


Attaching package: ‘Hmisc’


The following objects are masked from ‘package:base’:

    format.pval, units

# Correlations with significance levels
rcorr(as.matrix(dataSet[c(2,5,11,13,16,18)]), type="pearson")

              Vmail_Message Night_Mins Day_Calls Eve_Calls Night_Charge
Vmail_Message          1.00       0.01     -0.01     -0.01         0.01
Night_Mins             0.01       1.00      0.02      0.00         1.00
Day_Calls             -0.01       0.02      1.00      0.01         0.02
Eve_Calls             -0.01       0.00      0.01      1.00         0.00
Night_Charge           0.01       1.00      0.02      0.00         1.00
Intl_Charge            0.00      -0.02      0.02      0.01        -0.02
              Intl_Charge
Vmail_Message        0.00
Night_Mins          -0.02
Day_Calls            0.02
Eve_Calls            0.01
Night_Charge        -0.02
Intl_Charge          1.00

n= 3333 


P
              Vmail_Message Night_Mins Day_Calls Eve_Calls Night_Charge
Vmail_Message               0.6576     0.5816    0.7350    0.6583      
Night_Mins    0.6576                   0.1855    0.9039    0.0000      
Day_Calls     0.5816        0.1855               0.7092    0.1857      
Eve_Calls     0.7350        0.9039     0.7092              0.9056      
Night_Charge  0.6583        0.0000     0.1857    0.9056                
Intl_Charge   0.8678        0.3810     0.2111    0.6167    0.3808      
              Intl_Charge
Vmail_Message 0.8678     
Night_Mins    0.3810     
Day_Calls     0.2111     
Eve_Calls     0.6167     
Night_Charge  0.3808     
Intl_Charge

5. Visualising DataSet¶

# Pie Chart from data 
mytable <- table(dataSet$Churn)
lbls <- paste(names(mytable), "\n", mytable, sep="")
pie(mytable, labels = lbls, col=rainbow(length(lbls)), 
    main="Pie Chart of Classes\n (with sample sizes)")

# Barplot of categorical data
par(mfrow=c(1,1))
barplot(table(dataSet$Churn), ylab = "Count", 
        col=c("darkblue","red"))
barplot(prop.table(table(dataSet$Churn)), ylab = "Proportion", 
        col=c("darkblue","red"))
barplot(table(dataSet$Churn), xlab = "Count", horiz = TRUE, 
        col=c("darkblue","red"))
barplot(prop.table(table(dataSet$Churn)), xlab = "Proportion", horiz = TRUE, 
        col=c("darkblue","red"))

# Scatterplot Matrices from the glus Package 
library(gclus)
dta <- dataSet[c(2,5,11,13,16,18)] # get data 
dta.r <- abs(cor(dta)) # get correlations
dta.col <- dmat.color(dta.r) # get colors
# reorder variables so those with highest correlation are closest to the diagonal
dta.o <- order.single(dta.r) 
cpairs(dta, dta.o, panel.colors=dta.col, gap=.5, 
       main="Variables Ordered and Colored by Correlation" )

Loading required package: cluster

Visualise correlations

corrgram(dataSet[c(2,5,11,13,16,18)], order=TRUE, lower.panel=panel.shade,
         upper.panel=panel.pie, text.panel=panel.txt, main=" ")

# More graphs on correlatios amaong data
# Using "Hmisc"
res2 <- rcorr(as.matrix(dataSet[,c(2,5,11,13,16,18)]))
# Extract the correlation coefficients
res2$r
# Extract p-values
res2$P

# Using "corrplot"
library(corrplot)
library(RColorBrewer)
corrplot(res2$r, type = "upper", order = "hclust", col=brewer.pal(n=8, name="RdYlBu"),
         tl.col = "black", tl.srt = 45)
corrplot(res2$r, type = "lower", order = "hclust", col=brewer.pal(n=8, name="RdYlBu"),
         tl.col = "black", tl.srt = 45)

corrplot 0.84 loaded

# Using PerformanceAnalytics
library(PerformanceAnalytics)
data <- dataSet[, c(2,5,11,13,16,18)]
chart.Correlation(data, histogram=TRUE, pch=19)

Loading required package: xts

Loading required package: zoo


Attaching package: ‘zoo’


The following objects are masked from ‘package:base’:

    as.Date, as.Date.numeric


Attaching package: ‘PerformanceAnalytics’


The following object is masked from ‘package:graphics’:

    legend

# Using Colored Headmap 
col <- colorRampPalette(c("blue", "white", "red"))(20)
heatmap(x = res2$r, col = col, symm = TRUE)

We should notice that Night_Mins and Night_Charge have a strong, linear, positive relationship.

6. Pre-Processing of DataSet i.e. train (75%) : test (25%) split¶

train_test_index <- createDataPartition(dataSet$Churn, p=0.75, list=FALSE)
training_dataset <- dataSet[, c(1:20)][train_test_index,]
testing_dataset  <- dataSet[, c(1:20)][-train_test_index,]

dim(training_dataset)
dim(testing_dataset)

7. Cross Validation and control parameter setup¶

control <- trainControl(method="repeatedcv", # repeatedcv / adaptive_cv
                        number=2, repeats = 2, 
                        verbose = TRUE, search = "grid",
                        allowParallel = TRUE)
metric <- "Accuracy"
tuneLength = 2

8. Algorithm : Naive Bayes, NB, DDA¶

names(getModelInfo())

getModelInfo("naive_bayes"); getModelInfo("nb"); getModelInfo("dda");

function (x, y, len = NULL, search = "grid") 
expand.grid(usekernel = c(TRUE, FALSE), laplace = 0, adjust = 1)

function (x, y, wts, param, lev, last, classProbs, ...) 
{
    if (param$usekernel) {
        out <- naivebayes::naive_bayes(x, y, usekernel = TRUE, 
            laplace = param$laplace, adjust = param$adjust, ...)
    }
    else out <- naivebayes::naive_bayes(x, y, usekernel = FALSE, 
        laplace = param$laplace, ...)
    out
}

function (modelFit, newdata, submodels = NULL) 
{
    if (!is.data.frame(newdata)) 
        newdata <- as.data.frame(newdata, stringsAsFactors = TRUE)
    predict(modelFit, newdata)
}

function (modelFit, newdata, submodels = NULL) 
{
    if (!is.data.frame(newdata)) 
        newdata <- as.data.frame(newdata, stringsAsFactors = TRUE)
    as.data.frame(predict(modelFit, newdata, type = "prob"), 
        stringsAsFactors = TRUE)
}

function (x, ...) 
if (hasTerms(x)) predictors(x$terms) else names(x$tables)

function (x) 
x$levels

function (x) 
x[order(x[, 1]), ]

function (x, y, len = NULL, search = "grid") 
{
    if (search == "grid") {
        out <- data.frame(smooth = 0:(len - 1))
    }
    else {
        out <- data.frame(smooth = runif(len, min = 0, max = 10))
    }
    out
}

function (x, y, wts, param, lev, last, classProbs, ...) 
{
    dat <- if (is.data.frame(x)) 
        x
    else as.data.frame(x, stringsAsFactors = TRUE)
    dat$.outcome <- y
    struct <- bnclassify::nb(class = ".outcome", dataset = dat)
    args <- list(x = bnclassify::nb(".outcome", dataset = dat), 
        dataset = dat, smooth = param$smooth)
    dots <- list(...)
    args <- c(args, dots)
    do.call(bnclassify::lp, args)
}

function (modelFit, newdata, submodels = NULL) 
{
    if (!is.data.frame(newdata)) 
        newdata <- as.data.frame(newdata, stringsAsFactors = TRUE)
    predict(modelFit, newdata)
}

1) Training - without explicit parameter tuning / using default

# naive_bayes
fit.naive_bayes <- caret::train(Churn~., data=training_dataset, method="naive_bayes", 
                                metric=metric, 
                                trControl=control,
                                verbose = TRUE
)
print(fit.naive_bayes)

Aggregating results
Selecting tuning parameters
Fitting laplace = 0, usekernel = TRUE, adjust = 1 on full training set
Naive Bayes 

2501 samples
  19 predictor
   2 classes: 'no', 'yes' 

No pre-processing
Resampling: Cross-Validated (2 fold, repeated 2 times) 
Summary of sample sizes: 1250, 1251, 1251, 1250 
Resampling results across tuning parameters:

  usekernel  Accuracy   Kappa    
  FALSE      0.8702523  0.4775787
   TRUE      0.8724497  0.2517298

Tuning parameter 'laplace' was held constant at a value of 0
Tuning
 parameter 'adjust' was held constant at a value of 1
Accuracy was used to select the optimal model using the largest value.
The final values used for the model were laplace = 0, usekernel = TRUE
 and adjust = 1.

# nb
fit.nb <- caret::train(Churn~., data=training_dataset, method="nb", 
                       metric=metric, 
                       trControl=control,
                       verbose = TRUE
)
print(fit.nb)

Aggregating results
Selecting tuning parameters
Fitting fL = 0, usekernel = FALSE, adjust = 1 on full training set
Naive Bayes 

2501 samples
  19 predictor
   2 classes: 'no', 'yes' 

No pre-processing
Resampling: Cross-Validated (2 fold, repeated 2 times) 
Summary of sample sizes: 1250, 1251, 1251, 1250 
Resampling results across tuning parameters:

  usekernel  Accuracy   Kappa    
  FALSE      0.8688494  0.4710194
   TRUE      0.8684531  0.1664339

Tuning parameter 'fL' was held constant at a value of 0
Tuning
 parameter 'adjust' was held constant at a value of 1
Accuracy was used to select the optimal model using the largest value.
The final values used for the model were fL = 0, usekernel = FALSE and adjust
 = 1.

# dda
fit.dda <- caret::train(Churn~., data=training_dataset, method="dda", 
                        metric=metric, 
                        trControl=control,
                        verbose = TRUE
)
print(fit.dda)

Aggregating results
Selecting tuning parameters
Fitting model = Quadratic, shrinkage = Mean on full training set
Diagonal Discriminant Analysis 

2501 samples
  19 predictor
   2 classes: 'no', 'yes' 

No pre-processing
Resampling: Cross-Validated (2 fold, repeated 2 times) 
Summary of sample sizes: 1250, 1251, 1250, 1251 
Resampling results across tuning parameters:

  model      shrinkage  Accuracy   Kappa    
  Linear     Mean       0.5825714  0.1772734
  Linear     None       0.5825714  0.1772734
  Linear     Variance         NaN        NaN
  Quadratic  Mean       0.6797252  0.2840468
  Quadratic  None       0.6797252  0.2840468
  Quadratic  Variance         NaN        NaN

Accuracy was used to select the optimal model using the largest value.
The final values used for the model were model = Quadratic and shrinkage = Mean.

2) Training - with explicit parameter tuning using preProcess method

# naive_bayes
fit.naive_bayes_preProc <- caret::train(Churn~., data=training_dataset, method="naive_bayes", 
                                        metric=metric, 
                                        trControl=control,
                                        preProc=c("center", "scale"), 
                                        verbose = TRUE
)
print(fit.naive_bayes_preProc)

Aggregating results
Selecting tuning parameters
Fitting laplace = 0, usekernel = FALSE, adjust = 1 on full training set
Naive Bayes 

2501 samples
  19 predictor
   2 classes: 'no', 'yes' 

Pre-processing: centered (19), scaled (19) 
Resampling: Cross-Validated (2 fold, repeated 2 times) 
Summary of sample sizes: 1251, 1250, 1251, 1250 
Resampling results across tuning parameters:

  usekernel  Accuracy   Kappa    
  FALSE      0.8728489  0.4887380
   TRUE      0.8714537  0.2154238

Tuning parameter 'laplace' was held constant at a value of 0
Tuning
 parameter 'adjust' was held constant at a value of 1
Accuracy was used to select the optimal model using the largest value.
The final values used for the model were laplace = 0, usekernel = FALSE
 and adjust = 1.

# nb
fit.nb_preProc <- caret::train(Churn~., data=training_dataset, method="nb", 
                               metric=metric, 
                               trControl=control,
                               preProc=c("center", "scale"), 
                               verbose = TRUE
)
print(fit.nb_preProc)

Aggregating results
Selecting tuning parameters
Fitting fL = 0, usekernel = TRUE, adjust = 1 on full training set
Naive Bayes 

2501 samples
  19 predictor
   2 classes: 'no', 'yes' 

Pre-processing: centered (19), scaled (19) 
Resampling: Cross-Validated (2 fold, repeated 2 times) 
Summary of sample sizes: 1250, 1251, 1250, 1251 
Resampling results across tuning parameters:

  usekernel  Accuracy   Kappa    
  FALSE      0.8686505  0.4677752
   TRUE      0.8690536  0.2077854

Tuning parameter 'fL' was held constant at a value of 0
Tuning
 parameter 'adjust' was held constant at a value of 1
Accuracy was used to select the optimal model using the largest value.
The final values used for the model were fL = 0, usekernel = TRUE and adjust
 = 1.

# dda
fit.dda_preProc <- caret::train(Churn~., data=training_dataset, method="dda", 
                                metric=metric, 
                                trControl=control,
                                preProc=c("center", "scale"), 
                                verbose = TRUE
)
print(fit.dda_preProc)

Aggregating results
Selecting tuning parameters
Fitting model = Quadratic, shrinkage = None on full training set
Diagonal Discriminant Analysis 

2501 samples
  19 predictor
   2 classes: 'no', 'yes' 

Pre-processing: centered (19), scaled (19) 
Resampling: Cross-Validated (2 fold, repeated 2 times) 
Summary of sample sizes: 1251, 1250, 1250, 1251 
Resampling results across tuning parameters:

  model      shrinkage  Accuracy   Kappa    
  Linear     Mean       0.5665682  0.1672779
  Linear     None       0.5885615  0.1814163
  Linear     Variance         NaN        NaN
  Quadratic  Mean       0.6719325  0.2728773
  Quadratic  None       0.6753314  0.2768144
  Quadratic  Variance         NaN        NaN

Accuracy was used to select the optimal model using the largest value.
The final values used for the model were model = Quadratic and shrinkage = None.

3) Training - with explicit parameter tuning using preProcess method & Automatic Grid i.e. tuneLength

# naive_bayes
fit.naive_bayes_automaticGrid <- caret::train(Churn~., data=training_dataset, method="naive_bayes", 
                                              metric=metric, 
                                              trControl=control,
                                              preProc=c("center", "scale"), 
                                              tuneLength = tuneLength,
                                              verbose = TRUE
)
print(fit.naive_bayes_automaticGrid)

Aggregating results
Selecting tuning parameters
Fitting laplace = 0, usekernel = FALSE, adjust = 1 on full training set
Naive Bayes 

2501 samples
  19 predictor
   2 classes: 'no', 'yes' 

Pre-processing: centered (19), scaled (19) 
Resampling: Cross-Validated (2 fold, repeated 2 times) 
Summary of sample sizes: 1250, 1251, 1251, 1250 
Resampling results across tuning parameters:

  usekernel  Accuracy   Kappa    
  FALSE      0.8710513  0.4818473
   TRUE      0.8696560  0.1915383

Tuning parameter 'laplace' was held constant at a value of 0
Tuning
 parameter 'adjust' was held constant at a value of 1
Accuracy was used to select the optimal model using the largest value.
The final values used for the model were laplace = 0, usekernel = FALSE
 and adjust = 1.

# nb
fit.nb_automaticGrid <- caret::train(Churn~., data=training_dataset, method="nb", 
                                     metric=metric, 
                                     trControl=control,
                                     preProc=c("center", "scale"), 
                                     tuneLength = tuneLength,
                                     verbose = TRUE
)
print(fit.nb_automaticGrid)

Aggregating results
Selecting tuning parameters
Fitting fL = 0, usekernel = TRUE, adjust = 1 on full training set
Naive Bayes 

2501 samples
  19 predictor
   2 classes: 'no', 'yes' 

Pre-processing: centered (19), scaled (19) 
Resampling: Cross-Validated (2 fold, repeated 2 times) 
Summary of sample sizes: 1250, 1251, 1250, 1251 
Resampling results across tuning parameters:

  usekernel  Accuracy   Kappa    
  FALSE      0.8664563  0.4535703
   TRUE      0.8700534  0.1983141

Tuning parameter 'fL' was held constant at a value of 0
Tuning
 parameter 'adjust' was held constant at a value of 1
Accuracy was used to select the optimal model using the largest value.
The final values used for the model were fL = 0, usekernel = TRUE and adjust
 = 1.

# dda
fit.dda_automaticGrid <- caret::train(Churn~., data=training_dataset, method="dda", 
                                      metric=metric, 
                                      trControl=control,
                                      preProc=c("center", "scale"), 
                                      tuneLength = tuneLength,
                                      verbose = TRUE
)
print(fit.dda_automaticGrid)

Aggregating results
Selecting tuning parameters
Fitting model = Quadratic, shrinkage = None on full training set
Diagonal Discriminant Analysis 

2501 samples
  19 predictor
   2 classes: 'no', 'yes' 

Pre-processing: centered (19), scaled (19) 
Resampling: Cross-Validated (2 fold, repeated 2 times) 
Summary of sample sizes: 1251, 1250, 1251, 1250 
Resampling results across tuning parameters:

  model      shrinkage  Accuracy   Kappa    
  Linear     Mean       0.5689703  0.1697918
  Linear     None       0.5939626  0.1855654
  Linear     Variance         NaN        NaN
  Quadratic  Mean       0.6781268  0.2783895
  Quadratic  None       0.6813255  0.2808640
  Quadratic  Variance         NaN        NaN

Accuracy was used to select the optimal model using the largest value.
The final values used for the model were model = Quadratic and shrinkage = None.

4) Training - with explicit parameter tuning using preProcess method & Manual Grid i.e. tuneGrid

# naive_bayes
grid <- expand.grid(usekernel  = c("FALSE","TRUE"),
                    laplace    = c(seq(from = 1, to = 10, by = 1)),
                    adjust    = c(seq(from = 1, to = 10, by = 1))
)
fit.naive_bayes_manualGrid <- caret::train(Churn~., data=training_dataset, method="naive_bayes", 
                                           metric=metric, 
                                           trControl=control,
                                           preProc=c("center", "scale"), 
                                           tuneGrid = grid,
                                           verbose = TRUE
)
print(fit.naive_bayes_manualGrid)
plot(fit.naive_bayes_manualGrid)

Aggregating results
Selecting tuning parameters
Fitting laplace = 1, usekernel = FALSE, adjust = 1 on full training set
Naive Bayes 

2501 samples
  19 predictor
   2 classes: 'no', 'yes' 

Pre-processing: centered (19), scaled (19) 
Resampling: Cross-Validated (2 fold, repeated 2 times) 
Summary of sample sizes: 1251, 1250, 1250, 1251 
Resampling results across tuning parameters:

  usekernel  laplace  adjust  Accuracy   Kappa      
  FALSE       1        1      0.8736507  0.246338029
  FALSE       1        2      0.8722507  0.226088169
  FALSE       1        3      0.8658536  0.146195985
  FALSE       1        4      0.8590560  0.062806827
  FALSE       1        5      0.8560579  0.019523955
  FALSE       1        6      0.8554579  0.007012040
  FALSE       1        7      0.8552580  0.004691657
  FALSE       1        8      0.8548582  0.000000000
  FALSE       1        9      0.8548582  0.000000000
  FALSE       1       10      0.8548582  0.000000000
  FALSE       2        1      0.8736507  0.246338029
  FALSE       2        2      0.8722507  0.226088169
  FALSE       2        3      0.8658536  0.146195985
  FALSE       2        4      0.8590560  0.062806827
  FALSE       2        5      0.8560579  0.019523955
  FALSE       2        6      0.8554579  0.007012040
  FALSE       2        7      0.8552580  0.004691657
  FALSE       2        8      0.8548582  0.000000000
  FALSE       2        9      0.8548582  0.000000000
  FALSE       2       10      0.8548582  0.000000000
  FALSE       3        1      0.8736507  0.246338029
  FALSE       3        2      0.8722507  0.226088169
  FALSE       3        3      0.8658536  0.146195985
  FALSE       3        4      0.8590560  0.062806827
  FALSE       3        5      0.8560579  0.019523955
  FALSE       3        6      0.8554579  0.007012040
  FALSE       3        7      0.8552580  0.004691657
  FALSE       3        8      0.8548582  0.000000000
  FALSE       3        9      0.8548582  0.000000000
  FALSE       3       10      0.8548582  0.000000000
  FALSE       4        1      0.8736507  0.246338029
  FALSE       4        2      0.8722507  0.226088169
  FALSE       4        3      0.8658536  0.146195985
  FALSE       4        4      0.8590560  0.062806827
  FALSE       4        5      0.8560579  0.019523955
  FALSE       4        6      0.8554579  0.007012040
  FALSE       4        7      0.8552580  0.004691657
  FALSE       4        8      0.8548582  0.000000000
  FALSE       4        9      0.8548582  0.000000000
  FALSE       4       10      0.8548582  0.000000000
  FALSE       5        1      0.8736507  0.246338029
  FALSE       5        2      0.8722507  0.226088169
  FALSE       5        3      0.8658536  0.146195985
  FALSE       5        4      0.8590560  0.062806827
  FALSE       5        5      0.8560579  0.019523955
  FALSE       5        6      0.8554579  0.007012040
  FALSE       5        7      0.8552580  0.004691657
  FALSE       5        8      0.8548582  0.000000000
  FALSE       5        9      0.8548582  0.000000000
  FALSE       5       10      0.8548582  0.000000000
  FALSE       6        1      0.8736507  0.246338029
  FALSE       6        2      0.8722507  0.226088169
  FALSE       6        3      0.8658536  0.146195985
  FALSE       6        4      0.8590560  0.062806827
  FALSE       6        5      0.8560579  0.019523955
  FALSE       6        6      0.8554579  0.007012040
  FALSE       6        7      0.8552580  0.004691657
  FALSE       6        8      0.8548582  0.000000000
  FALSE       6        9      0.8548582  0.000000000
  FALSE       6       10      0.8548582  0.000000000
  FALSE       7        1      0.8736507  0.246338029
  FALSE       7        2      0.8722507  0.226088169
  FALSE       7        3      0.8658536  0.146195985
  FALSE       7        4      0.8590560  0.062806827
  FALSE       7        5      0.8560579  0.019523955
  FALSE       7        6      0.8554579  0.007012040
  FALSE       7        7      0.8552580  0.004691657
  FALSE       7        8      0.8548582  0.000000000
  FALSE       7        9      0.8548582  0.000000000
  FALSE       7       10      0.8548582  0.000000000
  FALSE       8        1      0.8736507  0.246338029
  FALSE       8        2      0.8722507  0.226088169
  FALSE       8        3      0.8658536  0.146195985
  FALSE       8        4      0.8590560  0.062806827
  FALSE       8        5      0.8560579  0.019523955
  FALSE       8        6      0.8554579  0.007012040
  FALSE       8        7      0.8552580  0.004691657
  FALSE       8        8      0.8548582  0.000000000
  FALSE       8        9      0.8548582  0.000000000
  FALSE       8       10      0.8548582  0.000000000
  FALSE       9        1      0.8736507  0.246338029
  FALSE       9        2      0.8722507  0.226088169
  FALSE       9        3      0.8658536  0.146195985
  FALSE       9        4      0.8590560  0.062806827
  FALSE       9        5      0.8560579  0.019523955
  FALSE       9        6      0.8554579  0.007012040
  FALSE       9        7      0.8552580  0.004691657
  FALSE       9        8      0.8548582  0.000000000
  FALSE       9        9      0.8548582  0.000000000
  FALSE       9       10      0.8548582  0.000000000
  FALSE      10        1      0.8736507  0.246338029
  FALSE      10        2      0.8722507  0.226088169
  FALSE      10        3      0.8658536  0.146195985
  FALSE      10        4      0.8590560  0.062806827
  FALSE      10        5      0.8560579  0.019523955
  FALSE      10        6      0.8554579  0.007012040
  FALSE      10        7      0.8552580  0.004691657
  FALSE      10        8      0.8548582  0.000000000
  FALSE      10        9      0.8548582  0.000000000
  FALSE      10       10      0.8548582  0.000000000
  TRUE        1        1      0.8736507  0.246338029
  TRUE        1        2      0.8722507  0.226088169
  TRUE        1        3      0.8658536  0.146195985
  TRUE        1        4      0.8590560  0.062806827
  TRUE        1        5      0.8560579  0.019523955
  TRUE        1        6      0.8554579  0.007012040
  TRUE        1        7      0.8552580  0.004691657
  TRUE        1        8      0.8548582  0.000000000
  TRUE        1        9      0.8548582  0.000000000
  TRUE        1       10      0.8548582  0.000000000
  TRUE        2        1      0.8736507  0.246338029
  TRUE        2        2      0.8722507  0.226088169
  TRUE        2        3      0.8658536  0.146195985
  TRUE        2        4      0.8590560  0.062806827
  TRUE        2        5      0.8560579  0.019523955
  TRUE        2        6      0.8554579  0.007012040
  TRUE        2        7      0.8552580  0.004691657
  TRUE        2        8      0.8548582  0.000000000
  TRUE        2        9      0.8548582  0.000000000
  TRUE        2       10      0.8548582  0.000000000
  TRUE        3        1      0.8736507  0.246338029
  TRUE        3        2      0.8722507  0.226088169
  TRUE        3        3      0.8658536  0.146195985
  TRUE        3        4      0.8590560  0.062806827
  TRUE        3        5      0.8560579  0.019523955
  TRUE        3        6      0.8554579  0.007012040
  TRUE        3        7      0.8552580  0.004691657
  TRUE        3        8      0.8548582  0.000000000
  TRUE        3        9      0.8548582  0.000000000
  TRUE        3       10      0.8548582  0.000000000
  TRUE        4        1      0.8736507  0.246338029
  TRUE        4        2      0.8722507  0.226088169
  TRUE        4        3      0.8658536  0.146195985
  TRUE        4        4      0.8590560  0.062806827
  TRUE        4        5      0.8560579  0.019523955
  TRUE        4        6      0.8554579  0.007012040
  TRUE        4        7      0.8552580  0.004691657
  TRUE        4        8      0.8548582  0.000000000
  TRUE        4        9      0.8548582  0.000000000
  TRUE        4       10      0.8548582  0.000000000
  TRUE        5        1      0.8736507  0.246338029
  TRUE        5        2      0.8722507  0.226088169
  TRUE        5        3      0.8658536  0.146195985
  TRUE        5        4      0.8590560  0.062806827
  TRUE        5        5      0.8560579  0.019523955
  TRUE        5        6      0.8554579  0.007012040
  TRUE        5        7      0.8552580  0.004691657
  TRUE        5        8      0.8548582  0.000000000
  TRUE        5        9      0.8548582  0.000000000
  TRUE        5       10      0.8548582  0.000000000
  TRUE        6        1      0.8736507  0.246338029
  TRUE        6        2      0.8722507  0.226088169
  TRUE        6        3      0.8658536  0.146195985
  TRUE        6        4      0.8590560  0.062806827
  TRUE        6        5      0.8560579  0.019523955
  TRUE        6        6      0.8554579  0.007012040
  TRUE        6        7      0.8552580  0.004691657
  TRUE        6        8      0.8548582  0.000000000
  TRUE        6        9      0.8548582  0.000000000
  TRUE        6       10      0.8548582  0.000000000
  TRUE        7        1      0.8736507  0.246338029
  TRUE        7        2      0.8722507  0.226088169
  TRUE        7        3      0.8658536  0.146195985
  TRUE        7        4      0.8590560  0.062806827
  TRUE        7        5      0.8560579  0.019523955
  TRUE        7        6      0.8554579  0.007012040
  TRUE        7        7      0.8552580  0.004691657
  TRUE        7        8      0.8548582  0.000000000
  TRUE        7        9      0.8548582  0.000000000
  TRUE        7       10      0.8548582  0.000000000
  TRUE        8        1      0.8736507  0.246338029
  TRUE        8        2      0.8722507  0.226088169
  TRUE        8        3      0.8658536  0.146195985
  TRUE        8        4      0.8590560  0.062806827
  TRUE        8        5      0.8560579  0.019523955
  TRUE        8        6      0.8554579  0.007012040
  TRUE        8        7      0.8552580  0.004691657
  TRUE        8        8      0.8548582  0.000000000
  TRUE        8        9      0.8548582  0.000000000
  TRUE        8       10      0.8548582  0.000000000
  TRUE        9        1      0.8736507  0.246338029
  TRUE        9        2      0.8722507  0.226088169
  TRUE        9        3      0.8658536  0.146195985
  TRUE        9        4      0.8590560  0.062806827
  TRUE        9        5      0.8560579  0.019523955
  TRUE        9        6      0.8554579  0.007012040
  TRUE        9        7      0.8552580  0.004691657
  TRUE        9        8      0.8548582  0.000000000
  TRUE        9        9      0.8548582  0.000000000
  TRUE        9       10      0.8548582  0.000000000
  TRUE       10        1      0.8736507  0.246338029
  TRUE       10        2      0.8722507  0.226088169
  TRUE       10        3      0.8658536  0.146195985
  TRUE       10        4      0.8590560  0.062806827
  TRUE       10        5      0.8560579  0.019523955
  TRUE       10        6      0.8554579  0.007012040
  TRUE       10        7      0.8552580  0.004691657
  TRUE       10        8      0.8548582  0.000000000
  TRUE       10        9      0.8548582  0.000000000
  TRUE       10       10      0.8548582  0.000000000

Accuracy was used to select the optimal model using the largest value.
The final values used for the model were laplace = 1, usekernel = FALSE
 and adjust = 1.

# nb
grid <- expand.grid(usekernel  = c("FALSE","TRUE"),
                    fL    = c(seq(from = 1, to = 10, by = 3)),
                    adjust    = c(seq(from = 1, to = 10, by = 3))
)
fit.nb_manualGrid <- caret::train(Churn~., data=training_dataset, method="nb", 
                                  metric=metric, 
                                  trControl=control,
                                  preProc=c("center", "scale"), 
                                  tuneGrid = grid,
                                  verbose = TRUE
)
print(fit.nb_manualGrid)

Aggregating results
Selecting tuning parameters
Fitting fL = 1, usekernel = FALSE, adjust = 1 on full training set
Naive Bayes 

2501 samples
  19 predictor
   2 classes: 'no', 'yes' 

Pre-processing: centered (19), scaled (19) 
Resampling: Cross-Validated (2 fold, repeated 2 times) 
Summary of sample sizes: 1250, 1251, 1250, 1251 
Resampling results across tuning parameters:

  usekernel  fL  adjust  Accuracy   Kappa      
  FALSE       1   1      0.8734513  0.243668211
  FALSE       1   4      0.8576577  0.040826832
  FALSE       1   7      0.8552582  0.004688062
  FALSE       1  10      0.8548582  0.000000000
  FALSE       4   1      0.8734513  0.243668211
  FALSE       4   4      0.8576577  0.040826832
  FALSE       4   7      0.8552582  0.004688062
  FALSE       4  10      0.8548582  0.000000000
  FALSE       7   1      0.8734513  0.243668211
  FALSE       7   4      0.8576577  0.040826832
  FALSE       7   7      0.8552582  0.004688062
  FALSE       7  10      0.8548582  0.000000000
  FALSE      10   1      0.8734513  0.243668211
  FALSE      10   4      0.8576577  0.040826832
  FALSE      10   7      0.8552582  0.004688062
  FALSE      10  10      0.8548582  0.000000000
  TRUE        1   1      0.8734513  0.243668211
  TRUE        1   4      0.8576577  0.040826832
  TRUE        1   7      0.8552582  0.004688062
  TRUE        1  10      0.8548582  0.000000000
  TRUE        4   1      0.8734513  0.243668211
  TRUE        4   4      0.8576577  0.040826832
  TRUE        4   7      0.8552582  0.004688062
  TRUE        4  10      0.8548582  0.000000000
  TRUE        7   1      0.8734513  0.243668211
  TRUE        7   4      0.8576577  0.040826832
  TRUE        7   7      0.8552582  0.004688062
  TRUE        7  10      0.8548582  0.000000000
  TRUE       10   1      0.8734513  0.243668211
  TRUE       10   4      0.8576577  0.040826832
  TRUE       10   7      0.8552582  0.004688062
  TRUE       10  10      0.8548582  0.000000000

Accuracy was used to select the optimal model using the largest value.
The final values used for the model were fL = 1, usekernel = FALSE and adjust
 = 1.

# dda
grid <- expand.grid(model     = c("Linear","Quadratic"),
                    shrinkage = c("Mean", "None", "Variance")
)
fit.dda_manualGrid <- caret::train(Churn~., data=training_dataset, method="dda", 
                                   metric=metric, 
                                   trControl=control,
                                   preProc=c("center", "scale"), 
                                   tuneGrid = grid,
                                   verbose = TRUE
)
print(fit.dda_manualGrid)
plot(fit.dda_manualGrid)

Aggregating results
Selecting tuning parameters
Fitting model = Quadratic, shrinkage = None on full training set
Diagonal Discriminant Analysis 

2501 samples
  19 predictor
   2 classes: 'no', 'yes' 

Pre-processing: centered (19), scaled (19) 
Resampling: Cross-Validated (2 fold, repeated 2 times) 
Summary of sample sizes: 1251, 1250, 1251, 1250 
Resampling results across tuning parameters:

  model      shrinkage  Accuracy   Kappa    
  Linear     Mean       0.5671757  0.1694759
  Linear     None       0.5903669  0.1833374
  Linear     Variance         NaN        NaN
  Quadratic  Mean       0.6761412  0.2801548
  Quadratic  None       0.6781415  0.2816168
  Quadratic  Variance         NaN        NaN

Accuracy was used to select the optimal model using the largest value.
The final values used for the model were model = Quadratic and shrinkage = None.

Collect the results of trained models

results <- resamples(list(      trained_Model_1  = fit.naive_bayes
                                , trained_Model_2  = fit.nb
                                #, trained_Model_3  = fit.dda
                                
                                , trained_Model_4  = fit.naive_bayes_preProc
                                , trained_Model_5  = fit.nb_preProc
                                #, trained_Model_6  = fit.dda_preProc
                                
                                , trained_Model_7  = fit.naive_bayes_automaticGrid
                                , trained_Model_8  = fit.nb_automaticGrid
                                #, trained_Model_9  = fit.dda_automaticGrid
                                
                                , trained_Model_10 = fit.naive_bayes_manualGrid
                                , trained_Model_11 = fit.nb_manualGrid
                                #, trained_Model_12 = fit.dda_manualGrid
))

Summarize the fitted models

summary(results)

Call:
summary.resamples(object = results)

Models: trained_Model_1, trained_Model_2, trained_Model_4, trained_Model_5, trained_Model_7, trained_Model_8, trained_Model_10, trained_Model_11 
Number of resamples: 4 

Accuracy 
                      Min.   1st Qu.    Median      Mean   3rd Qu.      Max.
trained_Model_1  0.8648000 0.8678787 0.8712524 0.8724497 0.8758235 0.8824940
trained_Model_2  0.8592000 0.8622000 0.8696496 0.8688494 0.8762990 0.8768985
trained_Model_4  0.8672000 0.8684000 0.8732489 0.8728489 0.8776978 0.8776978
trained_Model_5  0.8601119 0.8672280 0.8708512 0.8690536 0.8726767 0.8744000
trained_Model_7  0.8688000 0.8688787 0.8704524 0.8710513 0.8726251 0.8745004
trained_Model_8  0.8625100 0.8678275 0.8700518 0.8700534 0.8722777 0.8776000
trained_Model_10 0.8616000 0.8634815 0.8733014 0.8736507 0.8834705 0.8864000
trained_Model_11 0.8688000 0.8706772 0.8717026 0.8734513 0.8744767 0.8816000
                 NA's
trained_Model_1     0
trained_Model_2     0
trained_Model_4     0
trained_Model_5     0
trained_Model_7     0
trained_Model_8     0
trained_Model_10    0
trained_Model_11    0

Kappa 
                       Min.   1st Qu.    Median      Mean   3rd Qu.      Max.
trained_Model_1  0.10829612 0.2092886 0.2562885 0.2517298 0.2987297 0.3860461
trained_Model_2  0.43149223 0.4537303 0.4705059 0.4710194 0.4877950 0.5115734
trained_Model_4  0.46534664 0.4732033 0.4879770 0.4887380 0.5035118 0.5136513
trained_Model_5  0.07103303 0.1460284 0.2055050 0.2077854 0.2672621 0.3490986
trained_Model_7  0.45018357 0.4601530 0.4773717 0.4818473 0.4990659 0.5224622
trained_Model_8  0.14789441 0.1697040 0.1981728 0.1983141 0.2267830 0.2490164
trained_Model_10 0.08029601 0.1057571 0.2519525 0.2463380 0.3925335 0.4011511
trained_Model_11 0.18813059 0.1953985 0.2053473 0.2436682 0.2536170 0.3758477
                 NA's
trained_Model_1     0
trained_Model_2     0
trained_Model_4     0
trained_Model_5     0
trained_Model_7     0
trained_Model_8     0
trained_Model_10    0
trained_Model_11    0

Plot and rank the fitted models

dotplot(results)

bwplot(results)

Assign the best trained model based on Accuracy

best_trained_model <- fit.naive_bayes_automaticGrid

9. Test skill of the BEST trained model on validation/testing dataset¶

predictions <- predict(best_trained_model, newdata=testing_dataset)

Evaluate the BEST trained model and print results

res_  <- caret::confusionMatrix(table(predictions, testing_dataset$Churn))
print("Results from the BEST trained model ... ...\n"); 
print(round(res_$overall, digits = 3))

[1] "Results from the BEST trained model ... ...\n"
      Accuracy          Kappa  AccuracyLower  AccuracyUpper   AccuracyNull 
         0.852          0.407          0.826          0.876          0.856 
AccuracyPValue  McnemarPValue 
         0.639          0.857

10. Save the model to disk¶

#getwd()
saveRDS(best_trained_model, "./best_trained_model.rds")

# load the model
#getwd()
saved_model <- readRDS("./best_trained_model.rds")
print(saved_model)

Naive Bayes 

2501 samples
  19 predictor
   2 classes: 'no', 'yes' 

Pre-processing: centered (19), scaled (19) 
Resampling: Cross-Validated (2 fold, repeated 2 times) 
Summary of sample sizes: 1250, 1251, 1251, 1250 
Resampling results across tuning parameters:

  usekernel  Accuracy   Kappa    
  FALSE      0.8710513  0.4818473
   TRUE      0.8696560  0.1915383

Tuning parameter 'laplace' was held constant at a value of 0
Tuning
 parameter 'adjust' was held constant at a value of 1
Accuracy was used to select the optimal model using the largest value.
The final values used for the model were laplace = 0, usekernel = FALSE
 and adjust = 1.

# make a predictions on "new data" using the final model
final_predictions <- predict(saved_model, dataSet[1:20])
confusionMatrix(table(final_predictions, dataSet$Churn))
res_ <- confusionMatrix(table(final_predictions, dataSet$Churn))
print("Results from the BEST trained model ... ...\n"); 
print(round(res_$overall, digits = 3))

Confusion Matrix and Statistics

                 
final_predictions   no  yes
              no  2625  216
              yes  225  267
                                         
               Accuracy : 0.8677         
                 95% CI : (0.8557, 0.879)
    No Information Rate : 0.8551         
    P-Value [Acc > NIR] : 0.01963        
                                         
                  Kappa : 0.4702         
                                         
 Mcnemar's Test P-Value : 0.70324        
                                         
            Sensitivity : 0.9211         
            Specificity : 0.5528         
         Pos Pred Value : 0.9240         
         Neg Pred Value : 0.5427         
             Prevalence : 0.8551         
         Detection Rate : 0.7876         
   Detection Prevalence : 0.8524         
      Balanced Accuracy : 0.7369         
                                         
       'Positive' Class : no

[1] "Results from the BEST trained model ... ...\n"
      Accuracy          Kappa  AccuracyLower  AccuracyUpper   AccuracyNull 
         0.868          0.470          0.856          0.879          0.855 
AccuracyPValue  McnemarPValue 
         0.020          0.703

print(res_$table)
fourfoldplot(res_$table, color = c("#CC6666", "#99CC99"),
             conf.level = 0, margin = 1, main = "Confusion Matrix")

                 
final_predictions   no  yes
              no  2625  216
              yes  225  267

	Account_Length	Vmail_Message	Day_Mins	Eve_Mins	Night_Mins	Intl_Mins	CustServ_Calls	Churn	Intl_Plan	Vmail_Plan	⋯	Day_Charge	Eve_Calls	Eve_Charge	Night_Calls	Night_Charge	Intl_Calls	Intl_Charge	State	Area_Code	Phone
	<int>	<int>	<dbl>	<dbl>	<dbl>	<dbl>	<int>	<fct>	<fct>	<fct>	⋯	<dbl>	<int>	<dbl>	<int>	<dbl>	<int>	<dbl>	<fct>	<int>	<fct>
1	128	25	265.1	197.4	244.7	10.0	1	no	no	yes	⋯	45.07	99	16.78	91	11.01	3	2.70	KS	415	382-4657
2	107	26	161.6	195.5	254.4	13.7	1	no	no	yes	⋯	27.47	103	16.62	103	11.45	3	3.70	OH	415	371-7191
3	137	0	243.4	121.2	162.6	12.2	0	no	no	no	⋯	41.38	110	10.30	104	7.32	5	3.29	NJ	415	358-1921
4	84	0	299.4	61.9	196.9	6.6	2	no	yes	no	⋯	50.90	88	5.26	89	8.86	7	1.78	OH	408	375-9999
5	75	0	166.7	148.3	186.9	10.1	3	no	yes	no	⋯	28.34	122	12.61	121	8.41	3	2.73	OK	415	330-6626
6	118	0	223.4	220.6	203.9	6.3	0	no	yes	no	⋯	37.98	101	18.75	118	9.18	6	1.70	AL	510	391-8027
7	121	24	218.2	348.5	212.6	7.5	3	no	no	yes	⋯	37.09	108	29.62	118	9.57	7	2.03	MA	510	355-9993
8	147	0	157.0	103.1	211.8	7.1	0	no	yes	no	⋯	26.69	94	8.76	96	9.53	6	1.92	MO	415	329-9001
9	117	0	184.5	351.6	215.8	8.7	1	no	no	no	⋯	31.37	80	29.89	90	9.71	4	2.35	LA	408	335-4719
10	141	37	258.6	222.0	326.4	11.2	0	no	yes	yes	⋯	43.96	111	18.87	97	14.69	5	3.02	WV	415	330-8173

	Account_Length	Vmail_Message	Day_Mins	Eve_Mins	Night_Mins	Intl_Mins	CustServ_Calls	Churn	Intl_Plan	Vmail_Plan	⋯	Day_Charge	Eve_Calls	Eve_Charge	Night_Calls	Night_Charge	Intl_Calls	Intl_Charge	State	Area_Code	Phone
	<int>	<int>	<dbl>	<dbl>	<dbl>	<dbl>	<int>	<fct>	<fct>	<fct>	⋯	<dbl>	<int>	<dbl>	<int>	<dbl>	<int>	<dbl>	<fct>	<int>	<fct>
3324	117	0	118.4	249.3	227.0	13.6	5	yes	no	no	⋯	20.13	97	21.19	56	10.22	3	3.67	IN	415	362-5899
3325	159	0	169.8	197.7	193.7	11.6	1	no	no	no	⋯	28.87	105	16.80	82	8.72	4	3.13	WV	415	377-1164
3326	78	0	193.4	116.9	243.3	9.3	2	no	no	no	⋯	32.88	88	9.94	109	10.95	4	2.51	OH	408	368-8555
3327	96	0	106.6	284.8	178.9	14.9	1	no	no	no	⋯	18.12	87	24.21	92	8.05	7	4.02	OH	415	347-6812
3328	79	0	134.7	189.7	221.4	11.8	2	no	no	no	⋯	22.90	68	16.12	128	9.96	5	3.19	SC	415	348-3830
3329	192	36	156.2	215.5	279.1	9.9	2	no	no	yes	⋯	26.55	126	18.32	83	12.56	6	2.67	AZ	415	414-4276
3330	68	0	231.1	153.4	191.3	9.6	3	no	no	no	⋯	39.29	55	13.04	123	8.61	4	2.59	WV	415	370-3271
3331	28	0	180.8	288.8	191.9	14.1	2	no	no	no	⋯	30.74	58	24.55	91	8.64	6	3.81	RI	510	328-8230
3332	184	0	213.8	159.6	139.2	5.0	2	no	yes	no	⋯	36.35	84	13.57	137	6.26	10	1.35	CT	510	364-6381
3333	74	25	234.4	265.9	241.4	13.7	0	no	no	yes	⋯	39.85	82	22.60	77	10.86	4	3.70	TN	415	400-4344

Group.1	Account_Length	Vmail_Message	Day_Mins	Eve_Mins	Night_Mins	Intl_Mins	CustServ_Calls	Intl_Plan	Vmail_Plan	Day_Calls	Day_Charge	Eve_Calls	Eve_Charge	Night_Calls	Night_Charge	Intl_Calls	Intl_Charge	State	Area_Code
<fct>	<dbl[,6]>	<dbl[,6]>	<dbl[,6]>	<dbl[,6]>	<dbl[,6]>	<dbl[,6]>	<dbl[,6]>	<dbl[,6]>	<dbl[,6]>	<dbl[,6]>	<dbl[,6]>	<dbl[,6]>	<dbl[,6]>	<dbl[,6]>	<dbl[,6]>	<dbl[,6]>	<dbl[,6]>	<dbl[,6]>	<dbl[,6]>
no	1, 73, 100, 100.7937, 127, 243	0, 0, 0, 8.604561, 22, 51	0, 142.825, 177.2, 175.1758, 210.30, 315.6	0.0, 164.5, 199.6, 199.0433, 233.20, 361.8	23.2, 165.90, 200.25, 200.1332, 234.90, 395.0	0, 8.4, 10.2, 10.15888, 12.0, 18.9	0, 1, 1, 1.449825, 2, 8	1, 1, 1, 1.065263, 1, 2	1, 1, 1, 1.295439, 2, 2	0, 87.0, 100, 100.2832, 114.0, 163	0, 24.2825, 30.12, 29.78042, 35.75, 53.65	0, 87, 100, 100.0386, 114, 170	0.00, 13.980, 16.97, 16.91891, 19.820, 30.75	33, 87, 100, 100.0582, 113, 175	1.04, 7.470, 9.01, 9.006074, 10.570, 17.77	0, 3, 4, 4.532982, 6, 19	0.00, 2.27, 2.75, 2.743404, 3.24, 5.1	1, 14, 27, 27.01193, 40, 51	408, 408, 415, 437.0747, 510, 510
yes	1, 76, 103, 102.6646, 127, 225	0, 0, 0, 5.115942, 0, 48	0, 153.250, 217.6, 206.9141, 265.95, 350.8	70.9, 177.1, 211.3, 212.4101, 249.45, 363.7	47.4, 171.25, 204.80, 205.2317, 239.85, 354.9	2, 8.8, 10.6, 10.70000, 12.8, 20.0	0, 1, 2, 2.229814, 4, 9	1, 1, 1, 1.283644, 2, 2	1, 1, 1, 1.165631, 1, 2	0, 87.5, 103, 101.3354, 116.5, 165	0, 26.0550, 36.99, 35.17592, 45.21, 59.64	48, 87, 101, 100.5611, 114, 168	6.03, 15.055, 17.96, 18.05497, 21.205, 30.91	49, 85, 100, 100.3996, 115, 158	2.13, 7.705, 9.22, 9.235528, 10.795, 15.97	1, 2, 4, 4.163561, 5, 20	0.54, 2.38, 2.86, 2.889545, 3.46, 5.4	1, 17, 27, 27.33954, 39, 51	408, 408, 415, 437.8178, 510, 510

Group.1	x
<fct>	<dbl[,6]>
no	0, 8.4, 10.2, 10.15888, 12.0, 18.9
yes	2, 8.8, 10.6, 10.70000, 12.8, 20.0

Group.1	x
<fct>	<dbl[,6]>
no	0, 142.825, 177.2, 175.1758, 210.30, 315.6
yes	0, 153.250, 217.6, 206.9141, 265.95, 350.8

	Vmail_Message	Night_Mins	Day_Calls	Eve_Calls	Night_Charge	Intl_Charge
Vmail_Message	1.000000000	0.003718463	-0.009573189	-5.382921e-03	0.003710434	-1.263503e-03
Night_Mins	0.003718463	1.000000000	0.012550159	3.291091e-03	0.999625309	-7.103399e-03
Day_Calls	-0.009573189	0.012550159	1.000000000	9.253492e-03	0.012531632	1.038631e-02
Eve_Calls	-0.005382921	0.003291091	0.009253492	1.000000e+00	0.003310838	-9.536135e-05
Night_Charge	0.003710434	0.999625309	0.012531632	3.310838e-03	1.000000000	-7.097366e-03
Intl_Charge	-0.001263503	-0.007103399	0.010386309	-9.536135e-05	-0.007097366	1.000000e+00

	Vmail_Message	Night_Mins	Day_Calls	Eve_Calls	Night_Charge	Intl_Charge
Vmail_Message	187.37134656	5.3174453	-2.6229779	-1.59925653	0.23873433	0.02975334
Night_Mins	5.31744529	2557.7140018	23.2812431	-2.10859729	115.09955435	-0.57867377
Day_Calls	-2.62297790	23.2812431	402.7681409	2.58373944	1.04716693	0.32775442
Eve_Calls	-1.59925653	-2.1085973	2.5837394	396.91099860	-0.09322113	0.13025644
Night_Charge	0.23873433	115.0995543	1.0471669	-0.09322113	5.17959717	-0.02605168
Intl_Charge	0.02975334	-0.5786738	0.3277544	0.13025644	-0.02605168	0.56817315

	Vmail_Message	Night_Mins	Day_Calls	Eve_Calls	Night_Charge	Intl_Charge
Vmail_Message	1.000000000	0.007681136	-0.009548068	-0.005864351	0.007663290	0.002883658
Night_Mins	0.007681136	1.000000000	0.022937845	-0.002092768	0.999999215	-0.015179849
Day_Calls	-0.009548068	0.022937845	1.000000000	0.006462114	0.022926638	0.021666095
Eve_Calls	-0.005864351	-0.002092768	0.006462114	1.000000000	-0.002055984	0.008673858
Night_Charge	0.007663290	0.999999215	0.022926638	-0.002055984	1.000000000	-0.015186139
Intl_Charge	0.002883658	-0.015179849	0.021666095	0.008673858	-0.015186139	1.000000000

	Vmail_Message	Night_Mins	Day_Calls	Eve_Calls	Night_Charge	Intl_Charge
Vmail_Message	NA	0.6575570	0.5816089	0.7350335	0.6583020	0.8678283
Night_Mins	0.6575570	NA	0.1855268	0.9038694	0.0000000	0.3809828
Day_Calls	0.5816089	0.1855268	NA	0.7091964	0.1857418	0.2111142
Eve_Calls	0.7350335	0.9038694	0.7091964	NA	0.9055511	0.6166654
Night_Charge	0.6583020	0.0000000	0.1857418	0.9055511	NA	0.3807855
Intl_Charge	0.8678283	0.3809828	0.2111142	0.6166654	0.3807855	NA

parameter	class	label
<chr>	<chr>	<chr>
laplace	numeric	Laplace Correction
usekernel	logical	Distribution Type
adjust	numeric	Bandwidth Adjustment

A data.frame: 5 × 3
parameter	class	label
<chr>	<chr>	<chr>
k	numeric	#Folds
epsilon	numeric	Minimum Absolute Improvement
smooth	numeric	Smoothing Parameter
final_smooth	numeric	Final Smoothing Parameter
direction	character	Search Direction