Statistical Modeling and Analysis for Database Marketing: Effective Techniques for Mining Big Data

by ;
Edition: 1st
Format: Hardcover
Pub. Date: 2003-05-28
Publisher(s): Chapman & Hall/
List Price: $99.95

Rent Textbook

Select for Price
There was a problem. Please try again later.

New Textbook

We're Sorry
Sold Out

Used Textbook

We're Sorry
Sold Out

eTextbook

We're Sorry
Not Available

How Marketplace Works:

  • This item is offered by an independent seller and not shipped from our warehouse
  • Item details like edition and cover design may differ from our description; see seller's comments before ordering.
  • Sellers much confirm and ship within two business days; otherwise, the order will be cancelled and refunded.
  • Marketplace purchases cannot be returned to eCampus.com. Contact the seller directly for inquiries; if no response within two days, contact customer service.
  • Additional shipping costs apply to Marketplace purchases. Review shipping costs at checkout.

Summary

Traditional statistical methods are limited in their ability to meet the modern challenge of mining large amounts of data. Data miners, analysts, and statisticians are searching for innovative new data mining techniques with greater predictive power, an attribute critical for reliable models and analyses.Statistical Modeling and Analysis for Database Marketing: Effective Techniques for Mining Big Data delivers a collection of successful database marketing methodologies for big data. This compendium solves common database marketing problems by applying new hybrid modeling techniques that combine traditional statistical and new machine learning methods. The book delivers a thorough analysis of these cutting-edge techniques, which include non-statistical machine learning and genetic intelligent hybrid models.By following the step-by-step procedures detailed in the text, database marketing professionals can learn how to apply the proper statistical techniques to any database marketing challenge. The practical case studies and examples provided involve real problems and real data, and are taken from a variety of industries, including banking, insurance, finance, retail, and telecommunications.

Table of Contents

Introduction
The Personal Computer and Statisticsp. 1
Statistics and Data Analysisp. 3
EDAp. 4
The EDA Paradigmp. 6
EDA Weaknessesp. 7
Small and Big Datap. 8
Data Size Characteristicsp. 8
Data Size: Personal Observation of Onep. 9
Data Mining Paradigmp. 9
Statistics and Machine Learningp. 11
Statistical Learningp. 12
Two Simple Data Mining Methods for Variable Assessment
Correlation Coefficientp. 15
Scatterplotsp. 17
Data Miningp. 18
Example #1p. 18
Example #2p. 21
Smoothed Scatterplotp. 21
General Association Testp. 26
Summaryp. 27
Logistic Regression: The Workhorse of Database Response Modeling
Logistic Regression Modelp. 32
Illustrationp. 32
Scoring a LRMp. 33
Case Studyp. 35
Candidate Predictor and Dependent Variablesp. 36
Logits and Logit Plotsp. 36
Logits for Case Studyp. 37
The Importance of Straight Datap. 38
Re-expressing for Straight Datap. 39
Ladder of Powersp. 39
Bulging Rulep. 40
Measuring Straight Datap. 41
Straight Data for Case Studyp. 42
Re-expressing FD2_OPENp. 44
Re-expressing INVESTMENTp. 44
Techniques When Bulging Rule Does Not Applyp. 46
Fitted Logit Plotp. 46
Smooth Predicted vs. Actual Plotp. 47
Re-expressing MOS_OPENp. 47
Smooth Predicted vs. Actual Plot for MOS_OPENp. 49
Assessing the Importance of Variablesp. 50
Computing the G Statisticp. 52
Importance of a Single Variablep. 53
Importance of a Subset of Variablesp. 53
Comparing the Importance of Different Subsets of Variablesp. 53
Important Variables for Case Studyp. 54
Importance of the Predictor Variablesp. 55
Relative Importance of the Variablesp. 56
Selecting the Best Subsetp. 57
Best Subset of Variables for Case Studyp. 58
Visual Indicators of Goodness of Model Predictionsp. 59
Smooth Residual by Score Groups Plotp. 59
Smooth Residual by Score Groups Plot for Case Studyp. 60
Smooth Actual vs. Predicted by Decile Groups Plotp. 62
Smooth Actual vs. Predicted by Decile Groups Plot for Case Studyp. 63
Smooth Actual vs. Predicted by Score Groups Plotp. 65
Smooth Actual vs. Predicted by Score Groups Plot for Case Studyp. 65
Evaluating the Data Mining Workp. 68
Comparison of Smooth Residual by Score Groups Plots: EDA vs. NonEDA Modelsp. 69
Comparison of Smooth Actual vs. Predicted by Decile Groups Plots: EDA vs. NonEDA Modelsp. 71
Comparison of Smooth Actual vs. Predicted by Score Groups Plots: EDA vs. NonEDA Modelsp. 71
Summary of the Data Mining Workp. 71
Smoothing a Categorical Variablep. 74
Smoothing FD_TYPE with CHAIDp. 75
Importance of CH_FTY_1 and CH_FTY_2p. 78
Additional Data Mining Work for Case Studyp. 78
Comparison of Smooth Residual by Score Group Plots: 4var- vs. 3var-EDA Modelsp. 79
Comparison of Smooth Actual vs. Predicted by Decile Groups Plots: 4var- vs. 3var-EDA Modelsp. 81
Comparison of Smooth Actual vs. Predicted by Score Groups Plots: 4var- vs. 3var-EDA Modelsp. 82
Final Summary of the Additional Data Mining Workp. 84
Summaryp. 85
Ordinary Regression: The Workhorse of Database Profit Modeling
Ordinary Regression Modelp. 87
Illustrationp. 88
Scoring A OLS Profit Modelp. 89
Mini Case Studyp. 91
Straight Data for Mini Case Studyp. 91
Re-expressing INCOMEp. 93
Re-expressing AGEp. 95
Smooth Predicted vs. Actual Plotp. 96
Assessing the Importance of Variablesp. 98
Defining the F Statistic and R-squaredp. 98
Importance of a Single Variablep. 99
Importance of a Subset of Variablesp. 99
Comparing the Importance of Different Subsets of Variablesp. 99
Important Variables for Mini Case Studyp. 100
Relative Importance of the Variablesp. 101
Selecting the Best Subsetp. 101
Best Subset of Variable for Case Studyp. 102
PROFIT Model with gINCOME and AGEp. 103
Best PROFIT Modelp. 106
Suppressor Variable AGEp. 106
Summaryp. 108
CHAID for Interpreting a Logistic Regression Model
Logistic Regression Modelp. 111
Database Marketing Response Model Case Studyp. 112
Odds Ratiop. 113
CHAIDp. 114
Proposed CHAID-Based Methodp. 114
Multivariable CHAID Treesp. 117
CHAID Market Segmentationp. 121
CHAID Tree Graphsp. 123
Summaryp. 126
The Importance of the Regression Coefficient
The Ordinary Regression Modelp. 129
Four Questionsp. 130
Important Predictor Variablesp. 130
P-Values and Big Datap. 132
Returning to Question #1p. 132
Predictor Variable's Effect on Predictionp. 133
The Caveatp. 134
Returning to Question #2p. 136
Ranking Predictor Variables by Effect On Predictionp. 136
Returning to Question #3p. 138
Returning to Question #4p. 138
Summaryp. 139
The Predictive Contribution Coefficient: A Measure of Predictive Importance
Backgroundp. 141
Illustration of Decision Rulep. 143
Predictive Contribution Coefficientp. 145
Calculation of Predictive Contribution Coefficientp. 146
Extra Illustration of Predictive Contribution Coefficientp. 148
Summaryp. 152
CHAID for Specifying a Model with Interaction Variables
Interaction Variablesp. 155
Strategy for Modeling with Interaction Variablesp. 156
Strategy Based on the Notion of a Special Pointp. 156
Example of a Response Model with an Interaction Variablep. 157
CHAID for Uncovering Relationshipsp. 159
Illustration of CHAID for Specifying a Modelp. 160
An Exploratory Lookp. 164
Database Implicationp. 165
Summaryp. 166
Market Segment Classification Modeling with Logistic Regression
Binary Logistic Regressionp. 169
Necessary Notationp. 170
Polychotomous Logistic Regression Modelp. 171
Model Building with PLRp. 172
Market Segmentation Classification Modelp. 172
Survey of Cellular Phone Usersp. 173
CHAID Analysisp. 174
CHAID Tree Graphsp. 177
Market Segment Classification Modelp. 180
Summaryp. 182
CHAID as a Method for Filling in Missing Values
Introduction to the Problem of Missing Datap. 185
Missing-Data Assumptionp. 188
CHAID Imputationp. 189
Illustrationp. 190
CHAID Mean-Value Imputation for a Continuous Variablep. 191
Many Mean-Value CHAID Imputations for a Continuous Variablep. 192
Regression-Tree Imputation for LIF_DOLp. 193
CHAID Most-Likely Category Imputation for a Categorical Variablep. 196
CHAID Most-Likely Category Imputation for GENDERp. 196
Classification Tree Imputation for GENDERp. 198
Summaryp. 200
Identifying Your Best Customers: Descriptive, Predictive and Look-Alike Profiling
Some Definitionsp. 203
Illustration of a Flawed Targeting Effortp. 204
Well-Defined Targeting Effortp. 205
Predictive Profilesp. 208
Continuous Treesp. 212
Look-Alike Profilingp. 215
Look-Alike Tree Characteristicsp. 216
Summaryp. 219
Assessment of Database Marketing Models
Accuracy for Response Modelp. 221
Accuracy for Profit Modelp. 222
Decile Analysis and Cum Lift for Response Modelp. 223
Decile Analysis and Cum Lift for Profit Modelp. 227
Precision for Response Modelp. 229
Precision for Profit Modelp. 231
Construction of SWMADp. 231
Separability for Response and Profit Modelsp. 233
Guidelines for Using Cum Lift, HL/SWMAD and CVp. 233
Summaryp. 234
Bootstrapping in Database Marketing: A New Approach for Validating Models
Traditional Model Validationp. 237
Illustrationp. 238
Three Questionsp. 239
The Bootstrapp. 240
Traditional Construction of Confidence Intervalsp. 241
How to Bootstrapp. 242
Simple Illustrationp. 242
Bootstrap Decile Analysis Validationp. 244
Another Questionp. 245
Bootstrap Assessment of Model Implementation Performancep. 246
Illustrationp. 249
Bootstrap Assessment of Model Efficiencyp. 253
Summaryp. 255
Visualization of Database Models
Brief History of the Graphp. 257
Star Graph Basicsp. 258
Illustrationp. 260
Star Graphs for Single Variablesp. 261
Star Graphs for Many Variables Considered Jointlyp. 262
Profile Curves Methodp. 264
Profile Curves Basicsp. 264
Profile Analysisp. 265
Illustrationp. 265
Profile Curves for RESPONSE Modelp. 269
Decile-Group Profile Curvesp. 271
Summaryp. 274
SAS Code for Star Graphs for Each Demographic Variable about the Decilesp. 275
SAS Code for Star Graphs for Each Decile about the Demographic Variablesp. 277
SAS Code for Profile Curves: All Decilesp. 281
Genetic Modeling in Database Marketing: The GenIQ Model
What Is Optimization?p. 285
What Is Genetic Modeling?p. 286
Genetic Modeling: An Illustrationp. 287
Reproductionp. 290
Crossoverp. 290
Mutationp. 292
Parameters for Controlling a Genetic Model Runp. 293
Genetic Modeling: Strengths and Limitationsp. 293
Goals of Modeling in Database Marketingp. 294
The GenIQ Response Modelp. 295
The GenIQ Profit Modelp. 295
Case Study--Response Modelp. 296
Case Study--Profit Modelp. 299
Summaryp. 302
Finding the Best Variables for Database Marketing Models
Backgroundp. 303
Weakness in the Variable Selection Methodsp. 305
Goals of Modeling in Database Marketingp. 307
Variable Selection with GenIQp. 308
GenIQ Modelingp. 310
GenIQ-Structure Identificationp. 312
GenIQ Variable Selectionp. 316
Nonlinear Alternative to Logistic Regression Modelp. 316
Summaryp. 321
Interpretation of Coefficient-Free Models
The Linear Regression Coefficientp. 323
Illustration for the Simple Ordinary Regression Modelp. 324
Illustration for the Simple Logistic Regression Modelp. 325
The Quasi-Regression Coefficient for Simple Regression Modelsp. 326
Illustration of Quasi-RC for the Simple Ordinary Regression Modelp. 326
Illustration of Quasi-RC for the Simple Logistic Regression Modelp. 328
Illustration of Quasi-RC for Nonlinear Predictionsp. 331
Partial Quasi-RC for the Everymodelp. 331
Calculating the Partial Quasi-RC for the Everymodelp. 333
Illustration for the Multiple Logistic Regression Modelp. 335
Quasi-RC for a Coefficient-Free Modelp. 340
Illustration of Quasi-RC for a Coefficient-Free Modelp. 341
Summaryp. 348
Indexp. 351
Table of Contents provided by Rittenhouse. All Rights Reserved.

An electronic version of this book is available through VitalSource.

This book is viewable on PC, Mac, iPhone, iPad, iPod Touch, and most smartphones.

By purchasing, you will be able to view this book online, as well as download it, for the chosen number of days.

Digital License

You are licensing a digital product for a set duration. Durations are set forth in the product description, with "Lifetime" typically meaning five (5) years of online access and permanent download to a supported device. All licenses are non-transferable.

More details can be found here.

A downloadable version of this book is available through the eCampus Reader or compatible Adobe readers.

Applications are available on iOS, Android, PC, Mac, and Windows Mobile platforms.

Please view the compatibility matrix prior to purchase.