Data defines the model by dint of genetic programming, producing the best decile table.

Finding the Best Variables for Database Marketing Models
Bruce Ratner, Ph.D.

Finding the best possible subset of variables to put in a model has been a frustrating exercise. Many methods of variable selection exist, but none of them is perfect. Furthermore, none use a criterion that addresses the specific needs of direct/database marketing (DM) models. The purpose of this article to is to present a new methodology – the GenIQ Model© – that uses the machine learning aprroach of genetic programming to isolate the variables. Pointedly, the GenIQ Model automatically determines the best set of predictor variables (from the original variables, and newly constructed genetically data-mined variables) based on a virtually unbiased assessment of all variables under consideration, an achievement not possible with statistical methods. Most significantly, genetic modeling is used to address the specific needs of DM models, viz., optimizing the decile table, which has trandscended its DM origin, and now serves as a universal measure of model performance. Moreover, GenIQ offers exceptional predictions with minimal error variance, and a unique feature accommodating dirty and incomplete data. GenIQ can handle both classification (e.g., target yes-no response variable) and regression (e.g., target continuous sales variable) problems with categorical, ordinal and continuous candidate predictor variables. Case studies are reported showing the potential power, and future prominence of GenIQ in the data analyst's toolkit.

For more information about this article, call me at 516.791.3544, or e-mail,
My publisher owns the copyright of the article, about which this abstract addresses. The article will appear in my forthcoming book.
My publisher has granted me permission to discuss orally the article's content, but by no means provide an outline, draft or proof-ready of the article.

Sign-up for a free GenIQ webcast: Click here.