Data defines the model by dint of genetic programming, producing the best decile table.


The Importance of Straight Data: Simplicity and Desirability for Good Model Building Practice
Bruce Ratner, Ph.D.

The purpose of this article is to show the importance of straight data for the simplicity and desirability it brings for good model building practice. I illustrate the topic sentence by giving details of what to do when an observed relationship between two variables depicted in a scatterplot is masking an acute underlying relationship. Data mining is employed to unmask and straighten the obtuse relationship. The correlation coefficient is used to quantify the strength of the exposed relationship, which possesses straight-line simplicity.

For an interesting read, click here.


For more information about this article, call Bruce Ratner at 516.791.3544; or e-mail at br@dmstat1.com.
Sign-up for a free GenIQ webcast: Click here.