Data defines the model by dint of genetic programming, producing the best decile table.


Creating a Variable List of Big Data
Bruce Ratner, PhD
Live chat by Boldchat
Live chat by Boldchat

The first processing step of BIG data is getting a variable list - in text format - of those 100s - 1000s variables.  This report provides a SAS-code program for creating a variable list of a big (even small, if one is lazy) dataset. The program should be a welcomed entry in the tool kit of data analysts who frequently work with BIG data.


********** SAS-code Program **********

Data set IN is found in Technical Report #6.

proc contents data= IN
out = vars (keep = name type)
noprint;
run;
proc sql noprint;
select name into : varlist separated by ' '
from vars;
quit;
%put _global_ ;

********** end of program **********

The variable list is found in the log:

GLOBAL VARLIST GENDER GENDER_ GENDER_F GENDER_M ID MARITAL MARITAL_ MARITAL_D MARITAL_M MARITAL_S

For more information about this article, call Bruce Ratner at 516.791.3544 or 1 800 DM STAT-1; or e-mail at br@dmstat1.com.
Sign-up for a free GenIQ webcast: Click here.