GLOSSARY (MARKETING RESEARCH)

ANALYSIS OF VARIANCE (ANOVA)
  • ANOVA SHOWS WHETHER A VARIABLE IS RELATED TO ONE OR TWO GROUP VARIABLES.
  • ANOVA ALSO SHOWS IF MULTIPLE MEASURES OF A NUMERIC VARIABLE DIFFER FROM EACH OTHER MORE THAN COULD BE EXPECTED DUE TO CHANCE.
  • THERE ARE 2 TYPES OF ANOVA.
  1. ONE-WAY ANOVA SHOWS HOW A GROUP MEMBERSHIP VARIABLE AFFECTS THE VALUES OF ANOTHER VARIABLE. THE VARIABLE WHOSE VARIATION YOU WOULD LIKE TO ANALYZE IS CALLED THE DEPENDENT VARIABLE (TO WHAT EXTENT DO ITS ANSWERS DEPEND ON THE GROUP MEMBERSHIP VARIABLE). IT MUST BE A NUMERIC VARIABLE (THE KIND IN WHICH A NUMBER IS THE ACTUAL ANSWER). RATINGS, DOLLAR AMOUNTS, AND QUANTITIES ARE SOME EXAMPLES.
  2. TWO-WAY ANOVA SHOWS HOW GROUP MEMBERSHIP VARIABLES AFFECT THE VALUES OF ANOTHER VARIABLE. THE TWO-WAY METHOD LETS YOU EXAMINE INTERACTION EFFECTS. THESE ARE THE EFFECTS THAT TWO GROUP VARIABLES MAY HAVE IN COMBINATION, APART FROM ANY EFFECTS EACH MAY HAVE SEPARATELY. THE INTERACTION EFFECT CAN SOMETIMES UNCOVER IMPORTANT ASPECTS OF THE RELATIONSHIPS BETWEEN THREE VARIABLES.

CENTRAL LIMIT THEOREM
AS THE SAMPLE SIZE (NUMBER OF OBSERVATIONS IN EACH SAMPLE) GETS LARGE ENOUGH, THE SAMPLING DISTRIBUTION OF THE MEAN CAN BE APPROXIMATED BY THE NORMAL DISTRIBUTION . THIS IS TRUE REGARDLESS OF THE SHAPE OF THE DISTRIBUTION OF THE INDIVIDUAL VALUES IN THE POPULATION.


A GREAT DEAL OF STATISTICAL RESEARCH HAS GONE INTO THIS ISSUE. AS A GENERAL RULE, STATISTICIANS HAVE FOUND THAT FOR MANY POPULATION DISTRIBUTIONS, ONCE THE SAMPLE SIZE IS AT LEAST 30, THE SAMPLING DISTRIBUTION OF THE MEAN WILL BE APPROXIMATELY NORMAL. HOWEVER, WE MAY BE ABLE TO APPLY THE CENTRAL LIMIT THEOREM FOR EVEN SMALLER SAMPLE SIZES IF A GREAT DEAL OF INFORMATION IS ALREADY KNOWN ABOUT THE TARGET POPULATION.

CLUSTER ANALYSIS
  • CLUSTER ANALYSIS IS USED FOR CLASSIFYING OBJECTS OR CASES, AND SOMETIMES VARIABLES, INTO RELATIVELY HOMOGENEOUS GROUPS.
  • THE GROUPS OF CLUSTERS ARE SUGGESTED BY THE DATA AND ARE NOT DEFINED A PRIORI.
  • RESEARCHERS AT PENN AND ASSOCIATES SELECT VARIABLES ON WHICH THE CLUSTERING IS DONE.

    

DATA COLLECTION METHODS
THERE ARE SEVERAL DATA COLLECTION METHODS AND EACH HAVE VARIOUS ADVANTAGES.
SPEED
  • EMAIL AND WEB PAGE SURVEYS ARE THE FASTEST METHODS, FOLLOWED BY TELEPHONE INTERVIEWING. INTERVIEWING BY MAIL IS THE SLOWEST.

COST
  • PERSONAL INTERVIEWS ARE THE MOST EXPENSIVE FOLLOWED BY TELEPHONE AND THEN MAIL. EMAIL AND WEB PAGE SURVEYS ARE THE LEAST EXPENSIVE FOR LARGE SAMPLES.

INTERNET USAGE
  • EMAIL AND WEB PAGE SURVEYS OFFER FANTASTIC ADVANTAGES - COST, SPEED, AND DETAIL.

SENSITIVE QUESTIONS
  • PEOPLE ARE MORE LIKELY TO ANSWER SENSITIVE QUESTIONS WHEN INTERVIEWED DIRECTLY.

FACTOR ANALYSIS
  • FACTOR ANALYSIS IS A CLASS OF PROCEDURES USED FOR REDUCING AND SUMMARIZING DATA.
  • EACH VARIABLE IS EXPRESSED AS A LINEAR COMBINATION OF THE UNDERLYING FACTORS. LIKEWISE, THE FACTORS THEMSELVES CAN BE EXPRESSED AS LINEAR COMBINATIONS OF THE OBSERVED VARIABLES. THE NUMBER OF FACTORS THAT SHOULD BE EXTRACTED CAN BE DETERMINED A PRIORI.
  • A ROTATION (VARIMAX) TRANSFORMS THE FACTOR MATRIX MAKING IT SIMPLER AND EASIER TO INTERPRET.


THE NORMAL DISTRIBUTION

    

  • THE NORMAL DISTRIBUTION IS BELL-SHAPED AND SYMMETRICAL IN APPEARANCE.
  • ITS MEASURES OF CENTRAL TENDENCY (MEAN, MEDIAN, AND MODE) ARE ALL IDENTICAL.
  • NUMEROUS CONTINUOUS PHENOMENA SEEM TO FOLLOW OR CAN APPROXIMATE THE NORMAL DISTRIBUTION.
  • THE NORMAL DISTRIBUTION PROVIDES THE BASIS FOR CLASSICAL STATISTICAL INFERENCE BECAUSE OF ITS RELATIONSHIP TO THE CENTRAL LIMIT THEOREM.
  • ITS "MIDDLE SPREAD" IS EQUAL TO 1.33 STANDARD DEVIATIONS. THIS MEANS THAT THE INTER-QUARTILE RANGE IS CONTAINED WITHIN AN INTERVAL OF TWO-THIRDS OF A STANDARD DEVIATION BELOW THE MEAN TO TWO-THIRDS OF A STANDARD DEVIATION ABOVE THE MEAN.
  • THE NORMAL DISTRIBUTION IS DEFINED BY THE POPULATION MEAN (m) AND THE POPULATION STANDARD DEVIATION (s).
  • ANY NORMAL RANDOM VARIABLE X CAN BE CONVERTED TO A STANDARDIZED NORMAL RANDOM VARIABLE z BY THE FORMULA:
z = X - m
         s

(THE RANDOM VARIABLE IS ALWAYS NORMALLY DISTRIBUTED WITH A MEAN OF 0 AND A STANDARD DEVIATION OF 1).

REGRESSION
THE CORRELATION COEFFICIENT, r, MEASURES THE LINEAR ASSOCIATION BETWEEN TWO METRIC (INTERVAL OR RATIO SCALED) VARIABLES. ITS SQUARE MEASURES THE PROPORTION OF VARIATION IN ONE VARIABLE EXPLAINED BY THE OTHER. THE PARTIAL CORRELATION COEFFICIENT MEASURES ADDITIONAL VARIABLES. THE ORDER OF A PARTIAL CORRELATION INDICATES HOW MANY VARIABLES ARE BEING CONTROLLED.

RESEARCH GOALS
TYPICAL GOALS FOR A SURVEY (SEE SAMPLE SURVEYS ) INCLUDE THE FOLLOWING: