Response Variable — Definition, Formula & Examples
A response variable is the outcome you measure in a study to see whether it changed as a result of some treatment or condition. It "responds" to changes in the explanatory variable.
In a statistical study or designed experiment, the response variable (denoted ) is the dependent quantity whose variation the researcher seeks to explain or predict using one or more explanatory variables. Its observed values form the basis for inference about the effect of treatments or predictors.
How It Works
When you design an experiment, you deliberately change the explanatory variable (the factor you control) and then measure the response variable to see what happens. For example, if you test three different fertilizers on plant growth, the fertilizer type is the explanatory variable and the plant height after six weeks is the response variable. In observational studies you do not manipulate anything, but you still label the outcome of interest as the response variable. Correctly identifying the response variable is the first step in choosing an appropriate analysis — whether that is a two-sample -test, regression, or ANOVA.
Example
Problem: A researcher wants to know whether study time affects exam scores. She records the number of hours 5 students studied and their exam scores (out of 100). Identify the response variable, the explanatory variable, and describe what a scatter plot of the data would show.
Step 1: Ask: which variable is the outcome the researcher wants to explain? Exam score is the outcome she cares about, so exam score is the response variable.
Step 2: Ask: which variable might cause or predict changes in the response? Hours studied is the factor she thinks influences scores, so hours studied is the explanatory variable.
Step 3: On a scatter plot, the explanatory variable (hours studied) goes on the horizontal axis and the response variable (exam score) goes on the vertical axis. Each dot represents one student's data pair, such as (2, 65), (4, 78), (5, 82), (7, 90), (8, 95).
Answer: The response variable is exam score. It is plotted on the -axis because it is the outcome being measured.
Another Example
Problem: A pharmaceutical company tests a new drug at doses of 0 mg, 50 mg, and 100 mg on 60 patients and measures their blood-pressure reduction (in mmHg). Identify the response variable and explain why.
Step 1: The treatment being manipulated is the drug dose (0 mg, 50 mg, 100 mg). This is the explanatory variable.
Step 2: The measurement taken to judge effectiveness is blood-pressure reduction. This is the response variable because it is the outcome the company hopes the drug will change.
Answer: Blood-pressure reduction (mmHg) is the response variable because it is the outcome measured to evaluate the treatment's effect.
Visualization
Why It Matters
In AP Statistics, nearly every free-response question about experimental design asks you to identify the response variable. Getting it right determines whether you set up hypotheses, graphs, and regression equations correctly. Beyond the classroom, researchers in medicine, agriculture, and social science must clearly define their response variable before collecting data — otherwise their conclusions lack focus and validity.
Common Mistakes
Mistake: Swapping the response and explanatory variables when setting up a regression or scatter plot.
Correction: Remember: the response variable always goes on the -axis. Ask yourself, "Which variable am I trying to predict or explain?" That one is the response.
Mistake: Assuming the response variable must be numerical.
Correction: Response variables can be categorical — for instance, whether a patient recovers (yes/no). The type of response variable determines which test you use (e.g., chi-square instead of a -test).
