A friendly reminder that when all the variables are measured, causal claims aren't supportable. Photo credit: Sleeping: MicrostockAsia/Deposit Photos
CNN's headline was "sleep this way to add 5 years to your life." This headline should immediately kick in your "correlation is not causation" spidey-sense.
a) What makes this headline causal?
b) What are the two variables in the headline?
c) Why might you suspect this headline is based on a correlational study (rather than an experiment)?
Now that these preliminaries are out of the way, let's find out if the study behind the headline can support causation.
Here's the splashy intro:
Want to live longer? Then prioritize sleep in your life: Following five good sleep habits added nearly five years to a man's life expectancy and almost 2.5 years to a woman's life, a new study found.
The article then provides advice such as getting at least 7 hours of sleep, and so on. The journalist links that advice to the following research summary:
"Recent studies have shown irregularity in sleep timing and duration have been linked to metabolic abnormalities ...," he said. "Encouraging maintenance of regular sleep schedules with consistent sleep durations may be an important part of lifestyle recommendations for the prevention of heart disease."
d) In the example above--What are the two variables in the study? What makes the study correlational?
e) Apply the three causal criteria to the example in d). The study can support covariance (why?). What about temporal precedence? What about internal validity? What specific third variables might be responsible for the link between sleep timing and metabolic abnormalities?
Later in the piece, CNN shares data from another study, this one using multiple regression:
The preliminary study, presented Thursday at an annual meeting of the American College of Cardiology, analyzed data from over 172,000 people who answered sleep questionnaires between 2013 and 2018 as part of the National Health Interview Survey. The annual survey is done by the CDC and the National Center for Health Statistics.
Each of the five healthy sleep habits — falling asleep easily, staying asleep, getting seven to eight hours of zzz's, waking up rested and foregoing sleep meds — was assigned a number. People were scored on how many of the five habits they had.
About four years later, researchers compared those scores with National Death Index records to see if their sleep behaviors contributed to an early death from certain diseases or any cause.
The team then factored out other potential causes for a higher risk of dying, such as alcohol consumption, lower socioeconomic status and existing medical conditions.
f) Make a table of the variables in this study:
Variable name |
How was this variable operationalized? |
What are the variable's possible levels? |
Is this manipulated or measured? |
|
|
|
|
|
|
|
|
|
|
|
(add rows as needed) |
g) The study directly above meets both covariance and temporal precedence....can you explain why?
h) Imagine that a critic says, about the study directly above, "Hmmm....it could be that healthy sleep habits predicted early death because of alcohol consumption--maybe people who drink more alcohol are more likely to have unhealthy sleep, and they are also more likely to die early." What do you say?
Finally, if you're looking for an example of a moderator, we've got you:
Men who followed all five of the healthy sleep habits had a life expectancy that was 4.7 years greater than people who had none or only one of the five elements of low-risk sleep, the study found.
The impact of healthy sleep habits was much lower for women: Those who followed all five sleep habits gained 2.4 years compared with those who did none or only one.
i) Put this into a moderator sentence: _____ moderates the relationship between _____ and _____ such that ____.
Thanks Stephen Chew for sharing yet another great example from the "correlation is not causation" files.
Selected answers
a) The headline is causal because it takes the form of advice: If you do X, it will cause Y (here, If you sleep better, it will cause increased life span)
b) Type of sleep and lifespan are the two variables.
c) You might suspect this headline is based on a correlational study (rather than an experiment) because lifespan is definitely a measured variable (you can't manipulate it), and sleep seems likely to be measured as well. Of course, it's possible to randomly assign volunteers to one of two sleep patterns---you'll want to read the article closely to see!
d) One variable is irregularity in sleep timing and duration; the other is metabolic abnormalities.
e) The study can support covariance because the results showed a correlation between sleep irregularity and metabolic abnormalities. The study may not have temporal precedence--it's not clear if the variables were measured at the same time, or if sleep irregularity was actually measured first. What about internal validity? It's possible that some pre-existing medical condition (such as depression or addiction) goes with both sleep irregularities and with metabolic abnormalities. Other third variables might work, too--make sure you explain how the variable you are thinking of links to both sleep irregularities and with metabolic abnormalities.
f)
Variable name |
How was this variable operationalized? |
What are the variable's possible levels? |
Is this manipulated or measured? |
Healthiness of sleep habits |
Survey; summed five habits (falling asleep easily, staying asleep, etc.) |
0 to 5 |
Measured |
Early death |
National Death Index records |
Alive or dead |
Measured |
Alcohol consumption |
Doesn't say how this was operationalized |
Not clear--lo to hi |
Measured |
SES |
Doesn't say how this was operationalized |
Not clear--lo to hi |
Measured |
g) It meets covariance because the results show an association between healthiness of sleep habits and early death. It meets temporal precedence because the sleep habits were measured first, and then people were followed for four years, when early death was measured.
h) This argument doesn't work as a third variable critique because the researchers controlled for alcohol consumption in their multiple regression (The CNN article said, "The team then factored out other potential causes for a higher risk of dying, such as alcohol consumption, lower socioeconomic status and existing medical conditions").
i) Gender moderates the relationship between sleep quality and early death such that the relationship is present for both, but is stronger for men than for women.