Modelling dichotomous outcomes
In the social sciences we are often confronted with phenomena and concepts measured not on a numeric scale but on a categorical scale. Linear regression analysis has provided us with a general approach which can be generalised to categorical dependent variables, but this generalisation relies on a mathematical transformation of the dependent variable using the natural logarithm. Luckily, we have powerful software to take care of these transformations for us, and we can instead focus on understanding the sociological importance of estimating the probability of cases (people) with certain characteristics to belong to one outcome category as opposed to another. We will focus on the basic case where there are only two (dichotomous, binary) outcome categories, which can be modelled using logistic regression, another versatile and foundational statistical method that is probably the most commonly employed in sociological research.
Readings
Statistics:
- IMS: Chapter 9 (“Logistic regression”)
- Connelly, Roxanne, Vernon Gayle, and Paul S. Lambert. 2016. “Statistical Modelling of Key Variables in Social Survey Data Analysis”. Methodological Innovations9:205979911663800.
Application:
Delhey, Jan, and Kenneth Newton.
- “Who Trusts?: The Origins of Social Trust in Seven Societies.” European Societies 5(2): 93–137.
Download PDF: Delhey & Newton (2003) Who trusts
Advanced topics:
Mood, Carina.
- “Logistic Regression: Why We Cannot Do What We Think We Can Do, and What We Can Do About It.” European Sociological Review 26(1): 67–82.
Breen, Richard, Kristian Bernt Karlson, and Anders Holm.
- “Interpreting and Understanding Logits, Probits, and Other Nonlinear Probability Models.” Annual Review of Sociology 44(1): 39–54.