Ordinal regression with non-normally distributed response variable

leomein · October 1, 2025, 9:47am

Hi!

I just read through Regression Models with Ordered Categorical Outcomes — PyMC example gallery and am thinking about applying such a model to my data (currently using logistic regression because the response can be interpreted in a binary way).

Is my understanding right that by default a logistic distribution of the latent continuous response variable Z is assumed? In my data there are four categories and categories 1 and 4 have more than twice as much data as categories 2 and 3. I’m wondering how to best deal with this situation and if a logit or probit link is appropriate here?

Thanks!

ricardoV94 · October 1, 2025, 12:13pm

The “amount of data” shouldn’t matter?

The shape of the latent distributions has pretty subtle/minor effects, it’s usually just chosen for computational convenience.

I like this resource: Ordinal Regression

tcapretto · October 1, 2025, 12:32pm

I want to add another resource I also like: Ordinal Regression Models in Psychology: A Tutorial - Paul-Christian Bürkner, Matti Vuorre, 2019

leomein · October 1, 2025, 4:57pm

Thanks both! Hopefully these will clear up any misconceptions I seem to have.

leomein · October 2, 2025, 7:20am

I think this section from Ordinal Regression explains it:

This construction holds for any probability distribution over X. Differences in the shape of the distribution can be compensated by reconfiguring the interior cut points to achieve the desired ordinal probabilities. Consequently we have the luxury of selecting a probability distribution based on computational convenience, in particular the expense of the cumulative distribution function and the ultimate cost of computing the interval probabilities.

Thanks again!

ricardoV94 · October 2, 2025, 11:39am

I guess the shape matter a bit in relation to the cutpoints prior, but would be weird to try to think of an exotic distribution to fit with a specific prior on the cutpoints than the other way around.

bob-carpenter · October 2, 2025, 6:01pm

The continuous response variable is typically marginalized out rather than realized explicitly. The ordinal logit distribution then takes on the character of differences of logistic cdfs based on the cutpoints. The reason this is done for Hamiltonian Monte Carlo is that the result remains continuously differentiable. If you include the latent variable, then you get discontinuities in derivatives that are not good for HMC sampling (the feedback essentially gets “cut”).

You can replace the logistic cdf with another cdf to get a different latent distribution assumption. The reason logit is popular is that the cdf has a simple explicit form—the inverse logit function 1 / (1 + \exp(-x)). If you move to ordinal probit, this gets replaced with \Phi(x), the standard normal
cdf, which is much harder to compute because there’s no explicit form.

Topic		Replies	Views
Ordinal model does not find true means and variance Questions	16	1211	September 3, 2022
Using discrete ordinal predictors Questions	0	567	February 18, 2020
Multivariate ordered logistic regression model won't run - how can I fix it? Questions	2	1385	November 9, 2018
Struggling with ordinal data and `OrdinalProbit` Questions	8	924	January 18, 2023
Bad initial energy, check any log probabilities that are inf or -inf, nan or very small: Questions	1	977	October 3, 2020

Ordinal regression with non-normally distributed response variable

Related topics