binary data

0 or 1

binomial data

something between 0 and 1

what is the question in binary/binomial regression?

which variables influence the probability p of the outcome

what are the problems when using a linear regression for binomial data?

- linear regression leads to impossible predicted probability values -> impossible predictions
- residuals are nt normally distributed

what is the problem when using a poisson regression for binomial data?

impossibe predictions

count data definition

- theoretically no upper limit
- Counts cannot be expressed as a proportion

binomial data definition

- aggregated version of many binary experiments
- There is an upper limit
- success can be expressed as a proportion

when can we use the diagnostic plots with a binomial model?

when we have aggregated data

what does overdispersion mean?

extra variability

how is the variance determined in the binomial regression?

the variance is determined by the mean

Var(Y) =π (1-π )

what is the problem with overdispersion?

too small p-values

how to account in R for a overdipersed binomial model?

switch to a quasibinomial model

