Count and two-part regression tables

library(spicy)
library(pscl)     # zeroinfl(), hurdle()

This vignette covers count regression — models for a non-negative integer outcome such as number of publications, doctor visits, or species counts. The companion vignette Publication-ready regression tables covers the shared mechanics (vcov, ci_level, output formats, multi-model layouts, broom integration); here we focus on what is specific to counts: rate ratios, overdispersion, and above all the two-part models whose zero component table_regression() renders as its own labelled block.

table_regression() supports the full count-model escalation:

stats::glm(family = poisson()) — the Poisson baseline;
MASS::glm.nb() — negative binomial, for overdispersion;
pscl::zeroinfl() and pscl::hurdle() — the two-part models (Zeileis, Kleiber & Jackman 2008);
glmmTMB::glmmTMB() — all of the above with random effects.

The running example is pscl::bioChemists (Long 1990): the number of articles published by 915 biochemistry PhD students in the last three years of their doctorate, with gender, marital status, children under six, and the mentor’s article count as predictors. Thirty percent of the students published nothing:

data("bioChemists", package = "pscl")
mean(bioChemists$art == 0)
#> [1] 0.3005464

Count models in one paragraph

A Poisson regression models the rate of events through a log link, so exponentiate = TRUE turns each coefficient into an incidence rate ratio (IRR) — a multiplicative effect on the expected count. Real count data usually violate the Poisson assumption in one or both of two ways: the variance exceeds the mean (overdispersion — the negative binomial’s job), and there are more zeros than the count process predicts (excess zeros — the job of the zero-inflated and hurdle models, which add a second, binary submodel for the zeros). Each extension changes what the table must show, and the sections below follow that escalation.

Poisson baseline

fit_pois <- glm(art ~ fem + mar + kid5 + ment, data = bioChemists,
                family = poisson())
table_regression(fit_pois, exponentiate = TRUE)
#> Poisson regression: art
#> 
#>  Variable        │   IRR     SE      95% CI       p   
#> ─────────────────┼────────────────────────────────────
#>  (Intercept)     │    1.41  0.08  [1.26, 1.59]  <.001 
#>  fem:            │                                    
#>    Men (ref.)    │     –     –         –         –    
#>    Women         │    0.80  0.04  [0.72, 0.89]  <.001 
#>  mar:            │                                    
#>    Single (ref.) │     –     –         –         –    
#>    Married       │    1.16  0.07  [1.03, 1.31]   .013 
#>  kid5            │    0.83  0.03  [0.77, 0.90]  <.001 
#>  ment            │    1.03  0.00  [1.02, 1.03]  <.001 
#> ╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌
#>  n               │  915                               
#>  R² (McFadden)   │    0.05                            
#>  R² (Nagelkerke) │    0.19                            
#>  AIC             │ 3312.3                             
#> 
#> Note. Poisson regression.
#> Std. errors: classical (Fisher information).
#> IRR = incidence rate ratio.
#> Coefficients exponentiated and displayed as IRR; SE on the IRR scale (delta method); CI bounds exponentiated (asymmetric).

Each IRR multiplies the expected article count: women publish about 20% less than men (IRR 0.80), and each additional mentor publication raises a student’s rate by 3% (IRR 1.03). Inference is Wald-z on the link scale; the footer records the SE and CI conventions.

One remark for rate data: the observation window here is a fixed three years for everyone, so no exposure adjustment is needed. With unequal exposure — person-years, plot area, time at risk — add offset(log(exposure)) to the formula as usual; the offset is absorbed silently (no spurious coefficient row) and the IRRs read as rate ratios per unit of exposure.

Overdispersion: the negative binomial

The articles’ variance is far above their mean, and a Poisson model reacts by understating the standard errors. MASS::glm.nb() adds a dispersion parameter:

fit_nb <- MASS::glm.nb(art ~ fem + mar + kid5 + ment, data = bioChemists)
table_regression(fit_nb, exponentiate = TRUE)
#> Negative-binomial regression: art
#> 
#>  Variable        │   IRR     SE      95% CI       p   
#> ─────────────────┼────────────────────────────────────
#>  (Intercept)     │    1.35  0.11  [1.15, 1.59]  <.001 
#>  fem:            │                                    
#>    Men (ref.)    │     –     –         –         –    
#>    Women         │    0.81  0.06  [0.70, 0.93]   .003 
#>  mar:            │                                    
#>    Single (ref.) │     –     –         –         –    
#>    Married       │    1.16  0.09  [0.99, 1.36]   .072 
#>  kid5            │    0.84  0.04  [0.76, 0.93]  <.001 
#>  ment            │    1.03  0.00  [1.02, 1.04]  <.001 
#> ╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌
#>  n               │  915                               
#>  R² (McFadden)   │    0.03                            
#>  R² (Nagelkerke) │    0.11                            
#>  AIC             │ 3134.1                             
#> 
#> Note. Negative-binomial regression.
#> Std. errors: Model-based (asymptotic).
#> IRR = incidence rate ratio.
#> Coefficients exponentiated and displayed as IRR; SE on the IRR scale (delta method); CI bounds exponentiated (asymmetric).

The IRRs barely move, but the inference does: marriage drops from p = .013 under Poisson to p = .072 here — the Poisson significance was an artifact of ignored overdispersion. AIC falls from 3312 to 3134, and that comparison returns below.

Excess zeros: zero-inflation and hurdle

Two-part models tell two different stories about the zeros, and the difference matters for interpretation:

A zero-inflated model (zeroinfl(); Lambert 1992) says some zeros are structural: a latent group that would never publish regardless of the count process. Its zero component models the probability of being such a structural zero.
A hurdle model (hurdle(); Mullahy 1986) says all zeros come from one hurdle: first you publish at all or you do not, then a truncated count process decides how much. Its zero component models the probability of a nonzero count — the opposite direction.

table_regression() renders the count coefficients first and the zero component as a labelled block, with a footer line stating exactly what the block models. Under exponentiate = TRUE the exponentiation is per block: count coefficients become IRR, and the logit zero component becomes odds ratios:

fit_zip <- zeroinfl(art ~ fem + mar + kid5 + ment | ment,
                    data = bioChemists)
table_regression(fit_zip, exponentiate = TRUE)
#> Poisson zero-inflated regression: art
#> 
#>  Variable        │   IRR     SE      95% CI       p   
#> ─────────────────┼────────────────────────────────────
#>  (Intercept)     │    1.84  0.12  [1.61, 2.10]  <.001 
#>  fem:            │                                    
#>    Men (ref.)    │     –     –         –         –    
#>    Women         │    0.80  0.05  [0.72, 0.90]  <.001 
#>  mar:            │                                    
#>    Single (ref.) │     –     –         –         –    
#>    Married       │    1.14  0.08  [1.01, 1.30]   .041 
#>  kid5            │    0.85  0.04  [0.78, 0.93]  <.001 
#>  ment            │    1.02  0.00  [1.01, 1.02]  <.001 
#> ╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌
#>  Zero-inflation: │                                    
#>    (Intercept)   │    0.50  0.10  [0.34, 0.75]  <.001 
#>    ment          │    0.88  0.04  [0.81, 0.95]   .001 
#> ╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌
#>  n               │  915                               
#>  AIC             │ 3225.5                             
#> 
#> Note. Poisson zero-inflated regression.
#> Std. errors: Wald asymptotic (z).
#> Zero-inflation component: log-odds of a structural (excess) zero. Coefficients exponentiated and displayed as odds ratios.
#> IRR = incidence rate ratio.
#> Coefficients exponentiated and displayed as IRR; SE on the IRR scale (delta method); CI bounds exponentiated (asymmetric).

fit_hur <- hurdle(art ~ fem + mar + kid5 + ment | ment,
                  data = bioChemists)
table_regression(fit_hur, exponentiate = TRUE)
#> Poisson hurdle regression: art
#> 
#>  Variable        │   IRR     SE      95% CI       p   
#> ─────────────────┼────────────────────────────────────
#>  (Intercept)     │    1.88  0.13  [1.63, 2.16]  <.001 
#>  fem:            │                                    
#>    Men (ref.)    │     –     –         –         –    
#>    Women         │    0.80  0.05  [0.70, 0.90]  <.001 
#>  mar:            │                                    
#>    Single (ref.) │     –     –         –         –    
#>    Married       │    1.10  0.08  [0.96, 1.27]   .169 
#>  kid5            │    0.87  0.04  [0.79, 0.95]   .003 
#>  ment            │    1.02  0.00  [1.01, 1.02]  <.001 
#> ╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌
#>  Zero hurdle:    │                                    
#>    (Intercept)   │    1.28  0.14  [1.04, 1.58]   .021 
#>    ment          │    1.08  0.01  [1.06, 1.11]  <.001 
#> ╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌
#>  n               │  915                               
#>  AIC             │ 3233.8                             
#> 
#> Note. Poisson hurdle regression.
#> Std. errors: Wald asymptotic (z).
#> Zero hurdle component: log-odds of a nonzero count. Coefficients exponentiated and displayed as odds ratios.
#> IRR = incidence rate ratio.
#> Coefficients exponentiated and displayed as IRR; SE on the IRR scale (delta method); CI bounds exponentiated (asymmetric).

Compare the ment row in the two zero blocks — the same predictor, the same substantive finding, and opposite-looking odds ratios:

Zero-inflation block: OR 0.88 — each mentor publication lowers the odds of being a structural zero.
Zero hurdle block: OR 1.08 — each mentor publication raises the odds of publishing at all.

Both say productive mentors get students over the zero. Reading either number without its block label invites a sign error; the block labels and their footer lines exist to prevent it. As for choosing between the two: fit rarely decides — here the AICs are nearly tied (3225.5 vs 3233.8) — so let the substantive story choose. Zero-inflation fits when some units plausibly can never experience the event (a latent never-publisher class); a hurdle fits when everyone faces the same all-or-nothing first stage. Zero-component coefficients are substantive hypotheses: they take significance stars and join the p_adjust family like any other coefficient. To display only the count part, set show_components = FALSE (the model is still estimated in full).

Which count model? Side by side

The escalation is a model-comparison question, and the table lays it out. Per Long (1997) and Cameron & Trivedi (2013), much of what looks like zero-inflation is overdispersion — the AIC row makes the point. (The once-standard Vuong test is not used here: it is no longer recommended for testing zero-inflation — Wilson 2015 — so information criteria and substantive reasoning carry the comparison.)

fit_zinb <- zeroinfl(art ~ fem + mar + kid5 + ment | ment,
                     data = bioChemists, dist = "negbin")
table_regression(
  list(Poisson = fit_pois, "Neg. binomial" = fit_nb, ZINB = fit_zinb),
  show_columns = c("b", "p"), exponentiate = TRUE
)
#> Regression comparison: art
#> 
#>                       Poisson      Neg. binomial        ZINB      
#>                    ──────────────  ──────────────  ────────────── 
#>  Variable        │   IRR      p      IRR      p      IRR      p   
#> ─────────────────┼────────────────────────────────────────────────
#>  (Intercept)     │    1.41  <.001     1.35  <.001     1.51  <.001 
#>  fem:            │                                                
#>    Men (ref.)    │     –     –         –     –         –     –    
#>    Women         │    0.80  <.001     0.81   .003     0.81   .003 
#>  mar:            │                                                
#>    Single (ref.) │     –     –         –     –         –     –    
#>    Married       │    1.16   .013     1.16   .072     1.15   .085 
#>  kid5            │    0.83  <.001     0.84  <.001     0.85   .001 
#>  ment            │    1.03  <.001     1.03  <.001     1.02  <.001 
#> ╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌
#>  Zero-inflation: │                                                
#>    (Intercept)   │                                    0.45   .022 
#>    ment          │                                    0.54   .013 
#> ╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌
#>  n               │  915             915             915           
#>  R² (McFadden)   │    0.05            0.03                        
#>  R² (Nagelkerke) │    0.19            0.11                        
#>  AIC             │ 3312.3          3134.1          3122.5         
#> 
#> Note. Model 1: Poisson regression; Model 2: negative-binomial regression; Model 3: negative-binomial zero-inflated regression.
#> Std. errors:
#>   Model 1: classical (Fisher information)
#>   Model 2: Model-based (asymptotic)
#>   Model 3: Wald asymptotic (z)
#> Zero-inflation component: log-odds of a structural (excess) zero. Coefficients exponentiated and displayed as odds ratios.
#> IRR = incidence rate ratio.
#> Coefficients exponentiated and displayed as IRR; CI bounds exponentiated.

The negative binomial alone recovers most of the fit (AIC 3312 → 3134); adding the inflation part on top buys a further, modest improvement (3122). The zero block simply stays empty in the columns of models that have none. Substantively the IRRs agree across all three — what changes is the inference and the account of the zeros.

One number per predictor: the combined AME

Rate ratios live inside one component. For a single response-scale summary that spans both parts, request average marginal effects: the AME of a two-part model is the effect on the overall expected count E(Y), combining the count and zero processes (computed by marginaleffects::avg_slopes() on the full model):

table_regression(fit_zip, show_columns = c("b", "ame", "p"))
#> Poisson zero-inflated regression: art
#> 
#>  Variable        │    B      AME     p   
#> ─────────────────┼───────────────────────
#>  (Intercept)     │    0.61         <.001 
#>  fem:            │                       
#>    Men (ref.)    │     –            –    
#>    Women         │   -0.22  -0.36  <.001 
#>  mar:            │                       
#>    Single (ref.) │     –            –    
#>    Married       │    0.13   0.22   .041 
#>  kid5            │   -0.16  -0.28  <.001 
#>  ment            │    0.02   0.06  <.001 
#> ╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌
#>  Zero-inflation: │                       
#>    (Intercept)   │   -0.69         <.001 
#>    ment          │   -0.13          .001 
#> ╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌
#>  n               │  915                  
#>  AIC             │ 3225.5                
#> 
#> Note. Poisson zero-inflated regression.
#> Std. errors: Wald asymptotic (z).
#> Zero-inflation component: log-odds of a structural (excess) zero.
#> AME = average marginal effect.

Being a woman costs about a third of an article (AME −0.36) once both channels are combined; the zero-component rows carry no AME of their own because the combined effect already includes them.

Cluster-robust variance

Students cluster in labs, journals, cohorts. The CR* family covers both components with one estimator — sandwich::vcovCL() on the full score matrix, so the count and zero rows shift together. For two-part models every CR* variant maps to that same estimator, and the footer records it as CL:

set.seed(1)
bioChemists$lab <- factor(sample(1:60, nrow(bioChemists), replace = TRUE))
table_regression(fit_zip, vcov = "CR0", cluster = bioChemists$lab,
                 show_columns = c("b", "se", "p"))
#> Poisson zero-inflated regression: art
#> 
#>  Variable        │    B      SE     p   
#> ─────────────────┼──────────────────────
#>  (Intercept)     │    0.61  0.09  <.001 
#>  fem:            │                      
#>    Men (ref.)    │     –     –     –    
#>    Women         │   -0.22  0.07  <.001 
#>  mar:            │                      
#>    Single (ref.) │     –     –     –    
#>    Married       │    0.13  0.09   .147 
#>  kid5            │   -0.16  0.07   .013 
#>  ment            │    0.02  0.00  <.001 
#> ╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌
#>  Zero-inflation: │                      
#>    (Intercept)   │   -0.69  0.26   .010 
#>    ment          │   -0.13  0.05   .017 
#> ╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌
#>  n               │  915                 
#>  AIC             │ 3225.5               
#> 
#> Note. Poisson zero-inflated regression.
#> Std. errors: cluster-robust (CL), clusters by lab.
#> Zero-inflation component: log-odds of a structural (excess) zero.

With clustering acknowledged (here a synthetic 60-lab assignment for illustration), marriage loses its significance (p = .041 → .147) — the usual fate of borderline effects under a proper dependence structure. HC* and the resampling estimators have no two-part backend and are refused with a clear error.

Mixed counts: `glmmTMB`

When counts are also grouped, glmmTMB (Brooks et al. 2017) combines everything above with random effects: the zero-inflation block, the random-effects block, and per-block exponentiation appear in one table — here salamander counts in mined and unmined streams (Price et al. 2016):

data("Salamanders", package = "glmmTMB")
fit_mix <- glmmTMB::glmmTMB(
  count ~ mined + (1 | site),
  ziformula = ~ mined,
  family = glmmTMB::nbinom2, data = Salamanders
)
table_regression(fit_mix, exponentiate = TRUE)
#> Negative-binomial mixed-effects regression (glmmTMB) (zero-inflated): count
#> 
#>  Variable             │   IRR     SE      95% CI       p   
#> ──────────────────────┼────────────────────────────────────
#>  (Intercept)          │    0.57  0.21  [0.28, 1.19]   .133 
#>  mined:               │                                    
#>    yes (ref.)         │     –     –         –         –    
#>    no                 │    4.34  1.57  [2.14, 8.81]  <.001 
#> ╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌
#>  Zero-inflation:      │                                    
#>    (Intercept)        │    1.27  0.62  [0.49, 3.30]   .621 
#>    mined: yes (ref.)  │     –     –         –         –    
#>    mined: no          │    0.10  0.08  [0.02, 0.49]   .004 
#> ╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌
#>  Random effects:      │                                    
#>    σ site (Intercept) │    0.37  0.17  [0.16, 0.81]   –    
#> ╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌┼╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌
#>  n                    │  644                               
#>  N (site)             │   23                               
#>  R² (marginal)        │    0.44                            
#>  R² (conditional)     │    0.56                            
#>  AIC                  │ 1741.4                             
#>  BIC                  │ 1768.2                             
#> 
#> Note. Negative-binomial mixed-effects regression (glmmTMB) (zero-inflated).
#> Std. errors: Wald asymptotic (z).
#> p-values: Wald-z asymptotic (glmmTMB).
#> Random effects (ML).
#> Zero-inflation component: log-odds of a structural (excess) zero. Coefficients exponentiated and displayed as odds ratios.
#> IRR = incidence rate ratio.
#> Coefficients exponentiated and displayed as IRR; SE on the IRR scale (delta method); CI bounds exponentiated (asymmetric).

Unmined streams host over four times the salamander rate (IRR 4.34) and have far lower odds of a structural zero (OR 0.10) — the two channels tell one coherent ecological story. The random-effects block and its conventions are covered in the companion vignette Mixed-effects regression tables.

Output formats

Everything above used the default console output. The same structure — zero block included, as the last rows below — carries to a raw data frame, a long broom-style tibble, and the rich gt, flextable, tinytable, Excel, or Word targets:

table_regression(fit_zip, output = "data.frame")
#>           Variable       B   SE         95% CI     p
#> 1      (Intercept)    0.61 0.07 [ 0.48,  0.74] <.001
#> 2             fem:                                  
#> 3       Men (ref.)     –    –         –         –   
#> 4            Women   -0.22 0.06 [-0.33, -0.10] <.001
#> 5             mar:                                  
#> 6    Single (ref.)     –    –         –         –   
#> 7          Married    0.13 0.07 [ 0.01,  0.26]  .041
#> 8             kid5   -0.16 0.04 [-0.25, -0.08] <.001
#> 9             ment    0.02 0.00 [ 0.01,  0.02] <.001
#> 10 Zero-inflation:                                  
#> 11     (Intercept)   -0.69 0.21 [-1.09, -0.28] <.001
#> 12            ment   -0.13 0.04 [-0.21, -0.05]  .001
#> 13               n  915                             
#> 14             AIC 3225.5

table_regression(fit_zip, exponentiate = TRUE, output = "gt")

In broom::tidy() the zero-component terms carry a zero_ prefix, so the two parts separate cleanly:

td <- broom::tidy(table_regression(fit_zip))
td[, c("term", "estimate_type", "estimate", "p.value")]
#> # A tibble: 7 × 4
#>   term             estimate_type estimate  p.value
#>   <chr>            <chr>            <dbl>    <dbl>
#> 1 (Intercept)      B               0.609  2.37e-19
#> 2 femWomen         B              -0.218  2.05e- 4
#> 3 marMarried       B               0.135  4.07e- 2
#> 4 kid5             B              -0.163  1.75e- 4
#> 5 ment             B               0.0182 1.91e-16
#> 6 zero_(Intercept) B              -0.686  8.47e- 4
#> 7 zero_ment        B              -0.130  1.22e- 3

References

Brooks, M. E., Kristensen, K., van Benthem, K. J., Magnusson, A., Berg, C. W., Nielsen, A., Skaug, H. J., Mächler, M., & Bolker, B. M. (2017). glmmTMB balances speed and flexibility among packages for zero-inflated generalized linear mixed modeling. The R Journal, 9(2), 378–400.
Cameron, A. C., & Trivedi, P. K. (2013). Regression Analysis of Count Data (2nd ed.). Cambridge University Press.
Lambert, D. (1992). Zero-inflated Poisson regression, with an application to defects in manufacturing. Technometrics, 34(1), 1–14.
Long, J. S. (1990). The origins of sex differences in science. Social Forces, 68(4), 1297–1316.
Long, J. S. (1997). Regression Models for Categorical and Limited Dependent Variables. Sage.
Mullahy, J. (1986). Specification and testing of some modified count data models. Journal of Econometrics, 33(3), 341–365.
Price, S. J., Muncy, B. L., Bonner, S. J., Drayer, A. N., & Barton, C. D. (2016). Effects of mountaintop removal mining and valley filling on the occupancy and abundance of stream salamanders. Journal of Applied Ecology, 53(2), 459–468.
Wilson, P. (2015). The misuse of the Vuong test for non-nested models to test for zero-inflation. Economics Letters, 127, 51–53.
Zeileis, A., Kleiber, C., & Jackman, S. (2008). Regression models for count data in R. Journal of Statistical Software, 27(8).