Network


Latest external collaboration on country level. Dive into details by clicking on the dots.

Hotspot


Dive into the research topics where Graham M. Smith is active.

Publication


Featured researches published by Graham M. Smith.


Archive | 2009

Zero-Truncated and Zero-Inflated Models for Count Data

Alain F. Zuur; Elena N. Ieno; Neil J. Walker; Anatoly A. Saveliev; Graham M. Smith

In this chapter, we discuss models for zero-truncated and zero-inflated count data. Zero truncated means the response variable cannot have a value of 0. A typical example from the medical literature is the duration patients are in hospital. For ecological data, think of response variables like the time a whale is at the surface before re-submerging, counts of fin rays on fish (e.g. used for stock identification), dolphin group size, age of an animal in years or months, or the number of days that carcasses of road-killed animals (amphibians, owls, birds, snakes, carnivores, small mammals, etc.) remain on the road. These are all examples for which the response variable cannot take a value of 0.


Archive | 2009

Mixed Effects Modelling for Nested Data

Alain F. Zuur; Elena N. Ieno; Neil J. Walker; Anatoly A. Saveliev; Graham M. Smith

In this chapter, we continue with Gaussian linear and additive mixed modelling methods and discuss their application on nested data. Nested data is also referred to as hierarchical data or multilevel data in other scientific fields (Snijders and Boskers, 1999; Raudenbush and Bryk, 2002).


Archive | 2009

Limitations of Linear Regression Applied on Ecological Data

Alain F. Zuur; Elena N. Ieno; Neil J. Walker; Anatoly A. Saveliev; Graham M. Smith

This chapter revises the basic concepts of linear regression, shows how to apply linear regression in R, discusses model validation, and outlines the limitations of linear regression when applied to ecological data. Later chapters present methods to overcome some of these limitations; but as always before doing any complicated statistical analyses, we begin with a detailed data exploration. The key concepts to consider at this stage are outliers, collinearity, and the type of relationships between the variables. Failure to apply this initial data exploration may result in an inappropriate analysis forcing you to reanalyse your data and rewrite your paper, thesis, or report.


Archive | 2009

GLM and GAM for Count Data

Alain F. Zuur; Elena N. Ieno; Neil J. Walker; Anatoly A. Saveliev; Graham M. Smith

A generalised linear model (GLM) or a generalised additive model (GAM) consists of three steps: (i) the distribution of the response variable, (ii) the specification of the systematic component in terms of explanatory variables, and (iii) the link between the mean of the response variable and the systematic part. In Chapter 8, we discussed several different distributions for the response variable: Normal, Poisson, negative binomial, geometric, gamma, Bernoulli, and binomial distributions. One of these distributions can be used for the first step mentioned above. In fact, later in Chapter 11, we see how you can also use a mixture of two distributions for the response variable; but in this chapter, we only work with one distribution at a time.


Archive | 2009

GLMM and GAMM

Alain F. Zuur; Elena N. Ieno; Neil J. Walker; Anatoly A. Saveliev; Graham M. Smith

In Chapters 2 and 3, we reviewed linear regression and additive modelling techniques. In Chapters 4–7, we showed how to extend these methods to allow for heterogeneity, nested data, and temporal or spatial correlation structures. The resulting methods were called linear mixed modelling and additive mixed modelling (see the left hand pathway of Fig. 13.1). In Chapter 9, we introduced generalised linear modelling (GLM) and generalised additive modelling (GAM), and applied them to absence–presence data, proportional data, and count data. We used the Bernoulli and binomial distributions for 0–1 data (the 0 stands for absence and the 1 for presence), and proportional data (Y successes out of n independent trials), and we used the Poisson distribution for count data. However, one of the underlying assumptions of theses approaches (GLM and GAM) is that the data are independent, which is not always the case. In this chapter, we take this into account and extend the GLM and GAM models to allow for correlation between the observations, and nested data structures. It should come as no surprise that these methods are called generalised linear mixed modelling (GLMM) and generalised additive mixed modelling (GAMM); see the right hand pathway of Fig. 13.1.


Archive | 2009

Dealing with Heterogeneity

Alain F. Zuur; Elena N. Ieno; Neil J. Walker; Anatoly A. Saveliev; Graham M. Smith

This chapter, and the following three chapters, discuss solutions to the problems introduced in Chapters 2 and 3: heterogeneity, nested data, temporal correlation, and spatial correlation. We use both the linear regression model and the additive model as starting points. Figure 4.1 shows an overview of the methods we discuss in Chapters 4, 5, 6, and 7. In all these chapters, the model consists of a fixed term and a random term. The fixed term describes the response variable Y as a function of the explanatory variables via α + β 1 × X 1 + … + β q × X q in linear regression or α + f 1(X 1)+…+ f q (X q ) in additive modelling.


Archive | 2009

Things are not Always Linear; Additive Modelling

Alain F. Zuur; Elena N. Ieno; Neil J. Walker; Anatoly A. Saveliev; Graham M. Smith

In the previous chapter, we looked at linear regression, and although the word linear implies modelling only linear relationships, this is not necessarily the case. A model of the form Y i = α + β 1 × X i + β 2 × X i 2 + ɛ i is a linear regression model, but the relationship between Y i and X i is modelled using a second-order polynomial function. The same holds if an interaction term is used. For example, in Chapter 2, we modelled the biomass of wedge clams as a function of length, month and the interaction between length and month. But a scatterplot between biomass and length may not necessarily show a linear pattern.


Archive | 2007

Time series analysis of Hawaiian waterbirds

J. M. Reed; Chris S. Elphick; Alain F. Zuur; Elena N. Ieno; Graham M. Smith

Surveys to monitor changes in population size over time are of interest for a variety of research questions and management goals. For example, population biologists require survey data collected over time to test hypotheses concerning the patterns and mechanisms of population regulation or to evaluate the effects on population size of interactions caused by competition and predation. Resource managers use changes in population size to (i) evaluate the effectiveness of management actions that are designed to increase or decrease numbers, (ii) monitor changes in indicator species, and (iii) quantify the effects of environmental change. Monitoring population size over time is particularly important to species conservation, where population decline is one key to identifying species that are at risk of extinction.


Archive | 2009

GLM and GAM for Absence–Presence and Proportional Data

Alain F. Zuur; Elena N. Ieno; Neil J. Walker; Anatoly A. Saveliev; Graham M. Smith

In the previous chapter, count data with no upper limit were analysed using Poisson generalised linear modelling (GLM) and negative binomial GLM. In Section 10.2 of this chapter, we discuss GLMs for 0−1 data, also called absence–presence or binary data, and in Section 10.3 GLM for proportional data are presented. In the final section, generalised additive modelling (GAM) for these types of data is introduced. A GLM for 0−1 data, or proportional data, is also called logistic regression.


Archive | 2007

Investigating the effects of rice farming on aquatic birds with mixed modelling

Chris S. Elphick; Alain F. Zuur; Elena N. Ieno; Graham M. Smith

Ecologists are frequently interested in describing differences among the ecological communities that occur in habitats with different characteristics. In an ideal world, experimental methods would standardise situations such that each habitat variable could be altered separately in order to investigate their individual effects. This approach works well in simple ecosystems that can be replicated at small spatial scales. Unfortunately, the world is not always simple and many situations cannot be experimentally manipulated. Investigating specific applied questions, in particular, often can be done only at the spatial scales at which the applied phenomena occur and within the logistical constraints imposed by the system under study. In such cases, one is often left with the choice between collecting “messy” data that are difficult to analyse or avoiding the research questions entirely. In this chapter, we investigate just such a case, in which applied ecological questions were of interest, but experimental influence over the system was not possible.

Collaboration


Dive into the Graham M. Smith's collaboration.

Top Co-Authors

Avatar
Top Co-Authors

Avatar

Neil J. Walker

Central Science Laboratory

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

A. Ward

Central Science Laboratory

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar
Top Co-Authors

Avatar

I.G. Priede

University of Aberdeen

View shared research outputs
Top Co-Authors

Avatar
Top Co-Authors

Avatar

Mark Everard

University of the West of England

View shared research outputs
Researchain Logo
Decentralizing Knowledge