Multiple sites

This page shows how to move from a univariate to a multivariate data format to perform a multiple site analysis, considering each site as a different trait. In this example, the canterbury.csv data set contains only three variables: Site, Id (they correspond to genotypes) and yield. Site has three levels (Avonhead, Ilam and Yaldhurst) indicating the sites and yield is a measure of productivity.

# Loading asreml-r package
library(asreml)

# Reading csv file. You could also use asreml.read.table
d <- read.csv('~/examples/canterbury.csv', header = TRUE)

# Creating a yield variable for each site, with NA for the
# other two sites (NA = missing value in R)
d$y1 <- ifelse(d$Site == 'Avonhead', d$yield, NA)
d$y2 <- ifelse(d$Site == 'Ilam', d$yield, NA)
d$y3 <- ifelse(d$Site == 'Yaldhurst', d$yield, NA)

We can now run the equivalent to three univariate analyses, by binding the three columns (using cbind) that contain each of the different traits: y1, y2 and y3. The direct sum (dsum) specified in the residual part is representing a structure with no covariances between the traits for Id and between error terms (units), but a different variance component for each level of trait.

## Very simple multivariate model
m1 <- asreml(cbind(y1,y2,y3) ~ trait,
             random = ~ diag(trait):Id,
             residual = ~ dsum( ~ units | trait),
             data = d)

summary(m1)$varcomp

We can confirm that the runs are equivalent to univariate analyses by fitting the univariate models. For example:

# It should be equivalent to site specific univariate runs
# for example, for site 1
s1 <- asreml(yield ~ 1, random = ~ Id, data = d, 
             subset = Site == 'Yaldhurst')
summary(s1)$varcomp

We can now complicate the multivariate analyses to allow for correlation between sites, which is really what we are after.

# First I make up some starting values, assuming a genetic correlation
# of 0.7 between sites and variances from model 1 (m1)
# corgh fits a correlation structure with heterogeneous variances
start.values <- c(0.7, 0.7, 0.7, 3.5, 0.3, 2.4)
m2 <- asreml(cbind(y1,y2,y3) ~ trait,
             random = ~ corgh(trait, init = start.values):Id,
             residual = ~ dsum( ~ units | trait),
             data = d)

summary(m2)$varcomp

Updated on September 22, 2021

Was this article helpful?

Yes No