Agricultural and Food Science, Vol. 17 (2008): 63-72 A G R I C U L T U R A L A N D F O O D S C I E N C E Vol. 17 (2008): 63-72. 63 © Agricultural and Food Science Manuscript received July 2006 Recommendable block sizes: a case study on Finnish official variety trials of barley cultivars Lauri Jauhiainen, Jukka Öfversten MTT Agrifood Research Finland, Services Unit, FI-31600 Jokioinen, Finland, e-mail:lauri.jauhiainen@mtt.fi Arjo Kangas MTT Agrifood Research Finland, Plant Production Research, FI-61400 Ylistaro, Finland Well-established results in the current statistical literature imply that plant breeders should use incomplete block designs wherever spatial variability exists and the number of treatments is large. But the theoretical position does not indicate the recommendable number of cultivars in an incomplete block. In this study we used data from 28 official variety trials conducted in Finland during the period 2001–2005 to study the effect of block size on the efficiency of testing pairwise yield differences of barley cultivars and cultivar rankings. In previous trials some 6–7 cultivars have usually been included in one block. Our results imply that the efficiency of testing procedures could be improved by using a block size as small as 4–5. The results further imply that if an experiment with an incomplete block design is well planned to mitigate the effects of within-block heterogeneity, the spatial mixed model techniques and the conventional analysis of variance techniques have approximately the same efficiency in testing pairwise yield differences. Thus, if appropri- ate blocking strategies are used in planning the trials, there is usually no need to change the conventional practice followed in statistical analysis. Key-words: spatial variation, experimental design, block size, statistical analysis Introduction It is well known that soil heterogeneity complicates field experiments. The statistical issues that arise can be dealt with in two ways. On the one hand we can use experimental designs that account for spatial effects in the soil and, on the other, we can use sta- tistical analyses that account for spatial dependence in the data obtained. In designing experiments, both randomised complete block designs and incomplete block designs have been widely used. The data obtained have been conventionally analysed using analysis of variance techniques complying with A G R I C U L T U R A L A N D F O O D S C I E N C E Jauhiainen, L. et al. Recommendable block sizes for variety trials 64 A G R I C U L T U R A L A N D F O O D S C I E N C E Vol. 17 (2008): 63-72. 65 the experimental design but ignoring the poten- tial spatial dependency of the residual variances. Several more complicated attempts to account for spatial variability have also been made. The earlier ones were based on nearest neighbour adjustments introduced by Papadakis (1937) and later improved e.g. by Kempton (1981). In recent years, spatial mixed models with spatial covariance structures have been proposed. A lot of work has been done to compare the efficiency of the different methods for analysing data from variety trials. The results obtained sug- gest that in most cases the spatial mixed model gives the most efficient analysis (e.g. Kravchenko et al. 2006). This leads to the question of whether the choice of experimental design has any effect on the efficiency of spatial mixed model analysis. The power analyses made by Stroup (2002) give an explicit answer to this question: the design does matter. Even when the spatial structure is known and can therefore be specified exactly, incomplete block designs specifically intended to mitigate the effects of within-block heterogeneity are more powerful than the complete block designs used in conjunction with spatial mixed models. Spatial mixed model techniques used together with incom- plete block designs are even more powerful. The results of Stroup (2002) lead to the con- clusion that researchers should use incomplete block designs wherever spatial variability exists and the number of treatments is large. But there still remains at least one open question: what is the recommendable number of treatments in one block? The answer to this question clearly depends on the nature of the spatial variability. In practice the variability is always unknown, but it is linked to geographic conditions such as the regional location of the experimental site and the soil characteris- tics of the experimental fields concerned, and also depends on the shape and layout of the field plots used in experimentation. Moreover, it depends on the response variables recorded for the analyses (Watson 2000). In this study we used Finnish official variety tri- al data on barley cultivars to investigate the effect of block size in testing pairwise yield differences and cultivar rankings. The results were intended to help in determining the recommendable block sizes of the incomplete block designs needed for forthcoming Finnish variety trials on barley cul- tivars. The research approach was made possible by the long and well-organised tradition of official variety testing supervised by MTT Agrifood Re- search Finland. Variety trials on barley have been carried out continuously for more than 30 years now. The main response variable considered has always been the grain yield. During the long testing period most of the operating procedures related to the experimentation have become well established. In particular, the shape of experimental plots and the practices for laying them out in the field have been standardised. But there have never been any well-established principles for determining appro- priate block sizes for the incomplete block designs predominantly used. Material and methods Trial data We used Finnish official variety trial data originating from 28 trials conducted in 2001–2005 at 8 sites in southern Finland. In each trial the cultivars to be tested were arranged in the field using incomplete block designs with four replications. The number of cultivars in one trial varied from 24 to 43 and the block size ranged from 4 to 7 plots. Within each trial the block sizes never differed more than one plot. The data are given in Table 1. Plots were 1.5–1.75 metres wide and 8–10 metres long. Trials were carried out according to well standardised and documented cultivation recommendations (Järvi et al. 1998). Spatial models A variogram measures the correlation between data pairs as a function of the displacement between the pairs (Brooker 2001). In our data, detailed layout A G R I C U L T U R A L A N D F O O D S C I E N C E Jauhiainen, L. et al. Recommendable block sizes for variety trials 64 A G R I C U L T U R A L A N D F O O D S C I E N C E Vol. 17 (2008): 63-72. 65 information was available on 9 trials. Using these trials a semivariogram g(h) measuring the spatial dependency was calculated as: (1) where yi and yj are observed yields from field plot i and j, respectively, dij is the distance between these plots, and N(h) is the number of field plots separated by the same distance h. The semivariogram was computed in different directions to ensure that the spatial variation was isotropic. Variogram γ(h) was then modelled from the semivariogram values using a spherical model with a nugget effect as follows: (2) where r, c0 and c refer to the range, the nugget and the sill, respectively. The semivariograms and pa- rameter estimates of the modelled variogram were calculated using VarioWin software (Pannatier, 1996). Before the analysis, the effect of cultivars ( ) ( ) ( )∑ = −= hd ji ij yy hN hg 2 2 1 Table 1: Trials (years and names of locations), numbers of cultivars tested, used block size or range of original block sizes, average grain yields and standard error of difference (SED). Site year* number of cultivars original block size average yield SED Mietoinen 2001 41 5-6 5039 146 2002a 32 4 3260 224 2002b 24 4 3562 120 2003a 28 4 2711 132 2003b 25 5 3572 171 2004a 37 5-6 6754 308 2004b 28 4 5690 316 2005a 42 6 5820 155 2005b 25 5 5423 423 Pälkäne 2001 31 5-6 5157 191 2002 35 5 4190 271 2003 32 4 4988 319 2004 43 6-7 4977 352 Mikkeli 2001 31 5-6 5439 415 2002 25 5 4846 314 2004 32 4 4809 221 2005 30 5 5800 377 Jokioinen 2001 32 4 6382 171 2002 26 4-5 4549 160 2003 25 5 5288 218 Ylistaro 2002a 33 5-6 4844 319 2002b 24 4 3714 318 2004a 43 6-7 5159 312 2004b 37 5-6 652 276 Tuusula 2001 31 5-6 6725 176 Pernaja 2004 26 4-5 5954 218 Inkoo 2004 38 5-6 5327 272 *two trials conducted at Ylistaro and Mietoinen in the same year were placed at separate fields with different soil types ( ) 2 3 2 1 h c c r h r h for h r c c for h r> 0 3 0 # = + - + c d n Z [ \ ]] ] * 4 A G R I C U L T U R A L A N D F O O D S C I E N C E Jauhiainen, L. et al. Recommendable block sizes for variety trials 66 A G R I C U L T U R A L A N D F O O D S C I E N C E Vol. 17 (2008): 63-72. 67 was removed from the data using the one-way ANOVA. Kriging is one technique among many for interpolating a variable from sample points. It has the advantage that the estimated points are obtained with minimum variance (Lark 2000). The estimated values of the parameters of equation (2) were therefore used to predict values at the nodes of a 30 cm grid by ordinary punctual kriging using the KRIGE2D procedure included in SAS software (SAS 1999). A yield map was created using these kriging estimates. To indicate the effect of block size and the re- sulting efficiency of an experimental design we used standard errors of pairwise yield differences (SED). To estimate such standard errors we used the following spatial mixed model: yij= µ + αi + εij (3) where μ and αi are fixed effects associated with the grand mean and ith cultivar, respectively, and the residuals εij are spatially correlated with a covariance structure defined in equation (2). SED for all pairs of cultivars were calculated during the analysis. They were found to vary with the locations of the cultivars in the field. To measure the efficiency of a particular design and analysis we therefore used the mean of all the SED values associated with this experiment. Superimposing new blocking on the data In this study we used field data in which each repli- cation included all the experimental plots arranged in a row. This made it possible to superimpose new blocking strategies on the existing data. New blocking for a block size of 2 was made by dividing one replication into several blocks of 2 plots. If the number of plots was odd, we randomly included a block containing 3 consecutive plots. New block- ings for block sizes of 3, 4, 5, 6–7, 8–10 and 14–20 were made in the same way. Differences between cultivars were analysed using the following traditional mixed model for incomplete block design: yijk= µ + rj + bk(rj ) + αi + εijk (4) where yijk is the yield of the i th cultivar from the jth replication and the kth block, µ is the intercept, αi is the effect of the ith cultivar, rj is the effect of the jth replication, bk(rj) is the effect of the k th incom- plete block nested within the jth replication and εijk is the residual error. rj , bk(rj) and εijk are assumed to be random in the model used. The assumptions for the random effects were: bk(rj) ∼ iid N(0,σ 2 b ), rj ∼ iid N(0,σ 2 r ), εijk ∼ iid N(0,σ 2), and all the effects are independent of one other. For comparison (Fig. 1), the data were analysed using ANOVA for the randomised complete block design based on equa- tion (4) and without the bk(rj) effect. The parameters of the models were estimated using the restricted maximum likelihood (REML) method with the SAS system and MIXED procedure (SAS 1999). Standard errors for estimated pairwise cultivar effects were calculated and used as in the spatial analysis. The incomplete block designs are, however, featured by the fact that the frequency of occurrence of the same pair in the same incomplete block has an effect on the standard error. For each block size A verage SED (% of RCBD*) 50 60 70 80 90 100 1 4 7 10 13 16 19 22 25 28 31 Block size *randomized complete block design Fig. 1. Average standard error of the difference (SED) of two cultivars as a function of a block size. A SED val- ue for randomized complete block design was fixed at 100%. Dashed line is based on all trials whereas solid line is calculated without seven trials in which the dif- ference between analyses based on incomplete and com- plete block designs was not found. A G R I C U L T U R A L A N D F O O D S C I E N C E Jauhiainen, L. et al. Recommendable block sizes for variety trials 66 A G R I C U L T U R A L A N D F O O D S C I E N C E Vol. 17 (2008): 63-72. 67 we therefore drew Box and Whisker plot to describe symmetry of the distribution of standard errors. In addition, we calculated the standard deviation of the respective standard error. This was then used to identify block sizes which were more robust and less sensitive to spatial variation. For each trial the results of the complete block analysis and the incomplete block analysis were used to rank the cultivars according to their es- timated mean yields and differences between the rankings. Further, for each trial, maximum and av- erage differences between the rankings of all pairs of cultivars were calculated. These were then used to estimate the validity and relevance of our re- sults. Results The spatial variation in all trials could be consid- ered isotropic, because no clear differences were observed in the variograms computed in different directions. Estimates for the parameters of the spatial mixed models are given in Table 2. All models had a clear nugget effect. The estimate for the range parameter was fairly similar in all trials, whereas sill and nugget effects varied considerably. The trial at Ylistaro in 2004 had low estimates for all three parameters, while Mietoinen 2005a had the smallest estimates for sill and nugget parameters (Table 2). The spatial dependence abated clearly in all trials when the distance between field plots increased from 2 to 10 metres i.e. the number of plots between two fixed plots increased from 0 to 5. When the block size reached 20 plots, the spatial dependence between the two outermost plots was negligible in most of the trials. Grain yield maps revealed that the areas of dif- ferent yield levels did not follow the shape of repli- cations or blocks. In many cases, such as Mietoinen in 2004 (Fig. 2), growing conditions could be seen to vary widely within one replication. In fact the yield could differ by over 500 kg/ha (10%) within a distance of five plots (e.g. Mikkeli 2005, Fig. 3). The smallest SED values were reached when the block size was 3, 4 or 5 plots (Fig. 1). With such block sizes the SED value was on average 19% smaller than that for the randomised complete blocks design. For the historically widely used block size of 6–7 plots/block, the SED value was only slightly higher (1–2%) than for block sizes of 3, 4 or 5. The SED value increased rapidly when the block size was 2 or higher than 9. However, blocking had no effect on SED in 7 trials, and therefore the average SED value was also calcu- lated without these trials (dashed line in Fig. 1). Table 2: Parameter estimates for the spatial mixed model and effectiveness of the model compared to the analysis based on incomplete block designs. Site year nugget sill range relative SED* Mietoinen 2004a 70 000 150 000 37 100 2004b 111 000 207 000 53 96 2005a 26 000 23 000 37 98 2005b 70 000 210 000 33 93 Pälkäne 2004 140 000 110 000 43 100 Mikkeli 2004 38 000 92 000 28 98 2005 40 000 50 000 25 100 Ylistaro 2002 100 000 300 000 38 72 2004a 40 000 50 000 25 65 *compared to the most accurate analysis of incomplete block designs, SED=standard error of difference A G R I C U L T U R A L A N D F O O D S C I E N C E Jauhiainen, L. et al. Recommendable block sizes for variety trials 68 A G R I C U L T U R A L A N D F O O D S C I E N C E Vol. 17 (2008): 63-72. 69 Three of these 7 trials were conducted in 2002 (at Jokioinen, Mikkeli and Ylistaro), the year with the lowest grain yield level (Table 1). Experimental fields were very homogeneous in only two of these trials, but in the remaining five trials the random variation in the field was clearly above average. No other similarities between trials were found. Block sizes of 3, 4 or 5 were the best for all sites. The advantage of incomplete block designs over the randomised complete block designs was highest at Mietoinen (28%) and lowest at Tuusula (0%, based on one trial) and Jokioinen (3%). A singe trial with the highest advantage was found from Pälkäne (in 2001). When the trial was ana- lysed using ANOVA for a randomised complete block designs the SED value of the trial was 427 kg/ha, the second highest SED in all 28 trials ana- lysed using the same model. When the same trial was processed as an incomplete block designs with block size 5, the SED value fell to 191 kg/ha. This was lower than the average for all 28 trials (=240 kg/ha). The highest SED value (451 kg/ha) was reached at Ylistaro in 2002. Optimal block size for this trial was 3 plots in a block. In this case the SED value was 41% lower, at 273 kg/ha. Both alternative strategies - the spatial mixed model technique and the incomplete block design - were found to account for the variation in the field, but the effect of the strategies varied among experimental sites (Table 2). The spatial mixed model was efficient in the trials at Ylistaro, where the difference between methods was 28–35%. However, only a slight difference was found at other sites. This could not be explained in terms of the sill, nugget and range parameters or the nug- get/sill ratio. In the analysis of randomized complete block designs SED was the same for all pairs of culti- vars but in the incomplete block analysis it varied according to the superimposed blocking. The dis- tribution of SED was not altogether symmetrical. The lowest SED values were found when the same pair of cultivars appeared several times in the same block. Fig. 4 demonstrates the shape of skewness at Mikkeli. The same shape repeated itself at all Yield level 6000 6250 6500 6750 7000 Fig. 2. Grain yield map including the outline of plots for trial at Mietoinen in 2004. Plots of one replication are in one row. 5200 5500 56505350Yield level Fig. 3. Grain yield map of one replication for trial at Mikkeli in 2005. Yield level decreased almost 500 kg/ha during five plots. Fig. 4. Box and Whisker plots for the standard error of differences (SED). Plots are drawn for the trials set up at Mikkeli in 2001, 2002, 2004 and 2005. Each trial was analysed using block sizes of 2, 3, 4, 5, 6–7, 8–10, 14–20 and 25–32=max. A G R I C U L T U R A L A N D F O O D S C I E N C E Jauhiainen, L. et al. Recommendable block sizes for variety trials 68 A G R I C U L T U R A L A N D F O O D S C I E N C E Vol. 17 (2008): 63-72. 69 the locations. In addition, the standard deviation of SED was found to decrease as the block size increased. The sharpest fall occurred with block sizes 2 and 3 (Fig. 5). If the within field variation was small the variance component for the block effect was small, too, and the variation between the SED values was not practically important. In such cases the two alternative analyses resulted in well–matched rankings where the average varia- tion in the ranking of a cultivar within one trial was only two positions (Fig. 6). On the other hand, if the within field variation was large the maximum change in the rankings could be more than 10 po- sitions (Fig. 7). In a few cases the change could be more than 20 positions. For example, consider- ing the trial Mietoinen 2003b, the complete block analysis ranked the cultivars Scarlett and Maaren at positions 6 and 23, respectively. When the same data were analysed using a superimposed incom- plete block design with block size 3, Scarlett and Maaren were ranked 18 and 14, respectively. The latter analysis further adjusted the estimated differ- ence between the mean yields of the two cultivars from 536 kg/ha to -120 kg/ha. The last result is in accordance with the results from official variety testing which propose an estimated difference of -202 kg/ha (Kangas et al. 2006). Discussion The precision of experimental results can be im- proved by keeping experimental error low. This can be done by: i) increasing the number of replicates in the experiment, ii) using uniform fields and manag- ing them in the same way, iii) using experimental designs that control the variation within a replicate, iv) choosing an optimal field layout that reduces the variation within replicates, and v) using statistical models that take spatial variation into account. In cost-effective experiments the number of replicates in an experiment must be kept as low as possible, and so attention must be given instead to the other issues referred to above if improved results are to be obtained. 0 3 6 9 12 15 1 4 7 10 13 16 19 22 25 28 31 Block size SD of SED (kg/ha) Fig. 5. Standard deviation of standard error of difference as a function of a block size. Dashed line is based on all trials whereas solid line is calculated using seven trials in which within field variation was the largest. average change of rankings 0 1 2 3 4 1 4 7 10 13 16 19 22 25 28 31 Block size Fig. 6. Average change of rankings of cultivars as a func- tion of a block size. Dashed line is based on all trials where- as solid line is calculated using seven trials in which with- in field variation was largest. maximum change of rankings 0 3 6 9 12 15 1 4 7 10 13 16 19 22 25 28 31 Block size Fig. 7. Maximum change of rankings as a function of a block size. Dashed line is based on all trials whereas solid line is calculated using seven trials in which within field variation was largest. 400000 300000 200000 100000 0 Fig. 8. Estimated spherical variogram for nine varie- ty trials. A G R I C U L T U R A L A N D F O O D S C I E N C E Jauhiainen, L. et al. Recommendable block sizes for variety trials 70 A G R I C U L T U R A L A N D F O O D S C I E N C E Vol. 17 (2008): 63-72. 71 In this study we used the results obtained by Stroup (2002) showing the uniform superiority of incomplete block designs and spatial mixed model analysis in variety trials. These results were ob- tained by comparing the power of pairwise tests over a range of hypothetical spatial dependence patterns, experimental designs and yield record- ings. Recommendable block sizes for practical test- ing situations can also be determined by applying power analysis. A possible strategy would be to use empirical data to estimate the extent of variability in practical test fields, compute the power of pair- wise tests over the range of covariance structures implied by the estimated variability and then use designs that appear to be the most powerful. The data available to us, however, allowed a more di- rect approach. The characteristic feature of our data was that within each replication the blocks were arranged in a row. This made it possible to super- impose new blocking structures on the data and so study the effect of block size on the efficiency of testing pairwise yield differences. The results obtained showed that the analytical efficiency of spatial mixed model techniques and incomplete block analysis with duly planned block sizes were well-matched. They also suggested that the ranking of a cultivar could vary up to 10–20 positions if an incomplete block design was used instead of a complete block design. These results are in accordance with the results obtained by Stroup (2002) whose research frame is very close to that of ours. Our results also comply with the conclusions of Lopez and Arrue (1995). Focusing on incomplete blocks of size 2 they report that the incomplete block design is, on average, 24 % more efficient than the complete block design. Grondona et al. (1996) have analysed variety trial data with nineteen different spatial models and compared the results with traditional incomplete block analy- sis. They report that spatial analysis is in general more efficient in reducing residual variation than incomplete block analysis although there was no one model that best fit all the trials. Our findings are in agreement with their studies, too. We found that among the data for the different years, experimental sites and experimental fields there was a significant difference in the extent of soil variation and the associated spatial dependence. Based on graphs such as in Fig. 8, we concluded that spatial dependence was of minor importance in some 20% of the trials. Recommendations on optimal blocking strategies were then made using data from the trials where spatial dependence was of major importance. The justification for this was that wherever there is little soil variation, experi- mental design is always of minor importance. In planning an experiment the extent and strength of spatial dependence are usually unclear, and so it is wise to be prepared for the worst and use designs that are robust against strong spatial variation. It is clear that different response variables re- flect the variation in the soil differently. In this pa- per we report results concerning grain yield only. We have, however, studied other traits determin- ing the agricultural value of barley cultivars. The results show that in most cases the experimental designs that are efficient in testing pairwise dif- ferences of grain yields are also efficient in test- ing pairwise differences of many other traits. In particular, we have found that e.g. for the length of the stand the pattern of spatial dependence is very similar to that of grain yield and the same kind of incomplete block designs are therefore appropriate for studying both these traits. The incomplete block designs are problematic because the frequency of occurrence of the same pair in the same incomplete block has an effect on the standard errors of the differences between the pairs. When the block size is small and the number of cultivars is large many pairs of cultivars never occur in the same block. Our results showed that this is a very relevant problem with block sizes 2 and 3. A straightforward precept to use incomplete block designs is therefore not justifiable. Yet, in variety testing all comparisons between the cul- tivars are not relevant. In many cases only differ- ences between the old control cultivars and the new cultivars are of interest. So, the use of incomplete block designs with small block sizes like 2 or 3 can be recommended under the circumstances where the relative importance of different comparisons is known and can be taken into account in plan- ning the experimental designs. Because trials were planned for a certain block sizes, it was not pos- A G R I C U L T U R A L A N D F O O D S C I E N C E Jauhiainen, L. et al. Recommendable block sizes for variety trials 70 A G R I C U L T U R A L A N D F O O D S C I E N C E Vol. 17 (2008): 63-72. 71 sible to control the concurrence of pairs of cul- tivars during the superimposition of new blocks. This increased the skewness of the distribution of SED and emphasised the significance of the origi- nal planning process. However, in the current study skewness was not a major issue all told. In this study we concentrated only on variety trials of barley cultivars. The approach presented was possible because most of the operating pro- cedures related to the variety trials on barley had been standardised during the long period of these trials. Official variety trials have been carried out in Finland for other cereals over many years, too. In addition to barley, we have variety trials data on spring wheat, winter wheat, oats and rye covering a period of more than 30 years. Like variety tri- als on barley these trials are also well-established and standardised. The same techniques applied in studying the optimal block sizes for barley trials could therefore be applied in finding optimal block sizes for variety trials on these cereal species. Our experience shows that optimal block size is mainly related to spatial dependence and to a lesser extent to species, treatment types (e.g. va- riety, fertilization levels) or the number of levels of treatment. This makes it possible to generalise the results obtained in this study to other species and to other types of experiment. Traditionally, in- complete block designs have been used in MTT Agrifood Research Finland when the number of treatments is 16 or more. Similar practices are com- mon in other research organisations. According to our results, incomplete block designs can also be recommended in experiments with 7 to 15 treat- ments. During the course of this study we found that if an experiment with an incomplete block design is well planned for mitigating the effects of within- block heterogeneity, the spatial mixed model tech- niques and the conventional analysis of variance techniques have approximately the same efficiency in testing pairwise yield differences. In terms of the labour required the latter is less onerous. The principal conclusion from this study is therefore as follows: Finnish official variety trials and many similar field trials can be conducted more efficient- ly by using smaller block sizes, and if appropriate blocking strategies are used there is no reason to change the conventional practice followed in sta- tistical analysis. References Brooker, P. 2001. Modelling spatial variablility using soil profiles in the Riverland of South Australia. Environmen- tal International 27: 121–126. Grondona, M.O., Crossa, J., Fox, P.N., & Pfeiffer, W.H., 1996. Analysis of variety yield trials using two-dimen- sional separable ARIMA processes. Biometrics 52: 763–770. Järvi, A., Kangas, A., Mattila, I., Mäkelä, L., Rahkonen, A., Salo, Y., Vuorinen, M., & Öfversten, J. 1998. Viral- listen lajikekokeiden suoritusohjeet. [Guidelines for con- ducting official variety experiments]. Maatalouden Tut- kimuskeskuksen Julkaisuja. Sarja B. 14, 71 pp. Kangas, A., Laine, A., Niskanen, M., Salo, Y., Vuorinen, M., Jauhiainen, L., & Nikander, H., 2006. Virallisten lajike- kokeiden tulokset 1998–2005. Results of official variety trials 1998–2005. MTT:n selvityksiä 105, 210 pp. Kempton R.A., & Howes, C.W., 1981. The use of neigh- bouring plot values in the analysis of variety trials. Ap- plied Statistics 30: 59–70. Kravchenko, A.N., Robertson, G.R., Snap, S.S., & Smuck- er, A.J.M., 2006. Using information about spatial vari- ability to improve estimates of total soil carbon. Agron- omy Journal 98 (3): 823–829. Lark, R. 2000. Designing sampling grids from imprecise information on soil variability, an approach based on the fuzzy kriging variance. Geoderma 98: 35–59. Lopez, M.V., & Arrue, J.L., 1995. Efficiency of an incom- plete block design based on geostatistics for tillage ex- periments. Soil Science Society of America journal 59 (4): 1104–1111. Pannatier, Y. 1996. VARIOWIN: Software for Spatial Data Analysis in 2D, Springer-Verlag, New York, NY. Papadakis, J.S. 1937. Methode statistique pour des expe- riences sur champ. Bulletin de l’Institut d’Amelioration des Plantes, Thessalonike 23. SAS 1999. SAS/STAT User’s Guide, Version 8, Cary, NC: SAS Institute Inc. Stroup, W.W. 2002. Power analysis based on spatial mixed effects analysis: a tool for comparing design and analy- sis strategies in the presence of spatial variability. Jour- nal of Agricultural Biological, and Environmental Statis- tics 7, 491–511. Watson, S. 2000. Spatial dependence and block designs in spaced plant herbage trials. Journal of Agricultural Science, Cambridge, 245–258. A G R I C U L T U R A L A N D F O O D S C I E N C E Jauhiainen, L. et al. Recommendable block sizes for variety trials 72 Lajikekokeet ovat tyypillisesti kokeita, joissa ver- tailtavien käsittelyjen eli lajikkeiden määrä on suuri, jopa useita kymmeniä lajikkeita. Tutkimuksessa käytetty aineisto sisälsi 28 virallista ohran lajikekoetta vuosilta 2001–2005. Tyypillisessä epätäydellisten lohkojen koeasetelman mukaisesti suunnitellussa ohran lajike- kokeessa lohkot sisältävät 6–7 lajiketta, mutta tulosten perusteella vain 4–5 lajiketta kannattaisi sijoittaa samaan lohkoon. Tulosten perusteella voidaan olettaa epätäy- dellisten lohkojen koeasetelmista olevan hyötyä myös pienissä, jopa 7–10 käsittelyn peltokokeissa. Tulokset osoittavat lisäksi, että oikein käytettyinä epätäydellisten lohkojen koeasetelmien avulla pystytään huomioimaan pellolla esiintyvä sangen nopeakin vaihtelu niin hyvin, ettei tarvetta ole siirtyä käyttämään monimutkaisempia tilastollisia menetelmiä, joita ovat mm. koeruutujen väli- sen spatiaalisen riippuvuuden huomioivat sekamallit. Pellolla esiintyvä vaihtelu heikentää peltokokeista saatavien tulosten tarkkuutta. Tilanne on erityisen hankala vertailtavien käsittelyjen lukumäärän ollessa suuri, jolloin koekentän koko voi kasvaa niin suureksi, että kasvuolosuhteet saattavat olla kovinkin erilaiset eri koeruuduissa. Koeasetelman valinnalla on perinteisesti pyritty kontrolloimaan pellolla esiintyvää vaihtelua. Epätäydellisten lohkojen koeasetelmassa yhteen lohkoon valitaan vain osa vertailtavista käsittelyistä, kun taas täydellisten lohkojen koeasetelman mukaisesti suunni- tellussa kokeessa kaikki vertailtavat käsittelyt sisältyvät jokaiseen lohkoon. Lohkomisen jälkeen vain lohkon sisäisellä kasvuolosuhteiden vaihtelulla on merkitystä, joten isoissa kokeissa epätäydellisten lohkojen koeasetel- man on todettu johtavan tarkempiin tuloksia kuin täydel- listen lohkojen koeasetelman käytön. Käyttökelpoinen lohkon koko ei kuitenkaan ole tiedossa. SELOSTUS Suositeltava lohkon koko ohran lajikekokeissa Lauri Jauhiainen, Jukka Öfversten ja Arjo Kangas MTT Palveluyksikkö ja Kasvintuotannon tutkimus Recommendable block sizes: a case study on Finnish official variety trials of barley cultivars Introduction Material and methods Results Discussion References SELOSTUS