Application of Canonical Correlation for Soil –Vegetation Interrelationship in the Cocoa Belt of South Western Nigeria

This paper investigates the multivariate relationships between the soil and vegetation characteristics of plant communities of forest, fallow and cocoa for two reasons. The first is to bringing out the interactions among a set of independent variables and pinpoint the relative importance of each of these variables on a dependent variable be it a soil or a vegetation element. The second is to illustrate the changes that occur to the soil-vegetation properties as the tropical rainforest is cultivated to cocoa and field crops. The processes of choosing the study sites followed a random selection of sample points under three land use types: forest, fallow and cocoa (Theobroma cacao). A total of 300 sample points were selected under each of fallow and cocoa and 260 were selected under the forest. From each sample point soil samples were collected to a depth of 45 cm (i.e. 0-15 cm; 15-45 cm) placed into well-labelled polythene bags and taken to the laboratory for analysis. The results for the forest shows that the canonical variate of the vegetation variables account for 31% of the variance extracted by the soil factors, while the soil variables also account for 31% of the variance extracted by the vegetation variables while the result of the canonical correlation for fallow soil and vegetation extracted for the first and second canonical variates are 0.91 and 0.61 respectively. The result for the canonical correlation coefficients for cocoa for the first and second canonical variates is 0.68 and 0.51 respectively. The study concluded that while simple relationships were observed among soil and vegetation properties in the forest and fallow, complex relationships were recognised in the cocoa plant community.


Introduction
In the tropics an increasing area is being depleted and converted to secondary forest, tree crops plantation and fallow. Many factors contribute to the forest depletion, among which are farming practices, over-exploitation of forest resources, illegal encroachment into the forest reserves and other human activities (Orimoogunje, 2005). In the case of farming whenever the forest is opened up for cultivation, the plant-soil equilibrium is disrupted. It has however, been observed that the cultivation of tree crops in the tropical rain forest region has brought a different dimension to agricultural practice and environmental conditions in the study area.
The deterioration of organic matter content of the soil under tropical cultivation has been recognized as the chief cause of decline and falling productivity of the tropical soil (Ojeniyi and Agbede, 1980;Ekanade, 1989;Kio, 2002;Orimoogunje, 2005). Nye and Greenland (1960) and Mou tappa (1973) found that areas under fallow vegetation have less organic nutrients than areas not cultivated to crops at all. Getahun et al (2002) and Kio (2002) have also noted some correlation between the age of fallow and the amount of humus in the soil. Similar conclusions have been reached by Aweto (1981) and Areola et. al. (1982) in the forest and savannah areas of south-western Nigeria respectively. However, it has been established that soil organic matter increases as the fallow advances in age (Getahun et al, 2002). The equilibrium level is achieved as soon as the vegetal regrowth provides an adequate ground cover and generates litter comparable in quantity with what obtains in the rain forest (Nye and Greenland, 1960;Aweto, 1981). Soil is not only a means of tree anchorage, but is also a component of the nutrient and water cycle in the forest ecosystem, highly interrelated to the vegetation it supports. Hence, drastic changes in forest cover will influence soil properties, and these changes in soil properties may also influence the type of forest that can regenerate.
The objectives of this study, therefore, are to investigate the multivariate relationships between the soil and vegetation variables of plant communities of forest, fallow and cocoa in in The Cocoa Belt of South Western Nigeria the study area, bring out the interactions among these variables; pinpoint the relative importance of each of these variables to others be it soil or vegetation. And lastly to illustrate the changes that occur to the soil-vegetation properties as the tropical rainforest is cultivated to cocoa and field crops utilising canonical technique.

Study Area
The study area covers about 800 km 2 and is located in the central part of the Atakumosa Local Government Area of Osun State. It is located between latitudes 7 0 20' and 7 0 38' N and between longitudes 4 0 26'E and 4 0 30'E ( Figure 1).

Figure 1. The Study Area
The study area is essentially rural. The climate of the area is the tropical rain forest type characterized by both high mean temperature and rainfall. The latter is about 1500 mm while the former is about 27 0 C. However, the mean annual temperature variations rarely exceed 8 0 C. The natural vegetation is the evergreen tropical rain forest now restricted to patches. The natural vegetation of the study area has been greatly modified by man, especially through cultivation of land resulting in a mosaic of secondary forests, fallow regrowths and tree crops. The geology of the area comprises mainly fine-grained biotite gneiss which is heavily foliated making it susceptible to deep weathering. The soils overlying the solid geology has been classified as Red Yellow Latosols, Ferruginous Tropical Soils, Alfisols and Orthic Luvisols. According to Symth and Montgomery (1962), the top 25 cm of the soils are usually sandy and brownish grey to brown in colour. Between depths of 25 cm and 120 cm there is a marked stone/gravel layer. The texture of this layer is rarely very clayey and may be very sandy while the colour ranges from greyish brown to reddish brown. The soils oc-cupy level to gently sloping sites at high levels in the topography and are known to be highly suitable for both food and tree crops cultivation (Adejuwon and Jeje, 1975).

Selection of Sample Points
The topographical maps (1:50 000) and air photos (1:40 000) were used to demarcate the study area and locate study sites. The process of choosing the study followed a random selection of sample points under three land use types: forest, fallow and cocoa (Theobroma cacao). A total of 300 sample points were selected under each of fallow and cocoa while 260 were selected under forest. From each sample point soil samples were collected to a depth of 45 cm (i.e. 0-15 cm; 15-45 cm) placed into well-labelled polythene bags and taken to the laboratory for analysis.

Soil Analysis
The soil samples were air-dried and sieved through 2-mm diameter mesh after which standard laboratory techniques were used to determine particle size distribution, moisture content, bulk density, total porosity, pH, nitrate-nitrogen, available phosphorus, exchange calcium, magnesium, potassium and sodium, cation exchange capacity (C.E.C) and base saturation.
Bulk density and total porosity were determined by core method and particle size distribution by the hydrometer method. Soil pH was determined potentiometrically in 0.01M calcium chloride solution using a soil to calcium chloride solution ratio of 1:2 and organic matter was determined by the Walkley-Black method. Nitrate-nitrogen was determined by using 2, 4-phenoldisulphonic acid on the prepared soil sample solution and available phosphorus by the Bray No. 1 method. Soil samples were leached with 1N neutral ammonium acetate to obtain extracts used for the determination of exchangeable cations. Thereafter, exchangeable calcium, potassium and sodium were determined by flame analyser and exchangeable magnesium by atomic absorption spectrometer. CEC was the sum of exchangeable bases and exchange acidity while percent base saturation was calculated as the sum bases divided by the CEC and multiplied by 100.

Litter Analysis
Five samples of accumulated litter were collected from 0.5 cm square quadrats by making the five selected soil sample points from a 40 m x 25 m plot from the centres of the quadrats. In all, 860 accumulated litter samples were collected, that is 300 from each of cocoa plantations and fallow and 260 from the mature secondary forests.

Statistical Analysis
The canonical correlation analysis was developed by Hotelling (1935,1936) and it is designed to analyse the basic patterns of interrelationships between two sets of multivariate phenomena. The basic strategy of canonical correlation analysis is to derive a linear combination from each of the sets of variables in such a way that the correlation between the two linear combinations is maximized (Clark 1975;Johnston 1978). This technique has been widely applied in geographic studies (Aweto 1978;Young, 1981 Its usage in geographic research shows that canonical correlation analysis appears to be a powerful analytical tool for elucidating the basic patterns of interrelationships between two sets of multivariate data in the discipline. It is in the light of this that the technique of canonical correlation is applied in the present study to examine the pattern of interrelationships between a set of soil variables and a set of vegetation variables in the forest, fallow and cocoa plant communities. However, it has been observed that the results of canonical correlation analysis are often difficult to interpret especially when the number of variables involved is large. Thus, preliminary data "reduction" through the use of Factor Analysis or Principal Components Analysis is usually adopted in order to make interpretation of results easier. In fact, the studies of Young (1981), Patridge and Wilson (1989), Jafari et al., (2003) and Xian-Li et al., (2008) exemplify this practice. In this study, both sets of data on soil and vegetation properties were "reduced" by the technique of Principal Components Analysis using the SPSS computer program. Furthermore, the technique of varimax rotational transformation as outlined by Harman (1970) was used to simplify the loading pattern of the variables on the components. This was done because unrotated factors are not considered to be completely diagnostic of the underlying structure in a set of data (Cattel 1965). Keskin and Yasar (2007) demonstrated that this multivariate analysis can be used to simplify the relationship between morphological and biochemical traits of the egg plant. Following the same path, Baize et. al., (2009) used it to expose the relationships between concentrations of trace metals in wheat grains and soil. Rummel (1967) and Clark (1975), among others, have identified that the varimax rotational procedure has two main advantages. These are the preservation of the total variance accounted for by the unrotated factors and the maximization of each component associated with a distinct cluster of variables. All these make interpretation of the loading pattern easier since simple structure matrix in which few variables load highly on each component results (Thurston 1936).
The changes in, and interrelationships between soil and vegetation properties under cocoa over time viz-a-viz the situation under fallow and forest were analysed using canonical correlation.

Results and Discussion
The results of the canonical correlation analysis used in this study were extracted from Principal Component Analysis performed for the data and are given in Tables 1, 2 and 3 for the soil/vegetation components of the forest, fallow and cocoa plant communities respectively. Table 1 shows that only one canonical variate was extracted between soil and vegetation characteristics. The canonical correlation coefficient is 0.56, indicating a strong relationship between the two measurement domains. Cooley and Lohnes (1971) observed that, as a rule of the thumb, any canonical correlation of 0.30 or less could be regarded as trivial. Since canonical loadings are similar to component loadings, they serve to identify the canonical variates. An examination of the canonical variate in Table 1 shows that strong relationships exist between base saturation and textural properties in the soil measurement domain and cover/litter and biomass variables in the vegetation measurement domain. The relationship between these soil variables and the cover/litter variable is positive, while that with the biomass variable is negative. This latter situation is surprising since it contradicts the general trend obtained by scholars in the simple bivariate correlation analysis (for example, Ekanade, 1989). However, it is observed that the component scores which were used as inputs for the canonical correlation analysis showed that plots which have high biomass values have large negative component scores. It can therefore, be said that this variable is negatively defined with the consequent negative relationship with the soil variables that load highly on this canonical variate. Although the organic matter component did not load highly on the canonical variate (-0.40), its negative sign demands attention especially when it is considered that cover/litter component is positively defined. However, it is observed that all the plots with very high organic matter have their component scores negatively defined. In that respect, it can be conceived that the negative scores on the component of soil organic matter reacted with the positive ones for the cover/litter component to produce a misleading negative sign.  Table 1 further shows the eigenvalue which is the squared value of the canonical correlation coefficient. The importance of this value is that it gives the level of variance ac-in The Cocoa Belt of South Western Nigeria counted for in the soil variables by the vegetation factors and vice versa. In this canonical variate, therefore, the vegetation variables (cover/litter and biomass) account for 31% of the variance extracted by the soil factors, while the soil variables (base saturation and soil texture) also account for 31% of the variance extracted by the vegetation variables. In effect, it can be concluded that the variance accounted for by each measurement domain in the forest plant community is small. This is not surprising since it has been inferred by many scholars, when discussing the bivariate correlation analysis that as a result of the steady-state nature in the forest, low fluctuation persists between soil and vegetation since input usual almost equalizes output. Table 2 shows the coefficients of the canonical correlation for fallow soil and vegetation. Two canonical variates were extracted and the canonical correlation coefficients for the first and second canonical variates are 0.91 and 0.61 respectively. Following the criterion of Cooley and Lohnes (1971), these canonical correlation coefficients indicate strong relationships between the two measurement domains in the fallow plant community. An examination of the structure of the canonical factor matrix as shown in Table 2 further reveals that it has the characteristics of the simple structure matrix outlined for factor analysis by Thurston (1936). The advantage of a canonical factor having a simple structure lies in the fact that it makes the interpretation of results easy. The first canonical variate indicates strong positive relationship between total porosity /pH / calcium / potassium / magnesium / C.E.C. and base saturation of the topsoil and cover / biomass / litter components of the vegetation. The second canonical variate also indicates a strong positive relationship between organic matter / nitrate-nitrogen of both the topsoil and subsoil and tree density components. These two canonical variates show that reciprocal relationships exist between soil and vegetation properties in the fallow plant community. This conforms to scholars findings in using simple correlation analysis for plant community studies. However, the negative sign assigned to the cover / biomass / litter component variable in the second canonical variate should be noted. It implies that this component factor is inversely related to organic matter / nitrate-nitrogen component as against what obtained between these variables in the simple bivariate correlation analysis. Again, this trend has been traced to the fact that most of the cover / biomass / litter component scores that served as inputs for the canonical correlation analysis for this factor were negatively defined thereby producing the misleading negative sign.
The eigenvalue of the first canonical variate is 0.83 while it is 0.37 for the second canonical variate. This implies that the cover/biomass/litter factor accounts for a variance of 83% in the total porosity/pH/calcium/potassium/C.E.C/base saturation of the topsoil factor while the latter also accounts for the same amount of variance in the cover/biomass/litter factor. Similarly, tree density factor accounts for 37% of the total variance in organic matter/nitrate-nitrogen factor while a total of 37% variance in tree density factor is also ac-counted for by the organic matter/nitrate-nitrogen factor. The pattern of result for the canonical correlation analysis in the fallow plant community follows that observed in the simple bivariate correlation in which many of the soil properties are positively significantly correlated. The identification of most of the soil chemical properties, coupled with total porosity in the topsoil layer and organic matter and nitrate-nitrogen in both the topsoil layer and organic matter and nitrate-nitrogen in both the topsoil and subsoil layers as being strongly related to the vegetation properties, signifies the reciprocal relationship between soil and vegetation in the fallow plant community over time (Aweto, 1978).  Table 3 shows that two canonical variates were extracted between soil and vegetation properties in the cocoa plant community. The canonical correlation coefficients for the first and second canonical variates are 0.68 and 0.51 respectively. All these indicate strong relationships between soil and vegetation properties in the cocoa plant community. It is observed that the two canonical variates have the characteristics of the simple structure matrix as in the case of fallow. The first canonical variate shows that there is strong positive relationship between bulk density/total porosity component of the cocoa soil and the density/cover/yield/weed/litter component of the cocoa vegetation. The relatively high loading of the organic matter/nutrient status component on the first canonical variate should be noted. It implies that there is also a positive strong relationship between it and the density/cover/yield/weed/litter component factor. With the exception of weed species and cocoa yield in the latter component, it seems that the sustenance of most of the topsoil properties under cocoa is undertaken by the elements of this vegetation component. In return, these vegetation prop-erties also have their "food supply" from the soil elements.
It appears from the result of the first canonical variate that the soil physical properties are crucial in the cocoa soil (Smyth and Montgomery 1962;Are and Gyne-Jones 1974;Falade 1975). However, the fact that the loading of the bulk density/total porosity component takes on a positive sign appears misleading. It is the other way round when the relationships between total porosity and these vegetation properties are considered. Furthermore, an examination of the component scores fed in to generate the canonical correlation does not reveal the pattern of negatively defined scores as noted in the fallow plant community. In effect, what can probably be inferred from this result is that the positive and the negative interactions of bulk density and total porosity with these vegetation properties in the cocoa plant community even out.
The second canonical variate indicates strong negative relationship between nitrate-nitrogen in the topsoil and subsoil; and subsoil exchangeable cations/C.E.C/base saturation components on the one hand and vegetation biomass component on the other (Table 3). These inverse relationships indicate that in the cocoa plant community, these soil properties appear to be adversely affected as the cocoa vegetation biomass variables increase. These results agree with the observations made by Ekanade (1989) with respect to the simple bivariate correlation analysis. Table 3 also shows that the eigenvalues for the first and second correlation variates are 0.46 and 0.26 respectively. This means that the density/cover/yield /weed/litter component factor accounts for 46% of the total variance in bulk density/total porosity component factor while the latter also accounts for the same amount of total variance in the density/cover/yield/weed/litter component factor. Furthermore, the biomass component factor accounts for a total variance of 26% in the subsoil exchangeable cations/C.E.C/base saturation component factors. Similarly, these latter two soil component factors account for 26% of the total variance in the cocoa vegetation biomass variable. The first conspicuous difference in the canonical correlation analysis between forest, on the one hand, and fallow and cocoa plant communities, on the other, is the fact that in spite of the highest number of variables used for the forest, as compared with fallow and cocoa, only one canonical variate was extracted (see Tables 1 -3). This could probably be attributed to the steady-state in the forest as opposed to dynamic changes that are taking place in the fallow and cocoa plant communities. As evident from the discussions of the canonical correlation analysis between soil and vegetation properties in the forest, fallow and cocoa plant communities, it could be deduced that certain deteriorating changes do occur to many crucial soil physical and chemical properties under cocoa over time. It could also be inferred that tree density, foliage cover and accumulated litter appear to protect the cocoa soil, in a way, from the harsh tropical weather thereby helping to sustain its fertility to some extent.
It could be implied from the various treatments that in the forest, the influence of the steady-state condition was apparent while in the fallow, the tendency was for a reciprocal recuperative interaction between soil and vegetation over time. However, in the cocoa plant community, the analytic technique was unequivocal on the deteriorating impact of cocoa vegetation on certain crucial soil properties such as bulk density, total porosity, clay, organic matter, nitrate-nitrogen, calcium, magnesium, C.E.C. and base saturation. The above notwithstanding, it was also clearly indicated that certain elements of the cocoa vegetation appear to sustain the cocoa soil to some extent over time.
The canonical correlation indicates that increases in the cocoa biomass indices lead to deterioration in the quality of cocoa soil. This is especially so in such crucial soil chemical elements as nitrate-nitrogen, calcium, magnesium, potassium, C.E.C. and base saturation. The statistical technique indicates that two cocoa vegetation elements, foliage cover and accumulated litter, have advantageous effects on almost all of the cocoa soil elements. In effect, the maintenance of a closed canopy in the cocoa plant community may lead to greater accumulation on most elements of the cocoa soil.

Conclusions
From the outcome of the present exercise it can be concluded that while simple relationships were observed among soil and vegetation properties in the forest and fallow, complex relationships were recognised in the cocoa plant community. The study clearly indicates the steady-state nature of the forest plant community and the fact that the fallow plant community is tending towards equilibrium between soil and vegetation characteristics. The study further shows that the most affected soil properties to the extent of becoming limiting to cocoa as the latter advances in age include bulk density, total porosity, nitrate-nitrogen, calcium, magnesium, C.E.C. and base saturation. The limiting nature of these soil properties is indicated by their inverse relationships, especially in the topsoil, with the cocoa vegetation biomass and yield variables. The subsoil clay fractions are identified as being critical to the cocoa ecosystems, notwithstanding the age of cocoa. It could be true to conclude that the subsoil clays play a significant role in sustaining soil fertility under cocoa overtime.