Contents Page

Previous section

Isobel’s Home Page

Practical Geostatistics 2000

Courses

 

CHAPTER 4: Estimation

So far we have used the basic concepts and assumptions of Geostatistics to build ourselves a ‘model’ of the structure and continuity within the deposit. We have also (in Chapter 3) seen how this can lead to the production of ‘theoretical’ grade/tonnage curves and the study of how mining block size can influence final production Figures. It is now time we returned to our original problem of the estimation of ore reserves. The discussion in this (and the next) chapter will be confined to ‘local’ estimation, i.e. interest is confined to one portion of the deposit at a time. However, it should be borne in mind that the same techniques can be applied on a global scale, i.e. to the whole deposit at once. It should also be remembered that block-by-block or stope-by-stope estimates will lead inevitably to global estimates.

Let us, then, define the situation which is of interest to us. There is a point or an area or a volume of ground over which we do not know the grade (or value), but we wish to estimate it. Let us call this ‘unknown’ grade T, and the area (or point, or volume) of interest A. In order to produce an estimator we must have some information, usually in the form of samples. To be completely general, let us suppose n samples with values of g1,g2,g3...gn. This set of samples is generally denoted by S. From these samples we can form a ‘linear’ type of estimator --- that is, a weighted average. We must restrict ourselves to this type of estimator at this stage. The estimator is denoted by T* and is equal to:

where the w1, w2, w3...wn are the weights assigned to each sample. Most currently used local estimation techniques use a weighted average approach --- inverse distance techniques and so on. The simplest case of all is when all of the weights are the same, and T* is just the arithmetic mean of the sample values.

 

Easting

Northing

Grade

Point

(ft)

(ft)

U3O8

A

4150

2340

 

1

4170

2332

400

2

4200

2340

380

3

4160

2370

450

4

4150

2310

280

5

4080

2340

320

Fig 4.1. Hypothetical sampling and estimation situation --- a uranium deposit.

Table 4.1 Positions and values on hypothetical Uranium estimation problem

 

Now consider the setup of samples and ‘unknown’ which we originally discussed in the first chapter. Figure 4.1 shows the point of interest which lies at position A, and we have five ‘point’ samples lying around this position. The co-ordinates of these six points and the values of the samples are given in Table 4.1. The hypothetical deposit is a low-grade, large-tonnage uranium one, which is assumed to be isotropic. The semi-variogram model fitted to this deposit is a spherical one with a range of influence of 100 ft, a sill value (C) of 700 (p.p.m.)² and a nugget effect of 100 (p.p.m.)². Let us take the simplest possible estimation procedure. Take the value at the closest sample position (1) and ‘extend’ this to the unknown point. In doing so we incur an estimation error, e, which will be equal to the difference between the actual value T and the estimated value T*, which in this case equals g1. That is:

 


It is not too difficult to show that if there is no trend (at least locally), this estimator is unbiased. That is, if we make lots of similar estimations the average error will be zero.


The ‘reliability’ of the estimation can be measured by looking at the spread of the errors. If the errors take values consistently close to zero, then the estimator is a ‘good’ one. If the spread of values is large, then the estimator will be unreliable. The simplest stable measure of spread (statistically) is the standard deviation. The standard deviation of an estimation error --- or standard error as it is referred to in ordinary statistics --- will therefore measure the reliability of that estimator.

 

No matter how many estimations we perform, we cannot calculate the standard deviation of the errors since we do not know the value of the error made. Therefore we must look at the ‘theoretical’ form of the variance of the estimation error, i.e. the estimation variance:


The average would be made (theoretically) over the whole deposit. That is, the same estimation situation would be repeated over the whole deposit and the variance found. This cannot be done in practice, of course, so let us look closer at the form of this variance. It is found by taking the grade at point
A, subtracting the grade at point 1, squaring the result, repeating the process over all possible pairs of such points and then averaging the values. This sounds exactly like the definition of a variogram. In fact, it is the variogram between the two points A and (1). Given the distance between them (h)  we can evaluate this estimation variance simply by reading a value from the semi-variogram model (g )  and multiplying it by 2. This is one of the reasons why it is good policy to avoid confusing the variogram and the semi-variogram. Thus:


In the case of our particular example given in Fig. 4.1:


Given our knowledge about this deposit, i.e. the semi-variogram model, we can state (without too much fear of error) that the estimator used has a standard error of 25.4 p.p.m. Turning this standard error into a confidence interval, however, requires the assumption of some kind of probability distribution for the deposit. For instance if we hope that the Central Limit Theorem holds, we can say that a 95% confidence interval for
T would be given by T*± 1.96se, i.e. (350 p.p.m., 450 p.p.m.). On the other hand, if we were to assume a log-normal distribution for the errors, the 95% confidence interval would be given by (354 p.p.m., 453 p.p.m.).

 

Fig. 4.2. More realistic estimation --- the value of the block is required (uranium deposit).

 

Now, let us complicate the procedure a little. Instead of estimating the value at the point A, in a more realistic situation (at least in mining) we would be interested in the average grade over an area or block or some mining unit. In Fig. 4.2, a ‘panel’ 60 ft by 30 ft has been centred on the original point A. The estimation procedure then becomes:


The same arguments as previously still hold. The average error can be shown to be zero if there is no local trend. The estimation variance is still a variogram, but it is now the variogram between the grade at sample point (1) and the average grade over the panel
A. We saw in Chapter 3 that we could cope with average grades over samples if we wanted the semi-variogram between samples of the same size, but so far we have not considered the possibility of having two different sizes to compare. The model semi-variogram supplies us with the difference in grades between two points. We could find the value of the semi-variogram between the sample point and every point within the panel A, and we could average those values. Let us define this quantity as (S,A), read as ‘gamma-bar between the sample and every point in the panel’. The ‘bar’ notation is the standard one for arithmetic mean. This gamma-bar term will take the place of the g(h) in our previous relationship. However, what we really need is the semi-variogram between the average grade of panel and the sample, not between all the individual points within the panel and the sample. 2(S,A) would be the variance of the error made if we tried to estimate every point within the panel. To correct for this difference in emphasis we need to take into account the variation of the grades at points within the panel.

This was discussed in Chapter 3, and we evaluated it using the auxiliary function F(l,b). This was the average semi-variogram between all possible pairs of points within the panel. We can rewrite this in a more general way using the gamma-bar notation. That is, (A,A) will be the average semi-variogram value between every point in the panel and every point in the panel. In the case shown in Fig. 4.2, then, when using the value at sample point (1) to estimate the average grade of the panel, the estimation variance becomes:


The calculation of these gamma-bar terms will be discussed more fully later.


Now, let us complicate the mathematics still further. We actually have more than one sample available to us, so why not use them in the estimation procedure. Suppose we use the arithmetic mean of the samples as our
T*. This gives us the simplest form of the weighted average type of estimator. That is:


In this case the term
(S,A) is the average semi-variogram value between each point in the ‘sample set’ S and each point in the panel A. The term (A,A) is still the average semi-variogram between each point in the panel and each point in the panel. However, now we have yet another source of spurious variation. We only consider the average grade of the samples as the estimator, but (S,A) takes the individual grades into account. Thus we have also to subtract a (S,S) term from the variance, where this is the average semi-variogram value between each point in the sample set and each point in the sample set (i.e. 25 ‘pairs’ of samples). The final version of the estimation variance then becomes:


The arithmetic mean is often known in Geostatistics as an extension estimator, and the above variance is referred to as the extension variance. To distinguish this variance from the more general estimation variance for a weighted average, the subscript
e is used rather than the general e.

 

 

Contents Page

Next section

Isobel’s Home Page

Practical Geostatistics 2000

Courses