Skip to article frontmatterSkip to article content

Data Misfit

Authors
Affiliations
University of British Columbia
University of British Columbia

The observations dobsd^{obs} encode our knowledge about the model obtained from the survey. Any candidate solution mm must produce simulated (or predicted) data dd that “fit” the observations. Central to this goal is the choice of a “criterion” for measuring misfit between observed and predicted data, and a “tolerance” for deciding what value constitutes an acceptable fit. This is a subject on which an enormous amount of literature has been written but a good reference, written for the inverse problem, is Parker (1994). When errors are Gaussian with zero mean and known standard deviation, then the L2L_2 -norm, is an appropriate choice. We define the misfit to be

ϕd=j=1N(djdjobsϵj)2\phi_d=\sum^N_{j=1}\left(\frac{d_j-d_j^{obs}}{\epsilon_j}\right)^2

where djobsd_j^{obs} is the observation, ϵj\epsilon_j is its estimated standard deviation, and djd_j is the predicted datum. The quantity

ηj=djdjobsϵj\eta_j = \frac {d_j - d_j^{obs}} {\epsilon_j}

is a random variable with zero mean and unit standard deviation and ϕd\phi_d , which is the sum of squares of these variables, is the well known χ2\chi^2 statistical variable. It has an expected value ((3)) and variance ((4)).

E[χ2]=NE[\chi^2] = N
Var[χ2]=2NVar[\chi^2] = 2N

Thus if we are attempting to find a model mm that acceptably fits the data, then models with ϕdN\phi_d \simeq N are good candidates. For many problems we often denote a target misfit ϕd\phi_d^* to be

ϕd=N\phi_d^*=N

but this must only be regarded as a reasonable estimate and some flexibility should be entertained.

In times past, it was felt that getting an acceptable fit to the data was a sufficient criterion for having a successful inversion. The observed data dobs\mathbf{d}_{obs} are inverted to produce a model m\mathbf{m}, which is used to forward model the predicted data d\mathbf{d}. The observed and predicted data are then compared using the data misfit measure. If ϕd<ϕd\phi_d<\phi_d^* the model m\mathbf{m} is accepted, but if not, the inversion parameters are adjusted and the process is repeated until an acceptable model is reached. The workflow procedure is delineated below.

Finding a model that fits the data is a necessary component of an inversion algorithm, but as shown in the next section, it is not sufficient.

References
  1. Parker, R. L. (1994). Geophysical Inverse Theory (Vol. 1). Princeton University Press. 10.2307/j.ctvs32s89