Interpolation
Interpolation in petroleum geoscience and engineering is the mathematical process of estimating values of a property (porosity, permeability, formation thickness, oil saturation, seismic velocity) at unsampled locations between known data points by using a mathematical function that is fit to the known data and then evaluated at the desired location; interpolation assumes that the property varies smoothly and continuously between data points (rather than jumping discontinuously), so that values at intermediate locations can be reasonably estimated from nearby measurements; in well log analysis, interpolation is used to estimate log properties at depths where measurements are missing due to tool failures, washed-out intervals, or gaps between logging runs; in reservoir modeling, spatial interpolation methods (kriging, inverse distance weighting, natural neighbor, and others) are used to distribute petrophysical properties from well locations to the grid cells of a 3D geological model that covers the entire field, with the interpolated values at undrilled locations being the foundation of volumetric calculations and production forecasts; in seismic processing, interpolation of missing traces (regularization) fills gaps in the acquisition footprint that would otherwise create artifacts in the migrated image; the quality of an interpolation depends critically on the density of data points relative to the spatial variability of the property being interpolated, with high data density relative to variability producing reliable interpolations and sparse data with high variability producing interpolated values that may be geologically meaningless regardless of the mathematical sophistication of the interpolation algorithm used.
Key Takeaways
- Kriging, the geostatistical interpolation method developed by the South African mining engineer Danie Krige in the 1950s and formalized by Georges Matheron in the 1960s, is the most widely used spatial interpolation method in petroleum reservoir modeling because it uses the variogram (a spatial correlation function that describes how the property's variance changes with separation distance between data points) to weight the known data points optimally, minimizing the estimation error at unsampled locations; unlike simpler methods such as inverse distance weighting (which gives more influence to nearby points without regard to the actual spatial correlation structure of the data), kriging uses the statistical relationship between the data values and their spatial arrangement to produce interpolated estimates that honor the geological continuity implied by the variogram; kriging also provides an estimation variance (uncertainty measure) for each interpolated value, allowing the interpreter to identify where the interpolation is well-constrained (many nearby data points, small estimation variance) and where it is highly uncertain (sparse data coverage, large estimation variance); the limitation of kriging is that it produces smooth interpolated surfaces that reproduce the input data exactly at data points but may not represent the full variability of the property between wells, which is why multiple stochastic realizations (using sequential Gaussian simulation or similar methods) are often used alongside kriging to represent the geological uncertainty in the interpolated distribution.
- The variogram (or its equivalent, the covariance function) used in kriging interpolation is one of the most important and often most poorly constrained inputs in petroleum reservoir geostatistics: the variogram quantifies the spatial correlation structure of the property being interpolated — the correlation range (the distance beyond which data points have negligible influence on the interpolated value), the sill (the total variance of the property), and the nugget (the variability at zero separation distance, reflecting measurement error and sub-scale variability); fitting a variogram model to experimental variograms computed from the available well data is straightforward when the data density is high and the property is relatively homogeneous, but becomes highly uncertain when the well spacing is larger than the geological correlation range or when the geological variability is strongly directional (higher correlation in the depositional direction than perpendicular to it); the variogram must be estimated from well data alone if seismic-derived property estimates are not available, and with typical North Sea or Permian Basin well spacings of 500-2,000 meters, the variogram is often determined by only a handful of data pairs at short separation distances and many uncertain assumptions about the geological process that controls the spatial correlation structure.
- Seismic attribute interpolation (also called trace interpolation or data regularization) addresses the requirement that seismic migration algorithms need data at regular spatial intervals to produce artifact-free images, while real acquisition footprints have irregular geometry due to cable feathering (in marine surveys), surface obstacles (in land surveys), and source-receiver offset distribution variations: the interpolation fills in missing or sparse traces using information from neighboring traces, exploiting the fact that the seismic wavefield at a missing location can be predicted from the wavefield at nearby recorded locations using plane-wave decomposition, Fourier-domain interpolation (F-K or F-X interpolation), or prediction-error filtering; successful trace interpolation allows surveys with irregular acquisition footprints to be processed with migration algorithms that assume regular geometry, significantly improving image quality; the limitation is that interpolation cannot reliably reconstruct data at locations separated from the nearest recorded traces by more than the spatial aliasing wavelength (the distance at which the wavefield changes by more than half a wavelength between adjacent traces), and aggressive interpolation of highly aliased data can introduce artifacts that reduce rather than improve image quality.
- Temporal interpolation between well test pressure points is used in pressure transient analysis when the downhole pressure gauge recording rate is low relative to the rate of pressure change during early-time buildup or drawdown: downhole pressure gauges in modern wells typically sample at 1 second or finer intervals during transients, but legacy gauges or cost-optimized recording schemes may sample at 1-5 minute intervals, missing the early-time pressure behavior that contains information about the wellbore storage coefficient, the damaged zone permeability, and the skin factor; interpolating between sparse pressure measurements to create a denser time series for derivative calculation may produce a smooth-looking derivative that appears to confirm the expected radial flow regime but actually obscures real features in the early-time behavior; the best practice for wells where early-time data is critical is to use high-frequency recording during the period of maximum rate of pressure change (the first few minutes to hours of buildup), and to store data at lower frequency only during the late-time period when the derivative has stabilized and the rate of pressure change is slow enough that sparse sampling is adequate.
- The distinction between interpolation (estimating values within the range of the known data) and extrapolation (estimating values beyond the range of the known data) is critical in petroleum engineering because extrapolated values have much higher uncertainty than interpolated values and may be physically unrealistic: a porosity-depth relationship calibrated from wells with depths between 1,000 and 3,000 meters can be interpolated within that range with reasonable confidence, but extrapolating the same relationship to 5,000 meters depth (beyond the calibration range) may predict negative porosity or porosities inconsistent with the mechanical compaction physics of the formation; similarly, a production decline curve fit to the first two years of well production can be used to interpolate the expected rate at any time within those two years with good confidence, but extrapolating the same curve to predict the 20-year cumulative production requires assumptions about decline behavior that may not hold if the well undergoes artificial lift, recompletion, or reservoir depletion effects not represented in the early production data; distinguishing interpolation from extrapolation in both the spatial and temporal domains is a fundamental quality control step in petroleum engineering analysis.
Fast Facts
The geostatistical interpolation methods widely used in petroleum reservoir modeling today were developed not in the oil industry but in the South African gold mining industry of the 1950s, where Danie Krige was trying to estimate gold grade distributions between sampling locations in Witwatersrand gold mines. Georges Matheron at the Ecole des Mines de Paris formalized Krige's empirical methods into the rigorous mathematical framework of geostatistics in the 1960s and named the interpolation method "kriging" in Krige's honor. The transfer of these methods from mining to petroleum engineering occurred primarily in the 1980s and 1990s as the computing power needed for 3D geostatistical modeling became available and as the petroleum industry recognized that its data characteristics (sparse well data with dense seismic coverage) matched the epistemological challenge that geostatistics had been designed to address.
What Is Interpolation?
Interpolation is the art of filling in the blanks between what you know. In reservoir modeling, you measure porosity in three wells and need to know what it is across the entire 50-square-kilometer field. In seismic processing, you recorded traces every 25 meters and need them every 12.5 meters for the migration to work properly. In production forecasting, you measured pressure buildup at the beginning and end of a test and need to know the pressure at every intermediate time for derivative analysis. The interpolation algorithm provides the estimated values at unsampled points, using the mathematical structure of the known data to infer what is most likely in between. The critical insight is that interpolation is an estimate, not a measurement, and the quality of that estimate depends entirely on how much the property varies between data points compared to the data density. No algorithm, however sophisticated, can reliably interpolate a highly variable property from sparse data. The algorithm can only make the best of the information available, and it cannot know what it doesn't know.
Synonyms and Related Terminology
Interpolation methods in reservoir modeling are often named specifically: kriging, IDW (inverse distance weighting), natural neighbor, spline interpolation, or moving average. Related terms include kriging (the geostatistical interpolation method that uses the variogram to optimally weight nearby data points, widely used for spatial interpolation of reservoir properties in petroleum geological modeling), variogram (the spatial correlation function that quantifies how the variance of a property changes with separation distance, the fundamental input to kriging and other geostatistical interpolation methods), extrapolation (the estimation of values beyond the range of the known data, which has much higher uncertainty than interpolation within the data range), geostatistics (the application of statistical methods to spatially distributed geological data, of which interpolation and simulation are the primary tools for reservoir property distribution), and regularization (the seismic processing step of interpolating missing or irregularly sampled traces to create a regular acquisition grid for migration, also called trace interpolation or data reconstruction).
Why the Distance Between Your Data Points Is the Constraint That Mathematics Cannot Overcome
Every interpolation algorithm, from the simplest linear method to the most sophisticated geostatistical simulation, faces the same fundamental constraint: the quality of the interpolation cannot exceed what the data supports. If the property you are interpolating varies over distances shorter than your data spacing, the interpolation will systematically miss those variations regardless of the algorithm's theoretical elegance. In petroleum reservoir modeling, this translates to a straightforward but uncomfortable truth: a geological model built from wells spaced 2 kilometers apart cannot reliably represent heterogeneity at the scale of 100-meter channels, regardless of how sophisticated the geostatistical algorithm is. The model will look detailed and quantitative, but the details between wells are informed guesses constrained by statistical assumptions, not measurements. Understanding this limitation — and communicating it honestly in reserve estimates and development plans — is the professional responsibility of every petroleum engineer and geoscientist who uses interpolated data as the basis for business decisions.