Gaussian Collocated Cosimulation
Gaussian collocated cosimulation is a sequential Gaussian simulation algorithm that incorporates secondary data (such as seismic attributes) at every simulation grid node by exploiting a Markov-type screening hypothesis — the assumption that the primary variable (such as porosity) at unsampled locations is conditionally independent of all secondary data at distant locations, given knowledge of the collocated secondary value at the same location — which reduces the full co-kriging system to a computationally tractable simple kriging co-kriging equation that uses only the collocated secondary datum and the primary variable's spatial covariance structure; the algorithm produces stochastic realizations of the primary variable (porosity, net-to-gross, permeability) that honor all available primary data (well measurements), reproduce the primary variable's histogram and variogram, and locally integrate the spatial information provided by the secondary variable (seismic impedance, attribute cubes), making Gaussian collocated cosimulation the standard geostatistical technique for generating reservoir property models that are constrained by both well data and 3D seismic attributes in petroleum reservoir characterization workflows.
Key Takeaways
- The Markov screening hypothesis is the mathematical foundation of collocated cosimulation — instead of using all secondary data at all locations (which would require solving a very large co-kriging system with as many equations as there are simulation nodes times the number of secondary data points), the Markov-type approximation assumes that once the collocated secondary value at the target node is known, additional secondary data at other locations add no further information about the primary variable; this assumption is mathematically equivalent to assuming that the secondary variable's influence on the primary variable decays to zero beyond the collocated point, which is a simplification of reality but typically a good approximation when the secondary variable is sampled densely (as seismic is) relative to the primary variable correlation length; the practical consequence is that the collocated co-kriging system reduces to two equations with two unknowns (the primary spatial continuity model and the correlation coefficient between primary and secondary at the collocated point), which can be solved in microseconds per node compared to the minutes required for full co-kriging.
- Cross-correlation coefficient between primary and secondary variables controls the degree to which secondary data influences the simulated primary property field — a cross-correlation coefficient (rho) of 1.0 means the secondary variable perfectly predicts the primary variable and all simulated realizations will honor the secondary data exactly; a coefficient of 0.0 means the secondary data has no influence and the simulation degenerates to standard sequential Gaussian simulation (SGS) with only primary data conditioning; in practice, the cross-correlation between seismic impedance and porosity in carbonate reservoirs typically ranges from 0.3 to 0.7, meaning the seismic provides significant but imperfect guidance for the simulated porosity field; calibrating this cross-correlation coefficient requires co-located well log and seismic data at the well locations, where both primary (porosity from logs) and secondary (seismic impedance) values are known and their correlation can be measured directly.
- Sequential simulation path in Gaussian collocated cosimulation visits each grid node once in random order and at each node: (1) retrieves the simulated values of the primary variable at previously simulated neighboring nodes that fall within the search neighborhood, (2) retrieves the collocated secondary datum at the target node, (3) solves the collocated co-kriging system to obtain the conditional mean and variance of the primary variable at the target node given the neighboring primary data and the collocated secondary value, (4) draws a random value from the Gaussian distribution defined by that conditional mean and variance, and (5) adds this simulated value to the dataset of conditioning data for subsequent nodes; the random simulation path ensures that all directions in the model are treated equivalently and no systematic spatial bias is introduced by a fixed simulation order; repeating the process with different random seeds produces multiple equally probable realizations that quantify geological uncertainty.
- Normal score transform is a prerequisite for Gaussian collocated cosimulation because the algorithm assumes both the primary and secondary variables follow Gaussian (normal) distributions — well porosity data and seismic attributes typically follow non-Gaussian distributions (porosity bounded between 0 and 1, seismic attributes with skewed distributions from interference effects); the normal score transform maps the original data values to their equivalent quantiles in a standard Gaussian distribution (mean 0, variance 1), preserving the relative data ranking while creating Gaussian-distributed transformed values suitable for the simulation algorithm; after all realizations are completed, the simulated Gaussian values are back-transformed to the original variable distribution using the inverse normal score transform; the shape of the original distribution is reproduced in the final simulations while the spatial structure was modeled in the Gaussian space where geostatistical algorithms are mathematically valid.
- Co-simulation of multiple variables (porosity, net-to-gross, clay content, permeability) in the same sequential cosimulation framework maintains cross-variable correlations that are important for flow simulation — once porosity is simulated at all nodes using the collocated seismic attribute as secondary data, net-to-gross can be cosimulated using both the seismic attribute and the already-simulated porosity as secondary conditioning variables; permeability can then be cosimulated using porosity (through a porosity-permeability transform) and the seismic attribute; this sequential co-simulation chain propagates the seismic constraint through multiple interrelated reservoir properties consistently, unlike independent simulation of each property that would break the geological relationships between porosity, clay content, and permeability that control reservoir flow behavior.
Fast Facts
Gaussian collocated cosimulation was formalized by Alain Goovaerts and Clayton Deutsch in their 1997 textbook "Geostatistics for Natural Resources Evaluation," which provided the theoretical foundation and practical implementation details that made the technique accessible to reservoir geologists and engineers. The algorithm was implemented in the GSLIB (Geostatistical Software Library) public domain software package (Deutsch and Journel, 1992, 1998), specifically in the program COSGSIM (Collocated Sequential Gaussian Simulation), which enabled widespread adoption in academic and industrial reservoir modeling workflows. Today, Gaussian collocated cosimulation is built into all major commercial reservoir modeling packages (Petrel, RMS, Tempest, IRAP RMS), where it is typically the default method for generating porosity and net-to-gross realizations conditioned to seismic impedance inversions in 3D reservoir models.
What Is Gaussian Collocated Cosimulation?
Building a reservoir model requires populating a three-dimensional grid — often containing millions of cells — with estimates of properties like porosity, permeability, and fluid saturations. The challenge is that direct measurements (from wells) are sparse, while indirect measurements (from seismic) are dense but imprecise. Reservoir modelers want to use both: the accurate but sparse well data and the spatially comprehensive but noisy seismic data.
Gaussian collocated cosimulation is the algorithm that accomplishes this integration efficiently. The "Gaussian" part means the simulation works by drawing random values from normal distributions defined by the local statistics of the data. The "collocated" part refers to the Markov screening trick: instead of considering all the seismic data points across the entire model when simulating porosity at any given cell, the algorithm uses only the seismic value at that same cell location — the observation that a nearby seismic value tells you far less about the porosity at a point once you already know the seismic value at that exact point. This simplification makes the algorithm computationally feasible for million-cell grids while producing results that are nearly as good as the theoretically rigorous full co-kriging approach.
The result is a set of realizations — multiple equally probable reservoir property models — that all honor the well data, reproduce the expected geological variability (encoded in the variogram), and follow the spatial trends implied by the seismic. Running flow simulation on many such realizations quantifies how geological uncertainty translates into production forecast uncertainty, which is the information needed for risk-adjusted investment decisions in field development.
Gaussian Collocated Cosimulation in Reservoir Modeling Practice
Variogram modeling for the primary variable is a critical pre-processing step that defines the spatial continuity model used in the collocated cosimulation — the experimental variogram (a function describing how rapidly the primary variable becomes dissimilar with increasing separation distance in different directions) is computed from the available well data, typically by calculating the average squared difference between porosity values at well pairs separated by increasing lag distances in multiple azimuthal directions; the modeled variogram (a permissible covariance model such as a spherical, exponential, or Gaussian function fitted to the experimental variogram) provides the spatial continuity parameters (range, sill, nugget) in each principal direction (horizontal maximum continuity, horizontal minimum continuity, vertical) that control the size and shape of simulated spatial features; in reservoir settings with directional depositional fabrics (fluvial channels, turbidite lobes, deltaic systems), the variogram is anisotropic with much longer horizontal range in the depositional transport direction than perpendicular to it, and the variogram azimuth must be aligned with the geological interpretation of depositional strike.
Simulation validation uses the Q-Q plot of simulated versus input data distributions, variogram reproduction checks (do the simulated realizations reproduce the input variogram model within sampling uncertainty?), and cross-validation at well locations removed from the conditioning data to assess whether the simulation honors data at new locations with appropriate uncertainty; poor variogram reproduction in realizations typically indicates that the search neighborhood (number of conditioning data and search radius) is too small to adequately constrain the simulation at each node, requiring enlargement of the neighborhood parameters; over-smoothing (realizations that look too homogeneous compared to expected geological variability) typically indicates over-conditioning to secondary data (correlation coefficient too high) or a variogram range that is too long, requiring recalibration of the collocated correlation coefficient or variogram model against the actual data scatter.
Gaussian Collocated Cosimulation Across International Jurisdictions
Canada (AER / WCSB): WCSB reservoir modeling for oil sands and deep conventional reservoirs routinely uses Gaussian collocated cosimulation to integrate 3D seismic with well data in McMurray, Montney, and Duvernay formation reservoir models; AER requires that resource estimates and development plans for in situ oil sands projects include probabilistic reservoir descriptions that reflect geological uncertainty, and cosimulation-based modeling satisfies this requirement by producing multiple realizations that quantify porosity and bitumen saturation uncertainty at the lease scale; companies including Suncor, CNRL, Cenovus, and Imperial Oil use Petrel or RMS-based cosimulation workflows as the standard for McMurray oil sands reservoir characterization, with the seismic acoustic impedance inversion providing the secondary variable for cosimulated bitumen saturation and net-pay thickness used in SAGD well spacing design.