• Products
    • Products Overview
    • Oil and Gas Data
    • Oil and Gas Software
  • ML in Oil & Gas Blog
  • Resources
  • News & Events
    • Company News
    • Conference Presentations
  • About
    • About Novi Labs
    • Careers
  • Contact
Novi Labs
  • Skip to primary navigation
  • Skip to main content
  • Skip to footer
get started on your future of well planning with Novi

Novi Labs

  • Products
    • Products Overview
    • Oil and Gas Data
    • Oil and Gas Software
  • ML in Oil & Gas Blog
  • Resources
  • News & Events
    • Company News
    • Conference Presentations
  • About
    • About Novi Labs
    • Careers
  • Contact

Is it worth It? Quantifying the value of collecting and interpreting subsurface data. [URTeC 2022 Novi Paper Summary]

April 12, 2022

About the author

Jason Reed

Jason is a Managing Director at Novi Labs. Prior to Novi he worked for various operators and more recently transitioned to supporting digital solutions in the oil and gas space.

Rejoice geologists, geophysicists and petrophysicists!  We provide a novel approach to answer the question: Is it worth it for the industry to spend hundreds of millions of dollars each year to collect and interpret subsurface data?  In short, you bet!  

This blog post is a summary of our URTeC 2022 paper: “How Much Better Could We Have Done?  Using a Time Machine Method to Quantify the Impact of Incremental Geologic Data on Machine Learning Forecast Accuracy Control”

In this post:

  1. The method: using a “time machine” to quantify value of information
    1. The experiment: “base” subsurface vs. “rich” subsurface
      1. The results: how accurate were each of the models?
        1. The value: implications for NPV and leasing decisions
          1. Conclusions

            The method: using a “time machine” to quantify value of information

            Subsurface practitioners have always known how important our work is to guide upstream decisions; however, it has always been difficult to quantify the value of our contributions. Utilizing Novi’s self-service machine learning platform and its time machine method (yes, it’s as cool as its sounds) enables the ability to demonstrate the ROI of data collection an interpretation investments as a function of the actual NPV of wells and pads.

            Illustration of train/test split for time machine exercise.

            The Novi time machine simply simulates how a model would have worked in the past, by withholding all data from after a certain date and training the model on data from before that date.  This gives our users powerful counterfactual abilities to ask questions like “how would the advanced subsurface interpretation have impacted our forecasts back in 2019?”

            The figure above shows how you can simply go into the software product and select a date from a drop down calendar.  This informs the model that you want to use wells prior to that date as wells to train the model.  Any well turned online past that date will not be part of model training and the model will be blind to these holdout wells.  Using Novi Model Engine software, you can create forecasts for the holdout wells and compare the forecasts to the actual production that came from wells turned on line after the cut-off model date. 

            The experiment: “base” subsurface vs. “rich” subsurface

            Our evaluation focused on Howard County, a highly productive region of the Midland Basin that emerged as a hotspot much later than the rest of the Wolfberry play, making it an ideal candidate for counterfactual analysis. Would the addition of advanced subsurface data have enabled the model to identify this highly productive area?

            Only subsurface features were varied for this analysis.  The table below describes the differences between the features in “base” vs. “rich” models.  Engineering features including spacing, completions and production remained constant for both models that are used for the comparison.  

            These lists summarize the different feature inputs for the “base” vs the “enriched” models.

            The results: how accurate were each of the models?

            The 694 Howard County wells held out from the models by time machine produced a total of ~85 million barrels over their first year. The base model underpredicted this total by over 7%, while the enriched model nailed the production, getting within 1% of 85 million barrels.

            The rich subsurface model was more accurate across all reservoirs in Howard County.

            It is important to note that even though the model included all available subsurface features for the enriched version, model accuracy could be further improved through iterating the inclusion and exclusion of specific subsurface features.  The Novi Model Engine platform provides two means for feature selection.  The first option allows the user to select which features will be utilized in the model.  The second method is for the machine learning pipelines to auto-select the features that provide the most signal to the model outputs (i.e., production in this case).

            The value: implications for NPV and leasing decisions

            The enriched subsurface provides a more accurate forecast of production, but how does that translate to the assessment of acreage value?  In this example, five pads were randomly selected from the study to compare forecasted NPVs for each model.  The rich model predicted approximately $32MM additional NPV when compared to the base model.  Therefore, if decisions were being made based on the base model, then perhaps those lower NPV estimates would have influenced the decision to not drill and complete (or lease!) because economic hurdles were not met.  If the rich model drove the decision making process, then the drilling, completions and production would have proceeded with greater confidence because of the robust project NPV.  

            This figure summarizes the differences in NPV for each pad that was included in the pad comparison part of the study.

            This approach can influence a subsurface team’s priorities and focus additional subsurface data collection spend (e.g, which logs/core).   For example, if we find that geomechanical properties are key drivers to improve model accuracy and are important to explainability, then the team may want to collect and process additional sonic logs and/or invert seismic volumes for geomechanics.  Alternatively, if certain reservoir quality parameters (e.g., Vclay, Sw, porosity, etc.)  do not improve the model or contribute to explainability, then perhaps petrophysicists could divert time away from refining those reservoir quality models and focus in other areas.

            The time machine workflow is valuable for much more than estimating value of subsurface information, such as:

            • Building confidence for decision makers in use of machine learning models.
            • Providing context for well reviews.  How good were my AFE curves relative to my machine learning prediction and actuals?
            • Addressing questions such as: How does my model accuracy change when I introduce new spacing and/or engineering features?

            Conclusions

            In this analysis we:

            • Demonstrated that high quality subsurface features can increase predictive model accuracy.
            • Quantified how increased model accuracy supports improved financial decisions.  
            • Explained how an improved understanding of the value of subsurface interpretation  can help prioritize team workloads and data collection spend.

            Paper Details ::

            How Much Better Could We Have Done?  Using a Time Machine Method to Quantify the Impact of Incremental Geologic Data on Machine Learning Forecast Accuracy Control. ID: 3723907

            This paper was prepared for presentation at the Unconventional Resources Technology Conference held in Houston, Texas, USA, 20-22 June 2022.

            —-

            [Free Guide] Applying machine learning in the oil & gas industry can solve complex problems quickly and efficiently. But building accurate models can be challenging and incredibly time-consuming.

            If you don’t want to waste your time, resources, and motivation while building machine learning models, here’s how to avoid the most common mistakes.

            Filed Under: Machine Learning in Oil and Gas Blog

            Previous postOvercoming barriers to data-driven workflows: introducing Novi Model Engine
            If you would like more information, please reach out using the form below.
            • This field is for validation purposes and should be left unchanged.

            Footer

            Connect

            • LinkedIn
            • YouTube

            oil and gas analysis by novi

            Contact

            1905 Aldrich Street, Suite 220
            Austin, Texas 78723

            intro@novilabs.com
            512.368.9042

            • Home
            • Products
            • Resources
            • About
            • ML in O&G Blog
            • Privacy Policy

            Copyright © 2022 Novi Labs
            All rights reserved.