Total Organic Carbon and Clay Estimation in Shale Reservoirs Using Automatic Machine Learning

dc.contributor.authorHu, Yueen
dc.contributor.committeechairRipepi, Nino S.en
dc.contributor.committeememberChen, Chengen
dc.contributor.committeememberKeles, Cigdemen
dc.contributor.departmentMining Engineeringen
dc.date.accessioned2021-09-22T08:00:08Zen
dc.date.available2021-09-22T08:00:08Zen
dc.date.issued2021-09-21en
dc.description.abstractHigh total organic carbon (TOC) and low clay content are two criteria to identify the "sweet spots" in shale gas plays. Recently, machine learning has been proved to be effective to estimate TOC and clay from well loggings. The remaining questions are what algorithm we should choose in the first place and whether we can improve the already built models. Automatic machine learning (AutoML) appears as a promising tool to solve those realistic questions by training multiple models and compares them automatically. Two wells with conventional well loggings and elemental capture spectroscopy are selected from a shale gas play to test the AutoML's ability in TOC and clay estimation. TOC and clay content are extracted from the Schlumberger's ELAN interpretation and calibrated to cores. Generalizability is proved in the blind test well and the mean absolute test errors for TOC and clay estimation are 0.23% and 3.77%. 829 data points are used to generate the final models with the train-test ratio of 75:25. The mean absolute test errors are 0.26% and 2.68% for TOC and clay, respectively, which are very low for TOC ranging from 0-6% and clay from 35-65%. The results show the AutoML's success and efficiency in the estimation. The trained models are interpreted to understand the variables effects in predictions. 235 wells are selected through data quality checking and feed into the models to create TOC and clay distribution maps. The maps provide guidance on where to drill a new well for higher shale gas production.en
dc.description.abstractgeneralLocating "sweet spots", where the shale gas production is much higher than the average areas, is critical for a shale reservoir's successful commercial exploitation. Among the properties of shale, total organic carbon (TOC) and clay content are often selected to evaluate the gas production potential. For TOC and clay estimation, multiple machine learning models have been tested in recent studies and are proved successful. The questions are what algorithm to choose for a specific task and whether the already built models can be improved. Automatic machine learning (AutoML) has the potential to solve the problems by automatically training multiple models and comparing them to achieve the best performance. In our study, AutoML is tested to estimate TOC and clay using data from two gas wells in a shale gas field. First, one well is treated as blind test well and the other is used as trained well to examine the generalizability. The mean absolute errors for TOC and clay content are 0.23% and 3.77%, indicating reliable generalization. Final models are built using 829 data points which are split into train-test sets with the ratio of 75:25. The mean absolute test errors are 0.26% and 2.68% for TOC and clay, respectively, which are very low for TOC ranging from 0-6% and clay from 35-65%. Moreover, AutoML requires very limited human efforts and liberate researchers or engineers from tedious parameter-tuning process that is the critical part of machine learning. Trained models are interpreted to understand the mechanism behind the models. Distribution maps of TOC and clay are created by selecting 235 gas wells that pass the data quality checking, feeding them into trained models, and interpolating. The maps provide guidance on where to drill a new well for higher shale gas production.en
dc.description.degreeMaster of Scienceen
dc.format.mediumETDen
dc.identifier.othervt_gsexam:32427en
dc.identifier.urihttp://hdl.handle.net/10919/105040en
dc.publisherVirginia Techen
dc.rightsIn Copyrighten
dc.rights.urihttp://rightsstatements.org/vocab/InC/1.0/en
dc.subjectShale gasen
dc.subjectTotal organic carbonen
dc.subjectClay contenten
dc.subjectSweet spotsen
dc.subjectAutomatic machine learningen
dc.subjectEnsemble learningen
dc.titleTotal Organic Carbon and Clay Estimation in Shale Reservoirs Using Automatic Machine Learningen
dc.typeThesisen
thesis.degree.disciplineMining Engineeringen
thesis.degree.grantorVirginia Polytechnic Institute and State Universityen
thesis.degree.levelmastersen
thesis.degree.nameMaster of Scienceen

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Hu_Y_T_2021.pdf
Size:
4.96 MB
Format:
Adobe Portable Document Format

Collections