[PDF] Unsupervised seismic facies classification using deep convolutional autoencoder

Abstract

With the increased size and complexity of seismic surveys, manual labeling of seismic facies has become a significant challenge. Application of automatic methods for seismic facies interpretation could significantly reduce the manual labor and subjectivity of a particular interpreter present in conventional methods. A recently emerged group of methods is based on deep neural networks. These approaches are data-driven and require large labeled datasets for network training. We apply a deep convolutional autoencoder for unsupervised seismic facies classification, which does not require manually labeled examples. The facies maps are generated by clustering the deep-feature vectors obtained from the input data. Our method yields accurate results on real data and provides them instantaneously. The proposed approach opens up possibilities to analyze geological patterns in real time without human intervention.

Full PDF

UUnsupervised seismic facies classification using deep convolutional autoencoder

Vladimir Puzyrev * and Chris Elders School of Earth and Planetary Sciences and Curtin University Oil and Gas Innovation Centre, Curtin University, Perth, WA 6102, Australia.

Abstract

Keywords:

Interpretation, Seismic facies,

Deep learning, Convolutional neural network, Autoencoder

1. Introduction

Seismic facies analysis plays a key role in the derivation of reservoir properties from seismic attributes (Brown, 2011). Accurate and fast interpretation of facies provides a reference for further * Corresponding author, [email protected] nalysis of geological conditions. With the widespread use of 3D seismic technology and dramatic growth and complexity of seismic data, manual facies analysis and interpreting geological patterns becomes an extremely time-consuming task. The interpretation results are inevitably affected by the subjectivity of the interpreter. To address these issues, automatic seismic interpretation tools have drawn a lot of attention in recent years with the latest advances in computing and data analysis. This trend was boosted by the recent meteoric rise of deep learning (DL) models in many fields of science and technology. These methods allow for the detection and exploitation of nonlinear dependencies in the data without specifying a particular model in advance and using hand-engineered features. A particular class of DL models, convolutional neural networks (CNN), have shown tremendous success in image processing, pattern recognition, and object detection systems. In the past few years, deep CNN have been actively applied to various geophysical problems including detection of faults (Araya-Polo et al., 2017; Huang et al., 2017; Wu et al., 2019), seismic facies classification (Dramsch and Lüthje, 2018; Zhao, 2018; Duan et al., 2019), first-break picking (Yuan et al., 2018), seismic horizon picking (Shi et al., 2020) and many others. Seismic facies classification algorithms can be divided into two major categories, namely, supervised and unsupervised learning techniques. Supervised classification methods (Zhao et al., 2015; Qi et al., 2016; Liu et al., 2018;) involve data labeling, i.e. manual facies interpretation, which is inevitably affected by the differences in the interpreters’ knowledge base and experience level. Unsupervised facies classification algorithms are data-driven way to determine the clusters present in the data, unbiased by the interpreter. The most common traditional unsupervised seismic facies classification methods include principal component analysis (PCA) (Dumay and Fournier, 1988), K-means clustering (Coléou et al., 2003; Galvis et al., 2017), and the self-organizing map (SOM) (Matos et al., 2007; Saraswat and Sen, 2012). Deep neural networks can also be efficiently applied to clustering problems due to their inherent property of highly nonlinear transformation that allows transformation of data with highly complex structure into more clustering-friendly representations (Aljalbout et al., 2018; Min et al., 2018; Chalapathy and Chawla, 2019). In recent years, several unsupervised algorithms based on DL have been developed, in particular, based on the convolutional autoencoder (CAE) neural network architecture that s widely used for image segmentation, denoising purposes, and other problems of image processing. In a geophysical context, CAE has been used for interpolation of missing seismic traces (Wang et al., 2020), attenuation of marine seismic interference noise (Sun et al., 2020), and segmentation of rock images (Karimpouli and Tahmasebi, 2019). In recent years, the same architecture has been marginally used for facies classification applications. Qian et al. (2020) used the deep CAE scheme to extract features from 2D prestack gathers. Somewhat related to it are a group of “semi-supervised” methods that use both labeled and unlabeled data in training; a recent example of such an approach using generative adversarial networks can be found in Liu et al. (2020). In this study, we explore a deep CAE for unsupervised seismic facies classification. This approach is entirely data-driven, does not require manual data labeling, and provides the result instantaneously. The remainder of the paper is organized as follows. First, we formulate the problem and describe the methodology in Section 2. In Section 3, we describe the architecture of the deep neural networks employed, their training process, as well as the details of the clustering algorithm. The performance of the method is investigated in Section 4 using marine seismic data from the Northern Carnarvon Basin, North West Shelf, Australia. Finally, the last section summarizes the outcomes and points out future research directions.

2. Methodology

Seismic data volumes are huge and consist of highly redundant data which motivated previous applications of data reduction algorithms to preserve only the important features of the seismic character (Coléou et al., 2003). This redundancy makes DL methods a natural choice for handling large-scale seismic dataset processing as these methods benefit from massive amounts of data. By exploiting different layers of abstraction, deep neural networks pick up both the low-level and high-level features in data. Seismic data interpretation can be formulated as an image segmentation problem. Considering a 2D seismic section as a one-channel image, we split it into many “tiles” thus turning the seismic facies ecognition problem into an image classification problem. Deep convolutional networks are the state-of-the-art method for such problems. These networks can be viewed as a natural extension of neural networks for processing images and data with a grid-like topology. They achieve high performance in learning filters that represent repeating patterns and extracting the most important features from segments of data regardless of the specific location of these features within the data. Convolutional layers use shared local filters which is beneficial for processing data with a strong local structure (such as seismic data). A deep CAE is an unsupervised learning method that is able to learn hierarchical feature representations automatically from the input data. The workflow of our CAE-based facies interpretation consists of three stages: 1. CAE training. First, we split the vertical poststack seismic sections chosen as the training data into individual tiles, normalize this dataset and give it as input and output to the autoencoder. During training, the network learns the mapping from the data space to a lower-dimensional latent feature space. 2. Clustering. Once the CAE is trained, we employ the PCA to extract the dominant components of the feature vector which, in turn, are given to the K-means clustering algorithm to generate the facies map on the training data. Another possible way is to apply clustering directly in the latent space, however, this approach may lead to poor performance (Mukherjee et al. 2019). 3. Pattern recognition. Given new seismic data, we generate its deep features using the encoder, extract the dominant components, and find the best matching to generate the facies map. In order to increase the resolution of the resulting facies map, we extract tiles in an overlapping manner along the entire vertical section. Finally, we postprocess the obtained result to suppress the noise and remove spatially tiny structures. The method delivers the results in real time (the third stage takes several seconds using a single GPU). Details of the implementation using open-source libraries TensorFlow (Abadi et al. 2016) and scikit-learn (Pedregosa et al., 2011) are described next. . Implementation details

An autoencoder is an unsupervised learning method, which is based on training the neural network to approximate the data by itself via a bottleneck structure (Masci et al., 2011). Unlike traditional supervised DL methods, it does not require large amounts of labeled training examples and can automatically learn discriminative features in data. Various autoencoders have been applied for clustering and anomaly detection tasks in recent years (Guo et al., 2017; Ghasedi et al., 2017; Min et al., 2018; Chalapathy and Chawla, 2019). Convolutional autoencoders (CAE) are a special type of autoencoder that uses convolutional layers to extract high-level features from data while preserving local relations using the convolution kernel in each layer. This makes CAE a natural choice of autoencoder for processing of images and data with local spatial connectivity such as seismic data. An autoencoder consists of two major parts, the encoder  E that compresses the input x to lower-dimensional features h and the decoder  D that takes the latent features as input and reconstructs the original data as closely as possible:     h E x , (1)      x D h . (2) Choosing the architecture of both the encoder and decoder as a deep CNN allows it to learn hierarchical feature representations by exploiting deep features in the input data. This process of encoding can be viewed as a projection of the higher-dimensional input data onto a lower-dimensional space. The encoder, in other words, is a nonlinear function that allows automatic extraction of feature representations. The number of hidden features should be sufficient to describe enough variability of the data and thus be able to reveal details of the underlying geologic features. In the decoder, we use upsampling layers in a symmetric manner to restore the original data. The bottleneck structure of the CAE allows it to capture the most important features associated with the input seismic images in the hidden feature layer h . This layer has much smaller dimensionality than the original data, which leads o the creation of a compressed set of information in h from which the original data x is restored through linear and non-linear relationships. In other words, training of the CAE creates in h a more cost-effective representation of x . Figure 1 shows the architecture of the CAE used in the following examples. It is composed of 44 layers: 22 convolutional, batch normalization, and pooling layers in the encoder to build a deep representation of patterns in data, a feature layer, and 21 convolutional, batch normalization, and upscaling layers in the decoder. Each of the layers in the encoder and decoder networks contains between 32 and 256 filters for detecting various features in the input data. The number of filters used in convolutions increases by a factor of 2 after each pooling layer to enable the network to better learn features at higher abstractions, make the representation approximately invariant to small translations of the input, and also reduce the overall number of free parameters. Due to this hierarchical structure when the features extracted at the previous level become the input at the next level, the network is able to analyze the data at different scales. Batch normalization (Ioffe and Szegedy, 2015) is used after the convolutional layers for improving the training performance and regularization purposes. Rectified linear units (ReLU) are chosen as activation functions. The main advantages of these activation functions are their nonsaturating nonlinearity that often works well in training and low computational complexity. Based on a series of experiments (Puzyrev, 2019), we choose leaky rectified linear units (ReLU) as activation functions of the CAE. Two different sizes of the convolution kernel, namely 3 × 3 and 5 × 5, were tested in the model as described below. Like any other neural network, CAE needs to be trained. The training, however, does not require labeled examples and is performed using the input data as the output data, namely          x D E x x . (3) This allows the expensive procedures of manual data preparation to be avoided. The weights and biases of the encoder and decoder networks,  and  , respectively, are iteratively updated during training by minimizing the reconstruction error. While the training process is time-consuming for large datasets and deep networks, once it is finished, the network can be used at a very low computational cost. .1.1 Loss functions The role of the loss function is to ensure that the learned representation preserves important features of the initial data. A rather common choice for regression problems is the mean squared error (MSE). This loss is also commonly used in denoising autoencoders as the distance measure between the original and decoded data. However, a custom loss function instead of the standard MSE can improve the performance of an autoencoder (Creswell et al. 2017). Cross-entropy losses are preferred over MSE for classification problems. Here, we use the following loss function which is a weighted sum of the MSE over the normalized training dataset and the binary cross-entropy of the binarized data: ( , ) ( , ) ( , )

MSE BCE b b

L x x L x x L x x      . (4) The MSE and BCE losses are defined as follows  

1( , )

NMSE i ii

L x x x xN       , (5)  

1( , ) log (1 ) log(1 )

NBCE i i i ii

L x x x x x xN             ; (6) and the subscript b denotes the binarized data. The role of the BCE loss is to penalize the structural difference between the original and reconstructed data. Nesterov-accelerated adaptive moment estimation (Nadam) algorithm (Dozat, 2016) is employed. Table 1 reports the parameters of the networks and the minimum error (4) achieved on the training dataset. Both networks achieve similar accuracy on the training data, although the second one takes significantly longer to train due to the higher number of training parameters and more epochs required to reach the error plateau. At the same time, the facies maps generated with Model 2 are slightly smoother and contain less “noise” and thus we employ this network in the following numerical examples. Once the training is complete, the input data are split into several groups (clusters) as described in Section 3.3. We train the network on a high quality public domain 3D seismic data acquired in the Northern Carnarvon Basin, Australia's premier hydrocarbon province. This area has been extensively studied, not only in terms of hydrocarbon prospectivity, but also in terms of the structural, stratigraphic, edimentological and geodynamic evolution of the continental margin. The Northern Carnarvon Basin developed during several successive phases of extension from the Late Carboniferous until the Early Cretaceous, followed by passive margin thermal subsidence to the present day. This has resulted in the accumulation of sedimentary sequences up to 15 km in thickness which were deposited in a variety of different sedimentary environments and hence are represented by a number of different sedimentary and seismic facies (I'Anson et al., 2019; McHarg et al., 2019). The hc2000a 3D survey was acquired in 2000 and consists of 2324 in-lines and 4470 cross-lines with a 25 m inline spacing and a 12.5 m cross-line spacing covering the northern part of the Exmouth sub-basin and the adjacent Exmouth Plateau. The record length is 9000 ms with a 4ms sample rate. The data are time migrated. We extract 60 2D vertical sections from the available 3D volume, which contain a range of seismic facies with different degrees of reflector continuity, frequency and amplitude response. Data preprocessing is a crucial step in DL applications which may seriously affect the final result. In this particular case, we normalize the data over the entire training dataset to avoid numerical problems in network training since the range of amplitudes is sufficiently large and such variations in different samples may cause errors in gradient updates. Each vertical seismic section is split into non-overlapping tiles of 96 traces and 48 samples. This size was determined by a series of experiments. Smaller sizes make classification of individual tiles more challenging while larger tiles increase the chances of having several facies in one tile. 54 regularly spaced in-lines and cross-lines are used as the training data, which constitutes a sufficiently large and representative set of training data for a deep CAE to allow it to achieve high accuracy and generalization, i.e. perform meaningful classification of facies both for the data used in training (training set) as well as for new, previously unseen data (test set). Figure 2 shows the level of CAE reconstruction for ten sample tiles representing various geological facies. This indicates that the main reflector geometries are reconstructed with a high degree of accuracy, while the small amount of noise present in the original data is absent in the CAE output. .3 Clustering

Once the network is trained, we need to cluster the deep-feature vectors obtained from the input images for a further construction of a facies map. In order to reduce the dimensionality of the feature vector h , we employ the PCA for extraction of the dominant components. In Figure 3, we illustrate the contribution of each PCA component into the total explained variance for Models 1 and 2. The first 20 components in these two cases explained, respectively, more than 49% and 47% of the total variance. The obtained PCA components are used for clustering of the training samples. Clustering can be done by various unsupervised methods including K-means clustering, fuzzy c-means, or self-organizing maps. Without losing generality, to generate the facies maps in this study, we apply a K-means clustering algorithm (Hartigan and Wong, 1979) based on the distance between the principal components of the latent features extracted from the training data. Each of the 96 × 48 tiles is classified into one of several classes based on the similarity of their principal components. In order to increase the resolution of the resulting facies classification, we employ a sliding window over the entire vertical seismic section in an overlapping manner. Figure 4 illustrates this concept for the case when the fine grid is four time finer than the original grid in each dimension. For each sliding window position, we run the algorithm to determine its facies class and add this class to the 16 fine cells contained inside the window using the weight matrix shown in Figure 4b. The facies class for the fine cell is determined as the one having the highest weight.

4. Numerical results

In this section, we show the facies classifications generated for the considered Northern Carnarvon Basin seismic data. Figure 5a shows the CAE interpretation of an inline section from the training dataset. Seismic facies appearing in this dataset are automatically divided into 5 classes. Tiles with no reflections (water) are determined precisely. The relatively high amplitude and laterally continuous passive margin sequences (Muderong Shale and younger; red color) are distinguished from the continuous but lower amplitude reflectors of the Barrow Group (yellow color). Incisions within the assive margin sequence are also recognized. The Barrow Group sediments are in turn distinguished from diffuse reflections in the underlying Upper Jurassic sequence (green color). However, the algorithm has failed to distinguish between the faulted and rotated Triassic sequences forming the basin flank, and the more seismically transparent facies forming the basin fill. Some high amplitude events, which may represent igneous intrusions, are also picked out. In order to obtain a cleaner result, we also postprocess the algorithm output using a combination of morphological closing and median filtering techniques, which allows to get rid of spatially tiny structures (Figure 5b). For comparison, we also show the results of manual interpretation of the same inline section in Figure 5c. The distinction between the different formations is also clearly apparent in the cross lines, as shown in Figure 6 for a cross-line section from the test dataset. The angular unconformity between the Barrow Group sediments (yellow) and the overlying passive margin sequence (red) is clear, although there is some “leakage” of the facies below the unconformity. Once again, there is little discrimination between faulted units and the rift fill, but high amplitude igneous intrusions are clearly recognizable (purple).

5. Discussion

Deep learning methods have attracted considerable interest from the geophysical community and made artificial intelligence one of the main focuses of attention from both academia and industry. Automatic methods for seismic data analysis are commonly applied nowadays for recognizing geologically meaningful patterns and identifying various features such as faults, sequence boundaries, and unconformities. Application of deep learning methods in seismic facies interpretation could significantly reduce the manual labor and subjectivity of a particular interpreter present in conventional methods. Unsupervised learning methods are of particular interest nowadays and their role in seismic interpretation is expected to grow rapidly in the near future. By employing the proposed CAE architecture, deep features are learned from poststack seismic images in an unsupervised way thus eliminating the need of labeling training data and avoiding nterpreter’s subjectivity in definition and delineation of seismic facies. While the lack of need for labeled training data is a benefit of unsupervised methods, this also limits our possibilities to control the final result. Although the results compare favorably with the manual interpretation, there are still some obvious differences. The tiles are considered individually, while manual facies interpretation would heavily rely on their positions relative to each other. Future research may explore adding additional attributes such as frequency variation with depth to the training data. Another direction for future research is the extension of the method to 3D.

6. Conclusions

We applied a deep convolutional autoencoder to automatically extract the dominant features in seismic data and classify this data into various seismic facies. The method successfully extracted the dominant features of the seismic data in the training and test sets and classified them into five seismic facies. The workflow consists of three stages: training of the CAE, clustering of the training data, and recognition of seismic patterns in new data. Facies maps are generated by applying the K-means clustering algorithm on the principal components of the compressed features. Normalization of input data and fine-tuning the network architecture has an important effect on the clustering results. The first stage of the workflow is computationally expensive, since the network is trained on large datasets in order to learn the underlying dependencies in data and be able to generalize to new inputs. However, the first and second stages are only performed once. Processing of new data is performed instantaneously and, as demonstrated by the examples in this study, allows for sufficiently accurate estimation of the basic seismic facies. We also note that the trained network can be adapted to other datasets by employing the concept of transfer learning and fine tuning the network on the new data to improve training efficiency. cknowledgements

The authors acknowledge support from the Curtin University Oil and Gas Innovation Centre (CUOGIC) and the Institute for Geoscience Research (TIGeR). This work was supported by resources provided by the Pawsey Supercomputing Centre with funding from the Australian Government and the Government of Western Australia.

References Abadi, M., Barham, P., Chen, J., Chen, Z., Davis, A., Dean, J., Devin, M., Ghemawat, S., Irving, G., Isard, M., and Kudlur, M., 2016, Tensorflow: A system for large-scale machine learning: 12th USENIX Symposium on Operating Systems Design and Implementation, 265-283. 2.

Aljalbout, E., Golkov, V., Siddiqui, Y., Strobel, M., and Cremers, D., 2018, Clustering with deep learning: Taxonomy and new methods: arXiv preprint arXiv:1801.07648. 3.

Araya-Polo, M., Dahlke, T., Frogner, C., Zhang, C., Poggio, T., and Hohl, D., 2017. Automated fault detection without seismic processing: The Leading Edge, , 208-214. 4. Brown, A.R., 2011, Interpretation of three-dimensional seismic data: Society of Exploration Geophysicists and American Association of Petroleum Geologists. 5.

Chalapathy, R., and Chawla, S., 2019, Deep learning for anomaly detection: A survey: arXiv preprint arXiv:1901.03407. 6.

Coléou, T., Poupon, M., and Azbel, K., 2003, Unsupervised seismic facies classification: A review and comparison of techniques and implementation: The Leading Edge, , no. 10, 942-953. 7. Creswell, A., Arulkumaran, K., and Bharath, A. A., 2017, On denoising autoencoders trained to minimise binary cross-entropy: arXiv preprint arXiv:1708.08487. 8.

Dozat, T., 2016, Incorporating Nesterov momentum into Adam: ICLR 2016. 9.

Dramsch, J.S., and Lüthje, M., 2018, Deep-learning seismic facies on state-of-the-art CNN architectures: SEG Technical Program Expanded Abstracts 2018, 2036-2040. 10.

Duan, Y., Zheng, X., Hu, L., and Sun, L., 2019, Seismic facies analysis based on deep convolutional embedded clustering: Geophysics, , no. 6, IM87-IM97. 1. Dumay, J., and Fournier, F., 1988, Multivariate statistical analyses applied to seismic facies recognition: Geophysics, , no. 9, 1151-1159. 12. Galvis, I. S., Villa, Y., Duarte, C., Sierra, D., and Agudelo, W., 2017, Seismic attribute selection and clustering to detect and classify surface waves in multicomponent seismic data by using k-means algorithm: The Leading Edge, , no. 3, 239-248. 13. Ghasedi Dizaji, K., Herandi, A., Deng, C., Cai, W., and Huang, H., 2017, Deep clustering via joint convolutional autoencoder embedding and relative entropy minimization: Proceedings of the IEEE International Conference on Computer Vision, 5736-5745. 14.

Guo, X., Liu, X., Zhu, E., and Yin, J., 2017, Deep clustering with convolutional autoencoders: International Conference on Neural Information Processing, Springer, 373-382. 15.

Hartigan, J. A., and Wong, M. A., 1979, Algorithm AS 136: A k-means clustering algorithm: Journal of the Royal Statistical Society. Series C (Applied Statistics), , no. 1, 100-108. 16. Huang, L., Dong, X., and Clee, T.E., 2017, A scalable deep learning platform for identifying geologic features from seismic attributes: The Leading Edge, , no. 3, 249-256. 17. I'Anson, A., Elders, C., and McHarg, S., 2019, Marginal fault systems of the Northern Carnarvon Basin: Evidence for multiple Palaeozoic extension events, North-West Shelf, Australia: Marine and Petroleum Geology, , 211-229. 18.

Ioffe, S., and Szegedy, C., 2015, Batch normalization: accelerating deep network training by reducing internal covariate shift: ICML’15 Proceedings of the 32nd International Conference on International Conference on Machine Learning, 37, 448-456. 19.

Karimpouli, S., and Tahmasebi, P., 2019, Segmentation of digital rock images using deep convolutional autoencoder networks: Computers and Geosciences, , 142-150. 20.

Liu, J., Dai, X., Gan, L., Liu, L., and Lu, W., 2018, Supervised seismic facies analysis based on image segmentation: Geophysics, , vol. 2, O25-O30. 21. Liu, M., Jervis, M., Li, W., and Nivlet, P., 2020, Seismic facies classification using supervised convolutional neural networks and semisupervised generative adversarial networks: Geophysics, , no. 4, O47-O58. 22. Masci, J., Meier, U., Cireşan, D., and Schmidhuber, J., 2011, Stacked convolutional auto-encoders for hierarchical feature extraction: International conference on artificial neural networks, 52-59. 3. de Matos, M. C., Osorio, P. L., and Johann, P. R., 2007, Unsupervised seismic facies analysis using wavelet transform and self-organizing maps: Geophysics, , no. 1, P9-P21. 24. McHarg, S., Elders, C., and Cunneen, J., 2019, Origin of basin-scale syn-extensional synclines on the southern margin of the Northern Carnarvon Basin, Western Australia: Journal of the Geological Society, , no. 1, 115-128. 25.

Min, E., Guo, X., Liu, Q., Zhang, G., Cui, J., and Long, J., 2018, A survey of clustering with deep learning: From the perspective of network architecture: IEEE Access, , 39501-39514. 26. Mukherjee, S., Asnani, H., Lin, E., and Kannan, S., 2019, ClusterGAN: Latent space clustering in generative adversarial networks: Proceedings of the AAAI Conference on Artificial Intelligence, 33, 4610-4617. 27.

Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., et al., 2011, Scikit-learn: Machine learning in Python: Journal of machine learning research, , 2825-2830. 28. Puzyrev, V., 2019, Deep learning electromagnetic inversion with convolutional neural networks: Geophysical Journal International, , no. 2, 817-832. 29.

Qi, J., Lin, T., Zhao, T., Li, F., and Marfurt, K., 2016, Semisupervised multiattribute seismic facies analysis: Interpretation, , no. 1, SB91-SB106. 30. Qian, F., Yin, M., Liu, X.Y., Wang, Y.J., Lu, C., and Hu, G.M., 2018, Unsupervised seismic facies analysis via deep convolutional autoencoders: Geophysics, , no. 3, A39-A43. 31. Saraswat, P., and Sen, M. K., 2012, Artificial immune-based self-organizing maps for seismic-facies analysis: Geophysics, , no. 4, O45-O53. 32. Shi, Y., Wu, X., and Fomel, S., 2020, Waveform embedding: Automatic horizon picking with unsupervised deep learning: Geophysics, , no. 4, WA67-WA76. 33. Simonyan, K., and Zisserman, A., 2014, Very deep convolutional networks for large-scale image recognition: arXiv preprint arXiv:1409.1556. 34.

Sun, J., Slang, S., Elboth, T., Greiner, T. L., McDonald, S., and Gelius, L. J., 2020, Attenuation of marine seismic interference noise employing a customized U‐Net: Geophysical Prospecting, , no. 3, 845-871. 35. Wang, Y., Wang, B., Tu, N., and Geng, J., 2020, Seismic trace interpolation for irregularly spatial sampled data using convolutional autoencoder: Geophysics, , no. 2, V119-V130. 6. Wu, X., Liang, L., Shi, Y., and Fomel, S., 2019, FaultSeg3D: Using synthetic data sets to train an end-to-end convolutional neural network for 3D seismic fault segmentation: Geophysics, , no. 3, IM35-IM45. 37. Ye, J. C., Han, Y., and Cha, E., 2018, Deep convolutional framelets: A general deep learning framework for inverse problems: SIAM Journal on Imaging Sciences, , no. 2, 991-1048. 38. Yuan, S., Liu, J., Wang, S., Wang, T., and Shi, P., 2018, Seismic waveform classification and first-break picking using convolution neural networks: IEEE Geoscience and Remote Sensing Letters, , no. 2, 272-276. 39. Zhao, T., Jayaram, V., Roy, A., and Marfurt, K. J., 2015, A comparison of classification techniques for seismic facies recognition: Interpretation, , no. 4, SAE29-SAE58. 40. Zhao, T., 2018, Seismic facies classification using different deep convolutional neural networks: SEG Technical Program Expanded Abstracts 2018, 2046-2050.

Figure 1.

Architecture of the CAE. Color rectangles denote multichannel feature maps. The number of channels is shown at the bottom.

Figure 2.

Examples of 96 × 48 tiles: original input data x (top row), CAE-reconstructed data x  (middle row), and binary maps used in the loss function (4) (bottom row). Figure 3.

Sum of the explained variance (solid lines) and its individual contributions (dotted lines) for varying number of PCA components.

Figure 4.

4x coarse-to-fine grid interpolation scheme. Left: sliding window example. Right: weight matrix.

Figure 5.

Results of the CAE facies interpretation of a vertical seismic section from the training dataset. Top: original network output (a). Middle: automatically filtered image (b). Bottom: manual interpretation (c).

Figure 6.

Results of the CAE facies interpretation of a vertical seismic section from the test dataset. Top left: original network output (a). Top right: automatically filtered image (b). Bottom: manual interpretation (c). odel Kernel size Trainable parameters Minimum training error 1 × -3 × -3 Table 1.

Parameters of the networks and the lowest training set errors using loss (4) with  ..