Physics Guided Machine Learning algorithm for MAX-DOAS retrieval

Files

TR Number

Date

2023-01-18

Journal Title

Journal ISSN

Volume Title

Publisher

Virginia Tech

Abstract

Multi Axis Differential Optical Absorption Spectroscopy (MAX-DOAS) is a passive remote sensing technique that has been widely used to derive aerosol extinction coefficient profiles and trace gas concentrations. The ill-posed nature of the MAX-DOAS inversion problem makes it almost impossible to design an inversion algorithm providing a definite solution. A possible way to find a low-error inversion algorithm is incorporating the machine learning (ML) technique into the MAX-DOAS retrieval. This dissertation serves as the author's exploration of designing such an ML-based inversion algorithm. The inversion problem is formulated as a supervised learning problem and the ML models are trained on synthetic datasets simulated by radiative transfer models.newline By starting with a feasibility study, it is first shown that a ML model with appropriate architecture (CNN+LSTM) is capable of extracting aerosol extinction coefficient profile, single scattering albedo and asymmetry factor from one MAX-DOAS scan. Then more realistic atmosphere states were used for generating the training set. Due to the high time cost of radiative transfer simulations, a data augmentation strategy was put forward to increase the number of samples in the training set. A physics-guided machine learning (PGML) algorithm was designed to retrieve aerosol information and trace gas concentrations simultaneously. The model is named as PGML model because: (1) its prediction is based on the physical laws it has learnt from the radiative transfer simulations and (2) introduction of the physical constraints and the pseudo-inverse layer. The PGML model was tested on both a synthetic test set and real MAX-DOAS measurements from Pandora instruments. Evaluation on the synthetic dataset suggests that with similar data distribution, the PGML model is capable of retrieving aerosol extinction coefficient profile, trace gas concentration profile and the box-AMFs with good accuracy. Validation on real data was done via comparisons with inversion results given by other algorithms. Generally, moderate linear correlation were found between the inversion results. Limitation of current version of the PGML model and factors might lead to the discrepancies between inversion results given by the PGML model and other algorithms were discussed.

Description

Keywords

MAX-DOAS, Physics-guided machine learning, aerosol and trace gas retrieval

Citation