Microsoft Word - ACVE_2013_ML2PP_editorial_2014_01_13.doc ANNALS OF GEOPHYSICS, 56, FAST TRACK-1, 2013; 10.4401/AG-6341 1 MIPAS Level 2 Processor Prototype: from validation to operation MARC BERNAU1,*, HEIDRUN WEBER2, MICHAEL SCHMITT1, SVEN BARTHA1, AND ROLAND GESSNER1 1 Astrium GmbH {marc.bernau | michael.schmitt | sven.bartha | roland.gessner} (at)astrium.eads.net 2 ESA ESTEC heidrun.weber(at)esa.int I. INTRODUCTION n the research field of atmospheric chemistry a central question for acquired data sets is about validation. Have the data been validated to be useful for science? Has the data set been compared to other data sets? If deviations occur, which cause could be identified? Ultimately, two causes are pos- sible when the same scene is observed: either the acquired raw data set is erroneous (hardware prob- lem) or the data processing infers erroneous infor- mation (software problem). In order to make sure that the software works as expected, software vali- dation plays a key role in the overall data set valida- tion campaigns. This paper deals with operational software validation, which is an important compo- nent of the entire scientific validation chain. In the framework of the Michelson Interferometer for Passive Atmospheric Sounding (MIPAS) mis- sion, a sophisticated iterative algorithm scheme has been developed by many contributors, which can be used to infer the concentration of atmospheric trace gases from the measured spectra. Actually, this al- gorithm has been implemented in three instances for the operational phase of the mission with each fulfilling dedicated tasks: • a scientific core implementation for proof of feasibility and scientific validation named Op- timized Retrieval Model (ORM) coded accord- ing to an Algorithm Theoretical Baseline Doc- ument (ATBD) [Carli et al., 2013], • an industry standard implementation of the sci- entific algorithm including the framework around this algorithm for performance refer- ence and operational processor validation alias MIPAS Level 2 Processor Prototype (ML2PP), and • a ground segment implementation for opera- tional processing mirroring the ML2PP algo- rithm referred to as Instrument Processing Fa- cility (IPF). In this paper, Astrium’s ML2PP used for processing calibrated MIPAS spectra (included in the level 1B product) into atmospheric profiles (included in the level 2 product) is presented in terms of its • General processing scheme, • Role during the mission, • Software validation concept. * Corresponding author: Marc Bernau Dept. System Performance Verification & User Ground Seg- ments, Claude-Dornier-Strasse, 88039 Friedrichshafen, Germany I ANNALS OF GEOPHYSICS, 56, FAST TRACK-1, 2013; 10.4401/AG-6341 2 The outline of this paper is as follows. In section 2, a brief introduction of the MIPAS instrument aboard ENVISAT, which was operational from 2002 until 2012, is given. Then, the processing scheme from level 1B to level 2 is briefly described in section 3. Section 4 recalls the history of the algorithm devel- opment, and section 5 presents the software valida- tion concept. Section 6 concludes this paper. II. THE MIPAS MISSION MIPAS, being one of three atmospheric chemistry instruments aboard ESA's largest Earth Observation satellite ENVISAT, is a limb-sounding FTIR (Fourier transform infrared) spectrometer observing atmos- pheric data in the mid-infrared range from 4.15 µm to 14.6 µm with high spectral resolution. The meas- ured data set is very rich in information as each spectrum comprises of the order of 50000 spectral samples. MIPAS was designed to detect limb emission spec- tra in the Earth’s lower, middle and upper atmos- phere (cf. Figure 1), covering tangent heights from 5 to 150 km. During the operational phase, MIPAS gathered spectrally high resolved data in five spec- tral bands, which can be used for absorption spec- troscopy. After in-flight anomalies of the MIPAS moving mirror, starting in March 2004, the instru- ment operation had to be suspended. After an analysis period, however, MIPAS could be recovered starting from a 35% duty cycle in January 2005 to a full mission with an optimized spectral resolution (see also [Raspollini et al., 2013]). The MIPAS in-flight operational performance stabilized gradually, until the observed anomaly pattern that led to the mission interruption had practically dis- appeared. As a result of the modified spectral reso- lution, also the ground processing algorithms need- ed to be adapted accordingly. MIPAS was operated in two viewing ranges, a rearward viewing range (in anti-flight direction of the satellite) and sideways. Further, instrument cal- ibrations were used for regular radiometric calibra- tions (using offset calibrations and measurements from an internal blackbody) and line-of-sight cali- brations to correct for pointing errors. For further readings about the mission and the MIPAS instru- ment refer to [Raspollini et al., 2013]. III. PROCESSING The MIPAS data processing takes place sequentially as presented in Figure 2. The level 2 processing forms the final part of a processing chain, taking as input the level 1B (L1B) product, which contains calibrated spectra. It outputs atmospheric profiles of pressure, temperature and dedicated trace gases as a function of altitude. The number of different trace gases in the retrieval scheme has increased from ini- tially six species for the first ML2PP version (O3, H2O, HNO3, CH4, N2O, NO2) to ten species for ver- sion 6.00 (plus: F11, F12, N2O5, ClONO2) and to 15 species for ML2PP version 7.00 (plus: F22, F14, COF2, CCl4, HCN). The recently added species Spectral range: 685 - 2410 cm -1 (5 bands) 1020-1170 AB 1215-1500 B 1570-1750 C 1820-2410 D 685-970 A cm-1 Spectral range: 685 - 2410 cm -1 (5 bands) Spectral range: 685 - 2410 cm -1 (5 bands) 1020-1170 AB 1020-1170 AB 1215-1500 B 1215-1500 B 1570-1750 C 1570-1750 C 1820-2410 D 1820-2410 D 685-970 A cm-1685-970 A cm-1 Fourier Transform IR (FTIR) Limb Sounding Spectrometer day and night measurements Figure 1: MIPAS measurement scheme MIPAS Onboard Processing ENVISAT Instrument Source Packets Level 1A Processing Raw Interferograms Level 1A Level 0 Level 1B Processing Calibrated Spectra Level 1B Atmospheric Profiles Level 2 Level 2 Processing Space Segment Ground Segment L e v e l 2 P ro c e s s in g L e v e l 1 B P ro c e s s in g Processing Data Product Hardware Data Flow Legend Scope of ML2PP Figure 2: MIPAS data processing ANNALS OF GEOPHYSICS, 56, FAST TRACK-1, 2013; 10.4401/AG-6341 3 show weaker spectral signatures either due to low atmospheric volume mixing ratio or low emis- sion/absorption per molecule or both. The capabil- ity to retrieve such weak species demonstrates the quality of the Level 2 processing algorithm scheme and requires high accuracy of the implementation. The general processing approach is schematically shown in Figure 3. Besides the L1B product, several auxiliary files (pointing information, spectral data, cross-section data, initial guess data, and program settings) are required for the level 2 processing. Op- tionally, also European Centre for Medium-Range Weather Forecasts (ECMWF) files can be provided to the ML2PP. The level 2 algorithm works in a scan by scan fash- ion, where one scan consists of a set of calibrated spectra at different tangent altitudes (sweeps). For each scan at first the pressure and temperature pro- files (p,T) are retrieved from CO2 observations as- suming a fixed atmospheric CO2 profile. With the retrieved pressure and temperature profiles then kept fixed, the different trace gas retrievals are per- formed one after the other. The initial guess is the starting point for each of the retrievals, and plays a crucial role for its convergence. Depending on availability, the initial guess is an optimized com- position of climatological profiles, ECMWF profiles (for p,T retrieval only), or retrieved profiles from the same or the previous scan. The retrieval itself is based on a Levenberg- Marquardt algorithm, an iterative scheme that com- bines the advantages of the Newton-Gauss algo- rithm and the method of gradient descent for find- ing the minimum of a specific function. In the case of the MIPAS L2 processing, the function to be min- imized is the difference between the measured spec- trum and a simulated spectrum that is produced by the current set of atmospheric profiles in the itera- tion. If the defined convergence criteria are fulfilled, the retrieval is successfully terminated. Otherwise, a new set of atmospheric profiles is determined based on the difference of the spectra of the previous itera- tion. For detailed information on the algorithm refer to [Carli et al., 2013]. IV. LEVEL 2 ALGORITHM DEVELOPMENT In section 1, three level 2 processors have been in- troduced. The dedicated purposes and their role during the mission are summarized in this section. ORM The scientific processor to be used as scientific ref- erence was determined in a shoot-out of several ap- proaches to the level 2 processing scheme. Eventu- ally, the IFAC institute, Florence, Italy, has won this competition. It is interesting to note that all other independently developed processors are still being maintained, advanced, and tested on MIPAS data. This allows for inter-comparisons of all level 2 algo- rithms, which is extremely useful for validating the ORM itself on algorithm level (see also [Laeng et al., 2013]). Perform p,T Retrieval Compute Initial Guess for p,T Retrieval Receive/Store p,T and Altitude Profiles Preprocessing Read Level 1b Product & Auxiliary Files Compute Initial Guess for VMR Retrieval Species #n Perform VMR Retrieval Species #n Receive/Store VMR Profile Species #n Last Species? Last Scan? Write Level 2 Product Stop Start Select next Species Select next Scan yes yes no no Level 1B Product Auxiliary Files ECMWF Files Level 2 Product Processing Data Product File I/O Data Flow Legend Figure 3: Level 2 processing algorithm ANNALS OF GEOPHYSICS, 56, FAST TRACK-1, 2013; 10.4401/AG-6341 4 ML2PP The ML2PP is the industrial reference implementa- tion of the scientific core implementation ORM. The main purposes of developing this instance are to • Critically review the algorithm, • Implement the processor in a high level lan- guage following standard coding rules, • Implement the framework around the scientific core with read/write routines for the input and output files as they are used in the operational ground processing unit, • Develop a software validation plan and carry out the validation against the ORM with dedi- cated test procedures, • Prepare documentation on the software, the in- terfaces, the validation report, etc. The history of the ML2PP started with the devel- opment phase as early as the mid 1990's which re- sulted in the first validated delivery in 1998. When the first operational MIPAS data were downloaded and fed into the ML2PP, the processor converged without problems. This proved the quality of the algorithm. The ML2PP eventually turned into an operational processor when the version V6.00 became the des- ignated processor for offline reprocessing activities for the full mission in 2012, providing L2 data sets to the user community. Between these milestones lie the activities related to continuous improvement of the MIPAS processing in general and the L2-processing in particular. A major upgrade of the processing-scheme became necessary after a malfunction of the MIPAS instru- ment in 2004, which was ultimately solved by re- ducing the spectral resolution in 2005. The ML2PP history is still ongoing as version 7.01 is the desig- nated software tool for the reprocessing of all valid MIPAS L1b data into L2 products in 2014. By then, a rich collection of science and data quality improve- ments will have been implemented that have been elaborated by the MIPAS Quality Working Group (QWG) members and supporters while analyzing and processing MIPAS data during 10 years of ser- vice. IPF The IPF was operational in the ground segment, thus providing the operational level 2 data sets dur- ing the mission from 2002 to 2012. The IPF instance of the algorithm is validated against the ML2PP. Having three algorithms validated against each other has certain advantages, but this ultimately be- came a problem after adjusting the algorithm for the optimized MIPAS resolution, due to the instrument malfunction in 2004. The sequential adjustment from the ORM via ML2PP to the IPF actually de- layed the operational ground processing for years. In order to speed up the validation phase, it was discussed to drop either the ML2PP or the IPF from the validation chain. Due to the unexpected end of mission, it never came to that decision, and the IPF remained operational in the ground segment until end of mission. For the reprocessing campaign of the complete MIPAS data set the ML2PP was cho- sen as operational processor. V. VALIDATION CONCEPT The software validation concept of MIPAS level 2 processing is a two-step procedure. First, the ML2PP is validated against the scientific instance ORM. Secondly, the IPF is validated against ML2PP (cf. Figure 4). Having three independently coded instances of the same algorithm lowers the probabil- ity of coding errors, which is a true benefit for such a complex algorithm that is prone to cumulative er- rors. Although the same level of quality can also be reached with two instances instead of three, the choice of having a third implementation was mainly motivated by political and not scientific interest. A clear disadvantage associated with this two-step validation is the danger of having long adjustments of all software versions when algorithmic changes are made. ANNALS OF GEOPHYSICS, 56, FAST TRACK-1, 2013; 10.4401/AG-6341 5 The validation procedure itself is a process contain- ing several steps, and follows different comparison criteria. In order to compare software outputs it is necessary to compare intermediate results, especial- ly for algorithms containing so many calculations and iterations as for the ML2PP. Thus, the algo- rithms have been designed to write intermediate test data sets used for validation. As an example, the pressure, temperature and volume mixing ratios are being written to files after every iteration cycle for comparison of intermediate results. The driving question for the comparison of two files is the al- lowed margin for the numbers being compared. Ul- timately, a set of margins basically defines by how much a physical process may deviate from a clear mathematical formulation. The margins initially proposed have been deter- mined by running the ORM algorithm on different hardware platforms and comparing the results in terms of statistical evaluations. These statistics campaigns should lead to margins of the order of the hardware precision. In practice, such a precision is impossible to reach in different implementations, e.g. different programming languages (FORTRAN vs. C++), different numerical precision during cal- culations (floating point vs. double point variables), and different implementations of sub-routines (us- age of external libraries vs. self-coded routines). Moreover, the numerical precision is also affected by low-level calculations such as taking the loga- rithm of a number, which reduces the accuracy. Al- so, the sequence of adding, subtracting, dividing, and multiplying numbers can be important when very big and very small numbers are involved. The margins certainly had to be increased taking into account these fundamental differences. An ana- lytical approach was not feasible because of the al- gorithm’s complexity. Especially the large numbers of iterations during the processing of several scans in sequence let errors accumulate, and thus further degrade the numerical accuracy between both algo- rithms. The choice for margins was then solved by comparing the intermediate ML2PP results with ORM results for a robust test case where both algo- rithms converged after the same number of itera- tions to a very similar result. Margins have been de- fined for all parameters written to the intermediate files and to the level 2 product file. Ultimately, these margins are much higher than the initially proposed hardware precision, but are still small enough to state that the processing results of both different al- gorithms can be labeled as being ‘identical’. The margins for all values of the test outputs as well as for the level 2 output have been reported in a cor- responding ‘Test Definition and Procedure Docu- ment’. In order to provide an example of the select- ed margins, some results are shown in the conclu- sion section. All margins and thus the precision re- quirements were fulfilled by ORM vs. ML2PP and ML2PP vs. IPF. Thus, the operational software is considered validated. VI. RESULTS AND CONCLUSION This paper discussed the three ESA processors ORM, ML2PP and IPF for MIPAS L1b to L2 pro- cessing with focus on the ML2PP by presenting its • General processing scheme, ORM ABC ML2PP IPF ORM SDC Development Instance Baseline Reference Consolidated Subset of Functions Prototype Instance Operational Instance Level 2 Product Test data (TEPs) Level 2 Product Test data (TEPs) Margins fulfilled? yes validated no not validated Level 2 Product Test data (TEPs) Level 2 Product Test data (TEPs) Margins fulfilled? yes validated no not validated ML2PP Prototype Instance 1. Validation 2. Validation Figure 4: Validation concept ANNALS OF GEOPHYSICS, 56, FAST TRACK-1, 2013; 10.4401/AG-6341 6 • Role during the mission, • Software validation concept. The operational software validation is done by comparing ORM results with ML2PP results. For this comparison some margins were defined for de- ciding whether the outputs can be considered ‘iden- tical’. These comparisons are made for all values in the level 2 product and for intermediate results. As an example, the temperature and water vapour retrieval results of the scientific processor (ORM) and of ML2PP are plotted on the left hand side in figure 5. Both plot lines appear congruent, which is the expected result. On the right hand side, the nu- merical differences are plotted on a logarithmic ab- scissa. The allowed margins (vertical red lines) are usually percentage values of the maximum retrieval value. For water vapour retrieval, the allowed error is two percent of the maximum value (here: maximum is around 7.5 ppm). The error is well below that mar- gin by more than one magnitude, which means that a very good match of the outputs is achieved with relative errors below 0.1 percent (here: error is low- er than 0.0075 ppm). For temperature retrieval, two percent of the maxi- mum would mean a margin of a few Kelvin, which is higher than the MIPAS accuracy requirement of 0.1 K. Therefore, the plots show a stricter absolute margin of 0.1 K. Again, the error is well below that margin with accuracy better than 0.01 K. As a conclusion, software implementation valida- tion is an important component of the entire MIPAS level 2 product validation chain. The ML2PP has been performing this task for the MIPAS mission since mid 1990’s. Eventually, the ML2PP was cho- sen to provide the operational MIPAS L2 products since 2012. REFERENCES [Carli et al., 2013] B. Carli, M. Carlotti, S. Ceccherini, M. Höpfner, P. Raspollini, M. Ridolfi, L. Santurri, and L. Sgheri (2013). MIPAS Level 2 Algorithm Theoretical Baseline Document, issue 6.0 http://www2.fci.unibo.it/~ridolfi/mipas/docs/at bd/atbd_issue_6.0.pdf (December 19, 2013). [Raspollini et al., 2013] Raspollini, P., Carli, B., Carlotti, M., Ceccherini, S., Dehn, A., Dinelli, B. M., Dudhia, A., Flaud, J.-M., López-Puertas, M., Niro, F., Remedios, J. J., Ridolfi, M., Sembhi, H., Sgheri, L., and von Clarmann, T.: Ten years of MIPAS measurements with ESA Level 2 processor V6 - Part 1: Retrieval algorithm and diagnostics of the prod- ucts, Atmos. Meas. Tech., 6, 2419-2439, doi:10.5194/amt-6-2419-2013, 2013. [Laeng et al., 2013] Laeng, A, Hubert, D, Verhoelst, T, von Clarmann, T, Dinelli, B.M., Dudhia, A., Raspollini, P., Stiller, G., Grabowski, U., Keppens, A., Kiefer, M., Sofieva, V., Froideveaux, L., Walker, K.A., Lambert, J.-C., Zehner, C., The Ozone Climate Change Initiative: Comparison of four Level 2 Pro- cessors for the Michelson Interferometer for Passive Atmospheric Sounding (MIPAS), submitted to Re- mote Sensing of Environment, 2013. 180 200 220 240 260 280 0 20 40 60 80 a lt it u d e [ k m ] temperature [K] Temperature ORM ABC 3.0 ML2PP V7.00 0.0001 0.001 0.01 0.1 1 0 20 40 60 80 abs. error [K] a lt it u d e [ k m ] Temperature Error − (SNR=96.5 dB) error margin 0 2 4 6 8 0 20 40 60 80 [ppm] a lt it u d e [ k m ] H2O retrieval ORM ABC 3.0 ML2PP V7.00 0.0001 0.001 0.01 0.1 1 10 0 20 40 60 80 rel. error wrt max. ppm in % H2O Error − (SNR=69.6 dB) a lt it u d e [ k m ] error margin Figure 5: Comparison of processor outputs