|
|
Line 1: |
Line 1: |
| {{Refimprove|date=September 2011}}
| | Irwin Butts is what my wife enjoys to call me although I don't truly like being known as like that. Her husband and her live in Puerto Rico but she will have to move 1 working day or an additional. What I adore performing is taking part in baseball but I haven't made a dime with it. I am a meter reader but I strategy on changing it.<br><br>Also visit my web site; [http://immbooks.com/blogs/post/44599 home std test kit] |
| {{Confusing|reason=complicated jargon is used throughout, lacking explanation|date=October 2013}}
| |
| '''Data assimilation''' is the process by which observations are incorporated into a computer model of a real system. Applications of data assimilation arise in many fields of geosciences, perhaps most importantly in [[weather forecasting]] and [[hydrology]]. Data assimilation proceeds by ''analysis cycles''. In each analysis cycle, observations of the current (and possibly past) state of a system are combined with the results from a numerical [[mathematical model|model]] (the ''forecast'') to produce an ''analysis'', which is considered as 'the best' estimate of the current state of the system. This is called the ''analysis step''. Essentially, the analysis step tries to balance the uncertainty in the data and in the forecast. The model is then advanced in time and its result becomes the forecast in the next analysis cycle.
| |
| | |
| ==Data assimilation as statistical estimation==
| |
| In data assimilation applications, the analysis and forecasts are best thought of as [[probability distribution]]s. The analysis step is an application of [[Bayes' theorem]] and the overall assimilation procedure is an example of [[recursive Bayesian estimation]]. However, the probabilistic analysis is usually simplified to a computationally feasible form. Advancing the probability distribution in time would be done exactly in the general case by the [[Fokker-Planck equation]], but that is unrealistically expensive, so various approximations operating on simplified [[Representation (mathematics)|representations]] of the probability distributions are used instead. If the probability distributions are normal, they can be represented by their mean and covariance, which gives rise to the [[Kalman filter]]. However it is not feasible to maintain the covariance because of the large number of degrees of freedom in the state, so various approximations are used instead.
| |
| | |
| Many methods represent the probability distributions only by the mean and impute some covariance instead. In the basic form, such analysis step is known as optimal statistical interpolation. Adjusting the initial value of the mathematical model instead of changing the state directly at the analysis time is the essence of the variational methods, 3DVAR and 4DVAR. Nudging, also known as Newtonian relaxation or 4DDA, is essentially the same as proceeding in continuous time rather than in discrete analysis cycles (the [[Kalman-Bucy filter]]), again with imputing simplified covariance.
| |
| | |
| [[Ensemble Kalman filter]]s represent the probability distribution by an ensemble of simulations, and the covariance is approximated by [[sample covariance]].
| |
| | |
| ==Weather forecasting applications==
| |
| | |
| Data assimilation is a concept encompassing any method for combining observations of variables such as [[temperature]] and [[atmospheric pressure]] into models used in [[numerical weather prediction]] (NWP).
| |
| | |
| Two main types of data assimilation are used: three-dimensional (3DDA) which uses only those observations available at the time of analysis, and four-dimensional (4DDA) which also includes the later observations.
| |
| | |
| ===History===
| |
| | |
| Originally "subjective analysis" had been used in which NWP forecasts had been adjusted by meteorologists using their operational expertise. Then "objective analysis" (e.g. Cressman algorithm) was introduced for automated data assimilation. These objective methods used simple interpolation approaches, and thus were 3DDA methods.
| |
| | |
| Later, 4DDA methods, called "nudging", were developed, such as in the [[MM5 (weather model)|MM5]] model. They are based on the simple idea of Newtonian relaxation (the 2nd axiom of Newton). They introduce into the right part of dynamical equations of the model a term that is proportional to the difference of the calculated meteorological variable and the observed value. This term that has a negative sign keeps the calculated [[State space (controls)|state vector]] closer to the observations. Nudging can be interpreted as a variant of the [[Kalman-Bucy filter]] (a continuous time version of the [[Kalman filter]]) with the gain matrix prescribed rather than obtained from covariances.
| |
| | |
| A major development was achieved by L. Gandin (1963) who introduced the "statistical interpolation" (or "optimal interpolation") method, which developed earlier ideas of Kolmogorov. This is a 3DDA method and is a type of [[regression analysis]] which utilizes information about the spatial distributions of [[covariance]] functions of the errors of the "first guess" field (previous forecast) and "true field". These functions are never known. However, the different approximations were assumed.
| |
| | |
| The optimal interpolation algorithm is the reduced version of the [[Kalman filtering]] (KF) algorithm and in which the covariance matrices are not calculated from the dynamical equations but are pre-determined in advance.
| |
|
| |
| Attempts to introduce the KF algorithms as a 4DDA tool for NWP models came later. However, this was (and remains) a difficult task because the full version requires solution of the enormous number of additional equations (~N*N~10**12, where N=Nx*Ny*Nz is the size of the state vector, Nx~100, Ny~100, Nz~100 - the dimensions of the computational grid). To overcome this difficulty, approximate or suboptimal Kalman filters were developed. These include the [[Ensemble Kalman filter]] and the Reduced-Rank Kalman filters (RRSQRT) (see Todling and Cohn, 1994).
| |
| | |
| Another significant advance in the development of the 4DDA methods was utilizing the [[optimal control]] theory (variational approach) in the works of Le Dimet and Talagrand (1986), based on the previous works of G. Marchuk, who was the first to apply that theory in the environmental modeling. The significant advantage of the variational approaches is that the meteorological fields satisfy the dynamical equations of the NWP model and at the same time they minimize the functional, characterizing their difference from observations. Thus, the problem of constrained minimization is solved. The 3DDA variational methods were developed for the first time by Sasaki (1958).
| |
| | |
| As was shown by Lorenc (1986), all the above-mentioned 4DDA methods are in some limit equivalent, i.e. under some assumptions they minimize the same [[Loss function|cost function]]. However, in practical applications these assumptions are never fulfilled, the different methods perform differently and generally it is not clear what approach (Kalman filtering or variational) is better. The fundamental questions also arise in application of the advanced DA techniques such as convergence of the computational method to the global minimum of the functional to be minimised. For instance, cost function or the set in which the solution is sought can be not convex. The 4DDA method which is currently most successful<ref>http://ams.confex.com/ams/91Annual/webprogram/Paper181664.html</ref><ref>http://hfip.psu.edu/EDA2010/MZhang.pdf</ref> is hybrid incremental 4D-Var, where an ensemble is used to augment the climatological background error covariances at the start of the data assimilation time window, but the background error covariances are evolved during the time window by a simplified version of the NWP forecast model. This data assimilation method is used operationally at forecast centres such as the [[Met Office]].<ref>http://www.ecmwf.int/newsevents/meetings/annual_seminar/2011/presentations/Barker.pdf</ref><ref>http://www.metoffice.gov.uk/research/modelling-systems/unified-model/weather-forecasting</ref>
| |
| | |
| ===Cost function===
| |
| The process of creating the analysis in data assimilation often involves minimization of a [[Loss function|cost function]]. A typical cost function would be the sum of the squared deviations of the analysis values from the observations weighted by the accuracy of the observations, plus the sum of the squared deviations of the forecast fields and the analyzed fields weighted by the accuracy of the forecast. This has the effect of making sure that the analysis does not drift too far away from observations and forecasts that are known to usually be reliable.
| |
| | |
| ====3D-Var====
| |
| | |
| <math>J(\mathbf{x}) = (\mathbf{x}-\mathbf{x}_{b})^{\mathrm{T}}\mathbf{B}^{-1}(\mathbf{x}-\mathbf{x}_{b}) + (\mathbf{y}-\mathit{H}[\mathbf{x}])^{\mathrm{T}}\mathbf{R}^{-1}(\mathbf{y}-\mathit{H}[\mathbf{x}]),</math>
| |
| | |
| where <math>\mathbf{B}</math> denotes the background error covariance, <math>\mathbf{R}</math> the observational error covariance.
| |
| | |
| <math>\nabla J(\mathbf{x}) = 2\mathbf{B}^{-1}(\mathbf{x}-\mathbf{x}_{b}) - 2\mathit{H}^T\mathbf{R}^{-1}(\mathbf{y}-\mathit{H}[\mathbf{x}])</math>
| |
| | |
| ====4D-var====
| |
| | |
| <math>J(\mathbf{x}) = (\mathbf{x}-\mathbf{x}_{b})^{\mathrm{T}}\mathbf{B}^{-1}(\mathbf{x}-\mathbf{x}_{b}) + \sum_{i=0}^{n}(\mathbf{y}_{i}-\mathit{H}_{i}[\mathbf{x}_{i}])^{\mathrm{T}}\mathbf{R}_{i}^{-1}(\mathbf{y}_{i}-\mathit{H}_{i}[\mathbf{x}_{i}])</math>
| |
| | |
| provided that <math>\mathit{H}</math> is linear operator (matrix).
| |
| | |
| ===Future development===
| |
| | |
| Factors driving the rapid development of data assimilation methods for NWP models include:
| |
| | |
| * Utilizing the observations currently offers promising improvement in [[forecast skill]] at a variety of spatial scales (from global to highly local) and time scales.
| |
| * The number of different kinds of available observations ([[sodar]]s, [[radar]]s, [[satellite]]) is rapidly growing.
| |
| | |
| {{Expand section|date=June 2008}}
| |
| | |
| ==Other applications==
| |
| {{Expand section|date=June 2008}}
| |
| Data assimilation methods are currently also used in other environmental forecasting problems, e.g. in [[Hydrology|hydrological]] forecasting. Basically, the same types of data assimilation methods as those described above are in use there. An example of chemical data assimilation using [[Autochem]] can be found at [http://www.cdacentral.info/ CDACentral].
| |
| | |
| Given the abundance of spacecraft data for other planets in the solar system, data assimilation is now also applied beyond the Earth to obtain re-analyses of the atmospheric state of extraterrestrial planets. Mars is the only extraterrestrial planet to which data assimilation has been applied so far. Available spacecraft data include, in particular, retrievals of temperature and dust/water ice optical ticknesses from the Thermal Emission Spectrometer onboard NASA's [[Mars Global Surveyor]] and the Mars Climate Sounder onboard NASA's [[Mars Reconnaissance Orbiter]]. Two methods of data assimilation have been applied to these datasets: an Analysis Correction scheme <ref name="montabone">http://www.atm.ox.ac.uk/group/gpfd/research.html#marsgcm</ref> and two Ensemble Kalman Filter schemes,<ref name="hoffman">http://www.eps.jhu.edu/~mjhoffman/pages/research.html</ref><ref name="wrf">http://www.marsclimatecenter.com</ref> both using a global circulation model of the martian atmosphere as forward model. The Mars Analysis Correction Data Assimilation (MACDA) dataset is publicly available from the British Atmospheric Data Centre.<ref name="badc">http://badc.nerc.ac.uk/home/</ref>
| |
| | |
| Data assimilation is a part of the challenge for every forecasting problem.
| |
| | |
| Dealing with biased data is a serious challenge in data assimilation. Further development of methods to deal with biases will be of particular use. If there are several instruments observing the same variable then intercomparing them using [[probability distribution function]]s can be instructive. Such an analysis is available on line at [http://www.pdfcentral.info/ PDFCentral] designed for the validation of observations from the NASA [[Aura (satellite)|Aura satellite]].
| |
| | |
| Other uses include trajectory estimation for the [[Apollo program]], [[GPS]], and [[Autochem|atmospheric chemistry]].
| |
| | |
| ==References==
| |
| * [[Roger Daley|R. Daley]], Atmospheric data analysis, Cambridge University Press, 1991.
| |
| * [http://www.mmm.ucar.edu/mm5/ MM5 community model homepage]
| |
| * [http://www.ecmwf.int/newsevents/training/lecture_notes/LN_DA.html ECMWF Data Assimilation Lecture notes]
| |
| * Ide, K., P. Courtier, M. Ghil, and A. C. Lorenc (1997) [http://www.atmos.ucla.edu/~kayo/data/publication/ide_etal_jmsj97.pdf Unified Notation for Data Assimilation: Operational, Sequential and Variational] Journal of the Meteorologcial Society of Japan, vol. 75, No. 1B, pp. 181–189
| |
| * [http://www.meted.ucar.edu/nwp/pcu1/ic6/frameset.htm COMET module "Understanding Data Assimilation"]
| |
| * Geir Evensen, Data Assimilation. The Ensemble Kalman Filter. Springer, 2007
| |
| * John M. LEWIS; S. Lakshmivarahan, Sudarshan Dhall, "Dynamic Data Assimilation : A Least Squares Approach", Encyclopedia of Mathematics and its Applications 104, Cambridge University Press, 2006 (ISBN 978-0-521-85155-8 Hardback)
| |
| | |
| ==Footnotes (hyperlinks)==
| |
| {{Reflist|colwidth=30em}}
| |
| | |
| ==External links==
| |
| Examples of how variational assimilation is implemented weather forecasting at:
| |
| * [[ECMWF]] http://www.ecmwf.int/research/ifsdocs/ASSIMILATION/Chap1_Overview2.html
| |
| * the [[Met Office]] http://www.metoffice.gov.uk/science/creating/first_steps/data_assim.html?zoneid=79046
| |
| | |
| Other examples of assimilation:
| |
| * [http://www.CDAcentral.info/ CDACentral (an example analysis from Chemical Data Assimilation)]
| |
| * [http://www.PDFcentral.info/ PDFCentral (using PDFs to examine biases and representativeness)]
| |
| * [http://www.openda.org/joomla/index.php OpenDA – Open Source Data Assimilation package]
| |
| * [http://pdaf.awi.de PDAF – open-source Parallel Data Assimilation Framework]
| |
| * [http://www.data-assimilation.net/ SANGOMA New Data Assimilation techniques]
| |
| | |
| {{DEFAULTSORT:Data Assimilation}}
| |
| [[Category:Weather forecasting]]
| |
| [[Category:Numerical climate and weather models]]
| |
| [[Category:Estimation theory]]
| |
| [[Category:Control theory]]
| |
| [[Category:Bayesian statistics]]
| |
| [[Category:Climate and weather statistics]]
| |
| [[Category:Statistical forecasting]]
| |