Who is afraid of missing data in spectral analysis
System Identification, Volume # 14 | Part# 1
Authors
Piet M. T. Broersen
Identifier
10.3182/20060329-3-AU-2901.00109
Index Terms
ARMA model,autoregressive model,autocovariance estimation,missing observations,order selection,parametric model,spectral estimation
Abstract
The program ARMAsel automatically selects a single time series model for given stationary stochastic data. Three model types are candidates for selection. The type can be AR or autoregressive, MA or moving average and the combined ARMA type. The parameters of that selected model accurately represent the power spectral density and the autocorrelation function of the data. The reduced statistics ARMAsel-rs algorithm uses a long AR model as only input to compute models of the other types and to select the best. ARMAsel-mis is a new program that can handle missing data. It computes AR models with a numerically stable maximum likelihood algorithm and uses ARMAsel-rs to determine MA and ARMA models. The order and the type of the best candidate are automatically selected with dedicated missing data criteria, supposing that data are randomly missing. Missing less than 10 % of the data has not much influence on the accuracy, missing 50 % generally looses a factor two in the accuracy in comparison with the model estimated from the same number of contiguous observations. Low order time series models can be computed as long as the product of the remaining fraction and the remaining number of observations is greater than about 10.
References
[1] Anderson B. D. O. and J. B. Moore (1979). Optimal
Filtering, Englewood Cliffs, Prentice-Hall.
[2] Broersen, P. M. T. (1998). The quality of models for
ARMA processes. IEEE Trans. on Signal
Process., 46, pp. 1749-1752.
[3] Broersen, P. M. T. (2000). Autoregressive model
orders for Durbin's MA and ARMA estimators.
IEEE Trans. on Signal Process., 48, pp. 2454-
2457.
[4] Broersen, P. M. T. (2002). Automatic spectral analysis
with time series models. IEEE Trans. on Instrumentation
and Measurement, 51, pp. 211-216.
[5] Broersen, P. M. T. (2002). ARMASA toolbox, can be
found under spectral analysis at
www.mathworks.com/matlabcentral/fileexchange/
[6] Broersen, P. M. T. (2004). Automatic time series
identification, spectral analysis with Matlab
toolbox ARMASA. Proc. SYSID 2003, 13th IFAC
Symp. on System Identification, Rotterdam, pp.
1435-1441.
[7] Broersen, P. M. T. and R. Bos (2006). Time series
analysis if data are randomly missing. IEEE
Trans. on Instrumentation and Measurement, 55,
no. 1, accepted for publication.
[8] Broersen, P. M. T. and S. de Waele (2005). Automatic
identification of time series models from long
autoregressive models. IEEE Trans. on Instrumentation
and Measurement, 54, no. 5, pp. 1862-
1868.
[9] Broersen, P. M. T., S. de Waele and R. Bos (2004).
Autoregressive spectral analysis when
observations are missing. Automatica, 40, pp.
1495-1504.
[10] Jones, R. H. (1980). Maximum likelihood fitting of
ARMA models to time series with missing
observations. Technometrics, 22, pp. 389-395.
[11] Kay S. M. and S. L. Marple (1981). Spectrum analysis-a
modern perspective. Proc. IEEE, 69, pp. 1380-
1419.
[12] Little R. J. A. and D. B. Rubin (1987). Statistical
analysis with missing data. New York: Wiley.
[13] Lomb, N. R. (1976). Least squares frequency analysis
of unequally spaced data. Astrophysics and
Space Science, 39, pp. 447-462.
[14] Priestley, M. B. (1981). Spectral Analysis and Time
Series. London, U.K.: Academic Press.
[15] Scargle, J. D. (1989). Studies in astronomical time
series analysis III. Fourier transforms, auto-correlation
functions, and cross-correlation
functions of unevenly spaced data. The
Astrophysics Journal, 343, pp. 874-887.
[16] Waele, S. de (2003). Automatic Spectral Analysis, it
can be found under spectral analysis at
www.mathworks.com/matlabcentral/fileexchange
[17] Waele, S. de and P. M. T. Broersen (2000). The Burg
algorithm for segments. IEEE Transactions on
Signal Processing, 48, pp. 2876-2880.
