Week 0: Introductions to time series analysis and the AR(1) process

I decided to migrate some material that is auxilary to the course:

An overview of the course.
A review of some mathematical and statistical results used in the course.
A bibliography of books I found useful in the course.
A Feynman notebook for the course - is now in a separate notebook.

Course Card

Course: Bayesian Statistics: Time Series
Offered by: University of California, Santa Cruz
Instructor: Raquel Prado
Certificate: Yes
Level: Graduate
Commitment: 4 weeks of study, 3-4 hours/week

Kalman Filter (1960)

\begin{aligned} x_{t} & = F_{t} x_{t-1} + G_{t} u_{t} + w_{t} && \text{(transition equation)} \\ y_{t} & = H_{t} x_{t} + v_{t} && \text{(observation equation)} \end{aligned} \tag{7}

where:

x_{t} is the state vector at time t,
F_{t} is the state transition matrix,
G_{t} is the control input matrix,
u_{t} is the control vector,
w_{t} is the process noise vector,
y_{t} is the observation vector at time t,
H_{t} is the observation matrix,
v_{t} is the observation noise vector.

The Kalman filter is a recursive algorithm that estimates the state of a linear dynamic system from a series of noisy observations. The Kalman filter is based on a linear dynamical system model that is defined by two equations: the state transition equation and the observation equation. The state transition equation describes how the state of the system evolves over time, while the observation equation describes how the observations are generated from the state of the system. The Kalman filter uses these two equations to estimate the state of the system at each time step, based on the observations received up to that time step. This could be implemented in real time in the 1960s and was used in the Apollo missions.

The Extended Kalman Filter (EKF) is an extension of the Kalman filter that can be used to estimate the state of a nonlinear dynamic system. The EKF linearizes the nonlinear system model at each time step and then applies the Kalman filter to the linearized system. The EKF is an approximation to the true nonlinear system, and its accuracy depends on how well the linearized system approximates the true system.

Box Jenkins Method (1970)

see Box Jenkins Method

A five step process for identifying, selecting and assessing ARMA (and similar) models.

There are three courses on Stochastic Processes on MIT OCW that I found useful:
- Introduction to Stochastic Processes
- Discrete Stochastic Processes
- has kecture videos and notes
- poisson processes
- Advanced Stochastic Processes
- martingales
- ito calculus

Bayesian Time Series Bibliography

We start with some books from the course, I collected here both the recommended books and some others that I found useful.

Time Series: Modeling, Computation, and Inference

c.f. (Prado, Ferreira, and West 2023)

Title:Time Series: Modeling, Computation, and Inference
ISBN:9781032040042, 1032040041
Page count:452
Published:September 2023
Format:Paperback
Publisher:CRC Press
Authors: Raquel Prado, Marco A. R. Ferreira, Mike West

(Prado, Ferreira, and West 2023) “Time Series: Modeling, Computation, and Inference” by course instructor Raquel Prado. This book, now in its second edition is a comprehensive introduction to time series analysis and covers a wide range of topics in time series modeling, computation, and inference. The book is suitable for graduate students and researchers in statistics, computer science, and related fields.

While learning this course I found some of the material harder to follow than I expected. The books helped to clarify definitions and so on however the book is
rather comprehensive and mathematically advanced unlike some other books on statistics.

The teacher frequently point out that many aspects of Times series and are beyond the scope of the course. Yet this book covers much more ground like unequaly spaced time series and vector valued time series.

For example we look at EKG data which the authors have been working on for years. However we look at it in this course in terms of a univariate time series while in reality EKG is usually sampled at 12 sites simultaneously yielding a multi-variate time series.

Once this course is done I will probably want to dive deeper into the subject and try to devote more time to other models in the book.

Bayesian Forecasting and Dynamic Models

c.f. (West and Harrison 2013)

Title:Bayesian Forecasting and Dynamic Models
ISBN:9781475770971, 1475770979
Page count:682
Published:March 17, 2013
Format:Paperback
Publisher:Springer New York
Author:Mike West, Jeff Harrison

(West and Harrison 2013) “Bayesian Forecasting and Dynamic Models” by Mike West and Jeff Harrison. This book is a classic text on Bayesian statistics and covers a wide range of topics in Bayesian forecasting and dynamic models. The following is the description from the publisher:

The use of dynamic models in the forecasting of time series data has a long history, with the development of autoregressive integrated moving average (ARIMA) models and state space models. However, the use of Bayesian methods in the development of dynamic models is a relatively recent development. This book provides a comprehensive introduction to the use of Bayesian methods in the development of dynamic models for forecasting time series data. The book covers a wide range of topics, including the use of dynamic models in the analysis of time series data, the use of Bayesian methods in the development of dynamic models, and the use of dynamic models in the forecasting of time series data.

Audience: The book is suitable for graduate students and researchers in statistics, computer science, and related fields.

Practical Time Series Analysis

c.f. (Nielsen 2019)

Title:Practical Time Series Analysis: Prediction with Statistics and Machine Learning
ISBN:1492041602, 9781492041603
Page count:504
Published:2019
Format:Paperback
Publisher:O’Reilly Media, Inc.
(Nielsen 2019) “Practical Time Series Analysis: Prediction with Statistics and Machine Learning” by Aileen Nielsen. Is a good resource for parctionars getting started with time series analysis. I also recommend any videos by Aileen Nielsen on the subject.

Practical Times Series Analysis by Aileen Nielsen is a good book for beginners. It is a practical guide to time series analysis and covers a wide range of topics in time series modeling, computation, and inference. The book is suitable for beginners in statistics, computer science, and related fields.

Time series data analysis is increasingly important due to the massive production of such data through the internet of things, the digitalization of healthcare, and the rise of smart cities. As continuous monitoring and data collection become more common, the need for competent time series analysis with both statistical and machine learning techniques will increase.

Covering innovations in time series data analysis and use cases from the real world, this practical guide will help you solve the most common data engineering and analysis challenges in time series, using both traditional statistical and modern machine learning techniques. Author Aileen Nielsen offers an accessible, well-rounded introduction to time series in both R and Python that will have data scientists, software engineers, and researchers up and running quickly.

You’ll get the guidance you need to confidently:

Find and wrangle time series data

Undertake exploratory time series data analysis

Store temporal data

Simulate time series data

Generate and select features for a time series

Measure error

Forecast and classify time series with machine or deep learning

Evaluate accuracy and performance

“Machine Learning: A Bayesian and Optimization Perspective” by Sergios Theodoridis.

c.f. (Theodoridis 2015)

Title:Machine Learning: A Bayesian and Optimization Perspective
ISBN:0128015225, 9780128015223
Page count:1062
Published:2015
Format:Hardcover
Publisher:Academic Press
Authors: Sergios Theodoridis

I came across this book while looking into the Durban-Levinson recursion and the Yule-Walker equations. So far I haven’t had time to read it but it looks like a good book on machine learning. The following is the description from the publisher:

This tutorial text gives a unifying perspective on machine learning by covering both probabilistic and deterministic approaches -which are based on optimization techniques - together with the Bayesian inference approach, whose essence lies in the use of a hierarchy of probabilistic models. The book presents the major machine learning methods as they have been developed in different disciplines, such as statistics, statistical and adaptive signal processing and computer science. Focusing on the physical reasoning behind the mathematics, all the various methods and techniques are explained in depth, supported by examples and problems, giving an invaluable resource to the student and researcher for understanding and applying machine learning concepts.

The book builds carefully from the basic classical methods to the most recent trends, with chapters written to be as self-contained as possible, making the text suitable for different courses: pattern recognition, statistical/adaptive signal processing, statistical/Bayesian learning, as well as short courses on sparse modeling, deep learning, and probabilistic graphical models.

All major classical techniques: Mean/Least-Squares regression and filtering, Kalman filtering, stochastic approximation and online learning, Bayesian classification, decision trees, logistic regression and boosting methods.

The latest trends: Sparsity, convex analysis and optimization, online distributed algorithms, learning in RKH spaces, Bayesian inference, graphical and hidden Markov models, particle filtering, deep learning, dictionary learning and latent variables modeling.

Case studies - protein folding prediction, optical character recognition, text authorship identification, fMRI data analysis, change point detection, hyperspectral image unmixing, target localization, channel equalization and echo cancellation, show how the theory can be applied.

MATLAB code for all the main algorithms are available on an accompanying website, enabling the reader to experiment with the code.

Statistical Analysis in Climate Research

c.f.(Storch and Zwiers 2002)

Title:Statistical Analysis in Climate Research
ISBN:1139425099, 9781139425094
Page count:484
Published:2002
Format:Paperback
Publisher:Cambridge University Press
Authors: Hans von Storch, Francis W. Zwiers

I came across this book while looking into the Durban-Levinson recursion and the Yule-Walker equations. So far I haven’t had time to read it but it looks promising. Here is the description from the publisher:

Climatology is, to a large degree, the study of the statistics of our climate. The powerful tools of mathematical statistics therefore find wide application in climatological research. The purpose of this book is to help the climatologist understand the basic precepts of the statistician’s art and to provide some of the background needed to apply statistical methodology correctly and usefully. The book is self contained: introductory material, standard advanced techniques, and the specialised techniques used specifically by climatologists are all contained within this one source. There are a wealth of real-world examples drawn from the climate literature to demonstrate the need, power and pitfalls of statistical analysis in climate research. Suitable for graduate courses on statistics for climatic, atmospheric and oceanic science, this book will also be valuable as a reference source for researchers in climatology, meteorology, atmospheric science, and oceanography.

Hans von Storch is Director of the Institute of Hydrophysics of the GKSS Research Centre in Geesthacht, Germany and a Professor at the Meteorological Institute of the University of Hamburg.

Francis W. Zwiers is Chief of the Canadian Centre for Climate Modelling and Analysis, Atmospheric Environment Service, Victoria, Canada, and an Adjunct Professor at the Department of Mathematicw and Statistics of the University of Victoria.

Bayesian Modeling and Computation in Python

c.f. (Martin, Kumar, and Lao 2021)

This is a great resource for translating what we learned to Python. The book is available at Bayesian Modeling and Computation in Python

I found the chapter on state space modeling and the Kalman filter particularly useful. The book is a great resource for translating what we learned in the course to Python. The book is suitable for undergraduate students in statistics, computer science, and related fields.

Bayesian Data Analysis

c.f. (Gelman et al. 2013)

Title:Bayesian Data Analysis
ISBN:1439840954, 9781439840955
Page count:675
Published:2013
Format:Hardcover
Publisher:Chapman and Hall/CRC
Authors: Andrew Gelman, John B. Carlin, Hal S. Stern, David B. Dunson, Aki Vehtari, Donald B. Rubin

(Gelman et al. 2013) “Bayesian Data Analysis” is probably the most famous book on Bayesian statistics. This book is a classic text on Bayesian statistics and covers a wide range of topics in Bayesian data analysis. Although this is not a time series book, the authors have been intersted in the domain of political election prediction and have used time series data in their research and some of that is covered in the book’s examples.

Audience: The book is suitable for graduate students and researchers in statistics, computer science, and related fields.
An electronic version of the third eddition book is available at Bayesian Data Analysis

Introductory Time Series with R c.f. (Cowpertwait and Metcalfe 2009)

(Cowpertwait and Metcalfe 2009) “Introductory Time Series with R” by Cowpertwait and Metcalfe, and the second is

Yearly global mean temperature and ocean levels, daily share prices, and the signals transmitted back to Earth by the Voyager space craft are all examples of sequential observations over time known as time series. This book gives you a step-by-step introduction to analysing time series using the open source software R. Each time series model is motivated with practical applications, and is defined in mathematical notation. Once the model has been introduced it is used to generate synthetic data, using R code, and these generated data are then used to estimate its parameters. This sequence enhances understanding of both the time series model and the R function used to fit the model to data. Finally, the model is used to analyse observed data taken from a practical application. By using R, the whole procedure can be reproduced by the reader.

All the data sets used in the book are available on the website at datasets

The book is written for undergraduate students of mathematics, economics, business and finance, geography, engineering and related disciplines, and postgraduate students who may need to analyse time series as part of their taught programme or their research.

Paul Cowpertwait is an associate professor in mathematical sciences (analytics) at Auckland University of Technology with a substantial research record in both the theory and applications of time series and stochastic models.

Andrew Metcalfe is an associate professor in the School of Mathematical Sciences at the University of Adelaide, and an author of six statistics text books and numerous research papers. Both authors have extensive experience of teaching time series to students at all levels.

Analysis of Integrated and Cointegrated Time Series with R c.f.

(Pfaff 2008) “Analysis of Integrated and Cointegrated Time Series with R” by Bernhard Pfaff. Its been a long time since I read this book and rather than do it an injustice I direct you to the review by Dirk Eddelbuettel in the Journal of Statistical Software is avaoilable at review. Or the book’s website at Analysis of Integrated and Cointegrated Time Series with R.

The analysis of integrated and co-integrated time series can be considered as the main methodology employed in applied econometrics. This book not only introduces the reader to this topic but enables him to conduct the various unit root tests and co-integration methods on his own by utilizing the free statistical programming environment R. The book encompasses seasonal unit roots, fractional integration, coping with structural breaks, and multivariate time series models. The book is enriched by numerous programming examples to artificial and real data so that it is ideally suited as an accompanying text book to computer lab classes.

The second edition adds a discussion of vector auto-regressive, structural vector auto-regressive, and structural vector error-correction models.

Bayesian Analysis of Time Series by Lyle D. Broemeling

(Broemeling 2019)

covers pretty much the material in the course.
uses winbugs and R
models considered include
- white noise
- Wiener process (random walk)
- AR(p)
- ARMA(p,q)
- ARIMA
- Regression
- Regression with MA and Seasonal effects
- DLM
- TAR

Bayesian Inference for Stochastic Processes by Lyle D. Broemeling

The code for R and WinBUGS is available at code
IT is based on WinBUGS which is a bit dated but still useful.
This books seems a bit dated but it covers a lot of the material in the course.

Dynamic Time Series Models using R-INLA: An Applied Perspective

(Ravishanker, Raman, and Soyer 2022) is a new book that covers the use of the R-INLA package for fitting dynamic time series models. The book is available online gitbook

This is a very interesting book which covers a new approach to fitting time series models using the R-INLA package. INLA stands for Integrated Nested Laplace Approximation and is a method for fitting Bayesian models that is faster than MCMC. The book covers a wide range of topics in time series modeling, computation, and inference. The book is suitable for graduate students and researchers in statistics, computer science, and related fields.

Statistics for Spatio-Temporal Data

(Cressie and Wikle 2011) is a book I came across when I tried to understand the NDLM model. NLDMs have a two level hierarcial form and it seems possible to extend this formulation will non-normaly distributed shocks and possibly non linear relation. In this book the authors take an interesting approch of not only looking at NDLM as a heirarchical model but they also extend the time series model into a spatio-temporal model.

This book is a comprehensive introduction to the analysis of spatio-temporal data and covers a wide range of topics in spatio-temporal statistics. The book is suitable for graduate students and researchers in statistics, computer science, and related fields.

Bayesian Analysis of Stochastic Process Models

c.f. (Rios Insua, Ruggeri, and Wiper 2012)

David Rios Insua, Fabrizio Ruggeri, Michael P. Wiper

This book is a comprehensive introduction to the analysis of stochastic process models using Bayesian methods. The book covers a wide range of topics in stochastic process modeling, computation, and inference. The book is suitable for graduate students and researchers in statistics, computer science, and related fields.

There are also a number of books on NDLM that I’ve come accross:

Dynamic linear model tutorial matlab
Forecasting, structural time series and the Kalman filter by Andrew C. Harvey
Dynamic Linear Models with R by Giovanni Petris Sonia Petrone Patrizia Campagnoli
Time Series Analysis by State Space Methods by J. Durbin and S.J. Koopman

References

Broemeling, Lyle D. 2019. Bayesian Analysis of Time Series. CRC Press.

Cowpertwait, P. S. P., and A. V. Metcalfe. 2009. Introductory Time Series with r. Use r! Springer New York. https://books.google.co.il/books?id=QFiZGQmvRUQC.

Cressie, N., and C. K. Wikle. 2011. Statistics for Spatio-Temporal Data. CourseSmart Series. Wiley. https://books.google.co.il/books?id=-kOC6D0DiNYC.

Durbin, J. 1960. “The Fitting of Time-Series Models.” Revue de l’Institut International de Statistique / Review of the International Statistical Institute 28 (3): 233–44. http://www.jstor.org/stable/1401322.

Gelman, A., J. B. Carlin, H. S. Stern, D. B. Dunson, A. Vehtari, and D. B. Rubin. 2013. Bayesian Data Analysis, Third Edition. Chapman & Hall/CRC Texts in Statistical Science. Taylor & Francis. https://books.google.co.il/books?id=ZXL6AQAAQBAJ.

Levinson, Norman. 1946. “The Wiener (Root Mean Square) Error Criterion in Filter Design and Prediction.” Journal of Mathematics and Physics 25 (1-4): 261–78. https://doi.org/https://doi.org/10.1002/sapm1946251261.

Martin, Osvaldo A., Ravin Kumar, and Junpeng Lao. 2021. Bayesian Modeling and Computation in Python. Boca Raton.

Nielsen, A. 2019. Practical Time Series Analysis: Prediction with Statistics and Machine Learning. O’Reilly Media. https://books.google.co.il/books?id=xNOwDwAAQBAJ.

Pfaff, B. 2008. Analysis of Integrated and Cointegrated Time Series with r. Use r! Springer New York. https://books.google.co.il/books?id=ca5MkRbF3fYC.

Prado, R., M. A. R. Ferreira, and M. West. 2023. Time Series: Modeling, Computation, and Inference. Chapman & Hall/CRC Texts in Statistical Science. CRC Press. https://books.google.co.il/books?id=pZ6lzgEACAAJ.

Ravishanker, N., B. Raman, and R. Soyer. 2022. Dynamic Time Series Models Using r-INLA: An Applied Perspective. CRC Press. https://books.google.co.il/books?id=e6h6EAAAQBAJ.

Rios Insua, David, Fabrizio Ruggeri, and Michael P Wiper. 2012. Bayesian Analysis of Stochastic Process Models. John Wiley & Sons.

Storch, H. von, and F. W. Zwiers. 2002. Statistical Analysis in Climate Research. Cambridge University Press. https://books.google.co.il/books?id=bs8hAwAAQBAJ.

Theodoridis, S. 2015. Machine Learning: A Bayesian and Optimization Perspective. Elsevier Science. https://books.google.co.il/books?id=hxQRogEACAAJ.

Trench, William F. 1964. “An Algorithm for the Inversion of Finite Toeplitz Matrices.” Journal of the Society for Industrial and Applied Mathematics 12 (3): 515–22. http://ramanujan.math.trinity.edu/wtrench/research/papers/TRENCH_RP_6.PDF.

Walker, Gilbert Thomas. 1931. “On Periodicity in Series of Related Terms.” Proceedings of the Royal Society of London. Series A, Containing Papers of a Mathematical and Physical Character 131 (818): 518–32. https://doi.org/10.1098/rspa.1931.0069.

West, M., and J. Harrison. 2013. Bayesian Forecasting and Dynamic Models. Springer Series in Statistics. Springer New York. https://books.google.co.il/books?id=NmfaBwAAQBAJ.

Wikipedia contributors. 2024a. “Autoregressive Model — Wikipedia, the Free Encyclopedia.” https://en.wikipedia.org/w/index.php?title=Autoregressive_model&oldid=1233171855#Estimation_of_AR_parameters.

———. 2024b. “Levinson Recursion — Wikipedia, the Free Encyclopedia.” https://en.wikipedia.org/w/index.php?title=Levinson_recursion&oldid=1229942891.

Yule, George Udny. 1927. “VII. On a Method of Investigating Periodicities Disturbed Series, with Special Reference to Wolfer’s Sunspot Numbers.” Philosophical Transactions of the Royal Society of London. Series A, Containing Papers of a Mathematical or Physical Character 226 (636-646): 267–98. https://doi.org/10.1098/rsta.1927.0007.

Zohar, Shalhav. 1969. “Toeplitz Matrix Inversion: The Algorithm of w. F. Trench.” J. ACM 16: 592–601. https://api.semanticscholar.org/CorpusID:3115290.

Reuse

CC SA BY-NC-ND

Citation

BibTeX citation:

@online{bochman2024,
  author = {Bochman, Oren},
  title = {Week 0: {Introductions} to Time Series Analysis and the
    {AR(1)} Process},
  date = {2024-10-22},
  url = {https://orenbochman.github.io/notes/bayesian-ts/module0.html},
  langid = {en}
}

For attribution, please cite this work as:

Bochman, Oren. 2024. “Week 0: Introductions to Time Series Analysis and the AR(1) Process.” October 22, 2024. https://orenbochman.github.io/notes/bayesian-ts/module0.html.

Week 0: Introductions to time series analysis and the AR(1) process

Course Card

Overview of the course

Mathematical Review

Complex Numbers (Review)

Eigenvalues, Eigenvectors the characteristic polynomials and Unit roots

Unit Roots

Spectral analysis (1898)

Yule-Walker Equations (1932)

Durbin-Levinson recursion (Off-Course Reading)

Durbin-Levinson and the Yule-Walker equations (Off-Course Reading)

Wold’s theorem - (extra curricular) circa 1939