We propose a hidden Markov model for longitudinal multivariate continuous responses, accounting for missing data under the missing at random assumption. Maximum likelihood estimation of this model is carried out through the Expectation-Maximization algorithm. To address the problem of dimensionality reduction, we develop a greedy search algorithm based on the Bayesian Information Criterion. We illustrate the proposal through a dataset collected by the World Bank and UNESCO Institute for Statistics on the basis of which we dynamically cluster countries according to the selected variables observed during the period 2000-2017.

A hidden Markov model for variable selection with missing values

Francesco Bartolucci;Silvia Pandolfi
2021

Abstract

We propose a hidden Markov model for longitudinal multivariate continuous responses, accounting for missing data under the missing at random assumption. Maximum likelihood estimation of this model is carried out through the Expectation-Maximization algorithm. To address the problem of dimensionality reduction, we develop a greedy search algorithm based on the Bayesian Information Criterion. We illustrate the proposal through a dataset collected by the World Bank and UNESCO Institute for Statistics on the basis of which we dynamically cluster countries according to the selected variables observed during the period 2000-2017.
2021
9788891927361
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11391/1522035
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact