Chargement Évènements

Cet évènement est passé.

Distributional Matrix Completion via Nearest Neighbors in the Wasserstein Space

10 mars - 15 h 30 min

Campus St Priest (860 Rue Saint Priest 34095 Montpellier Cedex 5), bat. 5, Room: 02.124
Machine Learning in Montpellier, Theory & Practice
Jacob Feitelberg

We study the problem of distributional matrix completion: Given a sparsely observed matrix of empirical distributions, we seek to impute the true distributions associated with both observed and unobserved matrix entries. This is a generalization of traditional matrix completion where the observations per matrix entry are scalar-valued. To do so, we utilize tools from optimal transport to generalize the nearest neighbors method to the distributional setting. Under a suitable latent factor model on probability distributions, we establish that our method recovers the distributions in the Wasserstein metric. We demonstrate through simulations that our method (i) provides better distributional estimates for an entry compared to using observed samples for that entry alone, (ii) yields accurate estimates of distributional quantities such as standard deviation and value-at-risk, and (iii) inherently supports heteroscedastic distributions. In addition, we demonstrate our method on a real-world quarterly earnings predictions dataset. We also prove novel asymptotic results for Wasserstein barycenters over one-dimensional distributions.

Visio

Catégories :