Leveraging Relational Autocorrelation with Latent Group Models
Neville, J. and D. Jensen (2005). Leveraging relational autocorrelation with latent group models. Proceedings of the 5th IEEE International Conference on Data Mining.
- Abstract
The presence of autocorrelation provides strong motivation for using
relational techniques for learning and inference. Autocorrelation
is a statistical dependency between the values of the same variable
on related entities and is a nearly ubiquitous characteristic of
relational data sets. Recent research has explored the use of collective
inference techniques to exploit this phenomenon. These techniques
achieve significant performance gains by modeling observed
correlations among class labels of related instances, but the models
fail to capture a frequent cause of autocorrelation—the presence
of underlying groups that influence the attributes on a set of entities.
We propose a latent group model (LGM) for relational data,
which discovers and exploits the hidden structures responsible for
the observed autocorrelation among class labels. Modeling the latent
group structure improves model performance, increases inference
efficiency, and enhances our understanding of the datasets.
We evaluate performance on three relational classification tasks and
show that LGM outperforms models that ignore latent group structure,
particularly when there is little information with which to seed
inference.
- Text
- A PDF version of this paper is available.