A brief survey of machine learning methods for classification in networked data and an application to suspicion scoring
[pdf] (lncs version)
[pdf] (workshop version)
E.M. Airoldi et al. (Eds.): ICML 2006 Ws, LNCS 4503, pp. 172-175. Springer-Verlag.
Originally appeared as a poster at the Workshop on Statistical Network Learning at 23rd International Conference on Machine Learning (ICML 2006), Pittsburgh, 29 June, 2006.
Abstract
This paper surveys work from the field of machine learning on
the problem of within-network learning and inference. To give
motivation and context to the rest of the survey, we start by
presenting some (published) applications of within-network inference.
After a brief formulation of this problem and a discussion of
probabilistic inference in arbitrary networks, we survey machine
learning work applied to networked data, along with some important
predecessors---mostly from the statistics and pattern recognition
literature. We then describe an application of within-network
inference in the domain of suspicion scoring in social networks. We
close the paper with pointers to toolkits and benchmark data sets used
in machine learning research on classification in network data. We
hope that such a survey will be a useful resource to workshop
participants, and perhaps will be complemented by others.