Gaussian mixture models with equivalence constraints

Noam Shental, Aharon Bar-Hillel, Tomer Hertz, Daphna Weinshall

نتاج البحث: فصل من :كتاب / تقرير / مؤتمرفصلمراجعة النظراء

ملخص

Abstract Gaussian Mixture Models (GMMs) have been widely used to cluster data in an unsupervised manner via the Expectation Maximization (EM) algorithm. In this chapter we suggest a semi-supervised EM algorithm that incorporates equivalence constraints into a GMM. Equivalence constraints provide information about pairs of data points, indicating whether the points arise from the same source (a must-link constraint) or from different sources (a cannot-link constraint). These constraints allow the EM algorithm to converge to solutions that better reflect the class structure of the data. Moreover, in some learning scenarios equivalence constraints can be gathered automatically while they are a natural form of supervision in others. We present a closed form EM algorithm for handling must-link constraints, and a generalized EM algorithm using a Markov network for incorporating cannotlink constraints. Using publicly available data sets, we demonstrate that incorporating equivalence constraints leads to a considerable improvement in clustering performance. Our GMM-based clustering algorithm significantly outperforms two other available clustering methods that use equivalence con-Mixture models are a powerful tool for probabilistic modelling of data, which have been widely used in various research areas such as pattern recognition, machine learning, computer vision, and signal processing [13, 14, 18]. Such models provide a principled probabilistic approach to cluster data in an unsupervised manner [24, 25, 30, 31]. In addition, their ability to represent complex density functions has also made them an excellent choice in density estimation problems [20, 23].

اللغة الأصليةالإنجليزيّة
عنوان منشور المضيفConstrained Clustering
العنوان الفرعي لمنشور المضيفAdvances in Algorithms, Theory, and Applications
ناشرCRC Press
الصفحات33-58
عدد الصفحات26
رقم المعيار الدولي للكتب (الإلكتروني)9781584889977
رقم المعيار الدولي للكتب (المطبوع)9781584889960
حالة النشرنُشِر - 1 يناير 2008
منشور خارجيًانعم

ملاحظة ببليوغرافية

Publisher Copyright:
© 2008, CRC Press. All rights reserved.

بصمة

أدرس بدقة موضوعات البحث “Gaussian mixture models with equivalence constraints'. فهما يشكلان معًا بصمة فريدة.

قم بذكر هذا