- 1、本文档共8页,可阅读全部内容。
- 2、有哪些信誉好的足球投注网站(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
- 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
Computing gaussian mixture models with EM using equivalence constraints
Computing Gaussian Mixture Models with EM using
Side-Information
Noam Shental fenoam@cs.huji.ac.il
Aharon Bar-Hillel aharonbh@cs.huji.ac.il
Tomer Hertz tomboy@cs.huji.ac.il
Daphna Weinshall daphna@cs.huji.ac.il
School of Computer Science and Engineering and the Center for Neural Computation, The Hebrew University
of Jerusalem, 91904 Jerusalem, ISRAEL
Abstract
Estimation of Gaussian mixture models is an
efficient and popular technique for clustering
and density estimation. An EM procedure is
widely used to estimate the model parame-
ters. In this paper we show how side informa-
tion in the form of equivalence constraints can
be incorporated into this procedure, leading
to improved clustering results. Equivalence
constraints are prior knowledge concerning
pairs of data points, indicating if the points
arise from the same source (positive con-
straint) or from different sources (negative
constraint). Such constraints can be gath-
ered automatically in some learning prob-
lems, and are a natural form of supervision
in others. We present a closed form EM
procedure for handling positive constraints,
and a Generalized EM procedure using a
Markov net for the incorporation of negative
constraints. Using publicly available data
sets we demonstrate that such side informa-
tion may lead to considerable improvement
in clustering tasks, and that our algorithm is
preferable to another suggested method using
this type of side information.
Keywords: Learning from partial knowledge, semi-
supervised learning, Gaussian mixture models, clus-
tering.
1. Introduction
We are used to thinking about learning from labels as
supervised learning, and learning without labels as un-
supervised learning, where ’supervised’ implies a need
for human intervention. However, in unsupervised
learning we are not limited to using data statistics
only. Similarly supervised learning is not limited to
using labels. In this work we focus on semi-supervised
learning using side-information, which is not given
您可能关注的文档
- CAT809RTBI.pdf
- CAT809_810_cn.pdf
- CAT24C16WI-GT3;CAT24C16YI-GT3;CAT24C16LI-G;CAT24C16WI-G;CAT24C16YI-G;中文规格书,Datasheet资料.pdf
- cat4900m_sol.pdf
- catalogue_titanocene.pdf
- CAT9555_I2C.pdf
- CAT735B卡特彼勒铰接式自卸车用户手册.pdf
- Categorical perception of human female physical attractiveness and health.pdf
- Catalytic conversion of waste tyres into valuable hydrocarbons.pdf
- Cathodic reductive dissolution and surface adsorption behavior of ocean manganese nodules.pdf
文档评论(0)