数据降维和聚类中的若干问题研究
数据降维和聚类中的若干问题研究封面图

数据降维和聚类中的若干问题研究

赵建华, 著

出版社:中国统计出版社

年代:2011

定价:26.0

书籍简介:

对数据降维和聚类方法进行概率建模是数据挖掘和机器学习的一个重要研究领域。本书主要研究概率建模中的三个问题:模型的建立、估计及模型选择,介绍了在这些方面新近取得的一些研究成果。

书籍目录:

1 Introduction

1.1 PCA and Latent Variable Models

1.1.1 PCA

1.1.2 Latent Variable Models

1.1.3 FA and PPCA

1.2 Motivations and Contributions

1.3 Organization of the Book

2 ML Estimation for Factor Analysis: EM or non-EM

2.1 Introduction

2.2 FA Model and Three Estimation Algorithms

2.2.1 FA model

2.2.2 Lawley (1940)'s simple iteration algorithm

2.2.3 EM type algorithms

2.3 TheECME2 algorithm

2.3.1 The maximization in the first CM-step

2.3.2 The maximization in the second CM-step

2.3.3 Practical consideration

2.3.4 ECME2 vs. simpleiteration algorithm

2.4 The CMAlgorithm

2.4.1 The maximizationin the second CM-step

2.4.2 When will conditionlbe satisfied

2.4.3 Recursive computation ofthe matrix Bz

2.4.4 On the nature of stationary points

2.5 Simulations

2.5.1 Simulation Data

2.5.2 Performance Analysis

2.5.3 On different starting values

2.6 Conclusion and Future Work

2.7 Appendix

2.7.1 Proofs

2.7.2 Some Notes

3 Fast ML estimation for the Mixture of Factor Analyzers via an ECM Algorithm

3.1 Introduction

3.2 MFA model and an ECM algorithm

……

4 Mixture Model Selection:BIC or Hierarchical BIC

5 A Note on Variational Bayesian Factor Analysis

6 Bilinear Probabilistic Principal Component Analysis

7 Conclusions and discussions

References

内容摘要:

A central research area in data mining and machine learning is probabilis-tic modeling because it has a number of advantages over non-probabilistic methods. Given a probabilistic model, one could fit the model using max-imum likelihood (ML) method or Variational Bayesian (VB) method. In ML method, (1) many algorithms may converge very slowly and thus com- putationally efficient algorithms are often desirable; and (2) the choice of a suitable modelis difficult though many model selection criteria exist and thus criteria with higher accuracy are desired. In VB method, employing
different priors may yield different performances and thus studies on how to choose a suitable prior are important. In this book, three sub-topics were studied: Modeling, Estimation and Model selection for dimension reduc- ition and clustering.

书籍规格:

书籍详细信息
书名数据降维和聚类中的若干问题研究站内查询相似图书
9787503763076
如需购买下载《数据降维和聚类中的若干问题研究》pdf扫描版电子书或查询更多相关信息,请直接复制isbn,搜索即可全网搜索该ISBN
出版地北京出版单位中国统计出版社
版次1版印次1
定价(元)26.0语种英文
尺寸23 × 17装帧平装
页数印数

书籍信息归属:

数据降维和聚类中的若干问题研究是中国统计出版社于2011.8出版的中图分类号为 O21 的主题关于 概率-数学模型-研究-英文 的书籍。