- 1、本文档共31页,可阅读全部内容。
- 2、有哪些信誉好的足球投注网站(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
- 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
数据挖掘导论英文chap1_intro
(C) Vipin Kumar, CSci 5980 Data Mining, Spring 2004 (C) Vipin Kumar, Parallel Issues in Data Mining, VECPAR 2002 Data Mining: Introduction Why Mine Data? Commercial Viewpoint Lots of data is being collected and warehoused Web data, e-commerce purchases at department/grocery stores Bank/Credit Card transactions Computers have become cheaper and more powerful Competitive Pressure is Strong Provide better, customized services for an edge (e.g. in Customer Relationship Management) Why Mine Data? Scientific Viewpoint Data collected and stored at enormous speeds (GB/hour) remote sensors on a satellite telescopes scanning the skies microarrays generating gene expression data scientific simulations generating terabytes of data Traditional techniques infeasible for raw data Data mining may help scientists in classifying and segmenting data in Hypothesis Formation Mining Large Data Sets - Motivation There is often information “hidden” in the data that is not readily evident Human analysts may take weeks to discover useful information Much of the data is never analyzed at all What is Data Mining? Many Definitions Non-trivial extraction of implicit, previously unknown and potentially useful information from data Exploration analysis, by automatic or semi-automatic means, of large quantities of data in order to discover meaningful patterns What is (not) Data Mining? Origins of Data Mining Draws ideas from machine learning/AI, pattern recognition, statistics, and database systems Traditional Techniquesmay be unsuitable due to Enormity of data High dimensionality of data Heterogeneous, distributed nature of data Data Mining Tasks Prediction Methods Use some variables to predict unknown or future values of other variables. Description Methods Find human-interpretable patterns that describe the data. Data Mining Tasks... Classification [Predictive] Clustering [Descriptive] Association Rule Discovery [Descriptive] Sequential Pattern Discovery [Descriptive] Re
您可能关注的文档
最近下载
- 第六单元名著导读《简爱》作业设计 统编版语文九年级下册.pdf VIP
- 2025年高一物理寒假衔接讲练 (人教版)02 自由落体和竖直上抛(教师版).docx VIP
- 道德发展心理学.pdf VIP
- 福克斯特Scarlett 4i4 3rd Gen用户说明书.pdf
- 部编人教版小学语文5年级下册全册教学课件.pptx
- 人教版二年级口算题1000题大全.pdf
- 2025年高一物理寒假衔接讲练 (人教版)第02讲 小船渡河和关联速度(教师版).docx VIP
- 2025年高一物理寒假衔接讲练 (人教版)第02讲 共点力的平衡(教师版).docx VIP
- 2025年高一物理寒假衔接讲练 (人教版)第03讲 抛体运动的规律(教师版).docx VIP
- 酒店的薪酬管理制度.docx VIP
文档评论(0)