数据泛化 data cub computation and data generalization数据泛化 data cube computation and data generalization数据泛化 data cube computation and data generalization数据泛化 data cube computation and data generalization.ppt
- 1、本文档共85页,可阅读全部内容。
- 2、有哪些信誉好的足球投注网站(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
- 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
Data Mining: Concepts and Techniques Data Mining: Concepts and Techniques — Chapter 4 — Chapter 4: Data Cube Computation and Data Generalization Efficient Computation of Data Cubes Exploration and Discovery in Multidimensional Databases Attribute-Oriented Induction ─ An Alternative Data Generalization Method Efficient Computation of Data Cubes Computing full/iceberg cubes: 3 methodologies bottom-up: Multi-Way array aggregation (Zhao, Deshpande Naughton, SIGMOD’97) top-down: Bottom-up computation: BUC (Beyer Ramarkrishnan, SIGMOD’99) H-cubing technique (Han, Pei, Dong Wang: SIGMOD’01) Integrating Top-Down and Bottom-Up: Star-cubing algorithm (Xin, Han, Li Wah: VLDB’03) High-dimensional OLAP: A Minimal Cubing Approach (Li, et al. VLDB’04) Computing alternative kinds of cubes: Partial cube, closed cube, approximate cube, etc. Preliminary Tricks (Agarwal et al. VLDB’96) Sorting, hashing, and grouping operations are applied to the dimension attributes in order to reorder and cluster related tuples Aggregates may be computed from previously computed aggregates, rather than from the base fact table Smallest-child: computing a cuboid from the smallest, previously computed cuboid Cache-results: caching results of a cuboid from which other cuboids are computed to reduce disk I/Os Preliminary Tricks (Agarwal et al. VLDB’96) Amortize-scans: computing as many as possible cuboids at the same time to amortize disk reads Share-sorts: sharing sorting costs cross multiple cuboids when sort-based method is used Share-partitions: sharing the partitioning cost across multiple cuboids when hash-based algorithms are used Multi-Way Array Aggregation Array-based “bottom-up” algorithm Simultaneous aggregation on multiple dimensions Intermediate aggregate values are re-used for computing ancestor cuboids Cannot do Apriori pruning: No iceberg optimization Multi-way Array Aggregation for Cube Computation (MOLAP) Partition arrays into chunks Compressed sparse array addressing: (ch
您可能关注的文档
- 试析对招投标违法违规若干问研究及对策试析对招投标违法违规若干问题研究及对策试析对招投标违法违规若干问题研究及对策试析对招投标违法违规若干问题研究及对策.doc
- 视力和视力检测共章视力和视检测共章力检测共章.doc
- 试题组库--烃和卤代烃(有案)试题组库--烃和卤代烃(有答案)试题组库--烃和卤代烃(有答案)试题组库--烃和卤代烃(有答案).doc
- 视听新媒体蓝皮书解读:视听成新动力视听新媒体蓝皮书解读:视听+成新动力视听新媒体蓝皮书解读:视听+成新动力视听新媒体蓝皮书解读:视听+成新动力.doc
- 视频剪辑大赛策划书视频剪辑赛策划书大赛策划书.doc
- 视频解盘20130314视解盘20130314视频解盘20130314视频解盘20130314.ppt
- 适合学生看的电影适合学生看电影的电影.doc
- 适度是高中新课程教学的一个要原则 ----高中思想政治新课适度是高中新课程教学的一个重要原则 ----高中思想政治新课适度是高中新课程教学的一个重要原则 ----高中思想政治新课适度是高中新课程教学的一个重要原则 ----高中思想政治新课.ppt
- 适应新常态 把握新常态适新常态 把握新常态适应新常态 把握新常态适应新常态 把握新常态.doc
- 视听说sts4-2视听说ss4-2ts4-2.doc
最近下载
- 企业家代表发言稿.docx VIP
- 文言文阅读第一课(讲义)(学生版)-2025年高考语文一轮复习(新教材新高考).pdf VIP
- 总经理年度绩效考核表.xlsx
- 兰州理工大学2023-2024学年第1学期《高等数学(上)》期末考试试卷(A卷)附参考答案.pdf
- Unit2Reading学案-高一上学期英语牛津译林版(2020)必修第二册(有答案).docx VIP
- 2024年谈心谈话记录范文(四篇).docx VIP
- 青川县硅矿调查报告要点.doc
- 某某医院医疗质量与医疗安全奖惩考核制度.pdf VIP
- 人教版四年级数学上册第五单元《平行四边形和梯形》(大单元教学设计).docx VIP
- 癫痫患者的护理查房.pptx VIP
文档评论(0)