hi,小慕
课程

中国大学MOOC,为你提供一流的大学教育

hi,小mooc
期末考试会员
认证学习
数据仓库与数据挖掘
第8次开课
开课时间: 2025年02月24日 ~ 2025年07月15日
学时安排: 3-5小时每周
进行至第21周,共21周 已有 383 人参加
认证学习
认证成绩和证书
智能问答和解析
视频学习辅助
立即参加
课程详情
课程评价(68)
spContent=大数据时代亟需数据仓库与数据挖掘等技术集聚和挖掘数据资源,开发和释放数据蕴含的巨大价值,以数据竞争力支撑国家发展,以数据生产力推动社会进步。通过该课程的学习,你可以掌握数据仓库和数据挖掘的基础理论与相关工程技术,实现海量数据的采集、清理、存储、分析与挖掘。 In the era of big data, technologies such as data warehouse and data mining are urgently needed to gather and mine data resources, develop and release the huge value of data, support national development with data competitiveness, and promote social progress with data productivity. Through the study of this course, you can master the basic theories and related engineering techniques of data warehouse and data mining, and realize the collection, cleaning, storage, analysis and mining of massive data.
大数据时代亟需数据仓库与数据挖掘等技术集聚和挖掘数据资源,开发和释放数据蕴含的巨大价值,以数据竞争力支撑国家发展,以数据生产力推动社会进步。通过该课程的学习,你可以掌握数据仓库和数据挖掘的基础理论与相关工程技术,实现海量数据的采集、清理、存储、分析与挖掘。 In the era of big data, technologies such as data warehouse and data mining are urgently needed to gather and mine data resources, develop and release the huge value of data, support national development with data competitiveness, and promote social progress with data productivity. Through the study of this course, you can master the basic theories and related engineering techniques of data warehouse and data mining, and realize the collection, cleaning, storage, analysis and mining of massive data.
—— 课程团队
课程概述

《数据仓库与数据挖掘》在线课程注重理论联系实践,理论为经,应用为纬。立足数据,在统一框架内介绍数据仓库和数据挖掘技术,主要包括数据概念、数据仓库模型、知识类型,数据预处理、数据分类、数据回归、关联挖掘、数据聚类、异常检测、数据可视化等方法,以及大数据挖掘平台的设计与实现。通过学习,学生可以掌握海量数据仓库存储与挖掘的基本原理,利用数据预处理、关联规则挖掘、聚类分析、分类挖掘、异常检测等算法,研制软件工具,解决实际工程中海量数据的高效管理与深度利用问题。该课程为学生今后从事科学研究工作或从事各种数据利用工作提供必要的基础理论和基本技能。


The online course "Data Warehouse and Data Mining" focuses on the connection of theory with practice, with theory as warp and application as weft. Based on data, data warehouse and data mining technology is introduced within a unified framework, including data concepts, data warehouse models, knowledge types, data preprocessing, data classification, data regression, association mining, data clustering, anomaly detection, data visualization and so on, as well as the design and implementation of a big data mining platform. By learning the course, you can master the basic principles of massive data warehouse storage and mining, and further take advantage of data preprocessing, association rule mining, cluster analysis, classification mining, anomaly detection and other algorithms to develop software tools to solve the problems on efficient management and in-depth utilization of massive data in actual projects. This course provides the necessary basic theories and basic skills for students to engage in scientific research or engage in various data utilization tasks in the future.


课程大纲

1 Introduction

1. What Is Data Mining and Why Data Mining

2.Data Mining Process

3. Data to be Mined

4. Data Mining Tasks

5.Evaluation of Knowledge

Test 1

2 Data

Data Objects and Attribute Types

Basic Statistical Descriptions of Data

Measuring Data Similarity and Dissimilarity

Test 2

3 Data Preprocessing

Overview

Data Cleaning

Data Integration

Data Transformation

Data Reduction

Test 3

4 Association Rule Mining

Basic Concept

Frequent Itemset Generation

Rule Generation

Factors Affecting Complexity of Apriori

Compact Representation of Frequent Itemsets

Pattern Evaluation

Test 4

5 Classification

Classification: Basic Concepts

Decision Tree Induction

Bayes Classification Methods

Techniques to Improve Classification Accuracy: Ensemble Methods

Classification of Class-Imbalanced Data Sets

Model Evaluation and Selection

Test 5

6 Cluster Analysis

An Introduction

Partitioning Methods

Hierarchical Methods

Density- and Grid-Based Methods

Evaluation of Clustering

Test 6

7 Outlier Analysis

Outlier and Outlier Analysis

Outlier Detection Methods

Statistical Approaches

Proximity-Based Approaches

Clustering-Based and Classification–Based Approaches

Test 7

8 Data visualization

Introduction

Function of Data Visualization

Data Visualization Methods

Tools of Data Visualization

Test 8

9 Data warehouse

An Introduction

Test 9

10 Perspective

数据资源

10.2数据使用

10.3数据生态

Test 10

展开全部
参考资料
  1. 袁汉宁,王树良,程永,金福生,宋红,2015, 数据仓库与数据挖掘, 人民邮电出版社
  2. Shuliang Wang, Hanning Yuan, 2014, Spatial data mining: a perspective of big data, International Journal of Data Warehousing and Mining, 10(4):50-70
  3. Deren Li, Shuliang Wang*, Hanning Yuan*, Deyi Li, 2016, Software and Applications of Spatial Data Mining. WIREs Data Mining and Knowledge Discovery, 6(3): 84-114
  4. Li Deren, Shuliang Wang, Li Deyi, 2015, Spatial Data Mining: Theory and Application, Springer
  5. Han JiaweiKamber MichelinePei Jian, 2017, Data Mining : Concepts and Techniques (3rd Edition), Morgan Kaufmann
  6. William Inmon, 2005, Building the Data Warehouse (4th Edition), Wiley
北京理工大学
6 位授课老师
袁汉宁

袁汉宁

副教授

王树良

王树良

教授

耿晶

耿晶

预聘副研究员

推荐课程

【DeepSeek适用】小白玩转AI大模型应用开发

林粒粒

227人参加

小白玩转 Python 数据分析

林粒粒

93人参加

数据挖掘期末冲刺-3小时突击数据挖掘

L木子老师

331人参加
下载
下载

下载App