网课邦

当前位置:首页 > 网课答案 > 其他网课

其他网课

交通数据挖掘技术(Data Mining for Transportation)(东南大学) 中国大学MOOC答案2024版100分完整版

时间:2025-01-15 14:14:19 作者:暂无作者 浏览量:
内容页左侧

Week 1. Introduction to data mining Test 1

1、 问题:Which one is not the description of Data mining?
选项:
答案: 【Appropriate statistical analysis methods to analyze the data collected】

2、 问题:Which one describes the right process of knowledge discovery?
选项:
答案: 【Selection-Preprocessing-Transformation-Data mining-Interpretation/Evaluation】

3、 问题:Which one is not belong to the process of KDD?
选项:
答案: 【Data description】

4、 问题:Which one is not the right alternative name of data mining??
选项:
答案: 【Data harvesting】

5、 问题:Which one is not the nominal variables?
选项:
答案: 【Age】

6、 问题:Which one is wrong about classification and regression??
选项:
答案: 【We can construct classification models (functions) without some training examples.】

7、 问题:Which one is wrong about clustering and outliers?
选项:
答案: 【Clustering belongs to supervised learning.】

8、 问题:About data process, which one is wrong?
选项:
答案: 【When making data classification, we predict categorical labels excluding unordered one.】

9、 问题:Outlier mining?such as density based method belongs to supervised learning.
选项:
答案: 【错误】

10、 问题:Support vector machines can be used for classification and regression.
选项:
答案: 【正确】

Week 2. Data pre-processing Test 2

1、 问题:Which is not the reason we need to preprocess the data?
选项:
答案: 【to make result meet our hypothesis】

2、 问题:Which is not the major tasks in data preprocessing?
选项:
答案: 【Transition】

3、 问题:How to construct new feature space by PCA?
选项:
答案: 【New feature space by PCA is constructed by eliminating the weak components to reduce the size of the data.】

4、 问题:Which one is wrong about methods for discretization?
选项:
答案: 【Clustering analysis only belongs to top-down split.】

5、 问题:Which one is wrong about Equal-width (distance) partitioning and Equal-depth (frequency) partitioning?
选项:
答案: 【The interval of the former one is not equal.】

6、 问题:Which one is wrong way to normalize data?
选项:
答案: 【Simple scaling】

7、 问题:Which are the right way to fill in missing values?
选项:
答案: 【Smart mean;
Probable value;
Ignore】

8、 问题:Which are the right way to handle noise data?
选项:
答案: 【Regression;
Cluster;
WT;
Manual】

9、 问题:Which one is right about wavelet transforms?
选项:
答案: 【The DWT decomposes each segment of time series via the successive use of low-pass and high-pass filtering at appropriate levels.;
Wavelet transforms can be used for reducing data and smoothing data.】

10、 问题:Which are the common used ways to sampling?
选项:
答案: 【Simple random sample without replacement;
Simple random sample with replacement;
Stratified sample;
Cluster sample】

11、 问题:Discretization means dividing the range of a continuous attribute into intervals.
选项:
答案: 【正确】

Week 3. Instance based learning Test 3

1、 问题:What’s the difference between eager learner and lazy learner?
选项:
答案: 【Eager learners would generate a model for classification while lazy learner would not.】

2、 问题:How to choose the optimal value for K?
选项:
答案: 【Cross-validation can be used to determine a good value by using an independent dataset to validate the K values.;
Low values for K (like k=1 or k=2) can be noisy and subject to the effect of outliers.;
Historically, the optimal K for most datasets has been between 3-10.】

3、 问题:What’s the major components in KNN?
选项:
答案: 【How to measure similarity?;
How to choose “k”?;
How are class labels assigned?】

4、 问题:Which one of the following ways can be used to obtain attribute weight for Attribute-Weighted KNN?
选项:
答案: 【Prior knowledge / experience.;
PCA, FA (Factor analysis method).;
Information gain.;
Gradient descent, simplex methods and genetic algorithm.】

5、 问题:At learning stage KNN would find the K closest neighbors and then decide classify K identified nearest label.
选项:
答案: 【错误】