曹同学2023-04-16 10:55:49
可以解释一下什么是data mining吗,我没听懂,最好能有一个英文的定义
回答(1)
开开2023-04-17 11:24:36
同学你好,Data mining是反复地搜索数据集,直至出现显著的模式。这些数据本不存在相关性或者特定范式,但由于你不停的抽样或者搜索数据,就会偶然间突然出现一些数据会存在特定模式。这就是data mining bias,他并不存在经济原理也不符合逻辑,仅仅是由于过度搜索数据集从而偶然间出现了数据上的显著模式。
原版书中定义为:
Data-mining bias arises from repeatedly searching a dataset until a statistically significant pattern emerges. It is almost inevitable that some relationship will appear. Such patterns cannot be expected to have predictive value. Lack of an explicit economic rationale for a variable’s usefulness is a warning sign of a data-mining problem: no story, no future.6 Of course, the analyst must be wary of inventing the story after discovering
【点赞】哟~。加油,祝你顺利通过考试~
- 评论(0)
- 追问(0)
评论
0/1000
追答
0/1000
+上传图片

