被同学2020-11-21 15:43:19
2020 Mock Exam B - Morning Session 第7题: Wu’s recollection about the preparation of the textual data is most accurate with respect to: A numbers. B stop words. C lowercasing. 题目描述: Wu tells Quinn that she has heard a little about text mining for clues about an individual’s behavior and recalls that text preparation must be carried out by removing such items as HTML tags, punctuation, numbers, and stop words and eliminating the distinction between uppercase and lowercase words by lowercasing them all. 这道题不是问正确的做法吗, 为什么选A呢? A numbers. ——不是应该在去掉后增加注释才正确吗? B stop words、C lowercasing——这两步应该是“数据处理”过程里的动作,不是“数据准备”里的动作,是这个意思吧?
回答(1)
Kevin2020-11-23 09:28:37
同学你好!
A:numbers的处理的确像你说的;是在文本准备时(cleansing)
B和C是预处理时。
所以遇到这类问题,尽管A说的不够全,但还是抓主要矛盾。
致正在努力的你,望能解答你的疑惑~
如此次答疑能更好地帮助你理解该知识点,可以通过【点赞】来让我们知晓。你的反馈是我们进步的动力,祝你顺利通过考试~
- 评论(0)
- 追问(0)


评论
0/1000
追答
0/1000
+上传图片