中国科学院大学计算机学院 模式识别与机器学习期末考查试题及参考答案
中国科学院大学计算机学院 高级人工智能2016年考试题目
近年来,随着互联网的发展以及现代的、廉价的图形用户界面和大容量存储设备的出现,信息检索(information retrieval, IR)领域已经发生了巨大的变化,这使得传统的信息检索教材变得过时,所以很有必要通过竞赛来提高研究人员对技术的认识,trec是信息检索领域十分重要的一个竞赛,本课程是以2017年的trec竞赛为例。
1.Suppose that a data warehouse consists of four dimensions, date, spectator, location, and game, and two measures, count and charge, where charge is the fare that a spectator pays when watching a game on a given date. Spectators may be students, adults, or seniors, with each category having its own charge rate. (a)Draw a star schema diagram for the data warehouse. (b) Starting with the base cuboid [date, spectator, location, game],what specific OLAP operations should one perform in order to list the total charge paid by student spectators in Los Angeles? (c) Bitmap indexing is a very useful optimization technique. Please present the pros and cons of using bitmap indexing in this given data warehouse.
1.Consider the data set shown in Table 1(min_sup = 60%, min_conf=70%). (a)Find all frequent itemsets using Apriori by treating each transaction ID as a market basket. (b)Use the results in part (a) to compute the confidence for the association rules {a, b}{c} and {c}{a, b}. Is confidence a symmetric measure? (c)List all of the strong association rules (with support s and confidence c) matching the following metarule(规则), where X is a variable representing customers, and itemi denotes variables representing items (e.g. “A”, “B”, etc.): Table 1. Example of market basket transactions. TID Items-bought T1 {A, D, B, C} T2 {D, A, C, E, B} T3 {A, B, E} T4 {A, B, D}
信息检索(Information Retrieval)是用户进行信息查询和获取的主要方式,是查找信息的方法和手段。狭义的信息检索仅指信息查询(Information Search)。即用户根据需要,采用一定的方法,借助检索工具,从信息集合中找出所需要信息的查找过程。广义的信息检索是信息按一定的方式进行加工、整理、组织并存储起来,再根据信息用户特定的需要将相关信息准确的查找出来的过程。又称信息的存储于检索。一般情况下,信息检索指的就是广义的信息检索。该门课主要讲述信息检索的主要技术
魏老师和罗老师班都可以用,考试题目全部都来自这里。 答案已订正。
科大中科院2017-2019高级人工智能试题以及答案总结 答案题涉及到的知识点以及对应答案供参考
