This course is concerned with data mining, which is the application of the methods of statistics, data analysis and machine learning algorithms to the exploration and analysis of large data sets, with the aim of extracting new, previously unknown, and potentially useful information from the process of knowledge discovery in databases (KDD). Data mining is being applied in an increasing variety of areas, such as financial, economic, high-tech industrial, scientific and medical fields, as an essential component of decision assistance system. The objective of this course is to introduce statistical data mining concepts, describe methods in statistical data mining from sampling to decision trees, and provide decision support solutions.
統計資料採礦主要應用統計方法、資料分析和機器學習演算法,進行資料庫之知識挖掘,藉此從高維度大型資料庫中挖掘潛藏的有用資訊。目前已在金融、經濟、行銷、電子商務、數位資訊產業、高科技產業、生命科學和醫學等不同領域,逐漸提升地位成為決策輔助系統中的重要元件之一,提供管理階層決策輔助之用。以指導學生選定研究題目,閱讀文獻,提出研究計畫報告及研究成果報告為課程目標。This course is concerned with data mining, which is the application of the methods of statistics, data analysis and machine learning algorithms to the exploration and analysis of large data sets, with the aim of extracting new, previously unknown, and potentially useful information from the process of knowledge discovery in databases (KDD). Data mining is being applied in an increasing variety of areas, such as financial, economic, high-tech industrial, scientific and medical fields, as an essential component of decision assistance system. The objective of this course is to introduce statistical data mining concepts, describe methods in statistical data mining from sampling to decision trees, and provide decision support solutions.
Statistical data mining mainly applies statistical methods, data analysis and machine learning algorithms to conduct knowledge mining in databases, thereby mining useful information hidden in high-dimensional large databases. At present, it has gradually improved its status and become one of the important components in decision-making assistance systems in different fields such as finance, economics, marketing, e-commerce, digital information industry, high-tech industry, life sciences and medicine, providing management decision-making assistance. The course objectives are to guide students to select research topics, read literature, and propose research project reports and research results reports.
1. Tan, Steinbach and Kumar, Introduction to Data Mining, Addison Wesley, 2006. (歐亞)
2. 曾淑峰、林志弘、翁玉麟(2012年9月),資料採礦應用—以SAS Enterprise Miner為工具,梅霖文化事業有限公司 (ISBN: 978-986-6511-60-8)
3. Slaughter, S.J. and Delwiche, L.D., 蔡宏明、蔡秉諺譯(2011年11月),SAS Enterprise Guide實用工具書,梅霖文化事業有限公司 (ISBN: 978-986-6511-58-5)
4. 相關專業學術刊物
1. Tan, Steinbach and Kumar, Introduction to Data Mining, Addison Wesley, 2006. (Eurasia)
2. Zeng Shufeng, Lin Zhihong, and Weng Yulin (September 2012), Data Mining Application—using SAS Enterprise Miner as a tool, Meilin Cultural Industry Co., Ltd. (ISBN: 978-986-6511-60-8)
3. Slaughter, S.J. and Delwiche, L.D., translated by Cai Hongming and Cai Bingyan (November 2011), SAS Enterprise Guide practical reference book, Meilin Cultural Enterprise Co., Ltd. (ISBN: 978-986-6511-58-5)
4. Relevant professional academic journals
評分項目 Grading Method | 配分比例 Grading percentage | 說明 Description |
---|---|---|
作業--Homeworks (單元資料庫實作)作業--Homeworks (單元資料庫實作) Assignment--Homeworks (unit database implementation) |
40 | 單元資料庫實作 |
期中報告期中報告 interim report |
30 | 單元資料庫實作PPT暨上台報告 |
期末報告期末報告 Final report |
30 | 研究論文一篇 |