類別資料分析主要針對類別型(二元、多元分類和順序尺度)目標變數,以常用之敘述統計(次數或比率)和統計圖 (如圓餅圖和柱狀圖)為基礎,透過機率分配和抽樣分配,進階至母群體之參數推論、二維和三維交叉分析以及目標變數之決策樹和預測模型建立,配合課程內容使用統計軟體SAS Enterprise Guide(SAS EG)、Enterprise Miner(SAS EM)、和SPSS進行資料分析,從統計的角度連結資料採礦方法,如關聯分析(購物籃案例)、類神經網路(詐欺案例)、決策樹(信用狀態案例)和邏輯斯迴歸模型(信用狀態案例),藉此從海量資料和高維度大型資料庫中挖掘潛藏的有用資訊,提供管理階層決策輔助之用。
Categorical Data Analysis(CDA) mainly focus on the analysis of categorical response (or target) variables. Graphical bar charts and pie charts, frequency tables, two-way and three-way contingency tables are used to describe the association among the qualitative target and predictor variables. It is applicable to a wide variety of academic disciplines, from the natural and social sciences to the humanities, government and business. In addition, patterns in the data may be modeled in a way that accounts for randomness and uncertainty in the observations, and are then used to draw inferences about the process or population being studied. This course also introduces the methods in data mining through the statistical point of view. Students will learn the ability to analyze massive and complicated data and will be able to turn the raw data into valuable information using association rules, neural network, decision tree, and logistic regression model in both the software SAS Enterprise Guide and Enterprise Miner.Category data analysis mainly focuses on the target variables of category (binary, multivariate classification and sequence scale), and is based on commonly used descriptive statistics (number or ratio) and statistical diagrams (such as round and column diagrams). Through probability allocation and sampling allocation, parameter recommendations from the parent group, two-dimensional and three-dimensional cross-analysis, and decision trees and prediction models for target variables are established, and the statistical software SAS Enterprise Guide (SAS) is used in conjunction with the course content. EG), Enterprise Miner (SAS EM), and SPSS conduct data analysis, and link data from a statistical perspective to adopt mining methods, such as related analysis (shopping cases), category neural network (scam cases), decision trees (credit status cases) and logical reproduction models (credit status cases), thereby mining potential useful information from massive data and high-dimensional large databases, providing management-level decision-making assistance.
Categorical Data Analysis(CDA) mainly focuses on the analysis of category response (or target) variables. Graphical bar charts and pie charts, frequent tables, two-way and three-way contingency tables are used to describe the association among the quality target and predictor variables. It is applicable to a wide variety of academic disciplines, from the natural and social sciences to the humanities, government and business. In addition, patterns in the data may be modeled in a way that accounts for randomness and uncertainty in the observations, and are then used to draw inferences about the process or population being studied. This course also introduces the methods in data mining through the statistical point of view. Students will learn the ability to analyze massive and complicated data and will be able to turn the raw data into valuable information using association rules, neural network, decision tree, and logistic regression model in both the software SAS Enterprise Guide and Enterprise Miner.
Categorical data analysis that deals with qualitative or discrete quantitative data is one of the most important statistical tools nowadays. In recent years, this tool plays a fundamental role on analyzing polychotomous data, particularly in the social and health sciences. This course introduces statistical theories and models for analyzing categorical data. The main topics cover :
(1) likelihood-based inferences on measures of association for two-dimensional and three-dimensional contingency tables under different assumptions. (2) generalized linear (mixed) models with emphasis on binary (Poisson) regression and logit models. (3) Repeated categorical data modeling, such as generalized estimating equation approaches and quasi-likelihood methods. (4) Asymptotic results and other advanced topics.
Categorical data analysis that deals with quality or discrete quantitative data is one of the most important statistical tools nowadays. In recent years, this tool plays a fundamental role on analyzing polychotomous data, particularly in the social and health sciences. This course introduces statistical theories and models for analyzing category data. The main topics cover :
(1) likelihood-based inferences on measures of association for two-dimensional and three-dimensional contingency tables under different assumptions. (2) generalized linear (mixed) models with emphasis on binary (Poisson) regression and logit models. (3) Repeated category data modeling, such as generalized estimating equation approaches and quasi-likelihood methods. (4) Asymptotic results and other advanced topics.
1. *Agresti, A. and Franklin, C.(2009), Statistics—The Art and Science of Learning from Data, 2nd edition, Pearson Education, Inc. (東華書局/新月圖書代理
2. *曾淑峰、林志弘、翁玉麟(2012年9月),資料採礦應用—以SAS Enterprise Miner為工具,梅霖文化事業有限公司 (ISBN: 978-986-6511-60-8)
3. *Slaughter, S.J. and Delwiche, L.D., 蔡宏明、蔡秉諺譯(2011年11月),SAS Enterprise Guide實用工具書,梅霖文化事業有限公司 (ISBN: 978-986-6511-58-5)
4. 邱皓政*,量化研究與統計分析—SPSS資料分析範例,五南圖書股份有限公司,2010年10月五版.
1. *Agresti, A. and Franklin, C. (2009), Statistics—The Art and Science of Learning from Data, 2nd edition, Pearson Education, Inc. (Tonghua Book Bureau/Cresolution Agency
2. *Zeng Shufeng, Lin Zhihong, Weng Yulin (September 2012), data mining application—using SAS Enterprise Miner as a tool, Meilin Culture Industry Co., Ltd. (ISBN: 978-986-6511-60-8)
3. *Slaughter, S.J. and Delwiche, L.D., Cai Hongming and Cai Bing-san (November 2011), SAS Enterprise Guide practical tools book, Meilin Cultural Affairs Co., Ltd. (ISBN: 978-986-6511-58-5)
4. Qiu Haozheng*, Quantitative Research and Statistical Analysis—SPSS Data Analysis Example, Wunan Books Co., Ltd., October 5th Edition.
評分項目 Grading Method | 配分比例 Grading percentage | 說明 Description |
---|---|---|
平時考成績平時考成績 Normally, the exam results |
40 | 學習態度(包括出缺席)、作業成績、平時考成績、課堂討論與互動 |
期中考期中考 Midterm exam |
30 | 紙筆測驗+PPT口頭報告 |
期末考期末考 Final exam |
30 | 紙筆測驗+PPT書面報告 |