Home
統計學系
course information of 102 - 02 | 1766 Analysis of Categorical Data(類別資料分析)

Taught In English1766 - 類別資料分析 Analysis of Categorical Data


教育目標 Course Target

類別資料分析主要針對類別型(二元、多元分類和順序尺度)目標變數,以常用之敘述統計(次數或比率)和統計圖 (如圓餅圖和柱狀圖)為基礎,透過機率分配和抽樣分配,進階至母群體之參數推論、二維和三維交叉分析以及目標變數之決策樹和預測模型建立,配合課程內容使用統計軟體SAS Enterprise Guide(SAS EG)、Enterprise Miner(SAS EM)、和SPSS進行資料分析,從統計的角度連結資料採礦方法,如關聯分析(購物籃案例)、類神經網路(詐欺案例)、決策樹(信用狀態案例)和邏輯斯迴歸模型(信用狀態案例),藉此從海量資料和高維度大型資料庫中挖掘潛藏的有用資訊,提供管理階層決策輔助之用。 Categorical Data Analysis(CDA) mainly focus on the analysis of categorical response (or target) variables. Graphical bar charts and pie charts, frequency tables, two-way and three-way contingency tables are used to describe the association among the qualitative target and predictor variables. It is applicable to a wide variety of academic disciplines, from the natural and social sciences to the humanities, government and business. In addition, patterns in the data may be modeled in a way that accounts for randomness and uncertainty in the observations, and are then used to draw inferences about the process or population being studied. This course also introduces the methods in data mining through the statistical point of view. Students will learn the ability to analyze massive and complicated data and will be able to turn the raw data into valuable information using association rules, neural network, decision tree, and logistic regression model in both the software SAS Enterprise Guide and Enterprise Miner.Category data analysis mainly focuses on the target variables of category (binary, multivariate classification and sequence scale), and is based on commonly used descriptive statistics (number or ratio) and statistical diagrams (such as round and column diagrams). Through probability allocation and sampling allocation, parameter recommendations from the parent group, two-dimensional and three-dimensional cross-analysis, and decision trees and prediction models for target variables are established, and the statistical software SAS Enterprise Guide (SAS) is used in conjunction with the course content. EG), Enterprise Miner (SAS EM), and SPSS conduct data analysis, and link data from a statistical perspective to adopt mining methods, such as related analysis (shopping cases), category neural network (scam cases), decision trees (credit status cases) and logical reproduction models (credit status cases), thereby mining potential useful information from massive data and high-dimensional large databases, providing management-level decision-making assistance. Categorical Data Analysis(CDA) mainly focuses on the analysis of category response (or target) variables. Graphical bar charts and pie charts, frequent tables, two-way and three-way contingency tables are used to describe the association among the quality target and predictor variables. It is applicable to a wide variety of academic disciplines, from the natural and social sciences to the humanities, government and business. In addition, patterns in the data may be modeled in a way that accounts for randomness and uncertainty in the observations, and are then used to draw inferences about the process or population being studied. This course also introduces the methods in data mining through the statistical point of view. Students will learn the ability to analyze massive and complicated data and will be able to turn the raw data into valuable information using association rules, neural network, decision tree, and logistic regression model in both the software SAS Enterprise Guide and Enterprise Miner.


課程概述 Course Description

Categorical data analysis that deals with qualitative or discrete quantitative data is one of the most important statistical tools nowadays. In recent years, this tool plays a fundamental role on analyzing polychotomous data, particularly in the social and health sciences. This course introduces statistical theories and models for analyzing categorical data. The main topics cover : (1) likelihood-based inferences on measures of association for two-dimensional and three-dimensional contingency tables under different assumptions. (2) generalized linear (mixed) models with emphasis on binary (Poisson) regression and logit models. (3) Repeated categorical data modeling, such as generalized estimating equation approaches and quasi-likelihood methods. (4) Asymptotic results and other advanced topics.
Categorical data analysis that deals with quality or discrete quantitative data is one of the most important statistical tools nowadays. In recent years, this tool plays a fundamental role on analyzing polychotomous data, particularly in the social and health sciences. This course introduces statistical theories and models for analyzing category data. The main topics cover : (1) likelihood-based inferences on measures of association for two-dimensional and three-dimensional contingency tables under different assumptions. (2) generalized linear (mixed) models with emphasis on binary (Poisson) regression and logit models. (3) Repeated category data modeling, such as generalized estimating equation approaches and quasi-likelihood methods. (4) Asymptotic results and other advanced topics.


參考書目 Reference Books

1. *Agresti, A. and Franklin, C.(2009), Statistics—The Art and Science of Learning from Data, 2nd edition, Pearson Education, Inc. (東華書局/新月圖書代理
2. *曾淑峰、林志弘、翁玉麟(2012年9月),資料採礦應用—以SAS Enterprise Miner為工具,梅霖文化事業有限公司 (ISBN: 978-986-6511-60-8)
3. *Slaughter, S.J. and Delwiche, L.D., 蔡宏明、蔡秉諺譯(2011年11月),SAS Enterprise Guide實用工具書,梅霖文化事業有限公司 (ISBN: 978-986-6511-58-5)
4. 邱皓政*,量化研究與統計分析—SPSS資料分析範例,五南圖書股份有限公司,2010年10月五版.
1. *Agresti, A. and Franklin, C. (2009), Statistics—The Art and Science of Learning from Data, 2nd edition, Pearson Education, Inc. (Tonghua Book Bureau/Cresolution Agency
2. *Zeng Shufeng, Lin Zhihong, Weng Yulin (September 2012), data mining application—using SAS Enterprise Miner as a tool, Meilin Culture Industry Co., Ltd. (ISBN: 978-986-6511-60-8)
3. *Slaughter, S.J. and Delwiche, L.D., Cai Hongming and Cai Bing-san (November 2011), SAS Enterprise Guide practical tools book, Meilin Cultural Affairs Co., Ltd. (ISBN: 978-986-6511-58-5)
4. Qiu Haozheng*, Quantitative Research and Statistical Analysis—SPSS Data Analysis Example, Wunan Books Co., Ltd., October 5th Edition.


評分方式 Grading

評分項目 Grading Method 配分比例 Grading percentage 說明 Description
平時考成績平時考成績
Normally, the exam results
40 學習態度(包括出缺席)、作業成績、平時考成績、課堂討論與互動
期中考期中考
Midterm exam
30 紙筆測驗+PPT口頭報告
期末考期末考
Final exam
30 紙筆測驗+PPT書面報告

授課大綱 Course Plan

Click here to open the course plan. Course Plan
交換生/外籍生選課登記 - 請點選下方按鈕加入登記清單,再等候任課教師審核。
Add this class to your wishlist by click the button below.
請先登入才能進行選課登記 Please login first


相似課程 Related Course

很抱歉,沒有符合條件的課程。 Sorry , no courses found.

Course Information

Description

學分 Credit:0-3
上課時間 Course Time:Friday/2,3,4[M117]
授課教師 Teacher:林雅俐
修課班級 Class:統計系3,4
選課備註 Memo:A 群組;統計學下期40分以上方可修習
This Course is taught In English 授課大綱 Course Plan: Open

選課狀態 Attendance

There're now 10 person in the class.
目前選課人數為 10 人。

請先登入才能進行選課登記 Please login first