6458 - 類別資料分析 英授 Taught in English
Analysis of Categorical Data
教育目標 Course Target
在社會科學的分析中,我們雖然想方設法將許多現象予以數值化,以方便我們使用多元線性迴歸來進行統計分析。然而,更多社會現象是以類別的姿態呈現於真實世界,這種質性變項難以數值化,但卻是我們日常生活的常態。例如,你有多快樂?非常不快樂、不快樂、普通、快樂、非常快樂,許多研究基於方便可能將其視為連續變項,但正確來說,這是一個具有順序層級的類別變項,當它成為研究的依變項時,我們無法使用一般線性迴歸來估計。因此,處理類別資料的統計模型成為社會科學研究中重要的一個部份。本課程即在此基礎上,將帶領同學熟悉類別資料的分析。本課程除了進行統計理論的討論之外,也將透過揣摩期刊論文的操作、詮釋來幫助同學熟悉這些分析模型,更強調實際的資料分析,以期同學具有操作、分析資料、與詮釋資料的能力。
此課程的核心目標包含兩個部分,第一部分是引導學生具有閱讀與評論量化研究論文的能力;第二部分著重於量化資料的實際分析,以培養學生利用統計方法與軟體進行資料整理與比較的能力。台灣學界已經累積豐富具全國代表性的「次級資料」,這些大型資料庫將提供我們這門課程實作的分析素材,其中橫斷面資料是較容易掌握與入門的資料庫,因此本課程將以Stata統計軟體來示範分析「台灣社會變遷基本調查計畫七期二次家庭組」的資料。最後,期許修課同學完成一份具有質量的學期報告。
In social science analysis, we try our best to digitize many phenomena so that we can use multiple linear regression to conduct statistical analysis. However, more social phenomena are presented in the real world in the form of categories. Such qualitative variables are difficult to quantify, but they are the norm in our daily lives. For example, how happy are you? Very unhappy, unhappy, ordinary, happy, very happy, many studies may regard it as a continuous variable based on convenience, but to be correct, this is a categorical variable with an ordinal hierarchy. When it becomes the dependent variable of the study, we cannot use general linear regression to estimate it. Therefore, statistical models for processing categorical data have become an important part of social science research. On this basis, this course will lead students to become familiar with the analysis of category data. In addition to discussing statistical theory, this course will also help students become familiar with these analysis models by figuring out the operation and interpretation of journal articles. It also emphasizes actual data analysis, so that students can have the ability to operate, analyze, and interpret data.
The core goal of this course consists of two parts. The first part is to guide students to have the ability to read and comment on quantitative research papers; the second part focuses on the actual analysis of quantitative data to cultivate students' ability to use statistical methods and software to organize and compare data. The Taiwanese academic community has accumulated a wealth of nationally representative "secondary data." These large databases will provide us with analytical materials for the implementation of this course. Among them, cross-sectional data are easier to master and get started with. Therefore, this course will use Stata statistical software to demonstrate the analysis of data from the "Second Household Group of the Seventh Period of the Basic Survey on Social Change in Taiwan" using Stata statistical software. Finally, students taking the course are expected to complete a quality semester report.
課程概述 Course Description
Categorical data analysis that deals with qualitative or discrete quantitative data is one of the most important statistical tools nowadays. In recent years, this tool plays a fundamental role on analyzing polychotomous data, particularly in the social and health sciences. This course introduces statistical theories and models for analyzing categorical data. The main topics cover :
(1) likelihood-based inferences on measures of association for two-dimensional and three-dimensional contingency tables under different assumptions. (2) generalized linear (mixed) models with emphasis on binary (Poisson) regression and logit models. (3) Repeated categorical data modeling, such as generalized estimating equation approaches and quasi-likelihood methods. (4) Asymptotic results and other advanced topics.
Categorical data analysis that deals with qualitative or discrete quantitative data is one of the most important statistical tools nowadays. In recent years, this tool plays a fundamental role on analyzing polychotomous data, particularly in the social and health sciences. This course introduces statistical theories and models for analyzing categorical data. The main topics cover :
(1) likelihood-based inferences on measures of association for two-dimensional and three-dimensional contingency tables under different assumptions. (2) generalized linear (mixed) models with emphasis on binary (Poisson) regression and logit models. (3) Repeated categorical data modeling, such as generalized estimating equation approaches and quasi-likelihood methods. (4) Asymptotic results and other advanced topics.
參考書目 Reference Books
Long, John Scott. 1997. Regression Models for Categorical and Limited Dependent Variables. Thousand Oaks: Sage Publications.
[2002,鄭旭智、張育哲、潘倩玉、林克明譯,《類別與受限依變項的迴歸統計模式》。台北:弘智。]
Long, John Scott, and Jeremy Freese. 2014. Regression Models for Categorical Dependent Variables Using Stata, third edition. College Station, Tex.: Stata Press.
Long, John Scott. 1997. Regression Models for Categorical and Limited Dependent Variables. Thousand Oaks: Sage Publications.
[2002, translated by Zheng Xuzhi, Zhang Yuzhe, Pan Qianyu, and Lin Keming, "Regression Statistical Model of Category and Restricted Dependent Variables". Taipei: Hongzhi. ]
Long, John Scott, and Jeremy Freese. 2014. Regression Models for Categorical Dependent Variables Using Stata, third edition. College Station, Tex.: Stata Press.
評分方式 Grading
評分項目 Grading Method |
配分比例 Percentage |
說明 Description |
---|---|---|
1、 平日作業 1. Daily homework |
60 | (1) 四份實作作業(40%);(2) 導讀(依修課人數調整):(20%) |
2、 學期報告 2. Term report |
40 |
授課大綱 Course Plan
點擊下方連結查看詳細授課大綱
Click the link below to view the detailed course plan
相似課程 Related Courses
課程代碼 Course Code |
課程名稱 Course Name |
授課教師 Instructor |
時間地點 Time & Room |
學分 Credits |
操作 Actions |
---|---|---|---|---|---|
選修-1761
|
統計系2-4 蘇俊隆 | 一/5,6,9[M117] | 0-3 | 詳細資訊 Details |
課程資訊 Course Information
基本資料 Basic Information
- 課程代碼 Course Code: 6458
- 學分 Credit: 0-3
-
上課時間 Course Time:Wednesday/6,7,8[SS304]
-
授課教師 Teacher:巫麗雪
-
修課班級 Class:社會碩博1,2
交換生/外籍生選課登記
請點選上方按鈕加入登記清單,再等候任課教師審核。
Add this class to your wishlist by clicking the button above.