Introduction to Data Analysis††††††††††††††††††††††††††††††††††††††††††††††††††††††††††††††††††† Rev: 1/3/2012

Sociology 180B/280B

 

Draft Syllabus

 

Winter Quarter, 2013

Tuesdays and Thursdays

2:15-3:30

Building 200, Room 030

 

 

Lab/Section once a week for 75 minutes

time and place TBA

 

Michael J. Rosenfeld

Associate Professor

Department of Sociology

Building 120 room 124

mrosenfe@stanford.edu

The class website is my personal Stanford website

www.stanford.edu/~mrosenfe

Office Hours TBA

 

TAs

Andy Isaacson <andyi@stanford.edu>

Marion Coddou <mcoddou@stanford.edu>

 

Use Coursework (not Axess) to sign up for section meeting times, and use Coursework to submit homework.

 

 

 

 

Introduction:

††††††††††† This class will cover basic statistics including regression, how do statistical analysis, and how to find flaws and problems with statistical analyses.

††††††††††† In the process of learning about data analysis you will also learn about demography and stratification in the U.S., because the dataset is the Current Population Survey of March, 2000, which is a nationally representative survey of more than 60,000 households, with lots of information about race, gender, income, occupation, place of residence, and so on.You'll also learn how to use one of the most powerful and flexible tools for data analysis, the statistical software STATA.

 


Readings and Grading Policy

 

Books (available at Stanford Bookstore):

* Freedman, David, Robert Pisani, and Roger Purves. 2007. Statistics. Fourth Edition. W.W. Norton. $105, ISBN: 0393929728 (recommended). If you know a little about statistics already, or if you have taken one statistics class like Stats 60, you donít need to buy the Freedman, and you can ignore the Freedman reading assignments.

* Tufte, Edward. 2001. The Visual Display of Quantitative Information. Graphics Press. $28,  0961392142 (required).

 

Other readings will be linked from the class website.

 

 

Software Required (order online)

* Intercooled (IC) Stata, Version 12. You may purchase either a 1 year license for $98,or a perpetual license for $179. I recommend the perpetual license so that you can use this software in the future. The software comes with a small introduction to Stata book. Donít bother buying Stataís massive printed reference book collection. I will teach you the Stata commands that you need to know, and the Stata online help is very good.

http://www.stata.com/order/new/edu/gradplans/direct-ship-pricing/

 

 

Computer Use Policy:

* Computer use by students in class is strictly limited to following along with the data analysis examples being presented by the professor.


GRADING:

 

1) Undergraduates, Soc 180B:

Homework

4 homeworks, 15% each

Regular section participation

10%

Final exam (based on data analysis part of the course)

30%

 

 

2) Graduate Students Soc 280B

Homework

4 homeworks, 15% each

Regular section participation

10%

In-class presentation (data analysis of dataset of your own choosing) outline

10% (due date to be negotiated with professor Rosenfeld

In-class presentation (data analysis of dataset of your own choosing) actual presentation to class

20% (class presentation date to be negotiated with professor Rosenfeld)

 

 

Project and Reading Assignment Timeline

 

Week

CLASS

Class lecture Goals

READINGS; REQUIRED READINGS IN BOLD.

ASSIGNMENT

1

Jan 8

Introduction to the class

 

 

 

Jan 10

Basics of descriptive data analysis using STATA

Read my Intro to Stata (required)

Read Freedman Ch 4

Hand out HW#1

 

Section

Work on HW 1 and on using STATA

 

 

 

 

 

 

 

2

Jan 15

Observational Studies and their limitations

Freedman Ch 2

 

 

Jan 17

Error and bias

Freedman Ch 6

 

 

Section

Work on HW 1 and on using STATA

 

 

 

 

 

 

Monday, Jan 21, HW #1 due by midnight

3

Jan 22

Error and bias

Freedman Ch 6

Hand out HW#2

 

Jan 24

Probability sampling, Sample size and power, and standard errors

Freedman Ch 20

 

 

Section

Stata, and HW 2

 

 

 

 

 

 

 

4

Jan 29

More on sample size and power.

Freedman Ch 21

 

 

Jan 31

Statistics and hypothesis testing

 

 

 

Section:

Stata, and HW 2

 

 

 

 

 

 

Friday, Feb 1, HW#2 due by midnight

5

Feb 5

Introduction to regression with STATA

Freedman Chs 9, 10

Hand out HW#3

 

Feb 7

More on regression with STATA, interpreting coefficients

Freedman, Ch 11, 12

 

 

Section

Work on HW #3

 

 

 

 

 

 

 

6

Feb 12

Problems with and difficulties in using regression, Graphing.

Tufte, P. 1-87 (required)

 

 

Feb 14

Proper and improper presentation of data

Tufte, P. 90-190 (required)

 

 

Section

Work on HW #3

 

 

 

 

 

 

Friday, Feb 15, HW#3 due by midnight

7

Feb 19

Additivity, linearity, and regression fits

 

Hand out HW #4

 

Feb 21

Regression analysis: residuals and outliers

Readings by Jasso and Kahn and Udry, and Jassoís response posted on my website (all required)

 

 

Section

Work on STATA, discuss the issues in HW 4

 

 

 

 

 

 

 

8

Feb 26

Logistic regression

 

 

Feb 28

Logistic regression and the likelihood ratio test

 

 

Section

Work on STATA, discuss the issues in CPS HW #4

 

 

 

 

 

 

 

9

Mar 5

Polls, polling aggregation, and election prediction

 

 

 

Mar 7

Chisquare tests of independence

 

 

 

 

HW #4 due

Friday, March 8 by midnight

 

Section

Work on STATA, discuss the issues in HW 4

 

 

 

 

 

 

 

10

March 12

Soc 280B in-class presentations

 

 

March 14

Final Exam Review

 

 

no section meetings

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Final exam

at the regularly scheduled time and date