-----------------------------------------------------------------------------------------------------
. set mem 20m
Current memory allocation
current memory usage
settable value description (1M = 1024k)
--------------------------------------------------------------------
set maxvar 5000 max. variables allowed 1.733M
set memory 20M max. data space 20.000M
set matsize 400 max. RHS vars in models 1.254M
-----------
22.987M
. use cps_y2k_numeric.dta, clear
. describe
Contains data from cps_y2k_numeric.dta
obs: 133,710
vars: 42 8 May 2004 13:26
size: 9,894,540 (52.8% of memory free)
-------------------------------------------------------------------------------
storage display value
variable name type format label variable label
-------------------------------------------------------------------------------
phseq str5 %9s household sequence number p2
pernum byte %8.0g
age byte %8.0g p15
maritl byte %26.0g marlbl Marital Status p17
sex byte %8.0g sexnm p20
vet byte %22.0g vetnm veteran status p21
hga byte %8.0g Educational Attainment p22
race byte %11.0g racenm p25
reorigin byte %25.0g hisplbl Hispanic Origin p27
hrs1 byte %8.0g hours worked last week p76
clswkr byte %32.0g cwrknm sector of worker p109
grswk int %9.0g gross weekly wages p135
unmem byte %13.0g unnm labor union member p139
lfsr byte %28.0g lfsrnm labor force status p145
ernval float %9.0g main job last year earnings p228
ssval long %12.0g last year soc security payments
p291
pawval int %12.0g last year welfare payments p305
wgt2 int %9.0g rounded weight based on p50
ernval2 float %9.0g main job earnings, losses
recoded to zero
htype byte %37.0g htpnm household type h25
state byte %8.0g HG-ST60, or simply state of
residence h40
hpmsasz byte %8.0g metropolitan area size h56
hcccr byte %8.0g residence in central city h58
frelu18 byte %8.0g number of kids in fam under 18
f29
povll byte %8.0g ratio of fam income to poverty
level f38
fwsval float %9.0g family income f48
famwgt2 int %8.0g adjusted family weight f233
yrsed float %9.0g years of education, from hga
citizen byte %33.0g citnm citizenship p733
health byte %11.0g hlthnm self reported health status p800
occ int %8.0g occupation P 106
ptotr byte %8.0g total person income categories
P466
penatvty int %8.0g country of birth P 722,
Appendix H
pemntvty int %8.0g Mother's country of birth,
P725, appendix H
pefntvty int %8.0g Father's country of birth,
P728, appendix H
peinusyr byte %8.0g time of immigration, P 731
pxnatvty byte %8.0g allocation flag for country of
birth P 734
hgmsac int %8.0g metropolitan area code, h44,
appendix E
pppos2 byte %8.0g family sequence number within
each household p46
edlvl byte %16.0g edlabel 4 categories ed attainment
hispanic byte %12.0g smhisplbl
dichotomoy hispanic yes/no
new_race byte %18.0g new_race race and Hispanic combined
-------------------------------------------------------------------------------
Sorted by:
. *variable phseq should not be used for this class- it's a household sequence number
. *variable pernum should not be used for this class.
Along with phseq, pernum allows for indentification of
household relationships, like who's married to whom in the household.
. tabulate age
p15 | Freq. Percent Cum.
------------+-----------------------------------
0 | 1,713 1.28 1.28
1 | 1,932 1.44 2.73
2 | 1,950 1.46 4.18
3 | 1,939 1.45 5.63
4 | 1,965 1.47 7.10
5 | 1,998 1.49 8.60
6 | 2,059 1.54 10.14
7 | 2,176 1.63 11.77
8 | 2,163 1.62 13.38
9 | 2,243 1.68 15.06
10 | 2,202 1.65 16.71
11 | 2,083 1.56 18.27
12 | 2,035 1.52 19.79
13 | 2,047 1.53 21.32
14 | 1,979 1.48 22.80
15 | 2,046 1.53 24.33
16 | 1,965 1.47 25.80
17 | 1,998 1.49 27.29
18 | 1,847 1.38 28.67
19 | 1,826 1.37 30.04
20 | 1,722 1.29 31.33
21 | 1,687 1.26 32.59
22 | 1,638 1.23 33.81
23 | 1,622 1.21 35.03
24 | 1,662 1.24 36.27
25 | 1,666 1.25 37.52
26 | 1,640 1.23 38.74
27 | 1,726 1.29 40.03
28 | 1,801 1.35 41.38
29 | 1,995 1.49 42.87
30 | 1,907 1.43 44.30
31 | 1,991 1.49 45.79
32 | 1,890 1.41 47.20
33 | 1,898 1.42 48.62
34 | 2,024 1.51 50.13
35 | 2,134 1.60 51.73
36 | 2,123 1.59 53.32
37 | 2,099 1.57 54.89
38 | 2,064 1.54 56.43
39 | 2,228 1.67 58.10
40 | 2,190 1.64 59.74
41 | 2,115 1.58 61.32
42 | 2,137 1.60 62.92
43 | 2,091 1.56 64.48
44 | 2,114 1.58 66.06
45 | 2,118 1.58 67.64
46 | 1,939 1.45 69.10
47 | 1,957 1.46 70.56
48 | 1,827 1.37 71.93
49 | 1,767 1.32 73.25
50 | 1,865 1.39 74.64
51 | 1,802 1.35 75.99
52 | 1,825 1.36 77.35
53 | 1,695 1.27 78.62
54 | 1,301 0.97 79.59
55 | 1,323 0.99 80.58
56 | 1,324 0.99 81.57
57 | 1,304 0.98 82.55
58 | 1,128 0.84 83.39
59 | 1,129 0.84 84.24
60 | 1,154 0.86 85.10
61 | 1,051 0.79 85.89
62 | 1,073 0.80 86.69
63 | 938 0.70 87.39
64 | 952 0.71 88.10
65 | 1,014 0.76 88.86
66 | 869 0.65 89.51
67 | 926 0.69 90.20
68 | 908 0.68 90.88
69 | 904 0.68 91.56
70 | 913 0.68 92.24
71 | 885 0.66 92.90
72 | 770 0.58 93.48
73 | 797 0.60 94.08
74 | 814 0.61 94.68
75 | 796 0.60 95.28
76 | 704 0.53 95.81
77 | 646 0.48 96.29
78 | 687 0.51 96.80
79 | 602 0.45 97.25
80 | 514 0.38 97.64
81 | 476 0.36 97.99
82 | 425 0.32 98.31
83 | 427 0.32 98.63
84 | 325 0.24 98.87
85 | 306 0.23 99.10
86 | 248 0.19 99.29
87 | 209 0.16 99.44
88 | 172 0.13 99.57
89 | 155 0.12 99.69
90 | 416 0.31 100.00
------------+-----------------------------------
Total | 133,710 100.00
. *not very illuminating.
. summarize age
Variable | Obs Mean Std. Dev. Min Max
-------------+--------------------------------------------------------
age | 133710 35.17964 22.21722 0 90
. summarize age [fweight=wgt2]
Variable | Obs Mean Std. Dev. Min Max
-------------+--------------------------------------------------------
age | 2.732e+08 35.31592 22.1685 0 90
. *note the topcode max of 90
. tabulate maritl
Marital Status p17 | Freq. Percent Cum.
---------------------------+-----------------------------------
married, spouse present | 55,585 41.57 41.57
married, AF spouse present | 351 0.26 41.83
married, spouse absent | 1,355 1.01 42.85
widowed | 6,561 4.91 47.75
divorced | 9,523 7.12 54.88
separated | 2,097 1.57 56.44
never married | 58,238 43.56 100.00
---------------------------+-----------------------------------
Total | 133,710 100.00
. tabulate maritl [fweight=wgt2]
Marital Status p17 | Freq. Percent Cum.
---------------------------+-----------------------------------
married, spouse present | 112733750 41.26 41.26
married, AF spouse present | 658,497 0.24 41.50
married, spouse absent | 2,713,867 0.99 42.49
widowed | 13630062 4.99 47.48
divorced | 19629420 7.18 54.67
separated | 4,430,097 1.62 56.29
never married | 119438128 43.71 100.00
---------------------------+-----------------------------------
Total | 273233821 100.00
. tabulate sex
p20 | Freq. Percent Cum.
------------+-----------------------------------
male | 64,791 48.46 48.46
female | 68,919 51.54 100.00
------------+-----------------------------------
Total | 133,710 100.00
. tabulate sex [fweight=wgt2]
p20 | Freq. Percent Cum.
------------+-----------------------------------
male | 133187798 48.74 48.74
female | 140046023 51.26 100.00
------------+-----------------------------------
Total | 273233821 100.00
. tabulate vet
veteran status p21 | Freq. Percent Cum.
-----------------------+-----------------------------------
children or current AF | 30,904 23.11 23.11
Vietnam | 3,683 2.75 25.87
Korea | 1,716 1.28 27.15
WWII | 2,428 1.82 28.97
Other Service | 3,830 2.86 31.83
Non Veteran | 91,149 68.17 100.00
-----------------------+-----------------------------------
Total | 133,710 100.00
. tabulate vet [fweight=wgt2]
veteran status p21 | Freq. Percent Cum.
-----------------------+-----------------------------------
children or current AF | 60277572 22.06 22.06
Vietnam | 7,505,156 2.75 24.81
Korea | 3,469,644 1.27 26.08
WWII | 5,185,150 1.90 27.98
Other Service | 8,218,282 3.01 30.98
Non Veteran | 188578017 69.02 100.00
-----------------------+-----------------------------------
Total | 273233821 100.00
. tabulate hga
Educational |
Attainment |
p22 | Freq. Percent Cum.
------------+-----------------------------------
0 | 30,484 22.80 22.80
31 | 457 0.34 23.14
32 | 1,187 0.89 24.03
33 | 2,320 1.74 25.76
34 | 4,527 3.39 29.15
35 | 4,161 3.11 32.26
36 | 4,695 3.51 35.77
37 | 4,721 3.53 39.30
38 | 1,491 1.12 40.42
39 | 31,970 23.91 64.33
40 | 18,797 14.06 78.39
41 | 3,758 2.81 81.20
42 | 3,328 2.49 83.69
43 | 14,705 11.00 94.68
44 | 4,918 3.68 98.36
45 | 1,229 0.92 99.28
46 | 962 0.72 100.00
------------+-----------------------------------
Total | 133,710 100.00
. *This is an educational attainment variable.
The categories are a bit weird, so I created a more useful variable
for years of education.
. tabulate yrsed
years of |
education, |
from hga | Freq. Percent Cum.
------------+-----------------------------------
0 | 30,941 23.14 23.14
1.5 | 1,187 0.89 24.03
5.5 | 2,320 1.74 25.76
7.5 | 4,527 3.39 29.15
9 | 4,161 3.11 32.26
10 | 4,695 3.51 35.77
11 | 4,721 3.53 39.30
12 | 33,461 25.03 64.33
13 | 18,797 14.06 78.39
14 | 7,086 5.30 83.69
16 | 14,705 11.00 94.68
17 | 4,918 3.68 98.36
19 | 1,229 0.92 99.28
22 | 962 0.72 100.00
------------+-----------------------------------
Total | 133,710 100.00
. tabulate race
p25 | Freq. Percent Cum.
------------+-----------------------------------
White | 113,475 84.87 84.87
Black | 13,626 10.19 95.06
Amer Indian | 1,894 1.42 96.47
Asian | 4,715 3.53 100.00
------------+-----------------------------------
Total | 133,710 100.00
. tabulate reorigin
Hispanic Origin p27 | Freq. Percent Cum.
--------------------------+-----------------------------------
Mexican American | 6,447 4.82 4.82
Chicano | 384 0.29 5.11
Mexican | 8,155 6.10 11.21
Puerto Rican | 2,280 1.71 12.91
Cuban | 943 0.71 13.62
Central or South American | 3,487 2.61 16.23
Other Spanish | 1,863 1.39 17.62
Non Hispanic | 108,641 81.25 98.87
Don't Know | 471 0.35 99.22
N/A | 1,039 0.78 100.00
--------------------------+-----------------------------------
Total | 133,710 100.00
. tabulate hispanic
dichotomoy |
hispanic |
yes/no | Freq. Percent Cum.
-------------+-----------------------------------
Non Hispanic | 110,151 82.38 82.38
Hispanic | 23,559 17.62 100.00
-------------+-----------------------------------
Total | 133,710 100.00
. tabulate new_race
race and Hispanic |
combined | Freq. Percent Cum.
-------------------+-----------------------------------
Non Hispanic White | 90,988 68.05 68.05
Non Hispanic Black | 12,930 9.67 77.72
NH American Indian | 1,649 1.23 78.95
NH Asian | 4,584 3.43 82.38
Hispanic | 23,559 17.62 100.00
-------------------+-----------------------------------
Total | 133,710 100.00
. *race and reorigin are original CPS variables,
hispanic and new_race are variables I have added.
. summarize hrs1, detail
hours worked last week p76
-------------------------------------------------------------
Percentiles Smallest
1% -1 -1
5% -1 -1
10% -1 -1 Obs 133710
25% -1 -1 Sum of Wgt. 133710
50% 0 Mean 18.1734
Largest Std. Dev. 22.08705
75% 40 99
90% 48 99 Variance 487.8376
95% 55 99 Skewness .6339475
99% 70 99 Kurtosis 2.090072
. *here's a little surprise that the data dictionary does not predict:
a quarter of the sample has -1
for hours worked, clearly a 'not in universe' answer.
So how many people in the sample had positive hours worked?
. summarize hrs1 if hrs1>0, detail
hours worked last week p76
-------------------------------------------------------------
Percentiles Smallest
1% 4 1
5% 14 1
10% 20 1 Obs 62726
25% 35 1 Sum of Wgt. 62726
50% 40 Mean 39.37833
Largest Std. Dev. 13.8791
75% 45 99
90% 55 99 Variance 192.6293
95% 60 99 Skewness .0870914
99% 80 99 Kurtosis 4.812558
. *62,726 of the 133,710 had positive hours worked
. tabulate clswkr
sector of worker p109 | Freq. Percent Cum.
---------------------------------+-----------------------------------
not in universe, children, or AF | 64,773 48.44 48.44
private | 51,618 38.60 87.05
federal govt | 1,763 1.32 88.37
state govt | 2,884 2.16 90.52
local govt | 5,235 3.92 94.44
self employed- incorporated | 2,122 1.59 96.02
self employed, not incorp | 5,042 3.77 99.80
unpaid | 71 0.05 99.85
never worked | 202 0.15 100.00
---------------------------------+-----------------------------------
Total | 133,710 100.00
. tabulate clswkr, nolabel
sector of |
worker p109 | Freq. Percent Cum.
------------+-----------------------------------
0 | 64,773 48.44 48.44
1 | 51,618 38.60 87.05
2 | 1,763 1.32 88.37
3 | 2,884 2.16 90.52
4 | 5,235 3.92 94.44
5 | 2,122 1.59 96.02
6 | 5,042 3.77 99.80
7 | 71 0.05 99.85
8 | 202 0.15 100.00
------------+-----------------------------------
Total | 133,710 100.00
. *If you're looking at type of worker (i.e. government vs private vs self employed),
you need to discard the half of the sample that is 'not in the universe' for this question.
. summarize grswk if grswk>0
Variable | Obs Mean Std. Dev. Min Max
-------------+--------------------------------------------------------
grswk | 13422 601.9655 461.0122 1 2885
. summarize grswk if grswk>0, detail
gross weekly wages p135
-------------------------------------------------------------
Percentiles Smallest
1% 39 1
5% 99 2
10% 158 2 Obs 13422
25% 300 3 Sum of Wgt. 13422
50% 483 Mean 601.9655
Largest Std. Dev. 461.0122
75% 775 2885
90% 1154 2885 Variance 212532.2
95% 1442 2885 Skewness 1.964056
99% 2500 2885 Kurtosis 8.607142
. *Here's a problem with grswk: even though half of the sample works,
only 13,422 or about 10% of thesample actually reports weekly wages.
If you're going to use weekly wages, keep in mind that most
of the workers in the sample don't report weekly wages (I'm not sure why).
. summarize ernval2 if ernval2>0, detail
main job earnings, losses recoded to zero
-------------------------------------------------------------
Percentiles Smallest
1% 200 1
5% 1200 1
10% 3000 1 Obs 71370
25% 10000 1 Sum of Wgt. 71370
50% 21319.5 Mean 28801.05
Largest Std. Dev. 31102.15
75% 38000 362302
90% 60000 362302 Variance 9.67e+08
95% 76000 362302 Skewness 3.515875
99% 197387 362302 Kurtosis 22.47451
. *71,370 respondents reported positive earnings the previous year,
which seems about right given the
percent of the population that is in the labor force.
. tabulate unmem
labor union |
member p139 | Freq. Percent Cum.
--------------+-----------------------------------
not in univ | 120,249 89.93 89.93
Yes, in union | 1,883 1.41 91.34
non-union | 11,578 8.66 100.00
--------------+-----------------------------------
Total | 133,710 100.00
. *Like grswk, unmem (union membership) applies (either yes or no) to only 13,400 or so
respondents in the sample, far fewer than the number that actually work
. tabulate lsfr
variable lsfr not found
r(111);
. tabulate lfsr
labor force status p145 | Freq. Percent Cum.
-----------------------------+-----------------------------------
children or AF | 30,904 23.11 23.11
working | 62,726 46.91 70.02
with job, not at work | 2,340 1.75 71.77
unemployed, looking for work | 2,478 1.85 73.63
unemployed, on layoff | 469 0.35 73.98
Not in Labor Force | 34,793 26.02 100.00
-----------------------------+-----------------------------------
Total | 133,710 100.00
. *about half the sample is in the labor force
. summarize ernval, detail
main job last year earnings p228
-------------------------------------------------------------
Percentiles Smallest
1% 0 -9999
5% 0 -9999
10% 0 -9999 Obs 133710
25% 0 -9999 Sum of Wgt. 133710
50% 1700 Mean 15358.07
Largest Std. Dev. 26895.05
75% 23565 362302
90% 45000 362302 Variance 7.23e+08
95% 60000 362302 Skewness 3.951568
99% 117000 362302 Kurtosis 28.41035
. *ernval is the CPS variable for last year's earnings.
Notice that there are some very negative values.
ernval2 is my variable that simply recodes the negative earnings to zero.
. summarize ssval if ssval >0, detail
last year soc security payments p291
-------------------------------------------------------------
Percentiles Smallest
1% 899 1
5% 2610 7
10% 3918 12 Obs 18389
25% 5766 33 Sum of Wgt. 18389
50% 8400 Mean 8731.546
Largest Std. Dev. 4443.982
75% 11346 50000
90% 13698 50000 Variance 1.97e+07
95% 15090 50000 Skewness 2.219075
99% 20364 50000 Kurtosis 19.57047
. summarize age if ssval>0
Variable | Obs Mean Std. Dev. Min Max
-------------+--------------------------------------------------------
age | 18389 69.17195 13.61569 15 90
. *retired folks get social security.
. summarize pawval if pawval>0, detail
last year welfare payments p305
-------------------------------------------------------------
Percentiles Smallest
1% 26 1
5% 214 1
10% 450 1 Obs 1289
25% 1026 1 Sum of Wgt. 1289
50% 2664 Mean 3253.134
Largest Std. Dev. 2813.505
75% 4668 15600
90% 7000 19999 Variance 7915809
95% 8400 23292 Skewness 1.79416
99% 12648 25000 Kurtosis 9.428488
. summarize age if pawval>0
Variable | Obs Mean Std. Dev. Min Max
-------------+--------------------------------------------------------
age | 1289 33.79829 12.82859 15 88
. *younger folks receive welfare
. tabulate htype
household type h25 | Freq. Percent Cum.
--------------------------------------+-----------------------------------
husband wife primary family | 87,770 65.64 65.64
husband wife primary, AF | 1,223 0.91 66.56
unmarried civilian male householder | 6,436 4.81 71.37
unmarried civilian female householder | 19,000 14.21 85.58
AF and unmarried | 15 0.01 85.59
civilian male nonfamily householder | 9,345 6.99 92.58
civilian female nonfamily householder | 9,797 7.33 99.91
nonfamily household, AF | 38 0.03 99.94
group quarters | 86 0.06 100.00
--------------------------------------+-----------------------------------
Total | 133,710 100.00
. tabulate state
HG-ST60, or |
simply |
state of |
residence |
h40 | Freq. Percent Cum.
------------+-----------------------------------
11 | 1,353 1.01 1.01
12 | 1,297 0.97 1.98
13 | 1,237 0.93 2.91
14 | 2,885 2.16 5.06
15 | 1,289 0.96 6.03
16 | 1,409 1.05 7.08
21 | 8,669 6.48 13.57
22 | 4,029 3.01 16.58
23 | 5,036 3.77 20.35
31 | 4,628 3.46 23.81
32 | 1,686 1.26 25.07
33 | 5,626 4.21 29.28
34 | 4,511 3.37 32.65
35 | 1,860 1.39 34.04
41 | 1,809 1.35 35.39
42 | 1,630 1.22 36.61
43 | 1,503 1.12 37.74
44 | 1,485 1.11 38.85
45 | 1,629 1.22 40.07
46 | 1,728 1.29 41.36
47 | 1,662 1.24 42.60
51 | 1,327 0.99 43.59
52 | 1,430 1.07 44.66
53 | 1,183 0.88 45.55
54 | 1,940 1.45 47.00
55 | 1,615 1.21 48.21
56 | 3,271 2.45 50.65
57 | 1,306 0.98 51.63
58 | 2,004 1.50 53.13
59 | 6,939 5.19 58.32
61 | 1,629 1.22 59.54
62 | 1,646 1.23 60.77
63 | 1,736 1.30 62.06
64 | 1,553 1.16 63.23
71 | 1,715 1.28 64.51
72 | 1,702 1.27 65.78
73 | 1,855 1.39 67.17
74 | 8,027 6.00 73.17
81 | 1,730 1.29 74.47
82 | 1,963 1.47 75.93
83 | 1,664 1.24 77.18
84 | 2,042 1.53 78.71
85 | 2,347 1.76 80.46
86 | 2,562 1.92 82.38
87 | 1,968 1.47 83.85
88 | 2,106 1.58 85.42
91 | 1,661 1.24 86.67
92 | 1,598 1.20 87.86
93 | 13,325 9.97 97.83
94 | 1,607 1.20 99.03
95 | 1,298 0.97 100.00
------------+-----------------------------------
Total | 133,710 100.00
. *notice that with the state variable, you'll have to look up the codes.
. tabulate hpmsasz
metropolita |
n area size |
h56 | Freq. Percent Cum.
------------+-----------------------------------
0 | 34,809 26.03 26.03
2 | 7,852 5.87 31.91
3 | 10,970 8.20 40.11
4 | 14,228 10.64 50.75
5 | 21,561 16.13 66.88
6 | 7,927 5.93 72.80
7 | 36,363 27.20 100.00
------------+-----------------------------------
Total | 133,710 100.00
. tabulate hpmsasz
metropolitan |
area size h56 | Freq. Percent Cum.
-----------------+-----------------------------------
rural or unknown | 34,809 26.03 26.03
250K-499K | 7,852 5.87 31.91
500K-999K | 10,970 8.20 40.11
4 | 14,228 10.64 50.75
1M-2.5M | 21,561 16.13 66.88
2.5M-5M | 7,927 5.93 72.80
>5M | 36,363 27.20 100.00
-----------------+-----------------------------------
Total | 133,710 100.00
. tabulate hcccr
residence in |
central city |
h58 | Freq. Percent Cum.
-------------+-----------------------------------
central city | 32,481 24.29 24.29
suburbs | 51,468 38.49 62.78
rural | 29,658 22.18 84.97
unknown | 20,103 15.03 100.00
-------------+-----------------------------------
Total | 133,710 100.00
. tabulate frelu18
number of |
kids in fam |
under 18 |
f29 | Freq. Percent Cum.
------------+-----------------------------------
0 | 59,349 44.39 44.39
1 | 23,599 17.65 62.04
2 | 28,223 21.11 83.14
3 | 14,360 10.74 93.88
4 | 5,531 4.14 98.02
5 | 1,680 1.26 99.28
6 | 518 0.39 99.66
7 | 280 0.21 99.87
8 | 85 0.06 99.94
9 | 85 0.06 100.00
------------+-----------------------------------
Total | 133,710 100.00
. tabulate povll
ratio of |
fam income |
to poverty |
level f38 | Freq. Percent Cum.
------------+-----------------------------------
<.5 | 6,579 4.92 4.92
.5-.74 | 4,534 3.39 8.31
.75- .99 | 5,560 4.16 12.47
1- 1.24 | 6,259 4.68 17.15
1.25-1.49 | 6,727 5.03 22.18
1.5-1.74 | 6,592 4.93 27.11
1.75-1.99 | 6,452 4.83 31.94
2- 2.49 | 12,507 9.35 41.29
2.5- 2.99 | 11,601 8.68 49.97
3- 3.49 | 9,858 7.37 57.34
3.5- 3.99 | 8,967 6.71 64.05
4- 4.49 | 7,910 5.92 69.96
4.5-4.99 | 6,517 4.87 74.84
5+ | 33,647 25.16 100.00
------------+-----------------------------------
Total | 133,710 100.00
. *poverty level calculation includes family income and family size.
. summarize fwsval if fwsval>0
Variable | Obs Mean Std. Dev. Min Max
-------------+--------------------------------------------------------
fwsval | 112499 52263.22 45610.18 1 513472
. summarize fwsval, detail
family income f48
-------------------------------------------------------------
Percentiles Smallest
1% 0 0
5% 0 0
10% 0 0 Obs 133710
25% 10963 0 Sum of Wgt. 133710
50% 34000 Mean 43972.47
Largest Std. Dev. 45987.47
75% 63000 513472
90% 96500 513472 Variance 2.11e+09
95% 123614 513472 Skewness 2.274276
99% 229639 513472 Kurtosis 11.88614
. summarize wgt2
Variable | Obs Mean Std. Dev. Min Max
-------------+--------------------------------------------------------
wgt2 | 133710 2043.481 1233.998 0 15347
. *This is the weight variable you should be using
. summarize famwgt2
Variable | Obs Mean Std. Dev. Min Max
-------------+--------------------------------------------------------
famwgt2 | 133710 2020.472 1052.381 98 12904
. * The family weight variable is really only applicable to analyses that take the family
as the basic unit of analysis, which is hard to do since I've made the dataset an individual level dataset.
. summarize wgt2 if wgt2>0
Variable | Obs Mean Std. Dev. Min Max
-------------+--------------------------------------------------------
wgt2 | 120800 2261.869 1091.568 80 15347
. *There are nearly 13,000 individuals who have wgt2=0. That doesn't seem like a good thing.
I'm not sure why that is...
. tabulate age if wgt2==0
p15 | Freq. Percent Cum.
------------+-----------------------------------
0 | 207 1.60 1.60
1 | 241 1.87 3.47
2 | 236 1.83 5.30
3 | 231 1.79 7.09
4 | 286 2.22 9.30
5 | 290 2.25 11.55
6 | 282 2.18 13.73
7 | 278 2.15 15.89
8 | 252 1.95 17.84
9 | 287 2.22 20.06
10 | 278 2.15 22.22
11 | 244 1.89 24.11
12 | 223 1.73 25.83
13 | 244 1.89 27.72
14 | 231 1.79 29.51
15 | 230 1.78 31.29
16 | 205 1.59 32.88
17 | 235 1.82 34.70
18 | 197 1.53 36.23
19 | 207 1.60 37.83
20 | 196 1.52 39.35
21 | 184 1.43 40.77
22 | 206 1.60 42.37
23 | 222 1.72 44.09
24 | 219 1.70 45.79
25 | 228 1.77 47.55
26 | 240 1.86 49.41
27 | 214 1.66 51.07
28 | 223 1.73 52.80
29 | 217 1.68 54.48
30 | 238 1.84 56.32
31 | 232 1.80 58.12
32 | 238 1.84 59.96
33 | 225 1.74 61.70
34 | 240 1.86 63.56
35 | 245 1.90 65.46
36 | 238 1.84 67.30
37 | 233 1.80 69.11
38 | 208 1.61 70.72
39 | 239 1.85 72.57
40 | 206 1.60 74.17
41 | 192 1.49 75.65
42 | 196 1.52 77.17
43 | 182 1.41 78.58
44 | 168 1.30 79.88
45 | 181 1.40 81.29
46 | 129 1.00 82.29
47 | 125 0.97 83.25
48 | 126 0.98 84.23
49 | 115 0.89 85.12
50 | 142 1.10 86.22
51 | 126 0.98 87.20
52 | 142 1.10 88.30
53 | 88 0.68 88.98
54 | 89 0.69 89.67
55 | 86 0.67 90.33
56 | 85 0.66 90.99
57 | 76 0.59 91.58
58 | 67 0.52 92.10
59 | 58 0.45 92.55
60 | 66 0.51 93.06
61 | 66 0.51 93.57
62 | 52 0.40 93.97
63 | 50 0.39 94.36
64 | 51 0.40 94.76
65 | 60 0.46 95.22
66 | 41 0.32 95.54
67 | 51 0.40 95.93
68 | 46 0.36 96.29
69 | 44 0.34 96.63
70 | 34 0.26 96.89
71 | 45 0.35 97.24
72 | 44 0.34 97.58
73 | 36 0.28 97.86
74 | 41 0.32 98.18
75 | 36 0.28 98.46
76 | 19 0.15 98.61
77 | 23 0.18 98.78
78 | 19 0.15 98.93
79 | 14 0.11 99.04
80 | 20 0.15 99.19
81 | 14 0.11 99.30
82 | 14 0.11 99.41
83 | 12 0.09 99.50
84 | 13 0.10 99.60
85 | 17 0.13 99.74
86 | 6 0.05 99.78
87 | 6 0.05 99.83
88 | 3 0.02 99.85
89 | 5 0.04 99.89
90 | 14 0.11 100.00
------------+-----------------------------------
Total | 12,910 100.00
. *The folks with wgt2=0 have all ages..
. tabulate citizen
citizenship p733 | Freq. Percent Cum.
----------------------------------+-----------------------------------
native born in US | 116,220 86.92 86.92
native, born in territories | 1,090 0.82 87.73
native, born abroad of US parents | 976 0.73 88.46
foreign born, naturalized | 5,348 4.00 92.46
foreign born, non US citizen | 10,076 7.54 100.00
----------------------------------+-----------------------------------
Total | 133,710 100.00
. tabulate health
self |
reported |
health |
status p800 | Freq. Percent Cum.
------------+-----------------------------------
Excellent | 46,934 35.10 35.10
very good | 41,175 30.79 65.90
good | 30,730 22.98 88.88
fair | 10,356 7.75 96.62
poor | 4,515 3.38 100.00
------------+-----------------------------------
Total | 133,710 100.00
. *The following variables have hundreds of categories and I won't tabulate them here:
. *occ (occupation)
. * penatvty (person's place of birth)
. * pemntvty (mother's place of birth)
. * pefntvty (father's place of birth)
. * hgmsac (metropolitan area code)
. * ptotr is total person income recode. It has many categories. The one use of ptotr in comparsion
> to ernval and ernval2 is that ptotr includes income other than wage and salary income.
. * pppos is a family sequence variable that you're better off not using.
. tabulate edlvl
4 categories ed |
attainment | Freq. Percent Cum.
-----------------+-----------------------------------
<12th grade | 52,552 39.30 39.30
12 grade, no dip | 1,491 1.12 40.42
HS diploma | 31,970 23.91 64.33
>HS | 47,697 35.67 100.00
-----------------+-----------------------------------
Total | 133,710 100.00