-----------------------------------------------------------------------------------------------------
. set mem 20m

Current memory allocation

current memory usage settable value description (1M = 1024k) -------------------------------------------------------------------- set maxvar 5000 max. variables allowed 1.733M set memory 20M max. data space 20.000M set matsize 400 max. RHS vars in models 1.254M ----------- 22.987M
. use cps_y2k_numeric.dta, clear

. describe

Contains data from cps_y2k_numeric.dta obs: 133,710 vars: 42 8 May 2004 13:26 size: 9,894,540 (52.8% of memory free) ------------------------------------------------------------------------------- storage display value variable name type format label variable label ------------------------------------------------------------------------------- phseq str5 %9s household sequence number p2 pernum byte %8.0g age byte %8.0g p15 maritl byte %26.0g marlbl Marital Status p17 sex byte %8.0g sexnm p20 vet byte %22.0g vetnm veteran status p21 hga byte %8.0g Educational Attainment p22 race byte %11.0g racenm p25 reorigin byte %25.0g hisplbl Hispanic Origin p27 hrs1 byte %8.0g hours worked last week p76 clswkr byte %32.0g cwrknm sector of worker p109 grswk int %9.0g gross weekly wages p135 unmem byte %13.0g unnm labor union member p139 lfsr byte %28.0g lfsrnm labor force status p145 ernval float %9.0g main job last year earnings p228 ssval long %12.0g last year soc security payments p291 pawval int %12.0g last year welfare payments p305 wgt2 int %9.0g rounded weight based on p50 ernval2 float %9.0g main job earnings, losses recoded to zero htype byte %37.0g htpnm household type h25 state byte %8.0g HG-ST60, or simply state of residence h40 hpmsasz byte %8.0g metropolitan area size h56 hcccr byte %8.0g residence in central city h58 frelu18 byte %8.0g number of kids in fam under 18 f29 povll byte %8.0g ratio of fam income to poverty level f38 fwsval float %9.0g family income f48 famwgt2 int %8.0g adjusted family weight f233 yrsed float %9.0g years of education, from hga citizen byte %33.0g citnm citizenship p733 health byte %11.0g hlthnm self reported health status p800 occ int %8.0g occupation P 106 ptotr byte %8.0g total person income categories P466 penatvty int %8.0g country of birth P 722, Appendix H pemntvty int %8.0g Mother's country of birth, P725, appendix H pefntvty int %8.0g Father's country of birth, P728, appendix H peinusyr byte %8.0g time of immigration, P 731 pxnatvty byte %8.0g allocation flag for country of birth P 734 hgmsac int %8.0g metropolitan area code, h44, appendix E pppos2 byte %8.0g family sequence number within each household p46 edlvl byte %16.0g edlabel 4 categories ed attainment hispanic byte %12.0g smhisplbl dichotomoy hispanic yes/no new_race byte %18.0g new_race race and Hispanic combined ------------------------------------------------------------------------------- Sorted by:

. *variable phseq should not be used for this class- it's a household sequence number . *variable pernum should not be used for this class.
Along with phseq, pernum allows for indentific
ation of
household relationships, like who's married to whom in the household.
. tabulate age

p15 | Freq. Percent Cum. ------------+----------------------------------- 0 | 1,713 1.28 1.28 1 | 1,932 1.44 2.73 2 | 1,950 1.46 4.18 3 | 1,939 1.45 5.63 4 | 1,965 1.47 7.10 5 | 1,998 1.49 8.60 6 | 2,059 1.54 10.14 7 | 2,176 1.63 11.77 8 | 2,163 1.62 13.38 9 | 2,243 1.68 15.06 10 | 2,202 1.65 16.71 11 | 2,083 1.56 18.27 12 | 2,035 1.52 19.79 13 | 2,047 1.53 21.32 14 | 1,979 1.48 22.80 15 | 2,046 1.53 24.33 16 | 1,965 1.47 25.80 17 | 1,998 1.49 27.29 18 | 1,847 1.38 28.67 19 | 1,826 1.37 30.04 20 | 1,722 1.29 31.33 21 | 1,687 1.26 32.59 22 | 1,638 1.23 33.81 23 | 1,622 1.21 35.03 24 | 1,662 1.24 36.27 25 | 1,666 1.25 37.52 26 | 1,640 1.23 38.74 27 | 1,726 1.29 40.03 28 | 1,801 1.35 41.38 29 | 1,995 1.49 42.87 30 | 1,907 1.43 44.30 31 | 1,991 1.49 45.79 32 | 1,890 1.41 47.20 33 | 1,898 1.42 48.62 34 | 2,024 1.51 50.13 35 | 2,134 1.60 51.73 36 | 2,123 1.59 53.32 37 | 2,099 1.57 54.89 38 | 2,064 1.54 56.43 39 | 2,228 1.67 58.10 40 | 2,190 1.64 59.74 41 | 2,115 1.58 61.32 42 | 2,137 1.60 62.92 43 | 2,091 1.56 64.48 44 | 2,114 1.58 66.06 45 | 2,118 1.58 67.64 46 | 1,939 1.45 69.10 47 | 1,957 1.46 70.56 48 | 1,827 1.37 71.93 49 | 1,767 1.32 73.25 50 | 1,865 1.39 74.64 51 | 1,802 1.35 75.99 52 | 1,825 1.36 77.35 53 | 1,695 1.27 78.62 54 | 1,301 0.97 79.59 55 | 1,323 0.99 80.58 56 | 1,324 0.99 81.57 57 | 1,304 0.98 82.55 58 | 1,128 0.84 83.39 59 | 1,129 0.84 84.24 60 | 1,154 0.86 85.10 61 | 1,051 0.79 85.89 62 | 1,073 0.80 86.69 63 | 938 0.70 87.39 64 | 952 0.71 88.10 65 | 1,014 0.76 88.86 66 | 869 0.65 89.51 67 | 926 0.69 90.20 68 | 908 0.68 90.88 69 | 904 0.68 91.56 70 | 913 0.68 92.24 71 | 885 0.66 92.90 72 | 770 0.58 93.48 73 | 797 0.60 94.08 74 | 814 0.61 94.68 75 | 796 0.60 95.28 76 | 704 0.53 95.81 77 | 646 0.48 96.29 78 | 687 0.51 96.80 79 | 602 0.45 97.25 80 | 514 0.38 97.64 81 | 476 0.36 97.99 82 | 425 0.32 98.31 83 | 427 0.32 98.63 84 | 325 0.24 98.87 85 | 306 0.23 99.10 86 | 248 0.19 99.29 87 | 209 0.16 99.44 88 | 172 0.13 99.57 89 | 155 0.12 99.69 90 | 416 0.31 100.00 ------------+----------------------------------- Total | 133,710 100.00

. *not very illuminating. . summarize age

Variable | Obs Mean Std. Dev. Min Max -------------+-------------------------------------------------------- age | 133710 35.17964 22.21722 0 90

. summarize age [fweight=wgt2]

Variable | Obs Mean Std. Dev. Min Max -------------+-------------------------------------------------------- age | 2.732e+08 35.31592 22.1685 0 90

. *note the topcode max of 90 . tabulate maritl

Marital Status p17 | Freq. Percent Cum. ---------------------------+----------------------------------- married, spouse present | 55,585 41.57 41.57 married, AF spouse present | 351 0.26 41.83 married, spouse absent | 1,355 1.01 42.85 widowed | 6,561 4.91 47.75 divorced | 9,523 7.12 54.88 separated | 2,097 1.57 56.44 never married | 58,238 43.56 100.00 ---------------------------+----------------------------------- Total | 133,710 100.00

. tabulate maritl [fweight=wgt2]

Marital Status p17 | Freq. Percent Cum. ---------------------------+----------------------------------- married, spouse present | 112733750 41.26 41.26 married, AF spouse present | 658,497 0.24 41.50 married, spouse absent | 2,713,867 0.99 42.49 widowed | 13630062 4.99 47.48 divorced | 19629420 7.18 54.67 separated | 4,430,097 1.62 56.29 never married | 119438128 43.71 100.00 ---------------------------+----------------------------------- Total | 273233821 100.00

. tabulate sex

p20 | Freq. Percent Cum. ------------+----------------------------------- male | 64,791 48.46 48.46 female | 68,919 51.54 100.00 ------------+----------------------------------- Total | 133,710 100.00

. tabulate sex [fweight=wgt2]

p20 | Freq. Percent Cum. ------------+----------------------------------- male | 133187798 48.74 48.74 female | 140046023 51.26 100.00 ------------+----------------------------------- Total | 273233821 100.00

. tabulate vet

veteran status p21 | Freq. Percent Cum. -----------------------+----------------------------------- children or current AF | 30,904 23.11 23.11 Vietnam | 3,683 2.75 25.87 Korea | 1,716 1.28 27.15 WWII | 2,428 1.82 28.97 Other Service | 3,830 2.86 31.83 Non Veteran | 91,149 68.17 100.00 -----------------------+----------------------------------- Total | 133,710 100.00

. tabulate vet [fweight=wgt2]

veteran status p21 | Freq. Percent Cum. -----------------------+----------------------------------- children or current AF | 60277572 22.06 22.06 Vietnam | 7,505,156 2.75 24.81 Korea | 3,469,644 1.27 26.08 WWII | 5,185,150 1.90 27.98 Other Service | 8,218,282 3.01 30.98 Non Veteran | 188578017 69.02 100.00 -----------------------+----------------------------------- Total | 273233821 100.00

. tabulate hga

Educational | Attainment | p22 | Freq. Percent Cum. ------------+----------------------------------- 0 | 30,484 22.80 22.80 31 | 457 0.34 23.14 32 | 1,187 0.89 24.03 33 | 2,320 1.74 25.76 34 | 4,527 3.39 29.15 35 | 4,161 3.11 32.26 36 | 4,695 3.51 35.77 37 | 4,721 3.53 39.30 38 | 1,491 1.12 40.42 39 | 31,970 23.91 64.33 40 | 18,797 14.06 78.39 41 | 3,758 2.81 81.20 42 | 3,328 2.49 83.69 43 | 14,705 11.00 94.68 44 | 4,918 3.68 98.36 45 | 1,229 0.92 99.28 46 | 962 0.72 100.00 ------------+----------------------------------- Total | 133,710 100.00

. *This is an educational attainment variable.
The categories are a bit weird, so I created a more u
seful variable
for years of education.
. tabulate yrsed

years of | education, | from hga | Freq. Percent Cum. ------------+----------------------------------- 0 | 30,941 23.14 23.14 1.5 | 1,187 0.89 24.03 5.5 | 2,320 1.74 25.76 7.5 | 4,527 3.39 29.15 9 | 4,161 3.11 32.26 10 | 4,695 3.51 35.77 11 | 4,721 3.53 39.30 12 | 33,461 25.03 64.33 13 | 18,797 14.06 78.39 14 | 7,086 5.30 83.69 16 | 14,705 11.00 94.68 17 | 4,918 3.68 98.36 19 | 1,229 0.92 99.28 22 | 962 0.72 100.00 ------------+----------------------------------- Total | 133,710 100.00

. tabulate race

p25 | Freq. Percent Cum. ------------+----------------------------------- White | 113,475 84.87 84.87 Black | 13,626 10.19 95.06 Amer Indian | 1,894 1.42 96.47 Asian | 4,715 3.53 100.00 ------------+----------------------------------- Total | 133,710 100.00

. tabulate reorigin

Hispanic Origin p27 | Freq. Percent Cum. --------------------------+----------------------------------- Mexican American | 6,447 4.82 4.82 Chicano | 384 0.29 5.11 Mexican | 8,155 6.10 11.21 Puerto Rican | 2,280 1.71 12.91 Cuban | 943 0.71 13.62 Central or South American | 3,487 2.61 16.23 Other Spanish | 1,863 1.39 17.62 Non Hispanic | 108,641 81.25 98.87 Don't Know | 471 0.35 99.22 N/A | 1,039 0.78 100.00 --------------------------+----------------------------------- Total | 133,710 100.00

. tabulate hispanic

dichotomoy | hispanic | yes/no | Freq. Percent Cum. -------------+----------------------------------- Non Hispanic | 110,151 82.38 82.38 Hispanic | 23,559 17.62 100.00 -------------+----------------------------------- Total | 133,710 100.00

. tabulate new_race

race and Hispanic | combined | Freq. Percent Cum. -------------------+----------------------------------- Non Hispanic White | 90,988 68.05 68.05 Non Hispanic Black | 12,930 9.67 77.72 NH American Indian | 1,649 1.23 78.95 NH Asian | 4,584 3.43 82.38 Hispanic | 23,559 17.62 100.00 -------------------+----------------------------------- Total | 133,710 100.00

. *race and reorigin are original CPS variables,
hispanic and new_race are variables I have added.
. summarize hrs1, detail

hours worked last week p76 ------------------------------------------------------------- Percentiles Smallest 1% -1 -1 5% -1 -1 10% -1 -1 Obs 133710 25% -1 -1 Sum of Wgt. 133710

50% 0 Mean 18.1734 Largest Std. Dev. 22.08705 75% 40 99 90% 48 99 Variance 487.8376 95% 55 99 Skewness .6339475 99% 70 99 Kurtosis 2.090072

. *here's a little surprise that the data dictionary does not predict:
a quarter of the sample has -1

for hours worked, clearly a 'not in universe' answer.
So how many people in the sample had positi
ve hours worked? . summarize hrs1 if hrs1>0, detail

hours worked last week p76 ------------------------------------------------------------- Percentiles Smallest 1% 4 1 5% 14 1 10% 20 1 Obs 62726 25% 35 1 Sum of Wgt. 62726

50% 40 Mean 39.37833 Largest Std. Dev. 13.8791 75% 45 99 90% 55 99 Variance 192.6293 95% 60 99 Skewness .0870914 99% 80 99 Kurtosis 4.812558

. *62,726 of the 133,710 had positive hours worked . tabulate clswkr

sector of worker p109 | Freq. Percent Cum. ---------------------------------+----------------------------------- not in universe, children, or AF | 64,773 48.44 48.44 private | 51,618 38.60 87.05 federal govt | 1,763 1.32 88.37 state govt | 2,884 2.16 90.52 local govt | 5,235 3.92 94.44 self employed- incorporated | 2,122 1.59 96.02 self employed, not incorp | 5,042 3.77 99.80 unpaid | 71 0.05 99.85 never worked | 202 0.15 100.00 ---------------------------------+----------------------------------- Total | 133,710 100.00

. tabulate clswkr, nolabel

sector of | worker p109 | Freq. Percent Cum. ------------+----------------------------------- 0 | 64,773 48.44 48.44 1 | 51,618 38.60 87.05 2 | 1,763 1.32 88.37 3 | 2,884 2.16 90.52 4 | 5,235 3.92 94.44 5 | 2,122 1.59 96.02 6 | 5,042 3.77 99.80 7 | 71 0.05 99.85 8 | 202 0.15 100.00 ------------+----------------------------------- Total | 133,710 100.00

. *If you're looking at type of worker (i.e. government vs private vs self employed),
you need to dis
card the half of the sample that is 'not in the universe' for this question. . summarize grswk if grswk>0

Variable | Obs Mean Std. Dev. Min Max -------------+-------------------------------------------------------- grswk | 13422 601.9655 461.0122 1 2885

. summarize grswk if grswk>0, detail

gross weekly wages p135 ------------------------------------------------------------- Percentiles Smallest 1% 39 1 5% 99 2 10% 158 2 Obs 13422 25% 300 3 Sum of Wgt. 13422

50% 483 Mean 601.9655 Largest Std. Dev. 461.0122 75% 775 2885 90% 1154 2885 Variance 212532.2 95% 1442 2885 Skewness 1.964056 99% 2500 2885 Kurtosis 8.607142

. *Here's a problem with grswk: even though half of the sample works,
only 13,422 or about 10% of the
sample actually reports weekly wages.
If you're going to use weekly wages, keep in mind that most
of the workers in the sample don't report weekly wages (I'm not sure why). . summarize ernval2 if ernval2>0, detail

main job earnings, losses recoded to zero ------------------------------------------------------------- Percentiles Smallest 1% 200 1 5% 1200 1 10% 3000 1 Obs 71370 25% 10000 1 Sum of Wgt. 71370

50% 21319.5 Mean 28801.05 Largest Std. Dev. 31102.15 75% 38000 362302 90% 60000 362302 Variance 9.67e+08 95% 76000 362302 Skewness 3.515875 99% 197387 362302 Kurtosis 22.47451

. *71,370 respondents reported positive earnings the previous year,
which seems about right given the
percent of the population that is in the labor force. . tabulate unmem

labor union | member p139 | Freq. Percent Cum. --------------+----------------------------------- not in univ | 120,249 89.93 89.93 Yes, in union | 1,883 1.41 91.34 non-union | 11,578 8.66 100.00 --------------+----------------------------------- Total | 133,710 100.00

. *Like grswk, unmem (union membership) applies (either yes or no) to only 13,400 or so
respondents i
n the sample, far fewer than the number that actually work . tabulate lsfr variable lsfr not found
r(111);


. tabulate lfsr

labor force status p145 | Freq. Percent Cum. -----------------------------+----------------------------------- children or AF | 30,904 23.11 23.11 working | 62,726 46.91 70.02 with job, not at work | 2,340 1.75 71.77 unemployed, looking for work | 2,478 1.85 73.63 unemployed, on layoff | 469 0.35 73.98 Not in Labor Force | 34,793 26.02 100.00 -----------------------------+----------------------------------- Total | 133,710 100.00

. *about half the sample is in the labor force . summarize ernval, detail

main job last year earnings p228 ------------------------------------------------------------- Percentiles Smallest 1% 0 -9999 5% 0 -9999 10% 0 -9999 Obs 133710 25% 0 -9999 Sum of Wgt. 133710

50% 1700 Mean 15358.07 Largest Std. Dev. 26895.05 75% 23565 362302 90% 45000 362302 Variance 7.23e+08 95% 60000 362302 Skewness 3.951568 99% 117000 362302 Kurtosis 28.41035

. *ernval is the CPS variable for last year's earnings.
Notice that there are some very negative val
ues.
ernval2 is my variable that simply recodes the negative earnings to zero.
. summarize ssval if ssval >0, detail

last year soc security payments p291 ------------------------------------------------------------- Percentiles Smallest 1% 899 1 5% 2610 7 10% 3918 12 Obs 18389 25% 5766 33 Sum of Wgt. 18389

50% 8400 Mean 8731.546 Largest Std. Dev. 4443.982 75% 11346 50000 90% 13698 50000 Variance 1.97e+07 95% 15090 50000 Skewness 2.219075 99% 20364 50000 Kurtosis 19.57047

. summarize age if ssval>0

Variable | Obs Mean Std. Dev. Min Max -------------+-------------------------------------------------------- age | 18389 69.17195 13.61569 15 90

. *retired folks get social security. . summarize pawval if pawval>0, detail

last year welfare payments p305 ------------------------------------------------------------- Percentiles Smallest 1% 26 1 5% 214 1 10% 450 1 Obs 1289 25% 1026 1 Sum of Wgt. 1289

50% 2664 Mean 3253.134 Largest Std. Dev. 2813.505 75% 4668 15600 90% 7000 19999 Variance 7915809 95% 8400 23292 Skewness 1.79416 99% 12648 25000 Kurtosis 9.428488

. summarize age if pawval>0

Variable | Obs Mean Std. Dev. Min Max -------------+-------------------------------------------------------- age | 1289 33.79829 12.82859 15 88

. *younger folks receive welfare . tabulate htype

household type h25 | Freq. Percent Cum. --------------------------------------+----------------------------------- husband wife primary family | 87,770 65.64 65.64 husband wife primary, AF | 1,223 0.91 66.56 unmarried civilian male householder | 6,436 4.81 71.37 unmarried civilian female householder | 19,000 14.21 85.58 AF and unmarried | 15 0.01 85.59 civilian male nonfamily householder | 9,345 6.99 92.58 civilian female nonfamily householder | 9,797 7.33 99.91 nonfamily household, AF | 38 0.03 99.94 group quarters | 86 0.06 100.00 --------------------------------------+----------------------------------- Total | 133,710 100.00

. tabulate state

HG-ST60, or | simply | state of | residence | h40 | Freq. Percent Cum. ------------+----------------------------------- 11 | 1,353 1.01 1.01 12 | 1,297 0.97 1.98 13 | 1,237 0.93 2.91 14 | 2,885 2.16 5.06 15 | 1,289 0.96 6.03 16 | 1,409 1.05 7.08 21 | 8,669 6.48 13.57 22 | 4,029 3.01 16.58 23 | 5,036 3.77 20.35 31 | 4,628 3.46 23.81 32 | 1,686 1.26 25.07 33 | 5,626 4.21 29.28 34 | 4,511 3.37 32.65 35 | 1,860 1.39 34.04 41 | 1,809 1.35 35.39 42 | 1,630 1.22 36.61 43 | 1,503 1.12 37.74 44 | 1,485 1.11 38.85 45 | 1,629 1.22 40.07 46 | 1,728 1.29 41.36 47 | 1,662 1.24 42.60 51 | 1,327 0.99 43.59 52 | 1,430 1.07 44.66 53 | 1,183 0.88 45.55 54 | 1,940 1.45 47.00 55 | 1,615 1.21 48.21 56 | 3,271 2.45 50.65 57 | 1,306 0.98 51.63 58 | 2,004 1.50 53.13 59 | 6,939 5.19 58.32 61 | 1,629 1.22 59.54 62 | 1,646 1.23 60.77 63 | 1,736 1.30 62.06 64 | 1,553 1.16 63.23 71 | 1,715 1.28 64.51 72 | 1,702 1.27 65.78 73 | 1,855 1.39 67.17 74 | 8,027 6.00 73.17 81 | 1,730 1.29 74.47 82 | 1,963 1.47 75.93 83 | 1,664 1.24 77.18 84 | 2,042 1.53 78.71 85 | 2,347 1.76 80.46 86 | 2,562 1.92 82.38 87 | 1,968 1.47 83.85 88 | 2,106 1.58 85.42 91 | 1,661 1.24 86.67 92 | 1,598 1.20 87.86 93 | 13,325 9.97 97.83 94 | 1,607 1.20 99.03 95 | 1,298 0.97 100.00 ------------+----------------------------------- Total | 133,710 100.00

. *notice that with the state variable, you'll have to look up the codes. . tabulate hpmsasz

metropolita | n area size | h56 | Freq. Percent Cum. ------------+----------------------------------- 0 | 34,809 26.03 26.03 2 | 7,852 5.87 31.91 3 | 10,970 8.20 40.11 4 | 14,228 10.64 50.75 5 | 21,561 16.13 66.88 6 | 7,927 5.93 72.80 7 | 36,363 27.20 100.00 ------------+----------------------------------- Total | 133,710 100.00

. tabulate hpmsasz

metropolitan | area size h56 | Freq. Percent Cum. -----------------+----------------------------------- rural or unknown | 34,809 26.03 26.03 250K-499K | 7,852 5.87 31.91 500K-999K | 10,970 8.20 40.11 4 | 14,228 10.64 50.75 1M-2.5M | 21,561 16.13 66.88 2.5M-5M | 7,927 5.93 72.80 >5M | 36,363 27.20 100.00 -----------------+----------------------------------- Total | 133,710 100.00

. tabulate hcccr

residence in | central city | h58 | Freq. Percent Cum. -------------+----------------------------------- central city | 32,481 24.29 24.29 suburbs | 51,468 38.49 62.78 rural | 29,658 22.18 84.97 unknown | 20,103 15.03 100.00 -------------+----------------------------------- Total | 133,710 100.00

. tabulate frelu18

number of | kids in fam | under 18 | f29 | Freq. Percent Cum. ------------+----------------------------------- 0 | 59,349 44.39 44.39 1 | 23,599 17.65 62.04 2 | 28,223 21.11 83.14 3 | 14,360 10.74 93.88 4 | 5,531 4.14 98.02 5 | 1,680 1.26 99.28 6 | 518 0.39 99.66 7 | 280 0.21 99.87 8 | 85 0.06 99.94 9 | 85 0.06 100.00 ------------+----------------------------------- Total | 133,710 100.00



. tabulate povll

ratio of | fam income | to poverty | level f38 | Freq. Percent Cum. ------------+----------------------------------- <.5 | 6,579 4.92 4.92 .5-.74 | 4,534 3.39 8.31 .75- .99 | 5,560 4.16 12.47 1- 1.24 | 6,259 4.68 17.15 1.25-1.49 | 6,727 5.03 22.18 1.5-1.74 | 6,592 4.93 27.11 1.75-1.99 | 6,452 4.83 31.94 2- 2.49 | 12,507 9.35 41.29 2.5- 2.99 | 11,601 8.68 49.97 3- 3.49 | 9,858 7.37 57.34 3.5- 3.99 | 8,967 6.71 64.05 4- 4.49 | 7,910 5.92 69.96 4.5-4.99 | 6,517 4.87 74.84 5+ | 33,647 25.16 100.00 ------------+----------------------------------- Total | 133,710 100.00

. *poverty level calculation includes family income and family size. . summarize fwsval if fwsval>0

Variable | Obs Mean Std. Dev. Min Max -------------+-------------------------------------------------------- fwsval | 112499 52263.22 45610.18 1 513472

. summarize fwsval, detail

family income f48 ------------------------------------------------------------- Percentiles Smallest 1% 0 0 5% 0 0 10% 0 0 Obs 133710 25% 10963 0 Sum of Wgt. 133710

50% 34000 Mean 43972.47 Largest Std. Dev. 45987.47 75% 63000 513472 90% 96500 513472 Variance 2.11e+09 95% 123614 513472 Skewness 2.274276 99% 229639 513472 Kurtosis 11.88614

. summarize wgt2

Variable | Obs Mean Std. Dev. Min Max -------------+-------------------------------------------------------- wgt2 | 133710 2043.481 1233.998 0 15347

. *This is the weight variable you should be using . summarize famwgt2

Variable | Obs Mean Std. Dev. Min Max -------------+-------------------------------------------------------- famwgt2 | 133710 2020.472 1052.381 98 12904

. * The family weight variable is really only applicable to analyses that take the family
as the basi
c unit of analysis, which is hard to do since I've made the dataset an individual level dataset. . summarize wgt2 if wgt2>0

Variable | Obs Mean Std. Dev. Min Max -------------+-------------------------------------------------------- wgt2 | 120800 2261.869 1091.568 80 15347

. *There are nearly 13,000 individuals who have wgt2=0. That doesn't seem like a good thing.
I'm no
t sure why that is... . tabulate age if wgt2==0

p15 | Freq. Percent Cum. ------------+----------------------------------- 0 | 207 1.60 1.60 1 | 241 1.87 3.47 2 | 236 1.83 5.30 3 | 231 1.79 7.09 4 | 286 2.22 9.30 5 | 290 2.25 11.55 6 | 282 2.18 13.73 7 | 278 2.15 15.89 8 | 252 1.95 17.84 9 | 287 2.22 20.06 10 | 278 2.15 22.22 11 | 244 1.89 24.11 12 | 223 1.73 25.83 13 | 244 1.89 27.72 14 | 231 1.79 29.51 15 | 230 1.78 31.29 16 | 205 1.59 32.88 17 | 235 1.82 34.70 18 | 197 1.53 36.23 19 | 207 1.60 37.83 20 | 196 1.52 39.35 21 | 184 1.43 40.77 22 | 206 1.60 42.37 23 | 222 1.72 44.09 24 | 219 1.70 45.79 25 | 228 1.77 47.55 26 | 240 1.86 49.41 27 | 214 1.66 51.07 28 | 223 1.73 52.80 29 | 217 1.68 54.48 30 | 238 1.84 56.32 31 | 232 1.80 58.12 32 | 238 1.84 59.96 33 | 225 1.74 61.70 34 | 240 1.86 63.56 35 | 245 1.90 65.46 36 | 238 1.84 67.30 37 | 233 1.80 69.11 38 | 208 1.61 70.72 39 | 239 1.85 72.57 40 | 206 1.60 74.17 41 | 192 1.49 75.65 42 | 196 1.52 77.17 43 | 182 1.41 78.58 44 | 168 1.30 79.88 45 | 181 1.40 81.29 46 | 129 1.00 82.29 47 | 125 0.97 83.25 48 | 126 0.98 84.23 49 | 115 0.89 85.12 50 | 142 1.10 86.22 51 | 126 0.98 87.20 52 | 142 1.10 88.30 53 | 88 0.68 88.98 54 | 89 0.69 89.67 55 | 86 0.67 90.33 56 | 85 0.66 90.99 57 | 76 0.59 91.58 58 | 67 0.52 92.10 59 | 58 0.45 92.55 60 | 66 0.51 93.06 61 | 66 0.51 93.57 62 | 52 0.40 93.97 63 | 50 0.39 94.36 64 | 51 0.40 94.76 65 | 60 0.46 95.22 66 | 41 0.32 95.54 67 | 51 0.40 95.93 68 | 46 0.36 96.29 69 | 44 0.34 96.63 70 | 34 0.26 96.89 71 | 45 0.35 97.24 72 | 44 0.34 97.58 73 | 36 0.28 97.86 74 | 41 0.32 98.18 75 | 36 0.28 98.46 76 | 19 0.15 98.61 77 | 23 0.18 98.78 78 | 19 0.15 98.93 79 | 14 0.11 99.04 80 | 20 0.15 99.19 81 | 14 0.11 99.30 82 | 14 0.11 99.41 83 | 12 0.09 99.50 84 | 13 0.10 99.60 85 | 17 0.13 99.74 86 | 6 0.05 99.78 87 | 6 0.05 99.83 88 | 3 0.02 99.85 89 | 5 0.04 99.89 90 | 14 0.11 100.00 ------------+----------------------------------- Total | 12,910 100.00

. *The folks with wgt2=0 have all ages.. . tabulate citizen

citizenship p733 | Freq. Percent Cum. ----------------------------------+----------------------------------- native born in US | 116,220 86.92 86.92 native, born in territories | 1,090 0.82 87.73 native, born abroad of US parents | 976 0.73 88.46 foreign born, naturalized | 5,348 4.00 92.46 foreign born, non US citizen | 10,076 7.54 100.00 ----------------------------------+----------------------------------- Total | 133,710 100.00

. tabulate health

self | reported | health | status p800 | Freq. Percent Cum. ------------+----------------------------------- Excellent | 46,934 35.10 35.10 very good | 41,175 30.79 65.90 good | 30,730 22.98 88.88 fair | 10,356 7.75 96.62 poor | 4,515 3.38 100.00 ------------+----------------------------------- Total | 133,710 100.00

. *The following variables have hundreds of categories and I won't tabulate them here: . *occ (occupation) . * penatvty (person's place of birth) . * pemntvty (mother's place of birth) . * pefntvty (father's place of birth) . * hgmsac (metropolitan area code) . * ptotr is total person income recode. It has many categories. The one use of ptotr in comparsion > to ernval and ernval2 is that ptotr includes income other than wage and salary income. . * pppos is a family sequence variable that you're better off not using. . tabulate edlvl

4 categories ed | attainment | Freq. Percent Cum. -----------------+----------------------------------- <12th grade | 52,552 39.30 39.30 12 grade, no dip | 1,491 1.12 40.42 HS diploma | 31,970 23.91 64.33 >HS | 47,697 35.67 100.00 -----------------+----------------------------------- Total | 133,710 100.00