--------------------------------------------------------------------------------------------- name: log: /Users/avatuccio/Desktop/Data Description.log log type: text opened on: 4 Dec 2024, 20:16:23 . do "/Volumes/middfiles/Classes/Fall24/ECON0211C/PUBLIC_HTML/STUDENTS/Meg and Ava Project/ResearchProjectRegMeg.do" . ** Meg Simon and Ava Tuccio ** . . use "/Volumes/middfiles/Classes/Fall24/ECON0211C/PUBLIC_HTML/STUDENTS/Meg and Ava Project/usa_00006.dta" . . ** missing variables . replace degfield = . if degfield==00 (0 real changes made) . replace degfieldd = . if degfieldd==00 (0 real changes made) . . gen earnings=incwage . replace earnings = . if incwage==999999 (0 real changes made) . . ** set our earnings variable to missing if respondent has no employee income, missing hours worked . replace earnings = . if incwage==0 (110,148 real changes made, 110,148 to missing) . . gene hrswork = uhrswork . replace uhrswork=. if uhrswork==00 (85,234 real changes made, 85,234 to missing) . replace uhrswork=. if uhrswork==99 (730 real changes made, 730 to missing) . replace hrswork =. if hrswork==0 (85,234 real changes made, 85,234 to missing) . . ** categorical variables (sex, region, race, education) . tab sex sex | Freq. Percent Cum. --------------+----------------------------------- male | 319,494 45.02 45.02 female | 390,211 54.98 100.00 --------------+----------------------------------- Total | 709,705 100.00 . numlabel, add . gene male=sex==1 . . tab region census region and division | Freq. Percent Cum. ----------------------------------------+----------------------------------- 11. new england division | 42,146 5.94 5.94 12. middle atlantic division | 101,426 14.29 20.23 21. east north central div | 90,876 12.80 33.03 22. west north central div | 39,990 5.63 38.67 31. south atlantic division | 148,214 20.88 59.55 32. east south central div | 33,479 4.72 64.27 33. west south central div | 77,728 10.95 75.22 41. mountain division | 52,814 7.44 82.66 42. pacific division | 123,032 17.34 100.00 ----------------------------------------+----------------------------------- Total | 709,705 100.00 . numlabel, add (no value label to be modified) . tab region census region and division | Freq. Percent Cum. ----------------------------------------+----------------------------------- 11. new england division | 42,146 5.94 5.94 12. middle atlantic division | 101,426 14.29 20.23 21. east north central div | 90,876 12.80 33.03 22. west north central div | 39,990 5.63 38.67 31. south atlantic division | 148,214 20.88 59.55 32. east south central div | 33,479 4.72 64.27 33. west south central div | 77,728 10.95 75.22 41. mountain division | 52,814 7.44 82.66 42. pacific division | 123,032 17.34 100.00 ----------------------------------------+----------------------------------- Total | 709,705 100.00 . gene midwest=region==21 | region==22 . gene northeast=region==11 | region==12 . gene south=region==31 | region==32 | region==33 . gene west=region==41 | region==42 . . gen white_nh=0 . replace white_nh=1 if race==1 & hispan==0 (480,266 real changes made) . gen black_nh=0 . replace black_nh=1 if race==2 & hispan==0 (41,762 real changes made) . gen other_nh=0 . replace other_nh=1 if race>2 & hispan==0 (117,048 real changes made) . gen hispanic=0 . replace hispanic=1 if hispan>0 (70,629 real changes made) . . **check that the proportions of the groups sum to one . sum white_nh black_nh other_nh hispanic Variable | Obs Mean Std. dev. Min Max -------------+--------------------------------------------------------- white_nh | 709,705 .6767122 .4677319 0 1 black_nh | 709,705 .0588442 .235333 0 1 other_nh | 709,705 .1649249 .371113 0 1 hispanic | 709,705 .0995188 .2993576 0 1 . . tab educ educational attainment | [general version] | Freq. Percent Cum. -----------------------------+----------------------------------- 10. 4 years of college | 442,164 62.30 62.30 11. 5+ years of college | 267,541 37.70 100.00 -----------------------------+----------------------------------- Total | 709,705 100.00 . gene post_college=educ==11 . . tab nchild number of own | children in the | household | Freq. Percent Cum. ----------------------+----------------------------------- 0. 0 children present | 392,821 55.35 55.35 1. 1 child present | 130,731 18.42 73.77 2. 2 | 129,452 18.24 92.01 3. 3 | 41,831 5.89 97.90 4. 4 | 10,996 1.55 99.45 5. 5 | 2,592 0.37 99.82 6. 6 | 817 0.12 99.93 7. 7 | 277 0.04 99.97 8. 8 | 114 0.02 99.99 9. 9+ | 74 0.01 100.00 ----------------------+----------------------------------- Total | 709,705 100.00 . gene child=nchild>=1 . . gene married=marst==1&2 . . sum earnings degfield hrswork male midwest northeast south white post_college Variable | Obs Mean Std. dev. Min Max -------------+--------------------------------------------------------- earnings | 599,557 98799.48 102548.9 4 870000 degfield | 709,705 43.8292 17.19298 11 64 hrswork | 624,471 40.40229 11.37595 1 99 male | 709,705 .4501786 .497512 0 1 midwest | 709,705 .1843949 .3878062 0 1 -------------+--------------------------------------------------------- northeast | 709,705 .2022981 .4017136 0 1 south | 709,705 .3655336 .4815798 0 1 white_nh | 709,705 .6767122 .4677319 0 1 post_college | 709,705 .3769749 .4846289 0 1 . . gene communications=degfield==19 . gene education=degfield==23 . gene biology=degfield==36 . gene english=degfield==33 . gene psychology=degfield==52 . gene socialsciences=degfield==55 . gene engineering=degfield==24 . gene finearts=degfield==60 . gene medical=degfield==61 . gene business=degfield==62 . gene other=degfield==11 | degfield==13 | degfield==14 | degfield==15 | degfield==20 | degfield==21 | degfield==22 | degfield > ==25 | degfield==26 | degfield==29 | degfield==32 | degfield==34 | degfield==35 | degfield==37 | degfield==38 | degfie > ld==40 | degfield==41 | degfield==48 | degfield==49 | degfield==50 | degfield==51 | degfield==53 | degfield==54 | degfield= > =56 | degfield==57 | degfield==59 | degfield== 64 . ** . . gen major_cat=1 if communications==1 (679,158 missing values generated) . replace major_cat=2 if education==1 (62,816 real changes made) . replace major_cat=3 if biology==1 (38,407 real changes made) . replace major_cat=4 if english==1 (20,468 real changes made) . replace major_cat=5 if psychology==1 (37,178 real changes made) . replace major_cat=6 if socialsciences==1 (50,179 real changes made) . replace major_cat=7 if engineering==1 (59,464 real changes made) . replace major_cat=8 if finearts==1 (31,674 real changes made) . replace major_cat=9 if medical==1 (58,047 real changes made) . replace major_cat=10 if business==1 (144,166 real changes made) . replace major_cat=11 if other==1 (176,759 real changes made) . . label define major 1 "communications" 2 "education" 3 "biology" 4 "english" 5 "psychology" 6 "socialsciences" 7 "engineering > " 8 "finearts" 9 "medical" 10 "business" 11 "other" , replace . label values major_cat major . . tab major_cat [w= perwt], sum(incwage) mean noobs (analytic weights assumed) | Summary of | wage and | salary | income major_cat | Mean ------------+------------ communica | 76148.461 education | 52772.74 biology | 98674.645 english | 70822.742 psycholog | 64824.904 socialsci | 97527.712 engineeri | 113782.16 finearts | 57167.768 medical | 73259.91 business | 89625.505 other | 81916.285 ------------+------------ Total | 82414.728 . tab major_cat if male==0 [w= perwt], sum(incwage) mean noobs (analytic weights assumed) | Summary of | wage and | salary | income major_cat | Mean ------------+------------ communica | 68073.859 education | 47985.809 biology | 78311.095 english | 61933.634 psycholog | 57013.3 socialsci | 75049.856 engineeri | 86681.456 finearts | 50448.595 medical | 68055.029 business | 69057.652 other | 61495.914 ------------+------------ Total | 64173.176 . tab major_cat if male==1 [w= perwt], sum(incwage) mean noobs (analytic weights assumed) | Summary of | wage and | salary | income major_cat | Mean ------------+------------ communica | 88916.561 education | 70370.507 biology | 125284.46 english | 88934.292 psycholog | 86835.798 socialsci | 119407.55 engineeri | 120907.4 finearts | 67385.851 medical | 97554.092 business | 109048.16 other | 98854.843 ------------+------------ Total | 104026.7 . . ** . gene lnearnings=ln(earnings) (110,148 missing values generated) . . . . . end of do-file . log close name: log: /Users/avatuccio/Desktop/Data Description.log log type: text closed on: 4 Dec 2024, 20:17:09 ------------------------------------------------------------------------------------------------------------------------------