The variables would be: The students academic performance Socio-economic of parents income, based on their residence, occupation and educational experiences. This week you will perform a basic linear regression. * Please be aware of the very strict data requirement for running linear regression: your DV and IV both have to be continuous variables. (Most variables at interval/ratio level are continuous variables.) This rule is solid for DV: if your DV is a nominal or ordinal variable, you CANNOT use it as the DV for regression, not even when it is converted to a dummy variable (b/c the regression is no longer linear). It is a necessity that your DV is a continuous variable with interval/ratio level of measurement. If your current DV wont work for regression test, please choose a continuous variable from GSS data set as your temporary DV for the week in order to practice regression analysis. Some examples of continuous variables from GSS 2012 data: tvhours, hrs1, etc. You dont have to include regression test in your final portfolio if your DV wont work for regressing test. Keep in mind, regression is also a form of significance test. Your porfolio only needs ONE significance test (we have learned: independent sample t-test, dependent sample t-test, Chi-square, and regression.) IV: male dummy variable (based on variable sex) and white dummy variable (based on variable race) See equation below. We use * to mark the variable that is statistically significant. If an IV is not continuous (like race, sex), you could make things work by creating dummy variables based on these variables. For example, based on variable sex, we can make a male dummy variable or a female dummy variable. Based on variable race, we can create a white dummy variable or black dummy variable, or other dummy variable. By custom, well name the dummy variable using the value we coded as 1. For example, if we denote male as 1, female as 0, well name this dummy as male dummy variable. If we denote white as 1, then well name this dummy as white dummy variable. This naming method helps readers/researchers remember/understand what dummy variables stand for in a study. Here is a youtube video which shows the essential steps of creating dummy variables: In this weeks forum discussion, you are required to run a linear regression using: 1. your DV (if your DV is not a continuous variable, pick one from the GSS 2012 data set as your temporary DV for the week so you can practice regression) 2. and two dummy variables created based on variable sex and race in the GSS 2012 data set. The proper way to interpret linear regression is writing the regression equation. Here is an example: DV: educ (highest year of school completed, a continuous variable at I/R level) IV: male dummy variable (based on variable sex) and white dummy variable (based on variable race) See equation below. We use * to mark the variable that is statistically significant. Educ=13.031-.054male+.696white* Here is the fun part: prediction of respondents highest year of school completed based on their race and sex. Based on this equation: A white male by average will have 13.673 years of education: 13.031-.054*1+.696*1=13.673 A nonwhite female by average will have 13.031 years of education: 13.031-.054*0+.696*0=13.031