0% found this document useful (0 votes)
102 views

Project #3 Hypothesis Testing Project

This project uses data on 200 births from North Carolina to test several hypotheses about birth outcomes. The hypotheses test things like the mean age of mothers, mean weeks of gestation, percentage of low birth weights, and whether smoking impacts birth weight. Frequency tables and summary statistics will be generated for relevant variables. Six hypotheses will then be tested, following four steps: stating the null and alternative hypotheses, reporting the test statistic and p-value, and stating the conclusion as to whether the null hypothesis is rejected. An additional hypothesis will be proposed and tested by the student.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
102 views

Project #3 Hypothesis Testing Project

This project uses data on 200 births from North Carolina to test several hypotheses about birth outcomes. The hypotheses test things like the mean age of mothers, mean weeks of gestation, percentage of low birth weights, and whether smoking impacts birth weight. Frequency tables and summary statistics will be generated for relevant variables. Six hypotheses will then be tested, following four steps: stating the null and alternative hypotheses, reporting the test statistic and p-value, and stating the conclusion as to whether the null hypothesis is rejected. An additional hypothesis will be proposed and tested by the student.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 2

Project #3 Hypothesis Testing Project

For this project, we will use a subset of the North Carolina birth data set. The data set
ncbirth200.sav is a random sample of 200 births from the data set ncbirth1450.sav. When
doing this assignment, make sure you are working with the data set with only 200 observations!
In this assignment you will test hypotheses relating to mage, weeks, tounces, low, and smoke.

Variable Label Description


plurality Number of children born of the pregnancy
sex Sex of child (1=Male, 2=Female)
mage Age of mother (years)
weeks Completed Weeks of Gestation (weeks)
marital Marital status (1=married, 2=not married)
racemom Race of Mother (0=Other Non-white, 1=White, 2=Black 3=American Indian,
4=Chinese, 5=Japanese, 6=Hawaiian, 7=Filipino, 8=Other Asian or Pacific
Islander)
hispmom Mother of Hispanic origin (C=Cuban, M=Mexican, N=Non-Hispanic,
O=Other and Unknown Hispanic, P=Puerto Rican, S=Central/South American,
U=Not Classifiable)
gained Weight gained during pregnancy (pounds)
smoke 0=mother did not smoke during pregnancy
1=mother did smoke during pregnancy
drink 0=mother did not consume alcohol during pregnancy
1=mother did consume alcohol during pregnancy
tounces Weight of child (ounces)
tgrams Weight of child (grams)
low 0=infant was not low birth weight
1=infant was low birth weight
Premie 0=infant was not premature
1=infant was premature
premature defined at 36 weeks or sooner

Begin the assignment by proving a frequency table for the percentage of low birth weights and a
frequency table for the percentage of smokers. Create a summary table (mean, median, standard
deviation, minimum, maximum) for the continuous variables of mage, weeks, and tounces.

1
With the information that you gather from this summary, test the following (you will need to do
the tests of proportion by hand, but tests of means can be done using the computer):

a. Determine if there is sufficient evidence to conclude the mean age of mothers giving birth
in North Carolina is over 25 years of age at the 0.05 level of significance.
b. Determine if there is sufficient evidence to conclude the mean weeks of gestation of
mothers giving birth in North Carolina is below 39 weeks.
c. Determine if there is sufficient evidence to conclude that the mean weight of babies born
to mothers in North Carolina is above 7 lbs. (Note that there are 16 ounces in a pound.)
d. Determine if there is sufficient evidence to conclude the percentage of low birth weight
children in North Carolina is above 6%.
e. Determine if there is sufficient evidence to conclude the percentage of mothers who
smoke in North Carolina is above 10%.
f. Construct a side-by-side boxplot for tounces for smokers and non-smokers. Comment on
whether you believe you will reject or fail to reject the null hypothesis. Determine if
there is sufficient evidence to conclude the mean tounces of smoking mothers is lower
than the mean birth weight for non-smoking mothers.

For each of the tests above, in your report, be sure to


1. Clearly state a null and alternative hypothesis
2. Give the value of the test statistic
3. Report the P-value
4. Clearly state your conclusion (i.e. ‘Reject the Null’ is not sufficient)

For d and e above, be sure to check the assumptions associated with a test of a proportion.

Lastly, propose and conduct your own test of hypotheses. You can test a single mean, a single
proportion or compare two means for two independent groups. Make sure your test follows the
four steps above.

You might also like