(a) Provide appropriate names and labels for the variables and present a screenshot of the Variable view. (2 marks)
(b) Create a fifth variable named HDI_level that groups the countries into two groups of High HDI (0.851 to 0.957) and Low HDI (0.459 to 0.674) and obtain a frequency of the new variable. (2 marks)
(c) What is the average mean years of schooling in the ten countries with high HDI? (2 marks)
(d) What is the average life expectancy for the ten countries with low HDI? (2 marks)
(e) Comment on the standard deviations for the average life expectancy in the two groups of countries. (2 marks)
This section requires you to use SPSS to enter, manipulate and analyse data. Please present your work in word i.e. copy the relevant SPSS output and
You should include a syntax of all your SPSS commands as an appendix to your assessment submission.
To produce a syntax follow these steps:
Click of File New Syntax as shown in the picture below
The data are available on Moodle in the Artefact 1 Assessment section. The data file contains information on 8,165 women aged between 15 and 49 years at the time of the survey. There are 11 variables, two of which – BMI and Age --‐ are continuous, and the other nine – Age_group, Province, Residence, Education, Water_source, Wealth_Index, Breastfeeding_status, Anaemia, Marital_status and Occupation, --‐ are categorical.
A full description of the variables, including the coding scheme used, is shown in Table 2 below. (You can also click the "Variable View" tab from the Data Editor window in SPSS and then look at the "Label" and "Values" columns.).
1. Recode the variable ‘Age group' into a new variable called 'Age_group2' with three categories ( 15 to 24 years, 25 to 39 years and 40 to 49 years). Make sure that the recoded variable is clearly labelled. Obtain frequencies of the new variable Age_group2. What proportion of women are aged 25 to 39 years and what is the percentage of women in the age group 40 to 49 years? (4 Marks).
2. Recode the Variable Anaemia into Anaemic with two categories; Anaemic and Not Anaemic and produce a frequency of the recoded variable. (2 marks)
3. Recode the Variable Marital_status to Marital_status2 with two categories; one category should represent those that are married or are living with a partner and the other category should represent the unmarried or those without a partner and produce a frequency of the recoded variable. (2 marks)
4. Obtain and present the mean BMI and the 99% confidence interval for the mean BMI for each category of the Occupation variable to two decimal places. Based on the confidence intervals does the mean BMI differ significantly by Occupation? Interpret the confidence interval for the BMI of women that are not working. (6 marks).
5. Conduct a statistical test to explore if there are differences in the mean BMI for women that are Anaemic and those that are not Anaemic? What is the appropriate test to be carried out? Test at the 5% significance level. Make sure you state your hypotheses clearly, give the value of the test statistic and present your conclusions based on the result of the test. (7 marks).
6. Is there an association between a woman’s residential status (Residence) and their education level (Education)? What is the appropriate test to be carried out? Carry out the appropriate test at the 1% significance level to answer this question, stating your null and alternative hypotheses clearly. If an association exists, what is the nature of this association? (6 Marks)
7. Undertake a multiple linear regression analysis in SPSS using the enter method to identify the best model of the factors associated with BMI for women in the Democratic Republic of Congo. Your response variable is BMI (Body Mass Index) and the potential explanatory variables are all other variables available in the data set.
a. Identify a selection of explanatory variables for inclusion in the linear regression model. Your identification of explanatory variables should make reference to literature sources on factors/correlates of under/over-nutrition/obesity of which BMI is a measure of. Your answer should be presented in a paragraph and you should include an in-text citation of sources that support your arguments following APA 7th Referencing style. A References list should also be provided. (Maximum 500 words). (10 marks)
b. Compute and present the frequencies of all dummy variables that are potential explanatory variables for the multiple linear regression model.( 6 marks)
c. Run a multiple linear regression, present and Interpret the SPSS output from the multiple linear regression analysis.(5)
d. Undertake a residual analysis to assess if your final model fulfills the assumptions of a linear regression model. (5 marks)
e. Discuss the results of the multiple linear regression analysis, stating whether there is consistency or not with previous studies. Where there is inconsistencies explain the potential sources of the inconsistencies. (Maximum 500 words). You need to cite references and provide a References list using the APA 7th referencing style, see: (8 Marks)
CS 340 Milestone One Guidelines and Rubric Overview: For this assignment, you will implement the fundamental operations of create, read, update,
Retail Transaction Programming Project Project Requirements: Develop a program to emulate a purchase transaction at a retail store. This
7COM1028 Secure Systems Programming Referral Coursework: Secure
Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip
CS 340 Final Project Guidelines and Rubric Overview The final project will encompass developing a web service using a software stack and impleme