SPSS Analysis and Interpretation Assignment #3: Chi-Square Analysis
The following is an explanation of the variables you will encounter when opening up the Heart_Defect_Type.sav SPSS data set used for question #1 of this assignment. Notice that there is only one column of data in this data set, and it is coded as shown in the description column in the variable table. In SPSS you can alternate how the raw data is displayed by clicking on the menu icon shown in the circle in the figure below. Click in this icon a few times so you know what is going on with the coding (i.e., atrial septal defect without pulmonary stenosis or pulmonary hypertension will be replaced with its coded value of “Y2” and so forth). Note this icon will also work with other analyses that use coding – t-tests, ANOVAs etc.)
Variable Name |
Description |
Heart_Defect_Type |
Type of Heart Defect Recorded Y2: Atrial septal defect without pulmonary stenosis or pulmonary hypertension Y3: Ventricular septal defect with valvular pulmonary stenosis Y4: Isolated pulmonary hypertension Y5: Transposed great vessels Y6: Ventricular septal defect without pulmonary hypertension Y7: Ventricular septal defect with pulmonary hypertension
|
The data set depicts the 6 types of heart defects reported at a certain hospital. There is some question concerning the number of occurrences of these conditions when compared to the occurrence rates for the general population.
1. Please open up the HEART_DEFECTS Data SP22 WK3.sav file using SPSS and generate the frequency statistics as well as a Bar Chart for the variable Heart_Defect_Type. Summarize the results (look at relative frequencies). Perform a Chi-Square Goodness of Fit test on the variable Heart_Defect_Type using the following expected values (proportions) from the general population
Y2: atrial septal defect without pulmonary stenosis or pulmonary hypertension (.17)
Y3: ventricular septal defect with valvular pulmonary stenosis (.09)
Y4: isolated pulmonary hypertension (.02)
Y5: transposed great vessels (.12)
Y6: ventricular septal defect without pulmonary hypertension (.46)
Y7: ventricular septal defect with pulmonary hypertension (.14)
Summarize your results. Does this particular hospital exhibit the same distribution of heart defects as the general population? In other words, does this hospital differ in the prevalence of these particular heart defects compared to all other hospitals that reported the incidence rates of these heart defects? Interpret the actual value of the Chi-Square significance level. Is there anything “interesting” about the significance value?
· Required Table: Frequency table for Heart_Defect_Type
· Bar Chart: Bar chart for Heart_defect_Type
· Produce an Observed/Hypothesized Frequency Grouped Bar Chart to help with discussion. (Note: you may have to stretch this figure in order to get all the labels)
The following is an explanation of the variables you will encounter when opening up the POWER_LINES Data SP22 Wk3.sav SPSS data set used for question #2 of this assignment. Both of the variables are coded in this data set and the procedure to alternate between the coded values of the variable and the variable names as described above.
Variable Name |
Description |
Leukemia |
Cancer type Yes: Participant has been diagnosed with Leukemia No: Participant has not been diagnosed with Leukemia |
Location |
Distance from low voltage power lines Near: <500 feet from low voltage power lines Far: >500 feet from low voltage power lines |
2. Please open up the POWER_LINES Data SP22 Wk3.sav file using SPSS. Perform a Chi-Square Test of Independence to see whether or not living near low-voltage power lines is associated with higher occurrences of Leukemia. Summarize your results by discussing the Chi-square table – what about the results made this significant or not significant. In other words what was the direction of the results (i.e., was there a higher incidence of Leukemia in certain groups as measured by distance from power lines?) and give a detailed interpretation of the significance value. Disclaimer: The data used for this question are based on the findings of several studies but is not necessarily indicative of overall results. Actual results show that there may be an association between leukemia and distance from low-voltage power lines, but these results have not been replicated in other studies – there are currently no strong statements either way.
3. Generate two simple hypothetical research studies of potential interest to you – one that would be analyzed using the Chi-Square Goodness of Fit Test and the other analyzed by the Chi-Square Test of Independence. Identify the variables in each and identify the characteristics of the research study that dictates the choice of Chi-Square test. You do not have to come up with hypothetical data nor is there any analyzing involved in this question.
When completing this assignment keep in mind that I am interested in whether you can successfully “decipher” and derive meaning from the results. Your answers may differ significantly from someone else’s answers – this does not mean that one person is correct and the other incorrect. There are multiple answers and multiple ways of answering the same question.
CS 340 Milestone One Guidelines and Rubric Overview: For this assignment, you will implement the fundamental operations of create, read, update,
Retail Transaction Programming Project Project Requirements: Develop a program to emulate a purchase transaction at a retail store. This
7COM1028 Secure Systems Programming Referral Coursework: Secure
Create a GUI program that:Accepts the following from a user:Item NameItem QuantityItem PriceAllows the user to create a file to store the sales receip
CS 340 Final Project Guidelines and Rubric Overview The final project will encompass developing a web service using a software stack and impleme