Hi6007 Statistics For Business Decisions-Independent Assessment Answers
Consumer Research, Inc., is an independent agency that conducts research on consumer attitudes and behaviours for a variety of firms. In one study, a client asked for an investigation of consumer characteristics that can be used to predict the amount charged by credit card users. Data were collected on annual income, household size, and annual credit card charges for a sample of 50 consumers. The following data are recorded for Consumer information.
Income ($1000s) |
Household Size |
Amount Charged ($) |
Income ($1000s) |
Household Size |
Amount Charged ($) |
54 |
3 |
4016 |
54 |
6 |
5573 |
30 |
2 |
3159 |
30 |
1 |
2583 |
32 |
4 |
5100 |
48 |
2 |
3866 |
50 |
5 |
4742 |
34 |
5 |
3586 |
31 |
2 |
1864 |
67 |
4 |
5037 |
55 |
2 |
4070 |
50 |
2 |
3605 |
37 |
1 |
2731 |
67 |
5 |
5345 |
40 |
2 |
3348 |
55 |
6 |
5370 |
66 |
4 |
4764 |
52 |
2 |
3890 |
51 |
3 |
4110 |
62 |
3 |
4705 |
25 |
3 |
4208 |
64 |
2 |
4157 |
48 |
4 |
4219 |
22 |
3 |
3579 |
27 |
1 |
2477 |
29 |
4 |
3890 |
33 |
2 |
2514 |
39 |
2 |
2972 |
65 |
3 |
4214 |
35 |
1 |
3121 |
63 |
4 |
4965 |
39 |
4 |
4183 |
42 |
6 |
4412 |
54 |
3 |
3720 |
21 |
2 |
2448 |
23 |
6 |
4127 |
44 |
1 |
2995 |
27 |
2 |
2921 |
37 |
5 |
4171 |
26 |
7 |
4603 |
62 |
6 |
5678 |
61 |
2 |
4273 |
21 |
3 |
3623 |
30 |
2 |
3067 |
55 |
7 |
5301 |
22 |
4 |
3074 |
42 |
2 |
3020 |
46 |
5 |
4820 |
41 |
7 |
4828 |
66 |
4 |
5149 |
Required:
- Use methods of descriptive statistics to summarize the data. Comment on the findings.
- Develop estimated regression equations, first using annual income as the in- dependent variable and then using household size as the independent variable. Which variable is the better predictor of annual credit card charges Discuss your findings.
- Develop an estimated regression equation with annual income and household size as the independent variables. Discuss your findings.
- What is the predicted annual credit card charge for a three-person household with an annual income of $40,000
- Discuss the need for other independent variables that could be added to the model. What additional variables might be helpful
The data set for group assignment you can find on Blackboard in the folder assignment.
- Draw a histogram for each one of the 11 variables
- Do descriptive statistics (mean, standard deviation, minimum, maximum) for each one of the 11 variabl
- For each correlation discuss the results:
- Are they are positive/negatively correlated
- Are they weak or strong correlations
- What is the significance value
- What does the significance value reveal about the data we have used
Required
- Copy –paste the result from your Excel file to a Word document.
- Copy-paste ALL the output from all the activities requested in Activity 01 to 03 in Excel and put the answers in the same Word document.
- Answer all discussion questions requested in Activity 01 to 03 and put the answers in the same Word document.
- Submit a soft copy of the Excel files used in Excel and the Assignment Word document online under Assignment final submission.
As part of a long-term study of individuals 65 years of age or older, sociologists and physicians at the Wentworth medical Center in upstate New York investigated the relationship between geographic location and depression. A sample of 60 individuals, all in reasonably good health, was selected; 20 individuals were residents of Florida, 20 were residents of New York, and 20 were residents of North Carolina. Each of the individuals sampled was given a standardized test to measure depression.
The data collected follow; higher test scores indicate higher levels of depression. These data are available on the website that accompanies this text in the file named medical1. A second part of the study considered the relationship between geographic location and depression for individuals 65 years of age or older who had a chronic health condition such as arthritis, hypertension, and/or heart ailment. A sample of 60 individuals with such conditions was identified. Again, 20 were residents of Florida, 20 were residents of New York, and 20 were residents of North Carolina. The levels of depression recorded for this study follow. These data are available on the website that accompanies this text in the file named medical2.
Required:
- Use descriptive statistics to summarize the data from the two studies. What are your preliminary observations about the depression scores
- Use analysis of variance on both data sets. State the hypotheses being tested in each case. What are your conclusions
- Use inferences about individual treatment means where appropriate. What are your conclusions.
Answer:
Consumer Research, Inc., is an independent agency that conducts research on consumer attitudes and behaviours for a variety of firms. In one study, a client asked for an investigation of consumer characteristics that can be used to predict the amount charged by credit card users. Data were collected on annual income, household size, and annual credit card charges for a sample of 50 consumers. The following data are recorded for Consumer information.
Solution
- Develop estimated regression equations, first using annual income as the in- dependent variable and then using household size as the independent variable. Which variable is the better predictor of annual credit card charges? Discuss your findings.
Solution
- Regression model using annual income as the in- dependent variable
|
df |
SS |
MS |
F |
Significance F |
Regression |
1 |
16991229 |
16991229 |
31.71892 |
9.1E-07 |
|
Coefficients |
Standard Error |
t Stat |
P-value |
Lower 95% |
Upper 95% |
Lower 95.0% |
Upper 95.0% |
Intercept |
2204.241 |
329.134 |
6.697091 |
0.000 |
1542.472 |
2866.009 |
1542.472 |
2866.009 |
Income ($1000s) |
40.46963 |
7.185716 |
5.631955 |
0.000 |
26.02178 |
54.91748 |
26.02178 |
54.91748 |
The above tables give the regression results. From the results we deduce that; the model is fit to predict the amount charged (p-value < 0.05). The value of R-Squared is 0.3979; this shows that only 39.79% of the variation in amount charged on credit card is explained by the income.
The coefficient of income is 40.47; this means that a unit increase in income would result to an increase in the amount charged on credit card by 40.47
The intercept coefficient is 2204.24; this means that holding other factors constant we would expect the amount charged on credit card to be $2204.24.
The regression model is thus;
- Regression model using household size as the independent variable
SUMMARY OUTPUT | |
Regression Statistics | |
Multiple R |
0.752854 |
R Square |
0.566789 |
Adjusted R Square |
0.557764 |
Standard Error |
620.8163 |
Observations |
50 |
|
Coefficients |
Standard Error |
t Stat |
P-value |
Lower 95% |
Upper 95% |
Lower 95.0% |
Upper 95.0% |
Intercept |
2581.644 |
195.2699 |
13.2209 |
0.000 |
2189.028 |
2974.261 |
2189.028 |
2974.261 |
Household Size |
404.1567 |
50.99978 |
7.924676 |
0.000 |
301.6148 |
506.6986 |
301.6148 |
506.6986 |
The above tables give the regression results. From the results we deduce that; the model is fit to predict the amount charged (p-value < 0.05). The value of R-Squared is 0.5668; this shows that only 56.68% of the variation in amount charged on credit card is explained by the household size.
The coefficient of household size is 404.16; this means that a unit increase in household size would result to an increase in the amount charged on credit card by 404.16.
The intercept coefficient is 2581.64; this means that holding other factors constant we would expect the amount charged on credit card to be $2581.64.
The regression model is thus;
Best model
Regression model using household size as the independent variable is the better predictor of annual credit card charges
- Develop an estimated regression equation with annual income and household size as the independent variables. Discuss your findings.
Solution
SUMMARY OUTPUT | |
Regression Statistics | |
Multiple R |
0.908502 |
R Square |
0.825376 |
Adjusted R Square |
0.817945 |
Standard Error |
398.3249 |
Observations |
50 |
|
Coefficients |
Standard Error |
t Stat |
P-value |
Lower 95% |
Upper 95% |
Lower 95.0% |
Upper 95.0% |
Intercept |
1305.034 |
197.771 |
6.598712 |
0.000 |
907.17 |
1702.898 |
907.17 |
1702.898 |
Income ($1000s) |
33.12196 |
3.970237 |
8.342563 |
0.000 |
25.13487 |
41.10904 |
25.13487 |
41.10904 |
Household Size |
356.3402 |
33.2204 |
10.72655 |
0.000 |
289.5094 |
423.171 |
289.5094 |
423.171 |
The above tables give the regression results. From the results we deduce that; the model is fit to predict the amount charged (p-value < 0.05). The value of R-Squared is 0.8254; this shows that only 82.54% of the variation in amount charged on credit card is explained by the household size and annual income.
The coefficient of income is 33.12; this means that a unit increase in income would result to an increase in the amount charged on credit card by 33.12.
The coefficient of household income is 356.34; this means that a unit increase in household income would result to an increase in the amount charged on credit card by 356.34.
The intercept coefficient is 1305.03; this means that holding other factors constant we would expect the amount charged on credit card to be $1305.03.
The regression model is thus What is the predicted annual credit card charge for a three-person household with an annual income of $40,000? the need for other independent variables that could be added to the model. What additional variables might be helpful?
Solution
Looking at the performance of the regression equation models, we observed that addition of the variables resulted to a better model. This means that the necessary independent variables need to be added to ensure that the model is towards becoming a perfect one.
- For each correlation discuss the results:
- Are they are positive/negatively correlated?
- Are they weak or strong correlations?
- What is the significance value?
- What does the significance value reveal about the data we have used?
Solution
Yes there are both positively and negatively correlated relationships. 23 correlations were positively correlated while 13 correlations were negatively variables.
32 correlations had a weak relationship while 4 correlations had a strong relationship.
Significance value also known as p-value is the probability of getting a result that is equal to or "more extreme" than what was actually observed, given that the null hypothesis is true.
The significance value reveals that 11 correlations were significant while the remaining 25 correlations were insignificant.
As part of a long-term study of individuals 65 years of age or older, sociologists and physicians at the Wentworth medical Center in upstate New York investigated the relationship between geographic location and depression. A sample of 60 individuals, all in reasonably good health, was selected; 20 individuals were residents of Florida, 20 were residents of New York, and 20 were residents of North Carolina. Each of the individuals sampled was given a standardized test to measure depression.
The data collected follow; higher test scores indicate higher levels of depression. These data are available on the website that accompanies this text in the file named medical1. A second part of the study considered the relationship between geographic location and depression for individuals 65 years of age or older who had a chronic health condition such as arthritis, hypertension, and/or heart ailment. A sample of 60 individuals with such conditions was identified. Again, 20 were residents of Florida, 20 were residents of New York, and 20 were residents of North Carolina. The levels of depression recorded for this study follow. These data are available on the website that accompanies this text in the file named medical2.
Florida |
New York |
North Carolina |
Florida |
New York |
North Carolina |
3 |
8 |
10 |
13 |
14 |
10 |
7 |
11 |
7 |
12 |
9 |
12 |
7 |
9 |
3 |
17 |
15 |
15 |
3 |
7 |
5 |
17 |
12 |
18 |
8 |
8 |
11 |
20 |
16 |
12 |
8 |
7 |
8 |
21 |
24 |
14 |
8 |
8 |
4 |
16 |
18 |
17 |
5 |
4 |
3 |
14 |
14 |
8 |
5 |
13 |
7 |
13 |
15 |
14 |
2 |
10 |
8 |
17 |
17 |
16 |
6 |
6 |
8 |
12 |
20 |
18 |
2 |
8 |
7 |
9 |
11 |
17 |
6 |
12 |
3 |
12 |
23 |
19 |
6 |
8 |
9 |
15 |
19 |
15 |
9 |
6 |
8 |
16 |
17 |
13 |
7 |
8 |
12 |
15 |
14 |
14 |
5 |
5 |
6 |
13 |
9 |
11 |
4 |
7 |
3 |
10 |
14 |
12 |
7 |
7 |
8 |
11 |
13 |
13 |
3 |
8 |
11 |
17 |
11 |
11 |
Required:
- Use descriptive statistics to summarize the data from the two studies. What are your preliminary observations about the depression scores?
Solution
Descriptive statistics (Quantitative data): | |||
|
|
|
|
Statistic |
Florida |
New York |
North Carolina |
Minimum |
2.000 |
4.000 |
3.000 |
Maximum |
21.000 |
24.000 |
19.000 |
Mean |
10.025 |
11.625 |
10.500 |
Standard deviation (n-1) |
5.260 |
4.913 |
4.512 |
Preliminary observations indicates that residents from Florida have lower depression scores when compared to the two other states. Residents of New York are the most depressed lot of people.
- Use analysis of variance on both data sets. State the hypotheses being tested in each case. What are your conclusions?
Solution
Groups |
Count |
Sum |
Average |
Variance |
Florida |
40 |
401 |
10.025 |
27.66603 |
New York |
40 |
465 |
11.625 |
24.13782 |
North Carolina |
40 |
420 |
10.5 |
20.35897 |
We conducted a one-way ANOVA to test whether the mean depression scores are equal across the three states. The hypothesis we sought to test is;
Ho: the mean depression scores are equal across the three states
H1: at least one of the states has a different mean depression score
Results showed that we had to reject the null hypothesis and conclude that the mean depression scores are equal across the three states
- Use inferences about individual treatment means where appropriate. What are your conclusions?
Solution
In this part, I tested for individual treatment means where I tested that the mean depression score is greater than 10 for all the three states.
The hypothesis are;
H0: µ = 10
H0: µ > 10
For each state
One-Sample Statistics | ||||
|
N |
Mean |
Std. Deviation |
Std. Error Mean |
Florida |
40 |
10.0250 |
5.25985 |
.83166 |
New York |
40 |
11.6250 |
4.91303 |
.77682 |
North Carolina |
40 |
10.5000 |
4.51209 |
.71342 |
One-Sample Test | ||||||
|
Test Value = 10 | |||||
t |
df |
Sig. (2-tailed) |
Mean Difference |
95% Confidence Interval of the Difference | ||
Lower |
Upper | |||||
Florida |
.030 |
39 |
.976 |
.02500 |
-1.6572 |
1.7072 |
New York |
2.092 |
39 |
.043 |
1.62500 |
.0537 |
3.1963 |
North Carolina |
.701 |
39 |
.488 |
.50000 |
-.9430 |
1.9430 |
Results revealed that only New York showed mean depression scores as being significantly greater than 10 (M = 11.625, p-value < 0.05). The other two states Florida and North Carolina had though had depression scores greater than 10, the values were not significantly greater than 10 (M = 10.0250, p-value > 0.05 and M = 10.500, p-value > 0.05 respectively).
Buy Hi6007 Statistics For Business Decisions-Independent Assessment Answers Online
Talk to our expert to get the help with Hi6007 Statistics For Business Decisions-Independent Assessment Answers to complete your assessment on time and boost your grades now
The main aim/motive of the management assignment help services is to get connect with a greater number of students, and effectively help, and support them in getting completing their assignments the students also get find this a wonderful opportunity where they could effectively learn more about their topics, as the experts also have the best team members with them in which all the members effectively support each other to get complete their diploma assignments. They complete the assessments of the students in an appropriate manner and deliver them back to the students before the due date of the assignment so that the students could timely submit this, and can score higher marks. The experts of the assignment help services at urgenthomework.com are so much skilled, capable, talented, and experienced in their field of programming homework help writing assignments, so, for this, they can effectively write the best economics assignment help services.