Analyze the assumptions associated with the independent-samples t-tests and the implications when assumptions are violated.

The accuracy of parametric statistical tests is largely based on the data distribution of the collected data. Parametric tests are based on distribution assumptions, such as normality, linearity, equality of variances, etc. These assumptions and others vary based on the statistical test; therefore, it is critical for quantitative researchers to evaluate the assumptions pertaining to their statistical analyses and identify actions taken if assumptions are grossly violated.

Review the Lumley et al. (2002) article, as well as Lessons 19–21 and 24 in the Green and Salkind (2017) text. Use the Walden Library databases to identify a research example using your doctoral research proposal and consider the role and importance of the assumptions underlying each parametric test.

Post a comparison of one-sample, paired-samples, and independent-samples t-tests within the context of quantitative doctoral business research. In your comparison, do the following:

Describe the research example related to your doctoral research proposal.
Describe a hypothetical example appropriate for each t-test, ensuring that the variables are appropriately identified.
Analyze the assumptions associated with the independent-samples t-tests and the implications when assumptions are violated.
Explain options researchers have when assumptions are violated.

If we were to take the average of the 35 students’ scores, what is the probability that their average would be higher than 1150?

For a study, 5 high school seniors’ names are chosen in a random drawing. Their SAT scores are: 1100, 1200, 900, 1250 and 950.

Mean is 1080

Standard deviation is 136.38

Answer the questions below:

Question #1:  Let’s assume that nationwide individual scores follow a Normal Distribution with the mean of 1100 and a standard deviation of 150. What percentage of individual students would you expect to get a score of 1150 or higher?

Question #2:  Let’s say that we sample 35 students from another school which is your typical American school (same individual mean and standard deviation as d above). If we were to take the average of the 35 students’ scores, what is the probability that their average would be higher than 1150?

 

 

How many ways are there to choose 3 people to be president, vice-president, and secretary out of 20 people?

Please watch and

for which one video is a demonstration of how to calculate factorial, nCr, and nPr on Microsoft Excel.

Then, please decide which counting rule or method applies for any of the following problems:

1. How many ways are there to choose 3 people out of 20 people?

2. How many ways are there to choose 3 people to be president, vice-president, and secretary out of 20 people?

3. A baseball team has a 25-man roster. A batting order has nine people. How many different batting orders are there?

4. Given the letters A through G, how many different 4-letter passwords can be made if no letter is used more than once?

Show the polynomial and linear trendline charts from Excel charting.

Unit 6 Dropbox Assignment Answers by (Insert your name here)

This template is only for the first part of the Assignment. See specific instructions in the Assignment for part 2.

In the summary tables below, insert only the answers. You will show work after the summary section.

Question 2

a)   Moving average forecast for year 13

Weighted moving average forecast for year 13

MAD for part a

MAD for part b

Recommended forecast method (justify):

Work

Show all your work for the questions below. Make sure to also attach your Excel file to avoid losing points

Question 1

Show the errors you calculated.

Question 2

Show the two forecasts and the errors Copy and paste the entire forecast table from Excel.

Question 3

Show the polynomial and linear trendline charts from Excel charting. Do not use multiple regression analysis for this question or you will lose points.

What is the probability of a score falling between a raw score of 70 and 80?

  1. Questions 3a through 3d are based on a distribution of scores with and the standard deviation = 6.38. Draw a small picture to help you see what is required.
  2. What is the probability of a score falling between a raw score of 70 and 80?
  3. What is the probability of a score falling above a raw score of 80?
  4. What is the probability of a score falling between a raw score of 81 and 83?
  5. What is the probability of a score falling below a raw score of 63?
  6. Jake needs to score in the top 10% in order to earn a physical fitness certificate. The class mean is 78 and the standard deviation is 5.5. What raw score does he need?
  7. What does statistical significance mean? How do you know if something is statistically significant? What is the difference between statistical significance and practical significance?
  8. How does understanding probability help you understand inferential statistics?
  9. When have you used probability in everyday life? How did you use it?
  10. When have you used probability in everyday life? How did you use it?

Describe the importance of ensuring the unit of analysis aligns with the doctoral research purpose.

Post an assessment of the impact of the unit-of-analysis selection in quantitative doctoral business research. In your assessment, do the following:

Describe the importance of ensuring the unit of analysis aligns with the doctoral research purpose.
Explain the broader implications of selecting the incorrect unit of analysis on the practice to business.
Analyze the relationship between sample size for the chosen unit of analysis and statistical power.
Justify how and why the unit of analysis for you proposed quantitative study is appropriate for your research question.

Explain what kind of relationship is shown by the scatterplot (negative or positive? Strong or weak?) (4pts)

1. Find a set of data that includes two variables you think may be related. In your report (Word document or Google doc), explain which variable you think would be the independent variable and which one you think would be the dependent variable and why (You can find a list of data sources at the end of this document). (6pts)
For example, if you found a data set that had crime rate and average income for 100 cities, you might expect average income to predict crime rate. The more affluent the community, the less likely for crime to occur. So average income would be the independent variable and crime rate would be the dependent variable (crime rate “depends on” average income).

2. Find a quote, statistic, or sentence from an article that would support the relationship you have decided to test. (2pts)
For example, “On the enforcement side, expansion and modernization of the police force, improved apprehension system, stricter implementation of punishments, changes in legal and justice systems, and general advancements in education, values and societal norms and conducts account for the lesser crime rate.” (Source:
3. Using Megastat, run the scatterplot, correlation, and regression analyses. Using cut and paste, put the scatterplot within your document/report. Explain what kind of relationship is shown by the scatterplot (negative or positive? Strong or weak?) (4pts)

What is the optimal production Schedule and optimal profit? Write it below.

BUS 365, Introduction to Business Analytics

Your assignment is to formulate the attached problem as a linear program with an objective to maximize profit. You would use Excel solver to obtain the answer.

1) Write the problem formulation in algebraic form. When you do this, you can use the variable names as X1, X2, X3, X4 and X5. This will make the typing easier. (40 points)

2) Create a model in Excel. You can start with the Golf Bag Problem Excel Sheet and modify it.

Solve it using Excel Solver. (40 points)

Show the screenshot of the optimal screen (note: copy as picture/paste, and size it properly)

3) What is the optimal production Schedule and optimal profit? Write it below. (10 points)

4) Of the five resource constraints, which are non-binding constraints? Write them below. Why are they non-binding? Explain very briefly in one sentence. (10 points)

Using the mean and standard deviation from the Descriptive Statistics, calculate the interval in which about 99.7% of the data should lie according to the EmpiricalRule

sing the Excel file STAT Assignment Data in the Stat Assignment folder in Blackboard, you will see a list of 200 values in column A. You will want to sort the values in column A by choosing the Data tab at the top and then sort the data in increasing order. These values were randomly generated from a Normal distribution with a mean of 100 and a standard deviation of 20. You will also see the Descriptive Statistics from this data starting in cell C1.1) Using the mean and standard deviation from the Descriptive Statistics, calculate the interval in which about 68% of the data should lie according to the Empirical Rule.2) From the sorted column A, find the percentage of data that actually lie in your interval calculated in 1) by counting the number of values that fit in your interval and dividing by 200.3) Using the mean and standard deviation from the Descriptive Statistics, calculate the interval in which about 95% of the data should lie according to the Empirical Rule.

4) From the sorted column A, find the percentage of data that actually lie (without rounding) in your interval calculated in 3) by counting the number of values that fit in your interval and dividing by 200.5) Using the mean and standard deviation from the Descriptive Statistics, calculate the interval in which about 99.7% of the data should lie according to the EmpiricalRule

How does this effect influence the dynamics of your model?

From further research,it is found that there is a linear relationship between the fraction of occupied carrying capacity and the % of potential customers that will go to the park. A certain percentage (x%) of all the potential customers will go to the park if there are very few people in the park. No customers (0%) will go to the park if there are already 400.000 people in the park since,then,the park is overcrowded.

1.Make a causal loop diagram for the opening of a national park.5 points

2.How do you think the dynamics will be based on your causal loop diagram? Formulate your dynamic hypothesis.2 points

3.Now transform the causal loop diagram into a stock and flow diagram.3 points

4.Give a short description of the detail of your model and how it can be recreated.3 points

5.Explain the dynamics of your model. Identify the elements of your model and describei) thekind of growth presentand ii)the influence of changes in model parameters.7 points

6.Does the model set into equilibrium after a while? If so, is this a static or dynamic equilibrium?2 points3.Influence of visitors to the parkIn the first discussion session with the park’s directive and park ranger,you present your model. The park ranger tells that,from experience,he knows that the people visiting the park, intentional or unintentional, damage the park and all its facilities. In turn, the carrying capacity of the park is reduced. They ask you to incorporate this dynamic into your model.

1.You come up with two different ways to incorporate this feature into your model. First,you try to add a feature where all the people damage the park’s nature or facilities. You suggest that thecarrying capacity gets lower by 5% of the total numberof visitors inthe previous year. Implement this strategy.4 points

2.Explain the dynamics of the model after you introduced the influence of people on the carrying capacity. What is the difference betweenthis and the previous model? And how does the model react to different values of your newly implemented feature? 3 points3.The second idea is that the plants, animals,and facilities in the park only get affected after a certain amount people are in the park. Implement this idea into your model and show what happens if the park only gets affected after 300.000 visitors. 5 points4.Comparethe difference between the first and second ideas. Which one do you think would be a better representation of the system? 2 points 4.Regaining StrengthAfter you implemented both models,you notice a critical mistake. In your model. the facilities in the park are never renewed,or nature neverrecoversfrom its damage.

System dynamics 2020Assignment 1

1.Implement a mechanism in your model of 3.1 and 3.3 such that the carrying capacity of the park regains itself. Make sure that you explain the formulas that you have used and why you have chosen these formulas.3 points

2.How does this effect influence the dynamics of your model?3 points

3.What would happen if we introduce a time delay in the model, either between the visitors and carrying capacity or between the number of visitors and nature’s recovery?Explainit;you do not need to model this.

2 points5.Finalizing your modelTo be able to finishyour work,one crucial thing is missingin your model. The model still lacks a guide for employees that have to useit. If the employees do not know how to use your model, they will stop using,and your work will be lost in a drawer forever!

1.Add by yourself one additional feature to the model. Explain why you think this feature is important for the model.4 points

2.Explain how this feature influencesthe dynamics. Does it havea significant impact on th

dynamics?3points

3.When you deliver the model to your client,it should always be clear what the domain of validity is for your parameters/model and what its limitations are. Lists these features for the employees of the national park services and give a short guide for the model