Are children of “tall parents” as tall as their parents? And similarly, are children of “short parents” as short as their parents? Does the assumption of having 928 parents rather than 205 matter for this exercise?

Part 1: Analysis with Galton’s original data set

Galton’s work on children and parents’ height was published in: Galton, F. (1886): “Regression towards mediocrity in hereditary stature”, Journal of the Anthropological Institute, 15: 246-63. In this first part of the project you are asked to reconstruct the original data from this original article and replicate his analysis.

  • Question 1.1. Find Galton’s original article (you can use www.jstor.org). You can also find it on LEARN. On Table I of his article, the data used is summarized. You need to create a STATA data set that contains the 928 observations that Galton collected. It is recommended that you first type the data in an excel file and then have STATA read that file. Some versions of the Galton data set are available online. You are advised NOT to use them. It is part of this project that you show that you understand how to make a data set from such a table. There are important conceptual issues that you will miss if you borrow the data from somewhere else.

(i) For those observations reported in Table I of Galton’s article as “below” or “above” the minimum and maximum height values, you need to assume some particular values. Please state these explicitly in a table (Table 1.1.a.) and provide a justification with one sentence.

(ii) Given your assumptions, what is the sample mean height and standard deviation for adult children and for parents, respectively? Report this in a table (Table 1.1.b.).

  • Question 1.2. For the rest of part 1, assume that there are 928 parents in the sample rather than 205. Define “tall parents” and “short parents”. Then divide your sample into two corresponding groups.

(i) Are children of “tall parents” as tall as their parents? And similarly, are children of “short parents” as short as their parents? Report your results in a table.

(ii) Does the assumption of having 928 parents rather than 205 matter for this exercise?

  • Question 1.3. Galton was the first to describe and explain the phenomenon of “regression towards the mean”. Being concerned about the height of the English aristocracy, he interpreted his results as “regression to mediocrity” (hence the name “regression”).

(i) Regress the height of adult children against the height of parents. Report your results in a table and interpret the estimated coecients.

(ii) What can you say about the relationship between the height of parents and their children? How does it relate to the findings in question 1.2.? You can answer these questions with a short paragraph and a graph.

  • Question 1.4. Now regress the height of parents against the height of adult children. Report your results in a table. Explain in a short paragraph whether this regression is equivalent to the one in question 1.3.
  • Question 1.5. Taking your regression results from question 1.3., and using your definition of “tall parents” and “short parents” from question 1.2:

(i) Calculate the predicted adult children’s height whose parents are “tall” after 1, 2, 3, …, Z generations? And similarly, what is your prediction for adult children’s height whose parents are “short” after 1, 2, 3, …, Z generations? Report your results in a table. Is there convergence in heights? If so, how many generations does it take?

(ii) How do you interpret the results? Did Galton do something wrong in his regression? You can answer this question with a short paragraph.

 

Based on your findings, in which years did hospitals have better performance? How is hospital performance related to hospital characteristics and socio-economic characteristics?

Assignment 4 Final

Question #1: [RStudio Users and Excel Users]

The FINAL EXAM dataset provides some information about hospitals in 2011 and 2012. Download the FINAL EXAM data and then complete the descriptive table.

Answer the following questions.

  • In terms of hospital characteristics, what are the significant differences between 2011 and 2012?
  • In terms of socio-economic variables what are the significant differences between 2011 and 2012?

(To report the “Per Capita Hospital Beds to Population”, you need to divide “total_hospital_beds/tot_population)

  • Based on your findings, in which years did hospitals have better performance? How is hospital performance related to hospital characteristics and socio-economic characteristics?

Write at least three main differences between 2011 and 2012.

Table 1. Descriptive statistics between hospitals in 2011 & 2012

2011 2012 p-value

N Mean St. Dev N Mean St. Dev

Hospital Characteristics

Hospital beds

Number of paid Employees

Number of non-paid Employees

Interns and Residents

System Membership

Total hospital cost ($)

Total hospital revenues ($)

Hospital net benefit ($)

Available Medicare days

Available Medicaid days

Total Hospital Discharge

Medicare discharge

Medicaid discharge

Socio-Economic Variables

Per Capita Hospital Beds to Population

Percent of population under poverty

Percent of Female population under poverty

Percent of Male population under poverty

  1. Median Household Income ($)

(Hospital net benefit= Hospital revenues – Hospital costs). Round all mean and standard deviation figures to the nearest whole number; round p values to 2 decimal places.

(Per Capita Hospital Beds to Population = Total_hospital_beds/ Tot_population)

 

Show an ability to set up a research design where you analyze variables on an issue relating to inequalities with the help of R. Describes the course and gives an indication on suitable topics, feel free to choose a topic for the data repochort as long as it relates to inequality.

Data analysis report using R to investigate a topic in politics of social inequality

The paper should be a 3000 word data analysis report in a format that resembles a PNAS data report.

The idea of this data analysis report is to very quickly introduce a question and theory and then spend the majority of the paper explaining how the data itself should be investigated using R.

It should be like a 6 page methodology part of a paper that explains the issue shortly and how to go about with the methodological part of the data analysis, but stopping where the method part ends.

The important thing about this paper is to show an ability to set up a research design where you analyze variables on an issue relating to inequalities with the help of R.

The data for the analysis should come from an open source database such as the ESS (European social survey).

 

 

Calculate coefficient of correlation by the concurrent deviation. Find the regression equation to estimate the sale of tyres when the motor registration is known. Estimate sale of tyres when registration is 850.

Statistics Practice Exercise

Questions

  1.       Calculate coefficient of correlation by the concurrent deviation.

 

Supply Price
112 106
125 102
126 102
118 102
118 104
121 98
125 96
125 97
131 95
135 90

 

 

  1.       The table shows the number of motor registrations in a certain territory for a term of 5 years and the sale of motor tyres by a firm in that territory for the same period.

 

Year Motor Registrations No of Tyres sold
1 600 1,250
2 630 1,100
3 720 1,300
4 750 1,350
5 800 1,500

 

  1.  Find the regression equation to estimate the sale of tyres when the motor registration is known.
  2.  Estimate sale of tyres when registration is 850.

 

Create graphs of those variables. Give a brief written description of how the values of each variable are distributed in the sample. What does the graph tell us about the data in the heart rate sample?

Using graphs to interpret data

In this assignment, you will be required to use the Heart Rate Dataset to complete the following:

Use the classification of variables from the Unit 1 assignment to match each variable to one appropriate type of graph

  • Create graphs of those variables.
  • Give a brief written description of how the values of each variable are distributed in the sample. What does the graph tell us about the data in the heart rate sample?

Steps

Open the Heart Rate Dataset in Excel

Using the classification of variables from Unit 1 assignment as qualitative, quantitative discrete, or quantitative continuous, match each of the 3 variables to the most appropriate graph type.

Use the graphing functions in Excel to create an appropriate graph of the data for each variable. Remember to properly label and title your graphs to clearly identify what the graph is about.

Review these videos as needed:

Excel 2016: Creating a Pie Chart

(Tom Kleen, 2017)

Estimated time to complete: 9 minutes

Making a Simple Bar Graph

 

What is sampling? What is a representative sample? What can researchers do to increase the representativeness of their samples?

CCJ4938: Basic Statistics for Criminal Justice Essay 1: Sampling Distributions

Your submission should be approximately 2-3 (minimum 2) pages long.

Write an essay that addresses each of the following questions. If you wish to include equations or statistical symbols, you can do so in Word using the Equation or Symbol menus on the Insert tab.

  1. What is sampling? What is a representative sample? What can researchers do to increase the representativeness of their samples?
  2. What is sampling error? How do we approximate it in our analyses — that is, what do we calculate, and what elements go into that calculation? What can researchers do to reduce sampling error?
  3. What two pieces of information do we usually report when we present a mean? What does each indicate or mean? How are they calculated — that is, what pieces of information go into them?
  4. What does alpha represent? Where (generally) does it come from? Do we want a large alpha or a small one? Why?

 

If you have chosen to work with Excel, run the three linear regression models and complete the following tables using the dataset from week 1’s exercise.

Exercise #4

If you have chosen to work with Excel, run the three linear regression models and complete the following tables using the dataset from week 1’s exercise.

Medicare and Medicaid Discharge Ratios: Medicare Discharges ÷ Total Hospital Discharges; Medicaid Discharges ÷ Total Hospital Discharges

Model 1:

Run a linear model to predict the impact of number of hospital beds (use bed-tot) on hospital net-benefit in teaching hospitals.

  • Hospital Characteristics Coef. ST. ERR T Stat P-values Lower 95% Upper 95%
  • Hospital beds
  • R Square

(Limit all results to 2 decimal places max)

Model 2:

Run a linear model to predict the impact of number of hospital beds (use bed-tot) on hospital net-benefit in non-teaching hospitals.

  • Hospital Characteristics Coef. ST. ERR T Stat P-values Lower 95% Upper 95%
  • Hospital beds
  • R Square

(Limit all results to 2 decimal places max)

Use the results from model 1 and model 2 and compare the results between teaching and non-teaching hospitals.

Model 3:

Now, include the Medicare and Medicaid discharge ratios in first model. How do you evaluate the impact of having higher Medicare and Medicaid patients on hospital net-benefit in teaching hospitals?

  • Hospital Characteristics Coef. ST. ERR T Stat P-values Lower 95% Upper 95%
  • Hospital beds
  • Medicare-discharge-ratio
  • Medicaid-discharge-ratio
  • R Square

(Limit all results to 2 decimal places max)

Model 4:

Now, include the Medicare and Medicaid discharge ratios in first model. How do you evaluate the impact of having higher Medicare and Medicaid patients on hospital net-benefit in non-teaching hospitals?

  • Hospital Characteristics Coef. ST. ERR T Stat P-values Lower 95% Upper 95%
  • Hospital beds
  • Medicare-discharge-ratio
  • Medicaid-discharge-ratio
  • R Square

(Limit all results to 2 decimal places max)

Based on your findings, recommend 3 policies to improve hospital performance. Make sure to use the final model for your recommendations.

 

Discuss why there is a need for quantitative research in the health care service field. Provide an example of a researchable question that will be investigated with quantitative research approach.

Why there is a need for quantitative research in the health care service field

Discuss why there is a need for quantitative research in the health care service field. Provide an example of a researchable question that will be investigated with quantitative research approach.

In your response, explain which research methods and designs can be applied to the chosen research question. What challenges and biases may impact the study results?

 

What is your current role at UA Little Rock? What have you appreciated about UA Little Rock’s response to COVID-19? What are your biggest worries or concerns as you think about what’s coming up in the next few months?

Coding : Qualitative Assignment

Focus on the process of coding survey data for qualitative inquiry.

Discuss our findings/ categorizations/ results during our synchronous class session.

For this assignment we will be working with data from the UALR COVID-19 assessment that occurred in May 2020. The questions included in the database are:

Q.2.1 What is your current role at UA Little Rock?

Text Column & Coded Column provided

Q6.4 What have you appreciated about UA Little Rock’s response to COVID-19?

Q6.5 What are your biggest worries or concerns as you think about what’s coming up in the next few months?

 

You should initially read through the responses for both questions (6.4 & 6.5) and come up with initial or emergent themes/ categories for the responses – essentially first impressions of the ways to best interpret or categorize the data/ information.

You should then take a second pass and code the responses.

You should then take another cycle of coding by seeing if the categories/ themes/ etc. can be combined, condensed or expanded into other more broad or narrow themes.

You should pick one of the questions (6.4 or 6.5) and code it both holistically for total responses and by respondent type. You can even use the same coding to expediate the process but note if there are there differences in findings if you separate responses by type of respondent.

Upon completion of creating your coding you should create a frequency table with the created response categorizations/ themes, frequency counts (n), and percentages calculated.

 

Prepare a research page, a statistical analysis page with bar graphs, and another page with information regarding childcare desert areas in Texas, and the need for an increase for infant capacity.

Statistical analysis

Prepare a research page, a statistical analysis page with bar graphs, and another page with information regarding childcare desert areas in Texas, and the need for an increase for infant capacity.

Also complete the budget sheet, statistical analysis sheet, and a forecast sheet