Assignment 1: Title and Introduction to the Research Question

This is the first assignment for the Data Analysis Capstone from Data Analysis and Interpretation course ministered by Wesleyan University. You can see all the previous content here.

In this assignment, we have to make a title and an introduction to the Research Question.

Title

The relation between macro and micro-level health importance and tuberculosis treatment success rate.

Research question

How does the relation of the importance give to health of individuals, governments and companies influence the success rate in the treatment of tuberculosis?

Hypothesis

If a country expends more with health, has a good air quality and an easy access to water and sanitation, then the success rate in tuberculosis treatment will be higher and the incidence of new cases of this disease will be lower.

In contrast, not only the country needs to care about health. If the country has a high number of smokers, the rate in tuberculosis treatment will be lower and the incidence of new cases of this disease will be higher.

Motivation/Rationale

Tuberculosis (TB) remains a major global health problem. In 2012, 1.3 million people were believed to have died because of tuberculosis with an estimated 8.6 million new cases of TB worldwide [1].The number of TB deaths is unacceptably large given that most are preventable [2]. The purpose of this project is to enforce and determine what measures of healthcare are related to the tuberculosis treatment.

Potential Implications

As it is a dangerous disease that has a good chance of prevention, it would be interesting to have some measures that countries could take to decrease it.

Dataset and variables

To make this research, I decide to use the QOG Standard Dataset 2016 [3]. This dataset consists of approximately 2500 variables from more than 100 data sources. At first, I am thinking to use variables from four differents database:

  • Environmental Performance Data (EPI) [4];
  • International Monetary Fund (IMF) [5];
  • Worldbank - World Development Indicators (WDI) [6];
  • World Economic Forum (WEF) [7].

The response variable is the Tuberculosis treatment success rate (% of new cases).    There are a series of explanatory that can be used, at first I included these ones:

  • Health expenditure per capita, PPP (constant 2011 international dollar)
  • Water and Sanitation: Access to Drinking Water and Access to Sanitation
  • Air Quality: Household Air Quality, Air Pollution - Average Exposure to PM2.5 and Air Pollution
  • Smoking prevalence, females (% of adults)
  • Smoking prevalence, males (% of adults)
  • Business impact of tuberculosis
  • Tuberculosis case detection rate (%, all forms)
  • Incidence of tuberculosis (per 100,000 people)
  • GDP (PPP) (share of world total) (%)

References

[1] Tuberculosis: Causes, Symptoms and Treatments

[2] Global Tuberculosis Report 2013

[3] QOG Standard Dataset 2016

[4] Environmental Performance Data

[5] International Monetary Fund

[6] Worldbank - World Development Indicators

[7] World Economic Forum