Cardiovascular-Diseases-Analysis

How to view the HTML file

In order to view the HTML file in your browser, please visit: https://htmlpreview.github.io/ and paste the link: https://github.com/i-a-m-s-k/Cardiovascular-Diseases-Analysis/blob/main/SI544_Final_Project.html

About the Dataset

The National Health Interview Survey (NHIS) has monitored the health of the nation since 1957. This is one of the data sets provided by the National Cardiovascular Disease (CVD) Surveillance System. This data set aims to provide a comprehensive picture of the public health burden of CVDs and associated risk factors in the United States. The data are organized by location (region) and indicator, and they include CVDs (e.g., heart failure) and risk factors (e.g., hypertension).

Reference: Data Provided by Centers for Disease Control and Prevention, National Center for Chronic Disease Prevention and Health Promotion, Division of Health Disease and Stroke Prevention (DHDSP), National Cardiovascular Disease Surveillance System. https://chronicdata.cdc.gov/Heart-Disease-Stroke-Prevention/Behavioral-Risk-Factor-Surveillance-System-BRFSS-N/ikwk-8git Accessed on: 1st December 2021

Introduction

The National Health Interview Survey (NHIS) has monitored the health of the nation since 1957. This is one of the data sets provided by the National Cardiovascular Disease (CVD) Surveillance System. The purpose of this project is to explore and analyze the Behavioral Risk Factor Surveillance System (BRFSS) data set provided by the Centers for Disease Control and Prevention (CDC) to gain insights into the public health burden of CVDs and associated risk factors in the United States.

Purpose

The main purpose of this project is to analyze the BRFSS data set and explore the relationships between various factors related to cardiovascular diseases and risk factors. The project aims to identify trends, patterns, and associations in the data to help understand the public health burden of CVDs and associated risk factors in the United States.

Analysis

The analysis begins by importing the BRFSS data set into R and cleaning the data by removing rows with missing values. The cleaned data set is then merged with another data set containing category scores in binary format, where 1 denotes "cardiovascular diseases" and 0 denotes "risk factors".

The analysis includes exploratory data analysis, which involves visualizing the data using plots and summarizing the data using descriptive statistics. The plots include bar plots, scatter plots, and box plots, which are used to identify relationships between various factors related to cardiovascular diseases and risk factors.

The analysis also includes inferential statistics, which involves hypothesis testing and confidence interval estimation. Hypothesis testing is used to test whether there is a significant difference between two groups, while confidence interval estimation is used to estimate the population parameter of interest.

Overall, the analysis provides insights into the public health burden of CVDs and associated risk factors in the United States and can be used to inform public health policies and interventions aimed at reducing the burden of CVDs and associated risk factors.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
SI544_Final_Project.Rmd		SI544_Final_Project.Rmd
SI544_Final_Project.html		SI544_Final_Project.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cardiovascular-Diseases-Analysis

How to view the HTML file

About the Dataset

Introduction

Purpose

Analysis

About

Releases

Packages

Languages

i-a-m-s-k/Cardiovascular-Diseases-Analysis

Folders and files

Latest commit

History

Repository files navigation

Cardiovascular-Diseases-Analysis

How to view the HTML file

About the Dataset

Introduction

Purpose

Analysis

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages