How do we determine if there is a correlation between the X variables and the y variables?

A correlation or simple linear regression analysis can determine if two numeric variables are significantly linearly related. A correlation analysis provides information on the strength and direction of the linear relationship between two variables, while a simple linear regression analysis estimates parameters in a linear equation that can be used to predict values of one variable based on the other.

Correlation

The Pearson correlation coefficient, r, can take on values between -1 and 1. The further away r is from zero, the stronger the linear relationship between the two variables. The sign of r corresponds to the direction of the relationship. If r is positive, then as one variable increases, the other tends to increase. If r is negative, then as one variable increases, the other tends to decrease. A perfect linear relationship [r=-1 or r=1] means that one of the variables can be perfectly explained by a linear function of the other.

Examples:

Linear Regression

A linear regression analysis produces estimates for the slope and intercept of the linear equation predicting an outcome variable, Y, based on values of a predictor variable, X. A general form of this equation is shown below:

The intercept, b0, is the predicted value of Y when X=0. The slope, b1, is the average change in Y for every one unit increase in X. Beyond giving you the strength and direction of the linear relationship between X and Y, the slope estimate allows an interpretation for how Y changes when X increases. This equation can also be used to predict values of Y for a value of X.

Examples:

Inference

Inferential tests can be run on both the correlation and slope estimates calculated from a random sample from a population. Both analyses are t-tests run on the null hypothesis that the two variables are not linearly related. If run on the same data, a correlation test and slope test provide the same test statistic and p-value.

Assumptions:

Random samples
Independent observations
The predictor variable and outcome variable are linearly related [assessed by visually checking a scatterplot].
The population of values for the outcome are normally distributed for each value of the predictor [assessed by confirming the normality of the residuals].
The variance of the distribution of the outcome is the same for all values of the predictor [assessed by visually checking a residual plot for a funneling pattern].

Hypotheses:

Ho: The two variables are not linearly related.
Ha: The two variables are linearly related.

Relevant Equations:

Degrees of freedom: df = n-2

Example 1: Hand calculation

These videos investigate the linear relationship between people’s heights and arm span measurements.

Correlation:

Regression:

Sample conclusion: Investigating the relationship between armspan and height, we find a large positive correlation [r=.95], indicating a strong positive linear relationship between the two variables. We calculated the equation for the line of best fit as Armspan=-1.27+1.01[Height]. This indicates that for a person who is zero inches tall, their predicted armspan would be -1.27 inches. This is not a possible value as the range of our data will fall much higher. For every 1 inch increase in height, armspan is predicted to increase by 1.01 inches.

Example 2: Performing analysis in Excel 2016 on
Some of this analysis requires you to have the add-in Data Analysis ToolPak in Excel enabled.

Dataset used in videos

Correlation matrix and p-value:
PDF directions corresponding to video

Creating scatterplots:
PDF directions corresponding to video

Linear model [first half of tutorial]:
PDF directions corresponding to video

Creating residual plots:
PDF directions corresponding to video

Sample conclusion: In evaluating the relationship between how happy someone is and how funny others rated them, the scatterplot indicates that there appears to be a moderately strong positive linear relationship between the two variables, which is supported by the correlation coefficient [r = .65]. A check of the assumptions using the residual plot did not indicate any problems with the data. The linear equation for predicting happy from funny was Happy=.04+0.46[Funny]. The y-intercept indicates that for a person whose funny rating was zero, their happiness is predicted to be .04. Funny rating does significantly predict happiness such that for every 1 point increase in funny rating the males are predicted to increase by .46 in happiness [t = 3.70, p = .002].

Example 3: Performing analysis in R

The following videos investigate the relationship between BMI and blood pressure for a sample of medical patients.

Dataset used in videos

Correlation:
R script file used in video

Regression:
R script file used in video

How do you determine if there is a correlation between variables?

Complete correlation between two variables is expressed by either + 1 or -1. When one variable increases as the other increases the correlation is positive; when one decreases as the other increases it is negative. Complete absence of correlation is represented by 0.

How do you find the correlation of X and Y?

Here are the steps to take in calculating the correlation coefficient:.

Determine your data sets. ... .

Calculate the standardized value for your x variables. ... .

Calculate the standardized value for your y variables. ... .

Multiply and find the sum. ... .

Divide the sum and determine the correlation coefficient..

How do we determine if there is a correlation between the X variables and the y variables?

How do you determine if there is a correlation between variables?

How do you find the correlation of X and Y?

Bài Viết Liên Quan

What type of direction if the independent variable is decreasing while the dependent is increasing?

Toplist mới

Top 6 bản thân em phải làm gì để phòng chống hiv/aids 2023

Top 7 số 20 gồm 2 và 0 đúng hay sai 2023

Top 7 địa lý lớp 6 bài 19: lớp đất và các nhân tố hình thành đất một số nhóm đất điển hình trang 178 2023

Top 7 h thực hiện chủ trương đường lối chính sách của đảng pháp luật của nhà nước 2023

Top 6 on tập tiếng việt lớp 7 học kĩ 2 violet 2023

Top 7 trung du và miền núi bắc bộ có thế mạnh nổi bật về luyện kim đen 2023

Top 8 mơ thấy có người to tình với mình 2023

Top 6 nước yến cao cấp yến sào thiên hoàng 2023

Top 7 ví dụ về quyết định hành chính nhà nước 2023

Bài mới nhất

Hồng yến khác yến trắng như thế nào năm 2024

Phương pháp học tập tích cực là gì năm 2024

Thị trường thanh toán trực tuyến việt nam năm 2024

Once in a while là gì năm 2024

U nang tử cung là gì năm 2024

Cây nụ tầm xuân là cây gì năm 2024

Tổ chức cũng cấp tín dụng tiếng anh là gì năm 2024

Nhân viên xuất nhập khẩu tiếng nhật là gì năm 2024

Đường huyết lúc đói là gì năm 2024

Lỗi server error 550 5.7.1 unable to relay năm 2024

Chủ Đề