Which of the following values for correlation would be correct for the following scatterplot?

We have a new and improved read on this topic. Click here to view

We have moved all content for this concept to for better organization. Please update your bookmarks accordingly.

To better organize out content, we have unpublished this concept. This page will be removed in future.

Plot points and estimate the line that best represents them

Practice

  • Preview
  • Assign Practice

Preview

Scatter Plots and Linear Correlation

Loading... 

Found a content error?
Tell us

Notes/Highlights

ColorHighlighted TextNotes
Show More

Image Attributions

ShowHide Details

Video Transcript

What is the most likely value of the product-moment correlation coefficient for the data shown in the diagram? (A) Zero, (B) negative 0.94, (C) negative 0.58, (D) 0.37, (E) 0.78.

Looking at our graph, we see that it consists of data where each data point has an 𝑥- as well as a 𝑦-value. Data sets consisting of two variables are called bivariate, and such sets can be described quantitatively by what’s called a product-moment correlation coefficient. Another name for this is the Pearson correlation coefficient. And the whole idea is to use a single number, this coefficient, to describe how well one of the variables in the data set correlates with the other. The correlation coefficient can take on values anywhere between negative one and one.

And actually, in both of these extreme cases, that coefficient value describes perfect correlation. A coefficient value of negative one would describe a downward-trending data set that perfectly follows the line of best fit. That is, all the points in the data set lie along the same line. A correlation coefficient of positive one means the same thing, but for a data set that follows a positively sloping best fit line. In between these values, there’s a correlation coefficient of zero suggesting that there is no correlation between the two variables and then all the possible values in between these values named so far.

Looking at the set of data in our diagram, if we were to draw a best fit line for this set of data, we might draw it in by hand like this. Clearly, there is an inverse or a negative correlation between the values of 𝑥 and the values of 𝑦; that is, as 𝑥 gets larger, 𝑦 gets smaller. This tells us that the correlation coefficient for this set of data lies somewhere below zero. If the line of best fit had a positive slope to it, the opposite would be true. But here we see there is indeed a negative or inverse correlation between 𝑥 and 𝑦.

Looking then at our five answer options, we can see that what we’ve learned so far eliminates several of them. Any positive correlation coefficients are out of consideration. That means options (D) and (E) can’t be our final choice. And we also know that option (A), which suggests that there is no correlation between the 𝑥- and 𝑦-variables in our data set, isn’t a valid answer either. This leaves us with answer choices (B) and (C). These are both negative values, and we see that one is closer to the extreme value of negative one than the other. As we consider this range of Pearson correlation coefficients, the difference between them comes down to how tightly clustered about the best fit line the data points in a data set are.

For example, if we looked at data sets represented by correlation coefficients of negative 0.9 and negative 0.2, then, respectively, they might look like this. The data points in the data set represented by a correlation coefficient of negative 0.9 are much more tightly clustered about the line of best fit compared with those in the other data set. As we look at the data shown in our given diagram, we can see that it’s neither extremely far away from the best fit line nor extremely close to it. This suggests that the correlation coefficient is not particularly close to negative one, neither is it particularly close to zero. For this reason, of our two answer options (B) and (C), we’ll choose the one that is closer to a Pearson correlation coefficient of negative 0.5. That’s option (C).

So, of these answer choices, we would say that negative 0.58 is the most likely value of the product-moment correlation coefficient for the given data set.

What does a correlation of 0 look like on a scatter plot?

When all the points on a scatterplot lie on a straight line, you have what is called a perfect correlation between the two variables (see below). A scatterplot in which the points do not have a linear trend (either positive or negative) is called a zero correlation or a near-zero correlation (see below).

What is the R value on a scatter graph?

What is r? Put simply, it is Pearson's correlation coefficient (r). Or in other words: R is a correlation coefficient that measures the strength of the relationship between two variables, as well as the direction on a scatterplot. The value of r is always between a negative one and a positive one (-1 and a +1).

What does a correlation value of 1 represent in a scatter diagram?

Properties of the Linear Correlation Coefficient If r = +1, there is a perfect positive linear relation between the two variables. If r = -1, there is a perfect negative linear relation between the two variables. The closer r is to +1, the stronger is the evidence of positive association between the two variables.

Bài Viết Liên Quan

Chủ đề