regression line

(noun)

A smooth curve fitted to the set of paired data in regression analysis; for linear regression the curve is a straight line.

Examples of regression line in the following topics:

Introduction to Linear Regression
- Identify errors of prediction in a scatter plot with a regression line
- The best-fitting line is called a regression line.
- The vertical lines from the points to the regression line represent the errors of prediction.
- The sum of the squared errors of prediction shown in Table 2 is lower than it would be for any other regression line.The formula for a regression line is
- This makes the regression line:
A Graph of Averages
- A graph of averages and the least-square regression line are both good ways to summarize the data in a scatterplot.
- The regression line drawn through the points describes how the dependent variable $y$ changes with the independent variable $x$.
- A good line of regression makes the distances from the points to the line as small as possible.
- The points on a graph of averages do not usually line up in a straight line, making it different from the least-squares regression line.
- The graph of averages plots a typical $y$ value in each interval: some of the points fall above the least-squares regression line, and some of the points fall below that line.
The Coefficient of Determination
- r2 , when expressed as a percent, represents the percent of variation in the dependent variable y that can be explained by variation in the independent variable x using the regression (best fit) line.
- 1-r2 , when expressed as a percent, represents the percent of variation in y that is NOT explained by variation in x using the regression line.
- This can be seen as the scattering of the observed data points about the regression line.
- Approximately 44% of the variation (0.4397 is approximately 0.44) in the final exam grades can be ex- plained by the variation in the grades on the third exam, using the best fit regression line.
- Therefore approximately 56% of the variation (1 - 0.44 = 0.56) in the final exam grades can NOT be explained by the variation in the grades on the third exam, using the best fit regression line.
Slope and Intercept
- In the regression line equation the constant $m$ is the slope of the line and $b$ is the $y$-intercept.
- A simple example is the equation for the regression line which follows:
- The constant $$$m$ is slope of the line and $b$ is the $y$-intercept -- the value where the line cross the $y$ axis.
- The case of one explanatory variable is called simple linear regression.
- For more than one explanatory variable, it is called multiple linear regression.
Least-Squares Regression
- The criteria for determining the least squares regression line is that the sum of the squared errors is made as small as possible.
- The process of fitting the best- fit line is called linear regression.
- Any other potential line would have a higher SSE than the best fit line.
- Therefore, this best fit line is called the least squares regression line.
- Ordinary Least Squares (OLS) regression (or simply "regression") is a useful tool for examining the relationship between two or more interval/ratio variables assuming there is a linear relationship between said variables.
Checking the Model and Assumptions
- When working with multiple regression models, a number of assumptions must be made.
- These assumptions are similar to those of standard linear regression models.
- Error will not be evenly distributed across the regression line.
- In effect, residuals appear clustered and spread apart on their predicted plots for larger and smaller values for points along the linear regression line; the mean squared error for the model will be incorrect.
- Most experts recommend that there should be at least 10 to 20 times as many observations (cases, respondents) as there are independent variables, otherwise the estimates of the regression line are probably unstable and unlikely to replicate if the study is repeated.
Introduction to inference for linear regression
- In this section we discuss uncertainty in the estimates of the slope and y-intercept for a regression line.
- However, in the case of regression, we will identify standard errors using statistical software.
- This video introduces consideration of the uncertainty associated with the parameter estimates in linear regression.
The Equation of a Line
- In statistics, linear regression can be used to fit a predictive model to an observed data set of $y$ and $x$ values.
- In statistics, simple linear regression is the least squares estimator of a linear regression model with a single explanatory variable.
- Simple linear regression fits a straight line through the set of $n$ points in such a way that makes the sum of squared residuals of the model (that is, vertical distances between the points of the data set and the fitted line) as small as possible.
- Linear regression was the first type of regression analysis to be studied rigorously, and to be used extensively in practical applications.
- Three lines — the red and blue lines have the same slope, while the red and green ones have same y-intercept.
Predictions and Probabilistic Models
- Regression analysis is widely used for prediction and forecasting.
- Here are the required conditions for the regression model:
- The mean response $\mu_y$ has a straight-line (i.e., "linear") relationship with $x$: $\mu_y = \alpha + \beta x$; the slope $\beta$ and intercept $\alpha$ are unknown parameters.
- Each of these four data sets has the same linear regression line and therefore the same correlation, 0.816.
- Looking at panels 2, 3, and 4, you can see that a straight line is probably not the best way to represent these three data sets.
Estimating and Making Inferences About the Slope
- You use multiple regression when you have three or more measurement variables.
- When the purpose of multiple regression is prediction, the important result is an equation containing partial regression coefficients (slopes).
- When the purpose of multiple regression is understanding functional relationships, the important result is an equation containing standard partial regression coefficients, like this:
- Where $b'_1$ is the standard partial regression coefficient of $y$ on $X_1$.
- A graphical representation of a best fit line for simple linear regression.

regression line

Related Terms

Examples of regression line in the following topics: