We are independent & ad-supported. We may earn a commission for purchases made through our links.

Advertiser Disclosure

Our website is an independent, advertising-supported platform. We provide our content free of charge to our readers, and to keep it that way, we rely on revenue generated through advertisements and affiliate partnerships. This means that when you click on certain links on our site and make a purchase, we may earn a commission. Learn more.

How We Make Money

We sustain our operations through affiliate commissions and advertising. If you click on an affiliate link and make a purchase, we may receive a commission from the merchant at no additional cost to you. We also display advertisements on our website, which help generate revenue to support our work and keep our content free for readers. Our editorial team operates independently from our advertising and affiliate partnerships to ensure that our content remains unbiased and focused on providing you with the best information and recommendations based on thorough research and honest evaluations. To remain transparent, we’ve provided a list of our current affiliate partners here.

What is Simple Linear Regression?

Tricia Christensen
By
Updated May 17, 2024
Our promise to you
WiseGeek is dedicated to creating trustworthy, high-quality content that always prioritizes transparency, integrity, and inclusivity above all else. Our ensure that our content creation and review process includes rigorous fact-checking, evidence-based, and continual updates to ensure accuracy and reliability.

Our Promise to you

Founded in 2002, our company has been a trusted resource for readers seeking informative and engaging content. Our dedication to quality remains unwavering—and will never change. We follow a strict editorial policy, ensuring that our content is authored by highly qualified professionals and edited by subject matter experts. This guarantees that everything we publish is objective, accurate, and trustworthy.

Over the years, we've refined our approach to cover a wide range of topics, providing readers with reliable and practical advice to enhance their knowledge and skills. That's why millions of readers turn to us each year. Join us in celebrating the joy of learning, guided by standards you can trust.

Editorial Standards

At WiseGeek, we are committed to creating content that you can trust. Our editorial process is designed to ensure that every piece of content we publish is accurate, reliable, and informative.

Our team of experienced writers and editors follows a strict set of guidelines to ensure the highest quality content. We conduct thorough research, fact-check all information, and rely on credible sources to back up our claims. Our content is reviewed by subject matter experts to ensure accuracy and clarity.

We believe in transparency and maintain editorial independence from our advertisers. Our team does not receive direct compensation from advertisers, allowing us to create unbiased content that prioritizes your interests.

Simple linear regression applies to statistics and helps to describe (x,y) data that appears to have a linear relationship, allowing for some prediction of y if x is known. This data is often plotted on scatterplots and the formula for linear regression creates a line that best fits all the points, provided they truly have a linear correlation. It won’t fit exactly all the points, but it should be a line where the sum of the squares of the difference between actual data and expected data (residuals) creates the lowest number, which is often called the least squares line or line of best fit. The equation of the line for sample data and population data are the following: ŷ = b0 + b1x and Y = B0 + B1x.

Anyone familiar with algebra may note the similarity of this line to y = mx + b, and in fact the two are relatively identical, except the two terms on the right side of the equation are switched, so that B1 equals slope or m. The reason for this rearrangement is it then becomes elegantly easy to add additional terms with features such as exponents that might describe different nonlinear forms of relationship.

The formulas for getting a simple linear regression line are relatively complex and cumbersome, and most people do not spend much time writing these down because they take a long time to complete. Instead, various programs, such as for Excel® or for many types of scientific calculators, can easily compute a least squares line. The line is only appropriate for prediction if there is clear evidence of a strong correlation between the sets of (x,y) data. A calculator will generate a line, regardless of whether it makes any sense to use it.

At the same time a simple linear regression line equation is generated, people must look at level of correlation. This means evaluating r, the correlation coefficient, against a table of values to determine if linear correlation exists. Additionally, evaluating the data by plotting it as a scatterplot is a good way of getting a sense if data has a linear relationship.

What can then be done with a simple linear regression line, provided it has a linear correlation, is that values can be substituted into x, to get a predicted value for ŷ. This prediction has its limits. The data present, particularly if it’s just a sample, may have a linear correlation now, but might not later with additional sample material added.

Alternately, a whole sample can share a correlation while a whole population does not. Prediction is therefore limited, and going far beyond the available data values is called extrapolation, and is not encouraged. Moreover, should people know that if no linear correlation exists, the best estimate of x is the mean of all y data.

Essentially, simple linear regression is a useful statistical tool that can, with discretion, be used to predict ŷ values based on a x value. It is almost always taught with the idea of linear correlation since determining usefulness of a regression line requires analysis of r. Fortunately with many modern technical programs, people can graph scatterplots, add regression lines and determine correlation coefficient r with a couple of entries.

WiseGeek is dedicated to providing accurate and trustworthy information. We carefully select reputable sources and employ a rigorous fact-checking process to maintain the highest standards. To learn more about our commitment to accuracy, read our editorial process.
Tricia Christensen
By Tricia Christensen , Writer
With a Literature degree from Sonoma State University and years of experience as a WiseGeek contributor, Tricia Christensen is based in Northern California and brings a wealth of knowledge and passion to her writing. Her wide-ranging interests include reading, writing, medicine, art, film, history, politics, ethics, and religion, all of which she incorporates into her informative articles. Tricia is currently working on her first novel.

Discussion Comments

Tricia Christensen

Tricia Christensen

Writer

With a Literature degree from Sonoma State University and years of experience as a WiseGeek contributor, Tricia...
Learn more
WiseGeek, in your inbox

Our latest articles, guides, and more, delivered daily.

WiseGeek, in your inbox

Our latest articles, guides, and more, delivered daily.