Admin's other articles

4349 The World without Bankruptcy Laws

Bankruptcy is one of the natural states which a company may find itself in. Entrepreneurship is primarily about taking risks. When companies take risks, some of them succeed, whereas others fail. Hence failure is a natural part of the business. However, many critics of bankruptcy laws believe that there isn’t a need for an elaborate […]

4348 The Wirecard and Infosys Scandals are a Lesson on How NOT to Treat Whistleblowers

What is the Wirecard Scandal all about and Why it is a Wakeup Call for Whistleblowers Anyone who has been following financial and business news over the last couple of years would have heard about Wirecard, the embattled German payments firm that had to file for bankruptcy after serious and humungous frauds were uncovered leading […]

4347 Why the Digital Age Demands Decision Makers to be Like Elite Marines and Zen Monks

How Modern Decision Makers Have to Confront Present Shock and Information Overload We live in times when Information Overload is getting the better of cognitive abilities to absorb and process the needed data and information to make informed decisions. In addition, the Digital Age has also engendered the Present Shock of Virality and Instant Gratification […]

4346 Why Indian Firms Must Strive for Strategic Autonomy in Their Geoeconomic Strategies

Geopolitics, Economics, and Geoeconomics In the evolving global trading and economic system, firms and corporates are impacted as much by the economic policies of nations as they are by the geopolitical and foreign policies. In other words, any global firm wishing to do business in the international sphere has to be cognizant of both the […]

4345 Why Government Should Not Invest Public Money in Sports Stadiums Used by Professional Franchises

In the previous article, we have already come across some of the reasons why the government should not encourage funding of stadiums that are to be used by private franchises. We have already seen that the entire mechanism of government funding ends up being a regressive tax on the citizens of a particular city who […]

See More Article from Admin

It is a long established fact that a reader will be distracted by the readable content of a page when looking at its layout.

Visit Us

Our Partners

Search with tags

  • No tags available.

Regression analysis can be used to find out the relation between a set of variables statistically. This is done by identifying a curve or line that best fits the variables provided. Regression analysis is widely used in marketing research for trend analysis and for making predictions. In this article, we will be explaining simple linear regression only.

Case Based Explanation

Since it is inevitable to use numbers and perform some calculations to bring out the concept of regression, we will be demonstrating a case throughout the article to explain the statistical part in an easy way.

Suppose that after a few years of working in the industry, a person decides to go back to the university to get additional skills. Since education these days is expensive, the person would want to know whether education really increases the salary.

To start, we need to see how much wage is expected to increase with every additional year spent at the university. The intuitive way to go about it is to survey a sample of individuals and ask each of them how much they earn and how many years they have spent at school and then determine whether we can observe a pattern in their responses.

For the sake of simplicity and explanation, let’s say that we survey 10 individuals (In reality a much larger sample size is required to get reliable results). A random sample of 10 people will generate 10 data points. A scatter graph in excel is the best way to represent this.

Education is the independent variable depicted on the X axis and Wage is the dependent variable, to be plotted on the Y axis. The general pattern in the data set can be determined, i.e. relationship between wages and education can be obtained by the points on the scatter graph. For example, suppose that one person, referred to as P1, has 13 years of education and is earning $20 per hour. The next person, P2, has 20 years of education and paid $30 per hour.

Equation of a line is Υ = mX + b, where m is the slope and b is the intercept, i.e. where the line cuts the y axis. We have to find this line of best fit that will represent the general pattern in the sample.

In regression analysis, the line will be represented as Υ = β0+ β1X. We have simply changed the notation: β0 is the intercept and β1 is the slope of the gradient of the line. Software packages such as excel and MATLAB, can estimate the regression line.

So the equation now becomes:

Wages = β0 + β1Education

Situation 1

To determine whether there is a relation between wages and education, observe β1, the slope of the regression line. If β1 is positive, then there is a positive relation between wages and education. The more education a person attains, the higher the wage. This is clarified by the graph below:

Regression Analysis Situation 1

Situation 2

If the data from the survey looks like the graph below, a negative relation exists. The regression line is downward sloping from left to right. The trend here is that the more educated an individual, the less they earn in wages.

Regression Analysis Situation 2

Situation 3

A third scenario is when there is no relation between wages and education. In that case, the line would cut through the data as follows; the line of best fit is a horizontal line.

Regression Analysis Situation 3

Wages = 5 + 1Education

Suppose an individual has just finished high school and has 12 years of education. Substituting the value in the above equation, we get the hourly wage as:

Wages = 5 + 1×12 = 17

The next individual with 22 years of education, his expected wage would be:

Wages = 5 + 1×22 = 27, i.e. $27per hour.

Regression Analysis Situation 4

Thus we see that, for every additional 1 year of education, wages is expected to increase by $1 per hour. In case of a person with no education, β0 = 0, the equation reduces down to: wages = 5. This is the minimum wage since if a person has no education, he or she is expected to get at least $5 per hour.

Residuals

Referring to the equation of person P1 above with 12 years of education, the individual earns $17 per hour. However suppose in reality we find that the person actually earns $22 per hour! This does not imply that the regression equation is incorrect, but in fact can be attributed to a factor termed as residual. Thus residual is the difference between the actual wage and predicted wage.

So for P1, the residual is 22-17 = 5($). The regression model is the best guess at the hourly wage given the level of education. However, in real life many other factors in addition to education such as number of years of experience, IQ, networking ability, height, etc.

They were not accounted for and are contained in the residual term depicted by µ. So the revised equation would now be:

Υ = β0+ β1X + μ

Regression Analysis Situation 5

Summary

The main highlights of the article above are as follows:

  • The regression line is the line of best fit. It is the line that best represents the trend or relation in the given data
  • β1 is the slope of the line. The relation between the dependent and independent variable is:
    • Positive If: β1 > 0
    • Negative If: β1 < 0
    • No Relationship If: β1 = 0
  • The estimated regression can be used to make predictions for Υ given X.
  • Residual = Actual - Predicted
  • The residual term accounts for the error in the prediction. It contains all other factors (except X) that impact Y.

Article Written by

Admin

Leave a reply

Your email address will not be published. Required fields are marked *

Related Posts

Why Indian Firms Must Strive for Strategic Autonomy in Their Geoeconomic Strategies

Admin

The Wirecard and Infosys Scandals are a Lesson on How NOT to Treat Whistleblowers

Admin

Post Product Launch

Admin