When data curves, a line won't fit — here's what to use instead.
In 2020, researchers tracking COVID-19 case counts noticed something alarming: the numbers weren't growing by a fixed amount each day, they were roughly doubling. A straight line fit to the first week of data predicted 500 cases by day 14. The actual count was over 8,000. The line wasn't wrong because of bad data — it was wrong because the wrong model was chosen. Picking the right regression model is the difference between a useful prediction and a dangerous one.
You already know linear regression: fit a line to data that grows at a roughly constant rate. But many real situations don't grow that way. Data that curves upward faster and faster often follows an exponential model. Data that rises, peaks, and falls often follows a quadratic model. Data that grows quickly at first and then slows down often follows a power model. Your graphing calculator can find the best-fit equation for each of these, just as it finds a line of best fit.
The three models you need to know are:
In every case, , , and are constants your calculator finds by minimizing the total error between the model and the actual data points. The process on the calculator is identical for all three: enter data into lists, run the regression, read the equation, check the correlation.
The correlation coefficient measures how well a linear model fits data, and (called the coefficient of determination) measures how well any regression model fits. An value of 1 means the model explains every bit of variation in the data perfectly. An value of 0.94 means the model accounts for 94% of the variation — the remaining 6% is scatter the model doesn't capture. Higher is better, but context matters: a model with that makes no physical sense is still a bad model.
Here is a data set showing a city's population (in thousands) over several decades:
| Year (since 1960) | Population (thousands) | |---|---| | 0 | 12 | | 10 | 19 | | 20 | 30 | | 30 | 47 | | 40 | 74 | | 50 | 116 |
The data grows faster and faster, which suggests an exponential model. Enter the years in L1 and the populations in L2. Run ExpReg on your calculator.
The base means the population grows by about 4.6% per year. That interpretation comes directly from the model — it's not a separate calculation.
Now use the model to make a prediction. Estimate the population 60 years after 1960, meaning in 2020.
Not every curved data set is exponential. Consider this data showing the height of a ball (in feet) at various times after being thrown:
| Time (seconds) | Height (feet) | |---|---| | 0 | 4 | | 1 | 28 | | 2 | 36 | | 3 | 28 | | 4 | 4 |
This data rises and then falls, which is the signature of a quadratic model. Run QuadReg on your calculator.
Here is a visualization of all three model types so you can see how their shapes differ:
The red curve is exponential — it accelerates upward without bound. The blue curve is quadratic — it rises, peaks, and falls. The green curve is a power model — it grows quickly at first and then levels off relative to the exponential.
Choosing between models requires both visual judgment and comparison. If two models give similar values, consider the context. A population that's been growing steadily is more likely exponential than quadratic — quadratic growth implies the population will eventually start shrinking, which rarely happens on its own.
On Part II and Part III of the Algebra II Regents, regression problems almost always ask you to write the equation, use it to make a prediction, and state what a coefficient means in context. Leaving out any of these three parts costs points. A complete answer names the regression type, writes the full equation with rounded coefficients, performs the prediction calculation, and interprets the result in the units of the original problem.