Section1.3The derivative of a function at a point¶ permalink

Motivating Questions

How is the average rate of change of a function on a given interval defined, and what does this quantity measure?

How is the instantaneous rate of change of a function at a particular point defined? How is the instantaneous rate of change linked to average rate of change?

What is the derivative of a function at a given point? What does this derivative value measure? How do we interpret the derivative value graphically?

How are limits used formally in the computation of derivatives?

An idea that sits at the foundations of calculus is the instantaneous rate of change of a function. This rate of change is always considered with respect to change in the input variable, often at a particular fixed input value. This is a generalization of the notion of instantaneous velocity and essentially allows us to consider the question “how do we measure how fast a particular function is changing at a given point?” When the original function represents the position of a moving object, this instantaneous rate of change is precisely velocity, and might be measured in units such as feet per second. But in other contexts, instantaneous rate of change could measure the number of cells added to a bacteria culture per day, the number of additional gallons of gasoline consumed by going one mile per additional mile per hour in a car's velocity, or the number of dollars added to a mortgage payment for each percentage increase in interest rate. Regardless of the presence of a physical or practical interpretation of a function, the instantaneous rate of change may also be interpreted geometrically in connection to the function's graph, and this connection is also foundational to many of the main ideas in calculus.

In what follows, we will introduce terminology and notation that makes it easier to talk about the instantaneous rate of change of a function at a point. In addition, just as instantaneous velocity is defined in terms of average velocity, the more general instantaneous rate of change will be connected to the more general average rate of change. Recall that for a moving object with position function \(s\text{,}\) its average velocity on the time interval \(t = a\) to \(t = a+h\) is given by the quotient

It is essential to understand how the average rate of change of \(f\) on an interval is connected to its graph.

Preview Activity1.3.1

Suppose that \(f\) is the function given by the graph below and that \(a\) and \(a+h\) are the input values as labeled on the \(x\)-axis. Use the graph in Figure 1.3.2 to answer the following questions.

Locate and label the points \((a,f(a))\) and \((a+h, f(a+h))\) on the graph.

Construct a right triangle whose hypotenuse is the line segment from \((a,f(a))\) to \((a+h,f(a+h))\text{.}\) What are the lengths of the respective legs of this triangle?

What is the slope of the line that connects the points \((a,f(a))\) and \((a+h, f(a+h))\text{?}\)

Write a meaningful sentence that explains how the average rate of change of the function on a given interval and the slope of a related line are connected.

Subsection1.3.1The Derivative of a Function at a Point

Just as we defined instantaneous velocity in terms of average velocity, we now define the instantaneous rate of change of a function at a point in terms of the average rate of change of the function \(f\) over related intervals. In addition, we give a special name to “the instantaneous rate of change of \(f\) at \(a\text{,}\)” calling this quantity “the derivative of \(f\) at \(a\text{,}\)” with this value being represented by the shorthand notation \(f'(a)\text{.}\) Specifically, we make the following definition.

Definition1.3.3

Let \(f\) be a function and \(x = a\) a value in the function's domain. We define the derivative of \(f\) with respect to \(x\) evaluated at \(x = a\), denoted \(f'(a)\text{,}\) by the formula

Aloud, we read the symbol \(f'(a)\) as either “\(f\)-prime at \(a\)” or “the derivative of \(f\) evaluated at \(x = a\text{.}\)” Much of the next several chapters will be devoted to understanding, computing, applying, and interpreting derivatives. For now, we observe the following important things.

Note1.3.4

The derivative of \(f\) at the value \(x = a\) is defined as the limit of the average rate of change of \(f\) on the interval \([a,a+h]\) as \(h \to 0\text{.}\) It is possible for this limit not to exist, so not every function has a derivative at every point.

We say that a function that has a derivative at \(x = a\) is differentiable at \(x = a\text{.}\)

The derivative is a generalization of the instantaneous velocity of a position function: when \(y = s(t)\) is a position function of a moving body, \(s'(a)\) tells us the instantaneous velocity of the body at time \(t=a\text{.}\)

Because the units on \(\frac{f(a+h)-f(a)}{h}\) are “units of \(f\) per unit of \(x\text{,}\)” the derivative has these very same units. For instance, if \(s\) measures position in feet and \(t\) measures time in seconds, the units on \(s'(a)\) are feet per second.

Because the quantity \(\frac{f(a+h)-f(a)}{h}\) represents the slope of the line through \((a,f(a))\) and \((a+h, f(a+h))\text{,}\) when we compute the derivative we are taking the limit of a collection of slopes of lines, and thus the derivative itself represents the slope of a particularly important line.

While all of the above ideas are important and we will add depth and perspective to them through additional time and study, for now it is most essential to recognize how the derivative of a function at a given value represents the slope of a certain line. Thus, we expand upon the last bullet item above.

As we move from an average rate of change to an instantaneous one, we can think of one point as “sliding towards” another. In particular, provided the function has a derivative at \((a,f(a))\text{,}\) the point \((a+h,f(a+h))\) will approach \((a,f(a))\) as \(h \to 0\text{.}\) Because this process of taking a limit is a dynamic one, it can be helpful to use computing technology to visualize what the limit is accomplishing. While there are many different options, one of the best is a java applet in which the user is able to control the point that is moving. For a helpful collection of examples, consider the work of David Austin of Grand Valley State University, and this particularly relevant example. For applets that have been built in Geogebra^{ 1 }You can even consider building your own examples; the fantastic program Geogebra is available for free download and is easy to learn and use., see Marc Renault's library via Shippensburg University, with this example being especially fitting for our work in this section.

In Figure 1.3.5, we provide a sequence of figures with several different lines through the points \((a, f(a))\) and \((a+h,f(a+h))\) that are generated by different values of \(h\text{.}\) These lines (shown in the first three figures in magenta), are often called secant lines to the curve \(y = f(x)\text{.}\) A secant line to a curve is simply a line that passes through two points that lie on the curve. For each such line, the slope of the secant line is \(m = \frac{f(a+h) - f(a)}{h}\text{,}\) where the value of \(h\) depends on the location of the point we choose. We can see in the diagram how, as \(h \to 0\text{,}\) the secant lines start to approach a single line that passes through the point \((a,f(a))\text{.}\) In the situation where the limit of the slopes of the secant lines exists, we say that the resulting value is the slope of the tangent line to the curve. This tangent line (shown in the right-most figure in green) to the graph of \(y = f(x)\) at the point \((a,f(a))\) is the line through \((a,f(a))\) whose slope is \(m = f'(a)\text{.}\)

As we will see in subsequent study, the existence of the tangent line at \(x = a\) is connected to whether or not the function \(f\) looks like a straight line when viewed up close at \((a,f(a))\text{,}\) which can also be seen in Figure 1.3.6, where we combine the four graphs in Figure 1.3.5 into the single one on the left, and then we zoom in on the box centered at \((a,f(a))\text{,}\) with that view expanded on the right (with two of the secant lines omitted). Note how the tangent line sits relative to the curve \(y = f(x)\) at \((a,f(a))\) and how closely it resembles the curve near \(x = a\text{.}\)

Note1.3.7

The instantaneous rate of change of \(f\) with respect to \(x\) at \(x = a\text{,}\) \(f'(a)\text{,}\) also measures the slope of the tangent line to the curve \(y = f(x)\) at \((a,f(a))\text{.}\)

The following example demonstrates several key ideas involving the derivative of a function.

Example1.3.8Using the limit definition of the derivative

For the function given by \(f(x) = x - x^2\text{,}\) use the limit definition of the derivative to compute \(f'(2)\text{.}\) In addition, discuss the meaning of this value and draw a labeled graph that supports your explanation.

Now we use the rule for \(f\text{,}\) and observe that \(f(2) = 2 - 2^2 = -2\) and \(f(2+h) = (2+h) - (2+h)^2.\) Substituting these values into the limit definition, we have that

With \(h\) in the denominator and our desire to let \(h \to 0\text{,}\) we have to wait to take the limit (that is, we wait to actually let \(h\) approach 0). Thus, we do additional algebra. Expanding and distributing in the numerator,

Finally, we are able to take the limit as \(h \to 0\text{,}\) and thus conclude that \(f'(2) = -3\text{.}\)

Now, we know that \(f'(2)\) represents the slope of the tangent line to the curve \(y = x - x^2\) at the point \((2,-2)\text{;}\) \(f'(2)\) is also the instantaneous rate of change of \(f\) at the point \((2,-2)\text{.}\) Graphing both the function and the line through \((2,-2)\) with slope \(m = f'(2) = -3\text{,}\) we indeed see that by calculating the derivative, we have found the slope of the tangent line at this point, as shown in Figure 1.3.9.

Figure1.3.9The tangent line to \(y = x - x^2\) at the point \((2,-2)\text{.}\)

The following activities will help you explore a variety of key ideas related to derivatives.

Activity1.3.2

Consider the function \(f\) whose formula is \(\displaystyle f(x) = 3 - 2x\text{.}\)

What familiar type of function is \(f\text{?}\) What can you say about the slope of \(f\) at every value of \(x\text{?}\)

Compute the average rate of change of \(f\) on the intervals \([1,4]\text{,}\) \([3,7]\text{,}\) and \([5,5+h]\text{;}\) simplify each result as much as possible. What do you notice about these quantities?

Use the limit definition of the derivative to compute the exact instantaneous rate of change of \(f\) with respect to \(x\) at the value \(a = 1\text{.}\) That is, compute \(f'(1)\) using the limit definition. Show your work. Is your result surprising?

Without doing any additional computations, what are the values of \(f'(2)\text{,}\) \(f'(\pi)\text{,}\) and \(f'(-\sqrt{2})\text{?}\) Why?

Activity1.3.3

A water balloon is tossed vertically in the air from a window. The balloon's height in feet at time \(t\) in seconds after being launched is given by \(s(t) = -16t^2 + 16t + 32\text{.}\) Use this function to respond to each of the following questions.

Sketch an accurate, labeled graph of \(s\) on the axes provided in Figure 1.3.10. You should be able to do this without using computing technology.

Compute the average rate of change of \(s\) on the time interval \([1,2]\text{.}\) Include units on your answer and write one sentence to explain the meaning of the value you found.

Use the limit definition to compute the instantaneous rate of change of \(s\) with respect to time, \(t\text{,}\) at the instant \(a = 1\text{.}\) Show your work using proper notation, include units on your answer, and write one sentence to explain the meaning of the value you found.

On your graph in (a), sketch two lines: one whose slope represents the average rate of change of \(s\) on \([1,2]\text{,}\) the other whose slope represents the instantaneous rate of change of \(s\) at the instant \(a=1\text{.}\) Label each line clearly.

For what values of \(a\) do you expect \(s'(a)\) to be positive? Why? Answer the same questions when “positive” is replaced by “negative” and “zero.”

Figure1.3.10Axes for plotting \(y = s(t)\) in Activity 1.3.3.

Activity1.3.4

A rapidly growing city in Arizona has its population \(P\) at time \(t\text{,}\) where \(t\) is the number of decades after the year 2010, modeled by the formula \(P(t) = 25000 e^{t/5}\text{.}\) Use this function to respond to the following questions.

Sketch an accurate graph of \(P\) for \(t = 0\) to \(t = 5\) on the axes provided in Figure 1.3.12. Label the scale on the axes carefully.

Compute the average rate of change of \(P\) between 2030 and 2050. Include units on your answer and write one sentence to explain the meaning (in everyday language) of the value you found.

Use the limit definition to write an expression for the instantaneous rate of change of \(P\) with respect to time, \(t\text{,}\) at the instant \(a = 2\text{.}\) Explain why this limit is difficult to evaluate exactly.

Estimate the limit in (c) for the instantaneous rate of change of \(P\) at the instant \(a = 2\) by using several small \(h\) values. Once you have determined an accurate estimate of \(P'(2)\text{,}\) include units on your answer, and write one sentence (using everyday language) to explain the meaning of the value you found.

On your graph above, sketch two lines: one whose slope represents the average rate of change of \(P\) on \([2,4]\text{,}\) the other whose slope represents the instantaneous rate of change of \(P\) at the instant \(a=2\text{.}\)

In a carefully-worded sentence, describe the behavior of \(P'(a)\) as \(a\) increases in value. What does this reflect about the behavior of the given function \(P\text{?}\)

Figure1.3.12Axes for plotting \(y = P(t)\) in Activity 1.3.4.

Subsection1.3.2Summary

The average rate of change of a function \(f\) on the interval \([a,b]\) is \(\frac{f(b)-f(a)}{b-a}\text{.}\) The units on the average rate of change are units of \(f\) per unit of \(x\text{,}\) and the numerical value of the average rate of change represents the slope of the secant line between the points \((a,f(a))\) and \((b,f(b))\) on the graph of \(y = f(x)\text{.}\) If we view the interval as being \([a,a+h]\) instead of \([a,b]\text{,}\) the meaning is still the same, but the average rate of change is now computed by \(\frac{f(a+h)-f(a)}{h}\text{.}\)

The instantaneous rate of change with respect to \(x\) of a function \(f\) at a value \(x = a\) is denoted \(f'(a)\) (read “the derivative of \(f\) evaluated at \(a\)” or “\(f\)-prime at \(a\)”) and is defined by the formula
\begin{equation*}
f'(a) = \lim_{h \to 0} \frac{f(a+h)-f(a)}{h},
\end{equation*}
provided the limit exists. Note particularly that the instantaneous rate of change at \(x = a\) is the limit of the average rate of change on \([a,a+h]\) as \(h \to 0\text{.}\)

Provided the derivative \(f'(a)\) exists, its value tells us the instantaneous rate of change of \(f\) with respect to \(x\) at \(x = a\text{,}\) which geometrically is the slope of the tangent line to the curve \(y = f(x)\) at the point \((a,f(a))\text{.}\) We even say that \(f'(a)\) is the “slope of the curve” at the point \((a,f(a))\text{.}\)

Limits are the link between average rate of change and instantaneous rate of change: they allow us to move from the rate of change over an interval to the rate of change at a single point.

Consider the graph of \(y = f(x)\) provided in Figure 1.3.13.

On the graph of \(y = f(x)\text{,}\) sketch and label the following quantities:

the secant line to \(y = f(x)\) on the interval \([-3,-1]\) and the secant line to \(y = f(x)\) on the interval \([0,2]\text{.}\)

the tangent line to \(y = f(x)\) at \(x = -3\) and the tangent line to \(y = f(x)\) at \(x = 0\text{.}\)

What is the approximate value of the average rate of change of \(f\) on \([-3,-1]\text{?}\) On \([0,2]\text{?}\) How are these values related to your work in (a)?

What is the approximate value of the instantaneous rate of change of \(f\) at \(x = -3\text{?}\) At \(x = 0\text{?}\) How are these values related to your work in (a)?

Figure1.3.13Plot of \(y = f(x)\text{.}\)

7

For each of the following prompts, sketch a graph on the provided axes in Figure 1.3.14 of a function that has the stated properties.

\(y = f(x)\) such that

the average rate of change of \(f\) on \([-3,0]\) is \(-2\) and the average rate of change of \(f\) on \([1,3]\) is 0.5, and

the instantaneous rate of change of \(f\) at \(x = -1\) is \(-1\) and the instantaneous rate of change of \(f\) at \(x = 2\) is 1.

\(y = g(x)\) such that

\(\frac{g(3)-g(-2)}{5} = 0\) and \(\frac{g(1)-g(-1)}{2} = -1\text{,}\) and

\(g'(2) = 1\) and \(g'(-1) = 0\)

8

Suppose that the population, \(P\text{,}\) of China (in billions) can be approximated by the function \(P(t) = 1.15(1.014)^t\) where \(t\) is the number of years since the start of 1993.

According to the model, what was the total change in the population of China between January 1, 1993 and January 1, 2000? What will be the average rate of change of the population over this time period? Is this average rate of change greater or less than the instantaneous rate of change of the population on January 1, 2000? Explain and justify, being sure to include proper units on all your answers.

According to the model, what is the average rate of change of the population of China in the ten-year period starting on January 1, 2012?

Write an expression involving limits that, if evaluated, would give the exact instantaneous rate of change of the population on today's date. Then estimate the value of this limit (discuss how you chose to do so) and explain the meaning (including units) of the value you have found.

Find an equation for the tangent line to the function \(y = P(t)\) at the point where the \(t\)-value is given by today's date.

9

The goal of this problem is to compute the value of the derivative at a point for several different functions, where for each one we do so in three different ways, and then to compare the results to see that each produces the same value.

For each of the following functions, use the limit definition of the derivative to compute the value of \(f'(a)\) using three different approaches: strive to use the algebraic approach first (to compute the limit exactly), then test your result using numerical evidence (with small values of \(h\)), and finally plot the graph of \(y = f(x)\) near \((a,f(a))\) along with the appropriate tangent line to estimate the value of \(f'(a)\) visually. Compare your findings among all three approaches; if you are unable to complete the algebraic approach, still work numerically and graphically.