The derivative of a function measures its instantaneous rate of change at any point. Geometrically, is the slope of the tangent line to at .
Derivatives power every optimization problem, every physics equation involving rates, and every approximation method in applied math. If you understand derivatives deeply, the rest of calculus becomes much easier.
The formal definition is . This limit captures the idea of zooming in on a curve until it looks like a straight line, then measuring that line's slope.
The definition and what it means
The derivative is the limit of the difference quotient. The difference quotient is the slope of a secant line through two nearby points on the graph of .
As , the secant line becomes the tangent line, and its slope becomes the derivative. This is the geometric heart of the derivative: zoom in far enough on any smooth curve and it looks like a straight line. The derivative measures the slope of that local line.
If this limit exists at , we say is differentiable at . Differentiability implies continuity — if is differentiable at , then is also continuous at . The converse is false: is continuous at but not differentiable (it has a sharp corner where two lines meet at different slopes).
Places where differentiability fails: sharp corners (like ), cusps (like ), vertical tangent lines (like at ), and discontinuities. At all of these points, the limit of the difference quotient either doesn't exist or is infinite.
💡Explain it simply
Imagine zooming in on a curve with a magnifying glass. The more you zoom, the more the curve looks like a straight line. The slope of that 'zoomed-in line' is the derivative.
A sharp corner (like the tip of a V) can never be zoomed into a single straight line — from the left it looks like one slope, from the right a different slope. That's why corners don't have derivatives.
Computing a derivative from the definition
- Find for using the limit definition.
- .
- Expand .
- Subtract : .
- Divide by : .
- Take the limit: .
- So . This matches the power rule () plus the derivative of ().
Core differentiation rules
Power rule: for any real number . This works for negative and fractional exponents too: and .
Constant multiple rule: . Constants pass through the derivative.
Sum/difference rule: . Differentiate term by term.
Constant rule: . The derivative of any constant is zero.
Using the power rule on a polynomial
- Find .
- Apply the rules term by term: sum rule lets us differentiate each term separately, constant multiple rule pulls out the coefficients.
- First term: (power rule: bring down the , reduce exponent by ).
- Second term: .
- Third term: (since , power rule gives ).
- Fourth term: (constant rule).
- Combine: .
Derivatives of common functions
Exponentials: — the defining property of . No other function equals its own derivative. More generally, (the factor accounts for the base).
Logarithms: for . This is why appears everywhere in integration — it's the antiderivative of . More generally, .
Trigonometric: , (note the negative sign!), .
More trig: , , . Notice the pattern: the 'co-' functions (, , ) always pick up a negative sign.
These formulas are non-negotiable — they must be committed to memory. Every more complex derivative ultimately reduces to combinations of these through the product, quotient, and chain rules.
💡Explain it simply
Think of these as the 'vocabulary words' of calculus. Just like you can't write sentences without knowing words, you can't compute derivatives without knowing these basic building blocks. The rules (product, quotient, chain) are the grammar; these formulas are the words.
The pattern to notice: trig derivatives cycle — . And the 'co-' functions always introduce an extra negative sign.
Product rule
When two functions are multiplied: .
Think of it as: "derivative of the first times the second, plus the first times the derivative of the second."
Example: .
Tip: always check if you can simplify the expression first. Sometimes you don't need the product rule at all.
Product rule with exponential and polynomial
- Find .
- Identify the two functions: and .
- Find each derivative: and .
- Apply the product rule: .
- Factor out common terms: .
- Final answer: .
Quotient rule
For a ratio of functions: . The order matters — it's first, then subtract .
Memory aid: "low d-high minus high d-low, over the square of what's below." Here 'high' is the numerator , 'low' is the denominator , and 'd-' means 'derivative of.'
Alternative: you can often rewrite as and use the product rule with the chain rule instead. For simple denominators, sometimes pure algebra avoids the quotient rule entirely.
When to definitely use the quotient rule: when the denominator is a non-trivial function (like , , etc.) and rewriting would be messier than just applying the formula.
Quotient rule with trigonometric functions
- Find .
- Identify: , , , .
- Apply the formula: .
- Simplify: .
- Factor: .
Chain rule
The chain rule handles compositions: if , then .
In words: derivative of the outer function (evaluated at the inner function) times the derivative of the inner function.
Example: . The outer function is , the inner function is .
Nested chains: for , apply the chain rule twice: .
The chain rule is arguably the most important rule. It appears everywhere: in implicit differentiation, related rates, and integration by substitution.
💡Explain it simply
Imagine a machine with two gears connected together. The big gear turns the small gear. If you want to know how fast the final output changes, you need to know two things: how fast the big gear turns the small gear, and how fast the small gear turns on its own.
That's the chain rule. If you have a function inside another function (like of ), the rate of change of the whole thing is: how fast the outer part changes × how fast the inner part changes.
A real-world example: temperature affects ice cream sales, and ice cream sales affect your revenue. If temperature goes up by 1°, sales go up by 50 cones, and each cone gives you $3, then your revenue goes up by dollars per degree. You multiplied the two rates together — that's the chain rule.
Chain rule with a nested composition
- Find .
- Identify the outer and inner functions. Outer: where . Inner: .
- Derivative of the outer function: , evaluated at gives .
- Derivative of the inner function: .
- Multiply them (chain rule): .
- Final answer: .
Double chain rule
- Find .
- There are three nested layers: , then , then . We peel them off one at a time.
- Outermost layer: , evaluated at gives .
- Middle layer: , evaluated at gives .
- Innermost layer: .
- Multiply all three together: .
- Final answer: .
Inverse trigonometric derivatives
, valid for . The domain restriction comes from the original function: only takes inputs between and .
. Notice this is exactly the negative of 's derivative. This makes sense because (they always add to a right angle), so their derivatives must be negatives of each other.
. This is the most commonly used inverse trig derivative. It appears constantly in integration (the antiderivative of is ).
With the chain rule: . For example, .
Where these come from: use implicit differentiation. If , then . Differentiate: , so .
💡Explain it simply
Inverse trig functions answer questions like: 'What angle has a sine of ?' The derivative tells you how sensitive that angle is to small changes in the input value.
The square-root expressions (, ) come from the Pythagorean theorem. When you 'unwrap' the inverse function using a right triangle, the missing side gives you the derivative.
Inverse trig derivative with the chain rule
- Find .
- Use the chain rule: outer function is with .
- Derivative of outer: evaluated at gives .
- Derivative of inner: .
- Multiply: .
- Domain: requires , i.e., .
Implicit differentiation
When is defined implicitly by an equation like , differentiate both sides with respect to .
Every time you differentiate a term, attach a factor (this is the chain rule in action: is a function of ).
Example: differentiate to get , then solve .
Implicit differentiation is essential for curves that can't be written as , such as circles, ellipses, and other relations.
💡Explain it simply
Usually, you have a nice equation like where is alone on one side. But sometimes and are tangled together, like in (a circle). You can't easily solve for .
Implicit differentiation says: just differentiate everything with respect to anyway. Whenever you hit a , remember that secretly depends on , so slap on a (that's just the chain rule). Then solve for .
It's like a detective figuring out how fast a hidden variable changes by looking at the equation it's trapped in.
Implicit differentiation of a circle
- Find for .
- Differentiate both sides with respect to . Remember: is a function of , so we must use the chain rule on terms.
- Left side: .
- The came from the chain rule: .
- Right side: .
- So: .
- Solve for : , therefore .
- At the point on the circle: . This makes geometric sense — the tangent line at on a circle centered at the origin should slope downward to the right.
Logarithmic differentiation
For complicated products, quotients, or expressions where the variable appears in both the base and exponent (like ), take the natural log of both sides first, then differentiate implicitly.
Why it works: converts products into sums (), quotients into differences (), and powers into products (). These are all easier to differentiate.
The procedure: let , take of both sides, simplify using log properties, differentiate implicitly (), then solve for by multiplying both sides by .
This is the only way to differentiate variable-base, variable-exponent expressions like or . Neither the power rule nor the exponential rule applies when both the base and exponent depend on .
💡Explain it simply
Some expressions are like nested puzzles — the variable is both the base and the exponent, so no single rule works. Logarithmic differentiation is like taking the puzzle apart first. The logarithm untangles the messy structure into simple pieces that you know how to differentiate, and then you put the answer back together.
Logarithmic differentiation of
- Find for .
- Let . Take of both sides: .
- Differentiate both sides with respect to . Left side: (chain rule on ).
- Right side: product rule on : .
- So .
- Multiply both sides by : .
- Note: is only defined for , and the derivative is zero when , i.e., .
Higher-order derivatives
The second derivative is the derivative of . It measures how the rate of change itself is changing — the 'rate of the rate.'
Notation: , , or . For higher orders: , , etc. The notation reminds you that you're applying twice.
Physical meaning: if is position, then is velocity (how fast position changes) and is acceleration (how fast velocity changes). A car accelerating has positive ; a car braking has negative .
The second derivative tells you about concavity: means concave up (the curve bends upward, like a bowl), means concave down (bends downward, like a hill). This is central to the second derivative test for classifying extrema.
Higher-order derivatives appear in Taylor series: the th coefficient uses , letting you approximate any smooth function with a polynomial. Some functions (like and ) have patterns in their higher derivatives that make the Taylor series elegant.
💡Explain it simply
The first derivative tells you speed — how fast the function is changing. The second derivative tells you acceleration — is the change itself speeding up or slowing down?
When you drive a car: position → velocity (first derivative, how fast you're going) → acceleration (second derivative, are you pressing the gas or the brake?). The third derivative is sometimes called 'jerk' — how suddenly you slam the gas pedal.
Computing higher-order derivatives
- Find for .
- First derivative: .
- Second derivative: .
- Third derivative: .
- Fourth derivative: .
- Fifth and beyond: for . Any polynomial of degree has all derivatives beyond the th equal to zero.
Tangent lines and linearization
The tangent line to at is: . This is the best linear approximation near .
Linearization: for close to . This is incredibly useful for quick estimates.
Example: approximate . Use at : , .
Linearization gives . The actual value is , very close!
Differentials: gives the approximate change in for a small change in .
Interpreting the derivative graphically
on an interval means is increasing (going uphill) there. means is decreasing (going downhill). The magnitude tells you how steep the slope is.
Where , the tangent line is horizontal. These are critical points — candidates for local maxima, local minima, or inflection points.
The sign of changing from to at a critical point signals a local maximum (function goes up then down). From to signals a local minimum (down then up). If doesn't change sign, the critical point is neither (e.g., at — the function flattens momentarily but keeps going in the same direction).
Reading 's graph to understand : where is above the -axis, is climbing. Where is below, is falling. The -intercepts of correspond to the peaks and valleys (or flat inflection points) of . The higher is, the steeper is at that point.
You can also go the other way: given the graph of , sketch by estimating the slope at various points. Steep uphill = large positive ; flat = near zero; steep downhill = large negative .
💡Explain it simply
The derivative's graph is like a 'steepness report' for the original function. Wherever the report says positive, the original function is climbing uphill. Wherever it says negative, it's going downhill. Wherever the report crosses zero, the original function has a flat moment — a hilltop, a valley, or a brief rest.
If you've ever seen a trail elevation profile (height vs. distance), the derivative is the steepness at each point along the trail. Positive = uphill, negative = downhill, zero = flat.
Hyperbolic functions and their derivatives
The hyperbolic functions are built from exponentials: and .
They satisfy (compare with for trig).
Derivatives mirror trig but without the sign changes: and .
Also: where .
Inverse hyperbolics have logarithmic forms: , and their derivatives produce the expressions and that appear in integration.
Applications: catenary curves (hanging chains/cables obey ), special relativity (rapidity), and certain differential equations.
Differentiating a hyperbolic composition
- Find .
- This is a composition: outer function is with .
- Derivative of the outer: , evaluated at gives .
- Derivative of the inner: .
- By the chain rule: .
Common Mistakes to Avoid
- Forgetting the inner derivative in the chain rule. This is the single most common error. Always ask: is there a function inside another function?
- Applying the power rule to . The derivative of is , not . The power rule is for , not .
- In implicit differentiation, forgetting to multiply by whenever you differentiate a term.
- Using the quotient rule when simple algebra would work. For instance, is easier to differentiate directly.
- Confusing the derivative of with . The correct answer is by the chain rule.
- Getting signs wrong on trig derivatives: differentiates to (not ).
- Assuming differentiability implies smoothness everywhere. A function can be continuous but not differentiable at corners, cusps, or vertical tangents.