The derivative of a function measures its instantaneous rate of change at any point. Geometrically, is the slope of the tangent line to at .
Derivatives power every optimization problem, every physics equation involving rates, and every approximation method in applied math. If you understand derivatives deeply, the rest of calculus becomes much easier.
The formal definition is . This limit captures the idea of zooming in on a curve until it looks like a straight line, then measuring that line's slope.
The definition and what it means
The derivative is the limit of the difference quotient. The difference quotient is the slope of a secant line through two nearby points.
As , the secant line becomes the tangent line, and its slope becomes the derivative.
If this limit exists at , we say is differentiable at . If is differentiable at , then is also continuous at . The converse is false: is continuous at but not differentiable (it has a sharp corner).
Places where differentiability fails: sharp corners (like ), vertical tangent lines (like at ), and discontinuities.
Core differentiation rules
Power rule: for any real number . This works for negative and fractional exponents too: and .
Constant multiple rule: . Constants pass through the derivative.
Sum/difference rule: . Differentiate term by term.
Constant rule: . The derivative of any constant is zero.
Using the power rule on a polynomial
- Find .
- Apply the rules term by term: sum rule lets us differentiate each term separately, constant multiple rule pulls out the coefficients.
- First term: (power rule: bring down the , reduce exponent by ).
- Second term: .
- Third term: (since , power rule gives ).
- Fourth term: (constant rule).
- Combine: .
Derivatives of common functions
Exponentials: (the only function equal to its own derivative). More generally, .
Logarithms: . More generally, .
Trigonometric: , , .
More trig: , , .
Memorize these. They are the building blocks for every derivative you will ever compute.
Product rule
When two functions are multiplied: .
Think of it as: "derivative of the first times the second, plus the first times the derivative of the second."
Example: .
Tip: always check if you can simplify the expression first. Sometimes you don't need the product rule at all.
Product rule with exponential and polynomial
- Find .
- Identify the two functions: and .
- Find each derivative: and .
- Apply the product rule: .
- Factor out common terms: .
- Final answer: .
Quotient rule
For a ratio of functions: .
Memory aid: "low d-high minus high d-low, over the square of what's below."
Example: .
Alternative: you can often rewrite as and use the product rule instead. Use whichever feels easier.
Chain rule
The chain rule handles compositions: if , then .
In words: derivative of the outer function (evaluated at the inner function) times the derivative of the inner function.
Example: . The outer function is , the inner function is .
Nested chains: for , apply the chain rule twice: .
The chain rule is arguably the most important rule. It appears everywhere: in implicit differentiation, related rates, and integration by substitution.
💡Explain it simply
Imagine a machine with two gears connected together. The big gear turns the small gear. If you want to know how fast the final output changes, you need to know two things: how fast the big gear turns the small gear, and how fast the small gear turns on its own.
That's the chain rule. If you have a function inside another function (like of ), the rate of change of the whole thing is: how fast the outer part changes × how fast the inner part changes.
A real-world example: temperature affects ice cream sales, and ice cream sales affect your revenue. If temperature goes up by 1°, sales go up by 50 cones, and each cone gives you \50 \times 3 = 150
Chain rule with a nested composition
- Find .
- Identify the outer and inner functions. Outer: where . Inner: .
- Derivative of the outer function: , evaluated at gives .
- Derivative of the inner function: .
- Multiply them (chain rule): .
- Final answer: .
Double chain rule
- Find .
- There are three nested layers: , then , then . We peel them off one at a time.
- Outermost layer: , evaluated at gives .
- Middle layer: , evaluated at gives .
- Innermost layer: .
- Multiply all three together: .
- Final answer: .
Inverse trigonometric derivatives
, valid for .
.
. This one appears frequently in integration.
With the chain rule: .
Implicit differentiation
When is defined implicitly by an equation like , differentiate both sides with respect to .
Every time you differentiate a term, attach a factor (this is the chain rule in action: is a function of ).
Example: differentiate to get , then solve .
Implicit differentiation is essential for curves that can't be written as , such as circles, ellipses, and other relations.
💡Explain it simply
Usually, you have a nice equation like where is alone on one side. But sometimes and are tangled together, like in (a circle). You can't easily solve for .
Implicit differentiation says: just differentiate everything with respect to anyway. Whenever you hit a , remember that secretly depends on , so slap on a (that's just the chain rule). Then solve for .
It's like a detective figuring out how fast a hidden variable changes by looking at the equation it's trapped in.
Implicit differentiation of a circle
- Find for .
- Differentiate both sides with respect to . Remember: is a function of , so we must use the chain rule on terms.
- Left side: .
- The came from the chain rule: .
- Right side: .
- So: .
- Solve for : , therefore .
- At the point on the circle: . This makes geometric sense — the tangent line at on a circle centered at the origin should slope downward to the right.
Logarithmic differentiation
For complicated products, quotients, or expressions like , take the natural log of both sides first.
Example: find . Let , then .
Differentiate implicitly: .
Solve: .
This technique turns products into sums and powers into products, making differentiation much simpler.
Higher-order derivatives
The second derivative is the derivative of . It measures how the rate of change itself is changing.
Notation: , , or . For higher orders: , , etc.
Physical meaning: if is position, then is velocity and is acceleration.
The second derivative tells you about concavity: means concave up (like a bowl), means concave down (like an arch).
Higher-order derivatives appear in Taylor series: you need to build the polynomial approximation.
Tangent lines and linearization
The tangent line to at is: . This is the best linear approximation near .
Linearization: for close to . This is incredibly useful for quick estimates.
Example: approximate . Use at : , .
Linearization gives . The actual value is , very close!
Differentials: gives the approximate change in for a small change in .
Interpreting the derivative graphically
on an interval means is increasing there. means is decreasing.
Where , the tangent line is horizontal. These are critical points: potential maxima or minima.
The sign of changing from to at a critical point signals a local maximum. From to signals a local minimum.
If does not change sign, the critical point is neither a max nor a min (e.g., at ).
The graph of tells you the steepness and direction of . Where crosses zero, has a horizontal tangent.
Common Mistakes to Avoid
- Forgetting the inner derivative in the chain rule. This is the single most common error. Always ask: is there a function inside another function?
- Applying the power rule to . The derivative of is , not . The power rule is for , not .
- In implicit differentiation, forgetting to multiply by whenever you differentiate a term.
- Using the quotient rule when simple algebra would work. For instance, is easier to differentiate directly.
- Confusing the derivative of with . The correct answer is by the chain rule.
- Getting signs wrong on trig derivatives: differentiates to (not ).
- Assuming differentiability implies smoothness everywhere. A function can be continuous but not differentiable at corners, cusps, or vertical tangents.