Diploma Program MA1131 Mathematics 1A CALCULUS NOTES CRICOS Provider No: 00098G c©2020 UNSW Sydney iii Preface Please read carefully. These Notes form the basis for the calculus strand of MA1131. However, not all of the material in these Notes is included in the MA1131 calculus syllabus. In particular, any material marked [X] is non-assessable. A detailed syllabus is given, commencing on page (ix) of these Notes. In using these Notes, you should remember the following points: 1. Most courses at university present new material at a faster pace than you will have been accustomed to in high school, so it is essential that you start working right from the beginning of the session and continue to work steadily throughout the session. Make every effort to keep up with the lectures and to do problems relevant to the current lectures. 2. These Notes are not intended to be a substitute for attending lectures or tutorials. The lectures will expand on the material in the notes and help you to understand it. 3. These Notes may seem to contain a lot of material but not all of this material is equally important. One aim of the lectures will be to give you a clearer idea of the relative importance of the topics covered in the Notes. 4. Use the tutorials for the purpose for which they are intended, that is, to ask questions about both the theory and the problems being covered in the current lectures. 5. Some of the material in these Notes is more difficult than the rest. This extra material is marked with the symbol [H]. 6. Some of the problems are marked [V]. These have a video solution available from Moodle. 7. It is essential for you to do problems which are given at the end of each chapter. If you find that you do not have time to attempt all of the problems, you should at least attempt a representative selection of them. The problems set in tests and exams will be similar to the problems given in these notes. Further information on the problems and class tests is on page (xiii). 8. You will be expected to use the computer algebra package Maple in tests and understand Maple syntax and output for the end of semester examination. Note. We gratefully acknowledge the contributions of the School of Mathematics and Statistics towards the creation of this resource. Copyright is vested in The University of New South Wales, c©2020. c©2020 School of Mathematics and Statistics, UNSW Sydney iv c©2020 School of Mathematics and Statistics, UNSW Sydney vContents Preface iii Calculus . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix Syllabus for MA1131 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ix Homework schedule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xi Tutorial schedule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xii Test schedule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xiii Revision questions 1 1 Sets, inequalities and functions 11 1.1 Sets of numbers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12 1.2 Solving inequalities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14 1.3 Absolute values . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 17 1.4 Functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 19 1.5 Polynomials and rational functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23 1.6 The trigonometric functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25 1.7 The elementary functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28 1.8 Implicitly defined functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30 1.9 Continuous functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32 1.10 Maple notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33 Problems for Chapter 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35 2 Limits 39 2.1 Limits of functions at infinity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39 2.1.1 Basic rules for limits . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 39 2.1.2 The pinching theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 41 2.1.3 Limits of the form f(x)/g(x) . . . . . . . . . . . . . . . . . . . . . . . . . . . 42 2.1.4 Limits of the form √ f(x)− √ g(x) . . . . . . . . . . . . . . . . . . . . . . . . 42 2.1.5 Indeterminate forms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43 2.2 The definition of lim x→∞ f(x) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44 2.3 Proving that lim x→∞ f(x) = L using the limit definition . . . . . . . . . . . . . . . . . . 47 2.4 Proofs of basic limit results [X] . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 49 2.5 Limits of functions at a point . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51 2.5.1 Left-hand, right-hand and two-sided limits . . . . . . . . . . . . . . . . . . . . 51 c©2020 School of Mathematics and Statistics, UNSW Sydney vi 2.5.2 Limits and continuous functions . . . . . . . . . . . . . . . . . . . . . . . . . 53 2.5.3 Rules for limits at a point . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55 2.6 Maple notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 59 Problems for Chapter 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 60 3 Properties of continuous functions 63 3.1 Combining continuous functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 63 3.2 Continuity on intervals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 65 3.3 The intermediate value theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 66 3.4 The maximum-minimum theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 68 Problems for Chapter 3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 72 4 Differentiable functions 75 4.1 Gradients of tangents and derivatives . . . . . . . . . . . . . . . . . . . . . . . . . . . 77 4.2 Rules for differentiation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 81 4.3 Proofs of differentiation rules . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 83 4.4 Implicit differentiation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84 4.5 Differentiation, continuity and split functions . . . . . . . . . . . . . . . . . . . . . . 86 4.6 Derivatives and function approximation . . . . . . . . . . . . . . . . . . . . . . . . . 89 4.7 Derivatives and rates of change . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 90 4.8 Local maximum, local minimum and stationary points . . . . . . . . . . . . . . . . . 91 4.9 Maple notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 93 Problems for Chapter 4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94 5 The mean value theorem and applications 97 5.1 The mean value theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 97 5.2 Proof of the mean value theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99 5.3 Proving inequalities using the mean value theorem . . . . . . . . . . . . . . . . . . . 100 5.4 Error bounds . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 102 5.5 The sign of a derivative . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 103 5.6 The second derivative and applications . . . . . . . . . . . . . . . . . . . . . . . . . . 104 5.7 Critical points, maxima and minima . . . . . . . . . . . . . . . . . . . . . . . . . . . 107 5.8 Counting zeros . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 110 5.9 Antiderivatives . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 113 5.10 L’Hoˆpital’s rule . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 115 Problems for Chapter 5 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 118 6 Inverse functions 123 6.1 Some preliminary examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 123 6.2 One-to-one functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 125 6.3 Inverse functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 127 6.4 The inverse function theorem . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 130 6.5 Applications to the trigonometric functions . . . . . . . . . . . . . . . . . . . . . . . 132 6.6 Maple notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 137 c©2020 School of Mathematics and Statistics, UNSW Sydney vii Problems for Chapter 6 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 139 7 Curve sketching 143 7.1 Curves defined by a Cartesian equation . . . . . . . . . . . . . . . . . . . . . . . . . 143 7.1.1 A checklist for sketching curves . . . . . . . . . . . . . . . . . . . . . . . . . . 144 7.1.2 Oblique asymptotes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 144 7.1.3 Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 147 7.2 Parametrically defined curves . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 150 7.2.1 Parametrisation of conic sections . . . . . . . . . . . . . . . . . . . . . . . . . 151 7.2.2 Calculus and parametric curves . . . . . . . . . . . . . . . . . . . . . . . . . . 154 7.2.3 The cycloid and curve of fastest descent . . . . . . . . . . . . . . . . . . . . . 155 7.3 Curves defined by polar coordinates . . . . . . . . . . . . . . . . . . . . . . . . . . . 156 7.3.1 Polar coordinates . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 156 7.3.2 Basic sketches of polar curves . . . . . . . . . . . . . . . . . . . . . . . . . . . 158 7.3.3 Sketching polar curves using calculus . . . . . . . . . . . . . . . . . . . . . . . 161 7.4 Maple notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 164 Problems for Chapter 7 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 166 8 Integration 171 8.1 Area and the Riemann Integral . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 171 8.1.1 Area of regions with curved boundaries . . . . . . . . . . . . . . . . . . . . . 171 8.1.2 Approximations of area using Riemann sums . . . . . . . . . . . . . . . . . . 173 8.1.3 The definition of area under the graph of a function and the Riemann integral178 8.2 Integration using Riemann sums . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 180 8.3 The Riemann integral and signed area . . . . . . . . . . . . . . . . . . . . . . . . . . 183 8.4 Basic properties of the Riemann integral . . . . . . . . . . . . . . . . . . . . . . . . . 184 8.5 The first fundamental theorem of calculus . . . . . . . . . . . . . . . . . . . . . . . . 186 8.6 The second fundamental theorem of calculus . . . . . . . . . . . . . . . . . . . . . . 190 8.7 Indefinite integrals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 194 8.8 Integration by substitution . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 196 8.9 Integration by parts . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 200 8.10 Improper integrals . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 202 8.11 Comparison tests for improper integrals . . . . . . . . . . . . . . . . . . . . . . . . . 205 8.12 Functions defined by an integral . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 212 8.13 Maple notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 215 Problems for Chapter 8 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 216 9 The logarithmic and exponential functions 223 9.1 Powers and logarithms . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 224 9.2 The natural logarithm function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 225 9.3 The exponential function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 228 9.4 Exponentials and logarithms with other bases . . . . . . . . . . . . . . . . . . . . . . 230 9.5 Integration and the ln function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 231 9.6 Logarithmic differentiation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 233 9.7 Indeterminate forms with powers . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 233 c©2020 School of Mathematics and Statistics, UNSW Sydney viii Problems for Chapter 9 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 235 10 The hyperbolic functions 239 10.1 Hyperbolic sine and cosine functions . . . . . . . . . . . . . . . . . . . . . . . . . . . 240 10.2 Other hyperbolic functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 243 10.3 Hyperbolic identities . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 245 10.4 Hyperbolic derivatives and integrals . . . . . . . . . . . . . . . . . . . . . . . . . . . 246 10.5 The inverse hyperbolic functions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 248 10.6 Integration leading to the inverse hyperbolic functions . . . . . . . . . . . . . . . . . 251 10.7 A summary of important hyperbolic formulae . . . . . . . . . . . . . . . . . . . . . . 252 10.8 Appendix: The catenary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 253 10.9 Maple notes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 256 Problems for Chapter 10 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 258 Answers to selected problems 261 Chapter 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 261 Chapter 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 262 Chapter 3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 262 Chapter 4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 263 Chapter 5 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 263 Chapter 6 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 264 Chapter 7 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 265 Chapter 8 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 266 Chapter 9 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 268 Chapter 10 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 268 Past Class Tests 269 Index 283 c©2020 School of Mathematics and Statistics, UNSW Sydney ix CALCULUS SYLLABUS FOR MA1131 MATHEMATICS 1A The Calculus textbook is S.L. Salas & E. Hille and G.J. Etgen Calculus - One and Several Vari- ables, any recent edition, Wiley. References to the 10th and 9th editions are shown as SH10 and SH9. To improve your understanding of definitions, theorems and proofs, the following book is recommended: Introduction to Proofs in Mathematics, J. Franklin & A. Daoud, Prentice-Hall. In this syllabus the references to the textbook are not intended as a definition of what you will be expected to know. They are just a guide to finding relevant material. Some parts of the course are not covered in the textbook and some parts of the textbook (even in the sections mentioned in the references below) are not included in the course. The scope of the course is defined by the content of the lectures and problem sheets. The approximate lecture time for each section is given below. References to the 9th and 10th editions of Salas & Hille are shown as SH9 and SH10. SH10 SH9 1. Sets, inequalities and functions. (2 hours) N,Z,Q,R. Open and closed intervals. Inequalities. 1.2, 1.3 1.2, 1.3 Functions: sums, products, quotients, composites. Polynomials, rational functions, trig functions as examples of continuous functions. Implicitly defined functions. 1.6-1.7 1.6-1.7 2. Limits. (2 hours) Formal definition of limit as x→ a (a finite). 2.1, 2.2 2.1, 2.2 Formal definition of limit as x→∞. pp177-178 pp222-224 pp195-198 pp243-246 Limit rules. The pinching theorem. 2.3, 2.5 2.3, 2.5 3. Properties of continuous functions. (2 hours) Combinations of continuous functions. 2.4 2.4 Intermediate value and min-max theorems. 2.6, B1, B2 2.6, B1, B2 Relative and absolute maxima and minima. 4.3-4.5 4.3-4.5 4. Differentiable functions. (2 hours) Definition of derivative via tangents. 3.1 3.1 Derivatives of sums, products, quotients and composites. Rates of change. Higher derivatives. 3.2-3.5 3.2-3.5 Derivatives of polynomial, rational and trig functions. 3.5,3.6 3.5,3.6 Implicit differentiation, fractional powers. 3.7 3.7 5. The mean value theorem and applications. (2 hours) Mean value theorem and applications. 4.1, 4.2 4.1, 4.2 L’Hoˆpital’s rule. 11.5, 11.6, 10.5, 10.6 c©2020 School of Mathematics and Statistics, UNSW Sydney xSH10 SH9 6. Inverse functions. (2 hours) Domain, range, inverse functions, the inverse function theorem. 7.1, B3 7.1, B3 Inverse trig functions, their derivatives and graphs. 7.7 7.7 7. Curve sketching. (3 hours) Use of domain, range, intercepts, asymptotes, even or odd, calculus. 4.7, 4.8 4.7, 4.8 Parametrically defined curves. Relation between polar and Cartesian coordinates. 10.2 9.3 Sketching curves in polar coordinates. 10.3 9.4 8. Integration. (5 hours) Riemann sums, the definite integral and its algebraic properties. 5.1, B5 5.1, B5 Indefinite integrals, primitives and the two fundamental theorems of calculus. 5.2-5.5 5.2-5.5 Integration by substitution and by parts. 5.6, 8.2 5.6, 8.2 Integrals on unbounded domains, limit form of comparison test. 11.7 10.7 9. Logarithms and exponentials. (2 hours) ln as primitive of 1/x, basic properties, logarithmic differentiation. 7.2, 7.3 7.2, 7.3 Exponential function as inverse of ln, basic properties. ax, logs to other bases. 7.4-7.6 7.4-7.6 10. Hyperbolic functions (2 hours) Definitions, identities, derivatives, integrals and graphs. 7.8 7.8 Inverse hyperbolic functions. 7.9 7.9 Integrals involving hyperbolic or trig substitution. Review. (2 hour) c©2020 School of Mathematics and Statistics, UNSW Sydney xi CALCULUS PROBLEM SETS The Calculus problems are located at the end of each chapter of the Calculus Notes booklet. They are also available from the course module on the UNSW Moodle server. Some of the problems are very easy, some are less easy but still routine and some are quite hard. To help you decide which problems to try first, each problem is marked with an [R] or an [H]. The problems marked [R] form a basic set of problems which you should try first. Problems marked [H] are harder and can be left until you have done the problems marked [R]. Problems marked [V] have a video solution available on Moodle. You do need to make an attempt at the [H] problems because problems of this type will occur on tests and in the exam. If you have difficulty with the [H] problems, ask for help in your tutorial. Remember that working through a wide range of problems is the key to success in mathematics. WEEKLY CALCULUS SCHEDULE Solving problems and writing mathematics clearly are two separate skills that need to be devel- oped through practice. We recommend that you keep a workbook to practice writing solutions to mathematical problems. The following table gives the range of questions suitable for each week. In addition it suggests specific recommended problems to do before your classroom tutorials. The Online Tutorials will develop your problem solving skills, and give you examples of mathemat- ical writing. Because this overlaps with the skills developed through homework, there are fewer recommended homework in Online Tutorial weeks. WEEKLY CALCULUS HOMEWORK SCHEDULE Week Try to do up to Recommended Homework Chapter Problem Problems 1 1. Sets, inequalities. . . 19 4(e), 5(d), 10(h), 12, 13(e), 15, 17 2 2. Limits 13 1(e), 2(b), 3(b), 5, 10(b) 3 3. Properties of continuous. . . 9 3, 5, 8(a), 8(c), 9(a), 9(b) 4 4. Differentiable functions 16 2(d), 6(d), 7(b), 10(a), 15 5 5. The MVT and applications 24 1(b), 4(b), 7(a), 9(b), 13, 17(c), 18(d) 6 6. Inverse functions 17 1, 5, 8(b), 8(d), 8(f), 11(b) 8 7. Curve sketching 11 2(b), 7(c), 8(b) 9 7. Curve sketching 19 14(c), 16(b) 8. Integration 2 1 10 8. Integration 15 4(b), 11(b), 14(d), 15(a) 11 8. Integration 27 16(b), 17(d), 20(b), 22(a) 12 9. Logs and exponentials 10 3(b), 4(e), 5(a), 8(c), 9(e), 9(h) 12 10. Hyperbolic functions 12 2(b), 3(a), 7(c), 8 c©2020 School of Mathematics and Statistics, UNSW Sydney xii The main reason for having tutorials is to give you a chance to tackle and discuss problems which you find difficult or don’t fully understand. There are two kinds of tutorials: Online and Classroom. Calculus Online Tutorials are delivered using MapleTA. These are optional and can be completed from home. Calculus Classroom Tutorials are delivered in a classroom by a calculus tutor. The topics covered in a classroom tutorial are flexible, and you can (and should) ask your tutor to cover any topics you find difficult. You may also be asked to present solutions to homework questions to the rest of the class. c©2020 School of Mathematics and Statistics, UNSW Sydney xiii CALCULUS CLASS TESTS The tests will take place in class test times. The following table shows the week in which each test will be held and the topics covered. Topics covered Test Week chapter sections 1 6 1, 2, 3, 4 & 5 All 2 11 6 , 7 & 8 All It is important to note that: • The class tests do not cover the whole syllabus. • Questions in the exams may be very different from those in the class tests. c©2020 School of Mathematics and Statistics, UNSW Sydney xiv c©2020 School of Mathematics and Statistics, UNSW Sydney 1Revision questions Inequalities and Absolute Values 1. Sketch the set of points (x, y) which satisfy the following relations. a) 0 ≤ y ≤ 2x and 0 ≤ x ≤ 2 b) y/2 ≤ x ≤ 2 and 0 ≤ y ≤ 4 2. Solve a) x(x− 1) > 0 b) (x− 1)(x− 2) < 0 c) 2x2 + 3x− 2 ≥ 0 d) 1 x > −1 2 e) 1 1− x > 1 2 f) 4 x− 1 ≤ 1 3. Solve a) ∣∣x+ 1∣∣< 3 b) ∣∣x+ 2∣∣> 3 c) ∣∣3x+ 2∣∣≤ 1 d) ∣∣∣∣x− 1x+ 1 ∣∣∣∣ < 1 Trigonometry 4. Find the exact value of each of the following: a) cos ( π 12 ) b) sin ( 5π 12 ) c) tan ( 7π 12 ) d) sec ( 11π 12 ) 5. If A and B are acute with sin(A) = 3 5 and tan(B) = 12 5 find (without the use of a calculator): a) cos(A) b) tan(A) c) sin(B) d) cos(B) e) sin(A+B) f) cos(A−B) g) sin(2A) h) tan(2B) 6. If A and B are acute with sin(A) = 24 25 and cos(B) = 8 17 find (without finding A and B): a) cos(2A) b) sin(A−B) c) tan(A+B) 7. Find the period and amplitude for each of the following functions. a) y = 3 sin ( 2x− π 4 ) b) y = −2 cos (x 3 + π 2 ) c©2020 School of Mathematics and Statistics, UNSW Sydney 2 REVISION QUESTIONS 8. Express each of the following in terms of a single sine function in the form R sin(x±α), where R > 0 and α is acute. a) sin(x) + cos(x) b) 2 sin(x) + 2 √ 3 cos(x) c) √ 3 sin(x)− cos(x) d) √8 sin(x)−√8 cos(x) Functions 9. What is the (maximal) domain and range of the following functions? a) f(x) = √ 5− x2 b) f(x) = √x2 − 5 c) f(x) = √ x− 1 d) f(x) = 1√ x− 1 e) f(x) = (x− 8)−1/3 f) f(x) = √sinx g) f(x) = 1 + tan2 x h) f(x) = cos x if x < 0√ 1− x if 0 ≤ x ≤ 1 |x| if x > 1 10. Sketch the graph of each of the functions in Problem 9. 11. Sketch each of the following functions without using calculus. a) An odd function, f(x), defined on [−2, 2] such that f(x) = x2(1− x) when 0 ≤ x ≤ 2. b) An even function, f(x), defined on [−3, 3] such that f(x) = (x− 1)2(x− 2) when 0 ≤ x ≤ 3. 12. If f(x) = x+ 5 and g(x) = x2 − 3 find a) g(f(0)) b) g(f(x)) c) f(g(2)) d) f(g(x)) 13. If f(x) = x− 1 and g(x) = 1√ x− 1, give the explicit forms of a) f(x) + g(x) b) f(x)g(x) c) f(x) g(x) d) f(g(x)) e) g(f(x)) Limits of some Rational Functions 14. Find a) lim x→2 x− 2 x2 − 5x+ 6 b) limx→2 x2 − 5x+ 6 2x2 − 3x− 2 c) limλ→1 λ2 − 0.8λ− 0.2 λ− 1 d) lim x→1 1− x4 1− x e) limx→∞ 2x2 − 3x+ 7 3x2 + x− 1 f) limx→∞ 2x3 + 3x+ 2 −5x3 + 4x− 1 c©2020 School of Mathematics and Statistics, UNSW Sydney REVISION QUESTIONS 3 Simple Differentiation 15. Find the derivative of each of the following functions. a) f(x) = (2x+ 5)3 b) g(t) = √ t2 − 4 c) h(x) = 1 (2x+ 3)3/2 d) f(x) = sin3 x e) g(x) = cos(x3) f) h(x) = sec(2x2 + 3) g) f(x) = e−x2/2 h) g(x) = x2(2x− 1)4 i) h(θ) = θ tan θ j) f(x) = x cos 2x k) g(x) = x3 sinx l) h(x) = x lnx m) f(x) = x+ e x+ π n) g(x) = 2x2 + 3 3x− 2 o) h(t) = t√ t2 − 4 p) f(x) = sinx 2x+ 5 Tangents and Normals 16. Find the equation of the tangent and the equation of the normal to each of the following curves. a) y = 4x+ 1 x at the point (1, 5) b) y = x3 − 1 + 1 x2 at the point (1, 1) c) y = cos x 1− sinx at the point where x = π 6 Stationary Points 17. Locate and identify the stationary points for a) y = 2x3 − 9x2 + 12x− 3 b) y = x 1 + x2 c) y = e2x(1− x) d) y = xe−x e) y = xne−x for n ∈ Z, n ≥ 2 f) y = lnx x g) y = 4x3 − x4 h) y = x+ cos x 18. The slope of the curve y = f(x) is given by dy dx = x2(2x− 1)(x− 1) Determine the nature of the stationary points. 19. The slope of the curve y = f(x) is dy dx = 3(x− 1)2(x− 2)3(x− 3)4(x− 4) c©2020 School of Mathematics and Statistics, UNSW Sydney 4 REVISION QUESTIONS For what value or values of x does y have a) a local maximum? b) a local minimum? Integration 20. a) Use your answer to 15(i) to find a primitive function of θ sec2 θ. Hint: From tables ∫ tan θdθ = ln | sec θ|+ C. b) Use your answer to 15(j) to find a primitive function of x sin 2x. c) Use your answer to 15(l) to find a primitive function of lnx. 21. The curve y = f(x) has dy dx = 3x2 − 2x+ 1 and passes through the point (2, 3). Find f(x). 22. Find y where a) dy dx = x2 + 1 x2 for x 6= 0 b) dy dx = x+ 1√ x for x > 0 23. Without recourse to tables find a) ∫ ex dx b) ∫ 1 0 e3x dx c) ∫ π 0 sin(2x) dx d) ∫ cos(3x) dx e) ∫ (2x3 + 3x2 + 4x+ 5)dx f) ∫ 1 3x+ 1 dx g) ∫ −1 −2 1 2x− 3 dx h) ∫ (2x− 3)5 dx For all the above indefinite integrals, check your answers by differentiating. Integration by Substitution 24. Evaluate each of the following indefinite integrals by using the suggested substitution: c©2020 School of Mathematics and Statistics, UNSW Sydney REVISION QUESTIONS 5 a) ∫ x2 ( x3 + 1 )5 dx; u = x3 + 1 b) ∫ (t− 1) √ t2 − 2t+ 4 dt; u = t2 − 2t+ 4 c) ∫ (x+ 1) ex 2+2x+3 dx; u = x2 + 2x+ 3 d) ∫ x sin ( x2 + 1 ) dx; u = x2 + 1 e) ∫ esin 2x cos 2x dx; u = sin 2x f) ∫ e2x cos ( e2x ) dx; u = e2x g) ∫ dz z ln z ; u = ln z h) ∫ x+ 1 x2 + 2x− 1 dx; u = x 2 + 2x− 1 i) ∫ ex 1 + ex dx; u = 1 + ex j) ∫ x+ 1 (x2 + 2x− 1)5 dx; u = x 2 + 2x− 1 k) ∫ sin(lnx) dx x ; u = lnx 25. Evaluate each of the following definite integrals by using the suggested substitution: a) ∫ 4 0 xex 2+1 dx; u = x2 + 1 b) ∫ π/4 π/6 sec2 x tan x dx; u = tanx c) ∫ 1 0 3x (3x+ 1)2 dx; u = 3x+ 1 d) ∫ 20 5 t√ t− 4 dt; u = t− 4 Area and Volume 26. For each of the following functions, find the area between the curve y = f(x) and the x-axis over the given range of x values. a) f(x) = 2x2 − 1 from x = 1 to 2 b) f(x) = x3 − 3x2 + 4x from x = 0 to 2 c) f(x) = 2x2 + 1 x2 from x = 1 to 2 d) f(x) = e−x/3 from x = 0 to 3 e) f(x) = 2 cos x+ 3 from x = 0 to π f) f(x) = 1 x+ 1 from x = 0 to 2 27. For each of the following functions, find the volume of the solid formed when the curve y = f(x) over the given range of x is rotated about the x-axis. a) f(x) = x2 + 1 from x = 0 to 1 b) f(x) = x+ 2 x from x = 1 to 2 c) f(x) = e−x/4 from x = 0 to 2 d) f(x) = sec x from x = 0 to π 4 e) f(x) = 1 x+ 1 from x = 0 to 1 Logarithms 28. Simplify: c©2020 School of Mathematics and Statistics, UNSW Sydney 6 REVISION QUESTIONS a) log4 12− log4 3 b) log2 16 log2 8 c) log1/3 729 29. Solve for x: a) 22x+1 − (17)2x + 8 = 0 b) lnx = 3 ln 2 + 2 ln 3 c) logx 125 = −3 Remainder Theorem 30. Without division find the remainder when p(x) = x3 − 5x2 + 10x− 6 is divided by a) x− 2 b) x− 1 c) x+ 2 d) x+ 1 which (if any) of these is a factor of p(x)? Binomial Theorem 31. Use Pascal’s triangle to expand the following: a) (x+ y)5 b) (3x− 2y)4 c) (2x+ 3)6 32. Use the Binomial Theorem to find the following. a) The coefficient of x12 in the expansion of (2x3 − 3)7. b) The coefficient of x3 in the expansion of ( x2 − 2 x )3 . c) The term independent of x in the expansion of ( 2x2 + 1 x )9 . c©2020 School of Mathematics and Statistics, UNSW Sydney REVISION QUESTIONS 7 Answers for Revision Questions 1. Answer for both: the interior and boundary of the triangle with vertices at (0, 0), (2, 0), and (2, 4). 2. a) x < 0 or x > 1 b) 1 < x < 2 c) x ≤ −2 or x ≥ 1 2 d) x < −2 or x > 0 e) −1 < x < 1 f) x < 1 or x ≥ 5 3. a) −4 < x < 2 b) x < −5 or x > 1 c) −1 ≤ x ≤ −1 3 d) x > 0 4. a) 14 √ 2 ( 1 + √ 3 ) b) 14 √ 2 ( 1 + √ 3 ) c) − (2 +√3) d) −√2 (√3− 1) 5. a) 4 5 b) 3 4 c) 12 13 d) 5 13 e) 63 65 f) 56 65 g) 6 13 h) −120 119 6. a) −527 625 b) 87 425 c) −297 304 7. a) amplitude = 3, period = π b) amplitude = 2, period = 6π 8. a) √ 2 sin ( x+ π 4 ) b) 4 sin ( x+ π 3 ) c) 2 sin ( x− π 6 ) d) 4 sin ( x− π 4 ) 9. a) −√5 ≤ x ≤ √5; 0 ≤ y ≤ √5 b) x ≤ −√5 or x ≥ √5; y ≥ 0 c) x ≥ 1; y ≥ 0 d) x > 1; y > 0 e) x 6= 8; y 6= 0 f) {x : 2nπ ≤ x ≤ (2n + 1)π, n ∈ Z}; 0 ≤ y ≤ 1 g) {x : x 6= (2n + 1)π/2, n ∈ Z}; y ≥ 1 h) R; y ≥ −1 12. a) 22 b) x2 + 10x+ 22 c) 6 d) x2 + 2 13. a) x− 1 + 1√ x− 1 b) √ x− 1 c) (x− 1)3/2 d) 1√ x− 1 − 1 e) 1√ x− 2 14. a) −1 b) −1 5 c) 1.2 d) 4 e) 2 3 f) −2 5 15. a) 6(2x+ 5)2 b) t√ t2 − 4 c) − 3 (2x+ 3)5/2 d) 3 sin2 x cos x e) −3x2 sin(x3) f) 4x sec(2x2 + 3) tan(2x2 + 3) g) −xe−x2/2 h) 2x(6x− 1)(2x − 1)3 i) θ sec2 θ + tan θ j) −2x sin 2x+ cos 2x k) x2(x cos x+ 3 sin x) l) 1 + lnx m) π − e (x+ π)2 n) 6x2 − 8x− 9 (3x− 2)2 c©2020 School of Mathematics and Statistics, UNSW Sydney 8 REVISION QUESTIONS o) − 4 (t2 − 4)3/2 p) (2x+ 5) cos x− 2 sinx (2x+ 5)2 16. a) y = 3x+ 2, x+ 3y = 16 b) y = x, x+ y = 2 c) y −√3 = 2 ( x− π 6 ) , y −√3 = −1 2 ( x− π 6 ) 17. a) (1, 2) is a local maximum and (2, 1) is a local minimum b) (1, 12 ) is a local maximum and (−1,−12 ) is a local minimum c) (12 , e 2) is a local maximum d) (1, e−1) is a local maximum e) ( n, nn en ) is a local maximum and (0, 0) is a local minimum if n is even and a point of inflection if n is odd f) (e, e−1) is a local maximum g) (3, 27) is a local maximum and (0, 0) is a point of inflection h) (π2 + 2kπ, π 2 + 2kπ) k ∈ Z are points of inflection 18. There is a point of inflection for x = 0, a local maximum for x = 1 2 , and a local minimum for x = 1 19. a) x = 2 b) x = 4 20. a) θ tan θ − ln | sec θ| b) −1 2 x cos 2x+ 1 4 sin 2x c) x lnx− x 21. f(x) = x3 − x2 + x− 3 22. a) y = x− 1 x + C b) y = 2 3 x3/2 + 2 √ x+ C 23. a) ex+C b) 1 3 (e3−1) c) 0 d) 1 3 sin(3x)+C e) 1 2 x4+x3+2x2+5x+C f) 1 3 ln |3x+ 1|+C g) 1 2 ln ( 5 7 ) h) 1 12 (2x− 3)6 + C 24. a) 1 18 ( x3 + 1 )6 + C b) 1 3 ( t2 − 2t+ 4) 32 + C c) 1 2 ex 2+2x+3 + C d) −1 2 cos ( x2 + 1 ) + C e) 1 2 esin 2x + C f) 1 2 sin ( e2x ) + C g) ln | ln z|+ C h) 1 2 ln ∣∣x2 + 2x− 1∣∣+ C i) ln (1 + ex) + C j) − 1 8 (x2 + 2x− 1)4 + C k) − cos(lnx) + C 25. a) 1 2 ( e17 − e) b) 1 2 ln 3 c) 2 3 ln 2− 1 4 d) 66 c©2020 School of Mathematics and Statistics, UNSW Sydney REVISION QUESTIONS 9 26. a) 11 3 b) 4 c) 31 6 d) 3− 3 e e) 3π f) ln 3 27. a) 28π 15 b) 25π 3 c) 2π ( 1− 1 e ) d) π e) π 2 28. a) 1 b) 4 3 c) −6 29. a) −1, 3 b) 72 c) 1 5 30. a) 2 b) 0 c) −54 d) −22; x− 1 is a factor. 31. a) x5 + 5x4y + 10x3y2 + 10x2y3 + 5xy4 + y5 b) 81x4 − 216x3y + 216x2y2 − 96xy3 + 16y4 c) 64x6 + 576x5 + 2160x4 + 4320x3 + 4860x2 + 2916x + 729 32. a) −15120 b) −6 c) 672 c©2020 School of Mathematics and Statistics, UNSW Sydney 10 REVISION QUESTIONS c©2020 School of Mathematics and Statistics, UNSW Sydney 11 Chapter 1 Sets, inequalities and functions In the days of the Roman empire, the word ‘calculus’ denoted a pebble that was used for counting and gambling. As time progressed, the word ‘calculare’ came to mean ‘to compute.’ In the second half of the seventeenth century, two mathematicians, the Englishman Isaac Newton and the German Gottfried Leibniz, independently invented methods for • calculating gradients of tangents to curves, • calculating instantaneous acceleration and velocity, • calculating the area of regions with a curved boundary, • calculating the volume of solids with curved boundary, • calculating the length of a curve, • calculating the work done by a force, and • calculating the centre of mass of a general solid. These methods were developed by combining algebra, geometry and trigonometry with the limiting process and became known as the calculus. Calculus (as it is known today) has many applications to engineering, physics, chemistry, bi- ology, geology, surveying, sociology, economics and statistics. It includes two major branches: the differential calculus (introduced in Chapter 4) and the integral calculus (introduced in Chapter 8). These two branches are related by the fundamental theorem of calculus (see Theorem 8.5.1). The underlying tool used to develop these branches is the concept of the limit (introduced in Chapter 2), which is applied to functions (introduced in Chapter 1). The main goal of this chapter is to introduce functions. Although functions will be familiar to students from high school, important points that deserve students’ attention are emphasised here. Since the functions we study in this course take real numbers as inputs and give real numbers of outputs, we begin the chapter with a review of different sets that consist of real numbers. We also devote a little time to revising inequalities and absolute values, in preparation for our discussion of limits in Chapter 2. c©2020 School of Mathematics and Statistics, UNSW Sydney 12 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS 1.1 Sets of numbers (Ref: SH10 §1.2) A set is a collection of distinct objects. The objects in a set are called the elements or members of the set. Some commonly used sets of numbers are listed below. • The set N of natural numbers is given by N = {0, 1, 2, 3, 4, . . .}. • The set Z of integers is given by Z = {. . . ,−3,−2,−1, 0, 1, 2, 3, . . .}. • The set Q of rational numbers is the collection of all numbers of the form pq , where p and q are integers and q 6= 0. • The set R of real numbers may be represented as the collection of points lying on the number line. | | | | | | | −3 −2 −1 3210 If A is a set of numbers and the number x is a member of the set A, then we write x ∈ A. If x is not a member of A then we write x /∈ A. The next example illustrates this notation. Example 1.1.1. The following statements are true: 2 ∈ N, −2 /∈ N, 3 4 /∈ Z, 3 4 ∈ Q, √ 2 ∈ R, √ 2 /∈ Q. Many other sets of numbers can be described using N, Z, Q, R and inequalities. Example 1.1.2. (a) The set {n ∈ Z : n > 0} is the set of all numbers n belonging to the integers such that n is greater than 0. In other words, it is the set of positive integers. (Note that the colon ‘:’ above is read as ‘such that’.) (b) The set {x ∈ R : −2 ≤ x < 3} is represented by the following interval on the real line. | | | | | | | −3 −2 −1 3210 c©2020 School of Mathematics and Statistics, UNSW Sydney 1.1. SETS OF NUMBERS 13 (c) The set {x ∈ R : x /∈ Q} is the set of irrational numbers. Sets that are represented by intervals of the real line occur so frequently that we introduce a special notation for them. Suppose that a and b are real numbers and that a < b. Then the intervals (a, b), [a, b], (a, b] and [a, b) are given by • (a, b) = {x ∈ R : a < x < b} a b • [a, b] = {x ∈ R : a ≤ x ≤ b} a b • [a, b) = {x ∈ R : a ≤ x < b} a b • (a, b] = {x ∈ R : a < x ≤ b}. a b In each case, the numbers a and b are called the endpoints of the interval. An interval [a, b] that includes its endpoints is called a closed interval, while an interval (a, b) that excludes its endpoints is called an open interval. The intervals [a, b) and (a, b] are neither open nor closed. Interval notation can be extended to describe rays of the real line by using the symbol ∞ for infinity. Thus we have • [a,∞) = {x ∈ R : a ≤ x} a b | • (−∞, b) = {x ∈ R : x < b} a b | • (−∞,∞) = R. We end by introducing an important set relation. c©2020 School of Mathematics and Statistics, UNSW Sydney 14 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS Definition 1.1.3. Suppose that A and B are two sets. We say that A is a subset of B if x ∈ A implies that x ∈ B. If A is a subset of B then we also say that B contains the set A. The above definition is illustrated by the following examples: • {0, 1, 2, 3} is a subset of {0, 1, 2, 3, 4}, • N is a subset of Z, • [−1, 2] is a subset of (−3,∞), • R is a subset of R, and • [1, 4) is not a subset of (1, 4]. 1.2 Solving inequalities (Ref: SH10 §1.3) When solving an inequality, one must remember that if the inequality is multiplied by a negative number, the sign of the inequality reverses. Two types of inequalities deserve special attention: quadratic inequalities and inequalities with an unknown quantity in the denominator. Example 1.2.1. Solve the inequality x2 > 6− x. Solution. Rearranging the inequality gives x2 + x− 6 > 0. The values of x for which x2 + x− 6 is positive are the same as those for which the graph of y = x2 + x− 6 lies above the x-axis. To graph y = x2 + x− 6, we factorise the right-hand side to obtain y = (x− 2)(x+ 3). Hence y = 0 when x = 2 and when x = −3. We can now easily sketch its graph. c©2020 School of Mathematics and Statistics, UNSW Sydney 1.2. SOLVING INEQUALITIES 15 x y y = x2 + x− 6 −3 2 Since the graph lies above the x-axis when x < −3 and when x > 2, the solution to the inequality is x < −3 or x > 2. Example 1.2.2. Solve the inequality 1 3 < 1 x− 1 . Solution. We begin by noting that x 6= 1. To solve the inequality, we cannot simply multiply the inequality through by x − 1, since we do not know whether x− 1 is positive or negative. (Taking the reciprocal of both sides is similarly problematic.) Instead, we multiply through by the positive number (x− 1)2 to obtain 1 3 (x− 1)2 < x− 1. Multiplication by 3 and rearrangement gives (x− 1)2 − 3(x− 1) < 0 and by factorisation we obtain (x− 1)[(x− 1)− 3] = (x− 1)(x− 4) < 0. So proceding in a manner similar to the previous example, we seek to identify the values of x for which the graph of y = (x− 1)(x− 4) lies below the x-axis. c©2020 School of Mathematics and Statistics, UNSW Sydney 16 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS x y y = (x− 1)(x − 4) 1 4 Hence the solution to the inequality is given by 1 < x < 4. The next example is slightly more difficult, but uses techniques illustrated in the previous two examples. Example 1.2.3. Solve the inequality x+ 2 ≥ 4 x− 1 Solution. First observe that x 6= 1. Multiplying the inequality through by (x− 1)2 gives (x− 1)2(x+ 2) ≥ 4(x− 1). Hence (x− 1)2(x+ 2)− 4(x− 1) ≥ 0. Now (x− 1)2(x+ 2)− 4(x− 1) = (x− 1)[(x − 1)(x+ 2)− 4] = (x− 1)[x2 + x− 6] = (x− 1)(x − 2)(x+ 3). So the original inequality is equivalent to (x− 1)(x− 2)(x + 3) ≥ 0, x 6= 1. We now sketch the graph of y = (x− 1)(x− 2)(x+ 3) and see for which values of x the graph lies on or above the x-axis. c©2020 School of Mathematics and Statistics, UNSW Sydney 1.3. ABSOLUTE VALUES 17 x y y = (x− 1)(x− 2)(x+ 3) 1 2−3 Hence the solution to the inequality is −3 ≤ x < 1 or x ≥ 2. Remark 1.2.4. A variety of methods (some good and others bad) exist for solving inequalities where an unknown lies in the denominator. It is strongly recommended that students use the method shown in this section, or that shown by their first year calculus lecturer. 1.3 Absolute values (Ref: SH10 §§1.2, 1.3) The magnitude, or absolute value, of a real number x is denoted by |x| and defined by |x| = { x if x ≥ 0 −x if x < 0. (1.1) Absolute values of real numbers interact in the following way. Proposition 1.3.1. Suppose that x and y are real numbers. Then (i) | − x| = |x|, (ii) |xy| = |x||y|, (iii) ∣∣∣∣xy ∣∣∣∣ = |x||y| , provided that y 6= 0, and (iv) |x+ y| ≤ |x|+ |y|. c©2020 School of Mathematics and Statistics, UNSW Sydney 18 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS Property (iv) is known as the triangle inequality. Three other useful facts about absolute values are noted below. (F1) For every real number x, |x| = √ x2 and |x|2 = x2. (1.2) (F2) If a and x are real numbers then |x− a| is equal to the distance between x and a on the real number line. (F3) For any positive real number a, |y| < a if and only if − a < y < a and |y| > a if and only if y < −a or y > a. These facts give different methods for solving inequalities involving absolute values. Example 1.3.2. Solve the following inequalities: (a) |x− 3| ≤ 5 (b) |2x+ 5| > 8 (c) ∣∣∣∣x− 3x− 1 ∣∣∣∣ < 1. Solution. (a) An algebraic solution. Using (F3) we have |x− 3| ≤ 5 −5 ≤ x− 3 ≤ 5 −2 ≤ x ≤ 8. (a) A geometric solution. By (F2) the inequality |x− 3| ≤ 5 is equivalent to saying that ‘the distance between x and 3 is less than or equal to 5.’ The points x on the number line that have this property are shown below. | 3−2 8 55 Hence −2 ≤ x ≤ 8. (b) An algebraic solution. Using (F3) the inequality is equivalent to 2x+ 5 < −8 or 2x+ 5 > 8. Hence 2x < −13 or 2x > 3, which gives the solution x < −13 2 or x > 3 2 . c©2020 School of Mathematics and Statistics, UNSW Sydney 1.4. FUNCTIONS 19 (c) An algebraic solution. By Prop.1.3.1 (iii), we can write, for x 6= 1,∣∣∣∣x− 3x− 1 ∣∣∣∣ = |x− 3||x− 1| and since the denominator is positive, the inequality can be written as |x− 3| < |x− 1|. Squaring both (positive) sides),and using (F1), we have (x− 3)2 < (x− 1)2. Expanding and solving, we have x > 2. (c) A geometric solution. As above, the inequality is equivalent to |x− 3| < |x− 1|, x 6= 1. By (F2) the inequality is interpreted geometrically as ‘the distance from x to 3 is less than the distance from x to 1.’ In other words, x is closer to 3 than to 1. Points that have this property are shown on the numberline below. | | 1 2 3 Hence x > 2. 1.4 Functions (Ref: SH10 §1.5) A function f : A→ B is a rule which assigns to every element x belonging to a set A exactly one element f(x) belonging to a set B. The set A is called the domain of the function f and the set B is called the codomain of f . In this course, A and B are always sets of real numbers. Example 1.4.1. A function f : [0,∞)→ R is given by f(x) = √ x for all x in [0,∞). Informally, this means that f takes a number x from [0,∞) as input and gives the number √ x as output. More formally, we say that f maps an element x of [0,∞) to √x. We offer a few comments on terminology employed to describe f . • The expression f : [0,∞)→ R says that [0,∞) is the domain of f and that R is the codomain of f . We write Dom(f) = [0,∞) and Codom(f) = R. • The domain of f is the set of all inputs for the function. c©2020 School of Mathematics and Statistics, UNSW Sydney 20 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS • The codomain of f is a set that contains all the output values of the function. Since the output values are all real numbers, we say that f is a real-valued function. While all the outputs must lie in Codom(f), not every number in Codom(f) need be an output value. • The expression f(x) (read as ‘f of x’) is the value of f at the point x. That is, f(x) is the unique number in R that corresponds to the input x. To emphasise the point: f(x) is a real number, not a function. • The statement ‘f(x) = √ x for all x in [0,∞)’ can be abbreviated as f(x) = √ x ∀x ∈ [0,∞). The symbol ∀ means ‘for all.’ In high school, functions are often described by specifying a rule without specifying a domain. For example, the following sentence was taken from an HSC study guide: ‘Consider the function f(x) = x2.’ There are two problems with this sentence. First, neither f(x) nor f(x) = x2 is a function. As mentioned above, f(x) is a real number, and f(x) = x2 is an equation. (This may seem picky, but being able to distinguish between a function f and its value f(x) at the point x is crucial to understanding key concepts in both algebra and calculus.) Second, the quoted sentence does not specify the domain of the function. The next example gives two different functions that obey the same squaring rule. Example 1.4.2. Consider the functions f and g, defined by f : R→ R f(x) = x2 ∀x ∈ Dom(f) and g : [0,∞)→ R g(x) = x2 ∀x ∈ Dom(g). Even though they are both squaring functions, f and g are not the same function. For example, f can square negative numbers while g does not. On the other hand, g is invertible (this important and desirable property is discussed in Chapter 6) while f is not. Their graphs, shown below, are also noticeably different. x f(x) x g(x) If, for whatever reason, the domain of a function is not specified, but the function rule is, then the default domain, known as the maximal or natural domain, is the largest possible domain for which the rule makes sense. We give two examples. c©2020 School of Mathematics and Statistics, UNSW Sydney 1.4. FUNCTIONS 21 • If f(x) = √ x− 1 then the rule makes sense only if x− 1 ≥ 0, that is, if x ≥ 1. Therefore the maximal domain of the function f is [1,∞). • If f(x) = 2 (x− 2)(x− 8) then the rule only makes sense if x 6= 2 or x 6= 8. So the maximal domain is the set {x ∈ R : x 6= 2, x 6= 8}. Another set associated to a function is its range. Definition 1.4.3. Suppose that f : A→ B is a function. The range of f , denoted by Range(f), is defined by Range(f) = {f(x) ∈ B : x ∈ A}. In other words, the range of f is the set of all output values. Equivalently, if the rule of a function is described by the equation y = f(x), then the range of f is the set of all corresponding y-values. The range is always a subset of the codomain, but the range need not equal the codomain. The difference between range and codomain is illustrated by the following examples. Example 1.4.4. Suppose that f : R→ R is given by the rule f(x) = sinx ∀x ∈ R. Then Dom(f) = R and Codom(f) = R (this is obtained from the expression f : R → R). To determine the range, consider the graph of f is sketched below. x f(x) 1 −1 The set of all output values (equivalently, the set of all corresponding y-values) is [−1, 1]. Hence Range(f) = [−1, 1]. c©2020 School of Mathematics and Statistics, UNSW Sydney 22 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS Example 1.4.5. Suppose that f : [−1, 1]→ R is given by the rule f(x) = √ 1− x2 ∀x ∈ [−1, 1]. Then Dom(f) = [−1, 1] and Codom(f) = R (again, we are simply reading this from f : [−1, 1] → R). To determine the range, we observe that the graph of f is a semicircle. x f(x) 1 −1 1 The set of all outputs (or y-values) is [0, 1]. Hence Range(f) = [0, 1]. If f and g are two functions with the same domain, then one can combine f and g to form new functions. Definition 1.4.6. Suppose that f : A→ B and g : A→ B are real-valued functions. Then the functions f + g, f − g, f.g and f/g are defined by the rules (f + g)(x) = f(x) + g(x) ∀x ∈ A (f − g)(x) = f(x)− g(x) ∀x ∈ A (f.g)(x) = f(x)g(x) ∀x ∈ A (f/g)(x) = f(x) g(x) ∀x ∈ A, provided that g(x) 6= 0. Another way of constructing new functions is given below. Definition 1.4.7. Suppose that f : C → D and g : A→ B are functions such that Range(g) is a subset of Dom(f). Then the composition f ◦ g : A→ D is defined by the rule (f ◦ g)(x) = f(g(x)) ∀x ∈ A. Example 1.4.8. Suppose that f : R→ R and g : R→ R are given by the rules f(x) = x2 + 1 ∀x ∈ R and g(x) = x− 2 ∀x ∈ R. Find (f + g)(5) and (f ◦ g)(x). Is 2 in the domain of the function f/g? Give reasons. c©2020 School of Mathematics and Statistics, UNSW Sydney 1.5. POLYNOMIALS AND RATIONAL FUNCTIONS 23 Solution. First, (f + g)(5) = f(5) + g(5) = (52 + 1) + (5− 2) = 29. Second, if x in R then (f ◦ g)(x) = f(g(x)) = (x− 2)2 + 1 = x2 − 4x+ 5. Finally, g(2) = 0 so 2 is not in the domain of f/g. The final example illustrates the importance of bearing in mind domain and range when com- posing two functions. Example 1.4.9. Suppose that f : [0,∞)→ R and g : R→ R are given by f(x) = √ x ∀x ∈ [0,∞) and g(x) = sinx− 2 ∀x ∈ R. Find, if they exist, (f ◦ g)(x) and (g ◦ f)(x). Solution. Consider first f ◦ g. For this function to exist, we require that Range(g) is a subset of Dom(f). Now −1 ≤ sinx ≤ 1 ∀x ∈ R so −3 ≤ sinx− 2 ≤ −1 ∀x ∈ R; that is, Range(g) = [−3,−1]. Since Dom(f) = [0,∞), Range(g) is not a subset of Dom(f) and hence f ◦ g does not exist. (The point here is that, if (f ◦ g)(x) = f(g(x)) = √sinx− 2, then the expression under the square root sign is always negative.) Now consider g ◦f . Since f is a real-valued function and Dom(g) = R, it is clear that Range(f) is a subset of Dom(g). (The fact that Range(f) = [0,∞) is not needed because Dom(g) is so large.) Hence (g ◦ f)(x) = g(f(x)) = sin(√x)− 2 ∀x ∈ R. 1.5 Polynomials and rational functions (Ref: SH10 §1.6) In the next three sections we discuss some important classes of functions. Since these were intro- duced in high school, we give a brief survey only. c©2020 School of Mathematics and Statistics, UNSW Sydney 24 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS Definition 1.5.1. A function f : R→ R is called a polynomial if f(x) = anx n + an−1xn−1 + · · · + a2x2 + a1x+ a0 ∀x ∈ R, where n is a natural number, the constants a0, a1, . . . , an are real numbers and an 6= 0. We call n the degree of f , and the constants a0, a1, . . . , an are called the coefficients of f . The constant an is called the leading coefficient of f . Polynomials of small degree have specials names and properties. • If n = 0 then f(x) = a0. Hence f is a constant function and the graph of f is a horizontal straight line. • If n = 1 then f(x) = a1x+ a0. The graph of f is a straight line with gradient a1 and y-axis intercept a0. • If n = 2 then f is called a quadratic function and its graph is a parabola. • If n is 3, 4 or 5 then f is called a cubic, quartic or quintic, polynomial respectively. The general shape of the graph of a polynomial function of degree greater than 1 is indicated by the table below. Positive leading coefficient Negative leading coefficient Odd degree Even degree The number of turning points in the graph is always strictly less than the degree of the polynomial. Exactly how many turning points the graph has depends on the coefficients of the polynomial, and can be determined using calculus (see, for example, Chapters 4 and 5). By dividing two polynomials, one obtains a new type of function. c©2020 School of Mathematics and Statistics, UNSW Sydney 1.6. THE TRIGONOMETRIC FUNCTIONS 25 Definition 1.5.2. Suppose that p and q are polynomials. A function f is called a rational function if Dom(f) = {x ∈ R : q(x) 6= 0} and f(x) = p(x) q(x) , ∀x ∈ Dom(f). Hence the expressions f(x) = x x2 + 2 and f(x) = x3 − 4x2 + 1 x− 2 both give rise to rational functions, as does f(x) = x− 1 + 3 x2 + 3 (to see why, rewrite right-hand side with a common denominator of x2 + 3). The graphs of two simple rational functions are shown below. x y y = 1x x y y = 1x2 The graphs of more sophisticated rational functions are studied in Chapter 7. 1.6 The trigonometric functions (Ref: SH10 §1.6) The trigonometric functions will already be familiar to you. This section gives a brief summary of their definitions and some of their properties. Remark 1.6.1. In this course, angles are always measured in radians rather than degrees. (Radian measure is a more natural way of measuring angles; moreover, important formulae used in calculus, such as d dθ (sin θ) = cos θ, only make sense geometrically when θ is interpreted as an angle measured in radians rather than degrees.) We remind readers that 2π radians = 360 degrees. c©2020 School of Mathematics and Statistics, UNSW Sydney 26 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS Consider an angle θ and the corresponding point P (x, y) that lies on the unit circle centred at the origin, as shown below. 1 x y P (x, y) θ The functions cos : R→ R and sin : R→ R are defined by cos θ = x and sin θ = y. Other trig functions are defined in terms of the sine and cosine functions: tan θ = sin θ cos θ , provided that cos θ 6= 0 sec θ = 1 cos θ , provided that cos θ 6= 0 cosec θ = 1 sin θ , provided that sin θ 6= 0 cot θ = cos θ sin θ , provided that sin θ 6= 0. The graphs of sin, cos and tan are shown below. θ sin θ π 2 π 3π 2 2π−2π | | | | | 1 −1 | | θ cos θ π 2 π 3π 2 2π−2π | | | | | 1 −1 | | c©2020 School of Mathematics and Statistics, UNSW Sydney 1.6. THE TRIGONOMETRIC FUNCTIONS 27 θ tan θ π 2 π 3π 2 2π−2π | | | | | 1 −1 | | The sine and cosine functions are 2π-periodic, while the tangent function is π-periodic, which means that sin(θ + 2π) = sin θ, cos(θ + 2π) = cos θ and tan(θ + π) = tan θ whenever θ is in the domain of the respective function. Sine and cosine both have an amplitude of 1. It is not hard to obtain functions whose graphs have the same general shape as the sine (or cosine) curve with a period other than 2π and amplitude other than 1. For example, if A and ω are positive real numbers then the function f : R→ R, given by f(x) = A sin(ωx) ∀x ∈ R, has an amplitude of A and a period of 2πω . Its graph is shown below. x f(x) 2π ω−2πω | | | | | A −A | | The six trigonometric functions are related to one another by various identities and formulae: • complementary identities sin (π 2 − x ) = cos x cos (π 2 − x ) = sinx c©2020 School of Mathematics and Statistics, UNSW Sydney 28 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS • Pythagorean identities cos2 x+ sin2 x = 1 1 + tan2 x = sec2 x cot2 x+ 1 = cosec2 x • the sum and difference formulae sin(x± y) = sinx cos y ± cos x sin y cos(x± y) = cos x cos y ∓ sinx sin y tan(x± y) = tanx± tan y 1∓ tanx tan y • double-angle formulae sin(2x) = 2 sinx cos x cos(2x) = cos2 x− sin2 x tan(2x) = 2 tan x 1− tan2 x. The above formulae, especially the double-angle formulae for sin(2x) and cos(2x), should be learnt carefully. The inverse trigonometric functions will be discussed in Chapter 6. 1.7 The elementary functions (Ref: SH10 §1.6) Other important functions that have not yet been mentioned include the exponential and logarithm functions. Their graphs are shown below. x ex 1 (1, e)b x lnx 1 (e, 1) b These functions will be studied in greater depth in Chapter 9. Functions arising from the study of roots of polynomials are also important to mathematics, especially those functions f of the form f(x) = n √ x, where n is a positive integer. It is important c©2020 School of Mathematics and Statistics, UNSW Sydney 1.7. THE ELEMENTARY FUNCTIONS 29 that students are familiar with the graphs of the square root and cube root functions, shown below. x √ x (1, 1) b x 3 √ x (1, 1) (−1,−1) b b The elementary functions are all those functions that can be constructed by combining a finite number of polynomials, exponentials, logarithms, roots and trigonometric functions (including the inverse trigonometric functions) via function composition, addition, subtraction, multiplication and division. Hence the following expressions give rise to elementary functions: f(x) = esinx + x2 g(x) = lnx− tan x√ x h(x) = 3 √ x4 − 2x2 + 5. It also follows that every rational function is an elementary function. The absolute values function, given by abs : R→ R abs(x) = |x| ∀x ∈ R, is an elementary function by equation (1.2). It follows that if f is an elementary function then abs ◦ f is also an elementary function. It is standard to denote function abs ◦ f by |f |. Thus |f |(x) = (abs ◦ f)(x) = |f(x)| ∀x ∈ Dom(f). The graph of |f | is obtained by reflecting in the x-axis any part of the graph of f that lies below the x-axis. This is illustrated below. x f(x) x |f(x)| Most, if not all functions studied in high school were elementary functions. In this course we shall meet some useful functions that are not elementary functions. (These are defined by integrals; see Section 8.12.) c©2020 School of Mathematics and Statistics, UNSW Sydney 30 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS 1.8 Implicitly defined functions Many curves on the plane can be described as all those points (x, y) on the plane that satisfy some equation involving x and y. For example, consider the equation (x2 + y2 − 1)3 − x2y3 = 0. (1.3) The set of points (x, y) satisfying this equation are shown on the graph below. x y Note that the curve cannot be described by one function since some x-values have two corresponding y-values. However, the curve can be described by combining the graphs of two functions f : [−a, a]→ R and g : (−a, a)→ R, which are defined by the rules y = f(x) (x2 + y2 − 1)3 − x2y3 = 0 y ≥ b and y = g(x) (x2 + y2 − 1)3 − x2y3 = 0 y < b (where a ≈ 1.139 and b ≈ 0.545), as illustrated below. x y | b || a−a y = f(x) x y | b || a−a y = g(x) c©2020 School of Mathematics and Statistics, UNSW Sydney 1.8. IMPLICITLY DEFINED FUNCTIONS 31 We say that f and g are implicitly defined by equation (1.3). Example 1.8.1. The set of points (x, y) that satisfy the equation x2 + y2 = 1 describes a circle of radius 1 centred at the origin. The above equation could be used to implicitly define many different functions. We give two examples. • The function f : [0, 1] → R is defined by the rule y = f(x) x2 + y2 = 1 y ≥ 0. Its graph is shown below. x y 1 1 y = f(x) • The function g : (−1, 1)→ R is defined by the rule y = f(x) x2 + y2 = 1 y ≤ 0. Its graph is shown below. x y 1−1 −1 y = g(x) Many heavenly bodies (such as planets and comets) trace out paths that are roughly elliptic, parabolic or hyperbolic. (The fact that planets trace out elliptic paths around the sun was discov- ered by Johannes Kepler at the beginning of the sixteenth century, and was later used by Newton, in conjunction with calculus, to establish his universal law of gravitation.) Circles, ellipses, parabolas and hyperbolas are known as conic sections because they can be obtained by intersecting a cone with a plane. Equations that describe conic sections, accompanied by corresponding diagrams, are shown below. c©2020 School of Mathematics and Statistics, UNSW Sydney 32 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS x y r r −r −r Circle: x2 + y2 = r2 x y a b −a −b Ellipse: x2 a2 + y2 b2 = 1 x y y = bax a−a Hyperbola: x2 a2 − y 2 b2 = 1 x y y = bax b −b Hyperbola: y2 b2 − x 2 a2 = 1 Students should be able to recognise such curves. 1.9 Continuous functions The sine, cosine, exponential, logarithmic and polynomial functions are examples of continuous functions. Intuitively, a function f is continuous on its domain if small changes in the input variable x produce small changes in f(x) (assuming, of course, that x always remains within Dom(f)). The intuitive notion that a function is continuous if ‘its graph can be drawn without lifting the pencil off the page’ works well for some functions, but not for others. For example, the tangent function is a continuous function, even though its graph (part of which is shown below) cannot be drawn without lifting the pencil off the page. x tan x π 2 π−π2−π | | | | The break in the graph when, say, x = −π/2 or x = π/2, is a consequence of a break in the domain of the function rather than a discontinuity of the function. Similarly, the trigonometric functions c©2020 School of Mathematics and Statistics, UNSW Sydney 1.10. MAPLE NOTES 33 sec, cosec and cot are continuous, as are the rational functions. The notion of continuity shall be discussed in more precise terms in Chapters 2 and 3. Intuitively, a function f has a discontinuity at a point a if a ∈ Dom(f) and there is a break in the graph of f at a. It is important to realise that there are different types of discontinuities. We compare three examples. x f(x) bc b | a x g(x) bc b | a x h(x) b | a The functions f , g and h each have a discontinuity at a. • For f , the discontinuity can be easily removed by redefining the value of f(a) to ‘plug up the hole’ at a. This is an example of a removable discontinuity. • For g, the discontinuity is a result of a finite jump in the graph of g at the point a. The discontinuity for g, which is an example of a jump discontinuity, is worse than that for f but not nearly as bad as that for h. • For h, the discontinuity is a result of rapid oscillation on the right-hand side of a and an infinite jump on the left-hand side of a. This is an example of an essential discontinuity. While students are not expected to learn or use the terms removable, jump and essential disconti- nuity, it is expected that they will appreciate that some discontinuities are worse than others. 1.10 Maple notes Many chapters in these notes conclude with a summary of relevant Maple commands. Other sources of information are the First Year Computing Notes and the online Maple help command. While Maple can relieve you of the tedium of calculations, it is not a substitute for understanding the mathematics behind the calculations, which is what you need in order to interpret Maple output intelligently. The following Maple command is relevant to the material of this chapter: plot(f(x),x=a..b); draws a (two-dimensional) plot of the graph of y = f(x) for a ≤ x ≤ b. To find out about the many options available, use ?plot. Implicitly defined functions can be plotted using the smartplot command. For example, > # draw a rational function > f:=x^5-2*x^4+3*x^3-6*x^2-4*x+8: > g:=4*x^4-8*x^3-12*x^2+16*x+16: > plot(f/g, x=-10..10,y=-10..10,numpoints=1000,discont=true); c©2020 School of Mathematics and Statistics, UNSW Sydney 34 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS > # draw a hyperbola, via implicit definition > smartplot(x^2/9-y^2/4=1); > # use the Maple plot command implicitplot --- need more plotting tools > with(plots): > implicitplot(y^5+x^2*y^3+16,x=-10..10,y=-2..0); c©2020 School of Mathematics and Statistics, UNSW Sydney PROBLEMS FOR CHAPTER 1 35 Problems for Chapter 1 Questions marked with [R] are routine and with [H] harder. You should make sure that you can do the easier questions before you tackle the more difficult questions. Questions marked with [V] have a video solution available from Moodle. Problems 1.1 : Sets of numbers 1. [R] Express the following sets in words. Graph the sets on the number line (if possible). a) {x ∈ Z : −π < x < π} b) {x ∈ R : x2 − x− 1 < 0} c) {x ∈ Q : x2 = 2} 2. [R] Graph on the number line the following sets. a) [3,∞), (−∞, 3), (−∞,∞), (−3, 3] b) {x : |x− 2| < 5} c) {x : x2 + 4x− 5 > 0} 3. [R] Sketch the set of points (x, y) which satisfy the following relations. a) 0 ≤ y ≤ 2x and 0 ≤ x ≤ 2 b) y/2 ≤ x ≤ 2 and 0 ≤ y ≤ 4 Problems 1.2 : Solving inequalities and 1.3 : Absolute values 4. [R] Solve the following inequalities. a) x(x− 1) > 0 b) (x− 1)(x− 2) < 0 c) 1 x > −1 2 d) 1 1− x > 1 2 e) x ≥ 6 x− 1 5. [R] Solve the following inequalities. a) ∣∣x+ 1∣∣< 3 b) ∣∣x+ 2∣∣> 3 c) ∣∣3x+ 2∣∣< 1 d) ∣∣∣∣x− 1x+ 1 ∣∣∣∣ < 1 6. [R] [V] a) By expanding (x− y)2, prove that x2 + y2 ≥ 2xy for all real numbers x and y. b) Deduce that a+ b 2 ≥ √ ab for all non-negative real numbers a and b. When does equality hold? c) Use the result above to find the minimum value of y = x2 + 1 x2 . c©2020 School of Mathematics and Statistics, UNSW Sydney 36 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS 7. [R] True or false: a) If a > b then 1 a < 1 b . b) If a < b then a2 < b2. c) If 0 < a < b then a2 < b2. d) If a2 + b2 = 0 then a = b = 0. e) If −1 < a < b then a2 < b2. 8. [H] Prove that (x+ y)2 ≥ 4xy and hence deduce that 1 x2 + 1 y2 ≥ 4 x2 + y2 . 9. [H] [V] a) Prove that f(x) = 1 + x+ x2 is positive for all real numbers x. b) By considering cases (or otherwise) prove that 1 + x+ x2 + x3 + x4 is always positive. c) Generalise the above results. Problems 1.4 : Functions 10. [R] Determine the (maximal) domain and corresponding range for each function f described below. a) f(x) = √ 5− x2 b) f(x) = √x2 − 5 c) f(x) = (x− 8)−1/3 d) f(x) = √x− 1 e) f(x) = 1√ x− 1 f) f(x) = √ sinx g) f(x) = √ 1− 2 sinx h) f(x) = 1 + tan2 x i) f(x) = cos x if x < 0√ 1− x if 0 ≤ x ≤ 1 |x| if x > 1 11. [R] If f(x) = x+ 5 and g(x) = x2 − 3 then find a) (g ◦ f)(0) b) (g ◦ f)(x) c) (f ◦ g)(2) d) (f ◦ g)(x). 12. [R] If f(x) = x− 1 and g(x) = 1√ x− 1, then give the explicit forms of a) (f + g)(x) b) (fg)(x) c) ( f g ) (x) d) (f ◦ g)(x). Problems 1.5 : Polynomials and rational functions, 1.6 : The trigonometric functions and 1.7 : The elementary functions 13. [R] Draw neat sketches (preferably without using calculus) of the graphs given by the fol- lowing equations. c©2020 School of Mathematics and Statistics, UNSW Sydney PROBLEMS FOR CHAPTER 1 37 a) y = x2 − 5x+ 6 b) y = 2x3 − 16 c) y = 4 x− 3 d) y = 2ex−1 e) y = 1 x2 + 4 f) y = 3 sin 2x g) y = √ x− 1 14. [R] a) Sketch the graph of y = √ x+ 1 and use your graph to sketch (on the same diagram) y = 1√ x+ 1 . b) Repeat for y = x2 − 4x+ 3. 15. [R] Sketch the graph of y = x2 − 7x− 8 and hence sketch the graph of y = |x2 − 7x− 8|. 16. [R] What range of values will x2 + 4 take if −2 ≤ x ≤ 3? 17. [R] Use a graphical approach to solve |2x− 5| = x+ 2. 18. [R] a) Show that if p and q are polynomials then p ◦ q is again a polynomial. b) [H] Is the same true for rational functions? Problems 1.8 : Implicitly defined functions 19. [R] Sketch the graphs given by the following equations. a) x2 9 + y2 4 = 1 b) x2 9 − y 2 4 = 1 c) 4x2 + 9y2 = 36 d) y2 9 − x 2 4 = 1 c©2020 School of Mathematics and Statistics, UNSW Sydney 38 CHAPTER 1. SETS, INEQUALITIES AND FUNCTIONS c©2020 School of Mathematics and Statistics, UNSW Sydney 39 Chapter 2 Limits In many situations, it is desirable to know what the long term state of a system will be. For example, suppose that an initially unpolluted lake containing 109 litres of water has a river flowing through it at a rate of 106 litres per day. A factory is built next to the lake and discharges 104 litres of pollution per day. By making some simple assumptions, it can be shown that the amount P (t) of pollution in the lake after t days of the factory’s operation is given by P (t) = 109 101 ( 1− e−101t/105). (2.1) Environmental authorities want to know what the long term level of the pollution will be. To see whether the pollution in the lake eventually stabilises, and what the level of pollution will be, one studies the behaviour of P (t) as t→∞. Understanding the limiting behaviour of functions at infinity is important for practical and mathematical reasons. That is what we study in the next four sections. 2.1 Limits of functions at infinity (Ref: SH10 §4.7) In this section, we examine some techniques for calculating limits of the form lim x→∞ f(x). 2.1.1 Basic rules for limits There are a few basic things to bear in mind when working with limits. • If f(x) gets arbitrarily close to some real number L as x tends to infinity, then we write lim x→∞ f(x) = L. We can also write f(x)→ L as x→∞. • If f(x) gets arbitrarily large (that is, approaches ∞) as x tends to ∞ then we write f(x)→∞ as x→∞. We do not write lim x→∞ f(x) =∞ since∞ is not a real number. (The lim notation is only used when the limit exists and is a real number.) c©2020 School of Mathematics and Statistics, UNSW Sydney 40 CHAPTER 2. LIMITS • If f(x)→∞ as x→∞ then lim x→∞ 1 f(x) = 0. This is intuitively obvious and gives limits such as lim x→∞ 1 x = 0, lim x→∞ 5 4x6 = 0, lim x→∞ e −x = 0 (for the last example, recall that e−x = 1ex ). The following proposition shows that the arithmetic of limits behaves nicely in many situations. Proposition 2.1.1. Suppose that lim x→∞ f(x) and limx→∞ g(x) exist and are finite real numbers. Then (i) lim x→∞[f(x) + g(x)] = limx→∞ f(x) + limx→∞ g(x) (ii) lim x→∞[f(x)− g(x)] = limx→∞ f(x)− limx→∞ g(x) (iii) lim x→∞[f(x)g(x)] = [ limx→∞ f(x)][ limx→∞ g(x)] (iv) lim x→∞ f(x) g(x) = lim x→∞ f(x) lim x→∞ g(x) , provided that lim x→∞ g(x) 6= 0. Part of this proposition will be proved in Section 2.4. For now, we demonstrate how the proposition is used to calculate limits. Example 2.1.2. Calculate lim x→∞ 3− 1/x2 2 + e−x if the limit exists. Justify each step of your working with reference to Proposition 2.1.1. Solution. lim x→∞ 3− 1/x2 2 + e−x = lim x→∞(3− 1/x 2) lim x→∞(2 + e −x) (rule (iv)) = lim x→∞ 3− limx→∞ 1/x 2 lim x→∞ 2 + limx→∞ e −x (rules (i) and (ii)) = 3− 0 2 + 0 = 3 2 . In practice, you may use the rules of Proposition 2.1.1 without referring to them, and leave out some of the steps shown above. c©2020 School of Mathematics and Statistics, UNSW Sydney 2.1. LIMITS OF FUNCTIONS AT INFINITY 41 2.1.2 The pinching theorem The idea behind the following theorem is intuitively obvious. Suppose, as illustrated in the diagram below, that the graph of a function h always lies above the graph of a function f . Suppose also that f and h have the same limit as x →∞. If the graph of a function g always lies between the graphs of h and f (so that g is ‘pinched’ between f and h) then g has the same limit also. x y | b L y = f(x) y = g(x) y = h(x) This idea is stated precisely below. Theorem 2.1.3 (The pinching theorem). Suppose that f , g and h are all defined on the interval (b,∞), where b ∈ R. If f(x) ≤ g(x) ≤ h(x) ∀x ∈ (b,∞) and lim x→∞ f(x) = limx→∞h(x) = L then lim x→∞ g(x) = L. One can modify the theorem to cover the case when x→ −∞. Example 2.1.4. Use the pinching theorem to find lim x→∞ sinx x . Solution. We begin with the basic inequality −1 ≤ sinx ≤ 1, which is valid for every real number x. If we restrict x to the interval (0,∞) then we have −1 x ≤ sinx x ≤ 1 x . Now lim x→∞− 1 x = lim x→∞ 1 x = 0, and so lim x→∞ sinx x = 0 by the pinching theorem. c©2020 School of Mathematics and Statistics, UNSW Sydney 42 CHAPTER 2. LIMITS 2.1.3 Limits of the form f(x)/g(x) Suppose that we want to calculate a limit of the form lim x→∞ f(x) g(x) where both f(x) and g(x) tend to infinity as x→∞. In this situation, we cannot apply the rules of Proposition 2.1.1 directly because the numerator and denominator don’t have finite limits. Instead, we find an equivalent form of f(x)g(x) for which Proposition 2.1.1 can be applied. The key idea is to divide both f and g by the fastest growing term appearing in the denominator g. Example 2.1.5. Evaluate lim x→∞ 4x2 − 5 2x2 + 3x , if it exists. Solution. There are two terms appearing in the denominator: 2x2 and 3x. As x → ∞, the term which grows fastest is the one involving x2. So we divide the numerator and denominator by x2 to evaluate the limit: 4x2 − 5 2x2 + 3x = 4− 5/x2 2 + 3/x → 4− 0 2 + 0 as x→∞. Therefore lim x→∞ 4x2 − 5 2x2 + 3x = 2. In general, we divide the numerator and denominator by the highest power of x in the denom- inator. Example 2.1.6. Find lim x→∞ 5x3 + 6x2 − 4 sinx cos 3x+ 5x− 2x3 . Example 2.1.7. Find lim x→∞ x2 + 3x√ 2x4 + 3− 4x . 2.1.4 Limits of the form √ f(x)−√g(x) Consider the limit of √ x+ 5−√x+ 2 as x→∞. Since both √x+ 5 and √x+ 2 tend to infinity (rather than a finite number) as x→∞, one cannot apply Proposition 2.1.1. As the next example shows, a simple algebraic trick is used to overcome this difficulty. Example 2.1.8. Evaluate lim x→∞ (√ x+ 5−√x+ 2), if it exists. c©2020 School of Mathematics and Statistics, UNSW Sydney 2.1. LIMITS OF FUNCTIONS AT INFINITY 43 Solution. The algebraic trick is to multiply both the numerator and denominator by √ x+ 5 +√ x+ 2 and expand the numerator as a difference of squares: √ x+ 5−√x+ 2 = ( √ x+ 5−√x+ 2)(√x+ 5 +√x+ 2)√ x+ 5 + √ x+ 2 = (x+ 5)− (x+ 2)√ x+ 5 + √ x+ 2 = 3√ x+ 5 + √ x+ 2 → 0 as x→∞. One should not be fooled into thinking, on the basis of the above example, that all limits of this type are zero. Example 2.1.9. Show that lim x→∞ (√ x2 + x− x) = 1 2 . 2.1.5 Indeterminate forms We have already mentioned that the rules of Proposition 2.1.1 only apply when the limits involved are finite. In Section 2.1.3, we studied limits of the type lim x→∞ f(x) g(x) , when f(x)→∞ and g(x)→∞ as x→∞. We say that such a limit is of the form ∞∞ . We cannot say in advance whether or not a limit of the form ∞∞ exists, and if it does exist, what its value is. For example, while the following limits have the form ∞∞ , each displays very different limiting behaviour as x→∞: • x2x →∞ as x→∞ • x x2 → 0 as x→∞ • 2x2 x2 → 2 as x→∞ • x2 2x2 → 12 as x→∞. Because we cannot determine in advance what kind of limiting behaviour something of the form ∞ ∞ has, we say that ∞ ∞ is an indeterminate form. Other types of indeterminate forms (for example, limits of the form 00 and∞−∞) arise naturally in many applications. Techniques for evaluating such limits will be developed at various stages of the course. c©2020 School of Mathematics and Statistics, UNSW Sydney 44 CHAPTER 2. LIMITS 2.2 The definition of lim x→∞ f(x) Up to this point, we have treated limits in a rather intuitive way. It is now time to establish a rigorous basis for our treatment of limits. That is, we seek a definition for lim x→∞ f(x) = L that is mathemetically precise and agrees with our intuitive notion of limits. How do we do this? A good strategy is to think deeply about a few simple examples before considering the general case. Consider the real-valued functions f , g and h defined by f(x) = 1 x , g(x) = sinx and h(x) = 0.05 whenever x > 0. Their graphs are sketched below. x f(x) 0 x g(x) 0 2pi 4pi x h(x) 0 Intuitively, we see that f(x) tends to 0 as x tends to infinity, while neither g nor h have this property. What is it about f that distinguishes its limiting behaviour from that of g or h? The above graph shows that the distance between f(x) and its limit 0 is small whenever x is large. However, to get to the core difference between f and the other two functions, we must be more precise. • The distance between h(x) and 0 is also small, but its limit is not 0. The reason why f(x) tends to 0 as x → ∞ is that the distance between f(x) and 0 can be made as small as we like whenever x is large enough. • The distance between g(x) and 0 can also be made as small as we like, by taking x sufficiently close to an integer multiple of π. However the limit of g(x) is not 0. The difference between f and g is that the distance between f(x) and 0 remains small for all sufficiently large values of x. The same is not true for g. In summary, f(x) tends to 0 as x →∞ because the distance between f(x) and 0 can be made as small as we like for all values of x that are sufficiently large. The next step in crafting a good definition is to express phrases like ‘can be made as small as we like’ and ‘sufficiently large’ in concrete mathematical language. The following list is one attempt to do this. (As a reminder, f(x) = 1/x.) • The distance between f(x) and 0 is less than 1 whenever x > 1. c©2020 School of Mathematics and Statistics, UNSW Sydney 2.2. THE DEFINITION OF LIM X→∞ F (X) 45 x y y = f(x) 1 0 −1 1 (1, 1) • The distance between f(x) and 0 is less than 0.2 whenever x > 5. x y y = f(x) 0.2 0 −0.2 5 (5, 0.2) • The distance between f(x) and 0 is less than 0.1 whenever x > 10. • The distance between f(x) and 0 is less than 0.01 whenever x > 100. • The distance between f(x) and 0 is less than 0.0001 whenever x > 10000. • etc. Of course, to express the fact that ‘the distance between f(x) and 0 can be made as small as we like’ in this way would require an infinite list. Instead, we say that, for every small positive number ǫ, there is real number M such that • the distance between f(x) and 0 is less than ǫ whenever x > M . x y y = f(x) ǫ 0−ǫ M (1ǫ , ǫ) By looking at the graph we can see that M = 1ǫ . (The symbol ǫ is the Greek letter ‘epsilon’ and is traditionally used by mathematicians in this context.) So far we have agreed that f(x)→ 0 as x→∞ precisely because c©2020 School of Mathematics and Statistics, UNSW Sydney 46 CHAPTER 2. LIMITS for every positive real number ǫ, there is a real numberM such that the distance between f(x) and 0 is less than ǫ whenever x > M . This statement forms a basis for defining what is meant by lim x→∞ f(x) = 0. A general definition for what is meant by lim x→∞ f(x) = L is now obtained by • replacing the limit 0 with the limit L, and then • replacing ‘the distance between f(x) and L’ with |f(x)− L| (see Section 1.3). We are now ready to give a general definition for the limit of a function at infinity. Definition 2.2.1. Suppose that L is a real number and f is a real-valued function defined on some interval (b,∞). We say that lim x→∞ f(x) = L if for every positive real number ǫ, there is a real number M such that if x > M then |f(x)− L| < ǫ. Remark 2.2.2. Suppose that lim x→∞ f(x) = L. Definition 2.2.1 can be interpreted geometrically in the following way. For every small number ǫ, you can find a marker M on the x-axis such that the distance between f(x) and the limit L is less than ǫ whenever x lies to the right of M . x y y = f(x) L+ ǫ L L− ǫ M | Remark 2.2.3. In Definition 2.2.1, the number M depends on ǫ. In general, the smaller the value of ǫ, the larger the value of M . We note, however, that for any particular value of ǫ the choice of M is not unique. For example, the M chosen for the above diagram is different from the M chosen for the diagram below, but both choices guarantee that |f(x)− L| < ǫ whenever x > M. c©2020 School of Mathematics and Statistics, UNSW Sydney 2.3. PROVING THAT LIM X→∞ F (X) = L USING THE LIMIT DEFINITION 47 x y y = f(x) L+ ǫ L L− ǫ M | Remark 2.2.4. Students who find Definition 2.2.1 difficult to understand can take comfort in the fact that a rigorous formulation of limits evaded mathematicians for over 2000 years. The problems and paradoxes involving limiting processes tabled around 450 BC by the ancient Greek philosopher Zeno of Elea were only satisfactorily resolved in the nineteenth century, when a rigorous definition of the limit was given by Bolzano (in 1817) and Weierstrass (in the 1850s). 2.3 Proving that lim x→∞ f(x) = L using the limit definition In the previous section we gave a definition for lim x→∞ f(x) = L. In this section this definition is used to prove limits that were previously taken for granted. Example 2.3.1. Prove from Definition 2.2.1 that lim x→∞ 4 x2 = 0. Proof. Suppose that f(x) = 4/x2, L = 0 and ǫ > 0. We begin by calculating the distance between the function and its limit: |f(x)− L| = ∣∣∣∣ 4x2 − 0 ∣∣∣∣ = 4 x2 . (2.2) We need to find a condition on x such that |f(x)− L| < ǫ. By (2.2), we require that 4 x2 < ǫ. By rearrangement, we see that this is satisfied whenever x > 2√ ǫ . So far we have shown that |f(x)− L| < ǫ whenever x > 2√ ǫ . Hence if M = 2√ ǫ . (2.3) then |f(x)− L| < ǫ whenever x > M . This completes the proof. c©2020 School of Mathematics and Statistics, UNSW Sydney 48 CHAPTER 2. LIMITS The above proof can be streamlined by omitting some explanatory commentary. In the next example, explanatory comments (which are not part of the proof) are inserted inside square brack- ets. Example 2.3.2. Prove that lim x→∞ 5x x+ 3 = 5 by using Definition 2.2.1. Proof. Suppose that ǫ > 0. For convenience, suppose also that x > 0. [This is allowed since we are considering the behaviour of f(x) for large positive values of x.] Now |f(x)− L| = ∣∣∣∣ 5xx+ 3 − 5 ∣∣∣∣ = ∣∣∣∣5x− 5(x+ 3)x+ 3 ∣∣∣∣ = ∣∣∣∣ −15x+ 3 ∣∣∣∣ = 15 x+ 3 [since x > 0] < 15 x [to make algebra simpler later on]. In summary, |f(x)− L| < 15 x . [This inequality gives an upper bound for the distance between f(x) and L.] Hence |f(x)− L| < ǫ whenever 15 x < ǫ. This condition is equivalent to x > 15 ǫ . Hence if M = 15ǫ then |f(x)− L| < ǫ whenever x > M, as required. Before attempting one more proof, we reflect on the general strategy employed. Given ǫ, we need to find a number M such that |f(x)− L| < ǫ whenever x > M. The number M can be found by following the procedure below. 1. Find a good upper bound for |f(x)− L|. 2. Find a simple condition on x such that this upper bound is less than ǫ. c©2020 School of Mathematics and Statistics, UNSW Sydney 2.4. PROOFS OF BASIC LIMIT RESULTS [X] 49 3. Use this condition to state an appropriate value for M (in terms of ǫ). Example 2.3.3. Prove that lim x→∞ sinx 9x2 + e3x = 0 by using Definition 2.2.1. Proof. Suppose that ǫ > 0. Now |f(x)− L| = ∣∣∣∣ sinx9x2 + e3x − 0 ∣∣∣∣ = | sinx| |9x2 + e3x| ≤ 1 9x2 + e3x [using the fact that −1 ≤ sinx ≤ 1] < 1 9x2 [decreasing the denominator increases the RHS]. In summary, |f(x)− L| < 1 9x2 . [This is our upper bound for |f(x)− L|.] Now 1 9x2 < ǫ whenever x > 1 3 √ ǫ . Set M = 1 3 √ ǫ . Then |f(x)− L| < ǫ whenever x > M , as required. Remark 2.3.4. As pointed out in Remark 2.2.3, the value of M is not unique. For instance, the upper bound for |f(x) − L| in our solution to Example 2.3.3 was 1/(9x2) and the corresponding value for M was 1/(3 √ ǫ). If instead we use the upper bound 1/e3x then the corresponding value for M is 13 ln 1 ǫ . 2.4 Proofs of basic limit results [X] By using Definition 2.2.1, one can now prove the limit rules stated in Section 2.1. The selection of proofs given in this section is designed to give students a taste of how such results are proved. Proof of Proposition 2.1.1 (i). Suppose that lim x→∞ f(x) = L1 and limx→∞ g(x) = L2. c©2020 School of Mathematics and Statistics, UNSW Sydney 50 CHAPTER 2. LIMITS We need to show, using Definition 2.2.1, that lim x→∞ ( f(x) + g(x) ) = L1 + L2. We begin by fixing a positive real number ǫ. Since lim x→∞ f(x) = L, there is a positive M1 such that |f(x)− L1| < ǫ 2 whenever x > M1. Similarly, there is a positive M2 such that |g(x) − L2| < ǫ 2 whenever x > M2. Hence, by the triangle inequality,∣∣(f(x) + g(x)) − (L1 + L2)∣∣ ≤ |f(x)− L1|+ |g(x) − L2| < ǫ 2 + ǫ 2 = ǫ whenever both x > M1 and x > M2. Now set M = max{M1,M2} so that∣∣(f(x) + g(x)) − (L1 + L2)∣∣ < ǫ whenever x > M. This proves that lim x→∞ ( f(x) + g(x) ) = L1 + L2. The other rules in Proposition 2.1.1 can be proved similarly, except that sometimes the desired inequalities are harder to obtain. We move on to prove the pinching theorem. Proof of Theorem 2.1.3. Suppose that f(x) ≤ g(x) ≤ h(x) ∀x ∈ (b,∞) and that lim x→∞ f(x) = limx→∞h(x) = L. Fix a positive real number ǫ. Since lim x→∞ f(x) = limx→∞h(x) = L, there are real numbers M1 and M2, both greater than b, such that |f(x)− L| < ǫ whenever x > M1 and |h(x) − L| < ǫ whenever x > M2. In other words, L− ǫ < f(x) < L+ ǫ whenever x > M1 and L− ǫ < h(x) < L+ ǫ whenever x > M2. Hence L− ǫ < f(x) ≤ g(x) ≤ h(x) < L+ ǫ whenever x > max{M1,M2}. This shows that |g(x) − L| < ǫ whenever x > max{M1,M2}, completing the proof. c©2020 School of Mathematics and Statistics, UNSW Sydney 2.5. LIMITS OF FUNCTIONS AT A POINT 51 2.5 Limits of functions at a point (Ref: SH10 §§2.1–2.3, 2.5) So far we have only discussed limits of the form lim x→∞ f(x). In this section we study limits of the form lim x→a f(x), where a is a finite real number. 2.5.1 Left-hand, right-hand and two-sided limits Consider the function f whose graph is shown below. x f(x) | | | | 2 4 6 8 | | | | 1 2 3 4 b b b bc bc With reference to this graph, we will discuss the behaviour of f(x) when x is near the points 2, 4, 6 and 8. • As x approaches 2 from the left-hand side, f(x) approaches 3. In this situation, we write lim x→2− f(x) = 3 and say that ‘the limit of f(x) as x approaches 2 from the left is 3.’ (Note that f(2) = 1 6= 3. One should not confuse a limit with a function value.) As x approaches 8 from the left-hand side, f(x) approaches 1 and hence we write lim x→8− f(x) = 1. Both of these limits are called left-hand limits. • As x approaches 2 from the right-hand side, f(x) approaches 1. In this situation, we write lim x→2+ f(x) = 1 and say that ‘the limit of f(x) as x approaches 2 from the right is 1.’ As x approaches 6 from the right-hand side, f(x) approaches 2 and hence we write lim x→6+ f(x) = 2. Both of these limits are called right-hand limits. c©2020 School of Mathematics and Statistics, UNSW Sydney 52 CHAPTER 2. LIMITS • As x approaches 4 from the left-hand side, f(x) is positive and grows large without bound. In this case, lim x→4− f(x) does not exist because f(x) does not approach a real number. However, we can write f(x)→∞ as x→ 4−. • As x approaches 4 from the right-hand side, f(x) is negative and grows large without bound. In this case, lim x→4+ f(x) does not exist but we can we write f(x)→ −∞ as x→ 4+. Once an understanding of left- and right-hand limits is grasped, we can talk about two-sided limits. Definition 2.5.1. If the left-hand limit lim x→a− f(x) and the right-hand limit lim x→a+ f(x) both exist and equal the same real number L, then we say that the limit of f(x) as x→ a exists and is equal to L, and we write lim x→a f(x) = L. If any one of these conditions fails then we say that lim x→a f(x) does not exist. Recall the graph of the function f introduced at the beginning of this section. • The two-sided limit lim x→2 f(x) does not exist because lim x→2− f(x) 6= lim x→2+ f(x). • The two-sided limit lim x→4 f(x) does not exist because lim x→4− f(x) does not exist. • Since the left-hand and right-hand limits both exist at 6 and lim x→6− f(x) = lim x→6+ f(x) = 2, the two-sided limit lim x→6 f(x) exists and lim x→6 f(x) = 2. Note that f(6) = 4 and so lim x→6 f(x) 6= f(6). That is, the two-sided limit of a function at a point need not equal the value of the function at that point. c©2020 School of Mathematics and Statistics, UNSW Sydney 2.5. LIMITS OF FUNCTIONS AT A POINT 53 • Since the left-hand and right-hand limits at 8 both exist and equal 1, the two-sided limit lim x→8 f(x) exists and lim x→8 f(x) = 1. In this case, the two-sided limit of f at 8 equals the value of the function at 8. That is, lim x→8 f(x) = f(8). The last two points emphasise an important feature of limits and functions. If lim x→a f(x) = L then f(x) approaches L as x tends to a. The function f may not be defined at a, or if it is defined at a, we can have the situation where f(a) = L or where f(a) 6= L. In other words, the value of f(a) does not determine the value of lim x→a f(x). As far as the limit is concerned, all that matters is how f behaves when it is very close to the point a. Remark 2.5.2. In this section we have not given a proper definition for limits of the form lim x→a− f(x) or lim x→a+ f(x). At this stage it is enough to know that such a rigorous definition exists and is similar to the definition for limits of the form lim x→∞ f(x) (see Definition 2.2.1). This definition can be used to prove the basic limit results presented in the remainder of this chapter. 2.5.2 Limits and continuous functions Using limits we can now give a definition for continuity. Definition 2.5.3. Suppose that f is defined on some open interval containing the point a. If lim x→a f(x) = f(a), then we say that f is continuous at a; otherwise, we say that f is discontinuous at a. To illustrate, consider the function f whose graph is sketch below. x f(x) | | | 2 4 6 b b bc bc The function f is continuous at 2, discontinuous at 4 and undefined at 6. (We do not say that f is discontinuous at 6 since 6 /∈ Dom(f).) We now extend our formal notion of continuity at a point to continuity on the real line. c©2020 School of Mathematics and Statistics, UNSW Sydney 54 CHAPTER 2. LIMITS Definition 2.5.4. If f : R → R is continuous at every point a in R then we say that f is continuous everywhere. If a function f : R→ R is continuous everywhere then it has the property that its graph can be drawn on the Cartesian plane as an unbroken curve. This coincides with intuitive notions about continuity gained in high school. As mentioned in Section 1.9, polynomials, rational functions, the trigonometric functions, ex- ponentials and logarithms are continuous at every point in the respective domains. Moreover, if f is continuous at a point a then |f | is also continuous at a (see Section 1.7 for the definition of |f |). The proof of these facts requires a rigorous definition of the limit at a point, and, as already mentioned, such a definition is not presented in this course. However, we will discuss aspects of the proof in Chapter 3. A consequence of the above definition is that both one-sided and two-sided limits of f at a point a are easy to evaluate provided that (a) a ∈ Dom(f) and (b) we know that f is a continuous at a. Hence, in view the facts mentioned above, • lim x→2 sin(x) = sin(2) • lim x→3+ (x2 − x+ 1) = lim x→3 (x2 − x+ 1) = 32 − 3 + 1 = 7 • lim x→4 |x− 10| = |4− 10| = 6. Remark 2.5.5. Continuity is a deep property for a function to have. Contrary to the impression that many students form from their study of functions at school, most functions are not continuous. Three interesting functions with discontinuities are mentioned below. • The function f : R→ R, defined by f(x) = { 1 if x ∈ Q 0 if x /∈ Q, is discontinuous at every point in its domain. • The function f : R→ R, defined by f(x) = { x if x ∈ Q −x if x /∈ Q, is continuous at 0 but discontinuous everywhere else. • It is not too difficult to construct a function f : R→ R which is continuous at every irrational point but discontinuous at every rational point. Hence, one should not assume that the graph of a general function f : R → R is continuous at even a majority of points in its domain. Continuous functions are quite special and deserve much attention. We will study them in greater depth in Chapter 3. c©2020 School of Mathematics and Statistics, UNSW Sydney 2.5. LIMITS OF FUNCTIONS AT A POINT 55 2.5.3 Rules for limits at a point The following proposition, which is analogous to Proposition 2.1.1, shows that limits at a point have nice arithmetic properties. Proposition 2.5.6. Suppose that a ∈ R and that lim x→a f(x) and limx→a g(x) exist and are finite real numbers. Then (i) lim x→a(f(x) + g(x)) = limx→a f(x) + limx→a g(x) (ii) lim x→a(f(x)− g(x)) = limx→a f(x)− limx→a g(x) (iii) lim x→a f(x)g(x) = limx→a f(x) limx→a g(x) (iv) lim x→a f(x) g(x) = lim x→a f(x) lim x→a g(x) , provided that lim x→a g(x) 6= 0. A similar set of rules hold for left- and right-hand limits. Proposition 2.5.6 says that limits interact nicely with function addition, subtraction, multiplica- tion and division. The next proposition shows that limits interact nicely with function composition. Proposition 2.5.7. If lim x→a f(x) = L and g is continuous at L then lim x→a g(f(x)) = g ( lim x→a f(x) ) . If the functions f and g are continuous everywhere, then Proposition 2.5.7 implies that lim x→a g(f(x)) = g ( lim x→a f(x) ) for any point a in R. Example 2.5.8. Evaluate the limit lim x→ √ π/3 cos ( x2 + π6 ) , justifying each step of your calculation. Solution. Since the cosine function and polynomials are continuous everywhere, lim x→ √ π/3 cos ( x2 + π 6 ) = cos ( lim x→ √ π/3 x2 + π 6 ) (by Proposition 2.5.7) = cos (π 3 + π 6 ) (since polynomials are continuous) = cos (π 2 ) = 0. c©2020 School of Mathematics and Statistics, UNSW Sydney 56 CHAPTER 2. LIMITS A version of the pinching theorem for two-sided limits is given below. Similar versions exist for left- and right-hand limits. x y | a L y = f(x) y = g(x) y = h(x) Theorem 2.5.9 (The pinching theorem). Let I be an open interval containing the point a. Suppose that f , g and h are all defined on I except possibly at the point a. If f(x) ≤ g(x) ≤ h(x) ∀x ∈ I and lim x→a f(x) = limx→ah(x) = L then lim x→a g(x) = L. Remark 2.5.10. The pinching theorem can be used to prove the well-known limit lim θ→0 sin θ θ = 1. See the problem set of Chapter 2 for details. We close this chapter with four examples involving limits of the form lim x→a f(x), where f is not necessarily continuous at the point a. Determining whether or not the limit exists, and if so what its value is, requires special analysis of the function f . Example 2.5.11. Determine whether or not the limit lim x→0 x sin(1/x) exists. If it exists, find its value. Solution. We begin with the inequality −1 ≤ sin θ ≤ 1 which is valid for all θ in R. In particular, if θ = 1/x then −1 ≤ sin(1/x) ≤ 1 whenever x 6= 0. c©2020 School of Mathematics and Statistics, UNSW Sydney 2.5. LIMITS OF FUNCTIONS AT A POINT 57 Case 1: Suppose that x > 0. Then we have −x ≤ x sin(1/x) ≤ x. Since lim x→0+ −x = lim x→0+ x = 0, it follows from the pinching theorem that lim x→0+ x sin(1/x) = 0. Case 2: Suppose that x < 0. Then −x ≥ x sin(1/x) ≥ x. Since lim x→0− −x = lim x→0− x = 0, it follows from the pinching theorem that lim x→0− x sin(1/x) = 0. Conclusion: Since the left- and right-hand limits exist and are both equal to 0, the two-sided limit exists and lim x→0 x sin(1/x) = 0. This limit is illustrated graphically below. x y y = xy = −x y = x sin(1/x) Example 2.5.12. Suppose that f : R→ R is defined by the formula f(x) = x2 − 4 x− 2 if x 6= 2 0 if x = 2. Discuss the limiting behaviour of f(x) as x approaches 2. Solution. Since we are discussing limiting behaviour, the value of f at 2 is irrelevant. All that matters is how f(x) behaves near 2. So suppose that x 6= 2. Then f(x) = x2 − 4 x− 2 = (x− 2)(x + 2) x− 2 = x+ 2. The graph of f is now easy to sketch. c©2020 School of Mathematics and Statistics, UNSW Sydney 58 CHAPTER 2. LIMITS x f(x) bc b 2 |4 Since lim x→2− f(x) = lim x→2+ f(x) = 4, the two-sided limit exists and we have lim x→2 f(x) = 4. (Note that f is not continuous at 2 since lim x→2 f(x) 6= f(2).) Example 2.5.13. Suppose that f : R→ R is defined by the formula f(x) = |x2 − 9| x− 3 if x 6= 3 2 if x = 3. Discuss the limiting behaviour of f(x) as x approaches 3. Example 2.5.14. Suppose that f(x) = sin (π x ) , x 6= 0. Discuss the limiting behaviour of f(x) as x approaches 0. Solution. Since f is an odd function, we initially restrict our attention to positive values of x. Now sin θ = 0 whenever θ ∈ {π, 2π, 3π, . . .}. If θ = π/x then we have sin(π/x) = 0 whenever x = 1 1 , 1 2 , 1 3 , . . . . Similarly, sin θ = 1 whenever θ ∈ {π/2, 5π/2, 9π/2, 13π/2, . . .} so sin(π/x) = 1 whenever x = 2 1 , 2 5 , 2 9 , 2 13 . . . . Consequently, no matter how close x is to 0 on the right, we can always find two closer points x1 and x2 such that f(x1) = 0 and f(x2) = 1. We conclude that f has no right-hand limit at 0. Since f is odd, it follows that f has no left-hand limit at 0. Hence lim x→0+ f(x) does not exist. The graph of f near the origin is shown below. c©2020 School of Mathematics and Statistics, UNSW Sydney 2.6. MAPLE NOTES 59 x f(x) −1 1 The f(x) oscillates wildly between 1 and −1 as x tends to 0, and lim x→0+ f(x) does not exist. 2.6 Maple notes The following Maple command is relevant to the material of this chapter: limit(f(x),x=a); finds the (two-sided) limit of f(x) as x → a. One-sided limits can be specified using limit(f(x),x=a,right); or limit(f(x),x=a,left);. For example, > limit(sin(x)/x, x=0); 1 > limit(sin(x)/x, x=infinity); 0 > # the next limit needs methods from chapter 5 for hand calculations > limit((1-cos(x))/x^2, x=0, left); 1/2 > # Maple produces results that are not in strict agreement with our definitions as x goes to plus or minus infinity > limit(x^2, x=-infinity); ∞ > limit(sin(x), x=-infinity); −1 . . . 1 > limit(x^2*sin(x), x=-infinity); undefined c©2020 School of Mathematics and Statistics, UNSW Sydney 60 CHAPTER 2. LIMITS Problems for Chapter 2 Problems 2.1 : Limits of functions at infinity 1. [R] [V] Evaluate the following limits if they exist. a) lim x→∞ x2 − 1 x2 + 1 b) lim x→∞ 2x2 + x− 1 x2 + 4x− 3 c) lim x→∞ 2x2 + 5x− 1 x3 + x d) lim x→∞ x5 + 5x+ 1 x4 + 3 e) lim x→∞ 5x2 − 3x+ cos 7x 4 + sin 2x+ x2 f) lim x→∞ sinx 2. [R] Use the pinching theorem to find the following limits. a) lim x→∞ sinx x b) lim x→∞ cosx x2 3. [R] [V] a) Prove that lim x→∞( √ x+ 1−√x) = 0. b) Show that lim x→∞( √ x2 + x − x) = 1 2 . Problems 2.2 : The definition of lim x→∞ f(x) and 2.3 : Proving that lim x→∞ f(x) = L using the limit definition 4. [R] a) Write down the formal definition for the statement lim x→∞ f(x) = L. b) Evaluate lim x→∞ 1 2x2 . c) Verify from the formal definition that your answer in (b) is correct. 5. [R] a) Evaluate lim x→∞ x2 + 1 x2 . b) Find a real number M such that the distance between x2 + 1 x2 and its limit is less than 0.01 whenever x > M . c) Suppose that ǫ > 0. Find a real number M (expressed in terms of ǫ) such that the distance between x2 + 1 x2 and its limit is less than ǫ whenever x > M . c©2020 School of Mathematics and Statistics, UNSW Sydney PROBLEMS FOR CHAPTER 2 61 6. [R] [V] For each of the following, find the limit of f(x) as x tends to infinity and prove from the definition that your answer is correct. a) f(x) = 4x x+ 7 b) f(x) = x− 3 x2 + 3 c) f(x) = e−2x d) f(x) = sinx x e) f(x) = sin 3x x2 + 4 7. [R] [V] A parcel is dropped from an aeroplane. A simple model, taking into account gravity and air resistance, suggests that the parcel’s velocity v(t) (in metres per second) is given by v(t) = 50(1 − e−t/5), where t is the number of seconds since leaving the plane. a) Calculate the terminal velocity of the parcel (that is, find lim t→∞ v(t)). b) The parcel never attains its terminal velocity. How long does it take to come within 1 metre per second of its terminal velocity? Problems 2.5 : Limits of functions at a point 8. [R] Evaluate the following limits. a) lim x→3 2x+ 4 b) lim x→2 x2 − 4 x− 2 c) limx→1 x3 − 1 x− 1 d) limx→3 1 x − 13 x− 3 9. [R] [V] a) Find the left-hand limit lim x→2− |x− 2| x− 2 . b) Find the right-hand limit lim x→2+ |x− 2| x− 2 . c) Does lim x→2 |x− 2| x− 2 exist? 10. [R] By finding the left- and right-hand limits first, decide whether or not each of the following limits exist and if so find their values. a) lim x→0 x |x| b) limx→2 |x2 − 4| x− 2 c) limx→4 x− 4 |x− 4| d) limx→0 4 x 11. [R] [V] a) Use the pinching theorem to find lim x→0 x sin 1x . b) Repeat for lim x→0 x2 sin 1 2x . c©2020 School of Mathematics and Statistics, UNSW Sydney 62 CHAPTER 2. LIMITS 12. [R] [V] Suppose that θ is a (positive) angle measured in radians and consider the diagram below. θ O A B C D The curve segment CB is the arc of a circle of radius 1 centre O. a) Write down, in terms of θ, the length of arc CB and the lengths of the line segments CA and DB. b) By considering areas, deduce that sin θ cos θ ≤ θ ≤ tan θ whenever 0 < θ < π2 . c) Use the pinching theorem to show that lim θ→0+ θ sin θ = 1. d) Deduce that lim θ→0 sin θ θ = 1. 13. [H] Discuss the limiting behaviour of cos 1x as x→ 0. c©2020 School of Mathematics and Statistics, UNSW Sydney 63 Chapter 3 Properties of continuous functions In the world of functions, there is a small civilised country inhabited by the race of continuous functions. Citizens of this country, such as the polynomials and exponentials, are studied (and perhaps even loved) by students in high schools throughout the human world. However, outside this small civilised country is a vast untamed universe of all sorts of discontinuous functions. From this perspective, the continuous functions are a very rare breed. However, continuous functions are far from endangered. They are among the most useful functions for modelling real-life phenomena, particularly when the quantity that is being modelled occurs in nature and changes smoothly with time. For example, • the distance from the earth to the sun at time t, • the height of Mount Everest at time t, • the volume of water in a dam at time t, • the mass of an earthworm at time t, and • the temperature at the back of a lecture theatre at time t can all be modelled by continuous functions of the variable t. In addition to their usefulness for modelling naturally occurring quantities, continuous functions have important mathematical characteristics. The combination of the usefulness of continuous functions and their rarity among the world of functions means that continuity is a deep property. In this chapter we begin to explore the ramifications of continuity. 3.1 Combining continuous functions (Ref: SH10 §2.4) If two continuous functions are combined via function addition, subtraction, multiplication, division or composition, when is the resulting function also continuous? The two next propositions answer this question. Proposition 3.1.1. Suppose that the functions f and g are continuous at a point a. Then f + g, f − g and fg are continuous at a. If g(a) 6= 0 then f/g is also continuous at a. c©2020 School of Mathematics and Statistics, UNSW Sydney 64 CHAPTER 3. PROPERTIES OF CONTINUOUS FUNCTIONS Proof. Suppose that f and g are continuous at a. Then, by the definition of continuity at a point (see Definition 2.5.3), lim x→a f(x) = f(a) and limx→a g(x) = g(a). Therefore lim x→a(f + g)(x) = limx→a(f(x) + g(x)) (by the definition of f + g) = lim x→a f(x) + limx→a g(x) (by Proposition 2.5.6) = f(a) + g(a) (since f and g are continuous) = (f + g)(a) (by the definition of f + g). Hence f + g is continuous at a. The proofs that the functions f − g, fg and f/g are continuous at a are similar. Proposition 3.1.2. Suppose that f is continuous at a and that g is continuous at f(a). Then g ◦f is continuous at a. Proof. Combine Definition 2.5.3 with Proposition 2.5.7. In Chapters 1 and 2 it was stated (without proof) that the polynomials, rational functions and trigonometric functions are continuous at every point in their respective domains. The next example shows that the continuity of these functions can be deduced from the continuity of • the constant functions • the sine function, and • the function f : R→ R given by f(x) = x. Example 3.1.3. Suppose that a is a point in R. By assuming that the constant functions, the sine function and the function f : R→ R, given by f(x) = x, are continuous everywhere, show that (a) every polynomial is continuous at a, (b) every rational function r is continuous at a (provided that a ∈ Dom(r)), and (c) the cosine and tangent functions are continuous at a (provided that a ∈ Dom(tan)). Solution. (a) Every polynomial can be constructed from constant functions and the function f by a finite number of function multiplications and additions. (For example, x3 − 4x2 + 5 = (x× x× x) + ((−4) × x× x) + 5 for all x in R.) Since the constant functions and f are continuous at a, it follows from Proposition 3.1.1 that every polynomial is continuous at a. (b) Every rational function r can be written in the form p/q, where p and q are polynomials. By the result of part (a), p and q are continuous a. It follows from Proposition 3.1.1 that p/q is also continuous at a, provided that q(a) 6= 0. (c) Recall that cos x = sin(π/2− x) ∀x ∈ R. That is, cos x = h(g(x)) where h(x) = sinx and g(x) = π/2 − x. Since the polynomial g is continuous at a and the sine function h is continuous everywhere, the cosine function must also be continuous at a by Proposition 3.1.2. c©2020 School of Mathematics and Statistics, UNSW Sydney 3.2. CONTINUITY ON INTERVALS 65 Finally, since tan x = sinx cos x , the tangent function is also continuous at a (provided that a ∈ Dom(tan)) by Proposition 3.1.1. The next example justifies an assertion made in Section 2.5.2. We continue working under the assumption that the constant functions and the function f : R → R, given by f(x) = x, are continuous everywhere. Example 3.1.4. Suppose that g : R → R is continuous at the point a and let |g| denote the function given by |g|(x) = |g(x)| ∀x ∈ R. Show that |g| is continuous at the point a. Remark 3.1.5. Throughout this section we have taken for granted that the constant functions, the sine function and the function f : R → R, given by f(x) = x, are continuous everywhere. To prove this, one needs a rigorous definition of the limit of a function at a point. Such a definition is similar in flavour to the definition for the limit of a function at infinity, but will not be presented in these notes. Example 3.1.6. Suppose that a and b are real numbers and consider the function f : R→ R given by f(x) = { eax+b if x ≥ 0 cos x if x < 0. For what values of a and b will the function f be continuous at 0? Solution. We require that lim x→0− f(x) = f(0) = lim x→0+ f(x). Now lim x→0− f(x) = lim x→0− cos x = 1, f(0) = eb and lim x→0+ f(x) = lim x→0+ eax+b = eb. So we require that eb = 1, which implies that b = 0. There is no restriction on a. 3.2 Continuity on intervals (Ref: SH10 §2.4) In section 2.5.2, a definition for continuity at a point was given. We now extend this definition to define what is meant by continuity on an interval. Definition 3.2.1. Suppose that f is a real-valued function defined on an open interval (a, b). We say that f is a continuous on (a, b) if f is continuous at every point in the interval (a, b). c©2020 School of Mathematics and Statistics, UNSW Sydney 66 CHAPTER 3. PROPERTIES OF CONTINUOUS FUNCTIONS Definition 3.2.2. Suppose that f is a real-valued function defined on a closed interval [a, b]. We say that (a) f is continuous at the endpoint a if lim x→a+ f(x) = f(a), (b) f is continuous at the endpoint b if lim x→b− f(x) = f(b), and (c) f is continuous on the closed interval [a, b] if f is continuous on the open interval (a, b) and at each of the endpoints a and b. To illustrate these definitions, consider the functions f , g and h, whose graphs are shown below. x f(x) | | a b b b bc x g(x) | | a b b b b bc x h(x) | | a b b b All three functions are defined on the interval [a, b]. We see that • f is continuous on the open interval (a, b) and at the endpoint b; • g is continuous at the endpoints a and b but not continuous on the open interval (a, b); and • h is continuous on the closed interval [a, b] (and, by implication, on the open interval (a, b) and at both endpoints a and b). 3.3 The intermediate value theorem (Ref: SH10 §§2.6, B.1) The idea of this section is very simple. Suppose that a function f is continuous on some closed interval [a, b] and that z is any real number that lies between f(a) and f(b), as illustrated below. x f(x) | | | | | | f(a) z f(b) a c b b b c©2020 School of Mathematics and Statistics, UNSW Sydney 3.3. THE INTERMEDIATE VALUE THEOREM 67 It is clear from the graph that one can find a real number c in [a, b] such that f(c) = z. The intermediate value theorem, which is stated below, says that this can always be done for any function that is continuous on [a, b]. Theorem 3.3.1 (The intermediate value theorem). Suppose that f is continuous on the closed interval [a, b]. If z lies between f(a) and f(b) then there is at least one real number c in [a, b] such that f(c) = z. The intermediate value theorem is proven using the least upper bound property of the real numbers. (It is this property that distinguishes the real numbers from the rational numbers.) We shall not examine the proof here. We note three important points about the theorem. First, there may be more than one number c in [a, b] that satisfies the conclusion of the intermediate value theorem. The diagram below illustrates this possibility. x f(x) | | | | |f(a) z f(b) a b b b c1 c2 c3 Second, the condition in Theorem 3.3.1 that f is continuous cannot be relaxed. For example, consider the function f : [a, b]→ R whose graph is shown below. x f(x) | | | | | f(a) z f(b) a b b b b bc Because of the discontinuity, z is not in the range of f , even though z lies between f(a) and f(b). That is, there is no real number c in [a, b] such that f(c) = z. Third, the intermediate value theorem is a theorem about continuous functions defined on the real numbers. There is no analogous theorem for continuous functions defined on the rational numbers. For example, consider the functions f : R→ R and g : Q→ Q, given by f(x) = x2 − 2 and g(x) = x2 − 2. c©2020 School of Mathematics and Statistics, UNSW Sydney 68 CHAPTER 3. PROPERTIES OF CONTINUOUS FUNCTIONS Both functions are continuous on their domains. Also, f(0) = g(0) = −2 and f(2) = g(2) = 2. However, while there is a number c in [0, 2] such that f(c) = 0, there is no number c in [0, 2] such that g(c) = 0 (since √ 2 /∈ Dom(g)). One of the chief applications of the intermediate value theorem is determining whether an equation has a solution. Moreover, if a solution exists, then the theorem helps us determine an approximate location of this solution. This is particularly helpful when the equation cannot be solved by simple algebra. Example 3.3.2. Show that the equation e2x = sinx+ 4 (3.1) has at least one positive solution. Solution. Consider the function f given by f(x) = e2x − sinx− 4. Now f(0) = e0 − sin 0− 4 = −3 while f(π/2) = eπ + 1− 4 ≈ 20.14. Since f is continuous on the closed interval [0, π/2] and 0 lies between f(0) and f(π/2), the intermediate value theorem implies that there is a real number c in [0, π/2] such that f(c) = 0. That is, e2c = sin c+ 4 for some c in [0, π/2]. Moreover, one can check that c 6= 0. Hence the equation has at least one positive solution c. Note that in equation (3.1), one cannot isolate x to find an explicit solution. However, now that it is known a solution exists in the interval [0, π/2], one could use Newton’s method to find a good approximation to this solution. 3.4 The maximum-minimum theorem (Ref: SH10 §§2.6, B.2) In many situations, it is desirable to know the maximum (or minimum) value of a quantity that varies over an interval of time. Examples include the maximum temperature in a 24 hour period or the minimum stock price over a seven day period. Depending on the quantity and the time interval, a maximum or minimum value may not be attained. The maximum-minimum theorem gives conditions under which a maximum and minimum is attained. To clarify discussion, we begin with a definition. c©2020 School of Mathematics and Statistics, UNSW Sydney 3.4. THE MAXIMUM-MINIMUM THEOREM 69 Definition 3.4.1. Suppose that f is defined on a closed interval [a, b]. (a) We say that a point c in [a, b] is an absolute minimum point for f on [a, b] if f(c) ≤ f(x) for all x in [a, b]. The corresponding value f(c) is called the absolute minimum value of f on [a, b]. If f has an absolute minimum point on [a, b] then we say that f attains its minimum on [a, b]. (b) We say that a point d in [a, b] is an absolute maximum point for f on [a, b] if f(x) ≤ f(d) for all x in [a, b]. The corresponding value f(d) is called the absolute maximum value of f on [a, b]. If f has an absolute maximum point on [a, b] then we say that f attains its maximum on [a, b]. An absolute maximum point and an absolute minimum point are sometimes referred to as a global maximum point and a global minimum point. Example 3.4.2. Consider the functions g and h, whose graphs are illustrated below. x g(x) | | | | | 1 2 3 4 5 | | | | 1 2 3 4 x h(x) | | | | | 1 2 3 4 5 | | | | 1 2 3 4 b b The absolute minimim and maximum points of g and h on [1, 5] are recorded in the following table. g h Absolute minimum points none 3 Absolute minimum value n.a. 1 Absolute maximum points 2, 4 none Absolute maximum value 4 n.a. The above example shows that a function f : [a, b] → R need not have an absolute maximum point (or an absolute minimum point) on a closed interval [a, b]. However, if f is continuous on [a, b] then such points always exist. Theorem 3.4.3 (The maximum-minimum theorem). If f is continuous on a closed interval [a, b] then f attains its minimum and maximum on [a, b]. That is, there exist points c and d in [a, b] such that f(c) ≤ f(x) ≤ f(d) for all x in [a, b]. c©2020 School of Mathematics and Statistics, UNSW Sydney 70 CHAPTER 3. PROPERTIES OF CONTINUOUS FUNCTIONS As with the intermediate value theorem, the least upper bound property of the real numbers is used to prove the maximum-minimum theorem. We omit the proof. Example 3.4.4. Suppose that the function f : R→ R is given by the rule f(x) = x2 − 10x+ 16. Find the absolute maximum and absolute minimum values of f on the interval [1, 10]. Solution. Since f(x) = (x− 2)(x− 8), the function f has zeros at 2 and 8. x f(x) | | 1 10 2 8 5 By symmetry, we know that the axis of the parabola is the line x = 5. Hence 5 is an absolute minimum point for f on [1, 10]. Since f(5) = −9, the absolute minimum value of f on [1, 10] is −9. It is clear from the sketch of f that at least one of the endpoints of the interval [1, 10] must be an absolute maximum point for f on [1, 10]. Now f(1) = 7 while f(10) = 16. So 10 is an absolute maximum point for f on [1, 10] and the corresponding maximum value of f is 16. Remark 3.4.5. While continuity on a closed interval guarantees the existence of at least one absolute maximum point and absolute minimum point, locating such points may not be easy. A systematic approach to finding maximum and minimum points will be developed in Chapters 4 and 5. Remark 3.4.6. All the hypotheses stated in the maximum-minimum theorem must be satisfied for the conclusion of the theorem to be valid. If the interval is closed but the function is not continuous, then an absolute maximum (or minimum) point may not exist (see Example 3.4.2). The next example shows that if the function is continuous on the interval but the interval is not closed then an absolute maximum (or minimum) point may not exist. It is expected that students will know the precise hypotheses of the maximum-minimum theorem (and other important theorems, such as the intermediate value theorem) and know how to apply the theorem correctly. Example 3.4.7. Consider the function f : R→ R given by f(x) = x+1. Since f is a polynomial, it is continuous on the open interval (0, 1). However, f does not have an absolute maximum point on (0, 1). To see why, consider the graph of f on (0, 1). x f(x) | 10 1 2 | c©2020 School of Mathematics and Statistics, UNSW Sydney 3.4. THE MAXIMUM-MINIMUM THEOREM 71 As x → 1−, f(x) → 2 but there is no real number d in (0, 1) such that f(d) = 2. By a similar observation, f has no absolute minimum point on (0, 1). The final result of this chapter is a corollary of maximum-minimum theorem. We begin with a definition. Definition 3.4.8. A function f is said to be bounded on an interval I if there is some positive number M such that |f(x)| ≤M for all x in I. In other words, f is bounded if the y-values of its graph lie between −M and M for some positive number M . The sine and cosine functions are obvious examples of functions that are bounded on the whole of the real line. Corollary 3.4.9. If f : [a, b]→ R is continuous on [a, b] then f is bounded on [a, b]. Proof. Suppose that f is continuous on [a, b]. Then the function |f | is also continuous on [a, b] (see Example 3.1.4). By the maximum-minimum theorem, |f | attains its maximum on [a, b]. If M denotes the absolute maximum value of |f | on [a, b] then |f(x)| ≤M ∀x ∈ [a, b]. Hence f is bounded. c©2020 School of Mathematics and Statistics, UNSW Sydney 72 CHAPTER 3. PROPERTIES OF CONTINUOUS FUNCTIONS Problems for Chapter 3 Problems 3.1 : Combining continuous functions and 3.2 : Continuity on intervals 1. [R] Suppose that f : R→ R is defined by f(x) = |x|. a) Show that f is continuous at 0. b) Is f continuous everywhere? Give brief reasons for your answer. 2. [R] Determine at which points each function f : R→ R is continuous. Give reasons. a) f(x) = { e2x if x < 0 cos x if x ≥ 0 b) f(x) = e−2x if x < 0 sinx+ 1 if 0 ≤ x ≤ π/2 2x− π if x > π/2 3. [R] Suppose that f(x) = { x2−16 x−4 if x 6= 4 k if x = 4, where k is a real number. For which values of k (if any) will f be continuous everywhere? 4. [H] Use the pinching theorem for limits to show that if f , g and h are three functions defined on an open interval I, such that • f(x) ≤ g(x) ≤ h(x) for all x ∈ I, • f(a) = g(a) = h(a) for some a ∈ I, and • f and h are continuous at a, then g is also continuous at a. Problems 3.3 : The intermediate value theorem 5. [R] Show that the function f , given by f(x) = x3− 5x+3, has a zero in each of the intervals [−3,−2], [0, 1] and [1, 2]. 6. [R] [V] Use the intermediate value theorem to show that the equation ex = 2cos x has at least one positive real solution. 7. [H] Suppose that f is continuous on [0, 1] and that Range(f) is a subset of [0, 1]. By using g(x) = f(x)− x, prove that there is a real number c in [0, 1] such that f(c) = c. c©2020 School of Mathematics and Statistics, UNSW Sydney PROBLEMS FOR CHAPTER 3 73 Problems 3.4 : The maximum-minimum theorem 8. [R] In each case, determine whether or not f attains a maximum on the given interval. Give reasons for your answer. a) f(x) = x2 − 4 on [−3, 5] b) f(x) = ∣∣∣∣sin(ex) + lnxx2 − 1 ∣∣∣∣ on [2, 4] c) f(x) = x2 − 4 on (−3, 5) d) f(x) = −(x2 − 4) on (−3, 5) 9. [H] [V] Suppose that f is a continuous function on R and that lim x→∞ f(x) = limx→−∞ f(x) = 0. a) Give an example of such a function which has both a maximum value and a minimum value. b) Give an example of such a function which has a minimum value but no maximum value. c©2020 School of Mathematics and Statistics, UNSW Sydney 74 CHAPTER 3. PROPERTIES OF CONTINUOUS FUNCTIONS c©2020 School of Mathematics and Statistics, UNSW Sydney 75 Chapter 4 Differentiable functions A cyclist accelerates from rest in such a way that her displacement s (in metres) from her starting position after t seconds is given by s = t2 ∀t ∈ [0, 3]. The corresponding displacement–time graph for the first three seconds is shown below. | | 1 4 9 | | 1 2 30 b b displacement (m) time (s) What is the cyclist’s instantaneous velocity when t = 1? At present, we do not have a formula for calculating instantaneous velocity. However, we can calculate the average velocity v of the cyclist over any time interval by using the formula v = ∆s ∆t , where ∆s denotes the change in displacement corresponding to a change ∆t in time. For example, the average velocity over the time interval [1, 1.5] is 2.5 metres per second, as calculated below: v[1,1.5] = ∆s ∆t = (1.5)2 − (1)2 1.5− 1 = 2.5. (4.1) Average velocities over small time intervals are recorded in the following table. c©2020 School of Mathematics and Statistics, UNSW Sydney 76 CHAPTER 4. DIFFERENTIABLE FUNCTIONS Time interval ∆t Average velocity [1, 1.5] 0.5 2.5 [1, 1.4] 0.4 2.4 [1, 1.3] 0.3 2.3 [1, 1.2] 0.2 2.2 [1, 1.1] 0.1 2.1 [1, 1.01] 0.01 2.01 [1, 1.001] 0.001 2.001 The table suggests that the average velocity approaches 2 meters per second as ∆t→ 0. Hence it appears that instantaneous velocity of the cyclist when t = 1 is 2 meters per second. One of the goals of this chapter is put these kinds of ideas on a rigorous footing in a general context (where they can be applied to understanding rates of change of any quantity, not just displacement). Hence we reframe these ideas in terms of functions and analytic geometry. More precisely, we shall now show that the problem of determining instantaneous velocity is equivalent to the problem of finding the gradient of a tangent line to the graph of a function. To begin, the problem of calculating average velocity is equivalent to the problem of calculating the gradient of a secant to the graph of a function. (A secant is a straight line that intersects a curve at least twice.) For example, compare the calculation of average velocity in (4.1) with the calculation of the gradient of the secant shown in the following diagram. t s b b | | 1 2.25 | | 1 1.5 s = t2 secant gradient of secant = ∆s∆t = (1.5)2−(1)2 1.5−1 = 2.5 In the same way, the above table of average velocities can be reinterpreted geometrically as a table of gradients corresponding to secants of the curve s = t2. Intersection points ∆t Gradient of secant t = 1, t = 1.5 0.5 2.5 t = 1, t = 1.4 0.4 2.4 t = 1, t = 1.3 0.3 2.3 t = 1, t = 1.2 0.2 2.2 t = 1, t = 1.1 0.1 2.1 c©2020 School of Mathematics and Statistics, UNSW Sydney 4.1. GRADIENTS OF TANGENTS AND DERIVATIVES 77 The secants used in the table are illustrated over the page in Figure 4.1. The figure suggests that as ∆t→ 0, the gradient of the corresponding secant approaches the gradient of the tangent line to the curve. Thus the problem of determining instantaneous velocity is equivalent to the problem of determining the gradient of a tangent to the graph of a function. It is to this general problem that we now turn. 4.1 Gradients of tangents and derivatives (Ref: SH10 §3.1) Suppose that a function f has a ‘smooth’ graph. In this section, we give a general approach to calculating the gradient of tangents to f . This leads the concept of a gradient function, also known as a derivative (since it is derived from f), whose values give the gradient of the tangent to f at any point. We begin with an example. Example 4.1.1. Suppose that f : R→ R is given by f(x) = x3. Find the gradient of the tangent to the graph of f when x = 2. Solution. The tangent to the graph of f passes through the point P (2, 23). We will approximate the gradient of the tangent with the gradient of a secant passing through the points P (2, 23) and Q(2 + h, (2 + h)3), where h is a very small real number. (See the illustration below for the case when h > 0.) x y | | |23 2 2 + h b b P (2, 23) Q(2 + h, (2 + h)3) secant tangent y = f(x) Using the ‘rise over run’ formula, the gradient of the secant is (2 + h)3 − 23 h . Now, as h approaches 0, the point Q moves along the curve towards P , and hence the gradient of c©2020 School of Mathematics and Statistics, UNSW Sydney 78 CHAPTER 4. DIFFERENTIABLE FUNCTIONS b b b b b b | | | | | | 1 1.21 1.44 1.69 1.96 2.25 | | | | | | 1 1.1 1.2 1.3 1.4 1.5 t s tangent s = t2 secants Figure 4.1: Using secants to determine the gradient of a tangent. c©2020 School of Mathematics and Statistics, UNSW Sydney 4.1. GRADIENTS OF TANGENTS AND DERIVATIVES 79 the secant approaches the gradient of the tangent. In other words, (gradient of tangent) = lim h→0 (2 + h)3 − 23 h = lim h→0 8 + 12h+ 6h2 + h3 − 8 h = lim h→0 12h + 6h2 + h3 h = lim h→0 12 + 6h+ h2 = 12. Thus the gradient of the tangent to f at the point 2 is 12. This gives a quantitative description of the rate of change of f at the point 2. There is nothing special about the point 2 in the previous example; we could calculate the gradient of the tangent to f at any point x. The calculation would go something like this: (gradient of tangent at x) = lim h→0 (x+ h)3 − x3 h = lim h→0 x3 + 3x2h+ 3xh2 + h3 − x3 h = lim h→0 3x2h+ 3xh2 h = lim h→0 3x2 + 3xh = 3x2. So the gradient of the tangent to f when x = 5 is 75 (since 3× 52 = 75), while the gradient of the tangent when x = −4 is 48. The gradient function of f , denoted by f ′ and given by f ′(x) = 3x2 ∀x ∈ R, is called the derivative of f . We can repeat the same procedure for functions other than f(x) = x3. The gradient function (or derivative) may be calculated using the limit given in the next definition. Definition 4.1.2. Suppose that f is defined on some open interval containing the point x. We say that f is differentiable at x if lim h→0 f(x+ h)− f(x) h exists. If the limit exists, we denote it by f ′(x). We call f ′(x) the derivative of f at x. Other notation for f ′(x) includes dfdx(x) and d dxf(x). If y = f(x) then the derivative is often denoted by y′ or dydx . The ratio f(x+ h)− f(x) h c©2020 School of Mathematics and Statistics, UNSW Sydney 80 CHAPTER 4. DIFFERENTIABLE FUNCTIONS is called the difference quotient for f at the point x. Remark 4.1.3. The notation dydx originates from the gradient formula ∆y ∆x , where ∆y represents a change in y corresponding to a small change ∆x in x. (The symbol ∆ represents the Greek letter ‘delta’, which is equivalent to our capital ‘D’.) Some calculus texts prefer the use of delta notation and write the difference quotient for a function f as f(x+∆x)− f(x) ∆x . Example 4.1.4. Suppose that f : [0,∞) → R is given by f(x) = √x. Find the equation of the tangent to f when x = 16. Solution. First we shall calculate the derivative of f (if it exists) at 16. Note that, in the working below, the limit as h approaches 0 is difficult to evaluate immediately, so we use a standard trick (see, for example, Subsection 2.1.4): f(16 + h)− f(16) h = √ 16 + h−√16 h = √ 16 + h−√16 h × √ 16 + h+ √ 16√ 16 + h+ √ 16 = 16 + h− 16 h( √ 16 + h+ √ 16) = h h( √ 16 + h+ √ 16) = 1√ 16 + h+ √ 16 → 1 8 as h→ 0. So the gradient of the tangent is 1/8. Using the point-gradient formula, the equation of the tangent is y − y1 = m(x− x1) y − 4 = 1 8 (x− 16) 8y − x− 16 = 0. Example 4.1.5. Suppose that f : R → R is given by f(x) = |x|. Determine whether or not f is differentiable at the point 0. Solution. We calculate the difference quotient of f at 0: f(0 + h)− f(0) h = |h| h = { h h if h > 0 −h h if h < 0. = { 1 if h > 0 −1 if h < 0. c©2020 School of Mathematics and Statistics, UNSW Sydney 4.2. RULES FOR DIFFERENTIATION 81 Because of the ‘split formula,’ we must consider the left- and right-hand limits as h approaches 0 separately. Now lim h→0− f(0 + h)− f(0) h = −1 while lim h→0+ f(0 + h)− f(0) h = 1. Since the left- and right-hand limits are different, the two-sided limit doesn’t exist and hence f is not differentiable at 0. Note that this conclusion makes sense geometrically, since the graph of f has a ‘vertex’ at 0 and there is no unique tangent that touches the graph at this point. x y Example 4.1.6. Suppose that f : R→ R is defined by f(x) = { x sin 1x if x 6= 0 0 if x = 0. Determine whether or not f is differentiable at 0. 4.2 Rules for differentiation (Ref: SH10 §§3.2, 3.5, 3.6) So far we have calculated derivatives working directly from the difference quotient. In this section we see that there are fast ways of calculating derivatives without referring to the difference quotient and limits at all. This fast method is part of what is known as ‘the calculus.’ We begin with the derivatives of some standard functions, and then show how derivatives of other functions are obtained by applying some simple rules. The following table gives the derivatives for some standard functions. f(x) f ′(x) C, where C is a constant 0 xn, where n is a positive integer nxn−1 sinx cos x ex ex c©2020 School of Mathematics and Statistics, UNSW Sydney 82 CHAPTER 4. DIFFERENTIABLE FUNCTIONS With these basic functions, it is possible to construct many other functions via function addition, subtraction, multiplication and division. The next theorem describes what happens to the resulting derivatives. Theorem 4.2.1 (Rules for differentiation). Suppose that f and g are differentiable at x. Then f + g, f − g and fg are differentiable at x. If g(x) 6= 0 then fg is also differentiable at x. Moreover, (i) (f + g)′(x) = f ′(x) + g′(x), (ii) (C.f)′(x) = C.f ′(x), where C is a constant, (iii) (fg)′(x) = f ′(x)g(x) + f(x)g′(x), and (iv) ( f g )′ (x) = f ′(x)g(x) − f(x)g′(x) g(x)2 , provided that g(x) 6= 0. Rules (iii) and (iv) are usually called the product and quotient rules. A familiar way of express- ing these rules is d(uv) dx = v du dx + u dv dx and d dx (u v ) = v dudx − u dvdx v2 , where u and v are both functions of x. Given two functions f and g, a new function f ◦g may be constructed by function composition. The next theorem describes the derivative of this new function. Theorem 4.2.2 (The chain rule). Suppose that g is differentiable at the point x and f is differen- tiable at the point g(x). Then f ◦ g is differentiable at x and (f ◦ g)′(x) = f ′(g(x))g′(x). (4.2) You are probably more familiar with the chain rule expressed in the following way: if y = f(u) and u = g(x) then dy dx = dy du du dx . (4.3) While formula (4.3) looks simpler than (4.2), it does not specify the relationship between the variables or the points at which each derivative is to be evaluated. It should therefore be used with care. The next example illustrates how the chain rule formula (4.2) is applied. Example 4.2.3. Suppose that y = (sinx+ 1)7. Find the derivative of y at the point x. Solution. Suppose that f(x) = x7 and g(x) = sinx+1. Then f ′(x) = 7x6 and g′(x) = cos x. Then y = (f ◦ g)(x). By the chain rule, the derivative of y at x is given by (f ◦ g)′(x) = f ′(g(x))g′(x) = 7(g(x))6 × cos x = 7(sin x+ 1)6 cos x. c©2020 School of Mathematics and Statistics, UNSW Sydney 4.3. PROOFS OF DIFFERENTIATION RULES 83 As mentioned at the beginning of this section, derivatives of many functions can be obtained from those of a few basic functions by the application of simple rules. The next example illustrates this point. Example 4.2.4. Show that (a) d dx ( xm ) = mxm−1, where m is an integer; (b) d dx ( cosx ) = − sinx; and (a) d dx ( tanx ) = sec2 x. 4.3 Proofs of differentiation rules In Section 4.2 we stated two theorems and gave a table of derivatives. In this section we prove some of these results. We begin with the derivative of the sine function, using the standard result that lim θ→0 sin θ θ = 1. (4.4) For a geometric argument for this limit, see Question 12 from the problems for Chapter 2. Proposition 4.3.1. If f(x) = sinx then f ′(x) = cos x for all real x. Proof. The difference quotient for the sine function and the angle sum formula for sine gives sin(x+ h)− sinx h = sinx cos h+ cos x sinh− sinx h = sinx cos h− 1 h + cosx sinh h . (4.5) Now cos2 h+ sin2 h = 1 so cos h− 1 h = cos2 h− 1 h(cos h+ 1) = − sin2 h h(cos h+ 1) = −sinh h sinh cos h+ 1 → −1× 0 = 0 as h→ 0, by (4.4). If we combine this with (4.5) then lim h→0 sin(x+ h)− sinx h = sinx lim h→0 cos h− 1 h + cos x lim h→0 sinh h = sinx× 0 + cos x× 1 = cos x. Thus f ′(x) = cos x. c©2020 School of Mathematics and Statistics, UNSW Sydney 84 CHAPTER 4. DIFFERENTIABLE FUNCTIONS The derivative of ex will be dealt with in Chapter 9. The derivative of constant functions can be easily computed from Definition 4.1.2. Suppose that f is given by f(x) = xn, where n is a positive integer. The derivative of f can be computed using the same method as Example 4.1.1, with the assistance of the binomial theorem. Alternatively, one can use mathematical induction and the product rule for differentiation. Proof of the product rule for differentiation. Suppose that f and g are differentiable at the point x. The difference quotient of fg at x gives (fg)(x+ h)− (fg)(x) h = f(x+ h)g(x + h)− f(x)g(x) h = f(x+ h)g(x + h)− f(x)g(x+ h) + f(x)g(x+ h)− f(x)g(x) h = g(x+ h) f(x+ h)− f(x) h + f(x) g(x+ h)− g(x) h . (4.6) Now g is differentiable at x, so g is also continuous at x (see Theorem 4.5.1). Hence g(x+h)→ g(x) as h→ 0. Therefore (4.6) implies that (fg)(x + h)− (fg)(x) h → g(x)f ′(x) + f(x)g′(x) as h→ 0. Proofs of other differentiation rules are found in most undergraduate calculus textbooks. 4.4 Implicit differentiation (Ref: SH10 §3.7) In Section 1.8 we introduced functions that are defined by an equation relating the variables x and y. Using the chain rule, one can calculate the derivative of such functions without first expressing y explicitly as a function of x. We begin with a simple example to illustrate the principle. The point to keep in mind is that y is a function of x. Example 4.4.1. Suppose that y is a function of x, and that y and x are related by the formula x2 + y2 = 1. Calculate dydx . Solution. We begin with the equation x2 + y2 = 1 and differentiate both sides with respect to x to obtain d dx (x2) + d dx (y2) = d dx (1). (4.7) c©2020 School of Mathematics and Statistics, UNSW Sydney 4.4. IMPLICIT DIFFERENTIATION 85 Now clearly, ddx(x 2) = 2x and ddx(1) = 0. To simplify the central term of (4.7), we regard y as a function of x and suppose that g(y) = y2. The chain rule gives d dx (y2) = dg dx = dg dy dy dx = 2y dy dx . So continuing from (4.7) we have 2x+ 2y dy dx = 0. By isolating dydx we obtain dy dx = −2x 2y = −x y as required. (We note in passing that this answer coincides with the derivative obtained by first expressing y as a function of x and then differentiating. In particular, if y = (1− x2)1/2 then dy dx = 1 2 (1− x2)−1/2(−2x) = − x (1− x2)1/2 = − x y as above.) In general, if g is a function of y and y is a function of x then d dx g(y) = g′(y) dy dx (4.8) by the chain rule. The use of this formula is illustrated below. Example 4.4.2. Suppose that y is a function of x, implicitly related by the equation y4 + x3 − x2e3y = 4. (4.9) Find the equation of the tangent to the corresponding curve at the point where (x, y) = (2, 0). In Section 4.2 we saw that ddxx n = nxn−1 whenever n is an integer. A similar formula holds when n is any rational number. This can be easily proved using implicit differentiation. Proposition 4.4.3. Suppose that q is a rational number. Then d dx xq = qxq−1. c©2020 School of Mathematics and Statistics, UNSW Sydney 86 CHAPTER 4. DIFFERENTIABLE FUNCTIONS Proof. Since q is a rational number, there are integers m and n such that q = mn and n 6= 0. Now if y = xq then y = xm/n and taking the nth power of both sides gives yn = xm. Differentiating both sides with respect to x yields nyn−1 dy dx = mxm−1. Hence dy dx = mxm−1 nyn−1 = m n xm−1 xq(n−1) = qx(m−1)−qn+q = qxq−1 as required. 4.5 Differentiation, continuity and split functions (Ref: SH10 §3.1) Not every function that is continuous at a point a is differentiable at a. (Consider, for example, the function f , given by f(x) = |x|, at the point 0.) However, we do have the following result. Theorem 4.5.1. If f is differentiable at a then f is continuous at a. Proof. Suppose that f is differentiable at the point a. Then lim h→0 f(a+ h)− f(a) h exists. To show that f is continuous at a, we need to show that lim x→a f(x) = f(a), or equivalently, that lim h→0 f(a+ h) = f(a). Now lim h→0 f(a+ h)− f(a) = lim h→0 ( f(a+ h)− f(a) h × h ) = ( lim h→0 f(a+ h)− f(a) h ) × lim h→0 h (4.10) = f ′(a)× 0 = 0. c©2020 School of Mathematics and Statistics, UNSW Sydney 4.5. DIFFERENTIATION, CONTINUITY AND SPLIT FUNCTIONS 87 (Note that we can separate the limits in (4.10) because we know that the limit of the difference quotient exists.) Hence lim h→0 f(a+ h) = f(a), proving the theorem. The contrapositive form of the theorem is stated below. Corollary 4.5.2. If f is not continuous at a then it is not differentiable at a. Remark 4.5.3. Differentiability is a much stronger property than continuity. While every differen- tiable function is continuous, there exist functions that are continuous everywhere but differentiable nowhere. In fact, functions that are differentiable everywhere are a very rare breed, even among the continuous functions. An example of a function that is continuous everywhere but differentiable nowhere is the Weierstrass function whose graph is shown below. x y The next function studied is an example of a ‘split function’. Whether or not the function is differentiable at the ‘split point’ can be determined by calculating left- and right-hand limits of the difference quotient. Example 4.5.4. Suppose that f : R→ R is defined by f(x) = { sinx if x ≥ 0 x2 + bx+ c if x < 0, where b and c are real numbers. Find all possible values of b and c such that f is (a) continuous at 0 and (b) differentiable at 0. Solution. (a) For f to be continuous at 0, we require that lim x→0− f(x) = f(0) = lim x→0+ f(x). Now f(0) = sin 0 = 0 while lim x→0− f(x) = c and lim x→0+ f(x) = 0. Hence f is continuous if and only if c = 0 and b ∈ R. (b) If f is differentiable at 0 then it is continuous at 0 and hence c = 0 by part (a). Also, for differentiability we require that lim h→0− f(0 + h)− f(0) h = lim h→0+ f(0 + h)− f(0) h . c©2020 School of Mathematics and Statistics, UNSW Sydney 88 CHAPTER 4. DIFFERENTIABLE FUNCTIONS Now, lim h→0+ f(0 + h)− f(0) h = lim h→0+ sinh− sin 0 h = lim h→0+ sinh h = 1 while lim h→0− f(0 + h)− f(0) h = lim h→0− h2 + bh− 0 h = lim h→0− h+ b = b. Hence we require that b = 1. So f is differentiable at 0 if and only if b = 1 and c = 0. The graph of the corresponding function is sketched below. x f(x) || | −1 1 2π An important application of split functions is the theory of splines. A spline is a function defined piecewise by polynomials. They are frequently used in computer graphics and elsewhere because of their simplicity and their capacity to approximate complex curves. The following is an example of a spline. Example 4.5.5. f(x) = { (x− 1)2 − 1 if − 2 ≤ x < 0 1− (x− 1)2 if 0 ≤ x ≤ 2. In this example, each of the functions p(x) = (x − 1)2 − 1 and q(x) = 1 − (x − 1)2 have had their domains restricted and then ‘glued’ together at the point 0. If the constituent functions are continuous and differentiable in some interval containing the point a, then the following theorem tells us when the corresponding split function is differentiable. Theorem 4.5.6. Suppose a is a fixed real number and the function f is defined by f(x) = { p(x) if x ≥ a q(x) if x < a, where p(x) and q(x) are continuous and differentiable in some interval containing a. Then if f is continuous at a and p′(a) = q′(a), then f is differentiable at x = a. This theorem in particular can be applied to splines since the constituent functions are polyno- mials. c©2020 School of Mathematics and Statistics, UNSW Sydney 4.6. DERIVATIVES AND FUNCTION APPROXIMATION 89 Example 4.5.7. f(x) = { sinx if x ≥ 0 x2 + x if x < 0. The functions sinx and x2 + x are continuous and differentiable everywhere. We saw in Example 4.5.4 that f is continuous at x = 0. Also, with p(x) = sinx and q(x) = x2 + x, we have p′(0) = cos 0 = 1 and q′(0) = 1 so f is differentiable at x = 0. 4.6 Derivatives and function approximation (Ref: SH10 §4.11) The last three sections of this chapter are devoted to some applications of the derivative. Suppose that a function f is differentiable at a. Consider its graph and the tangent to f at a, shown below. x y | |f(a) a b The equation of the tangent to f at a may be calculated using the point-gradient formula: y − f(a) = f ′(a)(x − a) y = f ′(a)(x − a) + f(a). Observe in the above diagram that the tangent line is close to the graph of f for all points that are close to a. In other words, f(x) ≈ f ′(a)(x− a) + f(a) (4.11) when x is close to a. Formula (4.11) reflects the fact that every differentiable function can be locally approximated by a linear function. We use this principle in the following example. Example 4.6.1. Estimate √ 9.001 without using a calculator. Solution. Suppose that f(x) = √ x. Since 9.001 is close to 9 and f is a continuous function, we could use the approximation √ 9.001 = f(9.001) ≈ f(9) = 3. (4.12) However, we can do better than this. We shall approximate f with a linear function at the point 9 using (4.11). Since f ′(9) = 1/6 we have f(x) ≈ (x− 9)/6 + 3 c©2020 School of Mathematics and Statistics, UNSW Sydney 90 CHAPTER 4. DIFFERENTIABLE FUNCTIONS whenever x is close to 9. When x = 9.001 this gives √ 9.001 = f(9.001) ≈ (9.001 − 9)/6 + 3 = 3 + 1/6000. (4.13) The table below gives the error involved in each approximation. Approximation Error Approximation (4.12) 1.667−4 Approximation (4.13) 4.637−9 Clearly (4.13) is the superior approximation. 4.7 Derivatives and rates of change (Ref: SH10 §3.4) Many physical processes involve quantities (such as temperature, volume, concentration, velocity) that change with time. If Q is a quantity that varies with time, then the derivative dQdt gives the rate of change of that quantity with respect to time. The chain rule is a major tool for answering questions about rates of quantities which are related. Example 4.7.1. A spherical balloon is being inflated and its radius is increasing at a constant rate of 6 mm/sec. At what rate is its volume increasing when the radius of the balloon is 20 mm? Solution. Suppose that V (t) is the volume of the balloon and r(t) is its radius at time t. We are told that drdt = 6 and we need to find dV dt when r = 20. By the chain rule, dV dt = dV dr dr dt . Now V = 43πr 3 so dVdr = 4πr 2. Hence dV dt = 4πr2 × 6. When r = 20, dV dt = 24π(20)2 = 9600π. So the volume is increasing at a rate of 9600π mm3/sec when the radius is 20 mm. The above example illustrates an approach to solving such problems. 1. Define variables for the quantities involved. 2. Write down what is known in terms of these variables and their derivatives. 3. Write down what you need to find in terms of the these variables and their derivatives. 4. Write down anything else you know that relates the variables (for example, a volume or area formula). 5. Use the chain rule (or implicit differentiation) to find the relevant derivative. c©2020 School of Mathematics and Statistics, UNSW Sydney 4.8. LOCAL MAXIMUM, LOCAL MINIMUM AND STATIONARY POINTS 91 Sometimes drawing a diagram helps in the first few steps. The next example follows this procedure. Example 4.7.2. A point P moves to the right along the positive x-axis at a constant rate of 5 cm per second, and a point Q moves up the positive y-axis at a constant rate of 10 cm per second. How fast is the distance between them changing when OP = 30 cm and OQ = 40 cm? 4.8 Local maximum, local minimum and stationary points (Ref: SH10 §4.3) In Example 3.4.4, we determined the absolute maximum and minimum points of a continuous function over a closed interval. While the existence of these extreme points is given by the max- min theorem, the theorem provides no systematic way of locating these points. In this section we begin to develop a systematic approach to locating maxima and minima. A complete approach will be presented in the next chapter. We begin by defining what is meant by a local minimum point c. This is a point such that for all x sufficiently close to c, f(c) ≤ f(x). A more precise statement is below. x cc− h c+ h | | | b Local minimum point Definition 4.8.1. Suppose that f is defined on some interval I. We say that a point c in I is a local mininum point if there is a positive number h such that f(c) ≤ f(x) whenever x ∈ (c − h, c + h) and x ∈ I. We say that a point d in (a, b) is a local maximum point if there is a positive number h such that f(x) ≤ f(d) whenever x ∈ (d− h, d + h) and x ∈ I. The following theorem will be familiar from high school. Theorem 4.8.2. Suppose that f is defined on (a, b) and has a local maximum or minimum point at c for some c in (a, b). If f is differentiable at c then f ′(c) = 0. Proof. Suppose that f is a local maximum at c. To show that f ′(c) = 0 we consider the difference quotient f(c+ h)− f(c) h . c©2020 School of Mathematics and Statistics, UNSW Sydney 92 CHAPTER 4. DIFFERENTIABLE FUNCTIONS Since f is a maximum at c, f(c+ h) ≤ f(c) for h sufficiently close to 0. So if h is sufficiently small and positive, f(c+ h)− f(c) h ≤ 0, while if h is sufficiently small and negative, f(c+ h)− f(c) h ≥ 0. That is, lim h→0+ f(c+ h)− f(c) h ≤ 0 while lim h→0− f(c+ h)− f(c) h ≥ 0. Hence lim h→0 f(c+ h)− f(c) h = 0 which is equivalent to saying that f ′(c) = 0. The proof when c is a local minimum is similar and will be omitted. The set of points where the derivative of a function is zero is important. In particular, tangents to the graph of the function at these points are horizontal lines. This motivates the following definition. Definition 4.8.3. If a function f is differentiable at a point c and f ′(c) = 0 then c is called a stationary point of f . Example 4.8.4. Suppose that f : R → R is given by f(x) = 4x5 − 5x4 − 40x3 − 2. Find all the stationary points of f . Solution. We begin by differentiating f and factorising the derivative: f ′(x) = 20x4 − 20x3 − 120x2 = 20x2(x2 − x− 6) = 20x2(x+ 2)(x− 3). Hence the solutions to f ′(x) = 0 are x = −2, 0, 3. So the points −2, 0 and 3 are the stationary points of f . The graph of the function f appearing in Example 4.8.4 is shown below. c©2020 School of Mathematics and Statistics, UNSW Sydney 4.9. MAPLE NOTES 93 x f(x) | | | | 200 −200 −2 3 It is clear that −2 is a local maximum point of f and that 3 is a local minimum point of f . However, 0 is neither a local minimum point nor local maximum point, and none of these points are global maximum points or global minimum points. So while the local maxima and minima of a function occur where f ′ is zero, it does not follow that any point c satisfying f ′(c) = 0 is a local maximum point or local minimum point of f . Tools for identifying whether a stationary point is a local maximum, local minimum or neither are developed in the next chapter using an important result called the mean value theorem. 4.9 Maple notes The Maple diff command is used to compute derivatives: diff(f(x),x); calculates df dx and diff(f(x),x$n); calculates dnf dxn . For example, > # Note Maple use of csc to denote function we usually call cosec > diff(sqrt(csc(x^12-53*x^5-1)),x); −12 √ csc (x12 − 53x5 − 1) cot (x12 − 53x5 − 1) (12x11 − 265x4) > diff(x^3*exp(5*x),x); 3x2e5 x + 5x3e5x > diff(%,x); 6xe5 x + 30x2e5x + 25x3e5x > f2ndDeriv:=diff(x^3*exp(5*x),x$2); f2ndDeriv := 6xe5 x + 30x2e5 x + 25x3e5x c©2020 School of Mathematics and Statistics, UNSW Sydney 94 CHAPTER 4. DIFFERENTIABLE FUNCTIONS Problems for Chapter 4 Problems 4.1 : Gradients of tangents and derivatives 1. [R] Using the definition of the derivative, show that: a) if f(x) = x2 then f ′(x) = 2x; b) if f(x) = x3 then f ′(x) = 3x2; c) if f(x) = 1x then f ′(x) = −1x2 ; d) [H] if f(x) = √ x then f ′(x) = 1 2 √ x . Problems 4.2 : Rules for differentiation 2. [R] Find the derivative in each case. a) f(x) = 5(x4 + 3x7) b) g(x) = (x4 − 2x)(4x2 + 2x+ 5) c) h(y) = y2 y3 + 8 d) f(x) = x(x2 − 4)1/2 e) f(t) = t/ √ t2 − 4 f) g(y) = sin 3y − 3 cos2 2y g) g(x) = x4e−x h) f(x) = (x2 + 1) ln √ x3 + 1 i) f(x) = ln(etan x) j) f(x) = ln(cos x) 3. [H] Suppose that f : R→ R is defined by f(x) = x|x| for all x in R. a) If it exists, evaluate lim h→0+ f(0 + h)− f(0) h . b) If it exists, evaluate lim h→0− f(0 + h)− f(0) h . c) State the value of f ′(0) or explain why f is not differentiable at 0. 4. [R] [V] Determine at which points each function f is (i) differentiable; (ii) continuous. a) f(x) = |x| b) f(x) = { sinx if x ≤ 0 x if x > 0 c) f(x) = x3 − 6x+ 4 x2 + 4x+ 4 5. [R] Sketch the graph of f , where f(x) = x1/3. Is f differentiable at 0? Give reasons. 6. [R] (An exercise on notation.) Suppose that f(x) = x+ cos 2x. Write down a) f(x+ 17π) b) f ′(x+ nπ) c) f(2− x2) d) f ′(2− x2) e) ddx f(2− x2). c©2020 School of Mathematics and Statistics, UNSW Sydney PROBLEMS FOR CHAPTER 4 95 Problems 4.4 : Implicit differentiation 7. [R] Find dy dx in terms of x and y if a) x3 + y3 = xy b) x2 −√xy + y2 = 6. 8. [R] Find dy dx for the curve x4 + y4 = 16. Sketch the graph of the curve. 9. [R] [V] Find the equation of the line tangent to the curve x3 + y3 = 3(x + y) at the point (1, 2). Problems 4.5 : Differentiation, continuity and split functions 10. [R] Suppose that a and b are real numbers. Find all values of a and b (if any) such that the functions f and g, given by a) f(x) = { ax+ b if x < 0 sinx if x ≥ 0 and b) g(x) = { ax+ b if x < 0 e2x if x ≥ 0 , are (i) continuous at 0 and (ii) differentiable at 0. 11. [H] The function f : R→ R is is defined by f(x) = {√ x sin √ x if x ≥ 0 ax+ b if x < 0, where a and b are real numbers. Find all values of a and b (if any) such that f is differentiable at 0. Problems 4.6 : Derivatives and function approximation 12. [R] [V] Suppose that f(x) = 3 √ x. a) Without using a calculator, give a rough estimate of f(8.01). b) i) Find the equation of the tangent to f at the point (8, 2). ii) Use your answer to part (i) to find a different approximation for f(8.01). c) Using a calculator, determine the error for the approximation in (a) and in (b). Which approximation is better? Problems 4.7 : Derivatives and rates of change 13. [R] At a certain instant the side length of an equilateral triangle is a cm and this length is increasing at r cm/sec. How fast is the area increasing? c©2020 School of Mathematics and Statistics, UNSW Sydney 96 CHAPTER 4. DIFFERENTIABLE FUNCTIONS 14. [R] [V] A 5 m ladder is leaning against a vertical wall. Suppose that the bottom of the ladder is being pulled away from the wall at a rate of 1 m/sec. How fast is the area of the triangle underneath the ladder changing at the instant that the top of the ladder is 4 m from the floor? 15. [R] A spherical balloon is to be filled with water so that there is a constant increase in the rate of its surface area of 3 cm2/sec. (The surface area A and volume V of a sphere of radius r is given by A = 4πr2 and V = 43πr 3.) a) Find the rate of increase in the radius when the radius is 3 cm. b) Find the volume when the volume is increasing at a rate of 10 cm3/sec. 16. [R] a) A container in the shape of a right circular cone, of semi-vertical angle tan−1(12), is placed vertex downwards with its axis vertical. θ θ = tan−1(12) Water is poured in at the rate of 10mm3 per sec. Find the rate at which the depth, hmm, is increasing when the depth of water in the cone is 50mm. b) [H] The cone is filled to a depth of 100mm and pouring is then stopped. A hole is then opened at the vertex of the cone and water flows out of the hole at the rate of 50π √ hmm3 per second, where h is the depth at time t. Show that it takes 200 seconds to empty the cone. c©2020 School of Mathematics and Statistics, UNSW Sydney 97 Chapter 5 The mean value theorem and applications The mean value theorem is one of the most important results for establishing the theoretical structure of calculus. Several results that you are familiar with from high school calculus are based on the mean value theorem. Applications of the mean value theorem include • identifying where a function is increasing or decreasing, • identifying different types of stationary points, • determining how many zeros a polynomial has, • evaluating limits which are indeterminate forms of type ∞∞ and 00 , • proving useful inequalities and • estimating errors in approximations. We begin the chapter by introducing the theorem. 5.1 The mean value theorem (Ref: SH10 §4.1) The basic idea behind the mean value theorem is straightforward. Consider a differentiable function f and choose an interval [a, b]. We construct a chord AB joining the points (a, f(a)) and (b, f(b)) as shown in Figure 5.1. Because the function is continuous and differentiable, there is a point c in (a, b) such that tangent to the graph of f at c lies parallel to the chord AB. Since the tangent and the chord are parallel, their gradients are equal. The gradient of the tangent at c is f ′(c) while the gradient of the chord is given by f(b)− f(a) b− a (using the ‘rise over run’ formula). Hence f(b)− f(a) b− a = f ′(c). c©2020 School of Mathematics and Statistics, UNSW Sydney 98 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS x f(x) | | | | | f(a) f(b) a c b A B C Figure 5.1: The mean value theorem – AB is parallel to the tangent at C. A precise statement of the theorem is given below. Theorem 5.1.1 (The mean value theorem). Suppose that f is continuous on [a, b] and differentiable on (a, b). Then there is at least one real number c in (a, b) such that f(b)− f(a) b− a = f ′(c). The proof of the theorem will be presented in the next section. For now we illustrate the theorem with a simple example. Example 5.1.2. Suppose that f : [1, 8] → R is given by f(x) = 3√x. Find a number c in (1, 8) that satisfies the conclusions of the mean value theorem for f on [1, 8]. Solution. Note that f is continuous on [1, 8] and differentiable on (1, 8). By the mean value theorem there is a real number c in (1, 8) such that f(8)− f(1) 8− 1 = f ′(c). Now f ′(x) = 13x −2/3 so the above equation becomes 2− 1 7 = 1 3 3 √ c2 . Rearranging to find c gives 3 √ c2 = 7 3 c = √ 343 27 . Since c ≈ 3.56 it is clear that c ∈ (1, 8). This is illustrated graphically below. c©2020 School of Mathematics and Statistics, UNSW Sydney 5.2. PROOF OF THE MEAN VALUE THEOREM 99 x 3 √ x | | | | | 1 2 1 c 8 b b b 5.2 Proof of the mean value theorem (Ref: SH10 §4.1) In this section we prove the mean value theorem. The proof is part of MATH1141 only, so MATH1131 students may want to jump straight to Section 5.3. To prove the mean value theorem, we begin by considering the special case when f(a) = f(b). This case is known as Rolle’s theorem and is illustrated in Figure 5.2. Theorem 5.2.1. Suppose that g is continuous on [a, b] and differentiable on (a, b). Suppose also that g(a) = g(b) = 0. Then there is a real number c in (a, b) such that g′(c) = 0. Proof. The proof considers three cases. Case 1: Suppose that g is the constant function given by g(x) = 0. Then g′(c) = 0 for every c in (a, b) (see Section 4.2). Case 2: Suppose that there is a point d in (a, b) such that g(d) > 0. By the max-min theorem (Theorem 3.4.3), g attains a maximum value at some point c in [a, b]. Moreover, c cannot be a or b since g(d) > g(a) = g(b) = 0. Hence c lies in (a, b), and since g is differentiable on (a, b), we have g′(c) = 0 (see Theorem 4.8.2). Case 3: Suppose that g(x) ≤ 0 for all x in [a, b] and that g is not constant on [a, b]. Then g attains a minimum at a point c in (a, b), and similarly to Case 2 one can show that g′(c) = 0. c©2020 School of Mathematics and Statistics, UNSW Sydney 100 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS x g(x) a c b A B C Figure 5.2: Rolle’s theorem – the tangent at C is horizontal. From here it is not hard to see how to prove the mean value theorem. Consider the graph of the function f in Figure 5.1 and ‘subtract’ the chord AB. The result will look something like the graph of g in Figure 5.2, to which we can apply Rolle’s theorem and obtain a horizontal tangent at C. We then ‘add’ the chord back again to obtain the tangent to f of Figure 5.1. This rough geometric argument is made rigorous below. Proof of the mean value theorem. Suppose that f is continuous on [a, b] and differentiable on (a, b). We consider the function g given by g(x) = f(x)− [ f(b)− f(a) b− a (x− a) + f(a) ] . (The part in square brackets is taken from the equation of the chord AB in Figure 5.1.) One can check that g is continuous on [a, b], differentiable on (a, b) and that g(a) = g(b) = 0. By Rolle’s theorem there is a c in (a, b) such that g′(c) = 0; that is, such that f ′(c)− [ f(b)− f(a) b− a ] = 0. If we rearrange this equation then we obtain f ′(c) = f(b)− f(a) b− a as required. 5.3 Proving inequalities using the mean value theorem Our first application of the mean value theorem will be to prove some useful inequalities. We begin with an example. Example 5.3.1. By using the mean value theorem, show that lnx < x− 1 whenever x > 1. c©2020 School of Mathematics and Statistics, UNSW Sydney 5.3. PROVING INEQUALITIES USING THE MEAN VALUE THEOREM 101 Solution. Suppose that x > 1 and consider the closed interval [1, x]. We define a function f : [1, x] → R by f(t) = ln t. Now f is continuous on [1, x] and differentiable on (1, x). So we may apply the mean value theorem to f on the interval [1, x]. Now f ′(t) = 1/t so f(x)− f(1) x− 1 = 1 c for some c in (1, x) by the mean value theorem. That is, lnx x− 1 = 1 c for some c between 1 and x. Since c > 1 we have 1/c < 1 and hence lnx x− 1 < 1. Rearranging the inequality gives lnx < x− 1 as desired. The technique used in the above example is as follows. Consider a function f that is continuous on the interval [a, x] and differentable on the interval (a, x). The mean value theorem gives f(x)− f(a) x− a = f ′(c) (5.1) for some c between a and x. If f ′(c) < M then f(x)− f(a) x− a < M and hence f(x) < M(x− a) + f(a). (5.2) whenever x > a. Sometimes it is desirable for the inequality to go the other way. In this case we find m such that f ′(c) > m. Then, instead of (5.2), we obtain f(x) > m(x− a) + f(a). Often, the tricky part of the process is determining which function f will give the desired inequality. Example 5.3.2. Use the mean value theorem to show that √ x+ 4− 2 < x 4 whenever x > 0. Solution. Fix positive x. The first step is to identify a good choice of f . If we rewrite the inequality as √ x+ 4 < x 4 + 2 c©2020 School of Mathematics and Statistics, UNSW Sydney 102 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS then we have a similar form to inequality (5.2). This suggests that we should define f : [0, x]→ R by the formula f(t) = √ t+ 4. Then f is continuous on [0, x] and differentiable on (0, x). An application of the mean value theorem gives f(x)− f(0) x− 0 = 1 2 √ c+ 4 for some c in (0, x). Now the right-hand side is bounded above by 1 2 √ 4 , so we have √ x+ 4−√0 + 4 x− 0 < 1 2 √ 4 . Simplifying and rearranging gives √ x+ 4− 2 < x 4 as required. 5.4 Error bounds A second application of the mean value theorem is in calculating error bounds. Suppose that a calculation involves √ 26. Clearly √ 26 is close to √ 25, but the latter is easier to work with because it is equal to 5. If we use 5 instead of √ 26 then calculations are made easier with the tradeoff that we introduce an error. How bad is this error? Example 5.4.1. Use the mean value theorem to find an upper bound for the error involved if we approximate √ 26 by √ 25. Solution. The precise error in the approximation is given by √ 26− √ 25. We apply the mean value theorem to the function f , given by f(x) = √ x, on the interval [25, 26]. This gives √ 26−√25 26− 25 = 1 2 √ c for some c in (25, 26). Hence error = √ 26− √ 25 = √ 26−√25 26− 25 = 1 2 √ c < 1 2 √ 25 = 1 10 , since c > 25. Hence an upper bound for the error is 1/10. We illustrate the process with one more example. Example 5.4.2. Use the mean value theorem to find an upper bound for the error involved if we approximate sin 15π21 by sin 2π 3 . c©2020 School of Mathematics and Statistics, UNSW Sydney 5.5. THE SIGN OF A DERIVATIVE 103 5.5 The sign of a derivative (Ref: SH10 §§4.2, 4.3) Suppose that a function f is differentiable at 4 and that f ′(4) = −2. In high school, it was taught that, since the derivative of f at 4 is negative, the function f is decreasing at the point 4. This makes intuitive sense since the gradient of the tangent to f at the point 4 is −2 and so the tangent slopes downward. In this section, these ideas are given a rigorous foundation by using the mean value theorem. Definition 5.5.1. Suppose that a function f is defined on an interval I. We say that a) f is increasing on I if for every two points x1 and x2 in I, x1 < x2 implies that f(x1) < f(x2); b) f is decreasing on I if for every two points x1 and x2 in I, x1 < x2 implies that f(x1) > f(x2). Example 5.5.2. The function f : R→ R given by f(x) = x2 is increasing on [0,∞) and decreasing on (−∞, 0]. x f(x) decreasing increasing The following theorem should be familiar to you. It is proved using the mean value theorem. Theorem 5.5.3. Suppose that f is continuous on [a, b] and differentiable on (a, b). (i) If f ′(x) > 0 for all x in (a, b) then f is increasing on [a, b]. (ii) If f ′(x) < 0 for all x in (a, b) then f is decreasing on [a, b]. (iii) If f ′(x) = 0 for all x in (a, b) then f is constant on [a, b]. Proof. We only prove statement (i) since the proofs of statements (ii) and (iii) are similar. Suppose that f ′(x) > 0 for all x in (a, b) and choose two points x1 and x2 in [a, b] such that x1 < x2. By Definition 5.5.1, it suffices to show that f(x1) < f(x2). Since f is differentiable on I, it is continuous on [x1, x2] (see Theorem 4.5.1) and differentiable on (x1, x2). Hence, by the mean value theorem, there is a c in (x1, x2) such that f(x2)− f(x1) x2 − x1 = f ′(c). c©2020 School of Mathematics and Statistics, UNSW Sydney 104 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS But f ′(c) > 0 and x2 − x1 > 0. This means that f(x2) − f(x1) > 0 and hence f(x1) < f(x2) as required. Theorem 5.5.3 has several important applications. The first of these is classifying different types of stationary points. The basic idea is as follows. Suppose that f is differentiable on the open interval (1, 3) and that for some small positive number h we have the following: • f ′(x) > 0 for all x in (2− h, 2), • f ′(2) = 0, and • f ′(x) < 0 for all x in (2, 2 + h). Now f is differentiable on (2 − h, 2 + h) so it is also continuous on this interval. Moreover, f increases on the interval (2− h, 2), is stationary at 2 and then decreases on the interval (2, 2 + h). From these facts we conclude that the stationary point 2 is a local maximum point for f . Example 5.5.4. Find and classify all stationary points of the function f : R→ R whose derivative is given by f ′(x) = (x− 4)(x− 1)(x+ 5)2. Solution. The stationary points, which are found by solving the equation f ′(x) = 0, are 4, 1 and −5. To classify each stationary point, we examine the sign of f ′ on small intervals either side of each stationary point. This process is concisely documented in the following table. −5− −5 −5+ 1− 1 1+ 4− 4 4+ x− 4 − − − − − − − 0 + x− 1 − − − − 0 + + + + (x+ 5)2 + 0 + + + + + + + f ′(x) + 0 + + 0 − − 0 + Gradient upslope − upslope upslope − − upslope For example, the notation 4− is shorthand for those points x lying in a small interval to the immediate left of 4. For such a point x, the factor x − 4 is negative, while the factors x − 1 and (x+ 5)2 are positive. Hence f ′(x) is negative when x is slightly to the left of 4. Other columns of the table are filled by a similar process. In conclusion, 4 is a local minimum point, 1 is a local maximum point and −5 is neither a maximum nor minimum point. (In fact, −5 is called a horizontal point of inflexion since the tangent to f at −5 is horizontal and the concavity of the function changes about this point. See any standard undergraduate calculus text or any advanced high school calculus text for further details.) 5.6 The second derivative and applications (Ref: SH10 §4.3) In this section we give another method for classifying the stationary points of a function f . To do so, we use the second derivative of f , which is denoted by f ′′. c©2020 School of Mathematics and Statistics, UNSW Sydney 5.6. THE SECOND DERIVATIVE AND APPLICATIONS 105 Example 5.6.1. Find the second derivative of the function f : R→ R, given by the rule f(x) = 3x4 + 2x− sinx. Solution. The first derivative of f is given by f ′(x) = 12x3 + 2− cos x. To find the second derivative f ′′ of f , we simply differentiate f ′. Hence f ′′(x) = 36x2 + sinx. Remark 5.6.2. If y = f(x) then the first derivative of f is often written as dydx or y ′ while the second derivative of f is often written as d 2y dx2 or y′′. Remark 5.6.3. Even if a function is differentiable at a point a, the function may not have a second derivative at a. (Consider, for example, the function f : R→ R given by f(x) = x4/3, at the point 0.) A function which has a second derivative at a is called twice differentiable at a. If f is twice differentiable, the sign of f ′′ can often be used to determine whether a stationary point is a local maximum point or a local minimum point. Theorem 5.6.4 (The second derivative test). Suppose that a function f is twice differentiable on (a, b) and that c ∈ (a, b). (i) If f ′(c) = 0 and f ′′(c) > 0 then c is a local minimum point of f ; (ii) If f ′(c) = 0 and f ′′(c) < 0 then c is a local maximum point of f . The proof of the theorem uses the following result. Lemma 5.6.5. Suppose that g is differentiable at a point c. (i) If g′(c) > 0 then g(c − h) < g(c) < g(c + h) for all positive h sufficiently small. (ii) If g′(c) < 0 then g(c + h) < g(c) < g(c − h) for all positive h sufficiently small. Sketch proof. We give an intuitive argument for statement (i) and leave (ii) as an exercise. Suppose that g′(c) > 0. By the definition of the derivative, lim h→0 g(c + h)− g(c) h = g′(c). (5.3) That is, g(c+ h)− g(c) h ≈ g′(c) when h is close to 0. Now, if h > 0 then g(c + h)− g(c) ≈ hg′(c) > 0 c©2020 School of Mathematics and Statistics, UNSW Sydney 106 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS Hence g(c+ h)− g(c) > 0 (5.4) provided that h > 0 and that h is sufficiently small. On the other hand, the definition of the derivative also implies that lim h→0 g(c − h)− g(c) −h = g ′(c) (to see this, simply replace h with −h in (5.3)). Now, if h > 0 and h is close to 0 then g(c) − g(c− h) ≈ hg′(c) > 0. Hence g(c)− g(c − h) > 0 (5.5) provided that h > 0 and that h is sufficiently small. If we combine (5.4) and (5.5) then we have g(c − h) < g(c) < g(c + h) for all positive h sufficiently small. Proof of Theorem 5.6.4. We prove statement (i) and leave the proof of (ii) as an exercise. Suppose that f ′(c) = 0, f ′′(c) > 0 and that g = f ′. Then g′(c) > 0 and g(c − h) < g(c) < g(c + h) for all positive h sufficiently small, by Lemma 5.6.5. Since f ′(c) = g(c) = 0 we conclude that f ′(c− h) < 0 < f ′(c+ h) for all positive h sufficiently small. So f ′ is positive on a small interval immediately to the right of c, zero at x and negative on a small interval immediately to the left of c. By Theorem 5.5.3 we now conclude that f has a local minimum at the point c. Example 5.6.6. Find and classify the stationary points of the function f : R→ R given by f(x) = x3 − 6x2 + 9x− 5. Solution. First we calculate the first and second derivatives of f : f ′(x) = 3x2 − 12x+ 9 = 3(x− 1)(x − 3) f ′′(x) = 6x− 12. Hence f has stationary points at 1 and 3. Now f ′′(1) = −6 < 0, so f has a local maximum point at 1. For second stationary point, f ′′(3) = 6 > 0, so f has a local minimum stationary point at 3. (To illustrate these conclusions, we have included the graph of f , shown below.) c©2020 School of Mathematics and Statistics, UNSW Sydney 5.7. CRITICAL POINTS, MAXIMA AND MINIMA 107 x f(x) | | 1 3 −5 Remark 5.6.7. Suppose that f ′(c) = 0 and that f ′′(c) = 0. Then we cannot apply the second derivative test. In fact, the point c may be a local maximum, a local minimum or neither of these. The examples below illustrate various possibilities. x x4 x −x4 x x3 • If f(x) = x4 then f ′(0) = f ′′(0) = 0 and there is a local minimum at 0. • If f(x) = −x4 then f ′(0) = f ′′(0) = 0 and there is a local maximum at 0. • If f(x) = x3 then f ′(0) = f ′′(0) = 0 and there is neither a a local maximum and minimum at 0 (in fact, we have a horizontal point of inflexion at 0). Hence if f ′(c) = f ′′(c) = 0 then it is best to classify the stationary point c by examining the sign of the derivative on either side of c, as illustrated in Section 5.5. 5.7 Critical points, maxima and minima (Ref: SH10 §§4.3–4.5) By combining results from Chapters 3, 4 and 5, we now give a comprehensive presentation of how to locate global maxima and minima for any real-valued continuous function defined on a closed interval. Suppose that a function f : [a, b] → R is continuous on [a, b]. By the max-min theorem (Theorem 3.4.3), f attains a global maximum and a global minimum on [a, b]. Suppose that the global maximum occurs at the point c. Now either c is one of the endpoints a or b, or c lies in the open interval (a, b). In the latter case, f is either differentiable at c or it is not. If it is differentiable at c, then f ′(c) = 0 by Theorem 4.8.2. Thus we have identified that a global maximum point c must satisfy one of the three following properties: c©2020 School of Mathematics and Statistics, UNSW Sydney 108 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS • c is one of the endpoints a or b of the closed interval [a, b], • f is differentiable at c and f ′(c) = 0, or • f is not differentiable at c. The set of points satisfying these properties is useful so we give it a special name. Definition 5.7.1. Suppose that f is defined on [a, b]. We say that a point c in [a, b] is a critical point for f on [a, b] if c satisfies one of the following properties: (a) c is an endpoint a or b of the interval [a, b], (b) f is not differentiable at c, or (c) f is differentiable at c and f ′(c) = 0. Theorem 5.7.2. Suppose that f is continuous on [a, b]. Then f has a global maximum and global minimum on [a, b]. Moreover, the global maximum point and the global minimum point are both critical points for f on [a, b]. The proof of the theorem follows the same reasoning given at the beginning of this section. The theorem provides a systematic way of finding global minima and maxima of functions. Example 5.7.3. Suppose that f : R→ R is given by f(x) = 3(x− 1)2/3. Find the absolute maximum and minimum values of f on [0, 2]. Solution. The derivative of f is given by f ′(x) = 2(x− 1)−1/3 = 2 3 √ x− 1 . Clearly f is not differentiable at 1. Morover, its derivative is never zero. So the only critical points of f on [0, 2] are 1 and the endpoints 0 and 2. By Theorem 5.7.2, the global maximum and minimum points must be among the points 0, 1 and 2. But f(0) = 3, f(1) = 0 and f(2) = 3. Hence the global minimum value of f on [0, 2] is 0 while the global maximum value of f on [0, 2] is 3. (See Figure 5.3 (a) for a sketch of f that shows its critical points.) Example 5.7.4. Suppose that the function f : R→ R is given by the rule f(x) = |x2 − 3x− 4|. Find the absolute maximum and absolute minimum values of f on the interval [0, 5]. c©2020 School of Mathematics and Statistics, UNSW Sydney 5.7. CRITICAL POINTS, MAXIMA AND MINIMA 109 x f(x) b b b | 1 2 3 (a) x f(x) b b b b | | 4 3 2 4 5 (b) Figure 5.3: Graphs of functions and their critical points. x y | | −2 3 (a) x g(x) b b b b | | −2 3 |34 (b) Figure 5.4: Graphs for Example 5.7.5. c©2020 School of Mathematics and Statistics, UNSW Sydney 110 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS Example 5.7.5. Consider the parabolic arc {(x, y) ∈ R2 : y = x2 − x− 1,−2 ≤ x ≤ 3} (see Figure 5.4 (a)). Find where the arc is (i) closest to and (ii) furthest from the origin. Solution. If (x, y) is a point on the arc then its distance from the origin is given by√ x2 + y2. To find where the arc is closest to the origin, we want to minimise √ x2 + y2 subject to the con- straints y = x2− x− 1 and −2 ≤ x ≤ 3. In other words, we want to find a global minimum for the function f(x) = √ x2 + (x2 − x− 1)2 on the interval [−2, 3]. Because calculations involving square roots can get messy, we instead consider the function g, given by g(x) = x2 + (x2 − x− 1)2 on [−2, 3]. (Note that g(x) may be interpreted as the square of the distance between the origin and a point on the arc with coordinate (x, x2 − x− 1). The global minimum of both f and g will occur at the same point x.) We seek the critical points of g, so we compute its derivative: g′(x) = 2x+ 2(x2 − x− 1)(2x − 1) = 2(x+ 2x3 − 2x2 − 2x− x2 + x+ 1) = 2(2x3 − 3x2 − x2 + 1) = 2(x− 1)(2x2 − x− 1) = 2(x− 1)(x − 1)(2x + 1). Hence the critical points of g on [−2, 3] are −2, −1/2, 1 and 3. Now g(−2) = 29, g(−1/2) = 5/16, g(1) = 2, g(3) = 34 (see Figure 5.4 (b)). Therefore the arc is closest to the origin when x = −1/2 and furthest from the origin when x = 3. 5.8 Counting zeros In this section we combine the intermediate value theorem with results about the sign of derivatives to determine how many solutions there are to various equations. The main ideas are illustrated in the next example. Example 5.8.1. Determine how many real numbers satisfy the equation x3 − 6x2 − 15x+ 8 = 0. (5.6) c©2020 School of Mathematics and Statistics, UNSW Sydney 5.8. COUNTING ZEROS 111 Solution. Suppose that f(x) = x3 − 6x2 − 15x + 8 for all real x. Ideally, we would look for one root of f , factor this out and solve a quadratic equation to determine the remaining roots (if any). However, finding one root by inspection is not straightforward, so we need another approach. Note that f is differentiable (and hence continuous) everywhere. We begin by identifying intervals where f is monotonically decreasing and intervals where f is monotonically increasing. The derivative of f is given by f ′(x) = 3x2 − 12x− 15 = 3(x− 5)(x+ 1). So the stationary points of f are 5 and −1. Morover, the table below shows that f is increasing on the intervals (−∞,−1] and [5,∞), while it is decreasing on [−1, 5]. Interval (−∞,−1) (−1, 5) (5,∞) (x− 5) − − + (x+ 1) − + + f ′(x) + − + We evaluate f at the (finite) endpoints of these intervals and see that f(−1) = 16 and f(5) = −92. Since f is a cubic, we can now give a rough sketch of the graph of f . x f(x) b b (−1, 16) (5,−92) We now argue that f has exactly one root on each of the intervals mentioned above. • On the interval [−1, 5], the function f is decreasing. Hence f cannot have more than one root on [−1, 5]. On the other hand, since f(−1) > 0 and f(5) < 0 there is at least one c in [−1, 5] such that f(c) = 0 by the intermediate value theorem. Hence f has exactly one root on [−1, 5]. • On the interval (−∞,−1], the function f is increasing. Hence f cannot have more than one root on (−∞,−1]. On the other hand, since f(−1) > 0 and f(−5) < 0 there is at least one c in [−5,−1] such that f(c) = 0 by the intermediate value theorem. Hence f has exactly one root on (−∞,−1]. • On the interval [5,∞), the function f is increasing. Hence f cannot have more than one root on [5,∞). On the other hand, since f(5) < 0 and f(10) > 0 there is at least one c in [5, 10] such that f(c) = 0 by the intermediate value theorem. Hence f has exactly one root on [5,∞). c©2020 School of Mathematics and Statistics, UNSW Sydney 112 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS In conclusion, equation (5.6) has exactly three real solutions, one in each of the intervals [−5,−1], [−1, 5] and [5, 10]. Guided by our solution to Example 5.8.1, we now outline a general approach for solving problems of this type. Suppose that f is continuous and differentiable everywhere and that its derivative is also con- tinuous. To determine how many real solutions the equation f(x) = 0 has, one may follow the procedure below. 1. Calculate f ′ and solve f ′(x) = 0. 2. Determine the intervals where the derivative is positive and the intervals where it is negative. 3. By step 2 we know the intervals where the function f is monotonically increasing, and the intervals where f is monotonically decreasing. 4. Evaluate the function f at the endpoints of each interval. If f changes sign on the interval, there is exactly one root on that interval. If it does not change sign, there are no roots on that interval. (This step must be modified slightly for intervals of the form (−∞, a] or [b,∞).) Example 5.8.2. Determine how many real numbers satisfy the equation 2x3 − 9x2 + 12x− 1 = 0. (5.7) Solution. We follow the steps outlined above. Suppose that f(x) = 2x3 − 9x2 + 12x− 1. By differentiating f we obtain f ′(x) = 6x2 − 18x+ 12 = 6(x− 1)(x− 2). The stationary points of f are 1 and 2. By studying the sign of f ′, it is clear that f is increasing on the intervals (−∞, 1) and (2,∞) while it is decreasing on (1, 2). Now f(1) = 4 and f(2) = 3, as illustrated in the sketch below. x f(x) b b (1, 4) (2, 3) c©2020 School of Mathematics and Statistics, UNSW Sydney 5.9. ANTIDERIVATIVES 113 So it is clear that f has no zeros on [1, 2] or on [2,∞). On (−∞, 1), the function f has no more than one zero (since it is increasing on this interval). Moreover, f changes sign on the interval [0, 1] (since f(0) = −1 < 0 and f(1) > 0). Hence f has exactly one root on [0, 1] by the intermediate value theorem. In conclusion, equation (5.7) has exactly one real solution and this solution lies somewhere in the interval [0, 1]. 5.9 Antiderivatives We open with an example. Example 5.9.1. While filming a movie, a stunt man jumps out of a stationary helicopter 900 meters above the ground. The scriptwriter wants the man to make a 20 second mobile phone call before opening the parachute. Physical considerations (gravity and air resistance) suggest that the man’s velocity f(t) (in metres per second) without an open parachute is given by f(t) = 50(1 − e−t/5), where t is the number of seconds after jumping from the helicopter. How far will the man fall 20 seconds? Solution. Write F (t) for the distance fallen (in metres) after t seconds. We note that F (0) = 0. Since velocity is the rate of change of displacement, F satisfies the equation F ′(t) = f(t). (Any equation like this involving differentiation is called a differential equation). The differential equation may be rewritten as dF dt = 50− 50e−t/5. (5.8) One can check by differentiation that if F (t) = 50t+ 250e−t/5 + C, (5.9) where C is a real constant, then F satisfies (5.8). Let’s assume (for the moment) that all possible solutions to the differential equation (5.8) are of the form (5.9). Imposing the condition that F (0) = 0 allows us to evaluate the unknown constant C: 0 = F (0) = 50× 0 + 250e0 + C, whence C = −250. Therefore F (t) = 50t+ 250e−t/5 − 250. To complete the solution, F (20) = 50× 20 + 250e−4 − 250 ≈ 754.6. Hence the stunt man will fall about 755 metres in the first 20 seconds. To be certain of this solution, we need to show that every function F satisfying the differential equation (5.8) is of the form (5.9). This can be done using the mean value theorem. We begin by introducing the notion of an antiderivative. c©2020 School of Mathematics and Statistics, UNSW Sydney 114 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS Definition 5.9.2. Suppose that f is continuous on an open interval I. A function F is said to be an antiderivative (or a primitive) of f on I if F ′(x) = f(x) for all x in I. The process of finding an antiderivative of a function is called antidifferentiation. Example 5.9.3. (a) Suppose that f(t) = 50− 50e−t/5 ∀t ∈ R Then the function F , given by F (t) = 50t+ 250e−2t − 250 is an antiderivative of f on (−∞,∞). (b) Suppose that n is a positive integer and f(x) = xn. If F (x) = xn+1 n+ 1 and G(x) = xn+1 n+ 1 + 5 then F and G are both antiderivatives of f on the interval (−∞,∞). So f has more than one antiderivative on (−∞,∞). In fact, if C is any real number and H(x) = xn+1/(n+ 1) + C then H is an antiderivative of f on (−∞,∞). Thus the function f has infinitely many antiderivatives on (−∞,∞). Example 5.9.3 (b) illustrates the general principle that if F is an antiderivative of f on I then F + C is also an antiderivative of f on I, for any real constant C. The next theorem says that all antiderivatives are of this form. Theorem 5.9.4. Suppose that f is a continuous function on an open interval I and that F and G are two antiderivatives of f on I. Then there is a real constant C such that G(x) = F (x) + C for all x in I. Proof. Suppose that F and G are two antiderivatives of f on I and let H denote the function given by H(x) = G(x)− F (x) for all x in I. Then H is differentiable on I and H ′(x) = G′(x)− F ′(x) = f(x)− f(x) = 0 c©2020 School of Mathematics and Statistics, UNSW Sydney 5.10. L’HOˆPITAL’S RULE 115 for all x in I. Hence there is a constant C such that H(x) = C for all x in I (by Theorem 5.5.3 (c)). Therefore G(x) = F (x) +H(x) = F (x) + C for all x in I. Theorem 5.9.4 justifies the assumption made in the solution to Example 5.9.1. It also enables us to write down all possible antiderivatives of some well-known functions. In the table below, C is any real constant. Function Antiderivative xr, where r is rational and r 6= −1 1r+1xr+1 + C sinx − cos x+ C cos x sinx+ C eax 1ae ax + C f ′(x) f(x) ln |f(x)|+C 5.10 L’Hoˆpital’s rule (Ref: SH10 §§11.5, 11.6) Consider the limit lim x→0 ex − 1 5x+ x2 . Since both the numerator and denominator approach 0 as x → 0+, the limit is an example of an indeterminate form of type 00 . None of the limit rules of Chapter 2 can be easily applied to evaluate the limit. However, the mean value theorem gives another rule which helps in this situation. Suppose that f(x) = ex − 1 and g(x) = 5x+ x2. Instead of considering the quotient f(x)/g(x) (whose limit is difficult to calculate), we consider the quotient f ′(x)/g′(x) of derivatives (whose limit is easier to calculate): lim x→0 f ′(x) g′(x) = lim x→0 ex 5 + 2x = 1 5 . Since this second limit exists, l’Hoˆpital’s rule (which is stated below) implies that lim x→0 f(x) g(x) = lim x→0 f ′(x) g′(x) . Hence lim x→0 ex − 1 5x+ x2 = 1 5 . Theorem 5.10.1 (l’Hoˆpital’s rule). Suppose that f and g are both differentiable functions and a is a real number. Suppose also that either one of the two following conditions hold: c©2020 School of Mathematics and Statistics, UNSW Sydney 116 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS • f(x)→ 0 and g(x)→ 0 as x→ a; • f(x)→∞ and g(x)→∞ as x→ a. If lim x→a f ′(x) g′(x) exists then lim x→a f(x) g(x) = lim x→a f ′(x) g′(x) . Remark 5.10.2. The theorem also holds for • limits as x→∞ or x→ −∞; • one-sided limits (that is, as x→ a+ or as x→ a−). L’Hoˆpital’s rule is proved using the mean value theorem (see the sketch proof at the end of this section). Example 5.10.3. Calculate lim x→1 lnx 1− x . Solution. If f(x) = lnx and g(x) = 1−x then both f(x) and g(x) approach 0 as x→ 1. Moreover, f and g are both differentiable. So we look at the quotient of the derivatives: f ′(x) g′(x) = 1/x −1 = − 1 x → −1 as x→ 1. We conclude by l’Hoˆpital’s rule that lim x→1 f(x) g(x) = −1. (Note: for short we may write lim x→1 lnx 1− x = limx→1 1/x −1 (5.10) = lim x→1 −1 x = −1 provided that we check that our application of l’Hoˆpital’s rule in (5.10) is valid.) Sometimes l’Hoˆpital’s rule must be applied more than once to calculate a limit. The next example illustrates this. Example 5.10.4. Determine the limiting behaviour of x2 e2x as x→∞. c©2020 School of Mathematics and Statistics, UNSW Sydney 5.10. L’HOˆPITAL’S RULE 117 Solution. Observe that both the numerator and denominator approach ∞ as x → ∞. Differenti- ating the numerator and denominator gives 2x 2e2x . Again, as x→∞ we have an indeterminate form of type ∞∞ . If we differentiate the numerator and denominator once more we have 2 4e2x and this limit can be evaluated. Piecing this together, we have lim x→∞ x2 e2x = lim x→∞ 2x 2e2x = lim x→∞ 2 4e2x = 0 by l’Hoˆpital’s rule. As the next example shows, l’Hoˆpital’s rule can be used to solve limits that are indefinite forms of type 0×∞. Example 5.10.5. Determine the limiting behaviour of x lnx as x→ 0+. The next example warns against the improper application of l’Hoˆpital’s rule. Example 5.10.6. Determine the limiting behaviour of 2x− cos x 3x+ cos x as x→∞. We shall now sketch the proof of l’Hoˆpital’s rule for the case when x→ a+. Other cases are done similarly. (A complete proof uses a generalisation of the mean value theorem known as Cauchy’s mean value theorem.) Sketch proof of Theorem 5.10.1. Suppose that f and g are differentiable (and hence continuous) everywhere, that f(x)→ 0 and g(x)→ 0 as x→ a+, and that lim x→a+ f ′(x) g′(x) exists. We need to show that lim x→a+ f(x) g(x) = lim x→a+ f ′(x) g′(x) . (5.11) We begin by applying the mean value theorem to both functions f and g on the interval [a, x]. So there are real numbers c and d in (a, x) such that f(x)− f(a) x− a = f ′(c) and g(x)− g(a) x− a = g ′(d). (5.12) Since f is continuous and f(x)→ 0 as x→ a+, we have f(a) = 0. Similarly g(a) = 0. So by (5.12) we have f(x) = (x− a)f ′(c) and g(x) = (x− a)g′(d) for some c and d in (a, x). Hence f(x) g(x) = (x− a)f ′(c) (x− a)g(d) = f ′(c) g′(d) . (5.13) Now c→ a+ and d→ a+ as x→ a+. Hence (5.13) becomes (5.11) as required. c©2020 School of Mathematics and Statistics, UNSW Sydney 118 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS Problems for Chapter 5 Problems 5.1 : The mean value theorem 1. [R] Find a real number c which satisfies the conclusions of the mean value theorem for each function f on the given interval. a) f(x) = x3 on [1, 2] b) f(x) = √ x on [0, 2]. 2. [R] Suppose that f(x) = 1/x. Show that there is no real number c in [−1, 2] such that f ′(c) = f(2)− f(−1) 2− (−1) . Why does this not contradict the mean value theorem? 3. [R] Consider the function f given by f(x) = (x − 2)4 cos(x2 − 4x+ 4). Use the mean value theorem to show that f ′ has a zero on the interval [1, 3]. Problems 5.3 : Proving inequalities using the mean value theorem 4. [R] [V] By using the mean value theorem, show that a) ln(1 + x) < x whenever x > 0; b) − ln(1− x) < x/(1 − x) whenever 0 < x < 1; c) 1 + x < ex whenever x > 0. 5. [R] a) Use the mean value theorem to show that sin t < t whenever t > 0. b) Using the pinching theorem and part (a), evaluate the limit lim x→∞ sin 1 x . 6. [H] Prove that 1 + x 2 √ 1 + x < √ 1 + x < 1 + x 2 whenever x > 0. Problems 5.4 : Error bounds 7. [R] [V] Use the mean value theorem to find an upper bound for the error involved if we approximate a) √ 17 by √ 16 = 4; b) ( 1998 1000 )2 by 22 = 4; c) 1 1002 by 1 1000 . c©2020 School of Mathematics and Statistics, UNSW Sydney PROBLEMS FOR CHAPTER 5 119 Problems 5.5 : The sign of a derivative, 5.6 : The second derivative and applications and 5.7 : Critical points, maxima and minima 8. [R] The derivative of a function f : R → R is given by f ′(x) = 3(x + 1)(x − 1)2(x − 4)3. Locate all stationary points of f and identify any local maximum or minimum points of f . 9. [R] Find the maximum and minimum values for each function f over the given interval. a) f(x) = 3− x3 over [−2, 4] b) f(x) = 3− x4 over [−2, 4] c) f(x) = x3 − x4 over [−5, 5] d) f(x) = 2x(x+ 4)3 over [−2, 1] e) f(x) = |x2 − 3x+ 2| over [0, 3] 10. [R] Find the point on the straight line 2x+ 3y = 6 which is closest to the origin. 11. a) i) [R] Show that the polynomial p3, where p3(x) = 1 + x+ x2 2! + x3 3! , has at least one real root. ii) [H] Show that the polynomial p2, where p2(x) = 1 + x+ x2 2! , has no real roots and deduce that p3 has exactly one real root. 12. [R] A wire of length 100 cm is cut into two pieces of length x cm and y cm. The piece of length x cm is bent into the shape of a square and the piece of length y cm into the shape of a circle. Find x and y so that the sum of the areas enclosed by the shapes will be a) a minimum b) a maximum. Problems 5.8 : Counting zeros 13. [R] Show that x3 + x− 9 = 0 has only one real solution. 14. [R] [V] Suppose that p(x) = x3 − 12x2 + 45x − 51 whenever x ∈ R. How many real zeros does p have? Problems 5.9 : Antiderivatives 15. [R] a) Find a function f that has the following properties: f ′(t) = sin t+ t whenever t ∈ R, f(0) = 2. b) Are there any other functions with these properties? Explain your answer. 16. [R] [V] A particle moving along the x-axis has velocity 2t − t2 units per second after t seconds. Find c©2020 School of Mathematics and Statistics, UNSW Sydney 120 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS a) the distance from the starting point after three seconds; b) the total distance travelled after three seconds. Problems 5.10 : L’Hoˆpital’s rule 17. [R] Calculate the following limits. a) lim x→0 ex − 1 x(3 + x) b) lim x→1 xm − 1 xn − 1 , n 6= 0 c) limx→π/2 x− π/2 cos x d) lim x→0 ln (1 + x)− x x2 e) lim x→π/2 ( 1− sinx 1 + cos 2x ) f) lim x→0 tanx− x x3 18. [R] Determine the limiting behaviour in the following cases. a) x3 + 1 x4 + 1 as x→∞ b) e 5x x3 as x→∞ c) e5x x3 as x→ −∞ d) x sin(1/x) as x→∞ e) √ x4 + 1 3 √ x6 + 1 as x→∞ f) ln(x 3 + 1) ln(x2 + 1) as x→∞ 19. [H] Find the value of lim t→0 ( 1 ln(1 + t) + 1 ln(1− t) ) . 20. [H] Find (a, b) such that lim x→0 ax− 1 + ebx x2 = 1. 21. [R] Explain why l’Hoˆpital’s Rule cannot be used to find lim x→∞ 4x+ sinx 2x− sinx . Use another method to find this limit. 22. [R] [V] Show that the function f , given by f(x) = { e2x if x ≥ 0 2x+ 1 if x < 0, is differentiable at 0. 23. [R] a) Evaluate lim h→0+ cos √ h− 1 h . b) A function f is defined by f(x) = { cos √ x if x ≥ 0 ax+ b if x < 0, where a and b are real numbers. By using the limit calculated in (a), find all possible values of a and b such that f is differentiable at 0. c©2020 School of Mathematics and Statistics, UNSW Sydney PROBLEMS FOR CHAPTER 5 121 24. [H] [V] a) Use l’Hoˆpital’s rule to show that lim x→0+ x lnx = 0. b) By using part (a), or otherwise, show that lim x→0+ x2 lnx = 0. c) A function f is defined by f(x) = { x2 lnx if x > 0 ax+ b if x ≤ 0, where a and b are real numbers. Find all possible values of a and b such that f is differentiable at 0. c©2020 School of Mathematics and Statistics, UNSW Sydney 122 CHAPTER 5. THE MEAN VALUE THEOREM AND APPLICATIONS c©2020 School of Mathematics and Statistics, UNSW Sydney 123 Chapter 6 Inverse functions We return again to the pollution example (see the introductions to Chapter 2). According to our model, the volume of pollution P (t) in the lake after t days of the factory’s operation is given by P (t) = 109 101 ( 1− e−101t/105). Environmental authorities, after hearing that the eventual amount of pollution in the lake will be 109/101 litres, determine that this level would be environmentally devastating. Recommendations are made that the amount of pollution should not exceed 8 × 106 litres. If factory operations do not change, in how many days will pollution levels in the lake exceed this amount? Note that the function P expresses pollution as a function of time. One approach to solving this problem is to find a function T that expresses time as a function of pollution. Such a function is called an inverse function of P . This chapter is devoted to a study of inverse functions. 6.1 Some preliminary examples The examples of this section give an introduction to some of the key concepts appearing in Chapter 6. (A rigorous treatment of these concepts will be given in later sections.) Example 6.1.1. Jack and Jill are playing a number game. Jack says, ‘A positive number, when squared, is equal to 16. What was the original number?’ Jill correctly answers that the number was 4. Jack goes again: ‘A positive number, when squared, is equal to 49. What was the original number?’ This time Jill answers ‘7.’ This game may be modelled using two functions. Jack’s function f takes a positive number and squares it. That is, f is given by f : (0,∞)→ (0,∞), f(x) = x2. The function g that Jill uses to answer Jack’s question is given by g : (0,∞)→ (0,∞), g(x) = √x. So f takes an initial number and squares it, while g takes the square root and recovers the original number. A way of writing down the fact that g undoes (or reverses) what f does to any positive number x is: g(f(x)) = x ∀x ∈ Dom(f). (6.1) c©2020 School of Mathematics and Statistics, UNSW Sydney 124 CHAPTER 6. INVERSE FUNCTIONS x f(x) output input (a) Example 6.1.1 x f(x) output input (b) Example 6.1.2 x f(x) output inputinput (c) Example 6.1.3 Figure 6.1: Correspondence between inputs and outputs In fact, f also undoes what g does to any positive number. This is easily verified by checking that f(g(x)) = x ∀x ∈ Dom(g). (6.2) Because f and g have this special relationship, they are called inverses of each other. The relationship between any function f and its inverse function g can also be viewed in the following way. If y = f(x) and we want to express x as a function of y, then we find that x = g(y). For the functions of Example 6.1.1, this is easily verified by the following calculation: y = f(x) ⇐⇒ y = x2 ⇐⇒ √y = x ⇐⇒ x = g(y). The next example shows that changing the domain of f gives a different inverse function g. Example 6.1.2. Jack changes the game slightly. He says, ‘A negative number, when squared, is equal to 36. What was the original number?’ Jill correctly answers that the number was −6. Again, we model the game with two functions. Jack’s function f is given by f : (−∞, 0)→ (0,∞), f(x) = x2, while Jill’s function g is given by g : (0,∞)→ (−∞, 0), g(x) = −√x. Note that g undoes what f does to any negative number, and vice-versa. That is, f and g satisfy equations (6.1) and (6.2) and are therefore inverse functions of each other. Each function f in Example 6.1.1 and Example 6.1.2 had the property that for every output there is a unique corresponding input (See Figure 6.1 (a) and (b)). However, the function f graphed in Figure 6.1 (c) does not have a one-to-one correspondence between its outputs and inputs. As we see in the next example, this function f has no inverse function. Example 6.1.3. Jack changes the game one more time. This time he says, ‘A real number, when squared, is equal to 64. What was the original number?’ This time Jill cannot give a definite answer to his question. The answer is either 8 or −8, but she has no way of telling which it is. The model for this situation is the following. Jack’s function f is now given by f : R→ [0,∞), f(x) = x2; c©2020 School of Mathematics and Statistics, UNSW Sydney 6.2. ONE-TO-ONE FUNCTIONS 125 but there is no function g that undoes what f does to a real number x. That is, there is no function g that satisfies the equation (6.1). Therefore f has no inverse function. The reason why this is the case is clear: the domain of f now includes the positive and negative real numbers, and therefore some (in fact most) outputs of f have two possible corresponding inputs. That is, there is no one-to-one correspondence between inputs and outputs of f . The ideas introduced through these examples are summarised below. • If f has an inverse function g, then g undoes (or reverses) what f does to elements of Dom(f). • Suppose f and g are inverse functions. Then y = f(x) if and only if x = g(y). • A function f has an inverse function only if there is a one-to-one correspondence between inputs and outputs of f . • The domain of a function f plays an important role in determining whether f has an inverse function g, and if so, exactly what the inverse function is. • If a function f does not have an inverse function, then we can sometimes modify the domain of f so that it does have an inverse function. (For example, if f(x) = x2 then f : R→ [0,∞) does not have an inverse function while the function f : (0,∞) → (0,∞), with domain restricted to the positive real numbers, does have an inverse function.) In the following sections we treat these ideas more rigorously. 6.2 One-to-one functions (Ref: SH10 §7.1) As was demonstrated in Section 6.1, the property that a function has a one-to-one correspondence between inputs and outputs is critical to our discussion. Definition 6.2.1. A function f is said to be one-to-one if f(x1) = f(x2) implies that x1 = x2 whenever x1, x2 ∈ Dom(f). In other words, a function is one-to-one if every output has a unique input. Equivalently, f is one-to-one if f(x1) 6= f(x2) whenever x1 6= x2, provided that both x1 and x2 belong to Dom(f). One-to-one functions are sometimes called injective functions. Example 6.2.2. Show that the function f : R→ R, given by f(x) = 2x− 3, is one-to-one. c©2020 School of Mathematics and Statistics, UNSW Sydney 126 CHAPTER 6. INVERSE FUNCTIONS Solution. Suppose that f(x1) = f(x2). Then 2x1 − 3 = 2x2 − 3. If we add 3 to both sides and afterwards divide by 2 then we obtain x1 = x2. Hence f is one-to-one. Example 6.2.3. Show that the function f : R→ R, given by f(x) = x2, is not one-to-one. Proof. Clearly f(−2) = f(2). However, −2 6= 2 and so f is not one-to-one. There are other ways of identifying one-to-one functions. The following method is a simple geometric test. Proposition 6.2.4 (The horizontal line test). Suppose that f is a real-valued function defined on some subset of R. Then the following statements are equivalent: (a) f is one-to-one; (b) every horizontal line in the Cartesian plane intersects the graph of f at most once. Example 6.2.5. Consider the functions graphed below. x f(x) A b x g(x) x h(x) • f is not one-to-one because the horizontal line passing through the point A cuts the graph of f more than once; • g is one-to-one (in fact, since g is increasing, every horizontal line can cut the graph of g graph no more than once); • h is also one-to-one (even though it is not always increasing). Although not every one-to-one function is increasing (or decreasing), it is true that every in- creasing function is one-to-one. Proposition 6.2.6. If a function f is either increasing or decreasing, then f is one-to-one. Sketch proof. If f is increasing and x1, x2 ∈ Dom(f), then x1 < x2 ⇐⇒ f(x1) < f(x2). (6.3) Now compare (6.3) with Definition 6.2.1. c©2020 School of Mathematics and Statistics, UNSW Sydney 6.3. INVERSE FUNCTIONS 127 Example 6.2.7. Show that the function f : R→ R, given by f(x) = 2x5 + x3 + x− 10, is one-to-one. Solution. The derivative of f is given by f ′(x) = 10x4 +3x2 +1. Since f ′(x) ≥ 1 for all real x, the function f is increasing and hence one-to-one. Remark 6.2.8. Not every function whose derivative is only positive (or only negative) on its domain is one-to-one. For example, on Dom(tan) d dx tanx = sec2 x ≥ 1, but tan is not one-to-one, nor is it increasing on its domain. The problem here is that the domain of tan has gaps. 6.3 Inverse functions (Ref: SH10 §7.1) We have already mentioned the term inverse function in Section 6.1. In this section we define precisely what is meant by an inverse function and explore some of the relationships between a function and its inverse. Our starting point is the following theorem. It says that if f is one-to-one, then there is a function g which undoes what f does to any input x (c.f. equation (6.1)). Theorem 6.3.1. Suppose that f is a one-to-one function. Then there is a unique function g satisfying g(f(x)) = x ∀x ∈ Dom(f) (6.4) and f(g(x)) = x ∀x ∈ Range(f). (6.5) Moreover, Dom(g) = Range(f), Range(g) = Dom(f) and g is one-to-one. Proof. Suppose that f is one-to-one and let D and R denote the domain and range of f respectively. Since f is one-to-one, for every y in R there is a unique x in D such that y = f(x). Now define a function g : R→ D by the following rule: if y ∈ R then g(y) is the unique x in D such that f(x) = y. It is left as an exercise to show that g is one-to-one and that equations (6.4) and (6.5) hold. The theorem allows us to define the term inverse function. Definition 6.3.2. Suppose that f is a one-to-one function. Then the inverse func- tion of f is the unique function g given by Theorem 6.3.1. The inverse function for f is often written as f−1. c©2020 School of Mathematics and Statistics, UNSW Sydney 128 CHAPTER 6. INVERSE FUNCTIONS Remark 6.3.3. If f−1 denotes the inverse function of a one-to-one function f , then equations (6.4) and (6.5) can be expressed as f−1(f(x)) = x ∀x ∈ Dom(f) and f(f−1(x)) = x ∀x ∈ Range(f). (Warning: This notation is potentially confusing if not understood correctly. The function f−1 is not the reciprocal function of f (that is, 1/f(x)). To denote the reciprocal of f(x) using index notation we write [f(x)]−1 rather than f−1(x).) Remark 6.3.4. If g is the inverse of a function f , then f is the inverse of g. This is due to the symmetry between f and g in the statements of Theorem 6.3.1. If f is a one-to-one function then equation (6.5) can often be used to find an explicit formula for its inverse g. Example 6.3.5. Suppose that f : R→ R is given by f(x) = 4− 13x3. (a) Explain why f has an inverse function. (b) Find the inverse function g of f . (c) Sketch f and g on the Cartesian plane. (d) Use the inverse function g to solve the equation f(x) = 13. Solution. (a) Since f ′(x) = −x2 ≤ 0 for all x in R (with f ′(x) = 0 only when x = 0), the function f is decreasing and is therefore one-to-one. Hence f has an inverse function. (b) Let g denote the inverse function of f . Then Dom(g) = Range(f) = R and Range(g) = Dom(f) = R. To find an explicit formula for g(x), suppose that y = g(x). If we apply f to both sides of the equation, and use (6.5), then f(y) = f(g(x)) = x. Using the definition of f , this simplifies to 4− 1 3 y3 = x, whereupon rearrangement gives y = 3 √ 12 − 3x. Therefore g : R→ R is given by g(x) = 3 √ 12− 3x. (6.6) (c) The graphs of f and g are shown below. Note that each graph is the reflection the other in the line y = x. c©2020 School of Mathematics and Statistics, UNSW Sydney 6.3. INVERSE FUNCTIONS 129 x y y = x y = f(x) y = g(x) (d) To solve the equation f(x) = 13, apply g to both sides. This gives g(f(x)) = g(13). By (6.4) and (6.6) this simplifies to x = 3 √ 12− 3× 13 = −3. Remark 6.3.6. As was illustrated in Example 6.3.5 (c), the graph of a one-to-one function f and its inverse g are reflections of each other in the line y = x. We give a proof of this fact below: (x, y) lies on the graph of f ⇐⇒ y = f(x) ⇐⇒ g(y) = g(f(x)) ⇐⇒ g(y) = x ⇐⇒ (y, x) lies on the graph of g. If a function f is not one-to-one, then we can sometimes restrict the domain of f so that it becomes one-to-one. The function f with restricted domain then has an inverse function. Example 6.3.7. Consider the function f : R→ R given by f(x) = 4− x2. Find a restriction of f such that f becomes one-to-one. Find the inverse function g of this restriction. Solution. A quick sketch of the graph of f (see Figure 6.2 (a)) shows that f is not one-to-one. If we restrict the domain of f to [0,∞), then the restricted function f : [0,∞)→ R passes the horizontal line test and is therefore one-to-one. Let g denote the inverse of f : [0,∞)→ R. Then g : (−∞, 4]→ [0,∞) c©2020 School of Mathematics and Statistics, UNSW Sydney 130 CHAPTER 6. INVERSE FUNCTIONS x y y = f(x) 4 −2 2 (a) x y y = f(x) y = g(x) 4 4 (b) x y y = f(x) y = g(x) 4 4 (c) Figure 6.2: Two different restrictions of the function f , and their corresponding inverses (see Example 6.3.7). since Dom(g) = Range(f) = (−∞, 4] and Range(g) = Dom(f) = [0,∞). We find an explicit formula for g: y = g(x) f(y) = x (by equation 6.5) 4− y2 = x y = √ 4− x (taking the positive root since Range(g) = [0,∞)). Hence g(x) = √ 4− x. The graphs of f and g are sketched in Figure 6.2 (b). Remark 6.3.8. In the solution to Example 6.3.7, one could have instead chosen the restriction f : (−∞, 0]→ R. The corresponding inverse g is given by g : (−∞, 4]→ (−∞, 0], g(x) = − √ 4− x2. The graphs of f and g are given in Figure 6.2 (c). Example 6.3.9. Consider the function f : R→ R, given by f(x) = −x3 + 3x2 + 24x − 13. Find all intervals I, each as large as possible, such that the function f : I → R (with domain restricted to I) has an inverse function. 6.4 The inverse function theorem (Ref: SH10 §§7.1, B.3) If a function f is one-to-one then its inverse function g is also one-to-one (see Theorem 6.3.1). What other properties of f does its inverse function g inherit? First, if f is continuous, then so is g. (A rough argument for this fact would go like this: if f is continuous then its graph can be drawn as an unbroken line. Now reflect this graph in the line y = x. The reflected graph is also an unbroken line. Moreover it corresponds to the graph of g. Hence g is continuous.) c©2020 School of Mathematics and Statistics, UNSW Sydney 6.4. THE INVERSE FUNCTION THEOREM 131 What about differentiability? If a one-to-one function f is differentiable then its inverse g is not always differentiable. For example, if f : R→ R is given by f(x) = x3, then f is differentiable everywhere. Its inverse function g is given by g : R→ R, g(x) = 3√x. Since g′(x) = 1 3 √ x2 , g is not differentiable at 0. So while f is differentiable everywhere, its inverse g is not. The problem with the above example occurs where the derivative of f is zero. When a horizontal tangent to f is reflected in the line y = x, it becomes a vertical tangent to g and hence g is not differentiable at this point. x y However, if f is a differentiable function whose derivative is never zero, its inverse g will also be differentiable. Theorem 6.4.1 (The inverse function theorem). Suppose that I is an open interval, f : I → R is differentiable and f ′(x) 6= 0 for all x in I. Then (i) f is one-to-one and has an inverse function g : Range(f)→ Dom(f), (ii) g is differentiable at all points in Range(f), and (iii) the derivative of g is given by the formula g′(x) = 1 f ′(g(x)) for all x in Range(f). Some parts of the inverse function theorem are easy to prove while others are hard. Statement (i) is true since f ′(x) 6= 0 for x in I implies that f is either increasing on I or decreasing on I (by the mean value theorem). Hence f is one-to-one. Statement (ii) is difficult to prove and involves a delicate limiting argument with the difference quotients of f and g. It is important that students understand the proof of statement (iii). Suppose (by statement (ii)) that g is differentiable. Beginning with the equation f(g(x)) = x (see 6.5), differentiation with respect to x gives f ′(g(x)).g′(x) = 1 c©2020 School of Mathematics and Statistics, UNSW Sydney 132 CHAPTER 6. INVERSE FUNCTIONS by the chain rule. Since f ′ is never zero on I, we can divide by f ′(g(x))) to obtain g′(x) = 1 f ′(g(x)) as required. In the next section the inverse function theorem will be applied to the trigonometric functions. 6.5 Applications to the trigonometric functions (Ref: SH10 §7.7) In elementary trigonometry, the sine function gives the ratio ba of two side lengths of a right-angle triangle corresponding to the angle θ. a b c θ However, if we know the ratio and instead want to find the corresponding angle, we use the inverse sine function. Thus relationship between the sine and inverse sine functions is given by sin θ = b c , sin−1 b c = θ. (6.7) This relationship works in elementary trigonometry where the angle θ is acute. However, if θ is allowed to be any real number then the relationship between sin and sin−1 is not so straightforward. This is because the function sin : R→ R is not one-to-one and therefore has no inverse (see Figure 6.3). In the next example, we define exactly what we mean by sin−1 and compute its derivative. Example 6.5.1 (The inverse sine function). Since the usual sine function is not a one-to-one function, we consider instead a restricted sine function sin : [−π2 , π2 ]→ [−1, 1] which is one-to-one (see Figure 6.3 (a)). This restricted function has an inverse sin−1 : [−1, 1]→ [−π2 , π2 ], which is graphed in Figure 6.3 (b). We note that sin−1 is an odd function, that is, sin−1(−x) = − sin−1 x ∀x ∈ [−π2 , π2 ]. We now examine the differentiability of sin−1. If I = (−π2 , π2 ) then the sine function is differen- tiable on I and d dx (sinx) = cos x 6= 0 c©2020 School of Mathematics and Statistics, UNSW Sydney 6.5. APPLICATIONS TO THE TRIGONOMETRIC FUNCTIONS 133 x sinx b b (π2 , 1) (−π2 ,−1) (a) x sin−1 x b b (1, π2 ) (−1,−π2 ) (b) Figure 6.3: Graphs of sin and sin−1 for all x in I. So by the inverse function theorem (Theorem 6.4.1), sin−1 is differentiable on (−1, 1). While the derivative of sin−1 could be computed using the formula of Theorem 6.4.1 (iii), we prefer to compute it directly by implicit differentiation. If y = sin−1 x then sin y = x (by applying sin to both sides) cos y dy dx = 1 (by implicit differentiation) dy dx = 1 cos y (since cos y > 0 when −π2 < y < π2 ). (6.8) We seek a simple expression for cos y in terms of x. By the trigonometric identity sin2 y+cos2 y = 1, and the fact that y = sin−1 x, we have cos y = √ 1− sin2 y = √ 1− (sin(sin−1 x))2 = √ 1− x2, (6.9) where the positive square root was taken since cos y > 0. Substituting (6.9) into (6.8) gives dy dx = 1√ 1− x2 and hence d dx (sin−1) = 1√ 1− x2 whenever −1 < x < 1. As a corollary, note that the derivative of sin−1 is positive, so sin−1 is an increasing function. The following table summarises the domain, range and derivatives of the restricted (and hence one-to-one) trigonometric functions and their inverses. The derivatives exist on the largest open intervals contained in the domain of each function. Proofs of the derivatives for the inverse trigono- metric functions are done in much the same way as illustrated in Example 6.5.1. It is expected that students will be able to derive these derivatives, using the method of Example 6.5.1, if required. The graphs of cos−1 and tan−1 are shown in Figure 6.4. c©2020 School of Mathematics and Statistics, UNSW Sydney 134 CHAPTER 6. INVERSE FUNCTIONS x cos−1 x b b b (1, 0) (−1, π) (0, π2 ) x tan−1 x π 2 −π2 b b Figure 6.4: Graphs of cos−1 and tan−1 Function Domain Range Derivative sin [−π2 , π2 ] [−1, 1] d dx (sinx) = cos x sin−1 [−1, 1] [−π2 , π2 ] d dx (sin−1 x) = 1√ 1− x2 cos [0, π] [−1, 1] d dx (cos x) = − sinx cos−1 [−1, 1] [0, π] d dx (cos−1 x) = − 1√ 1− x2 tan (−π2 , π2 ) (−∞,∞) d dx (tanx) = sec2 x tan−1 (−∞,∞) (−π2 , π2 ) d dx (tan−1 x) = 1 1 + x2 We note also that sin−1 and tan−1 are both odd functions, but that cos−1(−x) = π − cos−1 x ∀x ∈ [−1, 1]. The next example illustrates that one must be careful with the notation sin and sin−1. It is not always true that sin−1(sinx) = x. This is because sin−1 is not the inverse of sin : R→ R. It is only the inverse of sin : [−π2 , π2 ]→ R. In particular, sin−1(sinx) = x only when x ∈ [−π2 , π2 ]. Similar comments apply to the other trigonometric functions. Example 6.5.2. Evaluate the following numbers without using a calculator: (a) sin−1(sin 5π4 ), (b) sin(cos−1 35 ). Solution. (a) We cannot write sin−1(sin 5π4 ) = 5π 4 , since 5π 4 is not in the interval [−π2 , π2 ]. However, since sin−1 is an odd function, sin−1 ( sin 5π 4 ) = sin−1 ( − 1√ 2 ) = − sin−1 ( 1√ 2 ) = −π 4 . c©2020 School of Mathematics and Statistics, UNSW Sydney 6.5. APPLICATIONS TO THE TRIGONOMETRIC FUNCTIONS 135 (b) Let θ denote cos−1 35 . Then we seek sin θ. By the trigonometric identity sin2 θ + cos2 θ = 1, we have sin θ = ± √ 1− cos2 θ, which simplifies to sin θ = ± √ 1− ( 3 5 )2 = ±4 5 . Since θ = cos−1 35 > 0 we have sin θ > 0 and hence sin θ = + 4 5 , completing the question. An alternate solution to (b) is the following. Once again, let θ denote the angle cos−1 35 . This means that cos θ = 35 , whereupon we construct the right-angled triangle shown below. 4 5 θ A C B 3 By Pythagoras’ theorem, the length of side AB is 4. Hence sin θ = 45 . The final example uses theory from all chapters covered in the course so far. Example 6.5.3. A man and a woman sit on opposite edges of a canyon which is 50 metres wide and 100 metres deep. The man drops a small stone from rest into the canyon. The woman watches the stone fall to the bottom of the canyon through a telescope. At what time during the stone’s fall will the angle that the telescope makes with the horizontal change the fastest? (That is, when will the angular velocity of the telescope be greatest?) Assume that the air resistance experienced by the stone is negligible. Solution. Let x(t) denote the distance (in metres) that the stone has travelled exactly t seconds after the stone was dropped. Let θ(t) denote the angle that the telescope makes with the horizontal at t seconds. Let g denote acceleration due to gravity (g ≈ 9.8 ms−2). c©2020 School of Mathematics and Statistics, UNSW Sydney 136 CHAPTER 6. INVERSE FUNCTIONS r man woman stone 50 m 100 m x(t) θ(t) If v(t) denotes the velocity of the stone at time t, then we have dx dt = v, x(0) = 0, (6.10) dv dt = g, v(0) = 0. (6.11) From (6.11) we obtain v(t) = gt+C1. By imposing the initial condition that v(0) = 0 we see that C1 = 0. Hence v(t) = gt and substituting this into (6.10) gives dx dt = gt, x(0) = 0. It follows that x(t) = 12gt 2 + C2 and one easily shows that C2 = 0. To find out when the stone hits the canyon floor, we solve the equation 100 = 1 2 gt2 for t. Hence t = √ 200/g ≈ 4.5. So it takes about 4.5 seconds for the stone to hit the canyon floor. By examining the triangle in the above diagram, we see that tan θ = x(t) 50 = gt2 100 . To solve the problem, we need to determine the value of t when dθdt is a maximum. Now θ = tan−1 ( gt2 100 ) and so dθ dt = 1 1 + ( gt2 100 )2 × gt50 by the derivative of tan−1 and the chain rule. This simplifies to dθ dt = 200gt 104 + g2t4 . c©2020 School of Mathematics and Statistics, UNSW Sydney 6.6. MAPLE NOTES 137 For simplicity, denote dθdt by ω. We need to find when ω attains its maximum on the interval [0, √ 200/g] (that is, over the time interval when the stone is falling). To locate the critical points of ω, we calculate dωdt : dω dt = d2θ dt2 = (104 + g2t4)200g − 200gt(4g2t3) (104 + g2t4)2 = (104)200g − 600g3t4 (104 + g2t4)2 . The stationary points of ω occur when (104)200g − 600g3t4 = 0, that is, when t4 = 104 3g2 . Since t ≥ 0, we obtain t = 10/ 4 √ 3g2 ≈ 2.42. So the critical points of ω on [0, √ 200/g] are 0, 10/ 4 √ 3g2 and √ 200/g. The value of ω at each of these points is calculated below: ω(0) = 0, ω(10/ 4 √ 3g2) ≈ 1.10, ω( √ 200/g) ≈ 0.26. Clearly ω(t) attains its maximum when t = 10/ 4 √ 3g2. In conclusion, the angular velocity of the telescope is greatest 10/ 4 √ 3g2 (or about 2.26) seconds after the stone is dropped. At this time, the angular velocity is about 1.10 radians per second (that is, about 62.9 degrees per second). 6.6 Maple notes Maple knows about the inverse trigonometric functions. The functions sin−1, cos−1, tan−1, cosec−1, sec−1, cot−1 are written respectively as: arcsin, arccos, arctan, arccsc, arcsec and arccot. For example, > plot([arctan(x), arccsc(x)], x=-10..10, y=-2..2, linestyle=[dash,solid]); c©2020 School of Mathematics and Statistics, UNSW Sydney 138 CHAPTER 6. INVERSE FUNCTIONS > # Note the gap around zero in the graph of arccsc(x) --- please explain! c©2020 School of Mathematics and Statistics, UNSW Sydney PROBLEMS FOR CHAPTER 6 139 Problems for Chapter 6 Problems 6.1 : Some preliminary examples 1. [R] [V] Suppose that the functions f : [0,∞)→ [1,∞) and g : [1,∞) → [0,∞) are given by f(x) = √ 1 + x2 and g(x) = √ x2 − 1. a) By calculating (f ◦ g)(x) and (g ◦ f)(x), verify that g is the inverse function to f . b) What are the domains of f ◦ g and g ◦ f? 2. [R] a) Suppose that f : R→ R is given by f(x) = 3x+ 1. Find f−1(x). Sketch the graph of f and the graph of its inverse function, f−1, on the same diagram. b) The function g : (−∞, 0]→ R is defined by g(x) = x2 + 1. Write down the domain and range of the inverse function g−1 and find a formula for g−1(x). Find the derivative of g−1. Problems 6.2 : One-to-one functions and 6.3 : Inverse functions 3. [R] Show that the function f : R→ R, given by f(x) = x3 + 3x+ 1, has an inverse function whose domain is R. 4. [R] [V] Suppose that f : R→ R is given by f(x) = 4x+ cos x. a) Show that f has an inverse function g. b) By using the inverse function theorem, find g′(2π). 5. [R] Suppose that f : R→ R is defined by f(x) = x3 − 3x+ 1. a) Show that f : R→ R is not a one-to-one function. b) Find all possible intervals I of R, each as large as possible, such that the restricted function f : I → R has an inverse. What is the domain of each of corresponding inverse function? 6. [H] a) Can you find a quadratic function from R to R which is one-to-one? b) Can you find a cubic function from R to R which is not one-to-one? Problems 6.4 : The inverse function theorem 7. [H] For each function f : R→ R given below, find all possible intervals I of R, each as large as possible, such that the restricted function f : I → R is one-to-one. State the range of each restricted function f : I → R. What can you say about existence, domain of definition, continuity and differentiability of the corresponding inverse functions? c©2020 School of Mathematics and Statistics, UNSW Sydney 140 CHAPTER 6. INVERSE FUNCTIONS a) f(x) = x(x2 − 1)(x + 2) b) f(x) = (x+ 1)17 c) f(x) = |x| |x+ 1| Problems 6.5 : Applications to the trigonometric functions 8. [R] Simplify each expression without using a calculator. a) sin−1( √ 3/2) b) cos(cos−1(2/5)) c) sin−1(sin(5π/3)) d) cos−1(cos(−π/3)) e) cos(sin−1(3/5)) f) sin(tan−1(3/5)) g) sec−1(2) h) sin−1(sinx) when π2 ≤ x ≤ 3π2 9. [R] Sketch the graph of f : [1, 3]→ R, where f(x) = cos−1(x− 2). 10. [R] Show that a) d dx ( cos−1 x ) = −1√ 1− x2 b) d dx ( tan−1 x ) = 1 1 + x2 . 11. [R] Differentiate a) cos−1(2x) b) sin−1 √ x c) tan−1(2x− 3). 12. [R] Prove that sin−1 x+ cos−1 x is constant. For what values of x is this valid and what is the constant? 13. [H] [V] Suppose that f(x) = tan−1 x+ tan−1(1/x) whenever x 6= 0. a) Show that f ′(x) = 0 whenever x 6= 0. b) Hence evaluate f on the intervals (0,∞) and (−∞, 0). c) How do you account for this result geometrically? 14. [H] a) Draw the graph of cosec x. b) Show that cosec restricted to the interval (0, π2 ] has an inverse function. Sketch the graph of the inverse and calculate its derivative. 15. [H] A function f : R→ R is defined by f(x) = { x tan−1 ( 1√ x ) if x > 0 ax+ b if x ≤ 0, where a and b are real numbers. Find all values of a and b such that f is differentiable at 0. c©2020 School of Mathematics and Statistics, UNSW Sydney PROBLEMS FOR CHAPTER 6 141 16. [H] A lighthouse containing a revolving beacon is located 3 km from P , the nearest point on a straight shoreline. The beacon revolves with a constant rotation rate of 4 revolutions per minute and throws a spot of light onto the shoreline. How fast is the spot of light moving when it is (a) at P and (b) at a point on the shoreline 2 km from P? 17. [H] [V] A picture 2 metres high is hung on a wall with its bottom edge 6 metres above the eye of the viewer. How far from the wall should the viewer stand for the picture to subtend the largest possible vertical angle with her eye? c©2020 School of Mathematics and Statistics, UNSW Sydney 142 CHAPTER 6. INVERSE FUNCTIONS c©2020 School of Mathematics and Statistics, UNSW Sydney 143 Chapter 7 Curve sketching There are a variety of ways to describe curves that lie in the plane. In this chapter we study curves which are described by using • a Cartesian equation (for example, y = x2 or y = √1− x2), • a parameter, and • polar coordinates. Each section of this chapter is devoted to one of these methods. The use of Cartesian equations will already be familiar. However, many curves cannot be easily described by Cartesian equations. The following diagrams illustrate two curves that are better described by a parameter or with polar coordinates. x y An example of a parametric curve: x(t) = sin t cos t ln |t| y(t) = √ |t| cos t t ∈ [−1, 1], t 6= 0 x y An example of a polar curve: r = | cos(4θ)| 7.1 Curves defined by a Cartesian equation (Ref: SH10 §§4.7, 4.8) c©2020 School of Mathematics and Statistics, UNSW Sydney 144 CHAPTER 7. CURVE SKETCHING In this section we survey techniques for sketching curves that are described by a Cartesian equation of the form y = f(x). 7.1.1 A checklist for sketching curves Students will already be familiar with techniques for sketching the graph of a function f . For convenience we summarise some of the most useful here. • Identify the domain of f . (In particular, Dom(f) should not include any points that lead to division by 0 or the square root of a negative number.) • Identify any symmetries: f is odd if f(−x)= −f(x) ∀x ∈ Dom(f) f is even if f(−x) = f(x) ∀x ∈ Dom(f). • Find x- and y-axis intercepts. • Identify vertical asymptotes. • Examine the behaviour of f(x) as x → ±∞. (This will identify any other asymptotes that exist). • Use calculus to identify stationary points and other features, if necessary. 7.1.2 Oblique asymptotes Consider the graph of the function f that is sketched below. x y y = f(x) y = ax+ b It appears that the graph of f approaches the line y = ax+ b as x→∞. If this is indeed the case, we say that f is asymptotic to the line. Since this line is neither vertical, nor horizontal, we call it an oblique asymptote to the function f . c©2020 School of Mathematics and Statistics, UNSW Sydney 7.1. CURVES DEFINED BY A CARTESIAN EQUATION 145 Definition 7.1.1. Suppose that a and b are real numbers and that a 6= 0. We say that a straight line, given by the equation y = ax+ b, is an oblique asymptote for a function f if lim x→∞ ( f(x)− (ax+ b)) = 0. Example 7.1.2. Suppose that f is defined by the equation f(x) = x+ 1 + 2 x− 2 whenever x 6= 2. Identify all asymptotes to f and hence sketch its graph. Solution. Clearly there is a vertical asymptote when x = 2. To identify the other asymptotes (if any), we examine the behaviour of f(x) as x→∞ and as x→ −∞. Now as x → ∞, 2x−2 → 0 from above. Hence we conclude that f(x) approaches x + 1 from above. Therefore the line y = x+ 1 is an oblique asymptote to f . (The calculation lim x→∞ ( f(x)− (x+ 1)) = lim x→∞ 2 x− 2 = 0 demonstrates from Definition 7.1.1 that this is the case.) On the other hand, as x → −∞, 2x−2 → 0 from below. Therefore f(x) approaches x + 1 from below. The asymptotes for f help us construct the sketch (which is pictured below). To complete the sketch, we identify the axes intercepts. For the y-axis, we have f(0) = 0. For the x-axis, f(x) = 0 ⇒ (x+ 1)(x− 2) + 2 = 0 ⇒ x2 − x = 0 ⇒ x = 0, 1. Thus we have the following sketch: x y 1 x = 2 y = f(x) y = x+ 1 c©2020 School of Mathematics and Statistics, UNSW Sydney 146 CHAPTER 7. CURVE SKETCHING (Note that we have obtained sufficient information for sketching the graph without resorting to calculus.) If f is a rational function of the form p/q, where p and q are polynomials and deg(p) > deg(q), then the asymptotic behaviour of f may be determined by polynomial division. Example 7.1.3. Suppose that f is defined by the equation f(x) = x2 − 4 x+ 1 whenever x 6= −1. Identify all asymptotes to f and hence sketch its graph. Solution. Clearly there is a vertical asymptote when x = −1. It is also clear that f(x) → ∞ as x → ∞, but the precise asymptotic behaviour of f(x) as x → ∞ is still not clear. However, polynomial division gives x− 1 x+ 1 )x2 − 4 x2 + x −x− 4 −x− 1 −3. Hence f(x) = x− 1− 3 x+ 1 . (7.1) (See Remark 7.1.4 for an alternative method of establishing (7.1).) Since 3x+1 → 0 from above as x → ∞, we conclude that f(x) approaches x − 1 from below as x →∞. A similar observation shows that f(x) approaches x− 1 from above as x→ −∞. So the line y = x− 1 is an oblique asymptote to f at both ∞ and −∞. Finally, we calculate the axes intercepts: f(0) = −4 and f(x) = 0 ⇒ x2 − 4 = 0 ⇒ x = ±2. Hence we obtain the following sketch. c©2020 School of Mathematics and Statistics, UNSW Sydney 7.1. CURVES DEFINED BY A CARTESIAN EQUATION 147 x y x = −1 2−2 −4y = f(x) y = x− 1 (Again, we have obtained sufficient information to sketch the graph without the use of calculus.) Remark 7.1.4. We give a method for establishing equation (7.1) that uses an algebraic ‘trick’ rather than polynomial ‘long division’: f(x) = x2 − 4 x+ 1 = x2 − 1− 3 x+ 1 = (x− 1)(x+ 1)− 3 x+ 1 = (x− 1)− 3 x+ 1 . 7.1.3 Examples We complete this section with three examples. Example 7.1.5. Sketch the graph of f , where f is defined by f(x) = xe−x 2 . Solution. It is clear that Dom(f) = R. Moreover, f(−x) = −xe−x2 = −f(x) ∀x ∈ R, so f is an odd function. Therefore, it suffices to consider x ≥ 0; the rest of the graph can be obtained by symmetry. c©2020 School of Mathematics and Statistics, UNSW Sydney 148 CHAPTER 7. CURVE SKETCHING Since e−x 2 > 0, it is easy to see that the only axis intercept occurs at (0, 0). We look now at the behaviour of f(x) as x→∞. By l’Hoˆpital’s rule, lim x→∞ f(x) = limx→∞ x ex2 = lim x→∞ 1 2xex2 = 0. So y = 0 is an horizontal asymptote. Finally, we use calculus to determine the shape of the curve. By the product rule, f ′(x) = x (− 2xe−x2)+ e−x2(1) = (1− 2x2)e−x2 . Since e−x 2 > 0 for all x in R, we conclude that f ′(x) = 0 when x = 1√ 2 , while f ′(x) > 0 when 0 ≤ x < 1√ 2 and f ′(x) < 0 when x > 1√ 2 . So f is increasing on the interval (0, 1√ 2 ), stationary at 1√ 2 and decreasing on ( 1√ 2 ,∞). Now f(1/ √ 2) = 1√ 2 e−1/2 = 1√ 2e ≈ 0.429. Moreover, f ′(0) = 1, implying that the tangent line to the graph at the origin has a gradient of 1. This information can be used to sketch the graph when x ≥ 0, and the fact that f is odd allows us to deduce the shape of the graph when x < 0. x f(x) −2 21√ 2 − 1√ 2 − 1√ 2e 1√ 2e | | | | | | Example 7.1.6. Sketch the graph of f , where f is defined by f(x) = x2 − 4 + 3 x2 . The next function is important since it is later used to define the Si function (see Example 8.12.1). The Si function is used by electrical engineers for digital signal processing and by surveyors using GPS (Global Positioning System). Example 7.1.7. Sketch the graph of f , where f is defined by f(x) = sinx x if x 6= 0 1 if x = 0. c©2020 School of Mathematics and Statistics, UNSW Sydney 7.1. CURVES DEFINED BY A CARTESIAN EQUATION 149 x y 1 √ 3− 4 √ 3 −4 2 √ 3− 4 y = f(x) y = x2 − 4 | Figure 7.1: Sketch for Example 7.1.6. c©2020 School of Mathematics and Statistics, UNSW Sydney 150 CHAPTER 7. CURVE SKETCHING 7.2 Parametrically defined curves (Ref: SH10 §§10.5, 10.6) Many moving bodies trace out a path that lies in a plane. An important problem is to describe the curve of the trajectory. We give an example. Example 7.2.1. Suppose that a stone is thrown horizontally from the deck of Sydney Harbour Bridge at 20 metres per second, and that the air resistance experienced by the stone is negligible. Let x(t) denote the horizontal distance (in metres) travelled by the stone t seconds after being thrown and let y(t) denote its height (in metres) above the water. The deck is 50 metres above sea level. water deck b stone 50m x(t) y(t) If we take the acceleration due to gravity to be 10 m/s2, then Newton’s laws of motion show that x(t) = 20t and y(t) = 50− 5t2 (7.2) when 0 < t < √ 10 (that is, after the stone is thrown and just before it hits the water). Sketch the trajectory of the stone when 0 < t < √ 10. Solution. We want to express y as a function of x. By (7.2) we have t = x(t) 20 and y(t) = 50 − 5t2. If we substitute the equation on the left into the equation on the right then y(t) = 50 − 5 ( x(t) 20 )2 which becomes y = 50− x 2 80 . So the trajectory is a parabola. It is easy to see that y = 50 when x = 0 and that x = 20 √ 10 when y = 0. We therefore obtain the following sketch. c©2020 School of Mathematics and Statistics, UNSW Sydney 7.2. PARAMETRICALLY DEFINED CURVES 151 x y 50 20 √ 10 Remark 7.2.2. The equation( x(t), y(t) ) = ( 20t, 50 − 5t2), t ∈ R is called a parametrisation of the parabola y = 50− x 2 80 . The variable t is called a parameter. Parametrisations of curves are important to study for several reasons. • They arise naturally with moving bodies, as illustrated in Example 7.2.1. In this context they also contain more information than a corresponding Cartesian equation (since a Cartesian equation only records the trajectory of the body, not the position of the body at any given time). • Many curves that cannot be described by an equation of the form y = f(x) can be described parametrically. • Even if a curve is the graph of a function, it may be much easier to describe the curve parametrically than to write down an equation describing the function. (This is the case with the cycloid, which will be studied in Example 7.2.7.) • Curves in three (or higher) dimensional space are most easily described using parameters. 7.2.1 Parametrisation of conic sections The trajectories of many heavenly bodies (such as planets and comets) can be modelled as conic sections. In this section we will give some standard parametrisations of conic sections. Example 7.2.3. Show that the curve given by the functions x(t) = a cos t, y(t) = b sin t, 0 ≤ t ≤ 2π (7.3) is an ellipse. Sketch the curve, showing how the point (x(t), y(t)) moves as t varies from 0 to 2π. Solution. Suppose that t ∈ [0, 2π]. From (7.3) we see that x(t)2 a2 = cos2 t, y(t)2 b2 = sin2 t. c©2020 School of Mathematics and Statistics, UNSW Sydney 152 CHAPTER 7. CURVE SKETCHING Since cos2 t+ sin2 t = 1, x(t)2 a2 + y(t)2 b2 = cos2 t+ sin2 t = 1. Hence the point (x(t), y(t)) lies on the ellipse x2 a2 + y2 b2 = 1. (Conversely, any point (x, y) that satisfies this equation can be expressed as x = a cos t, y = b sin t for some t in R.) In fact, it is easy to see that, as t moves from 0 to 2π, (x(t), y(t)) traces out the entire ellipse. This is illustrated in the diagram below. x(t) y(t) b −b a−a bb b b t = 0, 2π t = π2 t = π t = 3π2 0 < t < π2 π 2 < t < π π < t < 3π2 3π 2 < t < 2π The table below lists some commonly used parametrisations of conic sections. Conic section Cartesian equation Parametric equation Parabola 4ay = x2 x(t) = 2at y(t) = at2 Circle x2 + y2 = a2 x(t) = a cos t y(t) = a sin t Ellipse x2 a2 + y2 b2 = 1 x(t) = a cos t y(t) = b sin t Hyperbola x2 a2 − y 2 b2 = 1 x(t) = a sec t y(t) = b tan t c©2020 School of Mathematics and Statistics, UNSW Sydney 7.2. PARAMETRICALLY DEFINED CURVES 153 There are also other parametrisations for each of these curves. (We give a second parametrisation of the hyperbola in Chapter 10.) In Examples 7.2.1 and 7.2.3, we sketched a parametrically defined curve by first eliminating the parameter t. It is not always possible to do this, as the next example illustrates. Such curves can be sketched with the aid of a table of values. Example 7.2.4. Sketch the curve defined parametrically by( x(t), y(t) ) = (t2 cos t, t2 sin t), 0 ≤ t ≤ 3π. (7.4) Solution. We begin by noting that the above parametric equation is similar to the parametrisation( x(t), y(t) ) = (a cos t, a sin t) (7.5) of the circle. The only difference is that the points in (7.5) have a fixed distance a from the origin, whereas the points in (7.4) have a variable distance t2 from the origin. So intuitively, as t increases, the point (x(t), y(t)) in (7.4) rotates about origin, while at the same time getting further away from the origin. Therefore it is likely that the curve is a spiral. The following table of values helps verify this reasoning. t 0 π2 π 3π 2 2π 5π 2 3π x 0 0 −π2 0 4π2 0 −9π2 y 0 π 2 4 0 9π2 4 0 25π2 4 0 We plot these points on the Cartesian plane and interpolate between them to produce the following sketch. x(t) y(t) −π2 −9π24 4π2 25π2 4 −9π2 π2 4 b b b b b b b c©2020 School of Mathematics and Statistics, UNSW Sydney 154 CHAPTER 7. CURVE SKETCHING 7.2.2 Calculus and parametric curves Given a smooth curve defined using a parameter t, one would like to calculate its gradient at any point along the curve. The following proposition enables us to do this without having to eliminate t. Proposition 7.2.5. Suppose that x and y are both differentiable functions of t and that y is a function of x. If x′(t) 6= 0 then dy dx = y′(t) x′(t) . (7.6) Proof. Since y is a function of x, we write y = f(x). Hence y(t) = f(x(t)). Differentiating both sides with respect to t gives y′(t) = df dx (x(t))x′(t) by the chain rule. Since x′(t) 6= 0, df dx (x(t)) = y′(t) x′(t) which is equivalent to (7.6). Example 7.2.6. A curve is described parametrically by( x(t), y(t) ) = (t2 − 1, t3 + 2t), t ∈ R. Find the equation of the tangent to the curve at the point (0, 3). Solution. By Proposition 7.2.5, dy dx = y′(t) x′(t) = 3t2 + 2 2t (7.7) provided that t 6= 0. It is easy to see that t2 − 1 = 0 and t3 + 2t = 3 only when t = 1. By substituting this into (7.7), we find that the gradient of the curve at (0, 3) is 5/2. Hence the equation of the tangent at (0, 3) is given by y = 5x 2 + 3. c©2020 School of Mathematics and Statistics, UNSW Sydney 7.2. PARAMETRICALLY DEFINED CURVES 155 7.2.3 The cycloid and curve of fastest descent Suppose that a point A is higher than a point B. What curve, starting at A and ending at B, would enable a particle moving under the influence of gravity (and ignoring friction) to move from A to B in the shortest time? b b A B Curve of fastest descent. Such a curve is known as a curve of fastest descent or a brachistochrone (which, in Greek, means ‘shortest time’). In 1638 Galileo stated that the curve of fastest descent is the arc of a circle. This was shown to be false by Johann Bernoulli who found the correct description of the curve. In 1696 he issued a public challenge for others to find this description. Four mathematicians responded with a correct answer: Jakob Bernoulli (Johann’s brother), Gottfried Leibniz, Isaac Newton and Guillaume de l’Hoˆpital. The curve is the arc of a cycloid. Example 7.2.7 (The cycloid). A circle of radius r rolls along the x-axis, starting from the origin as shown below. Describe the locus (x(t), y(t)) of the point P on the edge of the circle that satisfies (x(0), y(0)) = (0, 0) x y P P P P The cycloid. Solution. The solution will be built up over three steps, of which the first two correspond to the diagrams below. x y θ P (x, y) t rt Step 1. x y t rt rt y = −r Step 2. P (x, y) c©2020 School of Mathematics and Statistics, UNSW Sydney 156 CHAPTER 7. CURVE SKETCHING Step 1. First we consider a circle of radius r centred at the origin. If the circle rotates about its centre by t radians in a clockwise direction (as shown) then x(t) = r cos θ = r cos(3π2 − t) = r ( cos 3π2 cos t+ sin 3π 2 sin t ) = −r sin t, y(t) = r sin θ = r sin(3π2 − t) = r ( sin 3π2 cos t− cos 3π2 sin t ) = −r cos t and the point P sweeps out a distance of rt. Step 2. We now consider what happens when the circle rolls along the line y = −r. If the circle rotates by t radians, then the centre of the circle moves to the right by rt (corresponding to the distance that P sweeps out – see Step 1). Hence x(t) = −r sin t+ rt. By Step 1 we still have y(t) = −r cos t. Step 3. We retain the same situation as in Step 2, except that we shift the circle up by r (so that the circle rolls along the line y = 0 rather than along the line y = −r. Thus y(t) = −r cos t+ r. By Step 2 we still have x(t) = −r sin t+ rt. Final solution. The locus of the curve is given by x(t) = r(t− sin t), y(t) = r(1− cos t) where t ≥ 0. Remark 7.2.8. The curve of fastest descent from A to B is the unique arc of an (inverted) cycloid whose tangent at A is vertical. 7.3 Curves defined by polar coordinates (Ref: SH10 §§10.2, 10.3) In this final section, we introduce a different coordinate system for the plane. Many equations in this system correspond to beautiful curves. 7.3.1 Polar coordinates Every point P in plane can be specified by • (x, y), where x is the horizontal distance of P from the origin, and y is the vertical distance, or • (r, θ), where r is the distance of P from the origin and θ is the angle (taken in the anticlockwise direction) between OP and the positive horizontal axis. c©2020 School of Mathematics and Statistics, UNSW Sydney 7.3. CURVES DEFINED BY POLAR COORDINATES 157 b P O x y b P O θ r The pair (x, y) is called the Cartesian coordinate of P and the pair (r, θ) are called a polar coordinate of P . (If P is the origin then r = 0 and θ is not defined.) A polar coordinate (r, θ) of P is related to the Cartesian coordinate (x, y) of P by the formulae x = r cos θ y = r sin θ r = √ x2 + y2 tan θ = y x , provided x 6= 0. (7.8) This is easily seen using trigonometry, Pythagoras’ theorem and the diagram below. b P (x, y) θ r r cos θ r sin θ When converting between the polar coordinates and Cartesian coordinates, it is best to draw a diagram. Example 7.3.1. Suppose that the polar coordinates of P and Q are given by (2, π4 ) and (3, 7π 6 ) respectively. Find the Cartesian coordinates for P and Q. Solution. First we plot these points on the Cartesian plane. x y P (2, π4 ) π 4 2 Q(3, 7π6 ) 13π 6 3 π 6 From the diagram, it is clear that the Cartesian coordinate of P is given by( 2 cos π 4 , 2 sin π 4 ) = ( 2√ 2 , 2√ 2 ) = ( √ 2, √ 2), while the Cartesian coordinate of Q is given by( −3 cos π 6 ,−3 sin π 6 ) = ( − √ 3,−3 2 ) . c©2020 School of Mathematics and Statistics, UNSW Sydney 158 CHAPTER 7. CURVE SKETCHING Example 7.3.2. Suppose that the Cartesian coordinate of R is given by (−2, 2√3). Find a polar coordinate for R. Solution. We begin with the following diagram. x yR(−2, 2√3) 2 2 √ 3 r θ α In the diagram, tanα = 2 √ 3 2 = √ 3, so α = π3 . Hence θ = π − α = 2π 3 . By Pythagoras’ theorem, r2 = 22 + (2 √ 3)2 = 16. Hence in polar coordinates, R can be written as (4, 2π3 ). Remark 7.3.3. Note that the point R has no unique polar coordinate. For example (4, 2π3 + 2π) and (4, 2π 3 − 2π) are also polar coordinates for R. In practice, we often choose the value for θ such that −π < θ ≤ π. 7.3.2 Basic sketches of polar curves Many curves can be described by an equation relating the polar variables r and θ. We begin with the simplest case: when either r or θ are constant. Example 7.3.4. Sketch the curves corresponding to the the equations (a) r = 3 (b) θ = 3π4 . Solution. (a) Since r = 3, we seek all points in the plane whose distance from the origin is 3. This is a circle of radius three centred at (0, 0). (b) This is the collection of all points P such that the angle between OP and the positive x-axis is 3π4 . Hence the curve is the ray sketched below with the origin excluded. c©2020 School of Mathematics and Statistics, UNSW Sydney 7.3. CURVES DEFINED BY POLAR COORDINATES 159 x y r = 3 3 3 x y θ = 3π4 3π 4 To sketch a curve, described by the polar equation r = f(θ), in the xy-plane, it may be helpful to first sketch r = f(θ) in the rθ-plane. Example 7.3.5. Sketch the curve whose polar equation is r = 2− cos θ. Solution. First we graph r against θ. θ r r = 2− cos θ | | | | | | | | | | | | 3 2 + 1√ 2 2 2− 1√ 2 1 π 4 π 2 3π 4 π 5π 4 3π 2 7π 4 2π b b b b b b b b b a b c d e f g h i The r-value of each point a, b, c, ..., i in the rθ-plane corresponds to the distance from the origin to each point a′, b′, ..., i′ in the xy-plane. x y θ = 0 θ = π4 θ = π2 θ = 3π4 θ = π θ = 5π4 θ = 3π2 θ = 7π4 b b b b b b b b r = 1 r = 2 r = 3 r = 2 a′ b′ c′ d′ e′ f ′ g′ h′ i′ c©2020 School of Mathematics and Statistics, UNSW Sydney 160 CHAPTER 7. CURVE SKETCHING Our task now is to interpolate between the points a′, b′, ..., i′ to obtain a sketch of the curve. Consider what happens to r as θ moves from 0 to 2π. • On the rθ-plane, as θ moves from 0 to π, r increases from 1 to 3. So on the xy-plane, if we travel along the curve in an anticlockwise direction from the positive to the negative x-axis, then the distance r from the origin to points on the curve increases from 1 to 3. • On the rθ-plane, as θ moves from π to 2π, r decreases from 3 to 1. So on the xy-plane, if we travel along the curve in an anticlockwise direction from the negative to the positive x-axis, then the distance r from the origin to points on the curve decreases from 3 to 1. These considerations lead to the final sketch. x y θ = π4θ = 3π 4 θ = 5π4 θ = 7π 4 r = 2− cos θ b b b b b b b b 1 2 −3 −2 We list a few more helpful tips for sketching a curve described by the polar equation r = f(θ). • If f is an even function (that is, f(−θ) = f(θ)) then the polar curve is symmetric about the x-axis. • If f(π − θ) = f(θ) then the polar curve is symmetric about the y-axis. • If f is 2π–periodic then it suffices to consider θ in the range 0 ≤ θ < 2π. • Sometimes it is possible to rewrite the equation r = f(θ) in terms of x and y only. The use of calculus will be discussed in the next subsection. Example 7.3.6. Sketch the curve that is described by the polar equation r = 2 sin θ, 0 ≤ θ ≤ π. Solution. Graphing r against θ, followed by y against x leads to the following two diagrams. c©2020 School of Mathematics and Statistics, UNSW Sydney 7.3. CURVES DEFINED BY POLAR COORDINATES 161 θ r r = 2 sin θ | | | | | | 2 2√ 2 π 4 π 2 3π 4 π b b b b b x y θ = pi 4 θ = 3pi 4 r = 2 sin θ b b b b r = 0 r = 2 r = √ 2r = √ 2 The curve looks as though it is a circle, an observation that can be confirmed by rewriting r = 2 sin θ in terms of x and y: r = 2 sin θ r2 = 2r sin θ (multiply by r) x2 + y2 = 2y (by (7.8)) x2 + y2 − 2y + 1 = 1 (completing the square) x2 + (y − 1)2 = 1. Therefore the polar curve is a circle with Cartesian centre (0, 1) and radius 1. Example 7.3.7. Sketch the curve that is described by the polar equation r = 4| sin(3θ)|. 7.3.3 Sketching polar curves using calculus Suppose that a curve can be expressed in polar form as r = f(θ). Since x = r cos θ and y = r sin θ, the curve can also be written in parametric form (with parameter θ) as x(θ) = f(θ) cos θ and y(θ) = f(θ) sin θ. Hence, dy dx = y′(θ) x′(θ) = f(θ) cos θ + f ′(θ) sin θ −f(θ) sin θ + f ′(θ) cos θ by the product rule. Since r = f(θ), this is usually written as dy dx = r cos θ + drdθ sin θ −r sin θ + drdθ cos θ . (Rather than memorising this formula, we recommend that students understand how it is obtained.) Knowing the value of derivative of the curve at various points allows for greater accuracy in sketching. We illustrate with one example. Example 7.3.8. Sketch the curve described by the polar equation r = 1− sin θ. c©2020 School of Mathematics and Statistics, UNSW Sydney 162 CHAPTER 7. CURVE SKETCHING Solution. First we graph r against θ and then give a preliminary sketch of y against x. θ r r = 1− sin θ | | | | 2 1 + 1√ 2 1 1− 1√ 2 | | | | | | | | π 4 π 2 3π 4 π 5π 4 3π 2 7π 4 2π b b b b b b b b b x y θ = π4 θ = 5π4 θ = 3π4 θ = 7π4 r = 1− sin θ 1−1 −2 b b b b b b b b To obtain a clearer idea of the behaviour of the curve, we calculate dydx . We have x = r cos θ = (1− sin θ) cos θ and y = r sin θ = (1 − sin θ) sin θ. Hence x′(θ) = cos θ(− cos θ) + (1− sin θ)(− sin θ) = − cos2 θ − sin θ + sin2 θ = 2 sin2 θ − sin θ − 1 = (2 sin θ + 1)(sin θ − 1) and y′(θ) = cos θ − 2 sin θ cos θ = cos θ(1− 2 sin θ). Thus dy dx = y′(θ) x′(θ) = cos θ(1− 2 sin θ) 2 sin2 θ − sin θ − 1 . To locate horizontal and vertical tangents, we examine where x′(θ) = 0 and y′(θ) = 0. Solving y′(θ) = 0 gives cos θ = 0 or sin θ = 1 2 , that is, θ = π 2 , 3π 2 , π 6 , 5π 6 . Solving x′(θ) = 0 gives sin θ = 1 or sin θ = −1 2 , c©2020 School of Mathematics and Statistics, UNSW Sydney 7.3. CURVES DEFINED BY POLAR COORDINATES 163 that is θ = π 2 , 7π 6 , 11π 6 . So horizontal tangents occur when θ = 3π 2 , π 6 , 5π 6 and vertical tangents occur when θ = 7π 6 , 11π 6 . (Understanding how the derivative behaves when θ is close to π2 is harder since x′(π2 ) = y ′(π2 ) = 0. A full analysis is given in Remark 7.3.9.) Finally, we observe that dydx = −1 when θ = 0 and that dydx = 1 when θ = π. Hence we obtain the following sketch. (The abbreviations HT and VT, which appear in the diagram, stand for ‘horizontal tangent’ and ‘vertical tangent.’) x y θ = π6 θ = 7π6 θ = 5π6 θ = 11π6 HTHT HT VT VT gradient of tangent is −1gradient of tangent is 1 r = 1− sin θ 1−1 −2 Remark 7.3.9. [X] In the above example, the curve has a ‘vertical cusp’ at the origin. This is seen by analysing dydx when θ is close to π 2 . By l’Hoˆpital’s rule we see that lim θ→pi 2 − x′(θ) y′(θ) = lim θ→pi 2 − 2 sin2 θ − sin θ − 1 cos θ − 2 sin θ cos θ = lim θ→pi 2 − 4 sin θ cos θ − cos θ − sin θ − 2(cos θ cos θ − sin θ sin θ) = lim θ→pi 2 − 0 1 = 0. c©2020 School of Mathematics and Statistics, UNSW Sydney 164 CHAPTER 7. CURVE SKETCHING Hence dy dx = y′(θ) x′(θ) →∞ as θ → π 2 − . By symmetry, dy dx = y′(θ) x′(θ) → −∞ as θ → π 2 + . Therefore the derivative has an infinite discontinuity when θ = π2 . Such a point is called a ‘cusp point.’ Remark 7.3.10. In most cases, a reasonable sketch of a polar curve can be made without appealing to calculus. As seen in the example above, while employing calculus gives a more accurate sketch, it also dramatically increases the amount of work and time required. 7.4 Maple notes It is easy to draw parametric curves using Maple. Maple also knows about polar coordinates, but you should be aware that Maple allows r < 0. For example, > # Draw a hyperbola via parametric specification > plot([3*sec(t), 2*tan(t), t=-Pi/2..3*Pi/2], x=-10..10, y=-8..8, numpoints=100); > # What is the reason for the straight lines through the origin in graph above? > plot([sin(5*theta), theta, theta=-Pi..Pi], coords=polar); c©2020 School of Mathematics and Statistics, UNSW Sydney 7.4. MAPLE NOTES 165 > plot([2+cos(5*t), t, t=-Pi..Pi], coords=polar); c©2020 School of Mathematics and Statistics, UNSW Sydney 166 CHAPTER 7. CURVE SKETCHING Problems for Chapter 7 Problems 7.1 : Curves defined by a Cartesian equation 1. [R] Find the maximal domain and range of the function f , given by f(x) = √ 5 + 4x− x2, and sketch its graph. 2. [R] Write down the period of each of the following functions f (where possible). Determine which are odd or even. Sketch the graph of each function. a) f(x) = sin 3x b) f(x) = 1 + sin(2x/3) c) f(x) = x sinx d) f(x) = tan 3x e) f(x) = cos2 x f) f(x) = sinx+ cosx 3. [R] Suppose that f is an odd function (not everywhere zero). Determine whether each function g below is odd, even or neither. a) g(x) = x2f(x) b) g(x) = x3f(x) c) g(x) = x2 + f(x) d) g(x) = x3 + f(x) e) g(x) = sin(f(x)) f) g(x) = f(cos x) 4. [R] For each function f , identify any vertical and oblique asymptotes and hence sketch the graph. (Do not use calculus.) a) f(x) = x+ 2 + 1 x− 3 b) f(x) = x2 − 2 x+ 1 c) [H] f(x) = x3 − 7x+ 8 x2 + x− 6 5. [R] Sketch the following curves, showing their main features. a) y = x2 + 1 x2 b) [V] y = x− 1 x− 2 c) y = e −x2/2 d) y = xe−x e) y2 = x(x− 4)2 f) y = x 2 x− 2 g) y = x2 − 1 x2 − 2x h) y = x cos −1 x 6. [H] (Longer rather than difficult) Suppose that y = 3x2 − 10x+ 3 3x2 + 10x+ 3 . a) Find the values of x for which y ≥ 0. b) Find the asymptotes. c) Find the turning points. d) Find the domain and range. e) Sketch the graph. Problems 7.2 : Parametrically defined curves 7. [R] Sketch the curves given by the following parametric equations. Also find, where possible, a Cartesian equation for the curve. c©2020 School of Mathematics and Statistics, UNSW Sydney PROBLEMS FOR CHAPTER 7 167 a) x = 4cos t, y = 5 sin t b) x = 3 sec t, y = 2 tan t c) x = t3, y = t2 d) x = et cos t, y = et sin t . 8. [R] For each of the curves given in parametric form by a) { x = 1− t y = 1 + t b) { x = 3t+ 2 y = t4 − 1 c) { x = cos t y = sin t, i) find the points on the curve corresponding to t = −1, 0, 1, and 2; ii) find any point on the curve where y = 0; iii) find dy dx as a function of t. 9. [R] [V] a) Find the equation of the normal to the curve x = t t+ 1 , y = t t− 1 at the point P when t = 2. b) Eliminate t from the above equations and find the gradient of the normal at P using the Cartesian form. 10. [H] Consider a fixed circle of radius 1 centred at the origin and a smaller circle of radius 14 initially centred at (34 , 0). The smaller circle rolls (without slipping) around the inside rim of the larger circle such that the centre Q of the smaller circle moves in an anticlockwise direction. A point P , fixed on the rim of the smaller circle and initially with coordinates (1, 0), traces out a curve as the smaller circle moves inside the larger circle. x y O Q P Initial configuration x y O θ Q P Configuration after motion has begun The goal of this question is to find the Cartesian form of the trajectory of P . Let θ denote the angle (in radians) between OQ and the positive x-axis, as shown in the above diagram. a) Explain why ⇀ OQ = 34 (cos θ, sin θ). c©2020 School of Mathematics and Statistics, UNSW Sydney 168 CHAPTER 7. CURVE SKETCHING b) [X] Explain why ⇀ QP = 14 (cos(−3θ), sin(−3θ)). c) Show that ⇀ OP = (cos3 θ, sin3 θ). (You may find techniques from MA1131 Algebra useful here.) d) Hence the trajectory of P is given by x = cos3 θ, y = sin3 θ, 0 ≤ θ ≤ 2π. By using an appropriate trigonometric identity, eliminate θ to find the cartesian equation of the trajectory of P . e) Sketch the curve corresponding to this equation. (This curve is called an astroid after the Greek word for ‘star’.) 11. [R] [V] In the 1960s two French car engineers, Paul de Casteljau and Pierre Bezier, inde- pendently discovered a remarkable new approach to parameterising curves. Let’s introduce their approach by finding a quadratic curve determined by the control points A(1, 1), B(2, 2) and C(3, 1) with respecitve coordinate vectors a,b and c. a) The linear de Casteljau Bezier curve determined by A and B is given parametrically as p(t) = (1− t)a+ tb. This is called the linear interpolation of the vectors a and b. Find a parametric vector equation for p(t) and evaluate p(0) and p(1). b) Similarly, the linear de Casteljau Bezier determined by B and C is q(t) = (1− t)b+ tc. Find a Cartesian equation for this line, and evaluate q(1/2). c) The quadratic de Casteljau Bezier curve r(t) determined A,B and C, is the linear inter- polation of the vectors p(t) and q(t): r(t) = (1− t)p(t) + tq(t). Find an explicit expression for r(t) and find the polynomials p0, p1 and p2 such that r(t) = p0(t)a+ p1(t)b+ p2(t)c. Problems 7.3 : Curves defined by polar coordinates 12. [R] The following points are given in polar coordinate form. Plot them on a diagram and find their Cartesian coordinates. a) (3, 0) b) (6, 7π/6) c) (2, 7π/4) 13. [R] Convert these Cartesian coordinates into polar forms with r ≥ 0 and −π < θ ≤ π. a) (−3, 0) b) (−1,−1) c) (−2, 2√3) d) (0, 1) e) (−2√3, 2) f) (−2√3,−2) c©2020 School of Mathematics and Statistics, UNSW Sydney PROBLEMS FOR CHAPTER 7 169 14. [R] Sketch the graph corresponding to each polar equation. a) r = 4 b) θ = 2 c) r = 3θ, for θ ≥ 0. 15. [R] a) Express r = 6 sin θ, where 0 ≤ θ ≤ π, in Cartesian form and hence draw its graph. b) Repeat this for r = 2cos θ, where −π/2 ≤ θ ≤ π/2. 16. [R] Sketch the graph corresponding to each polar equation. a) r = 2 + sin θ b) r = 3 + cos θ c) [V] r = 2− 2 cos θ d) r = 2| cos θ| e) r = 3| sin 6θ| f) r = | tan θ2 | (−π < θ < π) 17. [H] [V] The hyperbolic spiral is described by the equation rθ = a whenever θ > 0, where a is a positive constant. Using the fact that lim θ→0 sin θ θ = 1, show that the line y = a is a horizontal asymptote to the spiral. Sketch the spiral. 18. [H] Show that r = 5 3− 2 cos θ is the polar equation of an ellipse by finding the Cartesian equation of the curve (and completing the square). 19. [H] Find dydx for the curves in Q16a,b,c. c©2020 School of Mathematics and Statistics, UNSW Sydney 170 CHAPTER 7. CURVE SKETCHING c©2020 School of Mathematics and Statistics, UNSW Sydney 171 Chapter 8 Integration The problem of understanding and calculating the area of regions in the plane has a long history. Some of the main contributors to this field are Archimedes (the great Greek mathematician of antiquity), Isaac Barrow (Isaac Newton’s mentor), Isaac Newton (one of the greatest mathemati- cians and physicists of all time), Gottfried Leibniz (Newton’s contemporary), Bernhard Riemann (a German mathematician of the nineteenth century) and Henri Lebesgue (a French mathemati- cian of the early twentieth century). In this chapter we see a method for calculating the area of regions with curved boundaries known as Riemann integration. We then give a startling result, known as the fundamental theorem of calculus, which connects the problem of calculating areas with antidifferentiation. This theorem was known to Barrow and its implications were developed by Newton, Leibniz and their disciples. A consequence of this theorem and other related results, is that the problem of calculating area is much the same as that of calculating mass, volume, work and probability. The unifying feature is a body of theory known as the integral calculus. 8.1 Area and the Riemann Integral (Ref: SH10 §§5.1, 5.2, B.5) In this section we examine the problem of defining the area of regions with curved boundaries. The main technique introduced is called integration, which leads to defining ‘the area under the graph of a function’ via the Riemann integral. 8.1.1 Area of regions with curved boundaries We all know how to calculate the area of a rectangle or a triangle. The area of more exotic polygons can be computed by partitioning the polygon into triangles and then summing the areas of these triangles (see Figure 8.1 (a)). Such calculations are based on an intuitive understanding of area that takes for granted that • the area of a rectangle is the product of its length and height, • areas of congruent regions are equal, and • the area of a whole region is the sum of the areas of its parts. c©2020 School of Mathematics and Statistics, UNSW Sydney 172 CHAPTER 8. INTEGRATION (a) Partition of a polygon b h (b) A = 1 2 bh b b1 b2 h (c) A = 1 2 bh Figure 8.1: Calculating the area of polygons When precisely formulated, these points could be taken together as a definition of area. With this definition, one can derive a formula for the area of a right-angled triangle (see Figure 8.1 (b)), then for an arbitrary triangle (see Figure 8.1 (c)) and then calculate the area of an arbitrary polygon (as in Figure 8.1 (a)). When it comes to calculating the area of a region with curved boundaries (see, for example, Figures 8.2 and 8.3), a more sophisticated definition of area is needed. Before asking what shape this definition might take, it is worthwhile asking the question, What is area? First, we recognise that the area of a region is a non-negative real number. Also, our intuition of area informs us that it should satisfy the following properties (see Figure 8.2 for a diagram illustrating each property). We assume throughout that the regions considered are bounded. (A1) If Ω is a region of the plane then area(Ω) ≥ 0. (A2) If one region Ω1 is contained in another region Ω2, then area(Ω1) ≤ area(Ω2). (A3) If the area of a region Ω is partitioned into two smaller disjoint regions Ω1 and Ω2, then area(Ω) = area(Ω1) + area(Ω2). (A4) If Ω1 and Ω2 are congruent regions then area(Ω1) = area(Ω2). (A5) If Ω is a rectangle of length a and height b then area(Ω) = ab. These properties are axioms that any definition of area should satisfy. Rather than give a definition for the area of any bounded region of the plane, we shall only give a definition for ‘the area under the graph of a function.’ The following example gives some justification for this decision. c©2020 School of Mathematics and Statistics, UNSW Sydney 8.1. AREA AND THE RIEMANN INTEGRAL 173 Ω (a) (A1) Ω2 Ω1 (b) (A2) Ω Ω2Ω1 (c) (A3) Ω1 Ω2 (d) (A4) Ω a b (e) (A5) Figure 8.2: Properties of area Example 8.1.1. Consider the region Ω of Figure 8.3. By partitioning Ω with straight lines as shown, one can see by axiom (A3) that area(Ω) = area(Ω1) + area(Ω2) + area(Ω3). By rotating and translating each subregion, we see by axiom (A4) that each of area(Ω1), area(Ω2) and area(Ω3) is equal to the area under the graph of a function. This procedure can be done for any region in the plane with a ‘reasonable’ boundary. Hence the problem of defining the area of a region with a curved boundary can be reduced to defining what is meant by ‘the area under the graph of a function.’ 8.1.2 Approximations of area using Riemann sums To guide us towards a definition for ‘the area under the graph of a function,’ we study the following example. Consider the function f : R→ R, given by the rule f(x) = x2. Let Ω denote the region bounded by the graph of f , the x-axis and the lines x = 0 and x = 1. The region Ω is shaded in Figure 8.4 (a). The region Ω may be ‘approximated’ with rectangles as in Figure 8.4 (b) and (c). Since the regions Ω1 and Ω2 are composed of disjoint rectangles, their areas are easily calculated using axioms (A3) and (A5) (see Subsection 8.1.1). Moreover, however we eventually choose to define area(Ω), it is clear by axiom (A2) that we want area(Ω1) ≤ area(Ω) ≤ area(Ω2). This gives a lower and upper bound for area(Ω). The regions Ω1 and Ω2 each consist of ten rectangles. By changing the number of rectangles used, we obtain different upper and lower bounds for area(Ω). Example 8.1.2. Find different upper and lower bounds for the area of the shaded region Ω of Figure 8.4 (a). c©2020 School of Mathematics and Statistics, UNSW Sydney 174 CHAPTER 8. INTEGRATION Ω (a) Ω1 Ω2 Ω3 (b) x y f1 x y f2 x y f3 (c) Figure 8.3: Partitioning of a region with curved boundary x f(x) 1 Ω (a) x f(x) 1 Ω1 (b) x f(x) 1 Ω2 (c) Figure 8.4: Area under the graph of f(x) = x2. c©2020 School of Mathematics and Statistics, UNSW Sydney 8.1. AREA AND THE RIEMANN INTEGRAL 175 x y 1 n 2 n 3 n 4 n · · · k−1 n k n · · · n−1 n n n y = f(x) Rk · · · · · · Figure 8.5: An upper Riemann sum for Example 8.1.2. The following formula will help our solution: n∑ k=1 k2 = 12 + 22 + 32 + · · ·+ n2 = 1 6 n(n+ 1)(2n + 1). (8.1) (This may be proved using mathematical induction. If students need to use such a formula in an exam, they will be given it). Solution to Example 8.1.2. Let A denote the (as yet undefined) area of the region Ω. Rather than calculating an upper bound for A by using 10 rectangles (as in Figure 8.4 (c)), we use n rectangles, where n is a positive integer (see Figure 8.5). The bases of the rectangles are constructed by subdividing the interval [0, 1] into n subintervals [0, 1n ], [ 1 n , 2 n ], [ 2 n , 3 n ], . . . , [ n−1 n , 1]. The set Pn, given by Pn = {0, 1n , 2n , 3n , . . . , n−1n , 1}, that divides the interval [0, 1] into these subintervals is called a partition of [0, 1]. Let Rk denote the area of the kth rectangle in Figure 8.5. Then Rk = width× height = 1 n × f( kn) = 1 n × ( k n )2 = 1 n3 × k2. c©2020 School of Mathematics and Statistics, UNSW Sydney 176 CHAPTER 8. INTEGRATION If SPn(f) denotes the total area of the shaded region in Figure 8.5, then SPn(f) = n∑ k=1 Rk = n∑ k=1 1 n3 × k2 = 1 n3 n∑ k=1 k2 = 1 n3 ( 12 + 22 + 32 + · · ·+ (n− 1)2 + n2) = 1 n3 × 1 6 n(n+ 1)(2n + 1) (by formula (8.1)) = 1 6 ( 1 + 1 n )( 2 + 1 n ) = 1 3 + 1 2n + 1 6n2 . The quantity SPn(f) is called the upper Riemann sum of f with respect to the partition Pn. Axiom (A2) imples that A ≤ SPn(f) = 1 3 + 1 2n + 1 6n2 , and by taking different values of n we obtain different upper bounds for A. To compute a lower bound corresponding to n rectangles, consider Figure 8.6. The area Rk of the kth rectangle is given by Rk = 1 n × f(k−1n ) = 1 n3 (k − 1)2. The sum of all the areas of the rectangles is called the lower Riemann sum for the function f over the partition Pn and is denoted by SPn(f). We see that SPn(f) = n∑ k=1 Rk = n∑ k=1 1 n3 × (k − 1)2 = 1 n3 n∑ k=1 (k − 1)2 = 1 n3 ( 12 + 22 + 32 + · · · + (n− 1)2) = 1 n3 × 1 6 (n− 1)n(2n − 1) (by formula (8.1)) = 1 6 ( 1− 1 n )( 2− 1 n ) = 1 3 − 1 2n + 1 6n2 . c©2020 School of Mathematics and Statistics, UNSW Sydney 8.1. AREA AND THE RIEMANN INTEGRAL 177 x y 1 n 2 n 3 n 4 n · · · k−1 n k n · · · n−1 n n n y = f(x) Rk · · · · · · Figure 8.6: A lower Riemann sum for Example 8.1.2. By axiom (A2), A ≥ SPn(f) = 1 3 − 1 2n + 1 6n2 . Hence for each positive integer n, the inequality 1 3 − 1 2n + 1 6n2 ≤ A ≤ 1 3 + 1 2n + 1 6n2 (8.2) gives an upper and lower bound for A. From inequality (8.2) we see that, however the area A of Ω is eventually defined, it must be squeezed between 1 3 − 1 2n + 1 6n2 and 1 3 + 1 2n + 1 6n2 for every value of n. But as n→∞, 1 3 − 1 2n + 1 6n2 → 1 3 and 1 3 + 1 2n + 1 6n2 → 1 3 . (8.3) So there is only one number A that satisfies (8.2) for every positive integer n, namely A = 13 . Our definition of the area under the graph of a function must agree with this calculation. Remark 8.1.3. The process of calculating upper and lower Riemann sums and taking a limit as in (8.3) is called integration. Guided by the ideas introduced in this subsection, we are now ready to give a definition for the area under the graph of a function. c©2020 School of Mathematics and Statistics, UNSW Sydney 178 CHAPTER 8. INTEGRATION x f(x) a0 a1 a2 a3 ak−1 ak an−2 an−1 an· · · · · · · · · · · · Figure 8.7: An example of an upper Riemann sum 8.1.3 The definition of area under the graph of a function and the Riemann integral Suppose that f is a bounded function on [a, b] and that f(x) ≥ 0 for all x in [a, b]. In this subsection we define what is meant by ‘the area under the graph of f from a to b’. This is done by constructing upper and lower Riemann sums with respect to partitions of [a, b]. Definition 8.1.4. A finite set P of points in R is said to be a partition of [a, b] if P = {a0, a1, a2, . . . , an} and a = a0 < a1 < a2 < . . . < an = b. Suppose that P is a partition of [a, b]. In a manner similar to Example 8.1.2, P is used to construct rectangles that approximate the region under the graph of f (see Figure 8.7). As illustrated in the diagram, the points of P need not be evenly spaced. The area of the kth rectangle in Figure 8.7 is width× height = (ak − ak−1)× fk, where fk is the maximum value of the function f on the subinterval [ak, ak−1]. The upper Riemann sum SP(f) for f with respect to the partition P is defined by the formula SP(f) = n∑ k=1 (ak − ak−1)fk. (8.4) c©2020 School of Mathematics and Statistics, UNSW Sydney 8.1. AREA AND THE RIEMANN INTEGRAL 179 Hence the upper Riemann sum corresponds to the total area of the rectangles in Figure 8.4. Like- wise, the lower Riemann sum SP(f) for f with respect to the partition P is defined by the formula SP(f) = n∑ k=1 (ak − ak−1)fk, (8.5) where f k is the minimum value of f on the subinterval [ak, ak−1]. We are now ready to give a definition of the area under the graph of a function f . Definition 8.1.5 (Definition of the area under the graph of a function). Suppose that a function f is bounded on [a, b] and that f(x) ≥ 0 for all x in [a, b]. If there exists a unique real number A such that SP(f) ≤ A ≤ SP(f) for every partition P of [a, b], (8.6) then we say that A is the area under the graph of f from a to b. It may not seem obvious now but, as we shall see in Section 8.2, not every bounded function f has an area under its graph in the sense of Definition 8.1.5. The following definition gives a special name to those functions that do have a well-defined area under their graph. At this point, we lift the restriction that f(x) ≥ 0 for all x in [a, b]. Definition 8.1.6 (Definition of the Riemann integral). Suppose that a function f is bounded on [a, b]. If there exists a unique real number I such that SP(f) ≤ I ≤ SP(f) for every partition P of [a, b], (8.7) then we say that f is Riemann integrable on the interval [a, b]. If f is Riemann integrable, then the unique real number I satisfying (8.7) is called the definite integral of f from a to b and we write I = ∫ b a f(x) dx. The function f is called the integrand of the definite integral, while the points a and b are called the limits of the definite integral. Remark 8.1.7. The notation ∫ b a f(x) dx is due to Leibniz. It evolved from a slightly different way of writing down lower and upper Riemann sums. For example, SP(f) may be written as SP(f) = n∑ k=1 f(xk)∆xk, where ∆xk = ak − ak−1 and f attains its maximum value on [ak−1, ak] at the point xk. When taking a limit, as in (8.3), ∆xk was replaced with dx and the symbol ∑ was replaced with an elongated ‘S’ (‘S’ stands for ‘sum’). c©2020 School of Mathematics and Statistics, UNSW Sydney 180 CHAPTER 8. INTEGRATION f(x) x C a b (a) x f(x) 1 (b) f(x) x1 2 (c) f(x) x1 1 2 | (d) Figure 8.8: Diagrams for examples in Section 8.2. Remark 8.1.8. If f is Riemann integrable on [a, b], then the real number ∫ b a f(x) dx is the area under the graph of f from a to b, provided that f(x) ≥ 0 for all x in [a, b]. A geometric interpretation of the real number ∫ b a f(x) dx when f takes negative values will be discussed in Section 8.3. Remark 8.1.9. [X] If f is bounded but not continuous, then f may not attain a maximum or minimum value on the closed interval [ak, ak−1]. This technical difficulty can be overcome by defining the upper and lower Riemann sums in terms of suprema and infima rather than maxima and minima. The concept of a supremum and infimum is introduced only in higher Mathematics courses. Remark 8.1.10. One should think about why Definition 8.1.5 is a good definition for the area under the graph of a function. The following questions need to be answered: • If the area under the graph of f is well-defined by Definition 8.1.5, would it agree with any other reasonable definition for the area under the graph of f? • Is Definition 8.1.5 consistent with the axioms of area listed in Subsection 8.1.1? The answer to each of these questions is ‘yes,’ but we leave it to the reader to ponder why this might be so. Some of the results given over the next few sections do confirm that our definition of area under a graph is consistent with the axioms of area. 8.2 Integration using Riemann sums (Ref: SH10 §5.2) In this section we calculate the area under the graph of functions by computing Riemann sums. Our notation for the upper and lower Riemann sums follows that of equations (8.4) and (8.5). Example 8.2.1. Suppose that f(x) = C where C is a positive constant. Show that f is Riemann integrable on [a, b] and calculate the area under the graph of f from a to b (see Figure 8.8 (a)). Solution. Suppose that P is a partition of [a, b] and that P = {a0, a1, . . . , an}. Since f is constant, c©2020 School of Mathematics and Statistics, UNSW Sydney 8.2. INTEGRATION USING RIEMANN SUMS 181 x f(x) a b | | b x g(x) a b | | b b b bc Figure 8.9: Piecewise continuous functions fk = fk = C for every k between 1 and n. Therefore SP (f) = n∑ k=1 fk(ak − ak−1) = n∑ k=1 C(ak − ak−1) = C(a1 − a0 + a2 − a1 + a3 − a2 + · · ·+ an−1 − an−2 + an − an−1) = C(ak − a0) = C(b− a). The same calculation gives SP(f) = C(b− a). Since P is an arbitrary partition of [a, b], we have SP(f) = C(b− a) = SP for every partition P of [a, b]. Hence there is a unique number A satisfying (8.7), namely A = C(b− a). Therefore f is Riemann integrable, ∫ b a f(x) dx = C(b− a) and the area under the graph of f is C(b− a). Remark 8.2.2. The area under the graph of f is consistent with axiom (A5). There are many other kinds of functions which are Riemann integrable. One of the largest classes consists of those functions that are bounded and piecewise continuous. Definition 8.2.3. A function f : [a, b] → R is said to be piecewise continuous if it is continuous on [a, b] at all except perhaps a finite number of points. Figure 8.9 illustrates the graphs of two functions f and g that are piecewise continuous on [a, b]. The function f is bounded on [a, b] while the function g is not. According to the next theorem, the area of the shaded region under the graph of f is well-defined. Theorem 8.2.4. If f is bounded and piecewise continuous on [a, b] then f is integrable on [a, b]. c©2020 School of Mathematics and Statistics, UNSW Sydney 182 CHAPTER 8. INTEGRATION Since the proof of this theorem is difficult, we omit it. In practice, it often hard to prove from first principles that a function f is Riemann integrable. However, if we know in advance that f is integrable on [a, b] then it is much easier to calculate its definite integral from a to b. All one needs to do is show that lim n→∞SPn(f) = limn→∞SPn(f) for some sequence of partitions Pn of [a, b]. If I denotes this common limit, then∫ b a f(x) dx = I. Example 8.2.5. If f(x) = x2, find the area under the graph of f from 0 to 1 (see Figure 8.8 (b)). Solution. We first note that f is integrable on [0, 1] since it is bounded and continuous on [0, 1]. Suppose that Pn = {0, 1n , 2n , 3n , . . . , n−1n , nn .} Then SPn(f) = 1 3 + 1 2n + 1 6n2 and SPn(f) = 1 3 − 1 2n + 1 6n2 (see Example 8.1.2 for these calculations). Since lim n→∞SPn(f) = 1 3 = lim n→∞SPn(f), we conclude that ∫ 1 0 f(x) dx = 1 3 . So the area underneath the graph of f from 0 to 1 is 13 . Example 8.2.6. Suppose that f(x) = 3− x. Calculate ∫ 21 f(x) dx (see Figure 8.8 (c)). Our final example shows that some functions are not Riemann integrable. For such a function f , the area under the graph of f is not defined according to Riemann integration. Example 8.2.7. Suppose that f : [0, 1]→ R is defined by f(x) = { 2 if x ∈ Q 1 if x /∈ Q. (see Figure 8.8 (d)). Show that f is not Riemann integrable. Solution. Suppose that Pn is an arbitrary partition of [0, 1]. Then fk = 2 while fk = 1 for every k between 1 and n. Hence SPn(f) = 1 and SPn(f) = 2. (Exercise: show that this is the case by following a method similar to Example 8.2.1). Therefore there is no unique real number I satisfying (8.7). We conclude that f is not Riemann integrable. c©2020 School of Mathematics and Statistics, UNSW Sydney 8.3. THE RIEMANN INTEGRAL AND SIGNED AREA 183 x f(x) a b c Ω1 Ω2 (a) x f(x) a b c (b) Figure 8.10: Signed area and the definite integral Remark 8.2.8. A more sophisticated definition of the integral, known as the Lebesgue integral, was developed at the beginning of the twentieth century by the French mathematician Henri Lebesgue. Lebesgue’s definition of the integral allows us to integrate some functions which are not Riemann integrable. For example, if f is the function of Example 8.2.7 then • f is not Riemann integrable and the Riemann integral ∫ 1 0 f(x) dx is not defined; • f is Lebesgue integrable and the Lebesgue integral ∫ 1 0 f(x) dx is equal to 1. The Lebesgue integral is studied in some third year mathematics courses. 8.3 The Riemann integral and signed area (Ref: SH10 §5.5) The definite integral ∫ b a f(x) dx can only be interpreted as the area under the graph of f from a to b if f(x) ≥ 0 for all x in [a, b]. How should we interpret the definite integral if the integrand f has negative values on [a, b]? Consider the function f graphed in Figure 8.10 (a). The upper sum from a to b, illusrated in Figure 8.10 (b), approximates area(Ω1). However, since f(x) < 0 when b < x < c, each term in the upper sum from b to c will be negative. Hence the upper sum from b to c approximates −area(Ω2). By increasing the number of subdivisions of [a, c] and decreasing the size of each subdivison, these approximations get better and better. It follows that∫ c a f(x) dx = area(Ω1)− area(Ω2). We call this quantity the signed area under the graph of f from a to c. Thus there are two ways to calculate the (unsigned or absolute) area of the entire shaded region Ω of Figure 8.10 (a): c©2020 School of Mathematics and Statistics, UNSW Sydney 184 CHAPTER 8. INTEGRATION • area(Ω) = ∫ c a |f(x)| dx; or • area(Ω) = ∫ b a f(x) dx− ∫ c b f(x) dx. This kind of calculation will be illustrated in Section 8.6. 8.4 Basic properties of the Riemann integral (Ref: SH10 §5.8) The following proposition gives a list of some of the basic properties of the Riemann integral. Proposition 8.4.1 (Basic properties of the Riemann integral). Suppose that f and g are integrable functions over [a, b]. (i) If α and β are real numbers then αf + βg is integrable and ∫ b a (αf + βg)(x) dx = α ∫ b a f(x) dx+ β ∫ b a g(x) dx. (ii) If a < c < b then ∫ b a f(x) dx = ∫ c a f(x) dx+ ∫ b c f(x) dx. (iii) If f(x) ≥ 0 for all x in [a, b] then ∫ b a f(x) dx ≥ 0. (iv) If f(x) ≤ g(x) for all x in [a, b] then ∫ b a f(x) dx ≤ ∫ b a g(x) dx. (v) If m ≤ f(x) ≤M for all x in [a, b] then m(b− a) ≤ ∫ b a f(x) dx ≤M(b− a). (vi) If |f | is integrable on [a, b] then ∣∣∣∣ ∫ b a f(x) dx ∣∣∣∣ ≤ ∫ b a |f(x)| dx. Each of these properties can be interpreted geometrically (see Figure 8.11). For example, under the assumption that f(x) ≥ 0, property (iv) says that if the graph of f lies beneath the graph of g then the area under the graph of f will be less than the area under the graph of g. Except for (i), each of these properties is easy to prove. c©2020 School of Mathematics and Statistics, UNSW Sydney 8.4. BASIC PROPERTIES OF THE RIEMANN INTEGRAL 185 x f(x) a c b (a) Prop 8.4.1(ii) x y a b g f (b) Prop 8.4.1 (iv) x y a b M m f (c) Prop 8.4.1 (v) Figure 8.11: Geometric interpretation of Proposition 8.4.1. Proof. We omit the proof of (i) and (ii). (iii) If f(x) ≥ 0 for all x in [a, b], then each term in lower Riemann sum (8.5) is non-negative for any partition P of [a, b]. Hence 0 ≤ SP(f) ≤ ∫ b a f(x) dx by Definition 8.1.6. (iv) Suppose that h = g − f . Then h(x) ≥ 0 for all x in [a, b] and by applying (iii) and (i) we have 0 ≤ ∫ b a h(x) dx = ∫ b a ( g(x)− f(x)) dx = ∫ b a g(x) dx− ∫ b a f(x) dx. Rearranging the inequality gives the result. (v) By (iv) we have ∫ b a mdx ≤ ∫ b a f(x) dx ≤ ∫ b a M dx. By Example 8.2.1, the integral on the left is m(b − a) and the integral on the right is M(b − a), completing the proof. (vi) Now −|f(x)| ≤ f(x) ≤ |f(x)| ∀x ∈ [a, b], so ∫ b a −|f(x)| dx ≤ ∫ b a f(x) dx ≤ ∫ b a |f(x)| dx by (iv). By (i), ∫ b a −|f(x)| dx = − ∫ b a |f(x)| dx and so − ∫ b a |f(x)| dx ≤ ∫ b a f(x) dx ≤ ∫ b a |f(x)| dx which is equivalent to ∣∣∣∣ ∫ b a f(x) dx ∣∣∣∣ ≤ ∫ b a |f(x)| dx by a property of the absolute value function. c©2020 School of Mathematics and Statistics, UNSW Sydney 186 CHAPTER 8. INTEGRATION So far the definite integral ∫ b a f(x) dx has only been defined when a < b. We now extend this definition. Definition 8.4.2. Suppose that b < a and that f in integrable on [b, a]. Then we define ∫ b a f(x) dx to be the real number − ∫ a b f dx and we define ∫ a a f(x) dx to be 0. This definition gives a more general version of Proposition 8.4.1 (ii). We omit the proof. Proposition 8.4.3. Suppose that a, b and c are real numbers and that f is integrable over some interval containing a, b and c. Then∫ b a f(x) dx = ∫ c a f(x) dx+ ∫ b c f(x) dx. 8.5 The first fundamental theorem of calculus In this section we present a surprising theoretical result that indicates a strong connection between integration (calculating areas) and differentiation (calculating gradients of tangents). This result has enormous ramifications for the problem of calculating area. We begin by studying the area under the graph of a function. Suppose that f is a continuous function on the interval [a, b]. Since f is integrable on [a, b] (see Theorem 8.2.4) we can define the function F : [a, b]→ R by the formula F (x) = ∫ x a f(t) dt ∀x ∈ [a, b]. (8.8) For x in [a, b], the number F (x) represents the (signed) area captured underneath the graph of x on the interval [a, x] (see Figure 8.12 (a)). Thus F is an ‘area function’. (Note that the limit x in (8.8) is a different symbol from the ‘dummy variable’ t. It would be incorrect and confusing to write F (x) = ∫ x a f(x) dx for formula (8.8).) c©2020 School of Mathematics and Statistics, UNSW Sydney 8.5. THE FIRST FUNDAMENTAL THEOREM OF CALCULUS 187 y = f(t) F (x) t y | | | a x b (a) y = f(t) t y | | | | a x x+ h b (b) Figure 8.12: The area function F We investigate some of the properties of F . First, it is easy to see that if f(t) ≥ 0 for all t in [a, b] then the shaded area under f increases as x increases. That is, if f is a non-negative function then F is an increasing function. Second, we ask whether F continuous on [a, b] and differentiable on (a, b). Since differentiability on (a, b) implies continuity on (a, b), we address the question of differentiability first. Consider the difference quotient F (x+ h)− F (x) h when x ∈ (a, b) and h is small. Now, if h is a small positive number then Figure 8.12 (b) shows that F (x+ h)− F (x) = (area under f on the interval [x, x+ h]) ≈ f(x)× h, since f(t) ≈ f(x) for all t in [x, x+ h] by the continuity of f . Dividing both sides by h gives F (x+ h)− F (x) h ≈ f(x). As h gets smaller, this approximation gets better and so we expect that lim h→0+ F (x+ h)− F (x) h = f(x). A similar argument may be repeated when h is a small negative number to obtain the corresponding left-hand limit. Therefore it seems true that F ′(x) = lim h→0 F (x+ h)− F (x) h = f(x). This intuitive argument can be made rigorous (see the proof at the end of this section). Hence we conclude that F is continuous and differentable on (a, b) and that F ′(x) = f(x) for all x in (a, b). The fact that a continuous function f and its ‘area function’ F are related by the differential equation F ′(x) = f(x) has far-reaching consequences. This result is so important that it has come to be known as the fundamental theorem of calculus. c©2020 School of Mathematics and Statistics, UNSW Sydney 188 CHAPTER 8. INTEGRATION Theorem 8.5.1 (The first fundamental theorem of calculus). If f is continuous function defined on [a, b] then the function F : [a, b]→ R, defined by F (x) = ∫ x a f(t) dt, (8.9) is continuous on [a, b], differentiable on (a, b) and has derivative F ′ given by F ′(x) = f(x) for all x in (a, b). The first fundamental theorem of calculus is a deep mathematical result which relates differen- tiation to integration. From the statement of the theorem we note the following points. • The fact that F satisfies the equation F ′ = f on (a, b) says that F is an antiderivative of f on (a, b). Hence every continuous function f has an antiderivative F , given by formula (8.9). • Since any two antiderivatives of f differ by a constant (see Theorem 5.9.4), every antiderivative of f is of the form F + constant, where F is given by the integral formula (8.9). Hence the process of integration and of antidifferentiation are essentially the same. (This is surprising from a geometric point of view, where integration is used to calculate areas while differentiation is used to calculate gradients of tangents.) • The above point suggests that integration and differentiation are inverse processes. If one takes a function f , integrates it and differentiates the result, then one obtains f again. That is, differentiation undoes what integration does to f . This is precisely expressed by the formula f(x) = d dx (∫ x a f(t) dt ) . Whether or not the converse statement is true (that is, that integration undoes what differ- entiation does to a function f) will be discussed in the next section. Thus the first fundamental theorem of calculus suggests that there may be a different method for calculating the area under the graph of f ; rather than integrating via limits of Riemann sums, try integrating via antidifferentiation. This approach is investigated in the next section. We end this section by presenting a proof of the first fundamental theorem of calculus. The proof of the differentiability of F is based on the intuitive argument given immediately prior to the statement of Theorem 8.5.1. Proof of Theorem 8.5.1. We begin by showing that F is differentiable on (a, b). Suppose that x ∈ (a, b) and consider a small positive number h. By Proposition 8.4.1 (ii) one can show that F (x+ h)− F (x) = ∫ x+h x f(t) dt. (8.10) c©2020 School of Mathematics and Statistics, UNSW Sydney 8.5. THE FIRST FUNDAMENTAL THEOREM OF CALCULUS 189 x y x x+ h Mh mh y = f(x) Figure 8.13: Diagram for the proof of Theorem 8.5.1. Since f is continuous on [a, b], it attains a minimum valuemh and a maximum valueMh on [x, x+h]; that is, mh ≤ f(t) ≤Mh ∀t ∈ [x, x+ h]. By Proposition 8.4.1 (v) (see also Figure 8.13), mhh ≤ ∫ x+h x f(t) dt ≤Mhh and hence mhh ≤ F (x+ h)− F (x) ≤Mhh. by (8.10). Now h > 0 so mh ≤ F (x+ h)− F (x) h ≤Mh. Since f is continuous on [x, x+ h], lim h→0+ mh = f(x) = lim h→0+ Mh and hence lim h→0+ F (x+ h)− F (x) h = f(x) by the pinching theorem (Theorem 2.5.9). In a similar manner, one can show that lim h→0− F (x+ h)− F (x) h = f(x). Hence F is differentiable on (a, b) and F ′(x) = f(x) for all x in (a, b). To complete the proof, we need to show that F is continuous on [a, b]. Since F is differentiable on (a, b), it is also continuous on (a, b) by Theorem 4.5.1. So it suffices to show that F is continuous at the endpoints of [a, b]. c©2020 School of Mathematics and Statistics, UNSW Sydney 190 CHAPTER 8. INTEGRATION We will show that F is continuous at a. Since f is continuous on [a, b] there is a positive constant M such that |f(x)| ≤ M for all x in [a, b] (see Corollary 3.4.9). Suppose that x ∈ (a, b). By using the properties of Proposition 8.4.1, |F (x)− F (a)| = ∣∣∣∣ ∫ x a f(t) dt− ∫ a a f(t) dt ∣∣∣∣ = ∣∣∣∣ ∫ x a f(t) dt ∣∣∣∣ (Definition 8.4.2) ≤ ∫ x a |f(t)| dt (Proposition 8.4.1 (vi)) ≤ ∫ x a M dt (f is bounded) =M |x− a| (Example 8.2.1). Hence |F (x) − F (a)| ≤M |x− a| → 0 as x→ a+, or in other words, lim x→a+ F (x) = F (a). So F is continuous at a. The proof of continuity at the endpoint b is similar. 8.6 The second fundamental theorem of calculus (Ref: SH10 §5.4) The first fundamental theorem of calculus says that the processes of integration and antidifferen- tiation are essentially the same. The second fundamental theorem of calculus gives a fast way of calculating integrals by exploiting this similarity. Theorem 8.6.1 (The second fundamental theorem of calculus). Suppose that f is a continuous function on [a, b]. If F is an antiderivative of f on [a, b] then ∫ b a f(t) dt = F (b)− F (a). (8.11) The second fundamental theorem of calculus is an astonishing result. To calculate the signed area under the graph of a continuous function f , all one needs to know is the value of any an- tiderivative of f at the endpoints a and b. To illustrate the power of this result, compare the lengthy calculation of Riemann sums in Example 8.1.2 with the example below. Example 8.6.2. Suppose that f(x) = x2 for all real numbers x. Calculate the area of the region bounded by the x-axis, the graph of f and lines x = 0 and x = 1. Solution. Essentially we are being asked to evaluate ∫ 1 0 f(t) dt c©2020 School of Mathematics and Statistics, UNSW Sydney 8.6. THE SECOND FUNDAMENTAL THEOREM OF CALCULUS 191 where f(t) = t2 (see Figure 8.4 (a)). An antiderivative F of f is given by F (t) = t 3 3 . Hence, by the second fundamental theorem of calculus, ∫ 1 0 f(t) dt = F (1) − F (0) = 1 3 3 − 0 3 3 = 1 3 . So the area of the region is 1/3. Remark 8.6.3. The expression F (b) − F (a) appearing in (8.11) is used so frequently that it is often abbreviated to F (x) ∣∣∣b a or [ F (x) ]b a . Thus the calculation of Example 8.6.2 may be written as ∫ 1 0 t2 dt = [ t3 3 ]1 0 = 1 3 − 0 = 1 3 . When calculating areas, it is important to bear in mind that the integral in (8.11) represents the signed area. Example 8.6.4. Calculate the area of the region bounded by the x-axis, the lines x = 0 and x = 4 and the function f given by f(x) = (x− 2)(x− 4). y = f(x) x y 2 4 8 Solution. The above sketch shows that part of the region lies beneath the x-axis. So the area we c©2020 School of Mathematics and Statistics, UNSW Sydney 192 CHAPTER 8. INTEGRATION seek corresponds to the integral∫ 4 0 |f(x)| dx = ∫ 2 0 f(x) dx+ ∫ 4 2 (−f(x)) dx = ∫ 2 0 x2 − 6x+ 8 dx− ∫ 4 2 x2 − 6x+ 8 dx = [ x3 3 − 3x2 + 8x ]2 0 − [ x3 3 − 3x2 + 8x ]4 2 = [( 8 3 − 12 + 16 ) − 0 ] − [( 64 3 − 48 + 32 ) − ( 8 3 − 12 + 16 )] = 8. So the area is 8 square units. Note that this is a different problem to: Evaluate ∫ 4 0 f(x) dx, whose solution is: ∫ 4 0 f(x) dx = ∫ 4 0 x2 − 6x+ 8 dx = [ x3 3 − 3x2 + 8x ]4 0 = 16 3 . We now present a proof of the second fundamental theorem of calculus. The tools used in the proof are the first fundamental theorem of calculus and a corollary of the mean value theorem. Proof of Theorem 8.6.1. Suppose that f is continuous on [a, b] and that F is an antiderivative of f on [a, b]. Define a function G : [a, b]→ R by the formula G(x) = ∫ x a f(t) dt ∀x ∈ [a, b]. By the first fundamental theorem of calculus (see Theorem 8.5.1), G is continuous on [a, b] and G′(x) = f(x) for all x in (a, b). Hence G is also an antiderivative of f on [a, b]. By Theorem 5.9.4, there is a constant C such that G(x) = F (x) + C (8.12) for all x in [a, b]. Note that G(a) = ∫ a a f(t) dt = 0. So if x = a then equation (8.12) becomes 0 = F (a) + C. c©2020 School of Mathematics and Statistics, UNSW Sydney 8.6. THE SECOND FUNDAMENTAL THEOREM OF CALCULUS 193 Hence C = −F (a). Therefore ∫ b a f(t) dt = G(b) = F (b) + C (by (8.12)) = F (b)− F (a) as required. We return to the question of whether integration and differentiation are inverse processes. Suppose that f is continuous on [a, b]. The first fundamental theorem of calculus states that if one integrates a function f and differentiates the result, then one obtains f again. This is expressed by the formula f(x) = d dx (∫ x a f(t) dt ) ∀x ∈ (a, b). To be able to say that integration and differentiation are inverses operations of each other, we must ask whether the converse is true. That is, if one differentiates a function f and integrates the result, then does one obtain f again? In precise terms, we would like the formula f(x) = ∫ x a f ′(t) dt ∀x ∈ (a, b) (8.13) to hold. Unfortunately, this converse is false. The following corollary to the second fundamental theorem of calculus says that one obtains f again only when f(a) = 0. Otherwise formula (8.13) is ‘out by a constant.’ Corollary 8.6.5. Suppose that f is continuous on [a, b] and has a continuous derivative on (a, b). Then ∫ x a f ′(t) dt = f(x)− f(a) for all x in [a, b] Proof. Apply the second fundamental theorem of calculus to the function f ′ on the interval [a, x]. The second fundamental theorem of calculus allows a fruitful interaction between Riemann sums and antidifferentiation. Example 8.6.6. Suppose that f(x) = 1 1 + x2 . By considering the lower Riemann sum of f with respect to the partition {0, 1n , 2n , . . . , n−1n , nn} of [0, 1], show that π 4 = lim n→∞ n∑ k=1 n n2 + k2 = lim n→∞ ( n n2 + 12 + n n2 + 22 + n n2 + 32 + · · ·+ n n2 + n2 ) . c©2020 School of Mathematics and Statistics, UNSW Sydney 194 CHAPTER 8. INTEGRATION Solution. Let Pn denote the above partition of [0, 1]. Since f is a decreasing function on [0, 1], the lower Riemann sum is given by SPn(f) = n∑ k=1 1 n 1 1 + (k/n)2 = n∑ k=1 n n2 + k2 . Since f is bounded and continuous, it is integrable on [0, 1] and hence lim n→∞SPn(f) = ∫ 1 0 1 1 + x2 dx (see Remark 8.6.7). But by the second fundamental theorem of calculus,∫ 1 0 1 1 + x2 dx = [ tan−1(x) ]1 0 = π 4 . Hence π 4 = lim n→∞ n∑ k=1 n n2 + k2 as desired. Remark 8.6.7. The solution to Example 8.6.6 uses the following result. Suppose that f is Riemann integrable on [a, b] and that {Pn}∞n=1 is a sequence of partitions of [a, b]. Let sn denote the maximum size of the subintervals generated by Pn. If lim n→0 sn = 0 then lim n→∞SPn(f) = ∫ b a f(x) dx = lim n→∞SPn(f). 8.7 Indefinite integrals (Ref: SH10 §5.6) The fundamental theorems of calculus say that integration and antidifferentiation are, in a certain sense, the same process. In particular, if a F is an antiderivative of f then∫ x a f(t) dt = F (x) + C for some suitable constant C. If we have no particular interest in the interval [a, x] and merely want to indicate that F is an antiderivative of f , then we write∫ f(t) dt = F (t) + C. (8.14) An integral expressed in this way, without limits, is called an indefinite integral. The constant C in (8.14) is called the constant of integration. The table of antiderivatives given in Section 5.9 is equivalent to the following table of indefinite integrals. c©2020 School of Mathematics and Statistics, UNSW Sydney 8.7. INDEFINITE INTEGRALS 195 Indefinite integrals∫ xr dx = 1 r + 1 xr+1 + C, where r is a rational number and r 6= −1∫ sinx dx = − cos x+ C∫ cos x dx = sinx+ C∫ eax dx = 1 a eax + C∫ f ′(x) f(x) dx = ln |f(x)|+ C Since differentiation and indefinite integration (or antidifferentiation) are inverse processes, they have many analogous properties. For example, differentiation behaves linearly, which means that (f + g)′(x) = f ′(x) + g′(x) and (αf)′(x) = α× f ′(x) for every real number α and every differentiable function f and g. The next proposition says that indefinite integration is also behaves linearly. (We have already shown in Proposition 8.4.1 that definite integration is linear.) Proposition 8.7.1. Suppose that f and g are integrable functions and that α is a real constant. Then ∫ (f + g)(x) dx = ∫ f(x) dx+ ∫ g(x) dx (8.15) and ∫ (αf)(x) dx = α ∫ f(x) dx. (8.16) Proof. We prove (8.15) only and leave the proof of 8.16 as an exercise. Suppose that F , G and H are antiderivatives of f , g and f + g respectively. Then H ′(x) = (f + g)(x) = f(x) + g(x) = F ′(x) +G′(x) by the linearity of differentiation. By expressing the equation H ′ = F ′ +G′ in terms of indefinite integrals we obtain (8.15) as desired. Example 8.7.2. Find ∫ (10x4 + 8 sin x− 9ex) dx. Proof. By Proposition 8.7.1,∫ (10x4 + 8 sinx− 9ex) dx = 10 ∫ x4 dx+ 8 ∫ sinx dx− 9 ∫ ex dx = 10 ( 1 5 x5 + C1 ) + 8 (− cos x+ C2)− 9 (ex + C3) = 2x5 − 8 cos x− 9ex + C c©2020 School of Mathematics and Statistics, UNSW Sydney 196 CHAPTER 8. INTEGRATION where C1, C2 and C3 are constants of integration and C = 10C1 + 8C2 − 9C3. Remark 8.7.3. It is standard practice to combine of all constants of integration and simply write ‘+C’. Thus the calculation in Example 8.7.2 may be written as∫ (10x4 + 8 sin x− 9ex) dx = 10 ∫ x4 dx+ 8 ∫ sinx dx− 9 ∫ ex dx = 10× 1 5 x5 + 8(− cos x)− 9ex + C = 2x5 − 8 cos x− 9ex + C. 8.8 Integration by substitution (Ref: SH10 §5.7) When differentiating the composition of two functions, we apply the chain rule. By reversing this process with integration, we obtain a technique known as integration by substitution. The basic idea is the following. Suppose that F is an antiderivative for f . Then∫ f(g(x)).g′(x) dx = F (g(x)) + C, (8.17) as can be easily verified by differentiating the right-hand side via the chain rule. On the other hand, ∫ f(u) du = F (u) +C. (8.18) If we make the substitution u = g(x) then the right-hand sides of (8.17) and (8.18) are the same. This suggests the substitution du = g′(x) dx, so that the left-hand side of (8.17) is transformed into the left-hand side of (8.18). These mechanical substitutions can be used to transform the complicated integral into a simpler one. Example 8.8.1. Calculate the indefinite integral∫ (2x− 3) cos(x2 − 3x+ 4) dx by the method of substitution. Solution. We begin with the substitution u = x2 − 3x+ 4. Since du dx = 2x− 3, we also make the substitution du = (2x− 3) dx. c©2020 School of Mathematics and Statistics, UNSW Sydney 8.8. INTEGRATION BY SUBSTITUTION 197 Then ∫ (2x− 3) cos(x2 − 3x+ 4) dx = ∫ cos(u) du = sinu+ C = sin(x2 − 3x+ 4) + C. We can verify our answer by differentiating sin(x2 − 3x+ 4) via the chain rule. A similar technique can be used to evaluate definite integrals. Example 8.8.2. Evaluate the definite integral∫ 2 0 3x2 √ 1 + x3 dx. Solution. If u = 1 + x3 then dudx = 3x 2. Hence we make the following substitutions: u = 1 + x3 du = 3x2 dx u = 1 (when x = 0) u = 9 (when x = 2). Thus ∫ 2 0 3x2 √ 1 + x3 dx = ∫ 9 1 √ u du = [ 2 3 u3/2 ]9 1 = 2 3 (93/2 − 13/2) = 52 3 . In general, the reversal of the chain rule via integration gives rise to the change of variables formula ∫ b a f(g(x))g′(x) dx = ∫ g(b) g(a) f(u) du, (8.19) The substitutions u = g(x), du = g′(x) dx, are only to be understood within this context. Precise conditions under which formula (8.19) holds is given by the following theorem. Theorem 8.8.3 (Change of variables formula). Suppose that g is a differentiable function such that g′ is continuous on [a, b]. If f is continuous on any interval I containing g(a) and g(b) then the change of variables formula (8.19) holds. c©2020 School of Mathematics and Statistics, UNSW Sydney 198 CHAPTER 8. INTEGRATION Proof. Since f is continuous on I it has an antiderivative F : I → R by the first fundamental theorem of calculus. By the chain rule (Theorem 4.2.2), (F ◦ g)′(x) = F ′(g(x))g′(x) = f(g(x))g′(x). So by two applications of the second fundamental theorem of calculus,∫ b a f(g(x))g′(x) dx = (F ◦ g)(b)− (F ◦ g)(a) = F (g(b)) − F (g(a)) = ∫ g(b) g(a) f(u) du, completing the proof. Finding a fruitful substitution is not always easy (and sometimes not even possible). In principle, one looks for a function g in the integrand whose derivative g′ is also in the integrand. Once such a function g is identified, try the substitution u = g(x). As illustrated below, it may be necessary to manipulate the integrand to implement this strategy. Example 8.8.4. Evaluate ∫ e √ x √ x dx. Solution. Note that d dx (√ x ) = 1 2 √ x and so if g(x) = √ x then g′(x) appears in the integrand (up to the constant factor 1/2). So the substitution we use is u = √ x, du = 1 2 √ x dx. Hence ∫ e √ x √ x dx = 2 ∫ e √ x 2 √ x dx = 2 ∫ eu du = 2eu + C = 2e √ x + C. There are occasions when we make a substitution of the form x = g(u) rather than u = g(x). Example 8.8.5. Evaluate the integral I, where I = ∫ 9 4 dx 2 + √ x . c©2020 School of Mathematics and Statistics, UNSW Sydney 8.8. INTEGRATION BY SUBSTITUTION 199 x f(x) −a a | | Figure 8.14: Integration with an odd function f over [−a, a] Solution. If the integral were ∫ 9 4 dx 2 + x , then it would be easy to evaluate. The difficulty with I clearly lies with the square root appearing in the integrand. To remove the square root, we use the substitution x = u2. This gives x = u2 dx = 2u du u = 2 when x = 4 u = 3 when x = 9. (Note that we are implicitly assuming that u > 0. It would be incorrect to take the limits in the variable u to be −2 and 3, for example.) Hence I = ∫ 3 2 2u du 2 + u = 2 ∫ 3 2 u 2 + u du = 2 ∫ 3 2 2 + u− 2 2 + u du = 2 ∫ 3 2 1− 2 2 + u du = 2 [ u− 2 ln |2 + u| ]3 2 = 2 + 4 ln ( 4 5 ) . If an integrand possesses symmetry, then the corresponding definite integral may be easy to evaluate. For example, if f is an odd function and we integrate f over a balanced interval [−a, a], then the ‘negative area’ cancels out the ‘positive area’ and consequently the integral is zero (see Figure 8.14). This result is proved using integration by substitution. Proposition 8.8.6. Suppose that f is a continuous function and a is a real number. c©2020 School of Mathematics and Statistics, UNSW Sydney 200 CHAPTER 8. INTEGRATION (i) If f is even then ∫ a −a f(x) dx = 2 ∫ a 0 f(x) dx. (ii) If f is odd then ∫ a −a f(x) dx = 0. (iii) If f is periodic with period T then∫ a+T a f(x) dx = ∫ T 0 f(x) dx. Proof. We prove (ii) only and leave the proofs of (i) and (iii) as an exercise. Suppose that f is an odd function. By breaking up the integral and using a substitution we have∫ a −a f(x) dx = ∫ 0 −a f(x) dx+ ∫ a 0 f(x) dx = − ∫ 0 a f(−u) du+ ∫ a 0 f(x) dx (substituting u = −x) = ∫ a 0 f(−u) du+ ∫ a 0 f(x) dx = − ∫ a 0 f(u) du+ ∫ a 0 f(x) dx (since f(−u) = f(u)) = 0, completing the proof of (ii). Example 8.8.7. Find ∫ pi 17 +2π pi 17 sin5 x dx. Solution. If f(x) = sin5 x then f is both odd and periodic with period 2π. Hence ∫ pi 17 +2π pi 17 sin5 x dx = ∫ 2π 0 sin5 x dx (by Proposition 8.8.6 (iii) when a = π17 ) = ∫ π −π sin5 x dx (by Proposition 8.8.6 (iii) when a = −π) = 0 (by Proposition 8.8.6 (ii)). 8.9 Integration by parts (Ref: SH10 §8.2) c©2020 School of Mathematics and Statistics, UNSW Sydney 8.9. INTEGRATION BY PARTS 201 Suppose that u and v are two functions of a variable x. The product rule for differentiation gives d dx (uv) = v du dx + u dv dx . By integrating both sides with respect to x we obtain uv = ∫ v du dx dx+ ∫ u dv dx dx. Rearranging this equation gives the integration by parts formula∫ u dv dx dx = uv − ∫ v du dx dx. (8.20) For definite integrals the formula becomes ∫ b a u dv dx dx = [ uv ]b a − ∫ b a v du dx dx. (8.21) by the second fundamental theorem of calculus. Integration by parts allows one to evaluate the integral of a product of two functions. Its success depends on choosing u and v such that∫ b a v du dx dx is easier to calculate than ∫ b a u dv dx dx. Once u and v are chosen, simply apply formula (8.20) or (8.21). Remark 8.9.1. The shorthand version∫ uv′ = uv − ∫ vu′ of formulae (8.20) and (8.21) is easier to remember. Example 8.9.2. Evaluate the integral ∫ xe2x dx. Solution. We set u = x v = 12e 2x u′ = 1 v′ = e2x. Then integration by parts (see formula (8.20)) gives∫ xe2x dx = 12xe 2x − ∫ 1 2e 2x × 1 dx = 12xe 2x − 14e2x +C, completing the solution. c©2020 School of Mathematics and Statistics, UNSW Sydney 202 CHAPTER 8. INTEGRATION (Note that if we had instead set u = e2x v = 12x 2 u′ = 2e2x v′ = x. then integration by parts gives ∫ xe2x dx = 12x 2e2x − ∫ x2e2x dx and the integral on the right-hand side is harder to evaluate than the one we started with.) Sometimes integration by parts must be applied several times to evaluate the integral. Example 8.9.3. Find the integral I, where I = ∫ π 0 x2 sinx dx. Example 8.9.4. Evaluate the indefinite integral I, where I = ∫ ex sinx dx Integration by parts can also be used to evaluate some integrals of the form ∫ f(x) dx. The trick is to rewrite the integral as ∫ f(x)× 1 dx. Example 8.9.5. Evaluate the integral ∫ cos−1 x dx. 8.10 Improper integrals (Ref: SH10 §11.7) So far we have defined definite integrals for bounded functions over finite intervals. In this section we look at integrals over infinite intervals. Such integrals have many applications in probability, statistics, physics and engineering. To give meaning to the improper integral ,∫ ∞ a f(x) dx, we examine the behaviour of ∫ R a f(x) dx as R→∞, as illustrated below. x f(x) a R | | c©2020 School of Mathematics and Statistics, UNSW Sydney 8.10. IMPROPER INTEGRALS 203 Definition 8.10.1. (a) Suppose that there is a real number L such that∫ R a f(x) dx→ L as R → ∞. Then f is said to be integrable over [a,∞) and the integral∫∞ a f(x) dx is said to be convergent. Morever, we write∫ ∞ a f(x) dx = L. (b) Suppose that ∫ R a f(x) dx does not have a limit as R → ∞. Then f we say that f not integrable over (a,∞) and we say that the integral ∫∞a f(x) dx is divergent. One can write down a similar definition for improper integrals of the form ∫ b −∞ f(x) dx. Example 8.10.2. Evaluate the following improper integrals or show that they diverge: (a) ∫ ∞ 0 1 x2 + 1 dx; (b) ∫ ∞ 1 1√ x dx; (c) ∫ 0 −∞ e2x dx. Solution. (a) If R is a real number then∫ R 0 1 x2 + 1 dx = [ tan−1 ]R 0 = tan−1R− 0 → π 2 as R→∞. Hence the improper integral converges and∫ ∞ 0 1 x2 + 1 dx = π 2 . (b) If R is a real number then ∫ R 1 x−1/2 dx = [ 2x1/2 ]R 1 = 2 √ R− 2 →∞ as R→∞. Hence the improper integral diverges. c©2020 School of Mathematics and Statistics, UNSW Sydney 204 CHAPTER 8. INTEGRATION We now consider improper integrals whose interval of integration is the entire real line. Definition 8.10.3. We say f is integrable over (−∞,∞) if f is integrable over both (−∞, 0) and (0,∞). In this case we write∫ ∞ −∞ f(x) dx = ∫ 0 −∞ f(x) dx+ ∫ ∞ 0 f(x) dx. If f is not integrable on either of the intervals (−∞, 0) or (0,∞), then we say that the improper integral ∫ ∞ −∞ f(x) dx diverges. The following example shows that the improper integral ∫ ∞ −∞ f(x) dx may diverge even if lim R→∞ ∫ R −R f(x) dx exists. Example 8.10.4. Consider the improper integral∫ ∞ −∞ x dx. If R is a positive number then ∫ R −R x dx = 0 since the function f , given by f(x) = x, is odd (see Proposition 8.8.6). Hence lim R→∞ ∫ R −R x dx = 0. However ∫ R 0 x dx = [ 1 2x 2 ]R 0 = R2 2 →∞ as R→∞. Hence ∫ ∞ 0 x dx diverges, and by Definition 8.10.3 ∫ ∞ −∞ x dx diverges also. The following proposition determines the convergence or divergence of improper integrals whose integrand is a power of x. Such integrals will be used frequently in Section 8.11 c©2020 School of Mathematics and Statistics, UNSW Sydney 8.11. COMPARISON TESTS FOR IMPROPER INTEGRALS 205 Proposition 8.10.5 (Convergence and divergence of p-integrals). The improper integral∫ ∞ 1 1 xp dx is convergent if p > 1 and divergent if p ≤ 1. Proof. If p 6= 1 then ∫ R 1 x−p dx = [ x1−p 1− p ]R 1 = R1−p − 1 1− p → { 1 p−1 when 1− p < 0 ∞ when 1− p > 0 as R→∞. Hence the integral converges when 1− p < 0 (that is, when p > 1) and diverges when 1− p > 0 (that is, when p < 1). In the case when p = 1, we have ∫ R 1 1 x dx = [ lnx ]R 1 = lnR− ln 1→∞ as R→∞. Hence the integral diverges when p = 1. Remark 8.10.6. While you may be familiar with the fact that the ln function is unbounded, the proof that lnR→∞ as R→∞ requires the use of Riemann sums and is given later in §9.2. 8.11 Comparison tests for improper integrals (Ref: SH10 §11.7) Some of the most important improper integrals, such as∫ ∞ 0 e−x 2 dx have integrands without any antiderivative in the elementary functions. In these cases, determining convergence via a computation like ∫ ∞ a f(x) dx = lim R→∞ ∫ R a f(x) dx = lim R→∞ F (R)− F (a), (where F is an antiderivative of f) is not a viable strategy. Instead, we compare the integral of f with another integral ∫∞ a g(x) dx whose behaviour is known. The basic idea is as follows. Suppose that 0 ≤ f(x) ≤ g(x) whenever x > a, as illustrated below. c©2020 School of Mathematics and Statistics, UNSW Sydney 206 CHAPTER 8. INTEGRATION g f x y a Obviously the area under the graph of g is greater than that under the graph of f . Hence if the area under g is finite then the area under f is also finite. If the area under f is infinite then so too is the area under g. The next theorem expresses this conclusion in terms of improper integrals. Theorem 8.11.1 (The comparison test). Suppose that f and g are integrable functions and that 0 ≤ f(x) ≤ g(x) whenever x > a. (i) If ∫ ∞ a g(x) dx converges then ∫ ∞ a f(x) dx converges. (ii) If ∫ ∞ a f(x) dx diverges then ∫ ∞ a g(x) dx diverges. Proof. We begin by proving (i). Suppose that f and g are integrable functions, that 0 ≤ f(x) ≤ g(x) for all x in [a,∞) and that ∫∞0 g(x) dx converges. Then 0 ≤ ∫ R a f(x) dx ≤ ∫ R a g(x) dx whenever R > 0 by Proposition 8.4.1. Since the integral∫ R a f(x) dx increases as R increases and is bounded above by ∫∞ 0 g(x) dx, it has a limit as R → ∞. Hence∫∞ a f(x) dx is also convergent. The proof of (ii) is left as an exercise. When applying the comparison test, we often compare an improper integral I with∫ ∞ 1 1 xp dx, since the behaviour of this second integral is known (see Proposition 8.10.5). The value of p is chosen by analysing the dominant terms appearing in the integrand of I. The next example illustrates this procedure. Example 8.11.2. Determine whether or not the following improper integrals converge. c©2020 School of Mathematics and Statistics, UNSW Sydney 8.11. COMPARISON TESTS FOR IMPROPER INTEGRALS 207 (a) ∫ ∞ 1 1 x3 + 1 dx (b) ∫ ∞ 2 x+ 2 x3/2 − 1 dx. Solution. (a) When x is large, the dominant term in the denominator of 1 x3 + 1 is x3. So intuitively, 1 x3 + 1 ≈ 1 x3 for large x. Consequently we aim to compare∫ ∞ 1 1 x3 + 1 dx with ∫ ∞ 1 1 x3 dx. To make this comparison rigorous, we seek an appropriate inequality. Note that x3 + 1 > x3 for all x > 1 and so 1 x3 + 1 < 1 x3 for all x > 1. Since ∫ ∞ 1 1 x3 dx converges (see Proposition 8.10.5), ∫ ∞ 1 1 x3 + 1 dx also converges by the comparison test. There are many examples where the ‘dominant term analysis’ is straightforward but it is difficult to obtain an appropriate inequality for successful use of the comparison test. This difficulty can sometimes be overcome by multiplying one of the integrands by a ‘fudge factor.’ Example 8.11.3. Determine whether or not the the improper integral∫ ∞ 2 1 x3/2 − 1 dx. converges. Solution. Dominant term analysis suggests that 1 x3/2 − 1 ≈ 1 x3/2 when x is large. So we compare∫ ∞ 2 1 x3/2 − 1 dx with ∫ ∞ 2 1 x3/2 dx. c©2020 School of Mathematics and Statistics, UNSW Sydney 208 CHAPTER 8. INTEGRATION Now the integral to the right converges (by Proposition 8.10.5) but it is clear that 1 x3/2 − 1 > 1 x3/2 when x > 2, so we cannot immediately apply the comparison test. However, by multiplying the integrand on the right with the ‘fudge factor’ 2, we see that 1 x3/2 − 1 < 2 x3/2 (8.22) when x > 2. (To see why, observe that when x > 2, x3/2 − 1 > x3/2 − 12x3/2 = 12x3/2 whence (8.22) follows.) Since ∫ ∞ 2 2 x3/2 dx converges, ∫ ∞ 2 1 x3/2 − 1 dx. must also. Later we give a method for determining the convergence of this integral that does not rely on the use of fudge factors. Dominant term analysis is not always straightforward, and at times may not even be useful. One must develop an intuition of how functions decay at infinity and of what comparisons to use. Example 8.11.4. Determine whether or not the following improper integrals converge. (a) ∫ ∞ 1 e− √ x dx (b) ∫ ∞ 1 sinx+ 2√ x+ 1 dx. Solution. (a) Dominant term analysis does not apply since there is only one term appearing in the integrand. Instead, we recognise that the decay of the exponential e− √ x is very rapid; it is certainly c©2020 School of Mathematics and Statistics, UNSW Sydney 8.11. COMPARISON TESTS FOR IMPROPER INTEGRALS 209 much faster than the decay of 1x2 . This can be verified by the following limit calculation: lim x→∞ e− √ x 1 x2 = lim x→∞ x2 ex 1/2 = lim x→∞ 2x 1 2x −1/2ex1/2 (by l’Hoˆpital’s rule) = lim x→∞ 4x3/2 ex1/2 = lim x→∞ 4.32x 1/2 1 2x −1/2ex1/2 (by l’Hoˆpital’s rule) = lim x→∞ 12x ex 1/2 ... (two more applications of l’Hoˆpital’s rule) = lim x→∞ 24 ex 1/2 = 0. Hence we conclude that e− √ x < 1 x2 whenever x is sufficiently large. Since ∫ ∞ 1 1 x2 dx converges, ∫ ∞ 1 e− √ x dx also converges by the comparison test. (b) While the numerator of sinx+ 2√ x+ 1 has no dominant term, intuitively we see that, since the numerator is bounded and the dominant term of the denominator is √ x, we should compare the integrand with 1√ x . Since ∫ ∞ 1 1√ x dx diverges, we seek an inequality of the form f(x) ≤ sinx+ 2√ x+ 1 where f(x) is 1/ √ x with an appropriate fudge factor. c©2020 School of Mathematics and Statistics, UNSW Sydney 210 CHAPTER 8. INTEGRATION Now, −1 ≤ sinx, and so sinx+ 2√ x+ 1 ≥ 1√ x+ 1 ≥ 1 2 √ x for all x ≥ 1. Since ∫ ∞ 1 1 2 √ x dx diverges (see Proposition 8.10.5), ∫ ∞ 1 sinx+ 2√ x+ 1 dx also diverges by the comparison test. The next theorem provides an alternative approach for constructing a comparison of integrals. It is particularly useful if dominant term analysis is straightforward but one wants to avoid the use of fudge factors. Theorem 8.11.5 (The limit form of the comparison test). Suppose that f and g are non-negative and bounded on [a,∞). If lim x→∞ f(x) g(x) = L and 0 < L <∞ then either∫ ∞ a f(x) dx and ∫ ∞ a g(x) dx both converge or ∫ ∞ a f(x) dx and ∫ ∞ a g(x) dx both diverge. The hypothesis lim x→∞ f(x) g(x) = L indicates that f and g exhibit similar behaviour at infinity. The proof of the theorem will be given at the end of this section. Example 8.11.6. Determine whether or not the following improper integrals converge. (a) ∫ ∞ 1 1√ 1 + x+ x2 dx (b) ∫ ∞ 2 x2 + 3x 5x4 − 2 dx. Solution. (a) Suppose that f(x) = 1√ 1 + x+ x2 . We need to find a suitable function g to compare f with. As x gets large, the dominant term under the square root is x2. Therefore it seems that 1√ 1 + x+ x2 ≈ 1√ x2 = 1 x c©2020 School of Mathematics and Statistics, UNSW Sydney 8.11. COMPARISON TESTS FOR IMPROPER INTEGRALS 211 when x is large. So define g by g(x) = 1x . The following calculation verifies that our choice of g is suitable: lim x→∞ f(x) g(x) = lim x→∞ x√ 1 + x+ x2 = lim x→∞ √ x2 1 + x+ x2 = lim x→∞ √ 1 1 x2 + 1x + 1 = 1. Since this limit is a positive real number, we can apply the limit form of the comparison test. By Proposition 8.10.5, ∫∞ 1 g(x) dx diverges. Hence ∫∞ 1 f(x) dx also diverges. Remark 8.11.7. As a general rule, students will find the limit form of the comparison test easier to use than the inequality form of the comparison test when the integrand is a rational function, or is a ratio of roots of polynomials. Proof of Theorem 8.11.1 [X] . Suppose that f and g are non-negative and bounded on [a,∞). (It follows, for later use, that if a < M < R and R is increasing then the integrals∫ R M f(x) dx and ∫ R M g(x) dx, are also increasing.) Suppose also that lim x→∞ f(x) g(x) = L where 0 < L < ∞. Choose a positive number ǫ such that 0 < ǫ < L. By Definition 2.2.1, there is a positive real number M such that ∣∣∣∣f(x)g(x) − L ∣∣∣∣ < ǫ whenever x > M . Hence −ǫ < f(x) g(x) − L < ǫ ∀x > M and rearranging gives 0 < (L− ǫ)g(x) < f(x) < (L+ ǫ)g(x) ∀x > M. By integrating with respect to x we obtain 0 < (L− ǫ) ∫ R M g(x) dx < ∫ R M f(x) dx < (L+ ǫ) ∫ R M g(x) dx whenever R > M . By letting R approach infinity it is seen that • if ∫∞M g(x) dx is convergent then ∫ RM f(x) is also convergent (since ∫ RM f(x) increases as R increases and it is bounded above by (L+ ǫ) ∫∞ M g(x) dx); • if ∫∞M f(x) dx is convergent then ∫ RM g(x) is also convergent (since ∫ RM g(x) is increases as R increases and is bounded above by 1L−ǫ ∫∞ M f(x) dx); • if ∫∞M f(x) dx is divergent then ∫ RM g(x) dx→∞ as R→∞; and c©2020 School of Mathematics and Statistics, UNSW Sydney 212 CHAPTER 8. INTEGRATION • if ∫ ∞ M g(x) dx is divergent then ∫ R M f(x) dx→∞ as R→∞. Hence either ∫ ∞ M f(x) dx and ∫ ∞ M g(x) dx both converge or ∫ ∞ M f(x) dx and ∫ ∞ M g(x) dx both diverge. The proof of the theorem now easily follows by first observing that∫ R a f(x) dx = ∫ M a f(x) dx+ ∫ R M f(x) dx and ∫ R a g(x) dx = ∫ M a g(x) dx+ ∫ R M g(x) dx whenever a < M < R, and then letting R approach infinity. 8.12 Functions defined by an integral Many important functions are defined by an integral. If f is a continuous function then we may define a function F by F (x) = ∫ x a f(t) dt. We note the following points. • The derivative of F may be calculated by the first fundamental theorem of calculus. • The value of F (x) may be approximated by computing Riemann sums. • Symmetries of f can help in understanding the behaviour of F (see, for example, Proposition 8.8.6). • The behaviour of F (x) as x→∞ or as x→ −∞ may be ascertained by studying an improper integral of f . We give one example. Others may be found in the problem set for Chapter 8 and at the beginning of Chapter 9. Consider the function f : R→ R given by f(t) = { sin t t if t 6= 0 1 if t = 0. Since lim t→0 sin t t = 1, the function f is continuous everywhere. We can now define the function Si : R→ R by Si(x) = ∫ x 0 f(t) dt ∀x ∈ R. The name Si comes from ‘sine integral.’ This function is used in electrical engineering for signal processing and in surveying for the Global Positioning System (GPS). Its graph is plotted in Figure 8.15. c©2020 School of Mathematics and Statistics, UNSW Sydney 8.12. FUNCTIONS DEFINED BY AN INTEGRAL 213 x Si(x) π 2 −π2 10π−10π || | | Figure 8.15: The Si function. Example 8.12.1. Consider the functions f and Si defined above. (a) Find the value of Si(0). (b) Show that Si is an odd function. (c) Find and classify the critical points of Si on (0, 3π). (d) A function G : R→ R is defined by the formula G(x) = ∫ x2 0 f(t) dt. Calculate G′(x). Solution. (a) Si(0) = ∫ 0 0 f(t) dt = 0. (b) We need to show that Si(−x) = − Si(x). Now Si(−x) = ∫ −x 0 sin t t dt = ∫ x 0 sin(−u) −u (−du) (using the substitution t = −u) = − ∫ x 0 − sinu −u du = − ∫ x 0 sinu u du = − Si(x). Hence Si is an odd function. (c) By the first fundamental theorem of calculus, Si′(x) = sinx x c©2020 School of Mathematics and Statistics, UNSW Sydney 214 CHAPTER 8. INTEGRATION for all x in (0, 3π). Critical points occur when Si′(x) = 0, that is, when x = π or when x = 2π. To classify the critical points, we calculate the second derivative: Si′′(x) = x cos x− sinx x2 . Now Si′′(π) = − 1π and Si′′(2π) = 12π . Therefore Si has a local maximum point when x = π and a local minimum point when x = 2π. (d) Note that G(x) = Si(x2), so to calculate G′(x) we use the chain rule: G′(x) = d dx ( Si(x2) ) = 2x Si′(x2) = 2x× f(x2) = 2x sin(x 2) x2 = 2 sin(x2) x . (If this calculation is too dense, then consider y = G(x), u = x2 so that y = ∫ u 0 f(t) dt and dy du = f(u) by the first fundamental theorem of calculus. Then by the chain rule, dy dx = dy du du dx = f(u)× 2x = 2xf(x2) = 2x sin(x 2) x2 = 2 sin(x2) x . Hence G′(x) = 2 sin(x 2) x .) c©2020 School of Mathematics and Statistics, UNSW Sydney 8.13. MAPLE NOTES 215 8.13 Maple notes The command int(f(x),x); will cause Maple to attempt to find the indefinite integral ∫ f(x) dx, and the command int(f(x),x=a..b); is used to find definite integrals ∫ b a f(x) dx. If Maple is unable to find a primitive, then numerical integration can be carried out by combining the int and evalf commands. For example, > int(1/(x*log(x)^2),x); − (ln (x))−1 > int(1/(x*log(x)^2),x=2..infinity); (ln (2))−1 > int( exp(-x^2)*ln(x), x=0..1 );∫ 1 0 e −x2 ln (x) dx > evalf(%); −0.9059404763 c©2020 School of Mathematics and Statistics, UNSW Sydney 216 CHAPTER 8. INTEGRATION Problems for Chapter 8 Problems 8.1 : Area and the Riemann Integral and 8.2 : Integration using Riemann sums 1. [R] [V] a) By taking the partition Pn = { 0, 1 n , 2 n , . . . , 1 } of the interval [0,1], calculate the lower sum SPn(f) and the upper sum SPn(f) for each function f . i) f(x) = 1 ii) f(x) = x iii) f(x) = x2 [You may need n∑ k=1 k2 = 16n(n+ 1)(2n + 1).] iv) f(x) = x3 [You may need n∑ k=1 k3 = 14n 2(n+ 1)2.] v) f(x) = { 1 if x ∈ Q 0 if x /∈ Q b) By taking the limit as n→∞ for each sum SPn(f) and SPn(f) calculated in (a), either calculate ∫ 1 0 f(x) dx, or show that f is not Riemann integrable. 2. [R] An electrical signal S(t) has its amplitude |S(t)| tested (sampled) every 110 of a second. It is desired to estimate the energy over a period of half a second, given exactly by (∫ 1 2 0 |S(t)|2 dt ) 1 2 . The results of the measurement are shown in the following table: t .1 .2 .3 .4 .5 |S(t)| 60 50 50 45 55 e(t) 5 3 7 4 10 a) Using the above data for S(t), set up an appropriate Riemann sum and compute an approximate value for the energy. b) It is known that the signal varies by an amount of at most ±e(t), as shown above, in each 110 second period. Calculate upper and lower bounds for the energy. c©2020 School of Mathematics and Statistics, UNSW Sydney PROBLEMS FOR CHAPTER 8 217 Problems 8.3 : The Riemann integral and signed area and 8.4 : Basic properties of the Riemann integral 3. [R] Find the area of the region bounded by the line y = x and the parabola y = x2 − 2. 4. [R] Find a) ∫ 9 4 x3 − x x3/2 dx b) ∫ 2 −4 |x| dx. 5. [H] Find a function f which satisfies the integral equation∫ x 0 tf(t) dt = ∫ 0 x (t2 + 1)f(t) dt+ x. 6. [R] Explain why ∫ 1 −1 1 x2 dx = [ −1 x ]1 −1 = −1− 1 = −2 is not valid. 7. [H] a) Suppose that f is a continuous increasing (and hence invertible) function on [a, b]. If c = f(a), d = f(b) and a, b, c, d ≥ 0, then explain why∫ d c f−1(t) dt = bd− ac− ∫ b a f(x) dx. b) Use this to find ∫ 1 1/2 sin−1 x dx. 8. [H] Suppose that U ′(x) = u(x). a) Find V ′(x) if V (x) = (a− x)U(x) + ∫ x 0 U(t) dt where a is a constant. b) Hence show that ∫ a 0 U(x) dx = aU(0) + ∫ a 0 (a− x)u(x) dx. Problems 8.5 : The first fundamental theorem of calculus 9. [H] Suppose that f(t) = ⌊t⌋ and F (x) = ∫ x 0 f(t) dt, where ⌊t⌋ is the greatest integer less than or equal to t. Use a graph of f to sketch F on the interval [−1, 3]. Is F continuous? Where is F differentiable? 10. [H] Suppose that f(t) = sin(t2). Sketch the graph of f on the interval [0, 3]. Use this to sketch the graph of F on the interval [0, 3], where F (x) = ∫ x 0 f(t) dt. Indicate where F has local maxima and minima. c©2020 School of Mathematics and Statistics, UNSW Sydney 218 CHAPTER 8. INTEGRATION 11. [R] [V] Find F ′(x) for each function F : R→ R given below. a) F (x) = ∫ x 0 sin(t2) dt b) F (x) = ∫ x3 0 sin(t2) dt c) F (x) = ∫ 1 x3 sin(t2) dt d) F (x) = ∫ x3 x sin(t2) dt 12. [R] Find d dx ∫ 4 x (5− 4t)5 dt. Problems 8.6 : The second fundamental theorem of calculus 13. [R] a) Suppose that f(x) = 1 x . By considering the lower Riemann sum for f with respect to the partition { n n , n+ 1 n , n+ 2 n , . . . , 2n n } of [1, 2], show that ln 2 = lim n→∞ ( 1 n+ 1 + 1 n+ 2 + · · ·+ 1 2n ) . b) Suppose that f(x) = 1√ 1− x2 . i) Show that f is increasing on the interval [0, 12 ]. ii) Find the upper Riemann sum for f with respect to the partition{ 0 2n , 1 2n , 2 2n , 3 2n , . . . , n 2n } of [0, 12 ]. iii) Hence evaluate lim n→∞ ( 1√ 4n2 − 12 + 1√ 4n2 − 22 + 1√ 4n2 − 32 + · · · + 1√ 4n2 − n2 ) . Problems 8.7 : Indefinite integrals 14. [R] Evaluate the following integrals by inspection. a) ∫ x ex 2 dx b) ∫ sin √ x√ x dx c) ∫ 1 0 2x(1 + x2)3 dx d) ∫ a −a x2 √ a3 − x3 dx (a > 0) e) ∫ π/2 0 cos3 x sin x dx f) [H] ∫ 0 −1 √ t2 + t4 dt c©2020 School of Mathematics and Statistics, UNSW Sydney PROBLEMS FOR CHAPTER 8 219 Problems 8.8 : Integration by substitution 15. [R] Use a substitution to evaluate the following integrals. a) ∫ dx 1 + √ x b) ∫ x(5x− 1)19 dx c) [V] ∫ 1− x (1 + x)3 dx d) [V] ∫ 4 0 dx 5 + √ x Problems 8.9 : Integration by parts 16. [R] Use integration by parts to evaluate the following integrals. a) ∫ 1 0 x e5x dx b) ∫ x2 cos x dx c) ∫ ln x dx d) ∫ 0.5 0 sin−1 x dx e) ∫ e 1 x7 ln x dx f) ∫ π 0 x2 cos 2x dx g) [V] ∫ ex cos x dx h) ∫ tan−1 x dx i) [H] ∫ π/4 0 sec3 θ dθ Problems 8.10 : Improper integrals 17. [R] [V] Evaluate the following improper integrals or show that they diverge. a) ∫ ∞ 0 e−5xdx b) ∫ 1 −∞ e−0.01xdx c) ∫ ∞ 0 dx 4 + x2 d) ∫ ∞ −∞ x3 e−x 4 dx e) ∫ ∞ 2 dx (x− 1)3/2 f) ∫ ∞ e dx x ln x 18. [H] Prove that ∫ ∞ 0 xne−xdx = n! whenever n = 0, 1, 2, . . . . 19. [H] a) Find lim R→∞ ∫ R −R x 1 + x2 dx. b) Find lim R→∞ ∫ 2R −R x 1 + x2 dx. c) Does ∫ ∞ −∞ x 1 + x2 dx converge? Explain. Problems 8.11 : Comparison tests for improper integrals 20. [R] Use the inequality form of the comparison test to determine whether or not the following improper integrals converge. c©2020 School of Mathematics and Statistics, UNSW Sydney 220 CHAPTER 8. INTEGRATION a) [V] ∫ ∞ 1 1√ 1 + x4 dx b) ∫ ∞ 2 1 3 √ x2 − x dx c) ∫ ∞ 2 1 lnx dx 21. [R] Use the limit form of the comparison test to determine whether or not the following improper integrals converge. a) ∫ ∞ 2 x 2x3 − 1 dx b) ∫ ∞ 1 2x− 1 x2 + 2 dx c) ∫ ∞ 2 1√ x6 − 1 dx 22. [R] Use a comparison test to determine whether or not the following improper integrals converge. a) ∫ ∞ 1 3x+ sinx+ 2 2x3 − x+ 8 dx b) ∫ ∞ 4 4x3 − x+ 5 x4 − x2 + 1 dx c) [H] ∫ ∞ 2 ln t t3/2 dt 23. [H] Find all real numbers s such that the improper integral∫ ∞ 1 xs 1 + x dx is convergent. 24. [H] Find all real numbers p such that ∫ ∞ 2 1 x(lnx)p dx converges. 25. [H] For which pairs of numbers (a, b) does the improper integral ∫ ∞ 1 xb (1 + x2)a dx converge? Problems 8.12 : Functions defined by an integral 26. [R] [V] Given a positive real number x, let π(x) denote the number of primes less than or equal to x. The function Li with domain (1,∞) is given by Li(x) = ∫ x 2 1 ln t dt and is known as the ‘logarithmic integral function’. It has the property that Li(x) π(x) ≈ 1 when x is sufficiently large. a) Evaluate π(10), π(20) and π(3.14159). b) Suppose that x > 0. What does π(x) x represent? c) Find ddx Li(x) and Li(2). d) By applying the mean value theorem to Li on the interval [2, 106], find a lower bound for Li(106). c©2020 School of Mathematics and Statistics, UNSW Sydney PROBLEMS FOR CHAPTER 8 221 e) If x is large then π(x) x ≈ π(x) x Li(x) π(x) = Li(x) x . Using this approximation and your answer to part (d), find an approximate lower bound for π(106) 106 . Note: There are 78, 498 primes less than one million so the actual value of π(106) 106 is 0.078498. 27. [R] The function erf : R→ R is defined by the formula erf(x) = 2√ π ∫ x 0 e−t 2 dt. The function erf is an error function and can be used to calculate the probability that a measurement has an error in a given range of values. a) Calculate erf ′(x). b) Explain why erf is an increasing function on R. c) [H] Show that erf is an odd function. d) i) By calculating Riemann sums with respect to the partition {0, 14 , 12 , 34 , 1}, find upper and lower bounds for erf(1). ii) Explain why e−t2 < e−t whenever t > 1. iii) Hence show that ∫ ∞ 1 e−t 2 dt converges and find an upper bound for this improper integral. iv) Using your answers to (i) and (iii), find an upper bound for lim x→∞ erf(x). (In fact, lim x→∞ erf(x) = 1 but this is not so easy to prove.) e) Sketch the graph of erf. f) Explain why erf has an inverse function erf−1 and sketch its graph. c©2020 School of Mathematics and Statistics, UNSW Sydney 222 CHAPTER 8. INTEGRATION c©2020 School of Mathematics and Statistics, UNSW Sydney 223 Chapter 9 The logarithmic and exponential functions The volume of a colony of bacteria on an agar plate is 1 cubic millimetre and doubles every day. Let V (t) denote the volume of bacteria after t days. It is clear that V (0) = 1, V (1) = 2, V (2) = 4, V (3) = 8, V (4) = 16, and so on. In general V (t) = 2t if t is a nonnegative integer. The corresponding points are plotted in Figure 9.1 (a). If one wanted to determine the volume of the colony part-way through a day, it would be natural to use the formula V (t) = 2t, where t is a nonnegative real number. This formula makes sense when t is a rational number. If t = p/q where p and q are integers and q > 0, then 2t is defined to be the unique positive qth root of 2p. However, if t is an irrational number, this definition does not apply. So what do we mean by 2 √ 3 or 2π? We have not yet seen a definition for such numbers. That this shortcoming in our current definition of 2t is a serious problem can be illustrated graphically. The graph corresponding to V (t) = 2t, where t is confined to the nonnegative rational numbers, has an infinite number of ‘gaps’ (see Figure 9.1 (b)). Clearly this is not a satisfactory state of affairs. One of the aims of this chapter is to rectify this deficiency and show that, when t is an irrational number, 2t can be defined in such a way that the graph of V : [0,∞) → R is continuous (see in Figure 9.1 (c)). Our plan of attack is the following. 1. In Section 9.2, we define the function ln as an integral (by the fundamental theorem of calculus). 2. In Section 9.3, we define the function exp as the inverse of ln (by the inverse function theorem). 3. In Section 9.4, we define the number bt, where b > 0 and t is an irrational number, by a formula involving both ln and exp. c©2020 School of Mathematics and Statistics, UNSW Sydney 224 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS t V (t) b b b b b | | 4 16 (a) t V (t) | | 4 16 (b) t V (t) | | 4 16 (c) Figure 9.1: Growth in volume of bacteria on an agar plate. We shall see that when the definition of bt (for rational t) is combined with the definition of bt (for irrational t), the resulting function f : R→ R, given by f(t) = bt ∀t ∈ R, is continuous. 9.1 Powers and logarithms The content of this section is revision, and is placed here primarily as reference. It summarises a few known definitions and facts about powers and logarithms. Definition 9.1.1. Suppose that b is a positive real number. If p and q are integers and q > 0 then bp/q is defined to be the unique positive qth root of bp. The following is an elementary definition of the logarithm to the base b. Definition 9.1.2. Suppose that b is a positive real number not equal to one, c is a rational number and a = bc. Then logb a is defined by the formula c = logb a. Note that, according to Definition 9.1.2, one cannot write down logb a for any positive number a (even if a is rational); logb a is only defined if a = b c for some rational number c. (This short- coming in the domain of the function logb will be rectified later in the chapter when Definition 9.1.2 is superseded by a superior definition.) c©2020 School of Mathematics and Statistics, UNSW Sydney 9.2. THE NATURAL LOGARITHM FUNCTION 225 The elementary properties of logarithms are summarised below: if x > 0, y > 0 and b > 0 with b 6= 1 then logb 1 = 0, logb b = 1, logb(xy) = logb x+ logb y, logb ( x y ) = logb x− logb y, logb(b r) = r, logb(x r) = r logb x. (9.1) Remark 9.1.3. In the properties listed above, it is implicitly assumed that r is a rational number, and that x and y are of the form bc for some rational number c. One of the aims of this chapter is to remove these awkward assumptions. 9.2 The natural logarithm function (Ref: SH10 §7.2) The ln function will already be familiar to you from high school. However, it was probably never defined rigorously. In order to address the problems identified in the introduction to Chapter 9, it will be necessary to start from scratch. Fortunately, by the fundamental theorem of calculus, this is not difficult. Definition 9.2.1. The function ln : (0,∞)→ R is defined by the formula ln(x) = ∫ x 1 1 t dt. We read ln as either ‘ell en’ or as ‘log’. Since the function ln is defined by an integral, lnx is simply the area of the shaded region shown below in the case when x > 1. t 1/t 1 x One can also immediately see, by the first fundamental theorem of calculus, that d dx ( lnx ) = 1 x . This and other properties of ln are stated in the proposition below. c©2020 School of Mathematics and Statistics, UNSW Sydney 226 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS Proposition 9.2.2. The function ln has the following properties: (i) ln is differentiable on (0,∞) and d dx ( lnx ) = 1 x ; (ii) lnx > 0 if x > 1, ln 1 = 0 and lnx < 0 if 0 < x < 1; (iii) lnx→ −∞ as x→ 0+ and lnx→∞ as x→∞; (iv) ln(xy) = ln(x) + ln(y) for all positive real numbers x and y; (v) ln ( x y ) = ln(x)− ln(y) for all positive real numbers x and y; and (vi) ln(xr) = r ln(x) whenever r is a rational number and x is a positive real number. Proof. (i) Apply the first fundamental theorem of calculus (Theorem 8.5.1) to the definition of ln. (ii) This follows from Definition 8.4.2 and the fact that 1t > 0 when t > 0. (iii) We need to show that the improper integral ∫∞ 1 1 t dt diverges to infinity. The diagram below shows that ∫ 2 1 dt t ≥ 1× 1 2 , ∫ 4 2 dt t ≥ 2× 1 4 , ∫ 8 4 dt t ≥ 4× 1 8 and so on. t y 1 2 4 8 y = 1 t Hence ∫ 2n 1 dt t ≥ 1 2 + 1 2 + · · ·+ 1 2︸ ︷︷ ︸ n terms = n 2 →∞ as n→∞. Hence the integral diverges and therefore lnx→∞ as x→∞. This argument can be adapted to show that∫ 1 2−n dt t → −∞ as n→∞. Hence lnx→ −∞ as x→ 0+. (iv) Suppose that y is some fixed positive real number and that x ∈ (0,∞). Then, by the chain rule and part (i), d dx ( ln(xy) ) = y ln′(xy) = y × 1 xy = 1 x = d dx ( ln(x) ) . c©2020 School of Mathematics and Statistics, UNSW Sydney 9.2. THE NATURAL LOGARITHM FUNCTION 227 Hence ln(xy) = ln(x) + C for some constant C. When x = 1 we obtain ln(y) = ln(1) + C = 0 + C = C by part (ii). Hence ln(xy) = ln(x) + C = ln(x) + ln(y) as required. (v) The proof uses the same technique as the proof of (vi) and is left to the reader. (vi) This proof also uses the same technique as the proof of (iv). The fact that ln(1) = 0 and that ln satisfies properties (iv), (v) and (vi) suggests that ln coincides with one of the logarithm functions logb for some base b (with the advantage that Dom(ln) = (0,∞)). We now aim to identify the base. The function ln is increasing and continuous with Range(ln) = R (see Proposition 9.2.2 (i) and (ii)). Hence, by the intermediate value theorem, there is exactly one real number x that satisfies the equation ln(x) = 1. We call this solution e (after Leonard Euler, who was probably the greatest mathematician of the eighteenth century). Definition 9.2.3. The real number e is defined to be the unique number x that satisfies the equation ∫ x 1 1 t dt = 1. Thus ln(e) = 1. One can show that e is irrational and that e ≈ 2.71828. From Proposition 9.2.2, we see now that ln(er) = r for every rational number r. This equation shows that ln coincides with the logarithm to the base e. Since the number e arises naturally in many contexts, ln is usually called the natural logarithm. By using the properties of ln given by Propoistion 9.2.2, one can draw the graph of ln. c©2020 School of Mathematics and Statistics, UNSW Sydney 228 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS x lnx 1 e 1 We note that, although ln(x) → ∞ as x → ∞, we have ddx ln(x) = 1x → 0 as x → ∞. This means the graph of ln becomes flatter towards infinity and hence approaches infinity very slowly. 9.3 The exponential function (Ref: SH10 §7.4) In Section 9.2 it was shown that ddx(lnx) > 0 for all x > 0. Therefore, by the inverse function theorem (Theorem 6.4.1), ln has an inverse function. Definition 9.3.1. The function exp : R → (0,∞) is defined to be the inverse function of ln : (0,∞)→ R. By reflecting the graph of ln in the line y = x, one obtains the graph of exp. x y 1 (e, 1) b (1, e) b 1 y = lnx y = expx Proposition 9.3.2. The function exp : R→ (0,∞) has the following properties: c©2020 School of Mathematics and Statistics, UNSW Sydney 9.3. THE EXPONENTIAL FUNCTION 229 (i) exp(ln x) = x for all x in (0,∞) and ln(exp x) = x for all x in R; (ii) exp(1) = e and exp(0) = 1; (iii) expx→∞ as x→∞ and expx→ 0 as x→ −∞. (iv) exp is differentiable on R and ddx(expx) = expx for all x in R; (v) exp(x+ y) = exp(x) exp(y) for all x and y in R; and (vi) exp(rx) = (expx)r for every real number x and every rational number r. Proof. The proof of (i) follows immediately from Definition 9.3.1 and the proof of (ii) and (iii) then follow from (i) and the continuity of ln and exp. That exp is differentiable on R follows from the inverse function theorem. To complete the proof of (iii), we differentiate both sides of the equation ln(expx) = x with respect to x to obtain 1 expx × d dx (exp x) = 1 by the chain rule. Rearranging gives ddx(expx) = expx as required. If x and y are real numbers then exp(x+ y) = exp ( ln(exp x) + ln(exp y) ) (by part (i)) = exp ( ln(exp x× exp y) ) (by Proposition 9.2.2 (iv)) = expx× exp y (by part (i)). This completes the proof of (iv) To prove (v), suppose that r is a rational number and x is a real number. Then exp(rx) = exp ( r ln(exp x) ) (by part (i)) = exp ( ln ( (exp x)r )) (by Proposition 9.2.2 (v)) (exp x)r (by part (i)). This completes the proof of the proposition. Note that by Proposition 9.3.2 (ii) and (v) we have exp(r) = (exp 1)r = er (9.2) for every rational number r. This inspires the following definition. Definition 9.3.3. Suppose that x is an irrational number. Then we define the real number ex by the formula ex = expx. c©2020 School of Mathematics and Statistics, UNSW Sydney 230 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS Thus Definition 9.3.3 extends Definition 9.1.1, in the case when b = e, to include irrational powers. Moreover, since expx = ex and exp is a continuous function, the function f : R→ R, given by f(x) = ex, is also continuous. We have thus made significant progress towards the problem outlined at the beginning of the chapter. Finally, since expx = ex, we call exp the exponential function. 9.4 Exponentials and logarithms with other bases (Ref: SH10 §7.5) At the end of the last section, we gave a definition of ex, where x is an irrational number. Moreover, if f : R → R is given by f(x) = ex, then f is a continuous and differentiable function. In this section, we do the same for bx, where b is any positive real number. To begin, consider a rational number r and any positive real number b. Then br = exp ( ln(br) ) = exp ( r ln b ) = er ln b (9.3) by the definitions and properties established in the last two sections. This inspires the following definition. Definition 9.4.1. Suppose that b is a positive real number and x is an irrational number. We define the number bx by the formula bx = ex ln b. By combining equation (9.3) and Definition 9.4.1 we see that bx = ex ln b ∀x ∈ R. (9.4) Since exp and ln are continuous functions, the function f : R→ R, given by f(x) = bx ∀x ∈ R, is also continuous (see Proposition 3.1.2). This solves the problem articulated at the beginning of the chapter. Thus 2 √ 3 = e √ 3 ln 2 ≃ 1.492106 . . . . Moreover, we can discard our old definition of logb (see Definition 9.1.2), together with its technical difficulties (see Remark 9.1.3), and replace it with a simpler and more powerful definition. Definition 9.4.2. Suppose that b is a positive real number, b 6= 1. Then the function logb : (0,∞)→ R is defined to be the inverse of the function f : R→ (0,∞), where f(x) = bx for all x in R. c©2020 School of Mathematics and Statistics, UNSW Sydney 9.5. INTEGRATION AND THE LN FUNCTION 231 We leave it as an exercise to show that the function f of Definition 9.4.2 is one-to-one and hence invertible. The graphs of various exponential functions and their corresponding logarithmic functions are shown below. x y 1 1 y = log8 x y = log3 x y = log2 x y = 2x y = 3xy = 8x The following proposition gives a formula for logb in terms of ln. Its proof is left as an exercise. Proposition 9.4.3. Suppose that b is a positive real number, b 6= 1. Then logb x = lnx ln b ∀x ∈ (0,∞). (9.5) Using equations (9.4) and (9.5), one can easily establish familiar properties for the exponentials and logarithms to the base b, with the advantage that we no longer need the technical restrictions of Remark 9.1.3. 9.5 Integration and the ln function We have already seen that d dx (ln x) = 1 x provided that x > 0. It follows from the chain rule that d dx ( ln(f(x)) ) = 1 f(x) × f ′(x) = f ′(x) f(x) , provided that f is differentiable and f(x) > 0. Hence ∫ f ′(x) f(x) dx = ln(f(x)) +C. c©2020 School of Mathematics and Statistics, UNSW Sydney 232 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS It is not difficult to remove the restriction that f(x) > 0. To take a simple example, suppose that x 6= 0. Then d dx ( ln |x|) = { d dx ( lnx ) if x > 0 d dx ( ln(−x)) if x < 0 = { 1 x if x > 0 1 −x × (−1) if x < 0 = 1 x . Thus ∫ 1 x dx = ln |x|+ C provided x is restricted to an interval not containing zero. By generalising this calculation we obtain the formula ∫ f ′(x) f(x) dx = ln |f(x)|+ C, provided that f is never zero over the interval of integration. Example 9.5.1. Find (a) ∫ tan x dx (b) ∫ sec x dx. Solution. (a) By using the fact that tan x = sinxcos x we have∫ tanx dx = − ∫ − sinx cos x dx = − ln | cos x|+ C. (The indefinite integral of cot can be found similarly.) (b) This example involves a special trick: ∫ sec x dx = ∫ secx tan x+ sec x tan x+ sec x dx = ∫ tanx sec x+ sec2 x sec x+ tanx dx = ln | sec x+ tanx|+ C. (A similar trick can be used to find the indefinite integral of cosec.) c©2020 School of Mathematics and Statistics, UNSW Sydney 9.6. LOGARITHMIC DIFFERENTIATION 233 9.6 Logarithmic differentiation (Ref: SH10 §7.3) Logarithms are powerful because they transform powers into products, products into sums and quotients into differences. In this section, we illustrate the use of the logarithm for differentiating functions that are defined by (combinations of) powers, products or quotients. Example 9.6.1. Find dydx in each case: (a) y = 10x (b) y = ( (3x2 + 4)(x+ 2) x3 + 5x )3/5 (c) y = (sinx)cos x. Solution. (a) If we take ln of both sides of the equation y = 10x then ln y = ln 10x = x ln 10. By differentiating with respect to x, we obtain 1 y dy dx = ln 10 and hence dy dx = (ln 10) × y = (ln 10)10x. Alternatively: ddx(10 x) = ddx(e x ln 10) = ln 10ex ln 10 = (ln 10)10x. Remark 9.6.2. Recall that one can only take the logarithm of a positive number. Hence our solution to Example 9.6.1 (b) and (c) is only valid when y > 0. 9.7 Indeterminate forms with powers (Ref: SH10 §11.6) Consider the limits lim x→0+ xx and lim x→∞x 1/x. (9.6) The first limit is of the form 00 while the second is of the form ∞0. Both of these forms are examples of indeterminate forms and hence we cannot say what the corresponding limits are in each case without further calculation. Since each limit involves a power, it is natural to first take the logarithm of the limit and then bring l’Hoˆpital’s rule into play. Example 9.7.1. Evaluate the limit lim x→0+ x2x. c©2020 School of Mathematics and Statistics, UNSW Sydney 234 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS Solution. The limit is an indeterminate form of the type 00. By taking the natural logarithm, we can transform the limit into an indeterminate form of the type ∞∞ : lim x→0+ x2x = lim x→0+ exp ( lnx2x ) (since ln and exp are inverses) = lim x→0+ exp ( 2x ln x ) = exp ( lim x→0+ 2x lnx ) (since exp is continuous) = exp ( lim x→0+ lnx 1/(2x) ) . We can now apply l’Hoˆpital’s rule to the problem. By differentiating the numerator and denomi- nator and then simplifying we obtain lim x→0+ x2x = exp ( lim x→0+ 1/x −1/(2x2) ) = exp ( lim x→0+ −2x ) = exp(0) = 1. Each of the limits in (9.6) can be evaluated by using the ideas in this example. Remark 9.7.2. The limit of Example 9.7.1 is of the form 00 and has the value 1. The limit lim x→0+ ( e− 1 x )x is also of the form 00. However, its value is not 1: lim x→0+ ( e− 1 x )x = lim x→0+ e−1 = 1 e . The fact that limits of the type 00 can have different values explains why 00 is an indeterminate form. c©2020 School of Mathematics and Statistics, UNSW Sydney PROBLEMS FOR CHAPTER 9 235 Problems for Chapter 9 Problems 9.1 : Powers and logarithms and 9.2 : The natural logarithm function 1. [R] a) Write down the definition of lnx, where x > 0. b) Explain why d dx lnx = 1 x whenever x > 0. c) Suppose that r is a rational number and that x and y are positive real numbers. i) By first differentiating ln(xy) with respect to x, show that ln(xy) = lnx+ ln y. ii) Use the same technique to show that ln ( x y ) = lnx− ln y and ln(xr) = r lnx. 2. [R] a) Prove, using upper and lower Riemann sums and the definition of lnx, that ln 2 < 1 < ln 4, and hence that 2 < e < 4. b) [H] Use Maple and the method of part (a) to prove that 52 < e < 3. How many partition points do you need? 3. [R] Find the derivatives of a) f(x) = ln √ x3 + 1 b) g(x) = e|x| c) h(x) = ln(ln(ln x)) d) q(x) = eln(x 5+6) Problems 9.3 : The exponential function and 9.5 : Integration and the ln function 4. [R] Find a) ∫ e2x 1 + e2x dx b) ∫ e1/x x2 dx c) ∫ 3xdx d) ∫ e √ x 8 √ x dx e) ∫ ln x x dx f) ∫ cot x dx. (Hint for part (f): express cot in terms of sin and cos.) 5. [R] Sketch the curves a) y = ln(1 + ex) b) y = (ex + x) (ex − x) . c©2020 School of Mathematics and Statistics, UNSW Sydney 236 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS 6. [R] a) Sketch the curve y = lnx x , noting any turning points and asymptotes. b) By using (a) or otherwise, prove that πe < eπ. 7. [R] [V] x y y = 1x 1 t 1+t 1 1 + 1t a) From the graph, explain why 1 1 + t ≤ ln ( 1 + 1 t ) ≤ 1 t whenever t ≥ 0. b) Deduce that lim t→∞ ln ( 1 + 1 t )t = 1 and hence find the value of lim t→∞ ( 1 + 1 t )t . Problems 9.6 : Logarithmic differentiation 8. [R] Use logarithmic differentiation to find dy dx if a) y = 3x b) y = ( x3 − 3 1 + x2 )1/5 c) y = (sinx)sinx d) y = sin(xsinx). Problems 9.7 : Indeterminate forms with powers 9. [R] Calculate the following limits: a) [V] lim x→∞ lnx xa , a > 0 b) lim x→0+ xa lnx, a > 0 c) lim x→0+ xx d) lim x→0+ x2/ ln x e) lim x→∞ x 1/x f) lim x→∞ a 1/x, a > 0 g) lim x→∞ ( 1 + a x )x h) lim x→∞ x 100 e−x i) lim x→∞ p(x) e −x ,where p is any polynomial. c©2020 School of Mathematics and Statistics, UNSW Sydney PROBLEMS FOR CHAPTER 9 237 10. [H] Prove that the functions f : (−1,∞)→ R and g : (−1,∞)→ R, given by f(x) = ln(1 + x)− ( x− x 2 2 ) and g(x) = ( x− x 2 2 + x3 3 ) − ln(1 + x), are increasing on (0,∞). Deduce that x− x 2 2 < ln(1 + x) < x− x 2 2 + x3 3 whenever x > 0. c©2020 School of Mathematics and Statistics, UNSW Sydney 238 CHAPTER 9. THE LOGARITHMIC AND EXPONENTIAL FUNCTIONS c©2020 School of Mathematics and Statistics, UNSW Sydney 239 Chapter 10 The hyperbolic functions A telecommunications company wishes to suspend a cable from a sequence of telegraph poles in a cost effective manner. There are at least two competing considerations. First, it is desirable to minimise the length of cable used. Second, it is desirable to minimise the amount of tension in the cable (since an increase in tension requires an increase in cable strength). The configuration in Figure 10.1 (a) requires a short length of strong cable while that in Figure 10.1 (b) reduces the tension in the cable at the cost of increasing its length. Is there an optimal configuration for this problem? In order to solve this problem, one needs an equation which describes the curve of a suspended cable. Such a curve is called a catenary, after the Latin word for ‘chain.’ Galileo claimed that every catenary was a parabola, but this was later disproved in 1669. Two decades later, the Swiss mathematician Jakob Bernoulli issued a public challenge to find the general equation that describes a catenary. Three mathematicians Gottfried Leibniz, Christiaan Huygens and Johann Bernoulli (Jakob’s brother) independently derived the correct equation: y = a 2 ( ex/a + e−x/a ) (10.1) (see Figure 10.1 (c)). In this chapter we study the function described by equation (10.1) and other functions related to it. Such functions are important to mathematics, engineering, physics (especially the theory of relativity) and architecture. We note from the outset that a large number of new formulae will be introduced in this chapter. However, students do not need to memorise all of them. Those which should be memorised are listed in Section 10.7. (a) (b) x y a (c) Figure 10.1: Catenaries. c©2020 School of Mathematics and Statistics, UNSW Sydney 240 CHAPTER 10. THE HYPERBOLIC FUNCTIONS 10.1 Hyperbolic sine and cosine functions (Ref: SH10 §7.8) We mentioned in the introduction that the equation describing a freely suspended chain is given by y = f(x), where f(x) = a 2 ( ex/a + e−x/a ) . To begin we study the simplest case: when a = 1. Definition 10.1.1. The hyperbolic cosine function cosh : R → R is defined by the formula coshx = 12(e x + e−x) ∀x ∈ R. The reasoning behind the name ‘hyperbolic cosine’ will gradually become apparent. Since cosh is a linear combination of exponential functions, it is differentiable and d dx ( coshx ) = 1 2 d dx ( ex + e−x ) = 1 2 (ex − e−x). The derivative of cosh is important in its own right and is given a special name. Definition 10.1.2. The hyperbolic sine function sinh : R → R is defined by the formula sinhx = 12(e x − e−x) ∀x ∈ R. Some people pronounce sinh as ‘shine.’ One can easily show that the derivative of sinh is cosh. Thus we have d dx (sinhx) = coshx and d dx (cosh x) = sinhx. (10.2) We now consider the graphs of these functions. Example 10.1.3. Sketch the graph of sinh. Solution. We begin by identifying a few key features of the sinh function. • Since sinh(−x) = 12(e−x − ex) = −12(ex − e−x) = − sinhx, we see that sinh is an odd function. As a corollary, sinh(0) = 0. • Since d dx (sinhx) = cosh x = 12(e x + e−x) > 0, sinh is an increasing function. c©2020 School of Mathematics and Statistics, UNSW Sydney 10.1. HYPERBOLIC SINE AND COSINE FUNCTIONS 241 x y y = sinhx y = 12e x (a) x y y = cosh x y = 12e x 1 (b) Figure 10.2: Graphs of sinh and cosh. • Since lim x→∞ e −x = 0, sinhx gets arbitrarily close to 12e x as x→∞. From these observations we obtain a sketch of the graph of sinh (see Figure 10.2). Example 10.1.4. Sketch the graph of cosh. Solution. Some properties of the cosh function are listed below: • cosh is an even function; • cosh(0) = 1; • cosh is decreasing on (−∞, 0), stationary at 0 and increasing on (0,∞); • coshx ≥ 1 for all x in R; • coshx gets arbitrarily close to 12ex as x→∞. The reader should verify each of these properties by appealing to Definition 10.1.1. From these observations, we obtain a sketch of the graph of cosh (see Figure 10.2). The functions cosh and sinh are related by the following important identity. Proposition 10.1.5. If x is a real number then cosh2 x− sinh2 x = 1. Proof. If x ∈ R then cosh2 x− sinh2 x = ( 1 2(e x + e−x) )2 − ( 1 2 (e x − e−x) )2 = 1 4 × 2ex × 2e−x difference of two squares = 1, completing the proof. c©2020 School of Mathematics and Statistics, UNSW Sydney 242 CHAPTER 10. THE HYPERBOLIC FUNCTIONS We are now in a position to appreciate why these functions are called ‘hyperbolic cosine’ and ‘hyperbolic sine.’ First, we examine their similarity to the trigonometric cosine and sine functions. The sine and cosine functions have the following properties: d dx (cos x) = − sinx d dx (sinx) = cos x cos is an even function sin is an odd function cos2 x+sin2 x = 1. The functions sinh and cosh have analogous properties (with the occasional adjustment of a negative sign): d dx (cosh x) = sinhx d dx (sinhx) = coshx cosh is an even function sinh is an odd function cosh2 x− sinh2 x = 1. The next example explains the origin of the name hyperbolic in ‘hyperbolic cosine.’ Example 10.1.6. Sketch the curve defined by the parametric equation{ x(t) = cosh t y(t) = sinh t, (10.3) where t ∈ R. Solution. The parameter t in equation (10.3) can be eliminated by Proposition 10.1.5 to obtain 1 = cosh2 t− sinh2 t = ( x(t) )2 − (y(t))2, or simply x2 − y2 = 1. This equation describes an hyperbola. Since cosh t > 0 for every real number t, it follows that x > 0 and hence the curve is the branch of the hyperbola that lies in the right-half plane (see the black curve in Figure 10.3 (b)). Its asymptotes are the lines y = x and y = −x. (The other branch of the hyperbola is parameterised by{ x(t) = − cosh t y(t) = sinh t where t ∈ R, and is shown in Figure 10.3 (b) in gray.) Hence we use trigonometric sine and cosine to paramaterise a circle or ellipse, and hyperbolic sine and cosine to parameterise an hyperbola. c©2020 School of Mathematics and Statistics, UNSW Sydney 10.2. OTHER HYPERBOLIC FUNCTIONS 243 x y 1 Figure 10.3: Sketch for Example 10.1.6. 10.2 Other hyperbolic functions (Ref: SH10 §7.9) Other hyperbolic functions are defined in analogy to the trigonometric functions. Thus we have tanhx = sinhx cosh x , coth x = cosh x sinhx , sech x = 1 cosh x , cosech x = 1 sinhx . Some people pronounce tanh as ‘than.’ The derivative of each of these functions is easy to compute. Example 10.2.1. Compute the derivative of tanh. Solution. By the definition of tanh, d dx (tanh x) = d dx ( sinhx coshx ) = cosh x ddx(sinhx)− sinhx ddx(cosh x) cosh2 x (by the quotient rule) = cosh2 x− sinh2 x cosh2 x (by (10.2)) = 1 cosh2 x (by Proposition 10.1.5) = sech2 x. A complete list of derivatives will be given in Section 10.4. c©2020 School of Mathematics and Statistics, UNSW Sydney 244 CHAPTER 10. THE HYPERBOLIC FUNCTIONS x tanhx 1 −1 Figure 10.4: The graph of tanh. Example 10.2.2. Sketch the graph of the function tanh. Solution. Since tanhx = sinhxcosh x and coshx > 0 for all x in R, we see that Dom(tanh) = R. Some properties of tanh are listed below. • By using the fact that cosh is even and sinh is odd, we see that tanh(−x) = sinh(−x) cosh(−x) = − sinhx coshx = − tanh(x). Hence tanh is an odd function. It follows that tanh(0) = 0. • Since ddx(tanhx) = sech2 x > 0 for all x in R, the function tanh is increasing everywhere. • By writting tanhx in terms of exponentials we find that tanhx = sinhx cosh x = ex − e−x ex + e−x = 1− e−2x 1 + e−x → 1 as x→∞. • Finally, the slope of the graph of tanh at the origin is d dx (tanhx) ∣∣∣ x=0 = sech2 0 = 1. These features are shown in the following sketch in Figure 10.4. c©2020 School of Mathematics and Statistics, UNSW Sydney 10.3. HYPERBOLIC IDENTITIES 245 10.3 Hyperbolic identities (Ref: SH10 §7.8) It has already been observed that some identities involving cosh and sinh are almost analogous to those for cos and sin. This analogy can also be seen in the following larger list of identities. There are the ‘difference of squares’ identities cosh2 x− sinh2 x = 1, 1− tanh2 x = sech2 x, coth2 x− 1 = cosech2 x, the ‘sum and difference’ formulae sinh(x± y) = sinhx cosh y ± cosh x sinh y, cosh(x± y) = coshx cosh y ± sinhx sinh y, tanh(x± y) = tanhx± tanh y 1± tanhx tanh y , and the ‘double-angle’ formulae sinh(2x) = 2 sinhx coshx, cosh(2x) = cosh2 x+ sinh2 x, tanh(2x) = 2 tanh x 1 + tanh2 x . The following remark gives an easy method for remembering these identities. Remark 10.3.1. There is a convenient mnemonic for these hyperbolic identities, provided that you know the corresponding trigonometric identities. Simply replace cos with cosh and sin with i sinh. For example, cos2 x+ sin2 x = 1 −→ cosh2 x+ i2 sinh2 x = 1 −→ cosh2 x− sinh2 x = 1, and sin(x+ y) = sinx cos y + cos x sin y −→ i sinh(x+ y) = i sinhx cosh y + i cosh x sinh y −→ sinh(x+ y) = sinhx cosh y + cosh x sinh y. Since tan = sincos we replace tan with i sinh cosh = i tanh. Similarly sec is replaced with sech. Thus 1 + tan2 x = sec2 x −→ 1 + i2 tanh2 x = sech2 x −→ 1− tanh2 x = sech2 x. We emphasise that Remark 10.3.1 provides a way to remember each formula, not to prove it. Proofs are constructed using techniques that are illustrated in the following example. c©2020 School of Mathematics and Statistics, UNSW Sydney 246 CHAPTER 10. THE HYPERBOLIC FUNCTIONS Example 10.3.2. (a) Prove the identity cosh(x+ y) = cosh x cosh y + sinhx sinh y. (b) Hence show that cosh(x− y) = cosh x cosh y − sinhx sinh y. (c) By using the result of (a), prove the double-angle formula for cosh. (d) By assuming that sinh(x+ y) = sinhx cosh y + cosh x sinh y, (10.4) use the result of (a) to show that tanh(x+ y) = tanhx+ tanh y 1 + tanhx tanh y . Solution. (a) By the definition of cosh, LHS = 12 ( ex+y + e−(x+y) ) while RHS = 12 ( ex + e−x ) 1 2 ( ey + e−y ) + 12 ( ex − e−x)12(ey − e−y) = 14 ( ex+y + ex−y + e−x+y + e−x−y ) + 14 ( ex+y − ex−y − e−x+y + e−x−y) = 12e x+y + 12e −x−y = LHS. This proves the identity. Remark 10.3.3. Note that in Example 10.3.2, each proof started by simplifying either the left- hand side of the equation, or the right-hand side of the equation. It would be wrong to begin the proof of (a) by cosh(x+ y) = coshx cosh y + sinhx sinh y 1 2 ( ex+y + e−(x+y) ) = 12 ( ex + e−x ) 1 2 ( ey + e−y ) + 12 ( ex − e−x)12(ey − e−y) · · · = · · · Such a method is flawed and can be used to ‘prove’ erroneous statements such as ‘1 = 0’. 10.4 Hyperbolic derivatives and integrals (Ref: SH10 §§7.8, 7.9) Below is a list of derivatives for the hyperbolic functions: d dx ( sinhx ) = cosh x d dx ( cosh x ) = sinhx d dx ( tanhx ) = sech2 x d dx ( coth x ) = − cosech2 x d dx ( sech x ) = − sech x tanhx d dx ( cosech x ) = − cosech x coth x. c©2020 School of Mathematics and Statistics, UNSW Sydney 10.4. HYPERBOLIC DERIVATIVES AND INTEGRALS 247 These derivatives can be established by using the definition of each function, as in Example 10.2.1. The derivatives of sinh, cosh and tanh are the most important to remember. Corresponding to these derivatives are the indefinite integrals∫ sinhx dx = coshx+ C, ∫ sech2 x dx = tanhx+ C and so on. Example 10.4.1. Calculate the derivative of the function f given by f(x) = ln(sech(2x)). Solution. First we have d dx ( sech 2x ) = −2 sech(2x) tanh(2x). Hence the chain rules gives f ′(x) = −2 sech(2x) tanh(2x) sech(2x) = −2 tanh(2x). Example 10.4.2. Find the following integrals: (a) ∫ 1 5 ln 3 0 sinh 5x dx (b) ∫ x coshx dx (c) ∫ sech2( √ x)√ x dx (d) ∫ ex sinhx dx. Solution. (a) We have ∫ 1 5 ln 3 0 sinh 5x dx = [ 1 5 cosh 5x ] 1 5 ln 3 0 = 110 [ e5x + e−5x ] 1 5 ln 3 0 = 110 ( eln 3 + e− ln 3 − e0 − e0 ) = 110 ( 3 + 13 − 1− 1 ) = 215 . c©2020 School of Mathematics and Statistics, UNSW Sydney 248 CHAPTER 10. THE HYPERBOLIC FUNCTIONS x y (a) sinh and sinh−1 x y 1 1 (b) cosh and cosh−1 x y −1 1 (c) tanh and tanh−1 Figure 10.5: Graphs of the hyperbolic functions (in gray) and their inverses (in black). 10.5 The inverse hyperbolic functions (Ref: SH10 §7.9) The functions sinh : R → R and tanh : R → (−1, 1) are both increasing on R and hence have well-defined inverses. However, cosh : R → [1,∞) is not one-to-one on R. To obtain an inverse, one must restrict the domain of cosh. The following definition gives the conventional restriction. Definition 10.5.1. The function cosh−1 : [1,∞) → [0,∞) is defined to be the inverse function of the restricted hyperbolic cosine function cosh : [0,∞)→ [1,∞). Graphs of the functions sinh−1, cosh−1 and tanh−1 are illustrated in Figure 10.5. Expressions involving the hyperbolic functions and their inverses can be simplified using the identities of Section 10.3. It is important to note that cosh−1(coshx) = x only when x ≥ 0. Example 10.5.2. Simplify (a) sinh ( cosh−1 43 ) , (b) cosh−1(cosh(−7)), and (c) cosh(2 sinh−1 3). Solution. (a) The identity cosh2 t− sinh2 t = 1 can be rearranged to give sinh2 t = cosh2 t− 1. By substituting cosh−1 43 for t, we obtain sinh2 ( cosh−1 43 ) = ( 4 3 )2 − 1 = 79 . c©2020 School of Mathematics and Statistics, UNSW Sydney 10.5. THE INVERSE HYPERBOLIC FUNCTIONS 249 We need to decide whether to take the positive or negative square root. Since cosh−1 is a nonneg- ative function, we have cosh−1 43 > 0 and hence sinh(cosh−1 43 ) > 0. Therefore sinh ( cosh−1 43 ) = + √ 7 3 . (b) We cannot write cosh−1(cosh(−7)) = −7 because cosh−1 is only the inverse function of cosh with domain restricted to [0,∞). However, since cosh is an even function, cosh−1(cosh(−7)) = cosh−1(cosh(7)) = 7. (c) We use the double-angle formula cosh(2t) = 1 + 2 sinh2 t for cosh: cosh(2 sinh−1 3) = 1 + 2 ( sinh(sinh−1 3) )2 = 1 + 2× 32 = 19. Each of the functions sinh, cosh and tanh can be expressed in terms of the exponential function. It is not surprising, then, that their inverses can be expressed in terms of the natural logarithm. Proposition 10.5.3. The following identities hold: sinh−1 x = ln(x+ √ x2 + 1) ∀x ∈ R, cosh−1 x = ln(x+ √ x2 − 1) ∀x ∈ [1,∞), tanh−1 x = 1 2 ln ( 1 + x 1− x ) ∀x ∈ (−1, 1). Proof. We prove the formula for sinh−1 only; the others can be proved similarly. Suppose that y = sinh−1 x. Then sinh y = x, which means that 1 2(e y − e−y) = x. If we multiply this equation through by 2ey then we obtain e2y − 1 = 2xey, which is a quadratic equation in ey. Further rearrangement gives (ey)2 − 2x(ey) = 1, c©2020 School of Mathematics and Statistics, UNSW Sydney 250 CHAPTER 10. THE HYPERBOLIC FUNCTIONS to which we complete the square to obtain (ey − x)2 = 1 + x2. Since ey > 0, we take the positive square root. Thus ey = x+ √ x2 + 1. (Note that this may also be obtained by using the quadratic formula rather than completing the square). Taking the natural logarithm of both sides of this equation gives y = ln(x+ √ x2 + 1) as required. By the inverse function theorem, the functions sinh−1 : R→ R, cosh−1 : (1,∞)→ (0,∞) and tanh−1 : (−1, 1)→ R are differentiable on their respective domains. A formula for each derivative is given in the next proposition. Proposition 10.5.4. The derivatives of sinh−1, cosh−1 and tanh−1 are given by d dx ( sinh−1 x ) = 1√ x2 + 1 , d dx ( cosh−1 x ) = 1√ x2 − 1 , d dx ( tanh−1 x ) = 1 1− x2 . Proof. We prove the formula for ddx(sinh −1 x) only; the others can be proved similarly. Suppose that y = sinh−1 x, Then sinh y = x and differentiating with respect to x gives cosh y dy dx = 1. Hence dy dx = 1 cosh y = 1 cosh(sinh−1 x) . (10.5) The identity cosh2 t− sinh2 t = 1 gives cosh t = √ 1 + sinh2 t (where we have taken the positive square root since cosh t > 0 for all t). Hence cosh(sinh−1 x) = √ 1 + x2. If we substitute this into (10.5) then we obtain d dx ( sinh−1 x ) = 1√ 1 + x2 as required. Remark 10.5.5. Students should be familiar with the proofs of Propositions 10.5.3 and 10.5.4. c©2020 School of Mathematics and Statistics, UNSW Sydney 10.6. INTEGRATION LEADING TO THE INVERSE HYPERBOLIC FUNCTIONS 251 10.6 Integration leading to the inverse hyperbolic functions When propositions 10.5.3 and 10.5.4 for sinh−1 are restated in terms of indefinite integration we obtain ∫ dx√ x2 + 1 = sinh−1 x+ C = ln(x+ √ x2 + 1) + C. A more general version may be obtained by observing that, when a > 0, d dx ( sinh−1 x a ) = 1 a 1√( x a )2 + 1 = 1√ x2 + a2 . In this manner, one is able to prove the formulae∫ dx√ x2 + a2 = sinh−1 x a + C = ln(x+ √ x2 + a2) + (C − ln a), a > 0, ∫ dx√ x2 − a2 = cosh −1 x a + C = ln(x+ √ x2 − a2) + (C − ln a), x ≥ a > 0, ∫ dx a2 − x2 = { 1 a tanh −1 x a + C, |x| < a 1 a coth −1 x a + C, |x| > a > 0 = 1 2a ln ∣∣∣∣a+ xa− x ∣∣∣∣+ C, x2 6= a2. These formulae are included in the table of standard integrals that is issued at the final examination. Example 10.6.1. Find (a) ∫ dx√ 1 + 9x2 (b) ∫ dx√ x2 − 2x+ 10. Solution. (a) A little algebraic manipulation shows that 1√ 1 + 9x2 = 1 3 √ (1/3)2 + x2 . Hence ∫ dx√ 1 + 9x2 = 1 3 ∫ dx√ (1/3)2 + x2 = 1 3 sinh−1 x 1/3 + C = 1 3 sinh−1(3x) + C. c©2020 School of Mathematics and Statistics, UNSW Sydney 252 CHAPTER 10. THE HYPERBOLIC FUNCTIONS Alternatively, one could approach this problem by using the substitution u = 3x. (b) The technique we apply to this example is known as ‘completing the square’: x2 − 2x+ 10 = x2 − 2x+ 1 + 9 = (x− 1)2 + 32 = u2 + 32, where u = x− 1. Hence ∫ dx√ x2 − 2x+ 10 = ∫ du√ u2 + 32 = sinh−1 u 3 + C = sinh−1 x− 1 3 + C. 10.7 A summary of important hyperbolic formulae A large number of formulae was introduced in this chapter. Below is a list of formulae, with the more important appearing earlier in the list, that students are expected to know. • The definitions of sinh and cosh: sinhx = 12(e x − e−x) coshx = 12 (ex + e−x) • The derivatives of sin and cosh: d dx (sinhx) = cosh x d dx (cosh x) = sinhx • The definition of tanh: tanhx = sinhx coshx • The hyperbolic identity cosh2 x− sinh2 x = 1 • The derivative of sinh−1: d dx (sinh−1 x) = 1√ 1 + x2 • The derivative of tanh: d dx (tanh x) = sech2 x. Other formulae need not be memorised, though students will do well to note the following points. • Derivatives for the inverse hyperbolic functions can be read from the table of standard inte- grals. • Identities which express the inverse hyperbolic functions in terms of the natural logarithm (see Proposition 10.5.3) can be read from the table of standard integrals. c©2020 School of Mathematics and Statistics, UNSW Sydney 10.8. APPENDIX: THE CATENARY 253 • The definition of coth, sech and cosech are in exact analogy with the definition of cot, sec and cosec. • The hyperbolic identities listed in Section 10.3 are easily remembered from their trigonometric counterparts (see Remark 10.3.1). • It is not expected that students will memorise the derivatives of coth, sech and cosech. How- ever, they can be easily derived from first principles provided that the derivatives of sinh and cosh are known. A table of standard integrals will be issued at the final examination. 10.8 Appendix: The catenary We end this chapter showing that the cosh function describes the curve of a suspended cable. Suppose that a cable of uniform mass is suspended from two fixed points. To describe the equation of the resulting curve in terms of x and y, we fix a coordinate system such that vertex of the curve passes through the point A(0, y0) as in Figure 10.6. (The ordinate y0 can be any real number; its choice only fixes the coordinate system and does not alter the shape of the catenary.) Consider a point P (x, y) that lies on the curve and suppose that • T is the tension in the cable at P (x, y), • H is the tension in the cable at A(0, y0), • ℓ(x) is the length of the cable from A(0, y0) to P (x, y), and • w is the weight per unit length of the cable. (see Figure 10.6). Thus the gravitational force acting on the cable along the segment AP is given by w ℓ(x). We note that the quantities T , θ and ℓ depend on x but that w and H do not. We also make the natural assumption that the catenary is described by a differentiable function. Theorem 10.8.1. There is a choice of coordinate system such that the catenary in Figure 10.6 is described by the equation y = H w cosh (wx H ) . To prove the theorem, it suffices to show that if the catenary is described by y = f(x), where f is a differentiable function, then f(x) = H w cosh (wx H ) for a particular choice of y0. We begin with a general formula that relates a function f to its arc length function ℓ by a differential equation. Lemma 10.8.2. Suppose that f is a differentiable function. Then the arc length function ℓ is also differentiable and ℓ′(x) = √ 1 + ( f ′(x) )2 (10.6) c©2020 School of Mathematics and Statistics, UNSW Sydney 254 CHAPTER 10. THE HYPERBOLIC FUNCTIONS x y b b H w ℓ(x) T T cos θ T sin θ θP A suspended cable Figure 10.6: Forces acting on a suspended cable. Sketch proof. Suppose that h is a small nonzero real number and consider the diagram below. t f(t) b b ( x, f(x) ) ( x+ h, f(x+ h) ) h f(x+ h)− f(x)ℓ(x+ h)− ℓ(x) By Pythagoras’ theorem, ℓ(x+ h)− ℓ(x) ≈ √ h2 + ( f(x+ h)− f(x))2. If we divide both sides by h then ℓ(x+ h)− ℓ(x) h ≈ √ 1 + ( f(x+ h)− f(x) h )2 . As h→ 0 the approximation gets better and we obtain ℓ′(x) = √ 1 + ( f ′(x) )2 as required. c©2020 School of Mathematics and Statistics, UNSW Sydney 10.8. APPENDIX: THE CATENARY 255 The next lemma uses facts specific to the catenary. Lemma 10.8.3. If the equation of the catenary in Figure 10.6 is given by y = f(x), then f satisfies the following conditions: f(0) = y0, (10.7) f ′(0) = 0, (10.8) f ′′(x) = w H √ 1 + ( f ′(x) )2 . (10.9) Proof. The first two conditions are a result of our choice of coordinate system and the assumption that f is differentiable. It remains to establish the third condition. Since the cable is stationary, the forces w ℓ(x) and T sin θ that act vertically cancel each other out. That is, T sin θ = w ℓ(x). Similarly, the horizontal forces H and T cos θ also cancel each other out and so T cos θ = H. Using these equations, and the fact that f ′(x) = tan θ at the point P (x, y), gives f ′(x) = tan θ = T sin θ T cos θ = w ℓ(x) H . So far we have f ′(x) = w ℓ(x) H . Since w and H are constants, differentiating this equation with respect to x gives f ′′(x) = w H ℓ′(x). Therefore f ′′(x) = w H √ 1 + ( f ′(x) )2 . by equation (10.6). Proof of Theorem 10.8.1. It remains to find all functions f that satisfy the conditions of Lemma 10.8.3. If we let u denote f ′(x) and b denote w/H, then equation (10.9) becomes du dx = b √ 1 + u2. Hence dx du = 1 b √ 1 + u2 . Every solution to this differential equation is of the form x = 1 b sinh−1 u+C1. c©2020 School of Mathematics and Statistics, UNSW Sydney 256 CHAPTER 10. THE HYPERBOLIC FUNCTIONS Hence we have f ′(x) = u = sinh(bx− bC1). Enforcing condition (10.8) shows that either b = 0 or C1 = 0. Now if b = 0 then f ′(x) = 0 and f is simply a constant function (that is, the cable is pulled tight and exhibits no sag). On the other hand, if b 6= 0 then C1 = 0 and hence f ′(x) = sinh(bx). Therefore f(x) = cosh(bx) b + C2 and enforcing the condition (10.7) yields C2 = 1 b − y0. If we choose y0 to be 1/b then C2 = 0 and the function f that describes the catenary is given by f(x) = cosh(bx) b = H w cosh (wx H ) , proving the theorem. 10.9 Maple notes Maple knows about the hyperbolic functions sinh, cosh, tanh, csch, sech, and coth, and inverse hyperbolic functions arcsinh, arccosh, arctanh, arccsch, arcsech, and arccoth. For example, > int(arctanh(x),x); x arctanh (x) + 1 2 ln ( 1− x2) > # which is closely analogous to > int(arctan(x),x); x arctan (x)− 1 2 ln ( 1 + x2 ) > # BUT the following result does not follow the analogy between trig and hyperbolic functions > int(sech(x),x); arctan (sinh (x)) > plot(arccsch(x),x=-6..6, y=-6..6, discont=true, numpoints=100); c©2020 School of Mathematics and Statistics, UNSW Sydney 10.9. MAPLE NOTES 257 c©2020 School of Mathematics and Statistics, UNSW Sydney 258 CHAPTER 10. THE HYPERBOLIC FUNCTIONS Problems for Chapter 10 Problems 10.1 : Hyperbolic sine and cosine functions and 10.2 : Other hyperbolic functions 1. [R] [V] Define sinhx and coshx. Hence show that a) d dx (cosh 6x) = 6 sinh 6x; b) ln(sinh x) < x− ln 2 whenever x > 0. 2. [R] By expressing the following hyperbolic functions in terms of sinhx and coshx, find the derivative of each function f given below. a) f(x) = tanhx b) f(x) = sech x c) f(x) = coth x 3. [R] In each case, find f ′(x). a) f(x) = sinh(3x2) b) f(x) = cosh( 1x) c) f(x) = sinh(ln x) Problems 10.3 : Hyperbolic identities 4. [R] a) Given the formula sinh(A+B) = sinhA coshB+coshA sinhB, find a formula for sinh 2x. By differentiation or otherwise, find a formula for cosh 2x. b) [H] Using the results of part (a), express sinh 3x as a cubic polynomial in sinhx. Hence, or otherwise, find ∫ sinh3 x dx. 5. [R] Show that coshx+ sinhx = ex. Deduce that (cosh x+ sinhx)n = coshnx+ sinhnx. 6. [H] [V] Consider the hyperbola x2 − y2 = 1, where x ≥ 1. A(t) x y (cosh t, sinh t) 0 1 c©2020 School of Mathematics and Statistics, UNSW Sydney PROBLEMS FOR CHAPTER 10 259 a) Using the definitions of cosh and sinh, prove that, for every real number t, the point (cosh t, sinh t) lies on the hyperbola. b) When t > 0, let A(t) denote the shaded region in the diagram. Explain why A(t) = 1 2 cosh t sinh t− ∫ cosh t 1 √ x2 − 1 dx. c) By first calculating A′(t), prove that A(t) = t 2 . Problems 10.4 : Hyperbolic derivatives and integrals 7. [R] Evaluate the following integrals. a) ∫ cosh(4x) dx b) ∫ 1 3 ln 2 0 sinh 3x dx c) ∫ cosh2 x dx d) ∫ sinh( √ x)√ x dx Problems 10.5 : The inverse hyperbolic functions 8. [R] Simplify cosh(sinh−1(3/4)), cosh−1(cosh(−3)) and sinh(tanh−1(5/13)). 9. [R] Show that a) d dx ( cosh−1 x ) = 1√ x2 − 1 , for x > 1 b) d dx ( tanh−1 x ) = 1 1− x2 . 10. [R] Show that a) cosh−1 x = ln(x+ √ x2 − 1) ∀x ∈ [1,∞) b) [V] tanh−1 x = 1 2 ln ( 1 + x 1− x ) ∀x ∈ (−1, 1). 11. [R] Find dy dx if a) y = sinh−1(2x) b) y = tanh−1(1/x) c) y = cosh−1(sec x) whenever 0 < x < π/2. Problems 10.6 : Integration leading to the inverse hyperbolic functions 12. [R] Find a) [V] ∫ dx√ 1 + 4x2 b) ∫ 1/2 0 dx 1− x2 c) ∫ dx√ x2 + 4x+ 13 . c©2020 School of Mathematics and Statistics, UNSW Sydney 260 CHAPTER 10. THE HYPERBOLIC FUNCTIONS c©2020 School of Mathematics and Statistics, UNSW Sydney 261 Answers to selected problems Chapter 1 1. a) The set of integers between −π and π. c) The empty set. 3. Answer for both: the interior and boundary of the triangle with vertices at (0, 0), (2, 0) and (2, 4). 4. a) x < 0 or x > 1 b) 1 < x < 2 c) x < −2 or x > 0 d) −1 < x < 1 e) −2 ≤ x < 1 or x ≥ 3 5. a) −4 < x < 2 b) x < −5 or x > 1 c) −1 < x < −1/3 d) x > 0 6. c) From (a) we have x2 + 1 x2 ≥ 2 with equality if and only if x = ±1. 7. a) F b) F c) T d) T e) F 8. Hint: (x2 + y2)2 ≥ 4x2y2. 10. a) −√5 ≤ x ≤ √5; 0 ≤ y ≤ √5 b) x ≤ −√5 or x ≥ √5; y ≥ 0 c) x 6= 8; y 6= 0 d) [1,∞); [0,∞) e) (1, ∞); (0, ∞) f) {x ∈ R : 2nπ ≤ x ≤ (2n+ 1)π; n ∈ Z}; [0, 1] g) The union of the intervals [− 7pi6 + 2kπ, pi6 + 2kπ] where k ∈ Z; 0 ≤ y ≤ √ 3 h) {x ∈ R : x 6= (2n+ 1)π/2, n an integer}; [1, ∞) i) R; [−1, ∞) 11. a) 22 b) x2 + 10x+ 22 c) 6 d) x2 + 2 12. a) x− 1 + 1/√x− 1 b) √x− 1 c) (x − 1)3/2 d) (1/√x− 1)− 1 16. [4, 13] 17. x = 1, 7 c©2020 School of Mathematics and Statistics, UNSW Sydney 262 CHAPTER 3 18. a) If p(x) = a0+a1x+ · · ·+anxn then p(q(x)) = a0+a1q(x)+a2(q(x))2+ · · ·+an(q(x))n. Products and sums of polynomials are again polynomials. b) Yes. Chapter 2 1. a) 1 b) 2 c) 0 d) Doesn’t exist (→∞). e) 5 f) Doesn’t exist. 2. a) 0 b) 0 4. b) 0 5. a) 1 b) M = 10 (best possible) c) M = 1/ √ ǫ will do. 6. a) 4 b) 0 c) 0 d) 0 e) 0 7. a) 50 metres per second b) 5 ln 50 ≈ 19.56 seconds after leaving the plane. 8. a) 10 b) 4 c) 3 d) −1/9 9. a) −1 b) 1 c) No 10. a) Doesn’t exist. b) Doesn’t exist. c) Doesn’t exist. d) Doesn’t exist. 11. a) 0 b) 0 12. a) |CB| = θ, |CA| = sin θ, |DB| = tan θ. 13. Neither the left-hand nor right-hand limits exist due to wild oscillatory behaviour. Chapter 3 1. b) Yes 2. a) Continuous everywhere. b) Continuous everywhere except at π/2. 3. k = 8 5. Use the intermediate value theorem. 8. a) Yes b) Yes c) No d) Yes c©2020 School of Mathematics and Statistics, UNSW Sydney ANSWERS 263 Chapter 4 2. a) 5(4x3 + 21x6) b) (4x3−2)(4x2+2x+4)+(x4−2x)(8x+2) c) (16y − y4)/(y3 + 8)2 d) (2x2 − 4)/(x2 − 4)1/2 e) −4/(t2 − 4)3/2 f) 3 cos 3y + 12 cos 2y sin 2y g) (4x3 − x4)e−x h) x ln(x3 + 1) + 3x2(x2 + 1)/2(x3 + 1) i) sec2 x j) − tanx 3. a) 0 b) 0 c) f ′(0) = 0 4. a) i) x 6= 0 ii) all x b) i) all x ii) all x c) i) x 6= −2 ii) x 6= −2 6. a) x+ 17π + cos 2x b) 1− 2 sin 2x c) 2−x2+cos 2(2−x2) d) 1− 2 sin 2(2− x2) e) −2x(1− 2 sin 2(2− x2)) 7. a) dy dx = 3x2 − y x− 3y2 b) dy dx = (y − 4x√xy)/(4y√xy − x) 9. y = 2 10. a) (i) b = 0 (ii) a = 1, b = 0 b) (i) b = 1 (ii) a = 2, b = 1. 11. a = 1, b = 0 12. a) f(8.01) ≈ f(8) = 2 b) i) y = (x− 8)/12 + 2 ii) f(8.01) ≈ (8.01− 8)/12 + 2 = 2 + 11200 c) The approximation in (b) is much better. 13. √ 3 ar/2 14. 7/8 15. a) 18pi b) 32000pi81 cm 3 16. a) dhdt = 2 125pi when h = 50. Chapter 5 1. √ 7 3 b) 1 2 5. b) 0 7. a) By the Mean Value Theorem, for some c with 16 < c < 17, √ 17−√16 = 1 2 √ c < 1 2 √ 16 = 0.125. b) 0.008 c©2020 School of Mathematics and Statistics, UNSW Sydney 264 CHAPTER 6 c) 2× 10−6. 8. −1, 1 and 4 are stationary points; 4 is a local minimum point;−1 is a local maximum point. 9. a) 11,−61 b) 3,−253 c) 27/256,−750 d) 250,−54 e) 2, 0 10. (12/13, 18/13) 11. 12. a) (400)/(4 + π), 100π/(4 + π) b) 0, 100 14. Three real zeros 15. a) f(t) = − cos t+ t2/2 + 3 b) No 16. a) 0 b) 8/3 17. a) 13 b) m n c) −1 d) − 12 e) 14 f) 13 18. a) → 0 b) →∞ c) → 0 d) → 1 e) → 1 f) → 32 19. Combine the two fractions and apply l’Hoˆpital twice only. You will need to simplify the quotient obtained after the first application of l’Hoˆpital. Maple can confirm your answer. 20. (a, b) = (−√2,√2) or (√2,−√2) 23. a) −1/2 b) a = −1/2, b = 1 24. c) a = b = 0 Chapter 6 2. a) f−1(x) = 1 3 (x− 1) b) g−1(x) = −√x− 1, Dom(g−1) = [1,∞), Range(g−1) = (−∞, 0], (g−1)′(x) = −1 2 √ x−1 4. b) 1/3 5. b) The restriction of f to (−∞,−1] has an inverse with domain (−∞, 3], the restriction of f to [−1, 1] has an inverse with domain [−1, 3], and the restriction of f to [1,∞) has an inverse with domain [−1,∞). 6. a) No b) Yes c©2020 School of Mathematics and Statistics, UNSW Sydney ANSWERS 265 7. a) The graph is symmetric about x = − 12 , which surely gives a local maximum of f(x). There will be four (maximal) intervals where f will have an inverse. Try this exercise on Maple. The commands plot, diff and solve should suffice. b) f is one-to-one; f−1(x) = x1/17 − 1 is not differentiable when x = 0. c) I can be one of four intervals. 8. a) π/3 b) 2/5 c) −π/3 d) π/3 e) 4/5 f) 3/ √ 34 g) π/3 h) π − x 11. a) −2/√1− 4x2 b) 1/(2√x− x2 ) c) 2/(4x2 − 12x+ 10) 12. Differentiate; −1 ≤ x ≤ 1; π/2. 13. b) f(x) = π/2 when x > 0 and f(x) = −π/2 when x < 0. 14. b) The derivative of the inverse is −1/x√x2 − 1 when x > 1. 15. a = π/2, b = 0 16. a) 24π km/min b) 104π/3 km/min 17. √ 48 metres Chapter 7 1. [−1, 5], [0, 3], upper half of circle. 2. a) period 2π/3, odd b) period 3π, neither c) not periodic, even d) period π/3, odd e) period π, even f) 2π 3. odd, even, neither, odd, odd, even. 4. The asymptotes are a) x = 3, y = x+ 2 b) x = −1, y = x− 1 c) x = −3, x = 2, y = x− 1. 6. a) x ≥ 3, − 13 < x ≤ 13 b) x = − 13 , x = −3, y = 1 c) (1,− 14 ), (−1,−4) d) Domain: x 6= 3,− 13 , Range: (−∞,−4], [− 14 ,∞). 7. a) x2 16 + y2 25 = 1, ellipse b) x2 9 − y 2 4 = 1, hyperbola c) y = x2/3 d) spiral 8. a) ii) (2, 0) iii) −1 b) ii) (5, 0), (−1, 0) iii) 4t3/3 c) ii) (1, 0), (−1, 0) iii) − cot t 9. a) 3x− 27y + 52 = 0 b) 19 c©2020 School of Mathematics and Statistics, UNSW Sydney 266 CHAPTER 8 10. b) Hint: the length of one particular arc of the larger circle equals the length of one arc on the smaller circle. d) x2/3 + y2/3 = 1 11. a) p(t) = a + t(b − a),p(0) = a,p(1) = b b) y = 4 − x,q(1/2) is the coodinate vector for the midpoint of B and C c) p0(t) = (1 − t)2, p1(t) = 2t(1− t), p2(t) = t2 12. a) (3, 0) b) (−3√3, −3) c) (√2,−√2) 13. a) (3, π) b) ( √ 2,−3π/4) c) (4, 2π/3) d) (1, π/2) e) (4, 5π/6) f) (4,−5π/6) 14. a) Circle, centre (0,0), radius 4 b) A ray in the second quadrant c) A spiral of Archimedes 15. a) Circle, centre (0,3), radius 3 b) Circle, centre (1,0), radius 1 16. The following sketches are a guide to shape only. a) x y b) x y c) x y d) x y e) x y f) x y 18. (x− 2)2 9 + y2 5 = 1 Chapter 8 1. a) i) SPn(f) = SPn(f) = 1 ii) SPn(f) = 1 2 ( 1− 1n ) , SPn(f) = 1 2 ( 1 + 1n ) iii) SPn(f) = 1 6 ( 1− 1n ) ( 2− 1n ) , SPn(f) = 1 6 ( 1 + 1n ) ( 2 + 1n ) v) SPn(f) = 1, SPn(f) = 0 b) i) 1 ii) 12 iii) 1 3 iv) 1 4 (v) Not Riemann integrable 2. a) √ 1365 = 36.95 b) √ 1690.9 = 41.12 and the lower bound is √ 1078.9 = 32.85 3. 4.5 4. a) 82.4 b) 10 c©2020 School of Mathematics and Statistics, UNSW Sydney ANSWERS 267 5. f(x) = 1 x2 + x+ 1 6. 1x is not differentiable on all of [−1, 1] so the FTC doesn’t apply. 7. a) Draw a picture! b) 5π/12−√3/2 9. F is continuous everywhere, but not differentiable at the integers. 11. a) sinx2 b) 3x2 sinx6 c) −3x2 sinx6 d) 3x2 sinx6 − sinx2 12. −(5− 4x)5 13. biii) pi6 . 14. a) 12 e x2 + C b) −2 cos√x+ C c) 15/4 d) 4 √ 2 a9/2 / 9 e) 1/4 f) (2 √ 2− 1)/3 15. a) 2 √ x− 2 ln(1 +√x) + C b) 125 ( 1 21 (5x− 1)21 + 120 (5x− 1)20 ) + C c) x/(x + 1)2 + C d) 4− 10 ln(7/5) 16. a) 4e 5+1 25 b) x2 sinx+ 2x cosx− 2 sinx+ C c) x(ln(x) − 1) + C d) pi12 + √ 3 2 − 1 e) 7e 8+1 64 f) pi2 g) e x 2 (cos x+ sinx) h) x tan −1 x− ln√1 + x2 + C i) √ 2 2 + 1 2 ln(1 + √ 2) 17. a) 1/5 b) diverges c) π/4 d) 0 e) 2 f) diverges 19. a) 0 b) ln 2 c) No 20. a) convergent b) divergent c) divergent 21. a) convergent b) divergent c) convergent 22. a) convergent b) divergent c) convergent 23. s < 0 24. p > 1 25. The integral converges whenever 2a− b > 1. 26. a) 4, 8, 2 c) Li′(x) = 1ln x > 0 so Li is an increasing function; Li(2) = 0. d) Li(106) ≥ 106−26 ln 10 . e) pi(10 6) x ' 0.07238. 27. a) 2√ pi e−x 2 c©2020 School of Mathematics and Statistics, UNSW Sydney 268 CHAPTER 10 d) (i) 0.749 < erf(1) < 0.928 (iii) 1/e (iv) 1.344 Chapter 9 2. a) A partition into 7 equal parts will suffice 3. a) 3x2/2(x3 + 1) b) ex for x > 0, −e−x for x < 0 c) 1(ln(ln x))(ln x)x d) 5x 4 (where x > −61/5) 4. a) 12 ln(1 + e 2x) b) −e1/x c) 3x/ ln 3 d) e √ x 4 e) (ln x) 2 2 f) ln | sinx| 7. b) e 8. a) 3 x ln 3 b) ( x3 − 3 x2 + 1 )1/5( 3x2 5(x3 − 3) − 2x 5(1 + x2) ) c) (sinx)sin x cosx (1+ln(sinx)) d) cos(xsin x)xsin x ( cosx lnx+ sinx x ) 9. a) 0 b) 0 c) 1 d) e2 e) 1 f) 1 g) ea h) 0 i) 0 Chapter 10 2. a) sech2 x b) −sechx tanhx c) −cosech2x 3. a) 6x cosh(3x2) b) − sinh(1/x)x2 c) 1 2 + 1 2x2 4. a) sinh 2x = 2 coshx sinhx ; cosh 2x = cosh2 x+ sinh2 x b) 14 ( 1 3 cosh 3x− 3 coshx) or 13 cosh3 x− coshx 7. a) sinh 4x4 b) 1 12 c) (2x+ sinh 2x)/4 d) 2 cosh √ x 8. 5/4, 3, 5/12 11. a) 2/ √ 1 + 4x2 b) 11−x2 for |x| > 1 c) secx 12. a) 12 sinh −1 2x b) tanh−1 12 = 1 2 ln 3 c) sinh −1 (x+2 3 ) c©2020 School of Mathematics and Statistics, UNSW Sydney 269 Past class tests The following selection of past MATH1131 class tests can be used as a guide to the degree of difficulty of calculus class tests in this course. However, due to variations in timing and frequency, the material examined in each class test differs from the tests given here. Thus students must consult the Information booklet for MA1131, or page xiii of these notes, to ascertain the precise topics that may be examined in each calculus class test. c©2020 School of Mathematics and Statistics, UNSW Sydney UNIVERSITY OF NEW SOUTH WALES SCHOOL OF MATHEMATICS AND STATISTICS MATH1131 Mathematics 1A Calculus S1 2008 TEST 1 VERSION 5a This sheet must be filled in and stapled to the front of your answers Student’s Family Name Initials Student Number Tutorial Code Tutor’s Name Mark Note: The use of a calculator is NOT permitted in this test QUESTIONS (Time allowed: 20 minutes) 1. (2 marks) Solve |2− 3x| ≤ 1. 2. (2 marks) Find the (maximal) domain and the range of the function f(x) = 1√ 3− x. 3. (2 marks) Sketch the graph y = x2 − 3x− 10, and hence sketch the graph y = 1 x2 − 3x− 10. 4. (2 marks) For f(x) = |x2 − 9| x− 3 and a = 3, discuss the limiting behaviour of f(x) as x→ a +, as x→ a− and as x→ a. 5. (2 marks) Determine the limiting behaviour of f(x) = 2x+ 3x2 + e−x 2x2 + cos x as x→∞. UNIVERSITY OF NEW SOUTH WALES SCHOOL OF MATHEMATICS AND STATISTICS MATH1131 Mathematics 1A Calculus S2 2008 TEST 1 VERSION 2b This sheet must be filled in and stapled to the front of your answers Student’s Family Name Initials Student Number Tutorial Code Tutor’s Name Mark Note: The use of a calculator is NOT permitted in this test QUESTIONS (Time allowed: 20 minutes) 1. (2 marks) Let f(x) = x2 + 4, and g(x) = 1√ x+ 1 . Give the explicit forms of (f ◦ g)(x) and (g ◦ f)(x). 2. (2 marks) Find the limiting value of f(x) = x2 − 5x+ 6 2x2 − 5x+ 2 as x tends to 2. 3. (2 marks) For f(x) = |x2 − 4x+ 3| x− 1 and a = 1, discuss the limiting behaviour of f(x) as x → a +, as x→ a− and as x→ a. 4. (2 marks) Solve |2− 3x| ≤ 1. 5. (2 marks) Find the (maximal) domain and the range of the function f(x) = ln(x2 − 5). UNIVERSITY OF NEW SOUTH WALES SCHOOL OF MATHEMATICS AND STATISTICS MATH1131/1141 Calculus S1 2009 TEST 1 VERSION 8a This sheet must be filled in and stapled to the front of your answers Student’s Family Name Initials Student Number Tutorial Code Tutor’s Name Mark Note: The use of a calculator is NOT permitted in this test QUESTIONS (Time allowed: 20 minutes) 1. (2 marks) Sketch the graph y = √ x+ 2, and hence sketch the graph y = 1√ x+ 2 . 2. (2 marks) Solve |3x+ 2| ≥ 1. 3. (2 marks) Find the (maximal) domain and range of the function f(x) = 1√ 9− x2 . 4. (2 marks) Determine the limiting behaviour of f(x) = e−x + 3x2 − 2 4x2 + 3x+ sinx as x→∞. 5. (2 marks) For f(x) = x− 2 |x2 − 4| and a = 2, discuss the limiting behaviour of f(x) as x→ a +, as x→ a− and as x→ a. UNIVERSITY OF NEW SOUTH WALES SCHOOL OF MATHEMATICS AND STATISTICS MATH1131 Mathematics 1A Calculus S1 2009 TEST 1 VERSION 6a This sheet must be filled in and stapled to the front of your answers Student’s Family Name Initials Student Number Tutorial Code Tutor’s Name Mark Note: The use of a calculator is NOT permitted in this test QUESTIONS (Time allowed: 20 minutes) 1. (2 marks) Solve 1 x+ 1 ≤ −1 2 . 2. (2 marks) Find the (maximal) domain and the range of the function f(x) = √ 2− e−x. 3. (2 marks) Let f(x) = 3x+ 4, and g(x) = 1√ x− 2. Give the explicit forms of (f ◦ g)(x) and (g ◦ f)(x). 4. (2 marks) Find the limiting value of f(x) = 2x2 − x− 6 3x2 − 2x− 8 as x tends to 2. 5. (2 marks) For f(x) = |x2 + 3x− 18| x− 3 and a = 3, discuss the limiting behaviour of f(x) as x→ a +, as x→ a− and as x→ a. UNIVERSITY OF NEW SOUTH WALES SCHOOL OF MATHEMATICS AND STATISTICS MATH1131 Mathematics 1A Calculus S2 2009 TEST 1 VERSION 1a This sheet must be filled in and stapled to the front of your answers Student’s Family Name Initials Student Number Tutorial Code Tutor’s Name Mark Note: The use of a calculator is NOT permitted in this test QUESTIONS (Time allowed: 20 minutes) 1. (2 marks) Sketch the set of points in the (x, y) plane satisfying 0 < x < 3y and 0 < y < 2. 2. (2 marks) Solve ∣∣∣∣3x+ 12 ∣∣∣∣ ≤ 2. 3. (2 marks) Find the (maximal) domain and the range of the function f(x) = 1√ x− 1 . 4. (2 marks) For f(x) = |x2 − 4x+ 3| 3− x and a = 3, discuss the limiting behaviour of f(x) as x → a +, as x→ a− and as x→ a. 5. (2 marks) Consider the function f(x) = − cos x on the interval (−π/2, π/2). Determine whether f attains a maximum value on the interval. Give reasons for your answer. UNIVERSITY OF NEW SOUTH WALES SCHOOL OF MATHEMATICS AND STATISTICS MATH1131/MATH1141 Calculus S1 2010 TEST 1 VERSION 7b This sheet must be filled in and stapled to the front of your answers Student’s Family Name Initials Student Number Tutorial Code Tutor’s Name Mark Note: The use of a calculator is NOT permitted in this test QUESTIONS (Time allowed: 20 minutes) 1. (2 marks) Sketch the graph y = √ x− 1, and hence sketch the graph y = 1√ x− 1. 2. (2 marks) For f(x) = |x2 + x− 2| x− 1 and a = 1, discuss the limiting behaviour of f(x) as x → a +, as x→ a− and as x→ a. 3. (2 marks) Let p(x) = x3− 3x2− 4x+2. Use the Intermediate Value Theorem to show that p has a root between −2 and 0. 4. (2 marks) Solve 1 x+ 1 > −1 2 . 5. (2 marks) Find the (maximal) domain and the range of the function f(x) = √ 3 + x. UNIVERSITY OF NEW SOUTH WALES SCHOOL OF MATHEMATICS AND STATISTICS MATH1131 Mathematics 1A Calculus S1 2008 TEST 2 VERSION 8a This sheet must be filled in and stapled to the front of your answers Student’s Family Name Initials Student Number Tutorial Code Tutor’s Name Mark Note: The use of a calculator is NOT permitted in this test QUESTIONS (Time allowed: 20 minutes) 1. (2 marks) Show that the function f given by f(x) = x3−2x2−3x+3 has a zero in each of the intervals [0, 1] and [2, 3]. 2. (2 marks) Using the definition of the derivative, show that if f(x) = −2x2 + x then f ′(x) = −4x+ 1. 3. (2 marks) The length L of a rectangle is decreasing at the rate of 2 cm per second, and the width W is increasing at the rate of 4 cm per second. Find the rate of change of the area when L=13 cm and W=10 cm. 4. (2 marks) State the Mean Value Theorem. Find a point which satisfies the conclusions of the Mean Value Theorem for f(x) = x3 − 2x2 + 5 on the interval [0, 2]. 5. (2 marks) Find lim x→0 x sinx 1− cos x. UNIVERSITY OF NEW SOUTH WALES SCHOOL OF MATHEMATICS AND STATISTICS MATH1131 Mathematics 1A Calculus S2 2008 TEST 2 VERSION 4b This sheet must be filled in and stapled to the front of your answers Student’s Family Name Initials Student Number Tutorial Code Tutor’s Name Mark Note: The use of a calculator is NOT permitted in this test QUESTIONS (Time allowed: 20 minutes) 1. (2 marks) State the Mean Value Theorem and find a point which satisfies the conclusions of the Mean Value Theorem for f(x) = √ x− 1 on the interval [1, 3]. 2. (2 marks) Find lim x→0 1− cos 3x x2 . 3. (2 marks) Show that the function f given by f(x) = x3−3x2−2x+5 has a zero in each of the intervals [−2,−1] and [1, 2]. 4. (2 marks) Using the definition of the derivative, show that if f(x) = −x3 then f ′(x) = −3x2. 5. (2 marks) Find the equation of the line tangent to x2 + y3 − x2y = 1 at (1, 1). UNIVERSITY OF NEW SOUTH WALES SCHOOL OF MATHEMATICS AND STATISTICS MATH1131 Mathematics 1A Calculus S1 2009 TEST 2 VERSION 8b This sheet must be filled in and stapled to the front of your answers Student’s Family Name Initials Student Number Tutorial Code Tutor’s Name Mark Note: The use of a calculator is NOT permitted in this test QUESTIONS (Time allowed: 20 minutes) 1. (2 marks) Carefully state the Mean Value Theorem. Find a point which satisfies the conclusions of the Mean Value Theorem for f(x) = x3 − x2 + 3 on the interval [0, 1]. 2. (2 marks) Find lim x→1 3x3 − 5x2 + x+ 1 x2 − 2x+ 1 . 3. (2 marks) Determine the values of x at which the function f(x) = { x3 for x < 1 (x− 1)3 + 2 for x ≥ 1 is continuous. Give reasons for your answer. 4. (2 marks) Using the definition of the derivative, show that if f(x) = −x3 then f ′(x) = −3x2. UNIVERSITY OF NEW SOUTH WALES SCHOOL OF MATHEMATICS AND STATISTICS MATH1131/1141 Calculus S1 2009 TEST 2 VERSION 1a This sheet must be filled in and stapled to the front of your answers Student’s Family Name Initials Student Number Tutorial Code Tutor’s Name Mark Note: The use of a calculator is NOT permitted in this test QUESTIONS (Time allowed: 20 minutes) 1. (2 marks) The function f(x) = x− 2 x2 − 3x+ 2 is not defined for x = 2. Find a value to be given to f(2) that will make f continuous at 2. 2. (3 marks) Determine all real values of a and b such that the function f(x) = ax+ b for x ≤ 1tan πx 4 for 1 < x < 2 is differentiable at x = 1. 3. (3 marks) Let f(x) = |x2 − 2x|. (i) Giving reasons for your answer, find all critical points of f on the interval [0, 5]. (ii) Find the absolute maximum and absolute minimum values of f(x) on the given interval. 4. (2 marks) Find lim x→1 2x4 − 3x3 + x (x− 1)2 . UNIVERSITY OF NEW SOUTH WALES SCHOOL OF MATHEMATICS AND STATISTICS MATH1131 Mathematics 1A Calculus S2 2009 TEST 2 VERSION 3a This sheet must be filled in and stapled to the front of your answers Student’s Family Name Initials Student Number Tutorial Code Tutor’s Name Mark Note: The use of a calculator is NOT permitted in this test QUESTIONS (Time allowed: 20 minutes) 1. (2 marks) Find the equation of the line tangent to x+ lnx = y + 2 ln y at (1, 1). 2. (2 marks) A ladder of length 3 metres is leaning against a vertical wall. The foot of the ladder is pulled away from the wall at the rate of 0.5 metres per second. How fast is the top of the ladder moving down the wall when the foot is 1 metre away from the wall? (Leave your answer in surds.) 3. (2 marks) Carefully state the Mean Value Theorem and find a point which satisfies the conclusions of the Mean Value Theorem for f(x) = √ x− 1 on the interval [1, 3]. 4. (3 marks) Let f(x) = (x− 1)2/3. (i) Giving reasons for your answer, find all critical points of f on the interval [0, 2]. (ii) Find the absolute maximum and absolute minimum values of f(x) on tthe given interval. 5. (1 mark) Find lim x→0 tanx x . UNIVERSITY OF NEW SOUTH WALES SCHOOL OF MATHEMATICS AND STATISTICS MATH1131/MATH1141 Calculus S1 2010 TEST 1 VERSION 2a This sheet must be filled in and stapled to the front of your answers Student’s Family Name Initials Student Number Tutorial Code Tutor’s Name Mark Note: The use of a calculator is NOT permitted in this test QUESTIONS (Time allowed: 20 minutes) 1. (3 marks) Determine all real values of a and b such that the function f(x) = { aex + b for x < 0, sinx for x ≥ 0 is differentiable at x = 0. 2. (1 mark) Find lim x→0 tanx e3x − 1 . 3. (2 marks) Find the equation of the line tangent to x3 + y3 − x− y2 = 0 at (1, 1). 4. (3 marks) Determine how many real numbers satisfy the equation x3−6x2+1 = 0. Give reasons for your answer, naming any theorems you use. (Hint: find the stationary points of the polynomial function.) 5. (1 mark) Differentiate tan−1(4x+ 1). 282 c©2020 School of Mathematics and Statistics, UNSW Sydney 283 Index absolute maximum point, 69 absolute maximum value, 69 absolute minimum point, 69 absolute minimum value, 69 absolute value, 17 antiderivative, 114 antidifferentiation, 114, 193 area, 172 absolute, 183 signed, 183 under the graph of f , 179 unsigned, 183 astroid, 168 asymptote, 144 oblique, 144 vertical, 144 Barrow, Isaac, 171 Bernoulli, Jakob, 155, 239 Bernoulli, Johann, 155, 239 Bolzano, 47 bounded function, 71 brachistochrone, 155 Cartesian coordinate, 157 catenary, 239, 253 chain rule, 82, 90 change of variables formula, 197 circle, 31, 152 closed interval, 13 codomain, 19 composition (of functions), 22 conic sections, 31 contains, 14 continuity, 53, 87 and differentiability, 87 at an end point, 66 everywhere, 54 on an interval, 65, 66 continuous, 86 continuous functions, 32 and limits, 53 combination of, 63 piecewise, 181 critical point, 108 cubic, 24 curve of fastest descent, 155 cusp point, 164 cycloid, 155 decreasing function, 103 definite integral, see integral(s), definite derivative(s), 77, 79 definition of, 79 difference of squares identities, 245 difference quotient, 80 differentiability, 87 and continuity, 87 local extrema, 91 differentiable, 86 at a point, 79 local approximation, 89 maximum point, 91 mean value theorem, 97 minimum point, 91 twice, 105 differential calculus, 11 differential equation, 113 differentiation, 81 chain rule, 82 product rule, 84 quotient rule, 82 discontinuity, 33 distance, 18 domain, 19 maximal, 20 c©2020 School of Mathematics and Statistics, UNSW Sydney 284 natural, 20 dominant term analysis, 206, 210 double-angle formulae, 245 dummy variable, 186 e, 227 elementary functions, 29 elements (of set), 12 ellipse, 31, 152 endpoints critical points, 108 endpoints of intervals, 13 equations solutions, 110 erf, 221 error bounds, 102 error function, 221 even function, 144 exp, 228 exponential function, 231 fastest growing term, 42 function(s), 11, 19 composition of, 22, 82 decreasing, 103 error function, 221 even, 144 exponential, 230 implicitly defined, 31 increasing, 103 inverse function, 127 limit at infinity, 46 local approximation, 89 natural logarithm, 225 odd, 144 quadratic, 24 range of, 21 rational, 25 real-valued, 20 sine integral, 212 value of, 20 fundamental theorem of calculus, 11, 212 first fundamental theorem, 188 second fundamental theorem, 190 Galileo, 155, 239 horizontal line test, 126 horizontal point of inflexion, 104 Huygens, Christiaan, 239 hyperbola, 31, 152, 242 hyperbolic functions, 243 double-angle formulae, 245 hyperbolic cosine, 240 hyperbolic sine, 240 sum and difference formulae, 245 improper integral p-integrals, 205 convergence of, 203 indeterminate form(s), 43, 115, 233, 234 inequalities, 14 and absolute value, 18 triangle inequality, 18 inequality, 14 infinity, symbol for, 13 injective, 125 integers, 12 integral calculus, 11, 171 integral(s), 179 definite limits of, 179 improper p-integrals, 202 comparison test, 206 convergence of, 202 limit comparison test, 210 indefinite, 194 over (a,∞), 203 Riemann sum, 176, 179 table of, 253 integrand, 179 integration, 177 and differentiation, 193 by parts, 200 by substitution, 196 constant of, 194 dummy variable, 186 intermediate value theorem, 67 number of solutions, 112 intervals, 13 closed, 13 open, 13 inverse function(s), 127 hyperbolic functions, 248 inverses of, 248 restriction, 130 trigonometric functions, 132, 133 irrational numbers, 13 c©2020 School of Mathematics and Statistics, UNSW Sydney 285 l’Hoˆpital’s rule, 115, 233 l’Hoˆpital, Guillaume, 155 least upper bound property, 67, 70 Lebesgue, Henri, 171, 183 left-hand limits, 51 Leibniz, Gottfried, 11, 155, 171, 239 Leonard Euler, 227 limit(s), 11, 39 at infinity, 46 composition of function, 55 left-hand, 51 of continuous functions, 53 of the form lim x→∞ f(x), 39 pinching theorem, 50, 56 right-hand, 51 two-sided, 52 ln, 225 logarithmic functions, 231 logarithms, 225 maximum global, 107 local, 91, 105 second derivative test, 105 maximum-minimum theorem, 68, 69 mean value theorem, 97, 98 members (of set), 12 minimum global, 107 local, 105 second derivative test, 105 natural domain, 20 natural logarithm, 227, 249 natural numbers, 12 Newton, Isaac, 11, 155, 171 oblique asymptote, 144, 145 odd function, 144 one-to-one correspondence, 124, 125 one-to-one function, 125 open interval, 13 parabola(s), 31, 152 parametric curves, 150 conic sections, 151 parameter, 151 parametrisation, 151 polar curves, 158 partition, 175, 178 piecewise continuous, 181 pinching theorem, 41, 50, 56 polar coordinate, 157 polar curves, 158 polynomial, 24 coefficients of, 24 degree of, 24 leading coefficient, 24 polynomial division, 146 primitive, 114 product rule, 84 quadratic function, 24 quartic, 24 quintic, 24 radians, 25 range of a function, 21 rates of change, 90 rational function, 25 rational numbers, 12, 67 real numbers, 12, 67 Riemann integrable, 179 Riemann integral, 177 area under the graph, 177 basic properties of, 184 Riemann sum upper, 176, 178 Riemann sums, 193, 212 lower, 176, 179 Riemann, Bernhard, 171 right-hand limits, 51 Rolle’s theorem, 99 secant, 76 second derivative, 104, 107 set(s), 12 sine integral, 212 stationary point, 92 classification of, 104 subset, 14 sum and difference formulae, 245 tangent to a curve, 77 triangle inequality, 18 trigonometric functions, 25 inverse functions, 133 vertical asymptotes, 144 Weierstrass, 47 Zeno of Elea, 47 c©2020 School of Mathematics and Statistics, UNSW Sydney
欢迎咨询51作业君