082-Engineering-Mathematics-Anthony-Croft-Robert-Davison-Martin-Hargreaves-James-Flint-Edisi-5-2017
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
ENGINEERING
MATHEMATICS
A Foundation for Electronic, Electrical,
Communications and Systems Engineers
FIFTH EDITION
Anthony Croft • Robert Davison
Martin Hargreaves • James Flint
EngineeringMathematics
At Pearson, we have a simple mission: to help people
make more of their lives through learning.
We combine innovative learning technology with trusted
content and educational expertise to provide engaging
and effective learning experiences that serve people
wherever and whenever they are learning.
From classroom to boardroom, our curriculum materials, digital
learning tools and testing programmes help to educate millions
of people worldwide – more than any other private enterprise.
Every day our work helps learning flourish, and
wherever learning flourishes, so do people.
To learn more, please visit us at www.pearson.com/uk
FifthEdition
EngineeringMathematics
AFoundationforElectronic,Electrical,
CommunicationsandSystemsEngineers
AnthonyCroft
LoughboroughUniversity
RobertDavison
MartinHargreaves
CharteredPhysicist
JamesFlint
LoughboroughUniversity
Harlow, England • London • New York • Boston • San Francisco • Toronto • Sydney
Dubai • Singapore • Hong Kong • Tokyo • Seoul • Taipei • New Delhi
Cape Town • São Paulo • Mexico City • Madrid • Amsterdam • Munich • Paris • Milan
PEARSONEDUCATIONLIMITED
EdinburghGate
HarlowCM20 2JE
United Kingdom
Tel: +44 (0)1279 623623
Web: www.pearson.com/uk
First edition publishedunder the Addison-Wesleyimprint 1992 (print)
Second edition publishedunderthe Addison-Wesleyimprint 1996 (print)
Third edition publishedunder the PrenticeHall imprint 2001 (print)
Fourthedition published2013 (print and electronic)
Fiftheditionpublished2017(printandelectronic)
© Addison-WesleyPublishersLimited 1992, 1996 (print)
© Pearson EducationLimited 2001 (print)
© Pearson EducationLimited 2013, 2017 (print and electronic)
The rights of AnthonyCroft, Robert Davison,Martin Hargreavesand James Flint
to be identified as authors of this work havebeen asserted by them in
accordancewith the Copyright,Designsand PatentsAct 1988.
The print publicationis protected by copyright.Prior to any prohibitedreproduction,
storage in a retrievalsystem,distributionor transmissionin any form or by any means,
electronic,mechanical,recordingor otherwise,permissionshould be obtained from
the publisheror, whereapplicable,alicence permittingrestrictedcopying in the
United Kingdom shouldbe obtained from the CopyrightLicensingAgency Ltd,
Barnard’sInn, 86 FetterLane, LondonEC4A 1EN.
The ePublicationis protectedby copyright and must not be copied,reproduced,transferred,
distributed,leased,licensedor publiclyperformed or used in any way except as specifically
permittedin writing by the publishers,as allowed underthe termsand conditionsunderwhich
it was purchased,or as strictly permittedby applicable copyrightlaw.Any unauthorised
distributionor use of this text may be a direct infringementof the authors’and the publisher’s
rights and those responsiblemay be liable in law accordingly.
PearsonEducation is not responsiblefor the contentof third-party internetsites.
ISBN: 978-1-292-14665-2 (print)
978-1-292-14667-6 (PDF)
978-1-292-14666-9 (ePub)
BritishLibraryCataloguing-in-PublicationData
A catalogue recordfor the print edition is available from the British Library
LibraryofCongressCataloging-in-PublicationData
Names: Croft, Tony,1957– author.
Title: Engineering mathematics:afoundation for electronic,electrical,
communicationsand systemsengineers/AnthonyCroft, Loughborough
University,Robert Davison,De Montfort University,Martin Hargreaves,
De Montfort University,James Flint,LoughboroughUniversity.
Description:Fifth edition.| Harlow, England ; NewYork : Pearson,2017. ‖
Revised edition of: Engineeringmathematics : a foundation for electronic,
electrical,communications,and systemsengineers/AnthonyCroft, Robert
Davison,Martin Hargreaves.3rd editon.2001.| Includesindex.
Identifiers:LCCN2017011081| ISBN 9781292146652 (Print) | ISBN 9781292146676
(PDF) | ISBN 9781292146669 (ePub)
Subjects: LCSH: Engineeringmathematics.| Electrical
engineering–Mathematics.| Electronics–Mathematics.
Classification:LCC TA330 .C76 2017 | DDC 510–dc23
LC record available at https://lccn.loc.gov/2017011081
A catalog record for the print edition is available from the Library of Congress
10987654321
21 20 19 18 17
Print edition typesetin 10/12 Times Roman by iEnerziger Aptara ® , Ltd.
Printedin Slovakia by Neografia
NOTE THATANYPAGECROSS REFERENCES REFER TO THE PRINTEDITION
ToKate,TomandHarvey -- A.C.
ToKathy-- R.D.
Tomyfatherandmother --M.H.
ToSuzanne,AlexandraandDominic --J.F.
Contents
Preface
Acknowledgements
xvii
xix
Chapter1 Reviewofalgebraictechniques 1
1.1 Introduction 1
1.2 Laws of indices 2
1.3 Number bases 11
1.4 Polynomial equations 20
1.5 Algebraicfractions 26
1.6 Solution of inequalities 33
1.7 Partial fractions 39
1.8 Summation notation 46
Review exercises 1 50
Chapter2 Engineeringfunctions 54
2.1 Introduction 54
2.2 Numbers and intervals 55
2.3 Basic concepts of functions 56
2.4 Review of some common engineering functions and techniques 70
Review exercises 2 113
Chapter3 Thetrigonometricfunctions 115
3.1 Introduction 115
3.2 Degreesand radians 116
3.3 The trigonometric ratios 116
3.4 The sine, cosine and tangent functions 120
3.5 The sincxfunction 123
3.6 Trigonometric identities 125
3.7 Modellingwavesusing sint and cost 131
3.8 Trigonometric equations 144
Review exercises 3 150
viii Contents
Chapter4 Coordinatesystems 154
4.1 Introduction 154
4.2 Cartesiancoordinate system – two dimensions 154
4.3 Cartesiancoordinate system – three dimensions 157
4.4 Polar coordinates 159
4.5 Some simple polar curves 163
4.6 Cylindricalpolar coordinates 166
4.7 Spherical polar coordinates 170
Review exercises 4 173
Chapter5 Discretemathematics 175
5.1 Introduction 175
5.2 Set theory 175
5.3 Logic 183
5.4 Boolean algebra 185
Review exercises 5 197
Chapter6 Sequencesandseries 200
6.1 Introduction 200
6.2 Sequences 201
6.3 Series 209
6.4 The binomial theorem 214
6.5 Power series 218
6.6 Sequences arising from the iterativesolution
of non-linear equations 219
Review exercises 6 222
Chapter7 Vectors 224
7.1 Introduction 224
7.2 Vectors and scalars: basic concepts 224
7.3 Cartesiancomponents 232
7.4 Scalar fields and vector fields 240
7.5 The scalar product 241
7.6 The vector product 246
7.7 Vectors ofndimensions 253
Review exercises 7 255
Chapter8 Matrixalgebra 257
8.1 Introduction 257
8.2 Basic definitions 258
Contents ix
8.3 Addition,subtraction and multiplication 259
8.4 Using matrices in the translationand rotation of vectors 267
8.5 Some special matrices 271
8.6 The inverse of a 2 × 2 matrix 274
8.7 Determinants 278
8.8 The inverse of a 3 × 3 matrix 281
8.9 Applicationto the solution of simultaneous equations 283
8.10 Gaussian elimination 286
8.11 Eigenvaluesand eigenvectors 294
8.12 Analysis of electricalnetworks 307
8.13 Iterative techniquesfor the solution of simultaneous equations 312
8.14 Computer solutions of matrix problems 319
Review exercises 8 321
Chapter9 Complexnumbers 324
9.1 Introduction 324
9.2 Complex numbers 325
9.3 Operations with complex numbers 328
9.4 Graphical representationof complex numbers 332
9.5 Polar form of a complex number 333
9.6 Vectors and complex numbers 336
9.7 The exponential form of a complex number 337
9.8 Phasors 340
9.9 De Moivre’s theorem 344
9.10 Loci and regions of the complex plane 351
Review exercises 9 354
Chapter10 Differentiation 356
10.1 Introduction 356
10.2 Graphical approach to differentiation 357
10.3 Limits and continuity 358
10.4 Rate of change at a specific point 362
10.5 Rate of change at a general point 364
10.6 Existence of derivatives 370
10.7 Common derivatives 372
10.8 Differentiationas a linear operator 375
Review exercises 10 385
Chapter11 Techniquesofdifferentiation 386
11.1 Introduction 386
x Contents
11.2 Rules of differentiation 386
11.3 Parametric, implicit and logarithmicdifferentiation 393
11.4 Higher derivatives 400
Review exercises 11 404
Chapter12 Applicationsofdifferentiation 406
12.1 Introduction 406
12.2 Maximum points and minimum points 406
12.3 Points of inflexion 415
12.4 The Newton–Raphson method for solving equations 418
12.5 Differentiationof vectors 423
Review exercises 12 427
Chapter13 Integration 428
13.1 Introduction 428
13.2 Elementary integration 429
13.3 Definite and indefinite integrals 442
Review exercises 13 453
Chapter14 Techniquesofintegration 457
14.1 Introduction 457
14.2 Integration by parts 457
14.3 Integration by substitution 463
14.4 Integration using partialfractions 466
Review exercises 14 468
Chapter15 Applicationsofintegration 471
15.1 Introduction 471
15.2 Average value of a function 471
15.3 Root mean square value of a function 475
Review exercises 15 479
Chapter16 Furthertopicsinintegration 480
16.1 Introduction 480
16.2 Orthogonal functions 480
16.3 Improper integrals 483
16.4 Integral propertiesof the delta function 489
16.5 Integration of piecewise continuous functions 491
16.6 Integration of vectors 493
Review exercises 16 494
Contents xi
Chapter17 Numericalintegration 496
17.1 Introduction 496
17.2 Trapezium rule 496
17.3 Simpson’s rule 500
Review exercises 17 505
Chapter18 Taylorpolynomials,TaylorseriesandMaclaurinseries 507
18.1 Introduction 507
18.2 Linearizationusing first-order Taylor polynomials 508
18.3 Second-order Taylor polynomials 513
18.4 Taylor polynomials of the nth order 517
18.5 Taylor’s formula and the remainder term 521
18.6 Taylor and Maclaurinseries 524
Review exercises 18 532
Chapter19 OrdinarydifferentialequationsI 534
19.1 Introduction 534
19.2 Basic definitions 535
19.3 First-order equations: simple equationsand separation
of variables 540
19.4 First-order linear equations:use of an integratingfactor 547
19.5 Second-order linear equations 558
19.6 Constant coefficient equations 560
19.7 Series solution of differentialequations 584
19.8 Bessel’s equationand Bessel functions 587
Review exercises 19 601
Chapter20 OrdinarydifferentialequationsII 603
20.1 Introduction 603
20.2 Analogue simulation 603
20.3 Higher order equations 606
20.4 State-space models 609
20.5 Numerical methods 615
20.6 Euler’s method 616
20.7 Improved Euler method 620
20.8 Runge–Kuttamethod of order 4 623
Review exercises 20 626
Chapter21 TheLaplacetransform 627
21.1 Introduction 627
21.2 Definitionof the Laplace transform 628
xii Contents
21.3 Laplace transforms of some common functions 629
21.4 Properties of the Laplace transform 631
21.5 Laplace transform of derivativesand integrals 635
21.6 Inverse Laplacetransforms 638
21.7 Using partialfractions to find the inverse Laplacetransform 641
21.8 Finding the inverse Laplace transform using complex numbers 643
21.9 The convolution theorem 647
21.10Solving linear constant coefficient differential
equations using the Laplace transform 649
21.11Transfer functions 659
21.12Poles, zeros and the s plane 668
21.13Laplace transforms of some special functions 675
Review exercises 21 678
Chapter22 Differenceequationsandtheztransform 681
22.1 Introduction 681
22.2 Basic definitions 682
22.3 Rewriting difference equations 686
22.4 Block diagramrepresentationof difference equations 688
22.5 Design of a discrete-time controller 693
22.6 Numerical solution of difference equations 695
22.7 Definitionof the z transform 698
22.8 Sampling a continuous signal 702
22.9 The relationshipbetweenthe z transform and the
Laplace transform 704
22.10Properties of the z transform 709
22.11Inversion of z transforms 715
22.12The z transform and difference equations 718
Review exercises 22 720
Chapter23 Fourierseries 722
23.1 Introduction 722
23.2 Periodic waveforms 723
23.3 Odd and even functions 726
23.4 Orthogonality relationsand other useful identities 732
23.5 Fourier series 733
23.6 Half-range series 745
23.7 Parseval’s theorem 748
23.8 Complex notation 749
23.9 Frequency response of a linear system 751
Review exercises 23 755
Contents xiii
Chapter24 TheFouriertransform 757
24.1 Introduction 757
24.2 The Fourier transform – definitions 758
24.3 Some propertiesof the Fourier transform 761
24.4 Spectra 766
24.5 The t−ω dualityprinciple 768
24.6 Fourier transforms of some special functions 770
24.7 The relationship betweenthe Fourier transform
and the Laplace transform 772
24.8 Convolution and correlation 774
24.9 The discrete Fourier transform 783
24.10Derivationof the d.f.t. 787
24.11Using the d.f.t.to estimate a Fourier transform 790
24.12Matrixrepresentationof the d.f.t. 792
24.13Some propertiesof the d.f.t. 793
24.14The discrete cosine transform 795
24.15Discrete convolution and correlation 801
Review exercises 24 821
Chapter25 Functionsofseveralvariables 823
25.1 Introduction 823
25.2 Functions of more than one variable 823
25.3 Partial derivatives 825
25.4 Higher order derivatives 829
25.5 Partial differentialequations 832
25.6 Taylor polynomials and Taylor series in two variables 835
25.7 Maximum and minimum points of a function of two variables 841
Review exercises 25 846
Chapter26 Vectorcalculus 849
26.1 Introduction 849
26.2 Partial differentiationof vectors 849
26.3 The gradient of a scalar field 851
26.4 The divergence of a vector field 856
26.5 The curl of a vector field 859
26.6 Combining the operators grad, div and curl 861
26.7 Vector calculus and electromagnetism 864
Review exercises 26 865
xiv Contents
Chapter27 Lineintegralsandmultipleintegrals 867
27.1 Introduction 867
27.2 Line integrals 867
27.3 Evaluation of line integralsin two dimensions 871
27.4 Evaluation of line integralsin three dimensions 873
27.5 Conservative fields and potentialfunctions 875
27.6 Double and triple integrals 880
27.7 Some simple volume and surface integrals 889
27.8 The divergence theorem and Stokes’ theorem 895
27.9 Maxwell’s equationsin integral form 899
Review exercises 27 901
Chapter28 Probability 903
28.1 Introduction 903
28.2 Introducing probability 904
28.3 Mutuallyexclusive events: the additionlaw of probability 909
28.4 Complementary events 913
28.5 Concepts from communication theory 915
28.6 Conditional probability:the multiplicationlaw 919
28.7 Independent events 925
Review exercises 28 930
Chapter29 Statisticsandprobabilitydistributions 933
29.1 Introduction 933
29.2 Random variables 934
29.3 Probability distributions–discrete variable 935
29.4 Probability density functions – continuous variable 936
29.5 Mean value 938
29.6 Standard deviation 941
29.7 Expected value of a random variable 943
29.8 Standard deviationof a random variable 946
29.9 Permutations and combinations 948
29.10The binomial distribution 953
29.11The Poisson distribution 957
29.12The uniform distribution 961
29.13The exponentialdistribution 962
29.14The normal distribution 963
29.15Reliabilityengineering 970
Review exercises 29 977
Contents xv
AppendixI Representing a continuous function and a sequence
as a sum of weighted impulses 979
AppendixII The Greek alphabet 981
AppendixIII SI units and prefixes 982
( ) n
AppendixIV The binomial expansion of
982
n−N
n
Index 983
This page intentionally left blank
Preface
Audience
This book has been written to serve the mathematical needs of students engaged in a
firstcourseinengineeringatdegreelevel.Itisprimarilyaimedatstudentsofelectronic,
electrical,communicationsandsystemsengineering.Systemsengineeringtypicallyencompasses
areas such as manufacturing, control and production engineering. The textbook
will also be useful for engineers who wish to engage in self-study and continuing
education.
Motivation
Engineers are called upon to analyse a variety of engineering systems, which can be
anything from a few electronic components connected together through to a complete
factory.Theanalysisofthesesystemsbenefitsfromtheintelligentapplicationofmathematics.Indeed,
many cannot beanalysed withouttheuseofmathematics.Mathematics
is the language of engineering. It is essential to understand how mathematics works in
order to master the complex relationships present in modern engineering systems and
products.
Aims
Therearetwomainaimsofthebook.Firstly,wewishtoprovideanaccessible,readable
introductiontoengineeringmathematicsatdegreelevel.Thesecondaimistoencourage
the integration of engineering and mathematics.
Content
The first three chapters include a review of some important functions and techniques
thatthereadermayhavemetinpreviouscourses.Thismaterialensuresthatthebookis
self-contained and provides a convenient reference.
Traditional topics in algebra, trigonometry and calculus have been covered. Also included
are chapters on set theory, sequences and series, Boolean algebra, logic, difference
equations and theztransform. The importance of signal processing techniques is
reflectedbyathoroughtreatmentofintegraltransformmethods.ThustheLaplace,zand
Fourier transformshave been given extensive coverage.
Inthelightoffeedbackfromreaders,newtopicsandnewexampleshavebeenadded
in the fifth edition. Recognizing that motivation comes from seeing the applicability
of mathematics we have focused mainly on the enhancement of the range of applied
examples. These include topics on the discrete cosine transform, image processing, applicationsinmusictechnology,communicationsengineeringandfrequencymodulation.
xviii Preface
Style
The style of the book is to develop and illustrate mathematical concepts through examples.
We have tried throughout to adopt an informal approach and to describe mathematical
processes using everyday language. Mathematical ideas are often developed
by examples rather than by using abstract proof, which has been kept to a minimum.
This reflects the authors’ experience that engineering students learn better from practical
examples, rather than from formal abstract development. We have included many
engineering examples and have tried to make them as free-standing as possible to keep
thenecessaryengineeringprerequisitestoaminimum.Theengineeringexamples,which
havebeencarefullyselectedtoberelevant,informativeandmodern,rangefromshortillustrativeexamplesthroughtocompletesectionswhichcanberegardedascasestudies.
A further benefit is the development of the link between mathematics and the physical
world. An appreciation of this link is essential if engineers are to take fulladvantage of
engineeringmathematics.Theengineeringexamplesmakethebookmorecolourfuland,
moreimportantly,theyhelpdeveloptheabilitytoseeanengineeringproblemandtranslateitintoamathematicalformsothatasolutioncanbeobtained.Thisisoneofthemost
difficult skills that an engineer needs to acquire. The ability to manipulate mathematical
equations is by itself insufficient. It is sometimes necessary to derive the equations
corresponding to an engineering problem. Interpretation of mathematical solutions in
terms of the physical variables is also essential. Engineers cannot afford to get lost in
mathematical symbolism.
Format
Importantresultsarehighlightedforeasyreference.Exercisesandsolutionsareprovided
at the end of most sections; it is essential to attempt these as the only way to develop
competence and understanding is through practice. A further set of review exercises is
providedattheendofeachchapter.Inadditionsomesectionsincludeexercisesthatare
intended to be carried out on a computer using a technical computing language such as
MATLAB ® ,GNUOctave,MathematicaorPython ® .TheMATLAB ® commandsyntax
is supported in several software packages as well as MATLAB ® itself and will be used
throughout the book.
Supplements
A comprehensive Solutions Manual is obtainable free of charge to lecturers using this
textbook. Itisalsoavailable fordownload via the web atwww.pearsoned.co.uk/croft.
Finallywehopeyouwillcometoshareourenthusiasmforengineeringmathematics
and enjoy the book.
AnthonyCroft
RobertDavison
MartinHargreaves
JamesFlint
March2017
Acknowledgements
Wearegratefultothefollowingforpermissiontoreproducecopyrightmaterial:
Tables
Table29.7fromBiometrikaTablesforStatisticians,Vol.1,NewYork:Holt,Rinehart&
Winston(Hays,W.L. and Winkler,R.L. 1970) Table 1,© Cambridge UniversityPress.
Text
General Displayed Text on page xviii from https://www.mathworks.com/products/
matlab.html,MATLAB ® isaregisteredtrademarkofTheMathWorks,Inc.;GeneralDisplayed
Text xviii from Mathematica, https://www.wolfram.com/mathematica/, © Wolfram;
General Displayed Text xviii from https://www.python.org/, Python ® and the
Python logos are trademarks or registered trademarks of the Python Software Foundation,
used by Pearson Education Ltd with permission from the Foundation; General
Displayed Text on page 291 from http://www.blu-raydisc.com/en/, Blu-ray Disc is
a trademark owned by Blu-ray Disc Association (BDA); General Displayed Text on
page 291 from http://wimaxforum.org/home, WiMAX ® is a registered trademarks of
the WiMAX Forum. This work is produced by Pearson Education and is not endorsed
by any trademark owner referenced inthispublication.
This page intentionally left blank
1 Reviewofalgebraic
techniques
Contents 1.1 Introduction 1
1.2 Lawsofindices 2
1.3 Numberbases 11
1.4 Polynomialequations 20
1.5 Algebraicfractions 26
1.6 Solutionofinequalities 33
1.7 Partialfractions 39
1.8 Summationnotation 46
Reviewexercises1 50
1.1 INTRODUCTION
This chapter introduces some algebraic techniques which are commonly used in engineering
mathematics. For some readers this may be revision. Section 1.2 examines the
laws of indices. These laws are used throughout engineering mathematics. Section 1.3
looks at number bases. Section 1.4 looks at methods of solving polynomial equations.
Section 1.5 examines algebraic fractions, while Section 1.6 examines the solution of
inequalities. Section 1.7 looks at partial fractions. The chapter closes with a study of
summation notation.
Computers are used extensively in all engineering disciplines to perform calculations.
Some of the examples provided in this book make use of the technical computing
language MATLAB ® , which is commonly used in both an academic and industrial
setting.
Because MATLAB ® and many other similar languages are designed to compute not
just with single numbers but with entire sequences of numbers at the same time, data
is entered in the form of arrays. These are multi-dimensional objects. Two particular
typesofarrayarevectorsandmatriceswhicharestudiedindetailinChapters7and8.
2 Chapter 1 Review of algebraic techniques
Apart from being able to perform basic mathematical operations with vectors and
matrices, MATLAB ® has, in addition, a vast range of built-in computational functions
whicharestraightforwardtousebutneverthelessareverypowerful.Manyofthesehighlevelfunctionsareaccessiblebypassingdatatothemintheformofvectorsandmatrices.
A small number of these special functions are used and explained in this text. However,
to get the most out of a technical computing language it is necessary to develop
a good understanding of what the software can do and to make regular reference to the
manual.
1.2 LAWSOFINDICES
Considertheproduct6 ×6×6×6×6.Thismaybewrittenmorecompactlyas6 5 .We
call5theindexorpower. Thebaseis6.Similarly,y ×y×y×ymaybewrittenasy 4 .
Here the base isyand the index is4.
Example1.1 Write the following using index notation:
(a) (−2)(−2)(−2) (b) 4.4.4.5.5 (c)
Solution (a) (−2)(−2)(−2) may bewritten as (−2) 3 .
(b) 4.4.4.5.5 may be writtenas 4 3 5 2 .
yyy
xxxx
(d) aa(−a)(−a)
bb(−b)
(c)
yyy y3
may bewritten as
xxxx x 4.
(d) Note that (−a)(−a) = aa since the product of two negative quantities is positive.
Soaa(−a)(−a) =aaaa =a 4 . Alsobb(−b) = −bbb = −b 3 . Hence
aa(−a)(−a)
bb(−b)
= a4
−b = 3 −a4 b 3
Example1.2 Evaluate
(a) 7 3 (b) (−3) 3 (c) 2 3 (−3) 4
Solution (a) 7 3 = 7.7.7 = 343
(b) (−3) 3 = (−3)(−3)(−3) = −27
(c) 2 3 (−3) 4 = 8(81) = 648
Most scientific calculators have an x y button to enable easy calculation of expressions
ofasimilarform tothoseinExample 1.2.
1.2.1 Multiplyingexpressionsinvolvingindices
Consider the product (6 2 )(6 3 ). We maywrite thisas
(6 2 )(6 3 ) = (6.6)(6.6.6) = 6 5
1.2 Laws of indices 3
So
6 2 6 3 =6 5
This illustratesthefirst lawof indiceswhich is
a m a n =a m+n
When expressions with the same base aremultiplied, the indices areadded.
Example1.3 Simplifyeach of the following expressions:
(a) 3 9 3 10 (b) 4 3 4 4 4 6 (c) x 3 x 6 (d) y 4 y 2 y 3
Solution (a) 3 9 3 10 = 3 9+10 = 3 19
(b) 4 3 4 4 4 6 = 4 3+4+6 = 4 13
(c) x 3 x 6 =x 3+6 =x 9
(d) y 4 y 2 y 3 =y 4+2+3 =y 9
Engineeringapplication1.1
Powerdissipationinaresistor
The resistor is one of the three fundamental electronic components. The other two
are the capacitor and the inductor, which we will meet later. The role of the resistor
is to reduce the current flow within the branch of a circuit for a given voltage. As
current flows through the resistor, electrical energy is converted into heat. Because
the energy is lost from the circuit and is effectively wasted, it is termed dissipated
energy. The rate of energy dissipationisknown as the power,P, and isgiven by
P=I 2 R (1.1)
whereI isthecurrentflowingthroughtheresistorandRistheresistancevalue.Note
that the current is raised to the power 2. Note that power, P, is measured in watts;
current,I, ismeasured inamps;and resistance,R, is measured inohms.
Thereisanalternativeformulaforpowerdissipationinaresistorthatusesthevoltage,V,
across the resistor. To obtain this alternative formula we need to use Ohm’s
law,whichstatesthatthevoltageacrossaresistor,V,andthecurrentpassingthrough
it, are relatedby the formula
V=IR (1.2)
From Equation (1.2) wesee that
I = V R
(1.3)
Combining Equations (1.1) and (1.3) gives
( ) V 2
P = R = V ·V V2
·R =
R R R R
➔
4 Chapter 1 Review of algebraic techniques
Note that in this formula forP, the voltage is raised to the power 2. Note an important
consequence of this formula is that doubling the voltage, while keeping the
resistance fixed, results in the power dissipation increasing by a factor of 4, that is
2 2 .Alsotreblingthevoltage,forafixedvalueofresistance,resultsinthepowerdissipation
increasing by a factor of 9,thatis3 2 .
Similar considerations can be applied to Equation 1.1. For a fixed value of resistance,
doubling the current results in the power dissipation increasing by a factor of
4,andtreblingthecurrentresultsinthepowerdissipationincreasingbyafactorof9.
Consider the product 3(3 3 ). Now
3(3 3 ) = 3(3.3.3) = 3 4
Also, using the first law of indices we see that 3 1 3 3 = 3 4 . This suggests that 3 is the
same as 3 1 . This illustratesthe general rule:
a=a 1
Raising a number tothe power 1 leaves the number unchanged.
Example1.4 Simplify (a) 5 6 5 (b) x 3 xx 2
Solution (a) 5 6 5 = 5 6+1 = 5 7 (b) x 3 xx 2 =x 3+1+2 =x 6
1.2.2 Dividingexpressionsinvolvingindices
Consider the expression 45
4 3:
4 5
4 = 4.4.4.4.4
3 4.4.4
= 4.4 by cancelling 4s
= 4 2
This serves toillustratethesecond lawof indiceswhich is
a m
a n =am−n
When expressions with the same baseare divided, the indices aresubtracted.
Example1.5 Simplify
(a) 59
5 7 (b) (−2)16
(−2) 13
(c) x9
x 5
(d) y6
y
Solution (a)
(b)
5 9
5 = 7 59−7 = 5 2
(−2) 16
(−2) = 13 (−2)16−13 = (−2) 3
1.2 Laws of indices 5
(c)
(d)
x 9
x 5 =x9−5 =x 4
y 6
y =y6−1 =y 5
Consider the expression 23
23. Usingthe second law of indices we may write
2 3
2 = 3 23−3 = 2 0
But, clearly, 23
2 3 = 1,and so 20 = 1.This illustratesthe general rule:
a 0 =1
Any expression raised tothe power 0 is1.
1.2.3 Negativeindices
Consider the expression 43
45. We can write thisas
4 3
4 = 4.4.4
5 4.4.4.4.4 = 1
4.4 = 1 4 2
Alternatively, using the second law of indices we have
4 3
4 5 = 43−5 = 4 −2
So wesee that
4 −2 = 1 4 2
Thus we are able to interpret negative indices. The sign of an index changes when the
expression is inverted. Ingeneral wecan state
a −m = 1
a m a m = 1
a −m
Example1.6 Evaluate the following:
(a) 3 −2 2
(b) (c) 3 −1 (d) (−3) −2 (e) 6−3
4 −3 6 −2
Solution (a) 3 −2 = 1 3 = 1 2 9
2
(b)
4 = −3 2(43 ) = 2(64) = 128
(c) 3 −1 = 1 3 = 1 1 3
(d) (−3) −2 = 1
(−3) = 1 2 9
6 −3
(e)
6 = −2 6−3−(−2) = 6 −1 = 1 6 = 1 1 6
6 Chapter 1 Review of algebraic techniques
Example1.7 Write the following expressions using only positive indices:
(a) x −4 (b) 3x −4 (c) x−2
y −2
(d) 3x −2 y −3
Solution (a) x −4 = 1 x 4
(b) 3x −4 = 3 x 4
(c)
x −2
y −2 =x−2 y 2 = y2
x 2
(d) 3x −2 y −3 = 3
x 2 y 3
Engineeringapplication1.2
Powerdensityofasignaltransmittedbyaradioantenna
Aradioantennaisadevicethatisusedtoconvertelectricalenergyintoelectromagnetic
radiation,whichis thentransmitted todistant points.
An idealtheoreticalpointsourceradioantenna whichradiates the same power in
alldirectionsistermedanisotropicantenna.Whenittransmitsaradiowave,thewave
spreads out equally in all directions, providing there are no obstacles to block the
expansionofthewave.Thepowergeneratedbytheantennaisuniformlydistributed
on the surfaceofanexpanding sphereofarea,A, given by
A = 4πr 2
whereristhe distance fromthe generatingantenna tothe wave front.
The power density,S, provides an indication of how much of the signal can potentially
be received by another antenna placed at a distance r. The actual power
received depends on the effective area or aperture of the antenna, which is usually
expressed inunitsofm 2 .
Electromagneticfieldexposurelimitsforhumansaresometimesspecifiedinterms
of a power density. The closer a person is to the transmitter, the higher the power
densitywill be. So a safe distanceneeds tobedetermined.
Thepowerdensityistheratioofthepowertransmitted,P t
,totheareaoverwhich
itisspread
S =
power transmitted
area
= P t
4πr 2 = P t
4π r−2 W m −2
Note that r in this equation has a negative index. This type of relationship is
knownasaninversesquarelawandisfoundcommonlyinscienceandengineering.
Note that if the distance,r, is doubled, then the area,A, increases by a factor of
4 (i.e. 2 2 ). If the distance is trebled, the area increases by a factor of 9 (i.e. 3 2 ) and
so on. This means that as the distance from the antenna doubles, the power density,
S, decreases to a quarter of its previous value; if the distance trebles then the power
density isonly a ninth ofits previous value.
1.2 Laws of indices 7
1.2.4 Multipleindices
Consider the expression (4 3 ) 2 . This may be writtenas
(4 3 ) 2 =4 3 .4 3 =4 3+3 =4 6
This illustratesthethirdlaw ofindiceswhich is
(a m ) n =a mn
Note that the indicesmandnhave been multiplied.
Example1.8 Writethe following expressions usingasingleindex:
(a) (3 2 ) 4 (b) (7 −2 ) 3 (c) (x 2 ) −3 (d) (x −2 ) −3
Solution (a) (3 2 ) 4 = 3 2×4 = 3 8
(b) (7 −2 ) 3 = 7 −2×3 = 7 −6
(c) (x 2 ) −3 =x 2×(−3) =x −6
(d) (x −2 ) −3 =x −2×−3 =x 6
Consider the expression (2 4 5 2 ) 3 . We see that
(2 4 5 2 ) 3 = (2 4 5 2 )(2 4 5 2 )(2 4 5 2 )
=2 4 2 4 2 4 5 2 5 2 5 2
= 2 12 5 6
This illustratesageneralization of the thirdlaw of indices which is
(a m b n ) k =a mk b nk
Example1.9 Remove the brackets from
(a) (2x 2 ) 3 (b) (−3y 4 ) 2 (c) (x −2 y) 3
Solution (a) (2x 2 ) 3 = (2 1 x 2 ) 3 = 2 3 x 6 = 8x 6
(b) (−3y 4 ) 2 = (−3) 2 y 8 = 9y 8
(c) (x −2 y) 3 =x −6 y 3
Engineeringapplication1.3
Radarscattering
It has already been shown in Engineering application 1.2 that the power density of
an isotropic transmitter of radio waves is
S = P t
4π r−2 W m −2
➔
8 Chapter 1 Review of algebraic techniques
It is possible to use radio waves to detect distant objects. The technique involves
transmittingaradiosignal,whichisthenreflectedbackwhenitstrikesatarget.This
weakreflectedsignalisthenpickedupbyareceivingantenna,thusallowinganumber
ofpropertiesofthetargettobededuced,suchasitsangularpositionanddistancefrom
the transmitter. This system is known as radar, which was originally an acronym
standing for RAdioDetectionAndRanging.
When the wave hits the target it produces a quantity of reflected power. The
power depends upon the object’s radar cross-section (RCS), normally denoted by
theGreeklower caselettersigma, σ,andhaving unitsofm 2 .Thepower reflected at
the object,P r
, isgiven by
P r
=Sσ= P t σ
4π r−2 W
SomemilitaryaircraftusespecialtechniquestominimizetheRCSinordertoreduce
the amount of power they reflect and hence minimize the chance of being detected.
If the reflected power at the target is assumed to spread spherically, when it
returns tothe transmitter position itwill have the power density,S r
, given by
S r
=
power reflected attarget
area
= P r
4π r−2 W m −2
Substituting forthe reflected power,P r
, gives
power reflected attarget
S r
=
area
= P t σ W m −2
(4π) 2r−4
=
(
Pt σ
4π r−2 )
4π
r −2 =
P t σ ( ) r
−2 2
4π×4π
Notethattheproductofthetwor −2 termshasbeencalculatedusingthethirdlawof
indices.
Thisexampleillustratesoneofthemainchallengeswithradardesignwhichisthat
thepowerdensityreturnedbyadistantobjectisverymuchsmallerthanthetransmitted
power, even for targets with a large RCS. For theoretical isotropic antennas, the
received power density depends upon the factorr −4 . This factor diminishes rapidly
forlarge values ofr, thatis, as the object being detected gets further away.
In practice, the transmit antennas used are not isotropic but directive and often
scantheareaofinterest.Theyalsomakeuseofreceiveantennaswithalargeeffective
area which can produce a viable signalfrom the smallreflected power densities.
1.2.5 Fractionalindices
Thethirdlawofindicesstatesthat (a m ) n =a mn .Ifwetakea = 2,m = 1 2 andn=2we
obtain
(2 1/2 ) 2 = 2 1 = 2
So when 2 1/2 is squared, the result is 2. Thus, 2 1/2 is a square root of 2. Each positive
number has two square roots and so
2 1/2 = √ 2 = ±1.4142 ...
1.2 Laws of indices 9
Similarly
(2 1/3 ) 3 = 2 1 = 2
so that2 1/3 isacube rootof 2:
√
2 1/3 = 3 2=1.2599...
Ingeneral 2 1/n isannthrootof 2.The general lawstates
x 1/n isannthrootofx
Example1.10 Write the following usingasingle positive index:
(a) (3 −2 ) 1/4 (b) x 2/3 x 5/3 (c) yy −2/5 (d) √ k 3
Solution (a) (3 −2 ) 1/4 = 3 −2×1 4 = 3 −1/2 = 1
3 1/2
(b) x 2/3 x 5/3 =x 2/3+5/3 =x 7/3
(c) yy −2/5 =y 1 y −2/5 =y 1−2/5 =y 3/5
(d) √ k 3 = (k 3 ) 1/2 =k 3×1 2 =k
3/2
Example1.11 Evaluate
(a) 8 1/3 (b) 8 2/3 (c) 8 −1/3 (d) 8 −2/3 (e) 8 4/3
Solution We notethat 8 maybewritten as2 3 .
(a) 8 1/3 = (2 3 ) 1/3 = 2 1 = 2
(b) 8 2/3 = (8 1/3 ) 2 = 2 2 = 4
(c) 8 −1/3 = 1
8 1/3 = 1 2
(d) 8 −2/3 = 1
8 2/3 = 1 4
(e) 8 4/3 = (8 1/3 ) 4 = 2 4 = 16
Engineeringapplication1.4
Skindepthinaradialconductor
When an alternating current signal travels along a conductor, such as a copper wire,
most of the current is found near the surface of the conductor. Nearer to the centre
of the conductor, the current diminishes. The depth of penetration of the signal,
termed the skin depth, into the conductor depends on the frequency of the signal.
Skin depth, illustrated in Figure 1.1, is defined as the depth at which the current
➔
10 Chapter 1 Review of algebraic techniques
densityhasdecayedtoapproximately37%ofthatattheedge.Skindepthisimportant
because it affects the resistance of wires and other conductors: the smaller the skin
depth, the higher the effective resistance and the greater the loss due toheating.
At low frequencies, such as those found in the domestic mains supply, the skin
depthissolargethatoftenitcan beneglected; however, invery large-diameter conductors
and smaller conductors at microwave frequencies it becomes important and
has tobe taken into account.
The skin depth, δ, isgiven by
( ) 2 1/2
δ =
ωµσ
where µisamaterialconstantknownasthepermeabilityoftheconductor, ω isthe
angular frequency of the signal and σ isthe conductivity of the conductor.
d
Figure1.1
Cross-sectionofaradialconductor
illustratingaskindepth δ.
EXERCISES1.2
1 Evaluate
(a) 2 3 (b) 3 2 5 13
(c)
5 12
19 −11
(d)
19 −13 (e) (2 1/4 ) 8 (f) (−4) −2
(g) 4 −1/2 (h) (9 1/3 ) 3/2 √ √
(i) 32 2
√
(j) 0.01 (k) 81 3/4
2 Useascientificcalculator to evaluate
(a) 10 1.2 (b) 6 −0.7 (c) 6 2.5
(d) (3 −1 4 2 ) 0.8
3 Expresseachofthe followingexpressions usinga
single positiveindex:
(a) x 4 x 7 (b) x 2 (−x)
(c)
x 2
x
(d)
x −2
x −1
(e) (x −2 ) 4 (f) (x −2.5 x −3.5 ) 2
4 Simplify asmuch aspossible
(a)
(c)
x 1/2
x 1/3 (b) (16x 4 ) 0.25
( ) 1/3
27 2xy 2
y 3 (d)
(2xy) 2
(e) √ a 2 b 6 c 4 (f) (64t 3 ) 2/3
1.3 Number bases 11
Solutions
1 (a) 8 (b) 9 (c) 5 (d) 361
1 1
(e) 4 (f) (g) (h) 3
16 2
(i) 8 (j) 0.1 (k) 27
2 (a) 15.8489 (b) 0.2853
(c) 88.1816 (d) 3.8159
3 (a) x 11 (b) −x 3 (c) x
1 1 1
(d) (e)
x x 8 (f)
x 12
4 (a) x 1/6 (b) 2x (c)
(d)
1
2x
3
y
(e) ab 3 c 2 (f) 16t 2
1.3 NUMBERBASES
The decimal system of numbers in common use is based on the 10 digits 0, 1, 2, 3, 4,
5, 6, 7, 8 and 9. However, other number systems have important applications in computerscienceandelectronicengineering.Inthissectionweremindthereaderofwhatis
meant by a number in the decimal system, and show how we can use powers or indices
with bases of 2 and 16 to represent numbers in the binary and hexadecimal systems
respectively.Wefollowthisbyanexplanationofanalternativebinaryrepresentationof
a number known as binarycoded decimal.
1.3.1 Thedecimalsystem
Thenumbersthatwecommonlyuseineverydaylifearebasedon10.Forexample,253
can bewrittenas
253=200+50+3
= 2(100) + 5(10) + 3(1)
= 2(10 2 ) + 5(10 1 ) + 3(10 0 )
In this form it is clear why we refer to this as a ‘base 10’ number. When we use 10 as a
basewesaywearewritinginthedecimalsystem.Notethatinthedecimalsystemthere
are10digits:0,1,2,3,4,5,6,7,8,and9.Youmayrecallthephrase‘hundreds,tensand
units’ and as we have seen these are simply powers of 10. To avoid possible confusion
with numbers using other bases, we denote numbers in base 10 with a small subscript,
forexample, 5192 10
:
5192 10
= 5000 +100 +90 +2
= 5(1000) + 1(100) + 9(10) + 2(1)
= 5(10 3 ) + 1(10 2 ) + 9(10 1 ) + 2(10 0 )
Notethat,inthepreviousline,aswemovefromrighttoleft,thepowersof10increase.
1.3.2 Thebinarysystem
Abinarysystemusesthenumber2foritsbase.Abinarysystemhasonlytwodigits,0
and 1, and these are called binary digits or simply bits. Binary numbers are based on
powersof2.Inacomputer,binarynumbersareusuallystoredingroupsof8bitswhich
wecall abyte.
12 Chapter 1 Review of algebraic techniques
Convertingfrombinarytodecimal
Considerthebinarynumber110101 2
.Asthebaseis2thismeansthataswemovefrom
right toleft the position of each digitrepresents an increasing power of2asfollows:
110101 2
= 1(2 5 ) +1(2 4 ) +0(2 3 ) +1(2 2 ) +0(2 1 ) +1(2 0 )
= 1(32) +1(16) +0(8) +1(4) +0(2) +1(1)
=32+16+4+1
= 53 10
Hence 110101 2
and 53 10
areequivalent.
Example1.12 Convert the following todecimal: (a) 1111 2
(b)101010 2
Solution (a) 1111 2
= 1(2 3 ) +1(2 2 ) +1(2 1 ) +1(2 0 )
= 1(8) +1(4) +1(2) +1(1)
=8+4+2+1
= 15 10
(b) 101010 2
= 1(2 5 ) +0(2 4 ) +1(2 3 ) +0(2 2 ) +1(2 1 ) +0(2 0 )
=1(32)+0+1(8)+0+1(2)+0
=32+8+2
= 42 10
Convertingdecimaltobinary
Wenowlookatsomeexamplesofconvertingnumbersinbase10tonumbersinbase2,
that is from decimal to binary. We make use of Table 1.1, which shows various powers
of 2,when converting from decimal tobinary. Table 1.1 may be extended as necessary.
Table1.1
Powers of2.
2 0 1 2 4 16 2 8 256
2 1 2 2 5 32 2 9 512
2 2 4 2 6 64 2 10 1024
2 3 8 2 7 128 2 11 2048
Example1.13 Convert 83 10
toabinary number.
Solution We need to express 83 10
as the sum of a set of numbers, each of which is a power of 2.
From Table 1.1 we see that 64 is the highest number in the table that does not exceed
the given number of 83. We write
83=64+19
Wenowfocusonthe19.FromTable1.1,16isthehighestnumberthatdoesnotexceed
19. So wewrite
19=16+3
1.3 Number bases 13
giving
83=64+16+3
We now focus on the 3 and again usingTable 1.1 we may write
83=64+16+2+1
=2 6 +2 4 +2 1 +2 0
= 1(2 6 ) +0(2 5 ) +1(2 4 ) +0(2 3 ) +0(2 2 ) +1(2 1 ) +1(2 0 )
= 1010011 2
Example1.14 Express 200 10
as a binary number.
Solution From Table 1.1 we note that 128 is the highest number that does not exceed 200 so we
write
200 = 128 +72
UsingTable 1.1 repeatedly we may write
200 = 128 +72
=128+64+8
=2 7 +2 6 +2 3
= 1(2 7 ) +1(2 6 ) +0(2 5 ) +0(2 4 ) +1(2 3 ) +0(2 2 ) +0(2 1 ) +0(2 0 )
= 11001000 2
Anotherwaytoconvertdecimalnumberstobinarynumbersistodivideby2repeatedly
and note the remainder. We rework the previous two examples using thismethod.
Example1.15 Convert the following decimal numbers tobinary: (a)83 (b) 200
Solution (a) We divide by 2 repeatedly and note the remainder.
Remainder
83÷2=41r1 1
41÷2=20r1 1
20÷2=10r0 0
10÷2= 5r0 0
5÷2= 2r1 1
2÷2= 1r0 0
1÷2= 0r1 1
To obtain the binary number we write out the remainder, working from the bottom
one tothe topone. This gives
as before.
83 10
= 1010011 2
14 Chapter 1 Review of algebraic techniques
(b) We repeat the process by repeatedly dividing 200 by 2 and noting the remainder.
Remainder
200÷2=100r0 0
100÷2= 50r0 0
50÷2= 25r0 0
25÷2= 12r1 1
12÷2= 6r0 0
6÷2= 3r0 0
3÷2= 1r1 1
1÷2= 0r1 1
Readingtheremaindercolumnfromthebottomtothetopgivestherequiredbinary
number:
200 10
= 11001000 2
1.3.3 Hexadecimalsystem
We now consider the number system which uses 16 as a base. This system is termed
hexadecimal (or simply hex). There are 16 digits in the hexadecimal system: 0, 1, 2,
3, 4, 5, 6, 7, 8, 9, A, B, C, D, E, and F. Notice that conventional decimal digits are
insufficient to represent hexadecimal numbers and so additional ‘digits’, A, B, C, D, E,
andF,areincluded.Table1.2showstheequivalencebetweendecimalandhexadecimal
digits. Hexadecimal numbers are based on powers of16.
Table1.2
Hexadecimal numbers.
Decimal Hexadecimal Decimal Hexadecimal
0 0 8 8
1 1 9 9
2 2 10 A
3 3 11 B
4 4 12 C
5 5 13 D
6 6 14 E
7 7 15 F
Convertingfromhexadecimaltodecimal
The following example illustrates how to convert from hexadecimal to decimal. We
use the fact that as we move from right to left, the position of each digit represents an
increasing power of 16.
1.3 Number bases 15
Example1.16 Convert the following hexadecimal numbers todecimal numbers: (a)93A (b)F9B3
Solution (a) Noting thathexadecimal numbers use base16 we have
93A 16
= 9(16 2 ) +3(16 1 ) +A(16 0 )
= 9(256) +3(16) +10(1)
= 2362 10
(b) F9B3 16
= F(16 3 ) +9(16 2 ) +B(16 1 ) +3(16 0 )
= 15(4096) +9(256) +11(16) +3(1)
= 63923 10
Convertingfromdecimaltohexadecimal
Table 1.3 provides powers of 16 which help in the conversion from decimal to hexadecimal.
Table1.3
16 0 1
16 1 16
16 2 256
16 3 4096
16 4 65536
The following example illustrateshow toconvert from decimal tohexadecimal.
Example1.17 Convert 14397 toahexadecimal number.
Solution We need to express 14397 as the sum of multiples of powers of 16. From Table 1.3 we
see that the highest number that does not exceed 14397 is 4096. We express 14397 as
amultipleof4096withanappropriateremainder. Dividing14397by4096weobtain3
with a remainder of 2109. So we may write
14397 = 3(4096) +2109
Wenowfocuson2109andapplythesameprocessasabove.FromTable1.3thehighest
number thatdoes not exceed 2109 is 256:
2109 = 8(256) +61
Finally, 61 = 3(16) +13. So wehave
14397 = 3(4096) +8(256) +3(16) +13
= 3(16 3 ) +8(16 2 ) +3(16 1 ) +13(16 0 )
FromTable 1.2 wesee that13 10
isDinhexadecimal, sowe have
14397 10
= 383D 16
As with base 2 we can convert decimal numbers by repeated division and noting the
remainder. The previous example isreworked toillustratethis.
16 Chapter 1 Review of algebraic techniques
Example1.18 Convert 14397 tohexadecimal.
Solution We divide repeatedly by 16, noting the remainder.
Remainder
14397÷16=899r13 13
899÷16= 56r3 3
56÷16= 3r8 8
3÷16= 0r3 3
Recall that 13 inhexadecimal isD.Reading up the Remainder column wehave
as before.
14397 10
= 383D 16
Electronic engineers need to be familiar with the decimal, binary and hexadecimal systemsandbeabletoconvertbetweenthem.Theequivalentrepresentationsofthedecimal
numbers 0--15 are provided inTable 1.4.
Table1.4
Decimal Binary Hex Decimal Binary Hex
0 0000 0 8 1000 8
1 0001 1 9 1001 9
2 0010 2 10 1010 A
3 0011 3 11 1011 B
4 0100 4 12 1100 C
5 0101 5 13 1101 D
6 0110 6 14 1110 E
7 0111 7 15 1111 F
Convertingfrombinarytohexadecimal
There is a straightforward way of converting a binary number into a hexadecimal number.Thedigitsofthebinarynumberaregroupedintofours,orquartets,(fromtherighthand
side) and each quartetisconverted toits hex equivalent using Table 1.4.
Example1.19 Convert 1101011100111 2
into hexadecimal.
Solution Working from the right, the binary number is grouped into fours, with additional zeros
being added as necessary tothe final grouping.
0001 1010 1110 0111
1.3 Number bases 17
Table1.4isusedtoexpresseachgroupoffourasitshexequivalent.Forexample,0111 =
7 16
, and continuing inthis waywe obtain
1AE7
Thus 110101110 0111 2
= 1AE7 16
.
1.3.4 Binarycodeddecimal
We have seen in Section 1.3.2 that decimal numbers can be expressed in an equivalent
binary form where the position of each binary digit, moving from the right to the left,
represents an increasing power of 2. There is an alternative way of expressing numbers
using the binary digits 1 and 0 that is often used in electronic engineering because for
some applications it is more straightforward to build the necessary hardware. This systemiscalledbinary
coded decimal(b.c.d.).
First of all, recall how the decimal digits 0,1,2,...,9 are expressed in their usual
binary form. Note that the largest decimal digit 9 is 1001 in binary, and so we need
at most four digits to store the binary representations of 0,1,...,9. Expressing each
decimal digitas a four-digitbinary number we obtain Table 1.5.
Table1.5
Decimal digits and their four-digit
binaryrepresentations.
0 0000 5 0101
1 0001 6 0110
2 0010 7 0111
3 0011 8 1000
4 0100 9 1001
A four-digit binary number is referred to as a nibble. To express a multi-digit decimal
number, such as 347, in b.c.d. each decimal digit in turn is converted into its binary
representation as shown. Note thatanibble isused foreach decimal digit.
3 4 7
↓ ↓ ↓
0011 0100 0111
Recall from Section 1.3.2 that a byte is a group of 8 bits (binary digits). Computers
usually store numbers in 8-bit bytes so there are two common ways of encoding b.c.d.
The first is to use a whole byte for each nibble, with the first 4 bits always set to 0. So,
forexample, 347 10
can be storedas
00000011 00000100 00000111
Alternatively, eachbytecanbeusedtostoretwonibbles,inwhichcase347 10
wouldbe
storedas
00000011 01000111
Ruleshavebeendevelopedforperformingcalculationsinb.c.d.butthesearebeyondthe
scope of thisbook.
18 Chapter 1 Review of algebraic techniques
Engineeringapplication1.5
Seven-segmentdisplays
The number displays found on music systems, video and other electronic equipment
commonly employ one or more seven-segment indicators. A single sevensegment
indicator is shown in Figure 1.2(a). The individual segments are typically
illuminatedwithalight-emittingdiode(LED)orsimilaropticaldeviceandareeither
onoroff.ThesegmentsareilluminatedaccordingtothetableshowninFigure1.2(b),
where 1 indicates thatthe segment is turned on and 0 indicates thatitisturned off.
b.c.d.
number a b c d e f g
0000 1
1 1 1 1 1 0
0001
0 1 1 0 0 0 0
0010
1 1 0 1 1 0 1
a
0011
0100
1 1 1 1 0 0 1
0 1 1 0 0 1 1
f
g
b
0101
0110
1 0 1 1 0 1 1
1 0 1 1 1 1 1
e
d
c
0111
1000
1001
1 1 1 0 0 0 0
1 1 1 1 1 1 1
1 1 1 1 0 1 1
(a)
(b)
Figure1.2
(a) Seven-segment LED display. (b)Seven-segment coding.
The numbers in the microprocessor system driving the display are typically
stored in binary format, known as, binary coded decimal (b.c.d.). As an example
we consider displaying binary number 11101010 2
as a decimal number on sevensegment
displays. This represents the decimal number 234, which requires three
seven-segment displays.
Themicroprocessorfirstdividestheinputnumberby100andinthiscaseobtains
the result 2 with a remainder of 34. This can be done directly on the binary number
itself via a series of operations within the assembly language of the microprocessor
without first converting to a decimal number. The result 2 = 0010 2
is then decoded
usingFigure1.2(b),givingthebitpattern1101101whichispassedtothe‘hundreds’
display.
Theremainderof34isthendividedby10giving3withafinalremainderof4.The
number 3 = 0011 2
and so this can be outputted to the ‘tens’ display as the pattern
1111001. Finally, 4 = 0100 2
, which ispassed tothe display as the pattern 0110011.
1.3 Number bases 19
The display shows
a
a
a
f
g
b
f
g
b
f
g
b
e
d
c
e
d
c
e
d
c
Notice that prior to decoding for display, by successive division by 100 and 10
the number has been converted into separate b.c.d. digits. Integrated circuits are
available which convert b.c.d. directly into the bit patterns for display. Hence the
output bit pattern of the microprocessor may be chosen to be b.c.d. In this case it
has the advantage that fewer pins are required on the microprocessor to operate the
display.
EXERCISES1.3
1 Convert the following decimal numbersto binary
numbers:(a) 19 (b)36 (c) 100 (d)796
(e)5000
2 Convert the following binarynumbers to decimal
numbers:(a) 111 (b)10101 (c) 111001
(d)1110001 (e) 11111111
3 What isthe highestdecimal number that can be
written in binary formusingamaximum of(a)2
binarydigits (b)3binarydigits (c)4binary digits
(d)5binary digits? Canyou spotapattern? (e)Write
aformulaforthe highest decimal number that can be
written usingN binary digits.
4 Writethe decimal number 0.5 in binary.
5 Convert the following hexadecimal numbersto
decimal numbers:(a)91 (b)6C (c)A1B (d)F9D4
(e)ABCD
6 Convert the following decimal numbersto
hexadecimal numbers:(a)160 (b)396 (c)5010
(d)25000 (e)1000000
7 Calculate the highestdecimal number that can be
represented byahexadecimal number with (a)1digit
(b)2digits (c)3digits (d)4digits (e)N digits
8 Expressthedecimalnumber375asbothapurebinary
number and anumber in b.c.d.
9 Convert (a)1111111 2 (b)101010111 2 into
hexadecimal.
Solutions
1 (a) 19 10 =10011 2 (b) 100100 (c) 1100100
(d) 1100011100 (e) 1001110001000
2 (a)111 2 = 7 (b)21 (c) 57 (d)113 (e)255
3 (a)3 (b)7 (c) 15 (d)31 (e) 2 N −1
4 Thebinarysystem is basedonpowers of2. The
examples in the text can be extended to the case of
negative powers of2justasin the decimal system
numbersafterthe decimal place represent negative
powers of10.So, forexample, the binarynumber
11.101 2 isconverted to decimal asfollows:
11.101 2 =1×2 1 + 1×2 0 + 1×2 −1
+0×2 −2 +1×2 −3
=2+1+ 1 2 + 1 8
= 3 5 8
20 Chapter 1 Review of algebraic techniques
Inthe same way the binary equivalent ofthe decimal
number 0.5is 0.1.
5 (a) 91 16 =145 10 (b)6C =108 (c)2587 (d)63956
(e) 43981
6 (a) 160 10 = A0 (b)18C (c) 1392 (d)61A8
(e) F4240
7 (a)15 (b)255 (c)4095 (d)65535 (e)16 N −1
8 (a)101110111 2 (b)001101110101 bcd
9 (a)7F (b)157
1.4 POLYNOMIALEQUATIONS
Apolynomial equationhas the form
P(x) =a n
x n +a n−1
x n−1 +a n−2
x n−2 + ··· +a 2
x 2 +a 1
x +a 0
=0 (1.4)
where n is a positive whole number, a n
, a n−1
,...,a 0
are constants and x is a
variable. The constants a n
, a n−1
,...,a 2
, a 1
, a 0
are called the coefficients of the
polynomial.
The roots of an equation are those values ofxwhich satisfyP(x) = 0. So ifx =x 1
is a
rootthenP(x 1
) = 0.
Examples of polynomial equations are
7x 2 +4x−1=0 (1.5)
2x−3=0 (1.6)
x 3 −20=0 (1.7)
Thedegreeofanequationisthevalueofthehighestpower.Equation(1.5)hasdegree2,
Equation (1.6) has degree 1 and Equation (1.7) has degree 3. A polynomial equation of
degreenhasnroots.
Therearesomespecialnamesforpolynomialequationsoflowdegree(seeTable1.6).
Table1.6
Equation Degree Name
ax+b=0 1 Linear
ax 2 +bx +c = 0 2 Quadratic
ax 3 +bx 2 +cx+d=0 3 Cubic
ax 4 +bx 3 +cx 2 +dx +e =0 4 Quartic
1.4.1 Quadraticequations
Wenowfocusattentiononquadraticequations.Thestandardformofaquadraticequation
isax 2 +bx +c = 0.We look atthreemethods of solvingquadratic equations:
(1) factorization,
(2) use of a formula,
(3) completing the square.
Example 1.20 illustratessolution by factorization.
1.4 Polynomial equations 21
Example1.20 Solve
6x 2 +11x−10=0
Solution The left-hand side(l.h.s.)isfactorized:
(3x −2)(2x +5) =0
So either
3x−2=0 or 2x+5=0
Hence
x = 2 3 ,−5 2
When roots cannotbefound byfactorizationwecan make use ofaformula.
Theformulaforfindingthe roots ofax 2 +bx +c = 0 is
x = −b ± √ b 2 −4ac
2a
Example1.21 Usethe quadraticformulatosolve
3x 2 −x−6=0
Solution Comparing3x 2 −x−6 withax 2 +bx +cwesee thata = 3,b= −1 andc = −6.So
x = −(−1) ± √ (−1) 2 −4(3)(−6)
2(3)
= 1 ± √ 73
6
= −1.2573,1.5907
Engineeringapplication1.6
Currentusedbyanelectricvehicle
Personal transport systems that make use of electrical power are becoming increasinglycommon.Oneofthefactorsbehindthischangeisthattheirusecanreduceroadsidepollutioninanurbanenvironment.Electricalvehicleshavealsobecomethebase
forself-drivingcars when combinedwith electricalcontroland navigation systems.
The motor in an operational electric vehicle has to do work to overcome wind,
inertia, friction, road resistance and in order to climb inclines. The energy supply
in the form of electrical power comes from the on-board battery pack. Due to its
internal construction the battery pack has a total internal resistance,R, which serves
toreduce the power available tothe motor.
➔
22 Chapter 1 Review of algebraic techniques
Asimplified circuitdiagram of a vehicle isshown inFigure 1.3.
Internal
resistance
R
I
Terminals
V
Motor and gearbox
Drive
wheels
Battery pack
Figure1.3
Electric vehicle wiringdiagram.
The total power delivered by the battery pack is
power = voltage ×current =VI
This is shared between loss due to the internal resistance and the power, P, to the
motor. The power loss due to the internal resistance isI 2 R (see Engineering application
1.1).So the equation for the power inthe circuitis
VI=I 2 R+P
This can be rewritten into the formof a quadratic equation
RI 2 −VI+P=0
whichcan besolved tocalculate the currentinthe wireforaparticular power delivered
to the motor. It is important to know the current in order to specify the size of
the fuses,the motor controller and the wirediameters used inthe vehicle.
Consider the case where the power output is 2 kW. If the circuit parameters are
V = 150 volts, R = 1.6 ,we have
1.6I 2 −150I +2000 = 0
The solutions tothe quadratic equation are
I = −b ± √ b 2 −4ac
2a
= 77.7 A, 16.1 A
= −(−150) ± √ (−150) 2 − (4 ×1.6 ×2000)
2×1.6
Therelevantsolutiondependsontheelectricalcharacteristicsofthemotorusedinthe
circuit. In practice, the larger of the two currents would correspond to a substantial
loss inthe internal resistance and would beavoided by the correctchoice of motor.
The technical computing language MATLAB ® has the function roots which
finds the solutions of a polynomial equation. In this example we would type
roots ([1.6 -150 2000]) at the command line to obtain the results calculated
above.
1.4 Polynomial equations 23
We now introduce the method of completing the square. The idea behind completing
the square is to absorb both thex 2 and thexterm into a single squared term. Note that
thisispossible since
and so
x 2 +2kx+k 2 =(x+k) 2
x 2 +2kx=(x+k) 2 −k 2
and finally
x 2 +2kx+A=(x+k) 2 +A−k 2
Thex 2 and thexterms are both contained in the (x +k) 2 term. The coefficient ofxon
the l.h.s.
(
is 2k. The squared
)
term on the right-hand side (r.h.s.) has the form (x +k) 2 ,
coefficient ofx 2
thatis x + . The following example illustratesthe idea.
2
Example1.22 Solve the following quadraticequationsbycompletingthe square:
(a) x 2 +8x+2=0
(b) 2x 2 −4x+1=0
Solution (a) Bycomparingx 2 +8x +2withx 2 +2kx +Aweseek = 4.Thusthesquaredterm
mustbe (x +4) 2 .Now
and so
Therefore
(x+4) 2 =x 2 +8x+16
x 2 +8x=(x+4) 2 −16
x 2 +8x+2=(x+4) 2 −16+2
=(x+4) 2 −14
At this stage we have completed the square. Finally, solvingx 2 + 8x + 2 = 0 we
have
x 2 +8x+2=0
(x+4) 2 −14=0
(x+4) 2 =14
x+4=± √ 14
x=−4± √ 14 = −7.7417,−0.2583
(b) 2x 2 −4x +1 = 0maybeexpressedasx 2 −2x +0.5 = 0.Comparingx 2 −2x +0.5
with x 2 + 2kx +A we see that k = −1. Thus the required squared term must be
(x −1) 2 .Now
and so
(x−1) 2 =x 2 −2x+1
x 2 −2x=(x−1) 2 −1
24 Chapter 1 Review of algebraic techniques
and
x 2 −2x+0.5=(x−1) 2 −1+0.5
=(x−1) 2 −0.5
Finally, solvingx 2 −2x +0.5 = 0 we have
(x−1) 2 −0.5=0
(x −1) 2 = 0.5
x−1=± √ 0.5
x=1± √ 0.5 = 0.2929,1.7071
1.4.2 Polynomialequationsofhigherdegree
Example1.23 Verifythatx = 1 andx = 2 areroots of
P(x)=x 4 −2x 3 −x+2=0
Solution P(x) =x 4 −2x 3 −x +2
P(1)=1−2−1+2=0
P(2)=2 4 −2(2 3 )−2+2=16−16−2+2=0
SinceP(1) = 0 andP(2) = 0, thenx = 1 andx = 2 are roots of the given polynomial
equation and are sometimes referred to as real roots. Further knowledge is required to
find the two remainingroots,whichareknown as complex roots. This topiciscovered
inChapter9.
Example1.24 Solve the equation
P(x)=x 3 +2x 2 −37x+52=0
Solution AsseeninExample1.21aformulacanbeusedtosolvequadraticequations.Forhigher
degree polynomial equations such simple formulae do not always exist. However, if
one of the roots can be found by inspection we can proceed as follows. By inspection
P(4) = 4 3 +2(4) 2 −37(4) +52 = 0sothatx = 4isaroot.Hencex−4isafactorof
P(x). ThereforeP(x) can bewritten as
P(x)=x 3 +2x 2 −37x+52=(x−4)(x 2 +αx+β)
where α and β mustnow befound.Expanding the r.h.s.gives
Hence
P(x)=x 3 +αx 2 +βx−4x 2 −4αx−4β
x 3 +2x 2 −37x+52=x 3 +(α−4)x 2 +(β−4α)x−4β
Bycomparingconstanttermson the l.h.s. and r.h.s.wesee that
52 = −4β
so that
β=−13
Bycomparing coefficients ofx 2 we see that
2=α−4
Therefore,
α = 6
1.4 Polynomial equations 25
Hence,P(x) = (x −4)(x 2 +6x −13).Thequadraticequationx 2 +6x −13 = 0canbe
solved using the formula
x = −6 ± √ 36 −4(−13)
2
= −6 ± √ 88
2
= 1.690,−7.690
We conclude thatP(x) = 0 has roots atx = 4,x = 1.690 andx = −7.690.
EXERCISES1.4
1 Calculatethe rootsofthe following linearequations:
(a) 4x−12=0
(b) 5t+20=0
(c) t+10=2t
y
(d)
2 −1=3
(e) 0.5t−6=0
(f) 2x+3=5x−6
3x
(g)
2 −17=0
x
(h)
2 + x 3 = 1
(i) 2x−1= x 2 +2
(j) 2(y+1)=6
(k) 3(2y −1) =2(y +2)
3
(l)
2 (t+3)=2 (4t −1)
3
2 Solvethe followingquadraticequations by
factorization:
(a) t 2 −5t+6=0
(b) x 2 +x−12=0
(c) t 2 =10t −25
(d) x 2 +4x−21=0
(e) x 2 −9x+18=0
(f) x 2 =1
(g) y 2 −10y+9=0
(h) 2z 2 −z−1=0
(i) 2x 2 +3x−2=0
(j) 3t 2 +4t+1=0
(k) 4y 2 +12y+5=0
(l) 4r 2 −9r+2=0
(m)6d 2 −d−2=0
(n) 6x 2 −13x+2=0
3 Complete the square forthe followingquadratic
equations andhence findtheir roots:
(a) x 2 +2x−8=0
(b) x 2 −6x−5=0
(c) x 2 +4x−6=0
(d) x 2 −14x−10=0
(e) x 2 +5x−49=0
4 Solve the following quadraticequationsusingthe
quadraticformula:
(a) x 2 +x−1=0
(b) t 2 −3t−2=0
26 Chapter 1 Review of algebraic techniques
(c) h 2 +5h+1=0
(d) 0.5x 2 +3x−2=0
(e) 2k 2 −k−3=0
(f) −y 2 +3y+1=0
(g) 3r 2 =7r+2
(h) x 2 −70=0
(i) 4s 2 −2=s
(j) 2t 2 +5t+2=0
(k) 3x 2 = 50
5 Calculate the rootsofthe followingpolynomial
equations:
(a) x 3 −6x 2 +11x−6=0givenx=1isaroot
(b) t 3 −2t 2 −5t+6=0givent=3isaroot
(c) v 3 −v 2 −30v+72=0givenv=4isaroot
(d) 2y 3 +3y 2 −11y +3 = 0 giveny = 1.5 is aroot
(e) 2x 3 +3x 2 −7x−5=0givenx=− 5 2 isaroot.
6 Checkthat the given values are roots ofthe following
polynomial equations:
(a) x 2 +x−2=0
(b) 2t 3 −3t 2 −3t+2=0
(c) y 3 +y 2 +y+1=0
x=−2,1
(d) v 4 +4v 3 +6v 2 +3v=0
t=−1,0.5
y=−1
v=−1,0
Solutions
1 (a) 3 (b) −4 (c) 10 (d) 8
(e) 12 (f) 3 (g)
(i) 2 (j) 2 (k)
34
3
7
4
(h)
(l)
2 (a) 2,3 (b) −4,3 (c) 5
6
5
31
7
(d) −7,3 (e) 3,6 (f) −1,1
(g) 1,9 (h) −0.5,1 (i) −2,0.5
(j) −1, − 1 3
(m) − 1 2 , 2 3
(k) −2.5, −0.5 (l) 0.25,2
(n)
1
6 ,2
3 (a) (x+1) 2 −9=0,x=−4,2
(b) (x −3) 2 −14 = 0,x = −0.7417,6.7417
(c) (x +2) 2 −10 = 0,x = −5.1623,1.1623
(d) (x −7) 2 −59 = 0,x = −0.6811,14.6811
(e)
(
x + 5 ) 2 221 − = 0,x = −9.9330,4.9330
2 4
4 (a) −1.6180,0.6180
(b) −0.5616,3.5616
(c) −4.7913,−0.2087
(d) −6.6056,0.6056
(e) −1,1.5
(f) −0.3028,3.3028
(g) −0.2573,2.5907
(h) −8.3666,8.3666
(i) −0.5931,0.8431
(j) −2, −0.5
(k) −4.0825,4.0825
5 (a) 1,2, 3 (b) −2,1,3
(c) −6,3,4
(e) −2.5, −0.6180,1.6180
(d) −3.3028,0.3028,1.5
1.5 ALGEBRAICFRACTIONS
An algebraicfraction has the form
algebraicfraction = numerator polynomial expression
=
denominator polynomial expression
For example,
3t+1
t 2 +t+4 , x 3
x 2 +1
are all algebraic fractions.
and
y 2 +1
y 2 +2y+3
1.5.1 Properandimproperfractions
1.5 Algebraic fractions 27
Whenpresentedwithafraction,wecannotethedegreeofthenumerator,sayn,andthe
degree of the denominator, sayd.
A fraction is proper ifd > n, that is the degree of the denominator is greater than
the degree of the numerator. Ifd nthen the fraction isimproper.
Example1.25 Classify the following fractions as either proper or improper. In each case, state the
degree of both numerator and denominator.
x 2 +9x−6
(a)
3x 3 +x 2 +100
(b) t3 +t 2 +9t−6
t 5 +9
(v +1)(v −6)
(c)
v 2 +3v+6
(z +2) 3
(d)
5z 2 +10z +16
Solution (a) The degree of the numerator,n, is 2. The degree of the denominator,d, is 3. Since
d >nthe fraction isproper.
(b) Heren = 3 andd = 5.The fraction isproper sinced >n.
(c) Heren = 2 andd = 2,sod =nand the fraction isimproper.
(d) Heren = 3 andd = 2,sod <nand the fraction isimproper.
1.5.2 Equivalentfractions
Consider the numerical fractions 1 2 and 2 . These fractions have the same value. Similarly,
2 3 , 6 20
4
and
9 30 all have the same value. The algebraic fractions x y , 2x xt
and
2y yt all
have the same value. Fractions with the same value arecalledequivalent fractions.
The value of a fraction remains unchanged if both numerator and denominator are
multiplied or divided by the same quantity. This fact can be used to write a fraction in
many equivalent forms.Consider forexample the fractions
(a) 2 x
(b)
2(x +1)
x(x +1)
(c) 2xt
x 2 t
These are all equivalent fractions. Fraction (b) can be obtained by multiplying both numerator
and denominator of fraction (a) by (x +1), so they are equivalent. Fraction (a)
canbeobtainedbydividingnumeratoranddenominatoroffraction(c)byxt andsothey
arealsoequivalent.
Example1.26 Show that
x +1
x +7
areequivalent.
and
x 2 +4x+3
x 2 +10x+21
28 Chapter 1 Review of algebraic techniques
Solution We factorize the numerator and denominator of the second fraction:
x 2 +4x+3 (x +1)(x +3)
=
x 2 +10x+21 (x +7)(x +3)
Dividingbothnumeratoranddenominatorby (x +3)resultsin x +1
x +7 .Sothetwogiven
fractions are equivalent.
Dividingbothnumeratoranddenominatorbyx+3isoftenreferredtoas‘cancelling
x+3’.
1.5.3 Expressingafractioninitssimplestform
Consider the numerical fraction 6 . To simplify this we factorize both numerator and
10
denominator and then cancel any common factors.Thus
6
10 = 2 ×3
2 ×5 = 3 5
Thefractions 6
10 and 3 5 haveidenticalvaluesbut 3 5 isinasimplerformthan 6
10 .Itisimportanttostressthatonlyfactorswhicharecommontobothnumeratoranddenominator
can becancelled.
Example1.27 Simplify
(a)
(b)
6x
18x 2
12x 3 y 2
4x 2 yz
Solution (a) Notethat18canbefactorizedto6×3andso6isafactorcommontobothnumerator
and denominator. Alsox 2 isx×x and soxis also a common factor. Cancelling the
common factors,6andx, produces
6x
18x 2 =
6x
(6)(3)(x)(x) = 1 3x
(b) The common factors are4,x 2 andy. Cancelling these factors gives
12x 3 y 2
4x 2 yz = 3xy
z
Example1.28 Simplify (a)
4
6x+4
(b) 6t3 +3t 2 +6t
3t 2 +3t
Solution (a) Factorizingbothnumeratoranddenominatorandcancellingcommonfactorsyields
4
6x+4 = (2)(2)
2(3x +2) = 2
3x+2
1.5 Algebraic fractions 29
(b) Factorizing and cancelling common factors yields
6t 3 +3t 2 +6t
3t 2 +3t
= 3t(2t2 +t+2)
3t(t +1)
= 2t2 +t+2
t +1
Note thatthe common factor, 3t, has been cancelled.
4t+8
Example1.29 Simplify(a) (b) 2y2 −y−1
t 2 +3t+2 y 2 −2y+1
Solution The numerator and denominator arefactorized and common factors are cancelled.
4t+8
(a)
t 2 +3t+2 = 4(t +2)
(t +2)(t +1) = 4
t +1
(b)
The common factor,t +2,has been cancelled.
2y 2 −y−1 (2y +1)(y −1)
= = 2y+1
y 2 −2y+1 (y −1) 2 y −1
The common factor,y−1,has been cancelled.
1.5.4 Multiplicationanddivisionofalgebraicfractions
Tomultiplytwoalgebraicfractionstogether,wemultiplytheirnumeratorstogether,and
multiplytheirdenominatorstogether, thatis
a
b × c d = a ×c
b ×d = ac
bd
Division isperformed by inverting the second fraction and then multiplying, thatis
a
b ÷ c d = a b × d c = ad
bc
Before multiplying or dividing fractions it is advisable to express each fraction in its
simplest form.
Example1.30 Simplify
x 2 +5x+6
2x−2
× x2 −x
x 2 +3x+2
Solution Factorizing numerators and denominators produces
x 2 +5x+6
2x−2
× x2 −x (x +2)(x +3) x(x −1)
= ×
x 2 +3x+2 2(x −1) (x +1)(x +2)
=
(x +2)(x +3)x(x −1)
2(x −1)(x +1)(x +2)
Commonfactors (x +2)and (x −1)canbecancelledfromnumeratoranddenominator
togive
(x +2)(x +3)x(x −1) (x +3)x
=
2(x −1)(x +1)(x +2) 2(x +1)
30 Chapter 1 Review of algebraic techniques
Hence
x 2 +5x+6
2x−2
× x2 −x x(x +3)
=
x 2 +3x+2 2(x +1)
Example1.31 Simplify
x 2 +8x+7
x 2 −6x
÷ x +7
x 3 +x 2
Solution The second fraction isinverted togive
x 2 +8x+7
x 2 −6x
× x3 +x 2
x +7
Factorizing numerators and denominators yields
(x +1)(x +7)
x(x −6)
× x2 (x+1)
(x+7)
= (x +1)(x +7)x2 (x +1)
x(x −6)(x +7)
Common factors ofxand (x +7) arecancelled leaving
(x +1)x(x +1)
x −6
which maybe written as
x(x +1) 2
x −6
1.5.5 Additionandsubtractionofalgebraicfractions
Themethodofaddingandsubtractingalgebraicfractionsisidenticaltothatfornumerical
fractions.
Each fraction is written in its simplest form. The denominators of the fractions are
then examined and the lowest common denominator (l.c.d.) is found. This is the simplest
expression that has the given denominators as factors. All fractions are then written
in an equivalent form with the l.c.d. as denominator. Finally the numerators are
added/subtractedand placedover the l.c.d. Consider the following examples.
Example1.32 Expressasasinglefraction
2
x +1 + 4
x +2
Solution Bothfractionsarealreadyintheirsimplestform.Thel.c.d.ofthedenominators, (x+1)
and (x +2), is found. This is (x +1)(x +2). Note that this is the simplest expression
that has bothx+1 andx+2 as factors.
Each fraction iswritten inan equivalent form with the l.c.d. as denominator. So
2
x +1 iswritten as 2(x +2)
(x +1)(x +2)
1.5 Algebraic fractions 31
and
4
x +2 iswritten as 4(x +1)
(x +1)(x +2)
Finally the numerators areadded. Hence we have
2
x +1 + 4
x +2 = 2(x +2)
(x +1)(x +2) + 4(x +1)
(x +1)(x +2)
= 2(x+2)+4(x+1)
(x +1)(x +2)
6x+8
=
(x +1)(x +2)
= 6x+8
x 2 +3x+2
Example1.33 Expressasasingle fraction
x 2 +3x+2
x 2 −1
− 2
2x+6
Solution Each fraction iswritten initssimplest form:
x 2 +3x+2
x 2 −1
=
(x +1)(x +2)
(x −1)(x +1) = x +2
x −1
2
2x+6 = 2
2(x +3) = 1
x +3
Thel.c.d.is (x −1)(x +3).Eachfractioniswritteninanequivalent formwithl.c.d.as
denominator:
So
x +2 (x +2)(x +3)
=
x −1 (x −1)(x +3) , 1
x +3 =
x 2 +3x+2
x 2 −1
x −1
(x −1)(x +3)
− 2
2x+6 = x +2
x −1 − 1
x +3
(x +2)(x +3)
=
(x −1)(x +3) − (x−1)
(x −1)(x +3)
= (x+2)(x+3)−(x−1)
(x −1)(x +3)
= x2 +5x+6−x+1
(x −1)(x +3)
= x2 +4x+7
(x −1)(x +3)
32 Chapter 1 Review of algebraic techniques
Engineeringapplication1.7
Resistorsinparallel
When carrying out circuit analysis it is often helpful to reduce the complexity of a
circuit by calculating an equivalent single resistance for several resistors connected
togetherinparallel.Thissimplifiedversionoftheoriginalcircuitthenbecomesmuch
easier to understand. Figure 1.4 shows the simplest case of two resistors connected
together inparallel.
R 1
R 2
Figure1.4
Two resistorsin parallel.
The equivalent resistance,R E
, ofthis simple network isfound from the formula
1
R E
= 1 R 1
+ 1 R 2
Bycombining the fractions on the r.h.s.we see
1
R E
= R 2 +R 1
R 1
R 2
and hence
R E
= R 1 R 2
R 1
+R 2
Consider the case whenR 1
andR 2
are equal and have valueR. The equivalent resistance
then becomes
R E
=
RR
R +R = R2
2R = R 2
So
R E
= R 2 = 0.5R
Thereforetheeffectofputtingtwoequalresistorsinparallelistoproduceanoverall
equivalent resistance which ishalf thatof a singleresistor.
EXERCISES1.5
1 Classifyeachfraction aseitherproperorimproper.
(a)
(d)
x +2
x 2 +2
x 2 +2
x +2
(b)
(e)
2
x +2
x 2 +2
x 2 +1
(c)
(f)
2 +x
2
x 2 +1
x 2 +2
2 Classifyeachofthe following algebraic fractions as
properorimproper.
3t+1 10v 2 +4v−6 6−4t+t 3
(a)
t 2 (b)
−1 3v 2 (c)
+v−1 6t 2 +1
(d)
9t+1
t +1
(e)
100f 2 +1
f 3 −1
(f)
(x +1)(x +2)
(x +3) 3
1.6 Solution of inequalities 33
(g)
(h)
(j)
(y +1)(y +2)(y +3)
(y +4) 3
(z +1) 10
(2z +1) 10 (i)
3k 2 +2k−1
k 3 +k 2 −4k+1
(q +1) 10
(q 2 +1) 6
3 Expresseachfraction in itssimplestform.
(a)
y 3 +2y
2y−y 2
(c) t2 +7t+12
t 2 +5t+4
x 2 +2x+1
(e)
x 2 −2x+1
4 Simplify the following:
(a)
(b)
x +1
x +3 × x +3
x +2
4
x 2 −1 × x +1
6
(b)
(d)
5x 2 +5
10x −10
x 2 −1
x 3 −2x 2 +x
(c)
(d)
(e)
x 2 +3x
x 3 +2x 2 × x2 +4x+4
4x
4xt +4t
xt 2 −t 2 × 4x2 −4
8x+8
x 2 +2x−15
x 2 +4x−5 × x2 +3x−4
x 2 −4x+3
5 Express asasingle fraction
(a)
(b)
(c)
(d)
(e)
3
x +6 + 2
x +1
4
x +2 − 2
(x +2) 2
2x+1
x 2 +x+1 + 4
x −1
x 2 +3x−18
x 2 +7x+6 − 2x2 +7x−4
x 2 +9x+20
3(x +1) 2(x −1)
x 2 +
+4x+4 x 2 −4
Solutions
1 (a) proper (b) proper (c) improper
(d) improper (e) improper (f) improper
2 (a) proper (b) improper (c) improper
(d) improper (e) proper
(f) proper
(g) improper (h) improper (i) proper
(j) proper
y 2 +2
3 (a)
2 −y
x +1
(d)
x(x −1)
(b)
(e)
x 2 +1
(c) t +3
2x−2 t +1
x 2 +2x+1
x 2 −2x+1
4 (a)
(c)
5 (a)
(c)
(d)
x +1
x +2
(x +2)(x +3)
4x 2
5x+15
(x +1)(x +6)
(b)
(d)
6x 2 +3x+3
(x−1)(x 2 +x+1)
−x 2 +x−14
(x +1)(x +5)
(b)
(e)
2
3(x −1)
2(x +1)
t
4x+6
(x +2) 2
(e)
5x 2 −x−10
(x+2) 2 (x−2)
x +4
x −1
1.6 SOLUTIONOFINEQUALITIES
Aninequality isany expression involving one ofthe symbols >, , <, .
a >bmeansaisgreaterthanb
a <bmeansaislessthanb
a bmeansaisgreaterthanorequal tob
a bmeansaislessthanorequal tob
Just as with an equation, when we add or subtract the same quantity to both sides of an
inequalitythe inequalitystill remains.Mathematically wehave
34 Chapter 1 Review of algebraic techniques
Ifa>bthen
a +k >b +k
a −k >b −k
addingkto both sides
subtractingkfromboth sides
We can make similarstatements fora b,a <banda b.
When multiplying or dividing both sides of an inequality extra care must be taken.
Suppose we wish to multiply or divide an inequality by a quantityk. Ifkis positive the
inequality remains the same;ifkisnegative then the inequality isreversed.
Ifa>bthen
ka>kb
a
k > b k
⎫
⎬
⎭
ifkispositive
ka<kb
a
k < b k
⎫
⎬
⎭
ifkisnegative
Notethatwhenkisnegativetheinequalitychangesfrom >to <.Similarstatementscan
bemadefora b,a <banda b.Whenaskedtosolveaninequalityweneedtostate
all the values of the variable forwhich the inequality istrue.
Example1.34 Solve the following inequalities:
(a) 3t+1>t+7
(b) 2−3z6+z
Solution (a) 3t +1>t +7
2t +1>7
2t >6
t >3
subtractingt fromboth sides
subtracting 1 fromboth sides
dividing both sides by2
Hence all values oft greaterthan3satisfy the inequality.
(b) 2−3z 6+z
−3z 4 +z
−4z 4
z −1
subtracting 2 fromboth sides
subtractingzfrom bothsides
dividing both sides by −4,rememberingtoreverse
the inequality
Hence all values ofzgreaterthanorequal to −1 satisfy the inequality.
We often have inequalities of the form α β > 0, α < 0, αβ > 0and αβ < 0tosolve.It
β
isuseful tonote thatif
α
β >0theneitherα>0andβ>0orα<0andβ<0
α
β <0theneitherα>0andβ<0orα<0andβ>0
αβ>0theneitherα>0andβ>0orα<0andβ<0
αβ<0theneitherα>0andβ<0orα<0andβ>0
The following examples illustratethis.
1.6 Solution of inequalities 35
Example1.35 Solve the following inequalities:
(a)
x +1 2t+3
>0 (b)
2x−6 t +2 1
x +1
Solution (a) Consider the fraction . For the fraction to be positive requires either of the
following:
2x−6
(i) x+1>0and2x−6>0.
(ii) x+1<0and2x−6<0.
We consider both cases.
Case(i)
x+1>0andsox>−1.
2x−6>0andsox>3.
Both of these inequalities are true only whenx > 3. Hence the fraction is positive
whenx > 3.
Case(ii)
x+1<0andsox<−1.
2x−6<0andsox<3.
(b)
Bothoftheseinequalitiesaretrueonlywhenx < −1.Hencethefractionispositive
whenx < −1.
x +1
Insummary,
2x−6 >0whenx>3orx<−1.
2t+3
t +2 1
2t+3
t +2 −10
t +1
t +2 0
Wenowconsiderthefraction t +1
t +2 .Forthefractiontobenegativeorzerorequires
either ofthe following:
(i) t+10andt+2>0.
(ii)t+1 0andt+2<0.
We consider each case inturn.
Case(i) t+10andsot −1.
t+2>0andsot>−2.
Hencetheinequalityistruewhent isgreaterthan −2andlessthanorequalto −1.
We write thisas −2 <t −1.
Case(ii) t+10andsot −1.
t+2<0andsot<−2.
36 Chapter 1 Review of algebraic techniques
Itisimpossibletosatisfybotht −1andt < −2andsothiscaseyieldsnovalues
oft.
Insummary, 2t+3
t +2
1when−2<t −1.
Example1.36 Solve the following inequalities:
(a)x 2 >4 (b)x 2 <4
Solution (a) x 2 >4
x 2 −4>0
(x−2)(x+2)>0
For the product (x −2)(x +2) tobepositive requires either
(i) x−2>0andx+2>0
or
(ii) x−2<0andx+2<0.
We examine eachcase inturn.
Case(i)
Case(ii)
x−2>0andsox>2.
x+2>0andsox>−2.
Bothofthesearetrueonly whenx > 2.
x−2<0andsox<2.
x+2<0andsox<−2.
Bothofthesearetrueonly whenx < −2.
Insummary,x 2 > 4 whenx > 2 orx < −2.
(b) x 2 <4
x 2 −4<0
(x−2)(x+2)<0
For the product (x −2)(x +2) tobenegative requires either
(i) x−2>0andx+2<0
or
(ii) x−2<0andx+2>0.
We examine eachcase inturn.
Case(i)
x−2>0andsox>2.
x+2<0andsox<−2.
No values ofxarepossible.
1.6 Solution of inequalities 37
Case(ii)
x−2<0andsox<2.
x+2>0andsox>−2.
Here we havex < 2 andx > −2. This is usually written as −2 < x < 2. Thus all
values ofxbetween −2 and 2 will ensure thatx 2 < 4.
Insummary,x 2 < 4 when −2 <x<2.
The previous example illustratesageneral rule.
Ifx 2 >kthenx> √ korx<− √ k.
Ifx 2 <kthen − √ k<x< √ k.
Example1.37 Solve the following inequalities:
(a)x 2 +x−6>0
(b)x 2 +8x+1<0
Solution (a) x 2 +x−6> 0
(x−2)(x+3)>0
For the product (x −2)(x +3) tobepositive requires either
(i) x−2>0andx+3>0
or
(ii) x−2<0andx+3<0.
Case(i)
x−2>0andsox>2.
x+3>0andsox>−3.
Bothofthese inequalitiesare satisfied only whenx > 2.
Case(ii)
x−2<0andsox<2.
x+3<0andsox<−3.
Bothofthese inequalitiesare satisfied only whenx < −3.
Insummary,x 2 +x−6 > 0 when eitherx > 2 orx < −3.
(b) The quadratic expression x 2 + 8x + 1 does not factorize and so the technique of
completingthe squareisused.
Hence
x 2 +8x+1=(x+4) 2 −15
(x+4) 2 −15<0
(x+4) 2 <15
38 Chapter 1 Review of algebraic techniques
Using the resultafter Example 1.36 wemay write
− √ 15<x+4< √ 15
− √ 15−4<x< √ 15−4
−7.873 <x<−0.127
EXERCISES1.6
1 Solve the following inequalities:
y
(a) 2x > 6 (b)
4 > 0.6
(c) 3t<12 (d) z+14
(e) 3v−24 (f) 6−k −1
6−2v
(g) <1
3
(h)m 2 2
(i)x 2 <9 (j) v 2 +110
(k) x 2 +10<6 (l) 2k 2 −31
(m) 10−2v 2 6 (n) 5+4k 2 >21
(o) (v −2) 2 25 (p) (3t +1) 2 >16
2 Solve the following inequalities:
(a) x 2 −6x+8>0
(b) x 2 +6x+80
(c) 2t 2 +3t−2<0
(d) y 2 −2y−240
(e) h 2 +6h+91
(f) r 2 +6r+70
(g) x 2 +4x−6<0
(h) 4t 2 +4t+912
(i)
x +4
x −5 >1 2t−3
(j)
t +6 6
(k)
3v+12
6−2v 0 (l) x 2
x +1 > 0
x
(m)
x 2 <0
+1
3y+1
(n)
y −2 2
(o) k 3 >0 (p)x 3 >8
t 2 +6t+9
(q) < 0
t +5
(r) (x +1)(x −2)(x +3) >0
Solutions
1 (a) x>3 (b)y>2.4 (c)t<4
(d) z3 (e) v2 (f)k7
(g) v> 3 √ (h)m 2orm− √ 2
2
(i) −3<x<3
(j) −3v3
(k) nosolution
(l) √ k 2ork− √ √ 2
(m)v 2orv− √ 2
(n) k>2ork<−2
(o) −3v7
(p) t>1ort<− 5 3
2 (a) x>4orx<2
(b) −4x −2
(c) −2<t< 1 2
(d) y6ory −4
(e) −4h −2
(f) √ r 2−3orr− √ 2 −3
(g) − √ 10−2<x< √ 10−2
(h) − 3 2 t 1 2
(i) x>5
(j) t − 39 4 ort>−6
(k) −4v<3
(l) x>−1withx≠0
(m)x<0 (n) −5y<2
(o) k>0 (p) x>2
(q) t<−5
(r) x>2or−3<x<−1
1.7 Partial fractions 39
1.7 PARTIALFRACTIONS
Givenasetoffractions,wecanaddthemtogethertoformasinglefraction.Forexample,
inExample 1.32 wesaw
2
x +1 + 4
x +2 = 2(x+2)+4(x+1)
(x +1)(x +2)
= 6x+8
x 2 +3x+2
Alternatively,ifwearegivenasinglefraction,wecanbreakitdownintothesumofeasier
fractions. These simple fractions, which when added together form the given fraction,
6x+8
arecalledpartialfractions.Thepartialfractionsof
x 2 +3x+2 are 2
x +1 and 4
x +2 .
Whenexpressingagivenfractionasasumofpartialfractionsitisimportanttoclassifythefractionasproperorimproper.Thedenominatoristhenfactorizedintoaproduct
of factors which can be linear and/or quadratic. Linear factors are those of the form
x
ax + b, for example 2x − 1, + 6. Repeated linear factors are those of the form
2
(ax +b) 2 , (ax +b) 3 andsoon,forexample (3x −2) 2 and (2x +1) 3 arerepeatedlinear
factors.Quadraticfactorsarethoseoftheformax 2 +bx+c,forexample2x 2 −6x+1.
1.7.1 Linearfactors
We can calculate the partial fractions of proper fractions whose denominator can be
factorized into linear factors.The following steps areused:
(1) Factorize the denominator.
(2) Eachfactorofthedenominatorproducesapartialfraction.Afactorax+bproduces
a partialfraction of the form A whereAisan unknown constant.
ax+b
(3) Evaluate the unknown constants of the partial fractions. This is done by evaluation
using a specific value ofxor by equating coefficients.
A linear factor ax +bin the denominator produces a partial fraction of the form
A
ax+b .
Example1.38 Express
6x+8
x 2 +3x+2
as itspartialfractions.
Solution The denominator isfactorized as
x 2 +3x+2=(x+1)(x+2)
40 Chapter 1 Review of algebraic techniques
Thelinearfactor,x +1,producesapartialfractionoftheform A
x +1 .Thelinearfactor,
x + 2, produces a partial fraction of the form B . A and B are unknown constants
x +2
whose values have tobefound. So we have
6x+8
x 2 +3x+2 = 6x+8
(x +1)(x +2) = A
x +1 + B
x +2
Multiplying both sides of Equation (1.8)by (x +1) and (x +2) weobtain
(1.8)
6x+8=A(x+2)+B(x+1) (1.9)
We now evaluateAandB. There are two techniques which enable us to do this: evaluation
using a specific value of xand equating coefficients. Each isillustratedinturn.
Evaluationusingaspecificvalueofx
We examine Equation (1.9). We will substitute a specific value ofxinto this equation.
Although any value can be substituted for x we will choose a value which simplifies
the equation as much as possible. We note that substitutingx = −2 will simplify the
r.h.s.oftheequationsincethetermA(x +2)willthenbezero.Similarly,substitutingin
x = −1 will simplify the r.h.s. because the termB(x +1) will then be zero. Sox = −1
andx = −2 are two convenient values to substitute into Equation (1.9). We substitute
each inturn.
Evaluating Equation (1.9) withx = −1 gives
−6+8=A(−1+2)
2 =A
Evaluating Equation (1.9) withx = −2 gives
−4 =B(−1)
B = 4
SubstitutingA = 2,B = 4into Equation (1.8) yields
6x+8
x 2 +3x+2 = 2
x +1 + 4
x +2
2
Thus the required partialfractions are
x +1 and 4
x +2 .
The constantsAandBcould have been found by equating coefficients.
Equatingcoefficients
Equation (1.9) may be writtenas
6x+8=(A+B)x+2A+B
Equating the coefficients ofxon both sides gives
6=A+B
Equating the constant termson both sides gives
8=2A+B
1.7 Partial fractions 41
Thus we have two simultaneous equations in A and B, which may be solved to give
A = 2 andB = 4 as before.
1.7.2 Repeatedlinearfactor
We now examine proper fractions whose denominators factorize into linear factors,
where one or more of the linear factors isrepeated.
Arepeated linear factor, (ax +b) 2 , produces two partialfractions of the form
A
ax+b +
B
(ax +b) 2
Arepeated linear factor, (ax +b) 2 , leadstopartialfractions
A
ax+b +
B
(ax +b) 2
Example1.39 Express
2x+5
x 2 +2x+1
as partialfractions.
Solution Thedenominatorisfactorizedtogive (x+1) 2 .Herewehaveacaseofarepeatedfactor.
A
This repeated factor generates partialfractions
x +1 + B
(x +1) 2.
Thus
2x+5
x 2 +2x+1 = 2x+5
(x +1) 2 = A
x +1 +
Multiplying by (x +1) 2 gives
2x+5=A(x+1)+B=Ax+A+B
B
(x +1) 2
Equating coefficients ofxgivesA = 2.Evaluation withx = −1 givesB = 3.So
2x+5
x 2 +2x+1 = 2
x +1 + 3
(x +1) 2
Example1.40 Express
14x 2 +13x
(4x 2 +4x +1)(x −1)
as partialfractions.
Solution Thedenominatorisfactorizedto (2x+1) 2 (x−1).Therepeatedfactor, (2x+1) 2 ,produces
partialfractions of the form
A
2x+1 + B
(2x +1) 2
42 Chapter 1 Review of algebraic techniques
The factor, (x −1), produces a partialfraction ofthe form C
x −1 . So
14x 2 +13x
(4x 2 +4x +1)(x −1) = 14x 2 +13x
(2x +1) 2 (x −1) = A
2x+1 + B
(2x +1) + C
2 x −1
Multiplying both sides by (2x +1) 2 (x −1) gives
14x 2 +13x =A(2x +1)(x −1) +B(x −1) +C(2x +1) 2 (1.10)
The unknown constantsA,B andC can now befound.
Evaluating Equation (1.10) withx = 1 gives
27 =C(3) 2
from which
C = 3
Evaluating Equation (1.10) withx = −0.5 gives
14(−0.5) 2 +13(−0.5) =B(−0.5 −1)
from which
B = 2
Finally, comparing the coefficients ofx 2 on both sides of Equation (1.10) wehave
14=2A+4C
Since wealready haveC = 3 then
A = 1
Hence we see that
14x 2 +13x
(4x 2 +4x +1)(x −1) = 1
2x+1 + 2
(2x +1) 2 + 3
x −1
1.7.3 Quadraticfactors
We now look at proper fractions whose denominator contains a quadratic factor, that is
a factorofthe formax 2 +bx +c.
Aquadraticfactor,ax 2 +bx +c, producesapartialfraction ofthe form
Ax+B
ax 2 +bx+c
Example1.41 Notingthatx 3 +2x 2 −11x −52 = (x −4)(x 2 +6x +13), express
3x 2 +11x +14
x 3 +2x 2 −11x−52
as partialfractions.
1.7 Partial fractions 43
Solution Thedenominatorhasalreadybeenfactorized.Thelinearfactor,x−4,producesapartial
fraction of the form A
x −4 .
The quadratic factor,x 2 +6x +13, will not factorize further into two linear factors.
Thus this factor generates a partialfraction of the form
Bx+C
x 2 +6x+13 . Hence
3x 2 +11x +14
(x −4)(x 2 +6x +13) = A
x −4 + Bx+C
x 2 +6x+13
Multiplying by (x −4) and (x 2 +6x +13) produces
3x 2 +11x +14 =A(x 2 +6x +13) + (Bx +C)(x −4) (1.11)
The constantsA,BandC can now be found.
Puttingx = 4 into Equation (1.11) gives
106 =A(53)
A = 2
Equating the coefficients ofx 2 gives
3=A+B
B = 1
Equating the constant termon both sides gives
Hence
14 =A(13) −4C
C = 3
3x 2 +11x +14
x 3 +2x 2 −11x−52 = 2
x −4 + x +3
x 2 +6x+13
1.7.4 Improperfractions
The techniques of calculating partial fractions in Sections 1.7.1 to 1.7.3 have all been
applied to proper fractions. We now look at the calculation of partial fractions of improper
fractions. The techniques described in Sections 1.7.1 to 1.7.3 are all applicable
to improper fractions. However, when calculating the partial fractions of an improper
fraction, an extra term needs to be included. The extra term is a polynomial of degree
n − d, where n is the degree of the numerator and d is the degree of the denominator.
A polynomial of degree 0 is a constant, a polynomial of degree 1 has the form
Ax +B, a polynomial of degree 2 has the form Ax 2 +Bx +C, and so on. For example,ifthenumeratorhasdegree3andthedenominatorhasdegree2,thepartialfractions
will include a polynomial of degree n −d = 3 − 2 = 1, that is a term of the form
Ax +B. If the numerator and denominator are of the same degree, the fraction is improper.
The partial fractions will include a polynomial of degree n −d = 0, that is a
constantterm.
44 Chapter 1 Review of algebraic techniques
Letthedegreeofthenumeratorbenandthedegreeofthedenominatorbed.Ifn d
thenthefractionisimproper.Improperfractionshavepartialfractionsinadditionto
thosegeneratedbythefactorsofthedenominator.Theseadditionalpartialfractions
take the form ofapolynomial of degreen−d.
Example1.42 Express as partialfractions
4x 3 +10x+4
2x 2 +x
Solution Thedegreeofthenumeratoris3,thatisn = 3.Thedegreeofthedenominatoris2,that
isd = 2.Thus,the fraction isimproper.
Nown −d = 1andthisisameasureoftheextenttowhichthefractionisimproper.
Thepartialfractionswillincludeapolynomialofdegree1,thatisAx +B,inadditionto
the partialfractions generated by the factors of the denominator.
The denominator factorizes tox(2x +1). These factors generate partial fractions of
the form C x + D
2x+1 . Hence
4x 3 +10x+4
2x 2 +x
= 4x3 +10x+4
x(2x +1)
Multiplying byxand 2x +1 yields
=Ax+B+ C x +
D
2x+1
4x 3 +10x +4 = (Ax +B)x(2x +1) +C(2x +1) +Dx (1.12)
The constantsA,B,C andDcan now beevaluated.
Puttingx = 0 into Equation (1.12) gives
4 =C
Puttingx = −0.5 into Equation (1.12) gives
−1.5 = − D 2
D = 3
Equating coefficients ofx 3 gives
4=2A
A = 2
Equating coefficients ofxgives
Hence
10=B+2C+D
B=−1
4x 3 +10x+4
2x 2 +x
=2x−1+ 4 x + 3
2x+1
1.7 Partial fractions 45
EXERCISES1.7
1 Calculate the partial fractionsofthe following
fractions:
6x+14 7−2x
(a)
x 2 (b)
+4x+3 x 2 −x−2
3x+6 8 −x
(c)
2x 2 (d)
+3x 6x 2 −x−1
(e)
13x 2 +11x +2
(x +1)(2x +1)(3x +1)
2 Calculatethe partialfractionsofthe following
fractions:
2x+7 4x−5
(a)
x 2 (b)
+6x+9 x 2 −2x+1
(c)
(d)
3x 2 +8x+6
(x 2 +2x +1)(x +2)
3x 2 −3x−2
(x 2 −1)(x −1)
(e)
3x 2 +7x+6
x 3 +2x 2
3 Expressthe following aspartialfractions:
(a)
x 2 +x+2
(x 2 +1)(x +1)
(b)
(c)
(d)
(e)
5x 2 +11x+5
(2x +3)(x 2 +5x +5)
4x 2 +5
(x 2 +1)(x 2 +2)
18x 2 +7x +44
(2x −3)(2x 2 +5x +7)
2x
(x 2 −x+1)(x 2 +x+1)
4 Express the following fractions aspartialfractions:
(a)
(c)
(d)
(e)
x 2 +7x+13
x +4
x 2 +8x+2
x 2 +6x+1
x 3 −2x 2 +3x−3
x 2 −2x+1
2x 3 +2x 2 −2x−1
x 2 +x
(b)
12x−4
2x−1
Solutions
1 (a)
(c)
(e)
2 (a)
(c)
(d)
(e)
3 (a)
2
x +3 + 4
x +1
2
x − 1
2x+3
(b)
(d)
2
x +1 + 1
2x+1 − 1
3x+1
2
x +3 + 1
(x +3) 2 (b)
1
x +1 + 1
(x +1) 2 + 2
x +2
2
x −1 − 1
(x −1) 2 + 1
x +1
2
x + 3 x 2 + 1
x +2
1
x +1 + 1
x 2 +1
1
x −2 − 3
x +1
3
2x−1 − 5
3x+1
4
x −1 − 1
(x −1) 2
(b)
(c)
(d)
(e)
2x
x 2 +5x+5 + 1
2x+3
1
x 2 +1 + 3
x 2 +2
5
2x−3 + 4x−3
2x 2 +5x+7
1
x 2 −x+1 − 1
x 2 +x+1
4 (a) x+3+ 1
x +4
(c) 1+ 2x+1
x 2 +6x+1
(d) x+ 2
x −1 − 1
(x −1) 2
(e) 2x − 1 x − 1
x +1
(b) 6+ 2
2x−1
46 Chapter 1 Review of algebraic techniques
TechnicalComputingExercises1.7
1 Useatechnical computing language suchas
MATLAB ® to verifythe solutionsto the problemsin
Exercises1.7. InMATLAB ® ,the functionresidue
calculates the partialfraction expansion. Forexample,
exercise1(a)wouldbesolvedbytypingthefollowing:
b = [6 14];
a=[143];
[r,p,k] = residue(b, a)
Notice how the coefficients ofthe numerator are input
in the formb = [6 14];thisisknown asarow
vector. The concept ofavector willbe discussed in
later chapters. Fornow itisadequate to treat thisasa
horizontal list ofnumbers which are passed to
MATLAB ® in a specific order.
Similarly, the coefficients ofthe denominator are
input bya = [1 4 3].
Each vectoris arranged with the coefficient ofthe
highestpower ofxfirst.
Theresult is:
r =
p =
4.0000
2.0000
-1
-3
k =
[]
Examining the solution we notethat the output for
bothrand pisarranged asavertical list.Thisway of
representing the outputis known asacolumn vector.
We note that the numbersreturnedin columnvector p
have anegative sign.Thisis because the result
calculated containsthepoles ofthe partial fraction
expansion. These are values ofthe variable which
make the denominator ofthe fractionzero. The
significance ofthiswillbecome clear later in the text
but fornow itisadequate to note the difference in
sign from what might have been expected.
1.8 SUMMATIONNOTATION
Inengineeringweoftenwanttomeasurethevalueofavariable,suchascurrent,voltage
or pressure.
Suppose we make three measurements of a variablex. We can label these measurementsx
1
,x 2
andx 3
. Inthis context, the numbers 1,2,3arecalledsubscripts.
Inmathematics,theGreeklettersigma,written ∑ ,standsfora‘sum’.Forexample,
the sumx 1
+x 2
+x 3
is written
3∑
k=1
x k
Notethatthesubscriptkrangesfrom1to3.Askrangesfrom1to3,x k
becomesx 1
then
x 2
and thenx 3
and the sigmasign tells us toadd up these quantities.
Ingeneral,
N∑
x k
=x 1
+x 2
+···+x N
k=1
This notation is often used to express some of the fundamental equations of electrical
circuitanalysis. Sometimes ‘Summation Notation’ isknown as ‘Sigma Notation’.
1.8 Summation notation 47
Engineeringapplication1.8
Kirchhoff’scurrentlaw
Kirchhoff’scurrentlaw,oftenabbreviatedtoKCL,providesoneofthefundamental
equationsforanalysingelectricalcircuits.Thelawstatesthatthesumofthecurrents
flowing out of any junction, or node, in a circuit must equal the sum of the currents
flowing into it.
This principle is intuitive as it has a direct analogy with fluid flow in connected
water pipes. Currents flowing into a junction are considered positive; those flowing
out of a junction are negative. It is then valid to say that the sum of the currents
at a junction is zero. If there are N currents at the junction, denoted I 1
,I 2
,...,I N
,
then
I 1
+I 2
+I 3
+···+I N−1
+I N
=0
This can be expressed using the summation notation as
N∑
I k
=0
k=1
HereI k
means ‘the current,I, in branchk’. The first equation can be produced from
the summation notation by first substitutingk = 1, thenk = 2, right up tok = N.
Theexpressionbelowthesummationsymboltellsyouwheretostartandthevariable
to be substituted, and the number above the summation symbol indicates where to
stopcounting.Summationnotationisaverycompactandprecisewayofexpressing
KCL forany number ofcurrents atanode.
Consider the node shown inFigure 1.5.
Branch 1 Branch 3
1 A
2 A
3 A 2 A Figure1.5
Acircuit node with four separate branches.The currents are
Branch 2 Branch 4 given in amperes(oramps,A).
It can be seen that the total current flowing into the node is 1 + 3 = 4 amps. The
current flowing outof the node is2 +2 = 4 amps. Clearly,
Total current flowing into node =total currentflowing outof node
Alternatively, using the summation formof KCL wehave
4∑
I k
=I 1
+I 2
+I 3
+I 4
=0=1+3−2−2
k=1
Notethatforcurrentsflowingoutofthenodeanegativesignisusedandforcurrents
flowing into the node a positive sign is used. This is equivalent to considering the
currents separately as inward and outward flowing currents and equating the two.
Suppose for a moment that we did not know the current in branch 4 and,
furthermore, itwas notlabelled with anarrow toshow the direction of current flow.
➔
48 Chapter 1 Review of algebraic techniques
This situation is likely to occur in a circuit problem in electronics. There are
two options for labelling the current flow direction, and these are summarized in
Figure 1.6.
1 A
2 A
1 A
2 A
3 A I 4 3 A I 4 Figure1.6
Two different ways ofdefining the
Branch 4
Branch 4 currentdirection in Branch4.
1+3−2−I 4
=0 1+3−2+I 4
=0
I 4
=2 I 4
=−2
Note that the two solutions are both correct butI 4
= −2 has a negative sign, which
simply indicates that the current flows in the opposite direction to the arrow drawn
ontheright-handdiagram.Itdoesnotmatterwhichwayroundthearrowismarked,
as long as weobserve the sign.
Engineeringapplication1.9
Kirchhoff’svoltagelaw
Kirchhoff’s voltage law, often abbreviated to KVL, provides another of the fundamental
equations for analysing electrical circuits. The law states that the sum of the
voltages around a closed loop equals zero. It is often written down in the form of a
summation, as follows:
N∑
V k
=0
k=1
For the circuit shown in Figure 1.7 there are three possible loops to which we
could apply KVL.
V d
V a
V c
V b
V e
Figure1.7
A simple circuitto illustrate Kirchhoff’svoltage law.
In this example an ideal voltage source and resistors are used, although any components
could be substituted as KVL applies universally. Note that we ‘walk around’
1.8 Summation notation 49
thecircuitwhenwritingdowntheequations.Ifthearrowisinthedirectionoftravel
then it is given a positive sign; if it opposes the direction of travel it is given a
negative sign.
The equations are
V b
V a
V d
V d
V c
V e
V c
V a
V b
V e
V a
−V b
−V c
=0 V a
−V b
−V d
−V e
=0 V c
−V d
−V e
=0
If the equations are solved and the voltage has a negative sign it indicates that the
polarityisoppositetothedirectionofthevoltagearrowdrawnonthediagram.KVL
and KCL are the fundamental circuit laws that allow networks of electronic components
to be mathematically analysed. Although they are simple in concept they are
very powerful techniques.
EXERCISES1.8
1 Writeoutfullywhatismeantbyeachofthefollowing
expressions:
(a) ∑ 4
k=1 x k
(c) ∑ 7
k=1 x k
(b) ∑ 4
i=1 x i
(d) ∑ 3
k=1 x 2 k
(e) ∑ 4
j=1 (x j −2) 3 (f) ∑ 3
n=0 (2n +1) 2
4 Determine the current,I,at eachofthe following
circuitnodes:
(a)
3 A
I
1 A
(b)
3 A
I
1 A
2 Writeoutfully
(a) ∑ 4
k=1 (−1) k k (b) ∑ 5
k=1 (−1) k+1 k 2
3 Writethe followingsums more concisely byusing
sigma notation:
(c)
3 A
I
1 A
(d)
I 1
I 2
I 3
I
(a) 1 3 +2 3 +3 3 +···+10 3
1
(b)
1 − 1 2 + 1 3 − 1 4 +···− 1 12
(c) 1+ 1 3 + 1 5 + 1 7
50 Chapter 1 Review of algebraic techniques
5 FindV a in each ofthe followingcircuits:
(a)
6 FindV a andV b usingKVL.
1 V
1 V
2 V
3 V
V a
1 V
V a
V b
(b)
−1 V
2 V
V a
Solutions
1 (a) x 1 +x 2 +x 3 +x 4 (b)x 1 +x 2 +x 3 +x 4
(c) x 1 +x 2 +x 3 +x 4 +x 5 +x 6 +x 7
(d) x 2 1 +x2 2 +x2 3
(e) (x 1 −2) 3 + (x 2 −2) 3 + (x 3 −2) 3 + (x 4 −2) 3
(f) 1+3 2 +5 2 +7 2
2 (a) −1+2−3+4
(b) 1−4+9−16+25
∑
3 (a) 10
k=1 k 3
(b)
(c)
∑ 12 (−1) k+1
k=1 k
∑ 3n=0 1
2n+1 or ∑ 4 1
n=1 2n−1
4 Allsolved usingKCL
(a) 3−1−I=0 · · · I = 2
(b) 3+1−I=0 · · · I = 4
(c) 3+1+I=0 · · · I=−4
(d) I 1 +I 2 +I 3 −I=0, · · · I=I 1 +I 2 +I 3
orI= ∑ 3
k=1 I k
5 Both solved usingKVL
(a) 2−1−V a =0 · · ·Va =1
(b) 2+(−1) +V a =0 · · ·Va =−1
6 3−1−V a =0 · · ·Va =2
3−1−1−V b =0 · ·Vb =1
orV a −1−V b =0 · · · by substitution forV a ,
V b =1
REVIEWEXERCISES1
1 Simplify eachofthe followingasfar aspossible: 2 Simplify asfar aspossible:
(a) 7 6 7 4 6 3
(b)
6 −2 (c) (3 4 ) −2 (a) x 7 x −3 (b) (x 2 ) 4 (c)
√
(d) 3 4 6 2 (e) (3 2/3 4 1/3 ) 6 (f) 10−3
10 −4 (d) (y −2 ) −1 (e) y 1/3 yy 2
(√ ) −1 x
x
Review exercises 1 51
3 Removethe brackets and simplify:
( ) −1
(a) (2x 2 y) 3 (b) (6a 2 b 3√ c) 2 (c)
y −2
2
(d) (x 2 y −1 ) 0.5
( )
3 −1 x −2 −2
(e)
y −3
4 Expressthe following astheir partialfractions:
(a)
(c)
(e)
3x+11
(x −3)(x +7)
6x 2 −2
2x 2 −x
4x−11
2x 2 +15x+7
(b)
(d)
−3−x
x 2 −x
2x 2 −x−7
(x +1)(x −1)(x +2)
5 Convert the following intoasingle fraction:
(a)
(b)
(c)
1
x + 3
x +2 + 6
8x+4
1
s + 2 s 2 + 3s+4
8s+6
6
s + 10
s 2 − s +1
(s +2)(s +3) + s −1
(s +4)(s +3)
6 Expressthe following aspartialfractions:
(a)
(c)
(e)
(f)
(g)
(i)
(k)
(m)
(o)
(q)
(s)
5x
(x +1)(2x −3)
y +3
y 2 +3y+2
2z 2 +15z +30
(z +2)(z +3)(z +6)
(b)
(d)
24x 2 +33x +11
(2x +1)(3x +2)(4x +3)
3x+2
x 2 +5x+6
1
t 2 +3t+2
s +3
(s +1) 2 (h) 2k2 +k+1
k 3 −k
x 3
x 2 +1
s 2
s 2 +1
6d 2 +15d+8
(d +1) 2 (d +2)
−y−1
(y 2 +1)(y −1)
t 2 +t−2
(t −2) 2 (t +1)
x 3 +4x 2 +7x+5
x 2 +3x+2
(j)
(l)
t +5
(t +3) 2
8x−15
4x 2 −12x+9
(n) 2x2 +x+3
x 2 +2x+1
(p)
s 2 −8s−5
(s 2 +s+1)(s−4)
(r) 2s3 +3s 2 −s−4
s 2 +s−1
7 Solve the following quadraticequationsusingthe
quadraticformula:
(a) x 2 +10x+2=0
(b) y 2 −6y−3=0
(c) 2t 2 +2t−9=0
(d) 3z 2 −9z−1=0
(e) 5v 2 +v−6=0
8 Solve the quadratic equationsin Question7by
completing the square.
9 Solve
x 3 −4x 2 −25x+28=0
givenx = 7 is aroot.
10 Solve the following inequalities:
(a) 6t−14
(c) 1−2v<v+4
(e) (x−2) 2 36
x −3
(g)
x +1 0
x
(i)
2 3 x
(b) −63r6
(d) 2 x −2
3
(f)x 2 −2x−3<0
(h)x2 −8x+50
(j) x2 −2x−3
x −5
11 Express eachfraction in its simplestform.
3ab 2
(a)
12ab
3x 2 +3x
(d)
6x 2 −3x
(b) 6x2 y 2 z
3xy 3 (c)
z
xyz −2x 2 y 2 z
(e)
x 2 y 2 −2x 3 y 3
12 Express eachfraction in its simplestform.
x 2 +2x−15
(a)
x 2 −2x−3
2x 2 +7x−4
(c)
2x 2 −3x+1
(e)
x 3 −2x 2 +x−2
x 3 +x 2 +x+1
> 0
9t+6
12−3t
y 2 +4y−12
(b)
y 2 +13y+42
3x 2 t +3xt −3t
(d)
4x 2 z+4xz−4z
13 Express asasingle fraction in itssimplestform.
x +1
(a)
x +6 × x +6
x +2
3x−6
(b)
xy+2y × xy+3y
4x−8
(c)
(d)
(e)
x 2 −1
4
÷ x −1
6
x 2 −9x
x +1 ÷ x −9
x 3 +x 2
x 2 −5x−6
x 2 +x−42 ÷ x2 −1
x 2 +6x−7
52 Chapter 1 Review of algebraic techniques
14 Expressasasingle fraction:
(a)
(b)
5
x +6 + 3
x +1
3x
2x−1 − 4
x +5
(c)
x +1
x 2 −5x−6 + 5x
x +3
(d) x+1+ 2
x −3
(e) 2x−3+ 1
x +1 −
x
x 2 +1
Solutions
1 (a) 7 10 (b) 6 5 (c) 3 −8
(d) 3 2 6 (e) 3 4 4 2 (f) 10
2 (a) x 4 (b) x 8 (c) √ x
(d) y 2 (e) y 10/3
3 (a) 8x 6 y 3 (b) 36a 4 b 6 c (c) 2y 2
(d) xy −0.5 9x 4
(e)
y 6
2
4 (a)
x −3 + 1
x +7
3
(b)
x − 4
x −1
(c) 3+ 2 x − 1
2x−1
2
(d)
x +1 − 1
x −1 + 1
x +2
3
(e)
x +7 − 2
2x+1
5 (a)
(b)
(c)
6 (a)
(b)
(c)
(d)
(e)
(f)
19x 2 +22x +4
2x(x +2)(2x +1)
3s 3 +12s 2 +22s +12
2s 2 (4s +3)
2(3s 4 +30s 3 +120s 2 +202s +120)
s 2 (s +2)(s +3)(s +4)
1
x +1 + 3
2x−3
7
x +3 − 4
x +2
2
y +1 − 1
y +2
1
t +1 − 1
t +2
2
z +2 − 1
z +3 + 1
z +6
1
2x+1 + 3
3x+2 − 2
4x+3
1
(g)
s +1 + 2
(s +1) 2
1
(h)
k +1 + 2
k −1 − 1 k
(i) x−
x
x 2 +1
1
(j)
t +3 + 2
(t +3) 2
(k) 1− 1
s 2 +1
4
(l)
2x−3 − 3
(2x −3) 2
4
(m)
d +1 − 1
(d +1) 2 + 2
d +2
(n) 2− 3
x +1 + 4
(x +1) 2
(o)
y
y 2 +1 − 1
y −1
2s+1
(p)
s 2 +s+1 − 1
s −4
11
(q)
9(t −2) + 4
3(t −2) 2 − 2
9(t +1)
3
(r) 2s+1−
s 2 +s−1
(s) x+1+ 1
x +1 + 1
x +2
7 (a) −9.7958,−0.2042
(b) −0.4641,6.4641
(c) −2.6794,1.6794
(d) −0.1073,3.1073
(e) −1.2,1
8 (a) (x+5) 2 −23=0
(b) (y−3) 2 −12=0
[ (
(c) 2 t + 1 ) ] 2
− 19 = 0
2 4
Review exercises 1 53
[ (
(d) 3 z − 3 ) ] 2
− 31 = 0
2 12
[ (
(e) 5 v + 1 ) ] 2
− 121 = 0
10 100
9 x=−4,1,7
Solutionssame asforQuestion 7
10 (a)t 5 (b) −2r2
6
(c) v>−1 (d)x8
(e) x −4orx8
(f) −1<x<3
(g) x<−1orx3
(h) 4− √ 11x4+ √ 11
(i) √ 0<x 6,x − √ 6
(j) x>5or−1<x<3
b 2x
11 (a) (b) (c)
4 y
(d)
x +1
2x−1
(e)
z
xy
3t+2
4 −t
x +5
12 (a)
x +1
3t
(d)
4z
13 (a)
(c)
14 (a)
(b)
(c)
(d)
(e)
x +1
x +2
3(x +1)
2
y −2
(b)
y +7
x −2
(e)
x +1
(b)
3(x +3)
4(x +2)
8x+23
(x +1)(x +6)
3x 2 +7x+4
(2x −1)(x +5)
5x 2 −29x+3
(x −6)(x +3)
x 2 −2x−1
x −3
2x 4 −x 3 −x 2 −2x−2
(x +1)(x 2 +1)
(c)
x +4
x −1
(d) x 3 (e) 1
2 Engineeringfunctions
Contents 2.1 Introduction 54
2.2 Numbersandintervals 55
2.3 Basicconceptsoffunctions 56
2.4 Reviewofsomecommonengineeringfunctionsandtechniques 70
Reviewexercises2 113
2.1 INTRODUCTION
The study of functions is central to engineering mathematics. Functions can be used to
describe the way quantities change: for example, the variation in the voltage across an
electroniccomponentwithtime,thevariationinpositionofanelectricmotorwithtime
and the variation inthe strengthof a signalwith both position and time.
Inthischapterweintroduceseveralconceptsassociatedwithfunctionsbeforegoing
ontocatalogue anumber ofengineering functions inSection2.4.Muchofthematerial
ofSection2.4willalreadybefamiliartothereaderandsothissectionshouldbetreated
asareferencesectiontobedippedintowhenevernecessary.Anumberofmathematical
methods are also included in Section 2.4, most of which will be familiar but they have
been collected together inorder tomake the book complete.
When trying to understand a mathematical function it is always useful to sketch a
graphinordertoobtainanideaofitsbehaviour.Thereaderisencouragedtosketchsuch
graphs whenever a new function is met. Graphics calculators are now readily available
andtheymakethistaskrelativelyeasy.Ifyoupossesssuchacalculatorthenitwouldbe
useful to make use of it whenever a new function is introduced. Software packages are
alsoavailabletoallowsuchplotstobecarriedoutonacomputer.Thesecanbeusefulfor
plotting more complicated functions and ones that depend on more than one variable.
We examine functions of more than one variable inChapter 25.
Throughoutthebookwemakeuseofthetermmathematicalmodel.Whendoingso
wemeananidealizationofanengineeringsystemoraphysicalsituationsothatitcanbe
describedbymathematicalequations.Toreflectanengineeringsystemveryaccurately,
a sophisticated model, consisting of many interrelated equations, may be needed. Although
accurate, such a model may be cumbersome to use. Accuracy can be sacrificed
2.2 Numbers and intervals 55
in order to achieve a simple, easy-to-use model. A judgement is made as to when the
right blend of accuracy and conciseness is achieved. For example, the most common
mathematicalmodelforaresistorusesOhm’slawwhichstatesthatthevoltageacrossa
resistor equals the current through the resistor multiplied by the resistance value of the
resistor,thatisV =IR.However,thismodelisbasedonanumberofsimplifications.It
ignores any variation in current density across the cross-section of the resistor and assumes
a singlecurrent value isacceptable. Italsoignores the factthatifalargeenough
voltageisplacedacrosstheresistorthentheresistorwillbreakdown.Inmostcasesitis
worthaccepting these simplifications inorder toobtain a concise model.
Havingobtainedamathematicalmodel,itisthenusedtopredicttheeffectofchanging
elements or conditions within the actual system. Using the model to examine these
effects isoften cheaper, safer and more convenient than usingthe actual system.
2.2 NUMBERSANDINTERVALS
Numberscanbegroupedintovariousclasses,orsets.Theintegersarethesetofnumbers
{...,−3,−2,−1,0,1,2,3,...}
denotedby Z.Thenaturalnumbersare {0,1,2,3,...}andthissetisdenotedby N.The
positive integers, denoted by N + , are given by {1,2,3,...}. Note that some numbers
occur inmore than one set,thatisthe setsoverlap.
A rational number has the form p/q, where p andqare integers withq ≠ 0. For
example,5/2,7/118, −1/9and3/1areallrationalnumbers.Thesetofrationalnumbers
isdenotedby Q.Whenrationalnumbersareexpressedasadecimalfractiontheyeither
terminate orrecur infinitely.
}
5
can be expressed as2.5 These decimal fractions terminate,
2
1
can be expressed as0.125 thatisthey are of finitelength.
8
}
1
can be expressed as0.111111... These areinfinitely
9
1
can be expressed as 0.090909... recurring decimal fractions.
11
A number which cannot be expressed in the form p/q is called irrational. When
writtenasadecimalfraction,anirrationalnumberisinfiniteinlengthandnon-recurring.
The numbers π and √ 2 areboth irrational.
It is useful to introduce the factorial notation. We write 3! to represent the product
3 ×2×1. The expression 3! is read as ‘factorial 3’. Similarly 4! is a shorthand way of
writing4 ×3×2×1.Ingeneral, forany positive integer,n, we can write
n! =n(n −1)(n −2)(n −3)...(3)(2)(1)
Itisusefultorepresentnumbersbypointsontherealline.Figure2.1illustratessome
rational and irrational numbers marked on the real line. Numbers which can be represented
by points on the real line are known as real numbers. The set of real numbers
is denoted by R. This set comprises all the rational and all the irrational numbers. In
Chapter9weshallmeetcomplexnumberswhichcannotberepresentedaspointsonthe
–2 –– 3
5
2
–1 0 1 √ 2 2 2
– 3 p 4
Figure2.1
Both rationalandirrational numbers are representedonthe real line.
56 Chapter 2 Engineering functions
–6 –4 –1 0 2 3 4
Figure2.2
The intervals (−6,−4),[−1,2], (3,4]depicted onthe real line.
real line. The real line extends indefinitely to the left and to the right so that any real
number can be represented.
Sometimesweareinterestedinonlyasmallsection,orinterval,oftherealline.We
write [1, 3] to denote all the real numbers between 1 and 3 inclusive, that is 1 and 3 are
includedintheinterval.Thustheinterval[1,3]consistsofallrealnumbersx,suchthat
1 x3. The square brackets, [ ], are used to denote that the end-points are included
in the interval and such an interval is said to be closed. The interval (1, 3) consists of
all real numbers x, such that 1 < x < 3. In this case the end-points are not included
and the interval is said to be open. Brackets, (), denote open intervals. An interval may
be open at one end and closed at the other. For example, (1, 3] is open at the left and
closed at the right. It consists of all real numbersx, such that 1 < x 3, and is known
as a semi-open interval. Open and closed intervals can be represented on the real line.
A closed end-point is denoted by •; an open end-point is denoted by ◦. The intervals
(−6,−4), [−1,2] and (3, 4] are illustratedinFigure 2.2.
An upper bound of a set of numbers is any number which is greater than or equal
toeverynumberinthegivenset.So,forexample,7isanupperboundfortheset[3,6].
Clearly, 7 isgreater than every number inthe interval[3, 6].
Alowerboundofasetofnumbersisanynumberwhichislessthanorequaltoevery
number inthe given set.For example, 3 isalower bound forthe set (3.7,5).
Note that upper and lower bounds are not unique. Both 3 and 10 are upper bounds
for (1, 2). Both −1 and −3 are lower bounds for[0, 6].
TechnicalcomputinglanguagessuchasMATLAB ® usuallyhavefunctionsthatautomaticallygenerateasetofnumberswithinaparticularinterval.InMATLAB
® wecould
generate a set of time values,t, by typing:
t= 0:0.1:1
Thisgeneratesasetofrealnumbersfromtheinterval[0,1]storedinarowvectort,each
individual number being separated by an increment of 0.1. The values oft generated
are:
0 0.1000 0.2000 0.3000 0.4000 0.5000 0.6000 0.7000
0.8000 0.9000 1.0000
2.3 BASICCONCEPTSOFFUNCTIONS
Loosely speaking, we can think of a function as a rule which, when given an input,
producesasingleoutput.Ifmorethanoneoutputisproduced,theruleisnotafunction.
Considerthefunctiongivenbytherule:‘doubletheinput’.If3istheinputthen6isthe
output. Ifxisthe input then 2x isthe output, as shown inFigure 2.3.
If the doubling function has the symbol f wewrite
f:x→2x
or more compactly,
f(x)=2x
2.3 Basic concepts of functions 57
f
f
double
double
3 6 x
2x
the input
the input
Figure2.3
Thefunction: ‘double the input’.
Thelastformisoftenwrittensimplyas f = 2x.If f (x)isafunctionofx,thenthevalue
of the function whenx = 3,forexample, iswrittenas f (x = 3) or simplyas f (3).
Example2.1 Given f (x) = 2x +1 find
(a) f(3)
(c) f(−1)
(e) f(2α)
(g) f(t+1)
(b) f(0)
(d) f(α)
(f) f(t)
Solution (a) f(3) = 2(3) +1 = 7
(b) f(0)=2(0)+1=1
(c) f(−1) =2(−1) +1 = −1
(d) f(α)isthevalueof f(x)whenxhasavalueof α,hence f(α) = 2α +1
(e) f(2α)=2(2α)+1=4α+1
(f) f(t)=2t+1
(g) f(t+1)=2(t+1)+1=2t+3
Observe from Example 2.1 that it is the rule that is important and not the letter being
used. Both f (t) = 2t + 1 and f (x) = 2x + 1 instruct us to double the input and then
add 1.
2.3.1 Argumentofafunction
Theinputtoafunctionisoftencalledtheargument.InExample2.1(d)theargumentis
α, whileinExample 2.1(e) the argument is2α.
Example2.2 Given f (x) = x , write down
5
(a) f(5x)
(b) f(−x)
(c) f(x+2) (d) f(x 2 )
Solution (a) f(5x) = 5x
5 =x (b) f(−x)=−x 5
(c) f(x+2)= x +2
5
Example2.3 Giveny(t) =t 2 +t, write down
( t
(a) y(t +2) (b) y
2)
(d) f(x 2 )= x2
5
58 Chapter 2 Engineering functions
Solution (a) y(t +2) = (t +2) 2 + (t +2) =t 2 +5t+6
( ( ) t t 2 ( t
(b) y = + =
2)
2 2)
t2 4 + t 2
2.3.2 Graphofafunction
A function may be represented in graphical form. The function f (x) = 2x is shown in
Figure2.4.Notethatthefunctionvaluesareplottedverticallyandthexvalueshorizontally.Thehorizontalaxisisthencalledthexaxis.Theverticalaxisiscommonlyreferred
toastheyaxis, sothat weoften write
y=f(x)=2x
f (x)
2
–2
1 x
Sincexandycan have a number of possible values, they are called variables:xis the
independent variable and y is the dependent variable. Knowing a value of the independent
variable,x, allows us to calculate the corresponding value of the dependent
variable,y. To show this dependence we often writey(x). The set of values thatxis allowedtotakeiscalledthedomainofthefunction.Adomainisoftenanintervalonthe
x axis. For example, if
–4
Figure2.4
Thefunction: f (x) = 2x.
f(x)=3x+1 −5x10 (2.1)
thedomainofthefunction, f,istheclosedinterval[−5,10].Ifthedomainofafunction
isnot explicitly given itis taken tobe the largestset possible. For example,
g(x)=x 2 −4 (2.2)
hasadomainof (−∞,∞)sincegisdefinedforeveryvalueofxandthedomainhasnot
been given otherwise. The set of values that the function takes on is called the range.
The range of f (x) in Equation (2.1) is [−14,31]; the range ofg(x) in Equation (2.2) is
[−4,∞).
Wenowconsiderplottingthefunction f (t) =t 2 for0 t < 100inatechnicalcomputing
language. Firstwegenerate a number set as shown inSection 2.2.
t= 0:1:100
Then we produce a graph of the function by using the MATLAB ® plot command to
give the following
plot(t, t.^2)
Example2.4 Consider the function, f,given by the rule:‘squarethe input’. This can bewritten as
f(x)=x 2
Theruleandthegraphof f areshowninFigure2.5.Thedomainof f is (−∞,∞)and
the range is[0,∞).
2.3 Basic concepts of functions 59
square
x x
the input
2
f
f(x)
9
4
–2 0 3 x
Figure2.5
Thefunction: ‘square the input’.
Manyvariablesofinteresttoengineers,forexamplevoltage,resistanceandcurrent,can
be related by means of functions. We try to choose an appropriate letter for a particular
variable; so, forexample,t is usedfor time andPforpower.
Engineeringapplication2.1
Functiontomodelthepowerdissipationinaresistor
Recall from Engineering application 1.1 that the power, P, dissipated by a resistor
depends on the current,I, flowing through the resistance,R. The relationship is
given by
P=I 2 R
The power dissipated in the resistor depends on the square of the current passing
throughit.InthiscaseI istheindependent variableandPisthedependent variable,
assumingRremainsconstant.Thefunctionisgivenbytherule:‘squaretheinputand
multiply by the constantR’, and the input to the function isI. The output from the
function isP. This isillustratedinFigure 2.6,for the casesR = 4 andR = 2.
I
square the
input and
multiply
by R
Figure2.6
The function:P =I 2 R.
P
P
16
8
4
2
R = 4
R = 2
–1 2 I
This model for a resistor only approximates the behaviour of the device. In practice,changesinthetemperatureoftheresistorleadtoslightchangesintheresistance
value. If the current through the resistor is excessively high then the resistor overheatsandispermanentlydamaged.Itnolongerhasthecorrectresistancevalue.The
amount of power that a resistor can handle depends on the materials that have been
usedinitsconstruction.Agoodcircuitdesignerwouldcalculatetheamountofpower
tobedissipatedandthenallowasuitablesafetymargintoensurethattheresistorcannot
beoverloaded.
60 Chapter 2 Engineering functions
2.3.3 One-to-many
Some rules relating input to output are not functions. Consider the rule: ‘take plus or
minus the square rootof the input’, that is
x→± √ x
Now, for example, if 4 is the input, the output is ± √ 4 which can be 2 or −2. Thus a
single input has produced more than one output. The rule is said to be one-to-many,
meaning that one input has produced many outputs. Rules with this property are not
functions. For a ruletobe a function there mustbe a single output forany given input.
Bydefiningarulemorespecifically,itmaybecomeafunction.Forexample,consider
therule:‘takethepositivesquarerootoftheinput’.Thisruleisafunctionbecausethere
is a single output for a given input. Note that the domain of this function is [0,∞) and
the range isalso[0,∞).
2.3.4 Many-to-oneandone-to-onefunctions
Consideragainthefunction f (x) =x 2 giveninExample2.4.Theinputs2and −2both
producethesameoutput,4,andthefunctionissaidtobemany-to-one.Thismeansthat
many inputs produce the same output. A many-to-one function can be recognized from
itsgraph.Ifahorizontallineintersectsthegraphinmorethanoneplace,thefunctionis
many-to-one. Figure 2.7 illustrates a many-to-one function,g(x). The inputsx 1
,x 2
,x 3
andx 4
all produce the same output.
Afunctionisone-to-oneifdifferentinputsalwaysproducedifferentoutputs.Ahorizontallinewillintersectthegraphofaone-to-onefunctioninonlyoneplace.Figure2.8
illustratesaone-to-one function,h(x).
Bothone-to-onefunctionsandmany-to-onefunctionsaresupportedintechnicalcomputinglanguages.Forexample,inMATLAB
® thefunction f (x) =x 2 canbedefinedby
using the command:
f = @(x) x^2;
Itis now possible totype:
f(3)
or
f(-3)
g(x)
h(x)
x 1 x 2 x 3 x 4 x
Figure2.7
Theinputsx 1 ,x 2 ,x 3 andx 4 allproduce the same
output,thereforeg(x) isamany-to-one function.
Figure2.8
Eachinput producesadifferent outputandso
h(x)is aone-to-onefunction.
x
2.3 Basic concepts of functions 61
both of which have the same result:
ans=9
affirming that f (x) =x 2 isamany-to-one function.
Noticethatthevariableusedbythefunctionisdefinedinbracketsafterthe@sign.This
indicatesthattheinputtothefunctionisxandthecommandcreatesafunctionhandle,
f. Giving the function a handle enables ittobe used elsewhere inthe program.
More complicated functions are usually created in a separate file and saved on the
computer’s internal storage devices. They can be easily reused to create sophisticated
programs. In MATLAB ® these files are saved with the file name extension .m and are
often termedm-files.
2.3.5 Parametricdefinitionofafunction
Functions are often expressed in the formy(x). For every value ofxthe corresponding
valueofycanbefoundandthepointwithcoordinates (x,y)canthenbeplotted.Sometimes
it is useful to expressxandycoordinates in terms of a third variable known as a
parameter.Commonlyweuset or θ todenoteaparameter.Thusthecoordinates (x,y)
of the points on a curve can be expressed inthe form
x=f(t)
y=g(t)
For example, given the parametric equations
x =t 2 y=2t 0t5
we can calculatexandyfor various values of the parametert. Plotting the points (x,y)
produces partof a curve known as a parabola.
2.3.6 Compositionoffunctions
Considerthefunctiony(x) = 2x 2 .Wecanthinkofy(x)asbeingcomposedoftwofunctions.Onefunctionisdescribedbytherule:‘squaretheinput’,whiletheotherfunction
isdescribed by the rule:‘double the input’. This isshown inFigure 2.9.
y(x)
8
x
g
square
the input
Figure2.9
Thefunction:y(x) =h(g(x)).
h
x 2 double
2x
the input
2
2
–2 1 x
Mathematically, ifh(x) = 2x andg(x) =x 2 then
y(x) = 2x 2 = 2(g(x)) =h(g(x))
The form h(g(x)) is known as a composition of the functions h and g. Note that the
compositionh(g(x)) isdifferent fromg(h(x)) as Example 2.5 illustrates.
62 Chapter 2 Engineering functions
Example2.5 If f (t) = 2t +3 andg(t) = t +1 writeexpressions for the compositions
2
(a) f(g(t))
(b) g(f(t))
( ) t +1
Solution (a) f(g(t)) = f
2
The ruledescribing the function f is: ‘double the inputand then add 3’. Hence,
So
( ) ( ) t +1 t +1
f = 2 +3=t+4
2 2
f(g(t))=t+4
(b) g(f(t)) =g(2t +3)
The ruleforgis: ‘add 1 tothe inputand then divide everything by 2’. So,
Hence
g(2t +3) = 2t+3+1
2
g(f(t))=t+2
Clearly f (g(t)) ≠g(f(t)).
=t+2
2.3.7 Inverseofafunction
Considerafunction f (x).Itcanbethoughtofasacceptinganinputx,andproducingan
output f (x). Suppose now that this output becomes the input to the functiong(x), and
the output fromg(x) isx, thatis
g(f(x))=x
We can think ofg(x) as undoing the work of f (x). Figure 2.10 illustrates this situation.
Theng(x) is the inverse of f, and is written as f −1 (x). Since f −1 (x) undoes the work
of f(x)wehave
f (f −1 (x)) = f −1 (f(x))=x
x
f
f (x)
g
g( f (x))=x
Figure2.10
The functiongisthe inverse of f.
2.3 Basic concepts of functions 63
Example2.6 If f (x) = 5x verifythat the inverse of f isgiven by f −1 (x) = x 5 .
Solution The function f receives an input of x, and produces an output of 5x. Hence when the
inverse function, f −1 , receives an input of 5x, itproduces anoutput ofx, thatis
f −1 (5x) =x
We introduce a new variable,z, given by
so
z=5x
Then
x = z 5
f −1 (z)=x= z 5
Writing f −1 withxas the argument gives
f −1 (x) = x 5
Example2.7 If f (x) = 2x +1,find f −1 (x).
Solution The function f receives an input of x and produces an output of 2x + 1. So when the
inverse function, f −1 , receives aninputof2x +1 itproducesanoutput ofx, thatis
f −1 (2x+1)=x
We introduce a new variable,z, defined by
z=2x+1
Rearranginggives
So
x = z −1
2
f −1 (z)=x= z −1
2
Writing f −1 withxas the argument gives
f −1 (x) = x −1
2
64 Chapter 2 Engineering functions
Example2.8 Giveng(x) = x −1 find the inverse ofg.
2
Solution We knowg(x) = x −1 ( ) x −1
2 , and sog−1 =x.Lety= x −1 so that
2 2
But,
and so
g −1 (y) =x
x=2y+1
g −1 (y)=2y+1
Using the same independent variable as forthe functiong, we obtain
g −1 (x)=2x+1
WenotethattheinversesofthefunctionsinExamples2.7and2.8arethemselvesfunctions.
They are called inverse functions. The inverse of f (x) = 2x + 1 is f −1 (x) =
x −1 x −1
, and the inverse ofg(x) = isg −1 (x) = 2x +1. This illustrates the important
point that if f (x) andg(x) are two functions and f (x) is the inverse ofg(x), then
2 2
g(x) is the inverse of f (x). It is important to point out that not all functions possess an
inverse function. Consider f (x) =x 2 , for −∞ <x<∞.
The function, f, is given by the rule: ‘square the input’. Since both a positive and
negative value of x will yield the output x 2 , the inverse rule is given by: ‘take plus or
minus the square root of the input’. As discussed earlier, this is a one-to-many rule and
soisnotafunction.Clearlynotallfunctionshaveaninversefunction.Infact,onlyoneto-onefunctionshaveaninversefunction.Supposewerestrictthedomainof
f (x) =x 2
suchthatx 0.Then f isaone-to-onefunctionandsothereisaninversefunction.The
inverse function is f −1 (x) given by
Clearly,
f −1 (x) = + √ x
f −1 (f(x))=f −1 (x 2 )=x
wherexisthepositivesquarerootofx 2 .Restrictingthedomainofamany-to-onefunction
so that a one-to-one function results is a common technique of ensuring an inverse
function can be found.
2.3.8 Continuousandpiecewisecontinuousfunctions
Wenowintroduceinaninformalwaytheconceptofcontinuousandpiecewisecontinuousfunctions.AmorerigoroustreatmentfollowsinChapter10afterwehavediscussed
limits.Figure2.11showsagraphof f (x) = 1 x .Notethatthereisabreak,ordiscontinuity,
inthe graph atx = 0.The function f (x) = 1 issaidtobe discontinuousatx = 0.
x
2.3 Basic concepts of functions 65
f (x)
f(t)
3
2
x
1
Figure2.11
Thefunction f (x) = 1 has a discontinuity at
x
x=0.
1 3 t
Figure2.12
Thefunction f (t)isapiecewise continuous
functionwith adiscontinuity att = 1.
If the graph ofafunction, f (x), containsabreak, then f (x) isdiscontinuous.
Afunctionwhosegraph has no breaksisacontinuousfunction.
Sometimes a function is defined by different rules on different intervals of the domain.
For example,consider
g(t)
6
f(t)=
{
2 0t<1
t 1t3
Thedomainis[0,3]buttheruleon[0,1)isdifferenttothaton[1,3].Thegraphof f (t)
is shown in Figure 2.12. Recall the convention of using • to denote that the end-point
isincludedand ◦todenotetheend-pointisexcluded.Notethat f (t)hasadiscontinuity
att = 1. Each component, or piece, of the graph is continuous and f (t) is said to be
piecewisecontinuous.
2
Apiecewisecontinuousfunctionhasafinitenumberofdiscontinuitiesinanygiven
interval.
1 3 t
Figure2.13
The functiong(t)is a
continuous function
on (0,3).
Notallfunctionsdefineddifferentlyondifferentintervalsarediscontinuous.Forexample,
{
2 0<t<1
g(t) =
2t 1t<3
isacontinuousfunction onthe interval (0,3), asshown inFigure 2.13.
2.3.9 Periodicfunctions
Aperiodicfunctionisafunctionwhichhasadefinitepatternwhichisrepeatedatregular
intervals. More formally wesay a function, f (t),isperiodicif
f(t)=f(t+T)
forall values oft. Theconstant,T, isknown asthe period ofthe function.
66 Chapter 2 Engineering functions
Example2.9 Figure2.14illustratesaperiodicwaveform.Itisoftenreferredtoasatriangularwaveform
because of its shape. The form of the function is repeated every two seconds, that
is
f(t)=f(t+2)
and so the function is periodic. The period is 2 seconds, that isT = 2. Note that this
function iscontinuous.
f(t)
–2 –1 0 1 2 3 4 t
Figure2.14
The triangularwaveform is aperiodic function.
Engineeringapplication2.2
Saw-toothwaveform
Figure 2.15 illustrates asaw-tooth voltage waveform. It is called a saw-tooth waveform
because its shape is similar to that of the teeth on a saw. It has many uses in
electronicengineering.Oneusewouldbetoprovideasignaltosweepabeamofelectrons
across a cathode ray tube in a uniform way and then quickly move the beam
backtothestartagain.Thistechniqueisusedinananalogueoscilloscopeandforms
a signalfor the time base.
The form of the function isrepeated every threeseconds, that is
v(t) = v(t +3)
v(t)
1
–3 0 3
6 9 t
Figure2.15
Thesaw-tooth waveform isaperiodicfunction.
2.3 Basic concepts of functions 67
Technical computing languages often have a range of built-in functions for producing
waveforms. Sometimes specialist functions are provided in a separate softwarepackage.InMATLAB
® ,thesesoftwarepackagesareknownastoolboxes.The
signalprocessingtoolboxhasafunctionforgeneratingsaw-toothwaves.Thiscanbe
accessed by typing, forexample:
t=(-2*pi:0.1:2*pi);
plot(t, sawtooth(t));
This will plot two periods of a saw-tooth wave. The first line generates a set of time
values −2π ≤t < 2πinavectorformwithaspacingof0.1betweeneachpoint.The
secondlineplotst againsttheresultofpassingthevectort tothesawtoothfunction.
The sawtooth function always produces a wave with a period of 2π. It highlights
the need to read the manual pages carefully before using a function to understand
how itwill behave.
Engineeringapplication2.3
Squarewaveform
Periodic functions may be piecewise continuous. Consider the functiong(t) defined
by
{
1 0t<1
g(t) = period = 2
0 1t<2
Thefunctiong(t)isperiodicwithperiod2.Agraphofg(t)isshowninFigure2.16.
This function is commonly referred to as a square waveform by engineers. In Figure
2.16 the open and closed end-points have been shown for mathematical correctness.Note,however,thatengineerstendtoomitthesewhensketchingfunctionswith
discontinuities and usually they use a vertical line to show the discontinuity. This
reflectsthefactthatnopracticalwaveformcaneverchangeitslevelinstantaneously:
evenveryfastrisingwaveformsstillhaveafiniterisetime.Thefunctionhasdiscontinuities
att = ...,−3,−2,−1,0,1,2,3,4,5,....
g(t)
1
–5 –4 –3 –2 –1 0
1 2 3 4 5 t
Figure2.16
The functiong(t)is both piecewise continuous and periodic.
The square waveform is often used in electronic engineering, particularly in digital
electronic systems. One example is the clock signal that is generated to ensure that
all of the digital electronic circuits switch around the same time and so remain in
synchronisation.
68 Chapter 2 Engineering functions
EXERCISES2.3
1 Representthe followingintervals onthe real line:
(a) [1,3] (b) [2,4)
(c) (0,3.5) (d) [−2,0)
(e) (−1,1] (f) 2x<4
(g) 0<x<2 (h) −3x −1
(i) 0x<3
2 Describe the rule associated with the following
functions,sketch their graphs and state their domains
and ranges:
(a) f(x) =2x 2
(b)f(x)=x 2 −1
(c)g(t)=3t−4
(d)y(x) =x 3
0x
0t
(e)f(t)=0.5t+2 −2t10
(f)z(x)=3x−2
3 Iff(x)=5x+4,find
(a) f(3)
(b) f(−3)
(c) f(α)
(d) f(x+1)
(e) f(3α)
(f) f(x 2 )
4 Ifg(t) =5t 2 −4,find
(a) g(0)
(b) g(2)
(c) g(−3)
(d) g(x)
(e) g(2t −1)
3x8
5 Thereactance,X C ,offered by acapacitor is given by
X C = 1 ,where f isthe frequency ofthe applied
2πfC
alternating current, andC is the capacitance ofthe
capacitor. IfC = 10 −6 F,findX C when f = 50 Hz.
6 Classifythe functions in Question2asone-to-oneor
many-to-one.
7 Find the inverse ofthe followingfunctions:
(a) f(x)=x+4
(b) g(t) =3t +1
(c) y(x) =x 3
(d) h(t) = t −8
3
(e) f(t) = t −1
3
(f) h(x) =x 3 −1
(g) k(v)=7−v
(h) m(n) = 3 1 (1 −2n)
8 Givenf(t) =2t,g(t) =t −1andh(t) =t 2 write
expressions for
(a) f(g(t)) (b) f(h(t))
(c) g(h(t)) (d) g(f(t))
(e) h(g(t)) (f) h(f(t))
(g) f(f(t)) (h) g(g(t))
(i) h(h(t)) (j) f(g(h(t)))
(k) g(f(h(t))) (l) h(g(f(t)))
9 Givenf(t) =t 2 +1,g(t) =3t +2andh(t) = 1 t ,
write expressions for
(a) f(g(t)) (b) f(h(t))
(c) g(h(t)) (d) h(f(t))
(e) f(g(h(t)))
10 Given f(t) =2t,g(t) =2t +1,h(t) =1−3t,write
expressions forthe following:
(a) f −1 (t) (b)g −1 (t) (c)h −1 (t)
11 Givena(x) = 3x −2,b(x) = 2 x ,c(x)=1+1 x write
expressions for
(a)a −1 (x) (b)b −1 (x) (c)c −1 (x)
12 Givenf(t) =2t +3,g(t) =3tandh(t) = f(g(t))
write expressions for
(a) h(t)
(b) f −1 (t)
(c) g −1 (t)
(d) h −1 (t)
(e) g −1 (f −1 (t))
Whatdoyou noticeabout(d)and(e)?
13 Sketch the following functions:
{
t 0t3
(a) f(t) =
3 3<t4
{
2 −x 0x<1
(b) g(x) =
2 1x3
{
1 −t 0t1
(c) a(t) =
t−1 1<t2
⎧
⎪⎨
2 0 x1
(d) b(x) = 1 1<x2
⎪⎩
3−x 2<x3
2.3 Basic concepts of functions 69
14 Sketch
f(t)=
{
t 0t<2
5−2t 2t<3
Isthe function piecewise continuous orcontinuous?
State,ifthey exist, the position ofany discontinuities.
15 Thefunctionh(t)is defined by
{
2 −t 0t<2
h(t) =
2t−4 2t3
andh(t)has period 3.Sketchh(t)onthe interval
[0,6].
16 The functiong(t)is defined by
{
1 0 t1
g(t) =
2−t 1<t<2
andg(t)hasperiod 2.Sketchg(t)onthe interval
[−1,4].State any pointsofdiscontinuity.
Solutions
2 (a) Square the input and thenmultiply by2; domain
(−∞, ∞),range [0, ∞)
(b) Square the input, then subtract1; domain [0,∞),
range [−1,∞)
(c) Multiply input by3and subtract4; domain
[0, ∞),range [−4,∞)
(d) Cube the input;domain (−∞, ∞),range
(−∞,∞)
(e) Multiply input by0.5and then add2; domain
[−2,10], range [1,7]
(f) Multiply input by3and then subtract2; domain
[3,8],range [7,22]
3 (a) 19 (b) −11 (c) 5α +4
(d) 5x+9 (e) 15α+4 (f) 5x 2 +4
4 (a) −4 (b) 16 (c) 41
(d) 5x 2 −4 (e) 20t 2 −20t +1
5 3183ohms
8 (a) 2(t −1) (b) 2t 2 (c) t 2 −1
(d) 2t −1 (e) (t −1) 2 (f) 4t 2
(g) 4t (h) t −2 (i) t 4
(j) 2(t 2 −1) (k) 2t 2 −1 (l) (2t −1) 2
9 (a) 9t 2 1
+12t +5 (b)
t 2 +1
3
(c)
t +2 (d) 1
t 2 +1
9
(e)
t 2 + 12 +5
t
t
10 (a) (b) t −1
2 2
x +2 2
11 (a) (b)
3 x
12 (a) 6t +3 (b) t −3
2
(d) t −3 (e) t −3
6 6
13 SeeFigureS.1.
1 −t
(c)
3
1
(c)
x −1
t
(c)
3
6 (a) many-to-one (b) one-to-one
(c) one-to-one
(e) one-to-one
7 (a)f −1 (x)=x−4
(b) g −1 (t) = t −1
3
(c) y −1 (x) =x 1/3
(d) h −1 (t) =3t +8
(e) f −1 (t)=3t+1
(f) h −1 (x) = (x +1) 1/3
(g) k −1 (v) =7−v
(h) m −1 (n) = 1−3n
2
(d) one-to-one
(f) one-to-one
f
3
2
1
(a)
0
a
1
1
2 3 4 t
(c)
0 1 2 t
FigureS.1
(b)
(d)
g
2
1
0 1 2 3 x
b
2
1
0 1 2 3 x
70 Chapter 2 Engineering functions
14 Piecewise continuous; discontinuity att = 2. See
FigureS.2.
f
2
1
16 Discontinuitiesatt = 0,2.See FigureS.4.
g
1
–1 0 1 2 3 t
FigureS.2
15 See FigureS.3.
h
2
1
–1 0 1 2 3 4
FigureS.4
t
0 1 2 3 4 5 6
t
FigureS.3
2.4 REVIEWOFSOMECOMMONENGINEERINGFUNCTIONS
ANDTECHNIQUES
This section provides a catalogue of the more common engineering functions. The important
properties and definitions are included together with some techniques. It is intendedthatreaderswillrefertothissectionforrevisionpurposesandastheneedarises
throughout the rest of the book.
2.4.1 Polynomialfunctions
Apolynomial expressionhas the form
a n
x n +a n−1
x n−1 +a n−2
x n−2 +···+a 2
x 2 +a 1
x+a 0
wherenis a non-negative integer,a n
,a n−1
,...,a 1
,a 0
are constants andxis a variable.Apolynomial
function,P(x), has the form
P(x) =a n
x n +a n−1
x n−1 +a n−2
x n−2 + ··· +a 2
x 2 +a 1
x +a 0
(2.3)
Examples of polynomial functions include
P 1
(x)=3x 2 −x+2 (2.4)
P 2
(z) =7z 4 +z 2 −1 (2.5)
P 3
(t) =3t +9 (2.6)
P 4
(t) =6 (2.7)
2.4 Review of some common engineering functions and techniques 71
wherex,zandt are independent variables. It is common practice to contract the term
polynomial expression to polynomial. By convention, a polynomial is usually written
with the powers either increasing or decreasing. For example,
3x+9x 2 −x 3 +2
would be writtenas either
−x 3 +9x 2 +3x+2 or 2+3x+9x 2 −x 3
Thedegreeofapolynomialorpolynomialfunctionisthevalueofthehighestpower.
Equation(2.4)hasdegree2,Equation(2.5)hasdegree4,Equation(2.6)hasdegree1and
Equation(2.7)hasdegree0.Equation(2.3)hasdegreen.Polynomialswithlowdegrees
have special names (seeTable 2.1).
Table2.1
Polynomial Degree Name
ax 4 +bx 3 +cx 2 +dx +e 4 Quartic
ax 3 +bx 2 +cx +d 3 Cubic
ax 2 +bx +c 2 Quadratic
ax +b 1 Linear
a 0 Constant
Typical graphs of some polynomial functions are shown inFigure 2.17.
P(x)
Degree 2
Degree 3
x
Degree 0
Degree 1
Figure2.17
Some typical polynomials.
Engineeringapplication2.4
Ohm’slaw
RecallfromEngineeringapplication1.1thatthecurrentflowingthrougharesistoris
related tothe voltage applied across itby Ohm’s law. The equation is
V=IR
whereV = voltage across the resistor;
I = currentthrough the resistor;
R = resistance value of the resistor,which isaconstant
foragiven temperature.
➔
72 Chapter 2 Engineering functions
Note that the voltage is a linear polynomial function with I as the independent
variable.
This equation is only valid for a finite range of currents. If too much voltage is
appliedtotheresistor,thenthecurrentflowingthroughtheresistorbecomessufficient
forthe resistortooverheat and breakdown.
Engineeringapplication2.5
Anon-idealvoltagesource
An ideal voltage source has zero internal resistance and its output voltage,V, is independent
of the load applied to it; that is,V remains constant, independent of the
current it supplies. It is called an ideal voltage source because it is difficult to create
such a source in practice; it is in effect an abstraction that is useful when developing
engineering models of real electronic systems. A non-ideal voltage source has
aninternal resistance. Due tothis internal resistance, the output voltage from such a
source is reduced when current is drawn from the source. The voltage reduction increasesasmorecurrentisdrawn.Figure2.18showsanon-idealvoltagesource.Itis
modelledasanidealvoltagesourceinserieswithaninternalresistorwithresistance
R s
. The output voltage of the non-ideal voltage source is v o
while v R
is the voltage
drop across the internal resistor andiis the load current. Using Kirchhoff’s voltage
law,
V=v R
+v o
and hence by Ohm’s law,
V =iR s
+v o
v o
=V−iR s
Note thatV and R s
are constants and so the output voltage is a linear polynomial
function with independent variable i. The equation gives the output voltage across
theloadasafunctionofthecurrentthroughtheload.Theoutputcharacteristicforthe
non-idealvoltagesourceisobtainedbyvaryingtheloadresistorR L
andisplottedin
Figure2.19.Noticethattheoutputvoltageofthenon-idealvoltagesourcedecreases
as the load current increases and is equal in value to the ideal voltage source only
when there isno load current.
V
+
–
R s
v R
v o
i
R L
v o
V
gradient = –R s
i
non-ideal voltage source
Figure2.18
Anon-ideal voltage sourceconnected to aload
resistor,R L .
Figure2.19
Output characteristic ofanon-ideal
voltage source.
2.4 Review of some common engineering functions and techniques 73
Thisiswhyitiscalledanon-idealvoltagesource.Engineerswouldprefertohave
a source that maintained a constant voltage no matter how much current was drawn
but itisnotpossible tobuild such a source.
Engineeringapplication2.6
Windpowerturbines
Windturbinesareanimportantsourceofelectricalpower.Themostcommontype,
andtheoneswhichareusuallyfoundinoffshoreinstallations,resembleadesktopfan
and are called horizontal axis turbines. The wind driving a turbine blade consists
ofmanymoleculesofair,eachhavingatinyamountofmass.Thismasspassingthe
bladeareaeachsecondcarrieskineticenergy,whichisthesourceofthewindpower.
The wind power,P, can be calculated using the formula
P = 1 2 Mv2
whereM is the total mass of air per second passing the blade in kg s −1 and v is the
velocity of the air inms −1 .
The mass per second can be calculated by considering the area swept out by the
blade,A, the density ofthe air, ρ, and the velocity:
M=ρAv
This equation can besubstituted inthe power equation
P = 1 2 (ρAv)v2 = 1 2 ρAv3 (2.8)
Theavailablewindpowerthereforeincreaseswiththecubeofthevelocity.Notethat
the power isacubic polynomial function of the independent variable, v.
At 20 ◦ C the air density is approximately 1.204 kg m −3 . Consider the case of an
offshoreturbinethathasasweptareaof6362m 2 andaratedwindspeedof15ms −1 .
The maximum theoretical power atthe rated speed istherefore
P = 1 2 ρAv3 = 1 2 ×1.204 ×6362 ×153 = 12.93 MW
Theactualratedpowerofthedeviceisapproximately3MWbecauseotherphysical
processesandlosseshavetobeaccountedfor,yetEquation(2.8)remainsoneofthe
mostfundamental inthe studyof wind power.
Manyexcellentcomputersoftwarepackagesexistforplottinggraphsandthese,aswell
as graphics calculators, may be used to solve polynomial equations. The real roots of
theequationP(x) = 0aregivenbythevaluesoftheinterceptsofthefunctiony =P(x)
and thexaxis, because on thexaxisyiszero.
Figure 2.20 shows a graph of y = P(x). The graph intersects the x axis at x = x 1
,
x = x 2
and x = x 3
, and so the equation P(x) = 0 has real roots x 1
, x 2
and x 3
, that is
P(x 1
) =P(x 2
) =P(x 3
) = 0.
74 Chapter 2 Engineering functions
P(x)
x 1 x 2 x 3 x
Figure2.20
A polynomial function whichcuts
thexaxisat pointsx 1 ,x 2 andx 3 .
EXERCISES2.4.1
1 State the degree ofthe followingpolynomial (c) 3w −5w 2 +12w 4
expressions:
(d) 7x −x 2
(a) z 3 +2z 2 −8+13z
(e) 3(2t 2 −9t +1)
(b) t 2 −5t 5 +2−8t 3 (f) 2z(2z +1)(2z −1)
Solutions
1 (a) 3 (b) 5 (c) 4 (d) 2 (e) 2 (f) 3
TechnicalComputingExercises2.4.1
UseatechnicalcomputinglanguagesuchasMATLAB ® to
dothe following exercises.
1 (a) Ploty =x 3 andy = 4 −2x in the interval
[-3,3].Beaware that in MATLAB ® ,to carry out
operationsonindividual elements ofavector,
special notation is used.Forexample to multiply
each element in vectorabythe corresponding
element in vectorb(having the same dimension
asvectora)you would typea.*b.Other
functionssuch asraisingto apower alsorequire a
dotprefixifthey are to be carried outoneach
individual element, rather than the whole matrix.
Note thexcoordinate ofthe point ofintersection.
(b) Drawy =x 3 +2x −4. Note the coordinate of
the point where the curve cutsthexaxis.
Compare your answer with that from (a).Explain
your findings.
2 Plotthe followingfunctions:
(a) y=3x 3 −x 2 +2x+1
−2x2
(b) y=x 4 + x3
3 − 5x2 +x−1 −3x2
2
(c) y=x 5 −x 2 +2 −2x2
Henceestimatethe real roots of
0=3x 3 −x 2 +2x+1 −2x2
0=x 4 + x3
3 − 5x2
2 +x−1
−3x2
0=x 5 −x 2 +2 −2x2
3 Usetherootsfunctionin MATLAB ® orequivalent
to calculateamore accurate value forthe real roots
estimated in question 2 andconfirm your answers are
correct.
4 (a) Drawy = 2x 2 andy =x 3 +6 usingthe same
axes.Use yourgraphs to findapproximate
solutionstox 3 −2x 2 +6 = 0.
(b) Addthe liney = −3x +5 to yourgraph.State
approximatesolutionsto
(i) x 3 +3x+1=0
(ii)2x 2 +3x−5=0
2.4.2 Rationalfunctions
2.4 Review of some common engineering functions and techniques 75
Arational function,R(x), has the form
R(x) = P(x)
Q(x)
where P and Q are polynomial functions; P is the numerator and Q is the
denominator.
The functions
R 1
(x) = x +6
x 2 +1
R 2
(t) = t3 −1
2t+3
R 3
(z) = 2z2 +z−1
z 2 +3z−2
areallrational.Whensketchingthegraphofarationalfunction,y = f (x),itisusualto
drawupatableofxandyvalues.Indeedthishasbeencommonpracticewhensketching
anygraphalthoughtheuseofgraphicscalculatorsisnowreplacingthiscustom.Itisstill
usefultoanswer questions such as:
‘Howdoesthefunctionbehaveasxbecomeslargepositively?’
‘Howdoes thefunctionbehaveasxbecomeslarge negatively?’
‘Whatis thevalueof thefunctionwhenx = 0?’
‘At what valuesofxis thedenominator zero?’
Figure 2.21 shows a graph of the function y = 1+2x = 1 + 2. As x increases, the
x x
value ofyapproaches 2.We write thisas
y→2 as x→∞
and say ‘ytends to2asxtends toinfinity’.Also fromFigure 2.21, we see that
y→±∞ as x→0
Asx → ∞,thegraphgetsnearerandnearertothestraightliney = 2.Wesaythaty = 2
is an asymptote of the graph. Similarly,x = 0, that is theyaxis, is an asymptote since
the graph approaches the linex = 0 asx → 0.
If the graph of any function gets closer and closer to a straight line then that line is
calledanasymptote.Figure2.22illustratessomerationalfunctionswiththeirasymptotes
indicatedbydashedlines.InFigure2.22(a)theasymptotesarethehorizontalliney = 3
y
2
x
Figure2.21
Thefunction:y = 1+2x = 1 x x +2.
76 Chapter 2 Engineering functions
y
y
y
3
1
x
–2
x
–1
x
(a)
(b)
Figure2.22
Some examples offunctionswith their asymptotes:
(a)y= 3x+1
x
= 3 + 1 −1 +4x+2
;(b)y=x
x x +2 ;(c)y=x2 =x+3− 1
x +1 x +1 .
(c)
and the y axis, that is x = 0. In Figure 2.22(b) the asymptotes are the horizontal line
y = 1andtheverticallinex = −2;inFigure2.22(c)theyarey =x +3andthevertical
linex = −1.Theasymptotey =x+3,beingneitherhorizontalnorvertical,iscalledan
oblique asymptote. Oblique asymptotes occur only when the degree of the numerator
exceeds the degree of the denominator by one.
Weseethattheverticalasymptotesoccuratvaluesofxwhichmakethedenominator
zero. These values are particularly important to engineers and are known as the poles
of the function. The function shown in Figure 2.22(a) has a pole atx = 0; the function
showninFigure2.22(b)hasapoleatx = −2;andthefunctionshowninFigure2.22(c)
has a pole atx = −1.
Ifthegraphofafunctionapproachesastraightline,thelineisknownasanasymptote.
Asymptotesmaybehorizontal, vertical oroblique.
Values of the independent variable where the denominator is zero are called poles
ofthe function.
Example2.10 Sketch the rationalfunctiony =
x
x 2 +x−2 .
Solution For large values ofx, thex 2 term in the denominator has a much greater value than the
x inthe numerator. Hence,
y→0 as x→∞
y→0 as x→−∞
Therefore thexaxis, thatisy = 0,isanasymptote. Writingyas
x
y =
(x −1)(x +2)
2.4 Review of some common engineering functions and techniques 77
weseethefunctionhaspolesatx = 1andx = −2;thatis,thereareverticalasymptotes
atx = 1 andx = −2. Substitution into the function of a number of values ofxallows a
table tobe drawn up:
x −3 −2.5 −2.1 −1.9 −1.5 −1 0 0.5 0.9 1.1 1.5 2 3
y −0.75 −1.43 −6.77 6.55 1.20 0.50 0 −0.40 −3.10 3.55 0.86 0.50 0.30
The graph of the function can then be sketched as shown inFigure 2.23.
y
–2
1 x
Figure2.23
x
Thefunction:y =
x 2 +x−2 .
Engineeringapplication2.7
Equivalentresistance
RecallfromEngineeringapplication1.2thattheformulafortheequivalentresistance
oftwo resistors inparallel isgiven by:
1
R E
= 1 R 1
+ 1 R 2
Consider a circuit consisting of two resistors in parallel as shown in Figure 2.24.
Onehasaknown resistanceof1andtheother hasavariableresistance,R.The
equivalent resistance,R E
, satisfies
Hence,
1
= 1 R E
R + 1 1 = 1 +R
R
R E
=
R
1 +R
Thus the equivalent resistance is a rational function of R, with domain R 0.
The graph of this function is shown in Figure 2.25. When R = 0 we note
that R E
= 0, corresponding to a short circuit. As the value of R increases, that
isR → ∞,theequivalentresistanceR E
approaches1sothatR E
= 1isanasymptote.
➔
78 Chapter 2 Engineering functions
R E
1
R
1 V
R E
R
Figure2.24
Two resistorsin parallel.
Figure2.25
Theequivalent resistance,R E ,
increases asRincreases.
EXERCISES2.4.2
1 State the poles ofthe followingrational functions:
(a) y(x) = x +3
x −2
(c) y(t) = t2 +t+1
2t+3
3
(e) H(s) =
s 2 +6s+5
2s+7
(f) G(s) =
s 2 +3s−18
(g) x(t) = 9
t 3 −t
2t−6
(h) p(t) =
t 2 +10t+25
(b) y(x) = 2x+1
x +7
(d) X(s) = s +1
s 2 −4
2 Describethe horizontalasymptote ofeachofthe
following functions:
(a) y(x) =6+ 1 x
(c) y(r) =3− 2 5r
(e) r(v) = 2v−5
3v
(f) a(t) = t2 +2t+1
t 2
(g) m(s) = 10−2s−3s2
2s 2
(b) h(t) = 2 t −1
(d) v(t) = 6 +t
t
3 Describethe vertical asymptotes ofeachofthe
following functions:
(a) y(x) = 3x+1
x −2
(c) h(s) =
9
(s +2)(s −1)
(b) y(t) = 6t−3
4t+4
(d) G(t) = 1
t 2 −1
(f) y(x) =
(g) w(f)=
2x
x 2 −1
f +2
f 2 +f−6
(h) P(t) = t2 +t+1
t 2 +6t+9
(i) T(x) =
(j) Q(r) =
x3
2x−1
6 +r
r 2 −r−12
(e) H(s) =
s
s 2 −1
4 Describethe obliqueasymptote ofeachofthe
following functions:
(a) y(x)=x+3+ 1
x −1
(b) y(x)=2x−1+ 3
x +2
(c) y(x) = x 2 − 3 4 + 1
2x+7
(d) y(x)=3x−1+ 5
2x+2
(e) y(x)=2x−1+ x +2
x 2 −1
(f) y(x)=3x−2+ 4
2x−1
(g) y(x)=4−2x+ 3
2x+3
2.4 Review of some common engineering functions and techniques 79
5 Showthat
I(x) = 2x
3 − 7 9 + 23
9(3x +2)
can be expressed in the equivalent form
2x 2 −x+1
3x+2
Sketch the rational functionI(x)and state any
asymptotes.
6 Showthat
p(x) =2x + 1 2 + 9
2(2x +3)
can be writtenin the equivalent form
4x 2 +7x+6
2x+3
Sketch the rational function p(x)and stateany
asymptotes.
7 Show thatthe function
y(x)=x+ 7 −x
x 2 +3
can be expressed in the equivalent form
x 3 +2x+7
x 2 +3
Sketch the rational functiony(x) andstate any
asymptotes.
Solutions
1 (a) 2 (b) −7 (c) − 3 2
(d) −2,2 (e) −5, −1 (f) −6,3
(g) −1,0,1
(h) −5
2 (a)y=6 (b)h=−1
(c)y=3 (d) v=1
(e)r= 2 3
(f)a=1
4 (a) y=x+3 (b) y=2x−1
(c)y= x 2 − 3 4
(e) y=2x−1
(g) y=4−2x
5 x=− 2 2x
,I =
3 3 − 7 9
(d) y=3x−1
(f) y=3x−2
6 x=− 3 2 ,p=2x+1 2
(g)m=− 3 2
3 (a)x=2 (b)t=−1
(c) s=−2,1 (d) t =−1,1
(e) s=−1,1 (f) x=−1,1
(g) f =−3,2 (h) t =−3
(i) x=0.5 (j) r=−3,4
7 y=x
TechnicalComputingExercises2.4.2
1 Useatechnicalcomputing languageto plotthe
followingrational functions.Stateany asymptotes.
(2x +1)
(a) f(x) = −4x4
(x−3)
s
(b) g(s) = −3s3
(s+1)
z
(c) h(z) =
(z 2 −3z3
+1)
(d) y(x) = (x+1) −3x3
x
2x
(e) r(x) =
−3x3
(x −1)(x −2)
2 Plotthe functionsgiven in Question4in
Exercises2.4.2 for −10 x10.
80 Chapter 2 Engineering functions
2.4.3 Exponentialfunctions
Anexponentisanothernameforapowerorindex.Expressionsinvolvingexponentsare
called exponential expressions, for example 3 4 ,a b , andm n . In the exponential expressiona
x ,aiscalledthebase;xistheexponent.Exponentialexpressionscanbesimplified
and manipulated using the laws of indices. These laws aresummarized here.
a m a n =a m+n
a m
a n =am−n a 0 =1 a −m = 1
a m
(a m ) n =a mn
Example2.11 Simplify
a 3x a 2x
(a)
a 4x (b) a 2t (1 −a t ) +a 3t (c)
(a y ) 2
2a y
(d)
a −6z
a −2z
(e)
(2a 3r ) 2 a 2r
3a −5r
(f)
a x+y a y
a 2x
(g)
3a (x/y) a x
a y
a
Solution 3x a 2x
(a) = a5x
a 4x =ax
a4x (b) a 2t (1 −a t ) +a 3t =a 2t −a 3t +a 3t =a 2t
(c)
(d)
(e)
(f)
(g)
(a y ) 2
= a2y
2a y 2a = ay
y 2
a −6z
a −2z =a−6z−(−2z) =a −4z
(2a 3r ) 2 a 2r
= 4a6r a 2r
= 4a8r
3a −5r 3a −5r 3a
a x+y a y
=a x+2y−2x =a 2y−x
a 2x
3a (x/y) a x
= 3a (x/y)+x−y
a y
3
−5r
=
4a13r
Exponentialfunctions
Anexponential function, f (x), has the form
f(x)=a x
whereaisapositive constant called the base.
Some typical exponential functions are tabulated in Table 2.2 and are shown in Figure
2.26.Note from the graphsthatthese areone-to-onefunctions.
An exponential function is not a polynomial function. The powers of a polynomial
functionareconstants;thepowerofanexponentialfunction,thatistheexponent,isthe
variablex.
Table2.2
Values ofa x fora = 0.5, 2 and 3.
2.4 Review of some common engineering functions and techniques 81
x 0.5 x 2 x 3 x
(0.5) x
y
3 x
2 x
−3 8 0.125 0.037
−2 4 0.25 0.111
−1 2 0.5 0.333
0 1 1 1
1 0.5 2 3
2 0.25 4 9
3 0.125 8 27
Figure2.26
Some typical exponential functions.
x
Themostwidelyusedexponentialfunction,commonlycalledtheexponentialfunction,
is
f(x)=e x
where e is an irrational constant (e = 2.718281828...) commonly called the
exponential constant.
Most scientific calculators have values of e x available. The function is tabulated
in Table 2.3. The graph is shown in Figure 2.27. This particular exponential function
so dominates engineering applications that whenever an engineer refers to the exponential
function it almost invariably means this one. We will see later why it is so
important.
Asxincreasespositively,e x increasesveryrapidly;thatis,asx → ∞,e x → ∞.This
situationisknownasexponentialgrowth.Asxincreasesnegatively,e x approacheszero;
that is, asx → −∞, e x → 0. Thusy = 0 is an asymptote. Note that the exponential
function isnever negative.
Figure2.28showsagraphofe −x .Asxincreasespositively,e −x decreasestozero;that
is,asx → ∞,e −x → 0.Thisisknownasexponentialdecay.Thefunctionistabulated
inTable 2.4.
Table2.3
Thevalues ofthe exponential
function f (x) = e x for
variousvalues ofx.
x
e x
−3 0.050
−2 0.135
−1 0.368
0 1
1 2.718
2 7.389
3 20.086
e x 1
Figure2.27
Graph ofy = e x showing exponential growth.
x
82 Chapter 2 Engineering functions
Table2.4
The values ofthe exponential
function f (x) = e −x for
various values ofx.
e –x 1
x
e −x
−3 20.086
−2 7.389
−1 2.718
0 1
1 0.368
2 0.135
3 0.050
Figure2.28
Graph ofy = e −x showing exponential decay.
x
Engineeringapplication2.8
Dischargeofacapacitor
The capacitor is another of the three fundamental electronic components. It is a
device that is used to store electrical charge. It consists of two parallel conducting
platesseparatedbyaninsulatingmaterial,knownasadielectric.Abuildupofanet
positive charge on one plate and a net negative charge on the other plate creates an
electricfieldacrossthedielectric,allowingelectricalenergytobestoredthathasthe
potential to do useful work. The symbol for a capacitor is two parallel lines that are
perpendicular tothe conductors inthe circuit.
Consider thecircuitofFigure2.29.Beforetheswitchisclosed,thecapacitorhas
avoltageV acrossit.Supposetheswitchisclosedattimet = 0.Acurrentthenflows
in the circuit and the voltage, v, across the capacitor decays with time. The voltage
across the capacitor isgiven by
{
V t < 0
v =
Ve −t/(RC) t 0
ThequantityRC isknown asthetimeconstantofthecircuitand isusuallydenoted
byτ.So
{
V t < 0
v =
Ve −t/τ t 0
v
V
t larger
C
Figure2.29
Circuit to discharge acapacitor.
R
Figure2.30
Thecapacitor takes longerto discharge fora
largercircuit time constant, τ.
If τ is small, then the capacitor voltage decays more quickly than if τ is large.
This isillustratedinFigure 2.30.
t
2.4 Review of some common engineering functions and techniques 83
Engineeringapplication2.9
Thediodeequation
Adiodeisanelectronicdevicethatallowscurrenttoflowwitheaseinonedirection
andwithdifficultyintheother.Inplumbingtermsitistheequivalentofanon-return
valve.Itisconstructedbysandwichingtwodifferenttypesofsemiconductormaterial
together,knownasp-typeandn-typesemiconductors.Theresultantconstructionis
referred to as a p-n junction. These different types of semiconductors are created
by a procedure known as doping. The technical details are complex but it is one
of the most fundamental processes underpinning the electronics revolution that has
transformed society inrecent decades.
Asemiconductor diode can bemodelled by the equation
I=I s
(
e qV
nkT −1
)
whereV =applied voltage (V);
I =diode current (A);
I s
=reverse saturationcurrent (A);
k =1.38 ×10 −23 J K −1 ;
q =1.60 ×10 −19 C;
T =temperature (K);
n =idealityfactor.
This equation relates the current through the diode to the voltage across it. The idealityfactor,n,typicallyrangesbetween1and2dependingonhowthediodeismanufactured
and the type of semiconductor material used. We will consider the case
n = 1,whichcorrespondstothatofanidealdiode.Atroomtemperatureq/(kT ) ≈ 40
and sothe equation can be written as
I =I s
(e 40V −1)
Figure 2.31 shows a graph ofI againstV. Notice that for negative values ofV, the
equation may be approximated by
I≈−I s
since e 40V ≈ 0. The diode is said to be reverse saturated in this case. In reality,
I s
is usually quite small for a practical device, although its size has been exaggerated
in Figure 2.31. This model does not cater for the breakdown of the diode.
According to the model it would be possible to apply a very large reverse voltage
to a diode and yet only a small saturation current would flow. This illustrates an
importantpointthatnomathematicalmodelcoverseveryfacetofthephysicaldevice
orsystemitismodelling.Adifferentmodelwouldbeneededtodealwithbreakdown
characteristics.
➔
84 Chapter 2 Engineering functions
I
–I s V
Figure2.31
Atypical diode characteristic.
EXERCISES2.4.3
1 Simplify
(a)
(c)
(e)
e 2x
3e 3x
e x (e x +e 2x )
e 2x
(e 2t ) 3 (e 3t ) 4
e 10t
(b) e2t−3
e 2
(d) e−3 e −7
e 6 e −2
2 Consider theRC circuitofFigure2.29.Given an
initialcapacitor voltageof10Vplotthe variation in
capacitor voltage with time, usingthe same axes, for
the followingpairs ofcomponentvalues:
(a) R=1,C=1µF
(b) R=10,C=1µF
(c) R=3.3,C=1µF
(d) R=56,C=0.1µF
Calculate the time constant, τ,in eachcase.
3 Sketch the following functions,usingthe same axes:
y=e 2x y=e x/2 y=e −2x
for −3x3
4 Sketch agraph ofthe functiony = 1 −e −x for
x0.
Solutions
1 (a) e−x (b) e 2t−5 (c) 1 +e x 2 (a) 10 −6 (b) 10 −5 (c) 3.3 ×10 −6
3
(d) e −14 (e) e 8t (d) 5.6 ×10 −6
TechnicalComputingExercises2.4.3
1 Ploty = e kx fork = −3, −2, −1, 0, 1, 2, 3, for
−3x3.
2 Ploty =ke x fork = −3, −2, −1, 0, 1, 2, 3, for
−3x3.
3 Ploty=5−x 2 andy=e x for−3x3.Forwhich
valuesofxise x <5−x 2 ?
4 Ploty =x 4 andy = e x for −1 x9.Forwhich
valuesofxis(a)e x <x 4 ,(b)e x >x 4 ?
2.4.4 Logarithmfunctions
2.4 Review of some common engineering functions and techniques 85
Logarithms
The equation 16 = 2 4 may be expressed in an alternative form using logarithms. In
logarithmic form wewrite
log 2
16 = 4
and say ‘log to the base 2 of 16 equals 4’. Hence logarithms are nothing other than
powers. The logarithmic form isillustratedby more examples:
125 = 5 3 so log 5
125 = 3
64=8 2 solog 8
64=2
16=4 2 solog 4
16=2
1000 = 10 3 so log 10
1000 = 3
Ingeneral,
ifc =a b , thenb =log a
c
In practice, most logarithms use base 10 or base e. Logarithms using base e are called
naturallogarithms.Log 10
xandlog e
xareusuallyabbreviatedtologxandlnx,respectively.
Most scientific calculators have both logs to base 10 and logs to base e as preprogrammed
functions, usually denoted as log and ln, respectively. Some calculations
in communications engineering use base 2. Your calculator will probably not calculate
base 2 logarithms directly. We shall see how toovercome thisshortly.
Focusing on base 10 wesee that
ify=10 x then x=logy
Equivalently,
ifx=logy then y=10 x
Usingbase e we see that
ify=e x then x=lny
Equivalently,
ifx=lny then y=e x
Example2.12 Solve the equations
(a) 16 = 10 x (b) 30 = e x (c) logx = 1.5 (d) lnx = 0.75
Solution (a)
16 = 10 x
log16=x
x = 1.204
(c) logx = 1.5
x = 10 1.5
= 31.623
(b)
30 = e x
ln30=x
x = 3.401
(d) lnx = 0.75
x = e 0.75
= 2.117
Example2.13 Solve the equations
(a) 50 = 9(10 2x ) (b) 3e −(2x+1) = 10
(c) log(x 2 −1) =2 (d) 3ln(4x +7) =12
86 Chapter 2 Engineering functions
Solution (a) 50 = 9(10 2x ) (b) 3e −(2x+1) = 10
10 2x = 50 e −(2x+1) = 10
9
2x = log 50 3
−(2x +1) = ln 10 9
x = 1 2 log 50 3
9 = 0.372 2x=−ln 10 3 −1
(c) log(x 2 −1) = 2
x 2 −1=10 2 =100
x 2 = 101
x = ±10.050
2x = −2.204
x = −1.102
(d) 3ln(4x +7) = 12
4x+7=e 4
4x=e 4 −7
x = e4 −7
= 11.900
4
Logarithmicexpressionscanbemanipulatedusingthelawsoflogarithms.Theselaws
are identical for any base, but it is essential when applying the laws that bases are not
mixed.
log a
A +log a
B = log a
(AB)
( ) A
log a
A −log a
B = log a
B
nlog a
A = log a
(A n )
log a
a=1
Wesometimesneedtochangefromonebasetoanother.Thiscanbeachievedusingthe
following rule.
Inparticular,
Example2.14 Simplify
log a
X = log b X
log b
a
log 2
X = log 10 X
log 10
2 = log 10 X
0.3010
(a) logx +logx 3
(b) 3logx +logx 2
( 1
(c) 5lnx+ln
x)
(d) log(xy) +logx −2logy
( ) 4
(e) ln(2x 3 ) −ln + 1 x 2 3 ln27
Solution (a) Usingthe laws of logarithms we find
logx +logx 3 = logx 4
2.4 Review of some common engineering functions and techniques 87
(b) 3logx +logx 2 = logx 3 +logx 2 = logx
( ( ( 5
)
1 1 x
(c) 5lnx+ln =lnx
x)
5 5
+ln = ln =lnx
x)
4
x
(d) logxy+logx−2logy=logxy+logx−logy 2
( ) ( ) xyx x
2
=log = log
y 2 y
( ) 4
(e) ln(2x 3 ) −ln + ln27 ( ) 2x
3
=ln +ln27 1/3
x 2 3 4/x 2
( 2x 3 x 2 )
=ln +ln3
( x
5
=ln
2
4
)
+ln3=ln
( ) 3x
5
2
Example2.15 Findlog 2
14.
Solution Usingthe formulaforchangeofbasewehave
log 2
14 = log 10 14
log 10
2 = 1.146
0.301 = 3.807
Engineeringapplication2.10
Signalratiosanddecibels
Theratiobetweentwosignallevelsisoftenofinteresttoengineers.Forexample,the
outputandinputsignalsofanelectronicsystemcanbecomparedtoseeifthesystem
has increasedthe level ofasignal. Acommon case is anamplifier, where the output
signal is usually much larger than the input signal. This signal ratio, known as the
power gain, isoftenexpressed in decibels(dB)given by
( )
Po
power gain (dB) = 10log
P i
whereP o
isthepoweroftheoutputsignalandP i
isthepoweroftheinputsignal.The
term gain is used because ifP o
> P i
, then the logarithm function is positive, correspondingtoanincreaseinpower.IfP
o
<P i
thenthegainisnegative,corresponding
toadecrease inpower. Inthis situationthe signal issaidtobeattenuated.
The advantage of using decibels as a measure of gain is that if several electronic
systems are connected together then it is possible to obtain the overall system gain
in decibels by adding together the individual system gains. We will show this for
threesystems connected together, butthe development iseasilygeneralized tomore
systems. Let the power input to the first system be P i1
, and the power output from
➔
88 Chapter 2 Engineering functions
the third system be P o3
. Suppose the three are connected so that the power output
from system 1,P o1
, is used as input to system 2, that isP i2
=P o1
. The power output
from system 2,P o2
, is then used as input to system 3, that isP i3
= P o2
. We wish to
find the overall power gain, 10log(P o3
/P i1
). Now
P o3
= P o3P o2
P o1
P i1
P i3
P i2
P i1
becauseP i3
=P o2
andP i2
=P o1
. Therefore,
( ) ( )
Po3 Po3 P
10log = 10log o2
P o1
P i1
P i3
P i2
P i1
thatis
10log
(
Po3
P i1
)
( ) ( ) ( )
Po3 Po2 Po1
= 10log +10log +10log
P i3
P i2
P i1
usingthe laws of logarithms.
It follows that the overall power gain is equal to the sum of the individual power
gains. Often engineers are more interested in voltage gain rather than power gain.
The power of a signal is proportional to the square of its voltage. We definevoltage
gain (dB) by
( ) V
2
voltage gain (dB) = 10log
o
V 2
i
( )
Vo
= 20log
V i
Engineeringapplication2.11
TheuseofdBminradiofrequencyengineering
Inthepreviousengineeringapplicationitwasshownhowthedecibelcanbeusedto
express a ratio of the power of two signal levelsP o
andP i
. It is possible to specify
a fixed value for the input signalP i
. This is termed a reference level. When this is
done the decibel becomes an absolute quantity. The notation is normally changed
slightly to indicate the assumed reference level. For example, dBm is used as an
absolutemeasureofpowerinthefieldofradiofrequency(RF)engineering.Amobile
telephone handset, a microwave oven and a radar transmitter on an airfield are all
devices that might have their output power quoted in dBm. The definition of power
gain measured indBmisasfollows:
( )
Po
power gain (dBm) = 10log
10 −3
Herethe reference level chosen is1mWor 10 −3 W.
If a device is quoted as having an output power of 15 dBm we can convert this
into a power value inwatts as follows:
( )
Po
15 (dBm) = 10log
10 −3
Dividing both sides by 10
( )
Po
1.5 = log
10 −3
2.4 Review of some common engineering functions and techniques 89
And so
10 1.5 =
(
Po
10 −3 )
Therefore the actual power is
P o
= 10 1.5 ×10 −3 = 10 −1.5 = 0.0316 ∼ = 32 mW
ThisisthetypicalamountofRFpowerthatmightbetransmittedbyalaptopcomputer
with WiFicapability.
Engineeringapplication2.12
Attenuationinastep-indexopticalfibre
Fibre optical cables are used to guide high-bandwidth light signals generated by
lasers. One design is the step-index optical fibre. This consists of a glass core of
silicondioxidewithahighrefractiveindex,obtainedbydopingtheglasswiththeelementgermanium.Asurroundingsheathoflowerrefractiveindexglassensuresthat
nearly all of the light remains within the core. Transmission occurs by total internal
reflection atthe interfacebetween the two typesofglass.
The losses that do occur in the optical fibre can be described by the following
equation:
I(z) =I 0
e −αz
where the intensity of the light I(z) is a function of the distance along the fibre, z,
fromthelightsource.TheintensityofthelightsourceattheinsertionpointisI 0
and
α is anattenuation factor.
Note that α varies with the wavelength of the light and so the rate of attenuation
dependsonthecolourofthelight.Itispossibletoobtainplotsforthevariationofthe
attenuationfactorwithwavelengthfromthemanufacturers.Theattenuationfactoris
often expressed in units of dB km −1 . A typical value for a cable operating at a light
wavelength of1550 nmis0.3 dBkm −1 .
An alternative measure for gain/attenuation is that of the neper (Np). Like the
decibel this is also a dimensionless ratio that has a logarithmic form. Whereas the
decibel is defined in terms of base 10 logarithms, the neper is defined in terms of
natural logarithms, that is base e logarithms. The gain measured in Np is defined by
the following expression:
voltage gain (Np) = ln V o
V i
=lnV o
−lnV i
whereV o
is the signal value in volts after gain/attenuation andV i
is the reference
signal value involts.
It is possible to derive a conversion factor between Np and dB by considering
the case whenV o
is a factor of 10 greater thanV i
. Using the neper measurement this
corresponds to
voltage gain (Np) = ln 10V i
V i
=ln10
➔
90 Chapter 2 Engineering functions
Using the dB measurement for voltage gain (see Engineering application 2.10)
this corresponds to voltage gain (dB) =20log V o
V i
which equals 20log10. So,
ln10 (Np) = 20log10 (dB). Thus,
1Np= 20log10 dB ≈ 8.685 89 dB
ln 10
Using the value quoted earlier of an attenuation factor of 0.3 dB km −1 , this corresponds
to
0.3 ÷8.685 89 Np km −1 = 0.034539 Np km −1
It is more usual to quote values of Np m −1 and so this value becomes 3.4539 ×
10 −5 Np m −1 .
Engineeringapplication2.13
Referencelevels
We saw in Engineering application 2.11 that the suffix ‘m’ is used in dBm to indicatetheprovisionofaspecificreferencelevel.Alternativesuffixesareusedtodenote
other reference levels and quantities, which do not necessarily have to be related to
electricalpower.Forexample,whenmeasuringsoundpressure,P,inairtheconventionalreferencelevelforsoundpressureis20
µPar.m.s.Thisischosentocorrespond
totheapproximatethresholdofhumanhearingfora1kHzsinusoidalsignal.Theunit
forsoundpressureisthereforequotedwithreferencetoaninputpressureof20 µPa.
ThisiscommonlywrittenasdBre20 µPar.m.s.orusingtheshorthanddBSPL(dB,
soundpressure level). Inotherwords wehave
( )
P
soundpressure level (dBSPL) = 20log
20 ×10 −6
Asaconsequenceofthechoiceofthehumanhearingthresholdasthereferencelevel,
anegativevalueofdBSPLcorrespondstoasoundthatistooquiettobeheardbythe
averageperson;0dBSPLisasoundthatcanjustbeheardandanythingabovethisis
fullyaudible.Anofficemighthaveanambient(background)levelof30dBSPLand
a person talking to you at the next desk might produce 60 dB SPL, both quantities
being measured atyour hearing position.
Logarithmfunctions
Thelogarithm functionsaredefined by
f(x)=log a
x x>0
whereaisapositive constant called the base.
In particular the logarithm functions f (x) = logx and f (x) = lnx are shown in Figure2.32andsomevaluesaregiveninTable2.5.Thedomainofbothofthesefunctionsis
(0,∞)andtheirrangesare (−∞,∞).Weobservefromthegraphsthatthesefunctions
2.4 Review of some common engineering functions and techniques 91
Table2.5
Somevalues forlogarithm
functionslogx and lnx.
x logx lnx
y
ln x
log x
0.1 −1 −2.303
0.5 −0.301 −0.693
1 0 0
2 0.301 0.693
5 0.699 1.609
10 1 2.303
50 1.699 3.912
Figure2.32
Graphsoflnx and logx.
1 x
are one-to-one. It is important to stress that the logarithm functions, log a
x, are only
defined for positive values ofx. The following properties should be noted:
}
}
logx→∞ logx→−∞
asx→∞ asx→0
lnx→∞ lnx→−∞
log1=ln1=0
log10=1 lne=1
Connectionbetweenexponentialandlogarithmfunctions
Theexponentialfunction, f (x) =a x ,isaone-to-onefunctionandsoaninversefunction,
f −1 (x), exists.Recall
So
Now
f −1 (f(x))=x
f −1 (a x )=x
log a
(a x ) =xlog a
a
using laws of logarithms
=x since log a
a = 1
Hence the inverse of f (x) = a x is f −1 (x) = log a
x. By similar analysis the inverse of
f(x)=log a
xisf −1 (x) =a x .
The inverse of the exponential function, f (x) = a x , is the logarithm function, that
is f −1 (x) = log a
x.
Theinverseofthelogarithmfunction, f (x) = log a
x,istheexponentialfunction,
thatis f −1 (x) =a x .
In particular:
If f(x) =e x ,then f −1 (x) =lnx.
If f(x) = lnx,then f −1 (x) =e x .
If f(x) = 10 x ,then f −1 (x) =logx.
If f(x) = logx,then f −1 (x) =10 x .
92 Chapter 2 Engineering functions
x
1 1
2 64
3 729
4 4096
5 15625
6 46656
7 117649
8 262144
9 531441
10 1000000
y
log y
6
5
4
3
2
1
0
1
2 3 4 5 6 7 8 9 10 x
Figure2.33
Thefunctiony =x 6 plotted on alog--linear graph.
Useoflog–logandlog–linearscales
Suppose wewish toplot
y(x)=x 6
1x10
This may appear a straightforward exercise but consider the variation in the x and y
values. Asxvaries from 1 to 10, thenyvaries from 1 to 1000000, as tabulated above.
Severalofthesepointswouldnotbediscernibleonagraphandsoinformationwouldbe
lost.Thiscanbeovercomebyusingalogscalewhichaccommodatesthelargevariation
iny. Thus log yisplotted againstx, rather thanyagainstx. Note thatinthisexample
logy=logx 6 =6logx
so asxvaries from 1 to 10, logy varies from 0 to 6. A plot in which one scale is logarithmic
and the other is linear is known as a log--linear graph. Figure 2.33 shows logy
plottedagainstx.Ineffect,useofthelogscalehascompressedalargevariationintoone
which ismuch smaller and easier toobserve.
Example2.16 Considery = 7 x for −3 x 3.Plot a log--linear graph of this function.
Solution We have
and so
y=7 x
logy = log(7 x ) =xlog7 = 0.8451x
PuttingY = logy we haveY = 0.8451x which is the equation of a straight line passing
through the origin with gradient log7. Hence when logy is plotted againstxastraight
linegraphisproduced.ThisisshowninFigure2.34.Notethatbytakinglogs,therange
on the verticalaxis has been greatly reduced.
Aplotinwhichbothscalesarelogarithmicisknownasalog--logplot.Herelogyis
plotted against logx.
2.4 Review of some common engineering functions and techniques 93
Y
2.5
x y Y=logy
−3 0.003 −2.54
−2 0.020 −1.69
−1 0.143 −0.85
0 1 0
1 7 0.85
2 49 1.69
3 343 2.54
–3 –2 –1 1 2 3 x
–2.5
Figure2.34
Alog--linear plotofy = 7 x producesa
straightline graph.
Example2.17 Considery =x 7 for 1 x10. Plot a log--log graph of thisfunction.
Solution We have
and so
y=x 7
logy = log(x 7 ) = 7logx
We plot logy against logx in Figure 2.35 for a log--log plot. PuttingY = logy and
X = logx we haveY = 7X which isastraightlinethrough the originwith gradient 7.
x y X=logx Y=logy
1 1 0 0
2 128 0.301 2.107
3 2187 0.477 3.340
4 16384 0.602 4.214
5 78125 0.699 4.893
6 279936 0.778 5.447
7 823543 0.845 5.916
8 2097152 0.903 6.322
9 4782969 0.954 6.680
10 10000000 1 7
Y
7
1 X
Figure2.35
A log--logplotofy =x 7
produces astraightline graph.
Examples 2.16 and 2.17 illustrate the following general points:
Alog--linear plot ofy =a x produces a straightlinewith a gradient of loga.
Alog--log plot ofy =x n produces a straightlinewith a gradient ofn.
94 Chapter 2 Engineering functions
1
9
8
7
6
5
4
3
Second cycle
2
Logarithmic scale
1
9
8
7
6
5
4
3
First cycle
2
Linear
1
Figure2.36
Two-cycle log--lineargraph paper.
Useoflog–linearandlog–loggraphpaper
The requirement to take logarithms is a tedious process which can be avoided by using
specialgraphpaperscalledlog--lineargraphpaperandlog--loggraphpaper.Anexample
of log--linear graph paper isshown inFigure 2.36.
Note that on one axis the scale is uniform; this is the linear scale. On the other, the
scaleisnotuniformandismarkedincyclesfrom1to9.Thisisthelogarithmicscale.On
this scale values ofyare plotted directly, without first taking logarithms. On the graph
papershowninFigure2.36therearetwocyclesbutpapersarealsoavailablewiththree
2.4 Review of some common engineering functions and techniques 95
or more cycles. To decide which sort of graph paper is appropriate it is necessary to
examine the variation in size of the variable to be plotted measured in powers of 10. If,
for example,yvaries from 1 to 10, then paper with one cycle is appropriate. Ifyvaries
from 1 to 10 2 , two-cycle paper is necessary. If y varies from 10 −1 to 10 4 , then paper
with4 − (−1) = 5cycleswouldbeappropriate.Toseehowlog--linearpaperisusedin
practice, consider the following example.
Example2.18 During anexperiment the following pairs of data values wererecorded:
A B C D
x 0 1 5 12
y 4.00 5.20 14.85 93.19
It is believed thatyandxare related by the equationy = ab x . By plotting a log--linear
graph verify the relationship isof thisform and determineaandb.
Solution If the relationship isgiven byy =ab x , then taking logarithms yields
logy=loga+xlogb
So, plotting logy againstxshould produce a straight line graph with gradient logband
vertical intercept loga. The need to find logy is eliminated by plotting theyvalues directly
on a logarithmic scale. Examining the table of data we see that y varies from
approximately 10 0 to 10 2 so that two-cycle paper is appropriate. Values ofybetween 1
and 10 are plotted on the first cycle, and those between 10 and 100 are plotted on the
second.ThepointsareplottedinFigure2.37.Noteinparticularthatinthisexamplethe
‘1’atthestartofthesecondcyclerepresentsthevalue10,the‘2’representsthevalue20
and so on. From the graph, the straight line relationship between logy andxis evident.
It is therefore reasonable to assume that the relationship betweenyandxis of the form
y =ab x .
Tofindthegradientofthegraphwecanchooseanytwopointsontheline,forexample
Cand B. The gradient isthen
log14.85 −log5.20
5 −1
= log(14.85/5.20)
4
Recall thatlogbisthe gradient ofthe lineand so
= 0.1139
logb = 0.1139, thatisb= 10 0.1139 = 1.2999
The vertical intercept isloga. From the graph the vertical intercept islog4 so that
loga=log4,
thatisa=4
We conclude that the relationship betweenyandxisgiven byy = 4(1.3) x .
96 Chapter 2 Engineering functions
log y
D
(12, 93.19)
1
9
8
7
6
5
Second cycle
4
3
2
C (5, 14.85)
B (1, 5.20)
1
9
8
7
6
5
A (0, 4)
First cycle
4
3
2
0 1 2 3 4 5 6 7 8 9 10 11 12 x
Figure2.37
The log--lineargraph isastraightline.
1
Engineeringapplication2.14
Bodeplotofalinearcircuit
It is possible to find out a great deal about an electronic circuit by measuring how
it responds to a sinusoidal input signal. We will examine the sinusoidal functions in
2.4 Review of some common engineering functions and techniques 97
Section 3.4. The usual procedure is to apply a range of fixed-amplitude sinusoidal
signals with different frequencies in order to obtain information about the circuit or
system.InSection23.9wewillseethatifthecircuitislinearthen,afterithassettled
down, the output signal is also a sinusoidal signal of the same frequency but with a
different amplitude and phase (see Section 3.7 fordetails of these terms).
ABode plotconsists of two components:
(1) The ratio of the amplitudes of the output signal and the input signal is plotted
against frequency.
(2) Thephaseshiftbetweentheinputandoutputsignalsisplottedagainstfrequency.
A log scale is used for the frequency in order to compress its length; for example,
a typical frequency range is 0.1 to 10 6 Hz which corresponds to a range of −1 to 6
on a log scale. A log scale is also used for the ratio of the signal powers as this is
calculatedindecibels.Phaseshiftisplottedonalinearscale.Sothesignalamplitude
ratio versus frequency is a log--log graph and the phase shift versus frequency is a
linear-- log graph.
Anoperationalamplifierisanexampleofalinearcircuit.Inatechnicalcomputing
language such as MATLAB ® itisusually easytoproduce a Bode plot.
An example function which describes the behaviour ofsuch a device is:
A(f)= 100000
1 + j f 8
which gives the input-output voltage function. Do not worry about the meaning of
the ‘j’ in the equation for now. This will be covered in Chapter 9 when we discuss
complex numbers.
We could produce a Bode plot by typing:
f=1:1:10000;
semilogx(f, 20*log10(abs(100000./(1+j*f/8))));
The function semilogx plots a graph with a logarithmic scale on thex-axis. Chapter
21 examines Bode plotsinmore detail.
EXERCISES2.4.4
1 Evaluate
(a) log 2 8 (b) log 2 15
(c) log 16 50 (d) log 16 123
(e) log 8 23 (f) log 8 47
2 Simplify eachofthe following to asingle logterm:
(a) log7 +logx
(b) logx+logy+logz
(c) lny−ln3
(d) 2logy+logx
(e) ln(xy) +ln(y 2 )
(f) ln(x +y) −lny
(g) log(2x 2 ) +log(4x)
(h) 3logy+2logx
1
(i) 2 logx 4 −2logx
(j) 3lnx+2lny+4lnz
(k) logz−2logx+3logy
(l) logt 3 −log(2t) +2logt
3 Simplify each ofthe following to asingle log term:
(a) 3lnt−lnt
(b) 6logt 2 +4logt
98 Chapter 2 Engineering functions
(c) ln(3y 6 )−2ln3+lny
(d) ln(6x +4) −ln(3x +2)
(e) log(9x) ( 2
−log
2 3x)
4 Sketch graphs ofthe followingfunctions,usingthe
same axes:
y=ln(2x),
y=lnx 0<x10
Measurethe vertical distancebetweenthe graphs for
x = 1,x = 2 andx = 8. Canyou explain your
findings usingthe laws oflogarithms?
5 Solve the following equations:
(a) e x = 70 (b) e x = 1 3
(c) e −x = 1 (d) 3e x = 50
(e) e 3x = 50 (f) e 2x+3 = 300
(g) e −x+1 = 0.75 (h) 2ee 2x = 50
3
(i)
e x +1 = 0.6 (j) 3
= 0.6
ex+1 (k) (e x ) 3 = 200 (l) √ e 2x =2
(m) √ e 2x e x
+4=6 (n)
e x +2 = 0.7
(o) e 2x = 7e x (p) 2e −x = 9
(q) (e x +3) 2 =25 (r) (3e −x −6) 3 =8
(s) e 2x −3e x +2=0 (t) 2e 2x −7e x +3=0
(u) e x (5−e x )=6 (v) e x −7+ 12
e x = 0
6 Solve the following equations:
(a) 10 x = 30 (b) 10 x = 0.25
(c) 4(10 x ) = 20 (d) 10 2x = 90
(e) 10 3x−2 = 20 (f) 3(10 x+3 ) = 36
(g) 10 −3x = 0.02 (h) 7(10 −2x ) = 1.4
(i) 10 x−2 = 20 (j) 10 3x+1 = 75
4
(k)
10 x = 6 (l) (10−x ) 2 = 40
(m) √ 10 4x 10 −x
= 3 (n)
2 +10 −x = 1 2
(o) √ 10 2x +6 = 5 (p) 10 6x = 30(10 3x )
(q) (10 −x +2) 2 = 6 (r) 6(10 −3x ) = 10
(s) 10 2x −7(10 x ) +10 = 0
(t) 10 4x −8(10 2x ) +16 = 0
(u) 10 x −5 +6(10 −x ) = 0
(v) 4(10 2x ) −8(10 x ) +3 = 0
7 Solve
(a) logx = 1.6 (b) log2x = 1.6
(c) log(2 +x) = 1.6 (d) 2log(x 2 ) = 2.4
(e) log(2x −3) = 0.7
8 Solve
(a) lnx=2.4
(b) ln3x=4
(c) 2ln(2x
(
−1)
)
= 5 (d) ln(2x 2 ) = 4.5
x +1
(e) ln = 0.9
3
9 Solve
(a) e 3x = 21 (b) 10 −2x = 6.7
1
(c)
e −x +2 = 0.3 (d) 2e(x/2) −1 = 0
(e) 3(10 (−4x+6) ) = 17
(f) (e x−1 ) 3 +e 3x = 500
(g) √ 10 2x +100 = 3(10 x )
10 Calculate the voltagegain in decibelsofthe following
amplifiers:
(a) input signal = 0.1V, outputsignal =1V;
(b) input signal = 1 mV, outputsignal = 10V;
(c) input signal = 5 mV, outputsignal = 8 V;
(d) input signal = 60mV, outputsignal =2V.
11 An audio amplifierconsistsoftwo stages:a
preamplifierandamainamplifier. Given the
following data,calculatethe voltagegain in decibels
ofthe individual stagesandthe overall gain in
decibelsofthe audio amplifier:
preamplifier:
main amplifier:
input signal = 10mV,
outputsignal = 200 mV
input signal = 400 mV,
outputsignal = 3 V
12 ABluetooth radio system operating in Class3has a
maximumoutputpower of0dBm at2.45 GHz. Find
the maximumoutputpower in watts (W).
13 Amicrowave oven has an outputpower of1kW.
Expressthisfigurein dBm.
14 Express0dBSPLasasound pressure in
micropascals (µPa).
15 Acar audio speaker has an outputsound pressure
level of55dBSPLwhenmeasuredat adistanceof
1 m. Calculate the sound pressure atthatpoint in
pascal r.m.s.
16 An active sonarsystemfitted to aboatproducesa
sourcelevel of220 dBre 1 µPa at1m. Calculate the
sound pressure in kPa.
2.4 Review of some common engineering functions and techniques 99
17 Byusinglog--linear paper findthe relationship
betweenxandygiven the following table ofvalues:
x 1.5 1.7 3.2 3.9 4.3 4.9
y 8.5 9.7 27.6 44.8 59.1 89.6
18 Byusinglog--logpaper findthe relationshipbetween
x andygiven the following table ofvalues:
x 2.0 2.5 3.0 3.5 4.0 4.5
y 13.0 19.0 25.9 33.6 42.2 51.6
Solutions
1 (a) 3 (b) 3.9069
(c) 1.4110 (d) 1.7356
(e) 1.5079 (f) 1.8515
2 (a) log7x (b) logxyz
( )
y
(c) ln (d) logxy 2
3
( )
(e) ln(xy 3 x +y
) (f) ln
y
(g) log8x 3 (h) logx 2 y 3
(i) log1 = 0 (j) lnx 3 y 2 z 4
( ) ( )
y 3 z t 4 (k) log
x 2 (l) log
2
3 (a) 2lntorlnt 2 (b) 16logt
( )
y 7 (c) ln (d) ln2
3
( )
9x 3/2
(e) log
2
5 (a) 4.2485 (b) −1.0986
(c) 0 (d) 2.8134
(e) 1.3040 (f) 1.3519
(g) 1.2877 (h) 1.1094
(i) 1.3863 (j) 0.6094
(k) 1.7661 (l) 0.6931
(m) 1.7329 (n) 1.5404
(o) 1.9459 (p) −1.5041
(q) 0.6931 (r) −0.9808
(s) 0, 0.6931
(t) −0.6931,1.0986
(u) 0.6931,1.0986 (v) 1.0986,1.3863
6 (a) 1.4771 (b) −0.6021
(c) 0.6990 (d) 0.9771
(e) 1.1003 (f) −1.9208
(g) 0.5663 (h) 0.3495
(i) 3.3010 (j) 0.2917
(k) −0.1761 (l) −0.8010
(m) 0.2386 (n) −0.3010
(o) 0.6395 (p) 0.4924
(q) 0.3473 (r) −0.0739
(s) 0.3010,0.6990 (t) 0.3010
(u) 0.3010,0.4771 (v) −0.3010,0.1761
7 (a) 39.81 (b) 19.91 (c) 37.81
(d) ±3.98 (e) 4.01
8 (a) 11.02 (b) 18.20 (c) 6.59
(d) ±6.71 (e) 6.38
9 (a) 1.0148 (b) −0.4130 (c) −0.2877
(d) −1.3863 (e) 1.3117 (f) 2.0553
(g) 0.5485
10 (a) 20dB (b) 80 dB (c) 64.1 dB
(d) 30.46dB
11 Preamplifier gain = 26.02 dB, mainamplifiergain
= 17.50 dB,total gain = 43.52 dB
12 10 −3 W or1mW
13 60 dBm
14 20 µPa
15 0.011Paor11 mPa
16 10 5 or100 kPa
17 y=3(2 x )
18 y = 4x 1.7
100 Chapter 2 Engineering functions
TechnicalComputingExercises2.4.4
1 Useatechnical computing language to draw
y =log(kx)for0.5 x 50fork =1,2,3and4.
( 1
2 Drawy=lnxandy=ln for
x)
0.5 x20. What doyou observe? Can you explain
your observation usingthe laws oflogarithms?
3 Drawy=lnxandy=1− x 3 for
0.5 x4.Fromyourgraphs statean approximate
solution to
lnx=1− x 3
2.4.5 Thehyperbolicfunctions
The hyperbolic functions are y(x) = coshx, y(x) = sinhx, y(x) = tanhx, y(x) =
sechx, y(x) = cosechx and y(x) = cothx. Cosh is a contracted form of ‘hyperbolic
cosine’, sinh of‘hyperbolic sine’ and so on. We definecoshx and sinhx by
y(x) =coshx = ex +e −x
2
y(x) =sinhx = ex −e −x
2
Note:
cosh(−x) = e−x +e x
= coshx
2
sinh(−x) = e−x −e x
=−sinhx
2
so, for example, cosh1.7 = cosh(−1.7) and sinh(−1.7) = −sinh1.7. Clearly, hyperbolicfunctionsarenothingotherthancombinations
oftheexponential functionse x and
e −x .However,theseparticularcombinationsoccursofrequentlyinengineeringthatitis
worth introducing the coshx and sinhx functions. The remaining hyperbolic functions
are defined interms of coshx and sinhx.
y(x) =tanhx = sinhx
coshx = ex −e −x
e x +e −x
y(x) =sechx = 1
coshx = 2
e x +e −x
y(x) = cosechx = 1
sinhx = 2
e x −e −x
y(x) =cothx = coshx
sinhx = 1
tanhx = ex +e −x
e x −e −x
Values of the hyperbolic functions for various x values can be found from a scientific
calculator. UsuallyaHyp button followed by a Sin, Cos or Tan button isused.
Example2.19 Evaluate
(a) cosh3
(c) tanh1.6
(e) coth1
(b) sinh(−2)
(d) sech(−2.5)
(f) cosech(−1)
2.4 Review of some common engineering functions and techniques 101
Solution (a) cosh3 = 10.07
(b) sinh(−2) = −3.627
(c) tanh(1.6) = 0.9217
(d) sech(−2.5) = 1/cosh(−2.5) = 0.1631
(e) coth1 = 1/tanh1 = 1.313
(f) cosech(−1) = 1/sinh(−1) = −0.8509
Graphs of the functions sinhx, coshx and tanhx can be obtained using a graphics
calculator.
Hyperbolicidentities
Severalidentitiesinvolvinghyperbolicfunctionsexist.Theycanbeverifiedalgebraically
usingthe definitions given, and are listedfor reference.
cosh 2 x −sinh 2 x = 1
1 −tanh 2 x = sech 2 x
coth 2 x −1 = cosech 2 x
sinh(x ±y) = sinhxcoshy ±coshxsinhy
cosh(x ±y) = coshxcoshy ±sinhxsinhy
sinh2x = 2sinhxcoshx
cosh2x = cosh 2 x +sinh 2 x
cosh 2 x =
sinh 2 x =
cosh2x +1
2
cosh2x −1
2
Note also that
e x = coshx +sinhx
e −x = coshx −sinhx
Hence any combination of exponential terms may be expressed as a combination of
coshx and sinhx, and viceversa.
Example2.20 Express
(a) 3e x −2e −x intermsofcoshx and sinhx,
(b) 2sinhx +coshx intermsofe x and e −x .
Solution (a) 3e x −2e −x = 3(coshx +sinhx) −2(coshx −sinhx) = coshx +5sinhx.
(b) 2sinhx+coshx=e x −e −x + ex +e −x
= 3ex −e −x
.
2 2
102 Chapter 2 Engineering functions
Inversehyperbolicfunctions
The inverse hyperbolic functions use the familiar notation. Both y = sinhx and y =
tanhx are one-to-one functions and no domain restriction is needed for an inverse to be
defined.However,on (−∞,∞),y = coshxisamany-to-onefunction.Ifthedomainis
restricted to [0,∞) the resulting function is one-to-one and an inverse function can be
defined.
The inverse of the function sinhx is denoted by sinh −1 x. Here the −1 must not be
interpreted as a power but rather the notation we use for the inverse function. Similarly
the inverses of coshx and tanhx aredenoted by cosh −1 x and tanh −1 x respectively.
Values of sinh −1 x, cosh −1 x and tanh −1 x can be obtained using a scientific
calculator.
Example2.21 Evaluate
(a) cosh −1 (3.7) (b) sinh −1 (−2) (c) tanh −1 (0.5)
Solution Using a calculator we get
(a) 1.9827 (b) −1.4436 (c) 0.5493
Engineeringapplication2.15
Capacitancebetweentwoparallelwires
Althoughitmaynotseemobvious,asmallcapacitanceexistsbetweentwowiresthat
run close to each other and at certain frequencies this can be a significant factor for
someelectrical systems.
Themutualcapacitancepermetre,C,betweentwolongparallelwiresinaireach
havingaradiusrmetresandwiththewirecentresseparatedbyd metresiscalculated
using
πε
C = 0
cosh −1 (d/2r)
Thisexpressionincludesaninversehyperbolicfunction.Intheequationd>2r,otherwise
the wires would be overlapping. The constant ε 0
is a fundamental physical
constant called the permittivity of free space. It has an approximate value of
8.85 ×10 −12 Fm −1 .
Recall the general expression forthe hyperbolic function coshx:
coshx = ex +e −x
2
To derive the inverse of this hyperbolic function, we need to restrict the domain to
[0, ∞), that isx 0.We lety = coshx and then solve forxintermsofy:
y = ex +e −x
2
2y=e x +e −x
0=e x −2y+e −x
2.4 Review of some common engineering functions and techniques 103
Inorder tosolve this equation wefirst multiplyboth sides by e x , togive
0 =e x e x −e x 2y+e x e −x
Using the laws of indices,
0=e 2x −2e x y+1
We can now write this as a quadratic equation in e x and solve it using the standard
formula.This gives
(e x ) 2 −2y(e x )+1=0
e x = −b ± √ b 2 −4ac
2a
= −(−2y) ± √ (−2y) 2 −4(1)(1)
2(1)
= 2y ± √ 4y 2 −4
=y ± √ y
2
2 −1
Asx 0,the function e x 1 so wereject the negative rootand itfollows that
e x =y+ √ y 2 −1
from which we get
(
x=ln y + √ )
y 2 −1
Recall thaty =coshxand sox = cosh −1 y. Therefore
(
cosh −1 y = ln y + √ )
y 2 −1
Hencetheequationforthepairofwirescanalsobewrittenintermsoflogarithmsas
πε
C = [ 0
ln (d/2r) + √ ]
(d/2r) 2 −1
EXERCISES2.4.5
A word of warning about the inverse of coshx is needed. The calculator returns a
value of 1.9827 for cosh −1 (3.7). Note, however, that cosh(−1.9827) = 3.7. The value
−1.9827 is not returned by the calculator; only values on [0,∞) will be returned. This
isbecause the domainofy = coshx isrestrictedtoensureithas aninverse function.
1 Evaluate the following:
(a) sinh3 (b) cosh1.6
(c) tanh0.95 (d) sech1
(e) cosech2 (f) coth1.5
(g) cosh(−3) (h) cosech(−1.6)
(i) sinh(−2) (j) coth(−2.7)
(k) tanh(−1.4) (l) sech(−0.5)
2 Evaluate
(a) sinh −1 3
(b) cosh −1 2
(c) tanh −1 (−0.25)
3 Express
(a) 6e x +5e −x in terms ofsinhx andcoshx,
(b) 4e 2x −3e −2x in terms ofsinh2x andcosh2x,
(c) 2e −3x −5e 3x in terms ofsinh3x andcosh3x.
4 Express
(a) 4sinhx +3coshx in terms ofe x ande −x ,
(b) 3sinh2x −cosh2x in termsofe 2x ande −2x ,
(c) 3cosh3x −0.5sinh3x in terms ofe 3x ande −3x .
5 Expressae x +be −x ,whereaandbare constants,in
terms ofcoshx andsinhx.
104 Chapter 2 Engineering functions
6 Expressacoshx +bsinhx,whereaandbare
constants,in terms ofe x and e −x .
7 Show that the pointx = coshu,y = sinhulies onthe
curve
x 2 −y 2 =1
8 Provethehyperbolicidentitieslistedintheboxearlier
in thissection.
Solutions
1 (a) 10.0179 (b) 2.5775 (c) 0.7398
(d) 0.6481 (e) 0.2757 (f) 1.1048
(g) 10.0677 (h) −0.4210 (i) −3.6269
(j) −1.0091 (k) −0.8854 (l) 0.8868
2 (a) 1.8184 (b) 1.3170 (c) −0.2554
3 (a) 11coshx +sinhx (b) cosh2x +7sinh2x
(c) −3cosh3x −7sinh3x
4 (a) 3.5e x −0.5e −x
(b) e 2x −2e −2x
(c) 1.25e 3x +1.75e −3x
5 (a+b)coshx+(a−b)sinhx
6
a +b
2 ex + a −b
2 e−x
TechnicalComputingExercises2.4.5
1 Useatechnicalcomputing language to draw
(a)y = sinhx (b)y = coshx
(c)y=tanhxfor−5x5.
2 Draw graphs ofy = sinhx,y = coshx andy = ex
2 for
0 x5. What happens to the threegraphs asx
increases? Canyou explain thisalgebraically?
3 Draw
(a) y=sinh −1 x
(b) y=cosh −1 x
(c) y=tanh −1 x
−5x5
1x5
−1<x<1
2.4.6 Themodulusfunction
Themodulusofapositivenumberissimplythenumberitself.Themodulusofanegative
number is a positive number of the same magnitude. For example, the modulus of 3 is
3; the modulus of −3 is also 3. We enclose the number in vertical lines to show we are
findingits modulus,thus
|3|=3 |−3|=3
Mathematically wedefinethe modulusfunctionasfollows:
The modulus
{
functionisdefinedby
x x0
f(x)=|x|=
−x x<0
Figure2.38illustratesagraphof f (x) = |x|.Themodulusofaquantityisnevernegative.
Consider two points on thexaxis,aandb, asshown inFigure 2.39. Then
|a −b| = |b −a| = distancefromatob
2.4 Review of some common engineering functions and techniques 105
f (x)
a
0
b
x
|a – b|
Figure2.38
The function: f (x) = |x|.
Figure2.39
Distance fromatob=|a −b|.
Example2.22 Find the distance from
(a)x=2tox=9 (b)x=−2tox=9 (c)x=−2tox=−9
Solution (a) Distance = |2 −9| = | −7| = 7
(b) Distance=|−2−9|=|−11|=11
(c) Distance = | −2− (−9)| = |7| = 7
From the definition of the modulus function itfollows:
If |x| =a,then eitherx =aorx = −a.
If|x|<a,then−a<x<a.
If |x| >a, then eitherx >aorx < −a.
For example, if |x| = 4 then eitherx = 4 orx = −4. If |x| < 4 then −4 < x < 4; that
is,xlies between −4 and 4. If |x| > 4, then eitherx > 4 orx < −4; that is,xis either
greater than 4 orless than −4.
Example2.23 Describe the interval on thexaxis defined by
(a) |x| < 2
(b) |x| 3
(c) |x−1|<3
(d) |x+2|>1
Solution (a) |x| < 2 is the same statement as −2 < x < 2; that is,xis numerically less than 2.
Figure2.40illustratesthisregion.Notethattheregionisanopeninterval.Sincethe
pointsx = −2 andx = 2 arenot included, they areshown on the graph as ◦.
(b) If |x| 3theneitherx 3orx −3.ThisisshowninFigure2.41.Therequired
region of thexaxis has two distinct parts. Since the pointsx = 3 andx = −3 are
included inthe interval of interest,they are shown on the graph as •.
(c) |x −1| < 3 isequivalent to −3 <x −1 < 3,thatis −2 <x<4.
(d) |x+2|>1isequivalenttox+2>1orx+2<−1,thatisx>−1orx<−3.
106 Chapter 2 Engineering functions
–2
0
|x| < 2
Figure2.40
The quantity |x| < 2 is equivalent to −2 <x<2.
2
–3
0 3
|x| > 3 |x| > 3
Figure2.41
Thequantity |x| 3isequivalenttox 3orx −3.
Themodulusfunctioncan beusedtodescribe regionsinthex--yplane.
Example2.24 Sketch the region defined by
(a) |x| < 2and |y| < 1
(b) |x 2 +y 2 | 9
Solution (a) The region is a rectangle as shown in Figure 2.42. The boundary is not part of the
regionasstrictinequalitieswereusedtodefineit.Theregion |x| 2, |y| 1isthe
sameasthatinFigure 2.42 with the boundaryincluded.
(b) |x 2 +y 2 | 9isequivalentto −9 x 2 +y 2 9.Note,however,thatx 2 +y 2 isnever
negative and so the region isgiven by0 x 2 +y 2 9.
Let P(x,y) be a general point as shown in Figure 2.43. Then from Pythagoras’s
theorem,the distance fromPtothe originis √ x 2 +y 2 .So,
Then,
(distance fromorigin) 2 =x 2 +y 2 9
(distance fromorigin) 3
This describes a disc, centre the origin, of radius 3 (see Figure 2.44).
y
1
y
P (x, y)
y
3
–2
–1
2
x
O
r
x
–3
–3
3 x
Figure2.42
Theregion: |x| < 2 and |y| < 1.
Figure2.43
PointP(x,y) is ageneral point.
Figure2.44
Theregion: |x 2 +y 2 | 9.
Engineeringapplication2.16
Full-waverectifier
A rectifier is an electronic circuit that is used to convert an alternating voltage to
a direct voltage, that is, one that does not change in polarity with time. Usually
the alternating voltage has the shape of a sinusoidal function. We will examine the
2.4 Review of some common engineering functions and techniques 107
sinusoidalfunctionsinSection3.4.Afull-waverectifierisonethatmakesuseofthe
negativepartoftheinputsinusoidalsignalaswellasthepositivepart,byreversingits
polarity.Itcontrastswithahalf-waverectifierthatmerelydiscardsthenegativepart
of the input sinusoidal signal. This means that the half-wave circuit is not driven by
the alternating voltage supply at all during half of the cycle. In both types of circuit
a capacitor is usually connected across the output to store energy when the signal
approaches a peak. The capacitor then discharges into the circuit when the rectifier
output voltage falls. By this means, a relatively constant voltage is produced at the
power supply output.
Afullyrectifiedsinewave isthemodulusofthesinewave.ThecircuitforafullwaverectifierisshowninFigure2.45togetherwiththeinputandoutputwaveforms.
The input signal is v in
and the output signal is v o
. Ignoring the voltage drops across
the diodes gives
v o
= |v in
|
v in
v o
v o
t
v in
t
Figure2.45
A full-wave rectifier.
EXERCISES2.4.6
1 Sketchthe interval defined by
(a) |x|>4 (b) |y−1|3
(c) |t+6|3
(d) |t 2 −2|<7
(e) |t| < −1
Also, state the intervals withoutusingthe modulus
signs.
2 Sketchthe regionsdefined by
(a) |x| > 2, |y| < 3
(b) |x+2|<4,|y+1|<3
(c) |x 2 +y 2 −1|4
(d) |(x−1) 2 + (y+2) 2 | 1
3 Express the following intervalsusingmodulus
notation:
(a) −2x2
(c)1<y<3
(b) −4<t<4
(d) −6t0
Solutions
1 (a)x>4orx<−4 (b) −2y4
(c)t −3ort −9 (d) −3<t<3
(e) novalue oft satisfiesthis
3 (a) |x| 2 (b) |t| <4
(c) |y−2|<1
(d) |t+3|3
108 Chapter 2 Engineering functions
TechnicalComputingExercises2.4.6
1 Useatechnical computing language suchas
MATLAB ® to produce a plotofthe output of:
(a) afull wave rectifier circuit
(b) ahalfwave rectifier circuit
f (t)
2.4.7 Therampfunction
Theramp functionisdefined by
slope = c
f(t)=
{
ct t 0 cconstant
0 t<0
Figure2.46
Theramp function.
t
Its graph isshown inFigure 2.46.
Engineeringapplication2.17
Telescopedrivesignal
Inordertotrackthemotionofthestarslargetelescopesareusuallydrivenbyanelectricmotor.Thespeedofthismotoriscontrolledinorderthattheangularpositionof
thetelescopefollowsaspecifiedtrajectorywithtime.Thewholeassembly,including
telescope, gears, motor, controller and sensors, forms a position control system or
servo-system. This servo-system must be fed a signal corresponding to the desired
trajectoryofthesystem.OftenthistrajectoryisarampfunctionasillustratedinFigure2.47.Thedrivemotorisstartedattimet
=t 0
andthedesiredangularpositionof
the telescopeis θ.
u
Desired telescope
position
Drive motor started
t 0
t
Figure2.47
Tracking signal foratelescope.
2.4.8 Theunitstepfunction,u(t)
The unit stepfunctionisdefinedby
{ 1 t0
u(t) =
0 t<0
2.4 Review of some common engineering functions and techniques 109
u (t)
1
u (t – d)
1
Figure2.48
Theunit stepfunction.
t
Figure2.49
Graph ofu(t −d).
d
t
Its graph is shown in Figure 2.48. Note thatu(t) has a discontinuity att = 0. The point
(0, 1) is part of the function defined ont 0. This is depicted by •. The point (0, 0) is
notpartof the function defined ont < 0.We use ◦toillustratethis.
The position of the discontinuity may be shifted.
{ 1 td
u(t−d)=
0 t<d
The graph ofu(t −d) isshown inFigure 2.49.
Engineeringapplication2.18
RLCcircuit
Theinductorisanotherofthethreefundamentalelectroniccomponents.Itisadevice
that is used to store electrical energy in the form of a concentrated magnetic field. It
consistsofatightlywoundcoil,oftenaroundaferromagneticmaterial,whichhelps
to strengthen the magnetic field. It is usually depicted on a circuit diagram as a set
ofloops.Figure2.50showsaresistor,inductorandacapacitorconnectedinparallel.
This is usually known as an RLC circuit. The circuit also contains a switch, which
here is depicted as an arrow connecting two terminals with the switch in the open
position. When the switch is open the voltage v across terminals A and B is zero. If
theswitchisclosedatt = 0thevoltageacrossAandBisV S
,whereV S
isthesupply
voltage. Thus v can bemodelled by the function
{ 0 t<0
v(t) =
t0
V S
This can bewritten,usingthe unit stepfunction, as
v(t) =V S
u(t)
+
A
V S
–
L R C
B
Figure2.50
RLCcircuit.
110 Chapter 2 Engineering functions
Example2.25 Sketch the following functions:
(a) f =u(t−3)
(c) f =u(t−1)−u(t−3)
(e) f =e t u(t)
(b) f =u(t−1)
(d) f =u(t−3)−u(t−1)
Solution (a) See Figure 2.51. (b) See Figure 2.52.
(c) See Figure 2.53. (d) See Figure 2.54.
(e) See Figure 2.55.
f
1
f
1
Figure2.51
Thefunction: f =u(t −3).
3 t
Figure2.52
The function: f =u(t −1).
1
t
f
1
f
f
Figure2.53
Thefunction:
f =u(t−1)−u(t−3).
1
3
t
–1
1 3
Figure2.54
Thefunction:
f =u(t−3)−u(t−1).
t
1
Figure2.55
Thefunction: f = e t u(t).
t
EXERCISES2.4.8
1 Sketch the followingfunctions:
(a) f(t) =u(t −1)
(b) f(t) =u(t +1)
(c) f(t)=u(t−2)−u(t−6)
(d) f(t) =3u(t −1)
(e) f(t)=u(t+1)−u(t−1)
(f) f(t)=u(t−1)−u(t+1)
(g) f(t) =u(t +1)−2u(t −1)
(h) f(t) =2u(t +1)−u(t −1)
(i) f(t) =3u(t −1)−2u(t −2)
2 Aramp function, f (t),is defined by
{ 2t t0
f(t)=
0 t<0
Sketch the following for −1 t 3:
(a) f(t)
(b) u(t)f(t)
(c) u(t −1)f(t)
(d) u(t −1)f(t)−u(t −2)f(t)
Solutions
2 SeeFigureS.5.
2.4 Review of some common engineering functions and techniques 111
f
6
4
2
u(t – 1)f
6
4
2
u(t – 1)f – u(t –2)f
6
4
2
(a) (b)
–1
0
1
2 3 t
(c)
–1
0
1
2 3 t
(d)
–1
0
1
2 3 t
FigureS.5
2.4.9 Thedeltafunctionorunitimpulsefunction, δ(t)
Consider the rectangle function,R(t), shown in Figure 2.56. The base of the rectangle
ish, the height is 1/h and so the area is 1. Fort >h/2 andt < −h/2, the function is 0.
Ashdecreases, the base diminishes and the height increases; the area remains constant
at1.
Ashapproaches 0, the base becomes infinitesimally small and the height infinitely
large. The area remains at unity. The rectangle function is then called a delta function
or unit impulse function. Ithas a valueof0everywhere except atthe origin.
δ(t) = rectanglefunction ashapproaches0
We write thisconcisely as
δ(t) =R(t) as h →0
The position of the delta function may be changed from the origin tot =d. Consider a
rectangle function,R(t −d), shown in Figure 2.57.R(t −d) is obtained by translating
R(t)anamountd totheright.Again,lettinghapproach0producesadeltafunction,this
time centredont =d.
δ(t−d)=R(t−d) as h→0
We have seen that the delta function can be regarded as bounding an area 1 between
itselfand the horizontalaxis.More generallythe area bounded by the function
f(t) =kδ(t)
isk.Wesaythatkδ(t)representsanimpulseofstrengthkattheorigin,andkδ(t −d)is
animpulseofstrengthkatt =d.Itisoftenusefultodepictsuchanimpulsebyanarrow
where the height of the arrow gives the strength of the impulse. A series of impulses is
oftentermedan impulse train.
R (t)
1–
h
R (t – d)
1–
h
– h–
2
h–
2
t
d – h– d d + h–
2 2
t
Figure2.56
The rectangle function,R(t).
Figure2.57
Thedelayed rectangle function,R(t −d).
112 Chapter 2 Engineering functions
Example2.26 Atrainof impulses isgiven by
f(t) = δ(t)+3δ(t −1)+2δ(t −2)
Depict the traingraphically.
Solution Figure 2.58 shows the representation. In Section 22.8 we shall call such a function a
series of weighted impulseswhere the weights are1,3and 2,respectively.
f (t)
3
2
1
0 1 2 t
Figure2.58
A trainofimpulsesgiven by
f(t) = δ(t)+3δ(t −1)+2δ(t −2).
Engineeringapplication2.19
Impulseresponseofasystem
An impulse signal is sometimes used to test an electronic system. It can be thought
ofasgiving the systemavery harsh joltforavery shortperiodoftime.
It is not possible to produce an impulse function electronically as no practical
signalcanhaveaninfiniteheight.However,anapproximationtoanimpulsefunction
isoftenused,consistingofapulsewithalargevoltage,V,andshortduration,T.The
strength of such an impulse isVT. When this pulse signal is injected into a system
the output obtained isknown astheimpulse response ofthe system.
The approximation is valid provided the width of the pulse is an order of magnitudelessthanthefastesttimeconstantinthesystem.IfthevalueofT
requiredissmall
inordertosatisfythisconstraint,thenthevalueofV mayneedtobelargetoachieve
thecorrectimpulsestrength,VT.Oftenthiscanruleoutitsuseformanysystemsas
the value ofV would thenbelargeenoughtodistort the systemcharacteristics.
EXERCISES2.4.9
1 Sketch the impulsetrain given by
(a) f(t) = δ(t −1)+2δ(t −2)
(b) f(t) =3δ(t)+4δ(t −2)+δ(t −3)
Review exercises 2 113
REVIEWEXERCISES2
1 Statethe ruleand sketch the graphofeach ofthe
followingfunctions:
(a) f(x)=7x−2
(b) f(t)=t 2 −2 0t5
(c) g(x)=3e x +4 0x2
(d) y(t) = (e 2t −1)/2 t 0
(e) f(x)=x 3 +2x+5 −2x2
2 Statethe domainand range ofthe functions in
Question1.
3 Determine the inverse ofeach ofthe following
functions:
(a) y(x) =2x (b) f(t) =8t −3
(c) h(x) = 2x +1 (d)m(r)=1−3r
3
(e) H(s) = 3 s +2
(g) f(t) =e 2t
(i) g(v) = ln(v +1)
4 Iff(t)=e t find
(a) f(2t)
(b) f(x)
(c) f(λ) (d) f(t − λ)
(f) f(v)=lnv
(h) g(v) =lnv+1
(j) y(t) = 3e t−2
5 Ifg(t) =ln(t 2 +1)find(a)g(λ),(b)g(t − λ).
6 Sketchthe following functions:
{ 0 t<0
(a) f(t) =
0.5t t0
⎧
⎨4 t<0
(b) f(t) = t 0t<3
⎩
2t 3t4
(c) f(t)=|e t |
−3t 3
7 Thefunction f (x) isperiodicwith a periodof2, and
f(x)=|x|,−1x1.Sketchffor−3x3.
8 Givena(t) =3t,b(t) =t +3andc(t) =t 2 −3write
expressions
(a) b(c(t))
(c) a(b(t))
(e) a(b(c(t)))
(b) c(b(t))
(d) a(c(t))
(f) c(b(a(t)))
9 Sketchthe following functions,stating any
asymptotes:
(a) y(x) = 3 +x
x
(b) y(x) =
2x
x 2 −1
(c) y(x) =3−e −x (d) y(x) = ex +1
e x
10 Simplify each expression asfar aspossible.
(a) e 2x e 3x
(c) e 4 e 3 e
(e)
( )
e x 2
e −x
(b) e x e 2x e −3x
e x
(d)
e −x
( )
2
(f) ln3x +ln
x
(g) 3lnt+2lnt 2 (h) e lnx
(i) ln(e x ) (j) e lnx2
(k) e 0.5lnx2 (l) ln(e 2x )
(m) e 2lnx (n) ln(e 3 ) +ln(e 2x )
11 Solve the following:
(a) e 4x = 200 (b) e 3x−6 = 150
(c) 9e −x = 54 (d) e (x2) = 60
1
(e) = 0.1
6+e−x 12 Solve the following:
(a) 0.5lnt = 1.2
(b) ln(3t +2) = 1.4
(c) 3ln(t−1)=6
(d) log 10 (t 2 −1) = 1.5
(e) log 10 (lnt) = 0.5
(f) ln(log 10 t) = 0.5
13 Express eachofthe following in terms ofsinhx and
coshx:
(a) 7e x +3e −x (b) 6e x −5e −x
(c) 3ex −2e −x 1
(d)
2 e x +e −x
e x
(e)
1+e x
14 Express eachofthe following in terms ofe x ande −x :
(a) 2sinhx +5coshx
(b) tanhx +sechx
(c) 2coshx − 3 4 sinhx
1
(d)
sinhx −2coshx
(e) (sinhx) 2
114 Chapter 2 Engineering functions
15 Describe the interval onthexaxisdefined by
(a) |x| < 1.5 (b) |x| > 2
(c) |x+3|<7 (d) |2x|6
(e) |2x−1|5
16 Sketch
(a) f=|−2t| −3t3
(b) f=−|2t| −3t3
(c) 2δ(t)−δ(t −1)+3δ(t +1)
Solutions
1 (a) Multiply input by 7, then subtract2
(b) Square the input, thensubtract 2
(c) Calculate e x ,wherexisthe input, multiply by3
and thenadd 4
(d) Multiply by2, calculate exponential, subtract1
and thendivide by 2
(e) Cube input, addto twice the input and add 5
2 (a) Domain (−∞, ∞)range (−∞, ∞)
(b) Domain [0,5]range [−2,23]
(c) Domain [0,2]range [7,3e 2 +4]
(d) Domain [0,∞) range [0,∞)
(e) Domain [−2,2]range [−7,17]
3 (a) y −1 (x) = x 2
(b) f −1 (t) = t +3
8
(c) h −1 (x) = 3 2 (x−1)
(d) m −1 (r) = 1 −r
3
(e) H −1 (s) = 3
s −2
(f) f −1 (v) =e v
(g) f −1 (t) = 1 2 lnt
(h) g −1 (v) = e v−1
(i) g −1 (v) =e v (
−1
)
(j) y −1 t
(t) = ln +2
3
4 (a) e 2t (b) e x (c) e λ (d) e t−λ
5 (a) ln(λ 2 +1) (b) ln((t − λ) 2 +1)
8 (a) t 2
(b) (t+3) 2 −3ort 2 +6t+6
(c) 3(t +3)
(d) 3(t 2 −3)
(e) 3t 2
(f) (3t+3) 2 −3or9t 2 +18t+6
10 (a) e 5x (b) 1 (c) e 8
(d) e 2x (e) e 4x (f) ln6
(g) lnt 7 (h) x (i) x
(j) x 2 (k) x (l) 2x
(m) x 2 (n) 3 +2x
11 (a) 1.3246 (b) 3.6702
(c) −1.7918 (d) ±2.0234
(e) −1.3863
12 (a) 11.0232 (b) 0.6851
(c) 8.3891 (d) ±5.7116
(e) 23.6243 (f) 44.5370
13 (a) 10coshx +4sinhx
(b) coshx +11sinhx
coshx +5sinhx
(c)
2
1
(d)
2coshx
coshx +sinhx
(e)
1+coshx+sinhx
7
14 (a)
2 ex + 3 2 e−x
e x −e −x +2
(b)
e x +e −x
5e x +11e −x
(c)
8
−2
(d)
e x +3e −x
( )
e x −e −x 2
(e)
2
15 (a) −1.5 <x<1.5
(b) x>2andx<−2
(c) −10<x<4
(d) x3andx−3
(e) −2x3
3 Thetrigonometric
functions
Contents 3.1 Introduction 115
3.2 Degreesandradians 116
3.3 Thetrigonometricratios 116
3.4 Thesine,cosineandtangentfunctions 120
3.5 Thesincxfunction 123
3.6 Trigonometricidentities 125
3.7 Modellingwavesusingsint andcost 131
3.8 Trigonometricequations 144
Reviewexercises3 150
3.1 INTRODUCTION
ManycommonengineeringfunctionswerestudiedinSection2.4.However,thetrigonometricfunctionsaresoimportantthattheydeserveseparatetreatment.Oftenalternating
currents and voltages can be described using trigonometric functions, and they occur
frequently inthe solution of various kinds of equations.
Section3.3introducesthethreetrigonometricratios:sinθ,cosθ andtanθ.Theseare
definedasratiosofthesidesofaright-angledtriangle.Thedefinitionsarethenextended
so that angles of any magnitude may be considered. Section 3.5 introduces an important
function, sincx. This is really a combination of the familiar functionsxand sinx,
but because this combination occurs frequently in some engineering applications it deservesspecialmention.TrigonometricidentitiesareintroducedinSection3.6,themost
common ones being tabulated. These identities are useful in simplifying trigonometric
expressions. Graphs of the trigonometric functions are illustrated. The application
of these functions to the modelling of waveforms is an important section. The chapter
closes with the solution of trigonometric equations.
116 Chapter 3 The trigonometric functions
3.2 DEGREESANDRADIANS
Angles can be measured in units of either degrees or radians. The symbol for degree
is ◦ . Usually no symbol is used to denote radians and this is the convention adopted in
this book.
A complete revolution is defined as 360 ◦ or 2π radians. It is easy to use this fact to
convert between the two measures. We have
Note that
360 ◦ = 2π radians
1 ◦ = 2π
360 = π
180 radians
1 radian = 180
π
π
radians = 90◦
2
3π
2
π radians = 180 ◦
radians = 270◦
degrees ≈ 57.3◦
Yourcalculatorshouldbeabletoworkwithanglesmeasuredinbothradiansanddegrees.
Usually the Mode button allows you toselect the appropriate measure.
3.3 THETRIGONOMETRICRATIOS
Considertheangle θ intheright-angledtriangleABC,asshowninFigure3.1.Wedefine
the trigonometricratiossine, cosineandtangent asfollows:
sinθ=
cosθ=
tanθ=
sideopposite toangle
hypotenuse
sideadjacent toangle
hypotenuse
= BC
AC
= AB
AC
sideopposite toangle
sideadjacent toangle = BC
AB
C
a
A
u
B
Figure3.1
A right-angled triangle, ABC.
Note that
tanθ = BC
AB = BC
AC × AC
AB = sinθ
cos θ
3.3 The trigonometric ratios 117
Note that when θ reduces to0 ◦ the length of the sideBC reduces tozero and so
sin0 ◦ =0, tan0 ◦ =0
Also when θ reduces to0 ◦ the lengths of AB and ACbecome equal and so
cos0 ◦ =1
Similarlywhen θ approaches90 ◦ ,thelengthsofBCandACbecomeequalandthelength
of ABshrinks tozero.Hence
sin90 ◦ = 1, cos90 ◦ = 0
Note also that the length of AB shrinks to zero as θ approaches 90 ◦ , and so tanθ approaches
infinity. We write thisas
tanθ→∞
asθ→90 ◦
The trigonometric ratios of 30 ◦ , 45 ◦ and 60 ◦ occur frequently in calculations. They can
be calculated exactly by considering the right-angled triangles shown inFigure 3.2.
sin45 ◦ = 1 √
2
, cos45 ◦ = 1 √
2
, tan45 ◦ =1
sin30 ◦ = 1 2 , cos30◦ =
sin60 ◦ =
√
3
2 , tan30◦ = 1 √
3
√
3
2 , cos60◦ = 1 2 , tan60◦ = √ 3
Mostscientificcalculatorshavepre-programmedvaluesofsinθ,cosθ andtanθ.Anglescanbemeasuredindegreesorradians.Wewilluseradiansunlessstatedotherwise.
If welet ̸ ACB = α (see Figure 3.1),then
and
sinα= AB
AC =cosθ
cosα= BC
AC =sinθ
45°
2
1
45°
1
30°
2
3
60°
1
Figure3.2
Thetrigonometric ratiosfor30 ◦ ,
45 ◦ and60 ◦ canbe foundexactly
fromthese triangles.
118 Chapter 3 The trigonometric functions
But
α+θ= π 2
Hence,
( ) π
sinθ = cos
2 − θ
( ) π
cos θ = sin
2 − θ
Since θ is an angle in a right-angled triangle it cannot exceed π/2. In order to define
the sine, cosine and tangent ratios for angles larger than π/2 we introduce an extended
definition which isapplicable toangles of any size.
Consider an arm,OP, fixed atO, which can rotate (see Figure 3.3). The angle, θ, in
radians,betweenthearmandthepositivexaxisismeasuredanticlockwise.Thearmcan
beprojectedontoboththexandyaxes.TheseprojectionsareOAandOB,respectively.
Whetherthearmprojectsontothepositiveornegativexandyaxesdependsuponwhich
quadrant the armissituatedin. The length of the armOPisalways positive. Then,
sinθ= projectionofOPontoyaxis
OP
cosθ=
tanθ=
projection ofOPontoxaxis
OP
= OB
OP
= OA
OP
projection ofOP ontoyaxis
projection ofOPontoxaxis = OB
OA
In the first quadrant, that is 0 θ < π/2, both thexandyprojections are positive,
so sinθ, cos θ and tanθ are positive. In the second quadrant, that is π/2 θ < π, the
x projection,OA, is negative and theyprojection,OB, positive. Hence sinθ is positive,
and cosθ and tanθ are negative. Both thexandyprojections are negative for the third
quadrantandsosinθ andcosθ arenegativewhiletanθ ispositive.Finally,inthefourth
quadrant, thexprojection is positive and theyprojection is negative. Hence, sinθ and
tan θ are negative, and cosθ is positive (see Figure 3.4). The sign of the trigonometric
ratios can be summarized by Figure 3.5.
For angles greater than 2π, the armOP simply rotates more than one revolution before
coming to rest. Each complete revolution bringsOP back to its original position.
Second quadrant
P
y
B
u
First quadrant
A
O
x
Third quadrant
Fourth quadrant
Figure3.3
An arm,OP,fixed atO, which can rotate.
3.3 The trigonometric ratios 119
y
y
B
P
P
B
u
u
O A x
A
O
x
First quadrant
y
Second quadrant
y
A
O
u
x
O
u
A
x
P
B
B
P
Third quadrant
Fourth quadrant
Figure3.4
Evaluating the trigonometric ratios in each ofthe fourquadrants.
So,for example,
sin(8.76) = sin(8.76 −2π) = sin(2.477) = 0.617
cos(14.5) = cos(14.5 −4π) = cos(1.934) = −0.355
Negative angles are interpreted as a clockwise movement of the arm. Figure 3.6 illustrates
an angle of −2. Note that
sin(−2) = sin(2π −2) = sin(4.283) = −0.909
sinceananticlockwise movement ofOPof4.283radians would resultinthe armbeing
inthe same position as a clockwise movement of 2 radians.
y
sin u > 0
cos u < 0
tan u < 0
sin u > 0
cos u > 0
tan u > 0
y
sin u < 0
cos u < 0
tan u > 0
sin u < 0
cos u > 0
tan u < 0
x
O
u = –2
x
Figure3.5
Sign ofthe trigonometric ratios in
each ofthe four quadrants.
P
Figure3.6
Illustrationofthe angle θ = −2.
120 Chapter 3 The trigonometric functions
Thecosecant,secantandcotangentratiosaredefinedasthereciprocalsofthesine,
cosine and tangent ratios.
cosec θ = 1
sinθ
secθ = 1
cos θ
cotθ = 1
tanθ
Example3.1 An angle θ issuchthatsinθ > 0 and cosθ < 0.Inwhichquadrantdoes θ lie?
Solution FromFigure3.5weseethatsinθ > 0when θ isinthefirstandsecondquadrants.Also,
cosθ < 0when θ isinthesecondandthirdquadrants.Forbothsinθ > 0andcosθ < 0
thus requires θ to be in the second quadrant. Hence sin θ > 0 and cosθ < 0 when θ is
inthe secondquadrant.
EXERCISES3.3
1 Verify usingascientificcalculator that
(a)sin30 ◦ = sin390 ◦ = sin750 ◦
(b)cos100 ◦ = cos460 ◦ = cos820 ◦
(c)tan40 ◦ = tan220 ◦ = tan400 ◦
(d)sin70 ◦ = sin(−290 ◦ ) = sin(−650 ◦ )
(e)cos200 ◦ = cos(−160 ◦ ) = cos(−520 ◦ )
(f) tan150 ◦ = tan(−30 ◦ ) = tan(−210 ◦ )
2 Verify the following usingascientificcalculator. All
angles are in radians.
(a) sin0.7 = sin(0.7 +2π) = sin(0.7 +4π)
(b) cos1.4 = cos(1.4 +8π) = cos(1.4 −6π)
(c) tan1 = tan(1 + π) = tan(1 +2π) = tan(1 +3π)
(d) sin2.3 = sin(2.3 −2π) = sin(2.3 −4π)
(e) cos2 = cos(2 −2π) = cos(2 −4π)
(f) tan4 = tan(4 − π) = tan(4 −2π) = tan(4 −3π)
3 An angle θ is suchthat cos θ > 0 andtan θ < 0.In
whichquadrant does θ lie?
4 An angle α is suchthattan α > 0 andsin α < 0.In
whichquadrant does α lie?
Solutions
3 4thquadrant 4 3rd quadrant
3.4 THESINE,COSINEANDTANGENTFUNCTIONS
The sine, cosine and tangent functions follow directly from the trigonometric ratios.
These are defined to be f (x) = sinx, f (x) = cosx and f (x) = tanx. Graphs can be
constructedfromatableofvaluesfoundusingascientificcalculator.Theyareshownin
Figures3.7 to3.9. Notethatthese functions aremany-to-one.
3.4 The sine, cosine and tangent functions 121
f (x)
f (x)
1
1
p
– –2
p
–p
p
–
2
2p
3p
x
–p
p
2p
x
–1
–1
Figure3.7
Graphof f(x) = sinx.
Figure3.8
Graphof f(x) = cosx.
f (x)
0 p 2p x
Figure3.9
Graphof f(x) = tanx.
By shifting the cosine function to the right by an amount π/2 the sine function is
obtained. Similarly, shifting the sine function to the left by π/2 results in the cosine
function. This interchangeability between the sine and cosine functions is reflected in
their being commonly referred to as sinusoidal functions. Notice also from the graphs
two importantpropertiesofsinx and cosx:
sinx = −sin(−x)
cosx =cos(−x)
For example,
sin π (
3 = −sin − π )
3
and cos π (
3 = cos − π )
3
Note that f(x) = sinx is 0 whenx = 0,±π,±2π,±3π,..., ±nπ,.... Also f(x) =
cosxis0whenx=± π 2 ,±3π 2 ,±5π 2 ,..., ±(2n+1)π 2 ,....
3.4.1 Inversetrigonometricfunctions
Allsixtrigonometricfunctionshaveaninversebutwewillonlyexaminethoseofsinx,
cosxandtanx.Theinversefunctionsofsinx,cosxandtanxaredenotedsin −1 x,cos −1 x
and tan −1 x. This notation can and does cause confusion. The ‘−1’ in sin −1 x is sometimesmistakenlyinterpretedasapower.Wewrite
(sinx) −1 todenote1/sinx.Valuesof
sin −1 x, cos −1 x and tan −1 x can be found using a scientific calculator. Ify = sinx, then
x = sin −1 y, as in
sinx = 0.7654 x = sin −1 (0.7654) = 0.8717
122 Chapter 3 The trigonometric functions
sin x
1
–1 1 x
sin –1 x
– p –2
p –2 x
–1
Figure3.10
Thefunctionsinx isone-to-one [ ] ifthe
domainis restrictedto − π 2 , π .
2
–1
p –2
– p –2
Figure3.11
Theinverse sine
function, sin −1 x.
1
x
Figure3.12
A single input produces many
outputvalues.Thisisnotafunction.
Notethaty = sinxisamany-to-onefunction.Ifthedomainisrestrictedto[−π/2, π/2]
then the resulting function isone-to-one. This isshown inFigure 3.10.
Recall from Section 2.3 that a one-to-one function has a corresponding inverse. So
if the domain of y = sinx is restricted to [−π/2,π/2], then an inverse function exists.
A graph ofy = sin −1 x is shown in Figure 3.11. Without the domain restriction, a
one-to-many graph would result as shown in Figure 3.12. To denote the inverse sine
function clearly, we write
y=sin −1 x − π 2 y π 2
Example3.2 Useascientific calculatortoevaluate
(a) sin −1 (0.3169)
(b) sin −1 (−0.8061)
Solution (a) sin −1 (0.3169) = 0.3225
(b) sin −1 (−0.8061) = −0.9375
A word of warning about inverse trigonometric functions is needed. The calculator returns
a value of 0.3225 for sin −1 (0.3169). Note, however, that sin(0.3225 ± 2nπ) =
0.3169, n = 0,1,2,3,..., so there are an infinite number of values of x such that
sinx = 0.3169. Only one of these values is returned by the calculator. This is because
the domain ofy = sinx is restricted to ensure it has an inverse function. To ensure the
inverse functionsy = cos −1 x andy = tan −1 x can be obtained, restrictions are placed
on the domains of y = cosx and y = tanx. By convention, y = cosx has its domain
restrictedto[0,π] whereas withy = tanx the restrictionis (−π/2,π/2).
3.5 The sincxfunction 123
EXERCISES3.4
1 Evaluate the following:
(a) sin −1 (0.75) (b) cos −1 (0.625) (c) tan −1 3
(d) sin −1 (−0.9) (e) cos −1 (−0.75) (f) tan −1 (−3)
3 Sketchy = sinx andy = cosx forxin the interval
[−2π,2π].Markon the graphs the pointswhere
x=1,x=1.5,x=−3andx=2.3.
2 Showthat
sin −1 x +cos −1 x = π 2
Solutions
1 (a) 0.8481 (b) 0.8957 (c) 1.2490 (d) −1.1198 (e) 2.4189 (f) −1.2490
TechnicalComputingExercises3.4
1 Using atechnicalcomputing languagedrawy = sinx
andy = cosx for0 x4π onthe same axes. Use
your graphs to findapproximate solutionsto the
equationsinx = cosx.
2 Draw the graphs ofy = cos −1 x andy = tan −1 x.
3.5 THESINCxFUNCTION
Thesinefunctionisusedtodefineanotherimportantfunctionusedinengineering.The
cardinalsinefunction,sincx,occursfrequentlyinengineeringmathematicsinapplicationsrangingfromcommunications,powerelectronics,digitalsignalprocessing(d.s.p.)
and optical engineering. The standard sincxfunction without normalization is plotted
inFigure 3.13 and isdefined by
⎧
⎨ sinx
x≠0
sincx = x
⎩
1 x=0
It is necessary to separately define the value of the function at x = 0 because the
definitionofthefunction, sin x ,wouldcauseadivisionbyzeroatthispoint.Fortunately
x
it can be shown that asxapproaches 0, then sinx approaches 1, and so specifying the
x
valuesincx=1atx = 0isadequatetoresolvetheproblem.Notethatthesincfunction
without normalization is 0 whenx = ...,−3π, −2π, −π, π,2π, 3π,..., so that apart
fromx = 0 where sincx = 1 by definition, it has the same zero crossing points as that
ofsinx.
Some engineers use a different definition of the sincxfunction:
⎧
⎨ sinπx
x≠0
sincx = πx
⎩
1 x=0
This iscommonly termed thenormalized sinc function.
124 Chapter 3 The trigonometric functions
Sinc x
1
0.75
0.5
0.25
–4p –3p –2p –p p 2p 3p 4p
x
–0.25
Figure3.13
The standard sincfunction withoutnormalization.
Engineeringapplication3.1
Zerocrossingpointsofthenormalizedsinc(x)function
The sinc function often appears in communications engineering and signal processing.Ingeneral,asignalcanberegardedasbeingmadeofalargenumberofindividual
frequencycomponents.Inthecaseofasquarepulse,thegraphshowingtheamplitude
of these components against their frequencies takes the formof a sincfunction.
Find the zero crossing points of the normalized sincfunction,
⎧
⎨ sin(πx)
x≠0
sinc(x) = πx
⎩
1 x=0
Solution
It is evident from Figure 3.13 that the sinc function which has not been normalized
hasthesamezerocrossingpointsassinx,withtheexceptionofthecrossingatx = 0,
thatisatx=±π,±2π,±3π,...,±nπ,....
Thenormalizedsincfunctionis0whensin(πx)is0.Now,frompage121wesee
that
and so
sinx=0whenx=...−3π,−2π,−π,0,π,2π,3π,...
sinπx=0whenπx=...−3π,−2π,−π,0,π,2π,3π,...
and hence
sin(πx) =0whenx = −3,−2,−1,0,1,2,3...
Noting thatsinc(0) isdefined tohave a value of 1,then wesee that
sinc(x)=0whenx=...−3,−2,−1,1,2,3,...
3.6 Trigonometric identities 125
or
sinc(x)=0whenx=±nforn=1,2,3,...
The plot ofthe normalized sincfunction isshown inFigure 3.14.
Sinc x
1
0.75
0.5
0.25
–15 –10 –5 5 10
x
15
–0.25
Figure3.14
The sincfunctionwith normalization.
3.6 TRIGONOMETRICIDENTITIES
We have seen that tanθ = sinθ for all values of θ. We call this an identity. In an
cos θ
identity, the l.h.s. and the r.h.s. are always equal, unlike in an equation where the l.h.s.
and the r.h.s. are equal only for particular values of the variable concerned. Table 3.1
listssome common trigonometric identities.
Example3.3 Simplify
cotA
cosA
Solution We know that
tanA= sinA
cosA
and so
cotA= 1
tanA = cosA
sinA
Hence
cotA
cosA =cotA× 1
cosA
= cosA
sinA × 1
cosA
= 1
sinA
This may alsobewritten as cosecA.
126 Chapter 3 The trigonometric functions
Table3.1
Common trigonometricidentities.
tanA= sinA
cosA
sin(A ±B) =sinAcosB ±sinBcosA
cos(A ±B) =cosAcosB ∓sinAsinB
tan(A ±B) = tanA±tanB
1∓tanAtanB
2sinAcosB =sin(A +B) +sin(A −B)
2cosAcosB =cos(A +B) +cos(A −B)
2sinAsinB =cos(A −B) −cos(A +B)
sin 2 A+cos 2 A=1
1 +cot 2 A = cosec 2 A
tan 2 A+1=sec 2 A
cos2A=1−2sin 2 A=2cos 2 A−1=cos 2 A−sin 2 A
sin2A=2sinAcosA
sin 2 A = 1−cos2A
2
cos 2 A = 1+cos2A
2
Note: sin 2 Ais the notationused for (sinA) 2 .Similarly cos 2 A means (cosA) 2 .
Example3.4 Show that
tanA+cotA
maybewritten as
2
sin2A
Solution We have
and so
tanA= sinA
cosA ,
cotA=cosA sinA
tanA+cotA= sinA
cosA + cosA
sinA
= sin2 A +cos 2 A
sinAcosA
1
= usingthe identity sin 2 A +cos 2 A = 1
sinAcosA
2
=
2sinAcosA
= 2 using the identity sin2A = 2sinAcosA
sin2A
3.6 Trigonometric identities 127
Example3.5 Usethe( identities ) inTable 3.1( tosimplify ) the following expressions:
π 3π
(a) sin
2 + θ (b) cos
2 − θ
(c) tan(2π − θ) (d) sin(π − θ)
( π
)
Solution (a) The expression sin
2 + θ isof the form sin(A +B) and so we use the identity
sin(A +B) =sinAcosB +sinBcosA
PuttingA = π andB = θ weobtain
2
( π
)
sin
2 + θ = sin π 2 cos θ +sin θ cos π 2
We note thatsin π 2 = 1,cos π 2 =0andso
( π
)
sin
2 + θ =1cosθ +sinθ(0)
=cosθ
( ) 3π
(b) The expression cos
2 − θ has the form cos(A −B); hence we use the identity
cos(A −B) =cosAcosB +sinAsinB
PuttingA = 3π ,B = θ weobtain
2
( ) 3π
cos
2 − θ = cos 3π 2 cos θ +sin 3π 2 sinθ
Now cos 3π 2 = 0,sin 3π = −1 and so
2
( ) 3π
cos
2 − θ =0cosθ+(−1)sinθ
(c) We use the identity
= −sin θ
tan(A −B) = tanA−tanB
1+tanAtanB
SubstitutingA = 2π,B = θ we obtain
tan(2π − θ) =
tan2π −tan θ
1+tan2πtanθ
Since tan2π = 0 this simplifies to
tan(2π − θ) = −tanθ
1
(d) We use the identity
=−tanθ
sin(A −B) =sinAcosB −sinBcosA
128 Chapter 3 The trigonometric functions
SubstitutingA = π,B = θ, this thenbecomes
sin(π−θ)=sinπcosθ −sinθcosπ
Now sinπ = 0,cos π = −1 and so we obtain
sin(π−θ)=0cosθ −sinθ(−1)=sinθ
Example3.6 Simplify
(a) cos(π + θ)
(b) tan(π − θ)
(c) sin 3 B +sinBcos 2 B
(d) tanA(1 +cos2A)
Solution (a) Usingthe identityfor cos(A +B) withA = π,B = θ weobtain
cos(π+θ)=cosπ cosθ −sinπ sinθ
= (−1)cosθ − (0)sinθ
= −cos θ
(b) Usingthe identityfor tan(A −B) withA = π,B = θ weobtain
tan(π−θ)= tanπ−tanθ
1+tanπtanθ
= 0−tanθ
1+(0)tanθ
=−tanθ
(c) sin 3 B +sinBcos 2 B =sinB(sin 2 B +cos 2 B)
=sinB since sin 2 B +cos 2 B = 1
(d) Firstlywe notethat tanA = sinA . Also wehave from Table 3.1
cosA
cos 2 A = 1+cos2A
2
from which
Hence
1+cos2A=2cos 2 A
tanA(1 +cos2A) = sinA
cosA 2cos2 A
=2sinAcosA
= sin2A
3.6 Trigonometric identities 129
Example3.7 Show that
( ) ( ) A +B A −B
sinA+sinB=2sin cos
2 2
Solution Consider the identities
sin(C +D) = sinCcosD +sinDcosC
sin(C −D) = sinCcosD −sinDcosC
Byadding these identities weobtain
sin(C +D) +sin(C −D) =2sinCcosD
We now make the substitutionsC +D =A,C −D =Bfromwhich
C = A +B , D = A −B
2 2
Hence
( ) ( ) A +B A −B
sinA+sinB=2sin cos
2 2
TheresultofExample3.7isoneofmanysimilarresults.ThesearelistedinTable3.2.
Table3.2
Further trigonometricidentities
( A +B
sinA+sinB =2sin
2
( A −B
sinA−sinB =2sin
2
( A +B
cosA+cosB=2cos
2
( A +B
cosA−cosB=−2sin
2
)
cos
)
cos
)
cos
( ) A −B
2
( ) A +B
2
( ) A −B
2
( ) A −B
)
sin
2
Example3.8 Simplify
sin70 ◦ −sin30 ◦
cos50 ◦
Solution We note that the numerator, sin70 ◦ − sin30 ◦ , has the form sinA − sinB. Using the
identityfor sinA −sinBwithA = 70 ◦ andB = 30 ◦ wesee
( ) ( )
70
sin70 ◦ −sin30 ◦ ◦ −30 ◦ 70 ◦ +30 ◦
= 2sin cos
2 2
= 2sin20 ◦ cos50 ◦
130 Chapter 3 The trigonometric functions
Hence
sin70 ◦ −sin30 ◦
= 2sin20◦ cos50 ◦
cos50 ◦ cos50 ◦
= 2sin20 ◦
EXERCISES3.6
1 Usethe identities forsin(A ±B),cos(A ±B)and
tan(A ±B)to ( simplifythe ) following: ( )
(a) sin
θ − π 2
(b) cos
θ − π 2
(c) tan(θ + π) (d) sin(θ − π)
(e) cos(θ − π)
(g) sin(θ + π)
(
)
(i) sin 2θ + 3π 2
( )
π
(k) cos
2 + θ
(f) tan(θ −3π)
( )
(h) cos θ + 3π 2
( )
(j) cos
θ − 3π 2
2 Writedown the trigonometricidentityfortan(A
( )
+ θ).
BylettingA → π 2 showthat tan π
2 + θ can be
simplified to −cot θ.
3 (a) Bydividing the identity
sin 2 A +cos 2 A = 1 bycos 2 A showthat
tan 2 A +1 = sec 2 A.
(b) Bydividing the identity
sin 2 A +cos 2 A = 1 bysin 2 A showthat
1 +cot 2 A = cosec 2 A.
4 Simplify the followingexpressions:
(a) cosAtanA (b) sin θ cot θ
(c) tanB cosecB
( )
(d) cot2xsec2x
π
(e) tanθ tan
2 + θ (f) sin2t
cost
[Hint: seeQuestion 2.]
(g) sin 2 A +2cos 2 A
(h) 2cos 2 B −1
(i) (1 +cot 2 X)tan 2 X (j) (sin 2 A +cos 2 A) 2
(k) 1 2 sin2AtanA
(m)
sin2A
cos2A
(o) (tan 2 θ +1)cot 2 θ
5 Simplify
(a) sin110 ◦ −sin70 ◦
(b) cos20 ◦ −cos80 ◦
(c) sin40 ◦ +sin20 ◦
(d)
6 Show that
cos50 ◦ +cos40 ◦
√
2
sin60 ◦ +sin30 ◦
sin50 ◦ −sin40 ◦
isequivalent to
cos15 ◦
sin5 ◦
(l) (sec2 t −1)cos 2 t
(n)
sinA
sin2A
(p) cos2A +2sin 2 A
Solutions
1 (a) −cos θ (b) sin θ (c) tan θ
(d) −sin θ (e) −cos θ (f) tan θ
(g) −sin θ (h) sin θ (i) −cos2θ
(j) −sin θ (k) −sin θ
4 (a) sinA (b) cos θ (c) secB
(d) cosec 2x (e) −1 (f) 2sint
(g) 1 +cos 2 A (h) cos2B (i) sec 2 X
(j) 1 (k) sin 2 A (l) sin 2 t
(m) tan2A
(p) 1
(n)
1
2 secA (o) cosec 2 θ
5 (a) 0 (b) sin50 ◦ (c) cos10 ◦ (d) cos5 ◦
3.7 Modelling waves using sint and cost 131
3.7 MODELLINGWAVESUSINGSINt ANDCOSt
Examiningthegraphsofsinxandcosxrevealsthattheyhaveasimilarshapetowaves.
In fact, sine and cosine functions are often used to model waves and we will see in
Chapter 23 that almost any wave can be broken down into a combination of sine and
cosinefunctions.Themainwavesfoundinengineeringareonesthatvarywithtimeand
sot isoften the independent variable.
The amplitude of a wave is the maximum displacement of the wave from its mean
position.So,forexample,sint andcost haveanamplitudeof1,theamplitudeof2sint
is2,and the amplitude ofAsint isA(see Figure 3.15).
The amplitude ofAsint isA. The amplitude ofAcost isA.
AmoregeneralwaveisdefinedbyAcos ωt orAsin ωt.Thesymbol ω representsthe
angularfrequencyofthewave.Itismeasuredinradianspersecond.Forexample,sin3t
hasanangularfrequencyof3rads −1 .Ast increasesby1secondtheangle,3t,increases
by 3 radians. Note thatsint has anangular frequency of1rad s −1 .
The angular frequency ofy =Asin ωt andy =Acos ωt is ω radians per second.
The sine and cosine functions repeat themselves at regular intervals and so are periodic
functions. Looking at Figure 3.7 we see that one complete cycle of sint is completedevery
2πseconds.Thetimetaken tocomplete one fullcycle iscalled theperiod
andisdenotedbyT.Hencetheperiodofy = sint is2πseconds.Similarlytheperiodof
y = cost is2πseconds.Mathematicallythismeansthataddingorsubtractingmultiples
of 2πtot does notchange the sineor cosine of thatangle.
sint=sin(t±2nπ)
cost=cos(t±2nπ)
n=0,1,2,3,...
n=0,1,2,3,...
Inparticular we note that
sint = sin(t +2π)
cost = cos(t +2π)
We now considery =Asin ωt andy =Acos ωt. Whent = 0 seconds, ωt = 0 radians.
Whent = 2π ω seconds, ωt = ω2π = 2π radians. We can see that ast increases from 0
ω
f (t)
A
2
A sin t
2 sin t
–2
2p
t
–A
Figure3.15
The amplitude of f (t) =Asint isA.
132 Chapter 3 The trigonometric functions
to 2π seconds, the angle ωt increases from 0 to 2π radians. We know that as the angle
ω
ωt increases by 2π radians then Asin ωt completes a full cycle. Hence a full cycle is
completed in 2π ω seconds, thatisthe period ofy =Asin ωt is 2π ω seconds.
Ify =Asin ωt ory =Acos ωt, thenthe periodT is 2π ω .
Inparticular wenote thatthe period ofy =Asint andy =Acost is2π.
Closelyrelatedtotheperiodisthefrequencyofawave.Thefrequencyisthenumber
ofcyclescompletedin1second.Frequency ismeasuredinunitscalledhertz(Hz).One
hertz isone cycle per second. We have seen thaty =Asin ωt takes
2π
seconds tocomplete one cycle
ω
and soitwill take
1 second tocomplete ω 2π cycles
We use f as the symbol forfrequency and so
frequency, f = ω 2π
( ) 3
For example, sin3t has a frequency of Hz.
2π
Note that by rearrangement wemay write
ω=2πf
and sothe wavey =Asin ωt may alsobewritten asy =Asin2πft.
Fromthe definitions of period and frequency wecan see that
1
period =
frequency
that is
T = 1 f
We see that the period is the reciprocal of the frequency. Identical results apply for the
wavey =Acosωt.
A final generalization is to introduce a phase angle or phase, φ. This allows the
wave to be shifted along the time axis. It also means that either a sine function or a
cosine function can be usedtorepresent the same wave. So the general formsare
Acos(ωt + φ), Asin(ωt + φ)
Figure 3.16 depictsAsin(ωt + φ). Note from Figure 3.16 that the actual movement of
the wave along the time axis is φ/ω. Itis easytoshow thismathematically:
(
Asin(ωt +φ) =Asinω t + φ )
ω
The quantity φ iscalled thetimedisplacement.
ω
3.7 Modelling waves using sint and cost 133
f (t)
A
T = 1– 2p –v f =
f –
v
t
Figure3.16
The generalized waveAsin(ωt + φ).
The waves met in engineering are often termed signals or waveforms. There are
no rigid rules concerning the use of the words wave, signal and waveform, and often
engineers use them interchangeably. We will followthis convention.
Example3.9 Statethe amplitude, angular frequency and periodofeachofthe following waves:
(a) 2sin3t
1
(b)
(2t
2 cos + π )
6
Solution (a) Amplitude,A = 2,angular frequency, ω = 3,period,T = 2π ω = 2π 3 .
(b) Amplitude,A = 0.5, angular frequency, ω = 2,period,T = 2π ω = 2π 2 = π.
Example3.10 Statethe amplitude, period, phase angle and time displacement of
(a) 2sin(4t +1)
2cos(t −0.7)
(b)
( 3 ) 2t+1
(c) 4cos
3
( )
3 4t
(d)
4 sin 3
Solution (a) Amplitude = 2, period = 2π 4 = π , phase angle = 1 relative to 2sin4t, time
2
displacement = 0.25.
(b) Amplitude = 2 3 ,period = 2π,phaseangle = −0.7relativeto 2 3 cost,timedisplacement
= −0.7.
(c) Amplitude = 4, period = 2π
2/3 = 3π, phase angle = 1 ( ) 2t
3 relative to 4cos , time
3
displacement = 0.5.
(d) Amplitude = 3 2π
,period =
4 4/3 = 3π 2 ,phaseangle = 0relativeto 3 ( ) 4t
4 sin ,time
3
displacement = 0.
134 Chapter 3 The trigonometric functions
Engineeringapplication3.2
Alternatingcurrentwaveformsandtheelectricitysupply
Alternatingcurrentwaveformsareoftenfoundinengineering.Theelectricitysupply
to homes and businesses most often takes the form of an alternating current. This is
becauseitisfareasiertosupplyalternatingcurrentelectricitythandirectcurrentelectricitywhendistributingacrosslongdistances.Energylossesalongthesupplycables
can be reduced by transforming the electricity to high voltages prior to distribution
butelectricity transformersonly work with alternating currents.
Sineandcosinefunctionsareoftenusedtomodelalternatingcurrent(a.c.)waveforms.The
equations foran a.c. waveform are
I =I m
sin(ωt +φ) or I =I m
cos(ωt +φ)
whereI m
=maximumcurrent, ω =angularfrequencyand φ =phaseangle.Inpracticethefunctionscanbeshiftedalongthetimeaxisbygiving
φ anon-zerovalueand
so both the sine and the cosine function can be used to model any a.c. waveform;
which one isused isusually a matter of convenience.
The angular frequency, ω, can be written as ω = 2πf, where f is the frequency
of the waveform in Hertz (Hz). The frequency of the electricity supply in Europe
and across large parts of the world is 50 Hz, while in the Americas and in areas of
Asiaitis60Hz.Alternatingcurrentsuppliesarealsofoundonships,submarinesand
aircraftbut these often use 400 Hz as the operating frequency.
3.7.1 Combiningwaves
There are many situations in which engineers need to combine two or more waves
together to form a single wave. It is possible to make use of trigonometric identities
to calculate the resulting waveform when several waves are combined. Consider the
following example.
Engineeringapplication3.3
Combiningtwosinusoidalvoltagesignals
Two voltage signals, v 1
(t) and v 2
(t), have the following mathematical expressions:
v 1
(t) =3sint
v 2
(t) =2cost
(a) State the amplitude and angular frequency ofthe two signals.
(b) Obtain an expression forthe signal, v 3
(t), given by
v 3
(t) = v 1
(t)+2v 2
(t)
(c) Reduce the expression obtained in part (b) to a single sinusoid and hence state
its amplitude and phase.
3.7 Modelling waves using sint and cost 135
Solution
(a) v 1
(t) has an amplitude of 3 volts and an angular frequency ω = 1 rad s −1 . v 2
(t)
has an amplitude of 2 volts and an angular frequency ω = 1 rad s −1 . Note that
both of these signals have the same angular frequency.
(b) v 3
(t)=v 1
(t)+2v 2
(t)
=3sint+2(2cost)
=3sint+4cost
(c) We wish to write v 3
(t) in the formRsin(t + φ). The choice of sine is arbitrary.
We could have chosen cosine instead.Ris the amplitude of the single sinusoid
and φ isits phase angle.
Using the trigonometric identity sin(t + φ) = sintcos φ +sinφcost found
inTable 3.1 wecan write
Rsin(t +φ) =R(sintcosφ +sinφcost)
= (Rcosφ)sint +(Rsinφ)cost
Comparing thisexpression with thatfor v 3
(t) wenotethat, inorder tomake the
expressions identical,
Rcosφ =3 (3.1)
Rsinφ =4 (3.2)
Weneedtosolve(3.1)and(3.2)toobtainRand φ.Squaringeachequationgives
R 2 cos 2 φ=9
R 2 sin 2 φ = 16
Adding these equations together weobtain
R 2 cos 2 φ+R 2 sin 2 φ=9+16
R 2 (cos 2 φ +sin 2 φ) = 25
Usingthe identitycos 2 φ +sin 2 φ = 1 thissimplifies to
R 2 =25
R = 5
Next wedetermine φ. Dividing (3.2) by (3.1) wefind
Rsinφ
Rcosφ = 4 3
tanφ = 4 3
)
φ = tan −1 ( 4
3
From (3.1) and (3.2) we can see that both sinφ and cos φ are positive, and so
φ must lie in the first quadrant. Calculating tan −1( 4
3)
using a calculator gives
φ = 0.927 radians. So we can express v 3
(t) as
v 3
(t) =3sint +4cost =5sin(t +0.927)
Finally v 3
(t) has amplitude 5 volts and phase 0.927 radians.
136 Chapter 3 The trigonometric functions
Engineering application 3.3 illustrates an important property when combining together
two sinusoidal waves of the same angular frequency.
If two waves of equal angular frequency, ω, areadded the resultisawave of the
same angular frequency, ω.
Infactthisresultholdstruewhencombininganynumberofwavesofthesameangular
frequency.
Engineeringapplication3.4
Combiningtwosinusoidalcurrentsignals
Two current signals,i 1
(t) andi 2
(t), have the following mathematical expressions:
i 1
(t) =10sin4t
i 2
(t) =5cos4t
(a) State the amplitude and angular frequency ofthe two signals.
(b) Obtain an expression forthe signal,i 3
(t), given byi 3
(t) = 0.3i 1
(t) −0.4i 2
(t).
(c) Reduce the expression obtained in part (b) to a single sinusoid in the form
Rcos(4t + φ) and hence stateits amplitude and phase.
Solution
(a) i 1
(t)hasanamplitudeof10ampsandanangularfrequency ω = 4 rad s −1 .i 2
(t)
has an amplitude of 5 amps and an angular frequency ω = 4 rad s −1 . Note that
both signals have the same angular frequency.
(b) i 3
(t) =0.3i 1
(t) −0.4i 2
(t)
(c) Let
=0.3×10sin4t−0.4×5cos4t
=3sin4t−2cos4t
3sin4t −2cos4t =Rcos(4t +φ)
Then using the trigonometric identity given inTable 3.1
cos(A +B) =cosAcosB −sinAsinB
withA=4tandB=φwefind
3sin4t −2cos4t =Rcos(4t +φ)
Hence
=R(cos4tcosφ −sin4tsinφ)
= (Rcosφ)cos4t −(Rsinφ)sin4t
3=−Rsinφ (3.3)
−2=Rcosφ (3.4)
Bysquaring each equation and adding we obtain
9+4=R 2 (sin 2 φ+cos 2 φ)=R 2
so thatR = √ 13.
3.7 Modelling waves using sint and cost 137
From (3.3) and (3.4), both sinφ and cosφ are negative and so φ lies in the
third quadrant. Division of (3.3)by (3.4) gives
3
−2 = −Rsin φ
Rcosφ =−tanφ
tanφ = 1.5
Usingacalculatorandnotingthat φ liesinthethirdquadrantwefind φ = 4.124.
Finally
3sin4t−2cos4t = √ 13cos(4t +4.124)
Soi 3
(t) = √ 13cos(4t +4.124).Thereforei 3
(t)hasanamplitudeof √ 13amps
and a phase of4.124 radians.
Example3.11 Express0.5cos3t +sin3t asasingle cosinewave.
Solution Let
0.5cos3t +sin3t =Rcos(3t + φ)
Hence
=R(cos3tcosφ −sin3tsinφ)
= (Rcosφ)cos3t −(Rsinφ)sin3t
0.5=Rcosφ (3.5)
1=−Rsinφ (3.6)
Bysquaring and adding weobtain
1.25 =R 2
R = √ 1.25 = 1.1180
Division of (3.6)by (3.5) yields
2=−tanφ
(4 d.p.)
From (3.5), cosφ is positive; from (3.6), sin φ is negative; and so φ lies in the fourth
quadrant. Hence, using a calculator, φ = 5.1760. So
0.5cos3t +sin3t = 1.1180cos(3t +5.1760)
Example3.12 Ifacos ωt +bsinωt is expressed in the formRcos(ωt − θ) show thatR = √ a 2 +b 2
and tanθ = b a .
Solution Let
acosωt+bsinωt =Rcos(ωt−θ)
Then, using the trigonometric identity forcos(A −B), we can write
acosωt+bsinωt =Rcos(ωt−θ)
=R(cosωtcosθ +sinωtsinθ)
= (Rcosθ)cosωt+(Rsinθ)sinωt
138 Chapter 3 The trigonometric functions
Equating coefficients ofcosωt and then sinωt gives
a=Rcosθ (3.7)
b=Rsinθ (3.8)
Squaring these equations and adding gives
that is
a 2 +b 2 =R 2
R = √ a 2 +b 2
Division of (3.8)by (3.7) gives
b
a =tanθ
as required.
Note that this example demonstrates that adding two waves of angular frequency ω
forms another wave having the same angular frequency but with a modified amplitude
and phase.
3.7.2 Wavelength,wavenumberandhorizontalshift
The sine and cosine waves described earlier in this section hadt as their independent
variable because the waves commonly met in engineering vary with time. There are
occasionswheretheindependentvariableisdistance,xsay,andinthiscasesomeofthe
terminology changes.Consider the wave
y =Asin(kx+φ)
Asbefore,Aistheamplitudeofthewave.Thequantitykiscalledthewavenumber.It
playsthesameroleasdidtheangularfrequency, ω,whent wastheindependentvariable.
The length of one cycle of the wave, that is the wavelength, commonly denoted λ, is
related tokby the formula λ = 2π . The phase angle is φ and its introduction has the
k
effect of shiftingthe graph horizontally.
Example3.13 Figure 3.17 shows a graph ofy = sin2x.
(a) State the wave numberforthiswave.
(b) Findthe wavelength ofthe wave.
(c) State the phase angle.
Solution (a) Comparingy = sin2x withy = sinkx wesee thatthe wave number,k, is2.
(b) The wavelength, λ = 2π = π. Note by observing the graph that this result is
k
consistent inthat the distance required for one cycle of the wave is π units.
(c) Comparingy = sin2x with sin(kx + φ)we see thatthe phase angle, φ, is 0.
3.7 Modelling waves using sint and cost 139
y
1
–2p
–p
0
p
2p
x
–1
Figure3.17
A graphofthe wavey = sin2x.
(
Example3.14 Figure 3.18 shows a graph ofsin 2x + π )
.
3
(a) State the phase angle.
(b) BycomparingFigures3.17and3.18weseethattheintroductionofthephaseangle
has caused a horizontal shift of the graph (tothe left).Calculate thisshift.
y
1
–2p
–p 0
p 2p x
–1
p –6
Figure3.18
Agraph(
ofthe wave )
y=sin 2x + π .
3
(
Solution (a) By comparing sin 2x + π )
with sin(kx + φ) wesee thatthe phase angle is π 3
3 .
(
(b) By writingy = sin 2x + π ) (
as sin2 x + π )
we note that this isy = sin2x
3 6
shifted tothe left by a horizontal distance π 6 units.
The results of this example can be generalized. The wave y = Asin(kx + φ) can be
writteny = Asink(x + φ/k) so that a phase angle of φ introduces a horizontal shift of
length φ/k. (Compare thiswith the expression fortime displacement inSection3.7.)
Noting that λ = 2π k ,thenk=2π and we may write Asin(kx + φ) equivalently
( )
λ 2πx
as Asin
λ + φ . Again, using k = 2π λ , the horizontal shift, φ , may similarly be
k
written as
horizontal shift = φ k =
φ
2π/λ = φλ
2π
140 Chapter 3 The trigonometric functions
from which
phase angle = φ =
2π ×horizontal shift
λ
This result is important in the engineering application that follows because, more generally,
when any two waves arrive at a receiver it enables the difference in their phases,
φ, tobe calculated from knowledge of the horizontal shift between them.
Thepresenceof φ iny =Asin(kx + φ)causesahorizontal(left)shiftof φ k = φλ
2π .
Note that adding any multiple of 2π onto the phase angle φ will result in the same
graphbecauseoftheperiodicityofthesinefunction.Consequently,aphaseanglecould
(
be quoted as φ + 2nπ. For example, the wave sin 2x + π )
is the same wave as
3
sin
(2x + π )
3 +2π , sin
(2x + π )
3 +4π and so on. Normally, we would quote a value
of the phase thatwas lessthan 2πby subtracting multiples of 2πasnecessary.
Engineeringapplication3.5
Two-raypropagationmodel
Itisveryusefultobeabletomodelhowanelectromagneticwaveemittedbyatransmitter
propagates through space, in order to predict what signal is collected by the
receiver. This can be quite a complicated modelling exercise. One of the simplest
models is the two-ray propagation model. This model assumes that the signal received
consists of two main components. There is the signal that is sent direct from
the transmitter to the receiver and there is the signal that is received after being reflectedoff
the ground.
Figure 3.19 shows a transmitter with height above the groundh t
together with a
receiver with height above the groundh r
. The distance between the transmitter and
the receiver alongthe ground isd.
T
S
h t
Transmitter
O
Ground
d d
d r
d
A
R
Receiver
P
h r
Q
Figure3.19
Atransmitterandreceiver at
different heightsabove the
ground.
Note thatthe quantitiesh t
,h r
andd areall known.
3.7 Modelling waves using sint and cost 141
Waves can be considered to propagate between the transmitter and the receiver
in two ways. There is the direct route between transmitter and receiver. The direct
distance between transmitter and receiver is d d
. We obtain an expression for d d
in
terms of the known quantitiesh t
,h r
andd by considering the triangle RST. In this
triangle, RS =dand ST = TO − SO =h t
−h r
. Hence by Pythagoras’s theorem in
RST we have
and so
TR 2 = RS 2 +ST 2
d 2 d =d2 +(h t
−h r
) 2
d d
=
√
d 2 +(h t
−h r
) 2
Note thatd d
isexpressed intermsof the known quantitiesh t
,h r
andd.
ThereisalsoaroutewherebyawaveisreflectedoffthegroundatpointAbefore
arrivingatthereceiver.ThepointAonthegroundissuchthat ̸ TAOequals ̸ RAP.
The distance travelled in this case isd r
= TA +AR. We wish to find an expression
ford r
intermsoftheknownquantitiesh t
,h r
andd.Inordertosimplifythecalculation
of this distance we construct an isosceles triangle, TAQ, in which TA = QA and
̸ TAO = ̸ QAO. Note thatinthistriangle,TO = QO =h t
.
Then the distance travelled by thisreflected wave,d r
, is
distance travelled =d r
= TA +AR
=QA+AR
= QR
Consider now QSR. QR is the hypotenuse of this triangle. So by Pythagoras’s
theorem wehave
d 2 r
= QR 2 = SR 2 +SQ 2
WehaveSR=dandSQ=QO+SO=h t
+h r
.So
d 2 r
=d 2 +(h t
+h r
) 2
from which
√
d r
= d 2 +(h t
+h r
) 2
Now if the wavelength of the transmitted wave is λ then we can calculate the phase
differencebetweenthedirectwaveandthereflectedwave, φ,bynotingthedifference
in the distance travelled,d r
−d d
. Using the result for phase difference from Section
3.7.2 wehave
φ = 2π λ ×horizontal shift = 2π λ (d r −d d )
φ = 2π (√
)
d
λ
2 +(h t
+h r
) 2 −
√d 2 +(h t
−h r
) 2
⎛ √
φ = 2π ( )
√ ⎞
ht +h 2 ( )
⎝d 1 + r ht −h 2
−d 1 + r ⎠
λ d d
➔
142 Chapter 3 The trigonometric functions
Now the binomial expansion for √ (1 +x)is (see Section 6.4)
√
(1+x)=(1+x) 1/2 =1+ x 2 − x2
8 + x3
16 −···≈1+x 2 , if|x|<1
Usingthisexpansionintheexpressionfor φ andnotingthatthemoduliofboth(h t
+
h r
)/d and (h t
−h r
)/d arelessthan 1,we have
φ ≈ 2πd (1 + (h t +h r )2
−1− (h t −h )
r )2
λ 2d 2 2d 2
Expanding the bracketed termsgives
φ ≈ 2πd
(
h
2
t
+2h t
h r
+h 2 r −h2 t
+2h t
h r
−hr)
2
λ 2d 2
So
φ ≈ 4h t h r π
λd
This is a simplified approximation for the phase difference between the direct wave
andthereflectedwave.Notethatitdependsontheheightofthetransmitter,theheight
of the receiver and the distance between the transmitter and the receiver.
Thiscalculationisimportantbecauseundersomeconditionsthephasedifference
between the two paths means that the directed and reflected waves destructively
interfere.Inseverecasesthiscausesthesignaltodecreaseatthereceiverenoughso
that the communications link is lost. The effect is often termed multipath-induced
fading.
EXERCISES3.7
1 Statethe amplitude,angularfrequency, frequency,
phase angleandtime displacement ofthe following
waves:
(a) 3sin2t (b) 1 sin4t
2
(c) sin(t +1)
(d) 4cos3t (e) 2sin(t −3) (f) 5cos(0.4t)
(g) sin(100πt) (h) 6cos(5t +2) (i) 2 3 sin(0.5t)
(j) 4cos(πt −20)
2 Statethe period of
(a) 2sin7t (b) 7sin(2t +3)
(c) tan t 2
(e) cosec(2t −1)
(d) sec3t
( )
2t
(f) cot
3 +2
3 Avoltage sourceproduces atime-varying voltage,
v(t),given by
v(t) =15sin(20πt +4)
(a) Statethe amplitudeof v(t).
t 0
(b) Statethe angularfrequency of v(t).
(c) Statethe periodof v(t).
(d) Statethe phaseof v(t).
(e) Statethe time displacement of v(t).
(f) Statethe minimumvalue of v(t).
4 Asinusoidalfunctionhas an amplitude of 2 3 anda
periodof2. Stateapossibleform ofthe function.
5 Statethe phase angleandtime displacementof
(a) 2sin(t +3) relative to 2sint
(b) sin(2t −3) relative to sin2t
( )
t
(c) cos
2 +0.2 relative to cos t 2
(d) cos(2 −t)relative to cost
( )
3t+4
(e) sin relative to sin 3t
5 5
(f) sin(4 −3t) relative to sin3t
3.7 Modelling waves using sint and cost 143
(g) sin(2πt + π) relative to sin2πt
(h) 3cos(5πt −3) relative to 3cos5πt
( )
πt
(i) sin
3 +2 relative to sin πt
3
(j) cos(3π −t)relative to cost
6 Writeeachofthe followingin the form
Asin(3t+θ),θ 0:
(a) 2sin3t +3cos3t
(b) cos3t −2sin3t
(c) sin3t −4cos3t
(d) −cos3t −4sin3t
7 Writeeachofthe followingin the form
Acos(t−θ),θ 0:
(a) 2sint−3cost
(b) 9sint+6cost
(c) 4cost−sint
(d) 3sint
8 Writeeachofthe followingexpressions in the form
(i)Asin(ωt + θ),(ii)Asin(ωt − θ),
(iii)Acos(ωt + θ),(iv)Acos(ωt − θ)where θ 0:
(a) 5sint+4cost
(c) 4sin2t −6cos2t
(b) −2sin3t+2cos3t
(d) −sin5t −3cos5t
Solutions
1 (a) 3,2, 1 π ,0,0 (b) 1
2 ,4, 2 π ,0,0
(c) 1,1, 1 3
,1,1 (d) 4,3,
2π 2π ,0,0
(e) 2,1, 1
1
, −3, −3 (f) 5,0.4,
2π 5π ,0,0
(g) 1,100π,50,0,0 (h) 6,5, 5
2π ,2,0.4
(i)
2 (a)
(d)
2
3 ,0.5, 1
4π ,0,0
2π
7
2π
3
(b) π
(e) π
(j) 4,π,1 , −20, −20
2 π
(c) 2π
(f)
3π
2
3 (a) 15 (b) 20π (c) 0.1
1
(d) 4 (e) (f) −15
5π
2
4
3 sin(πt +k)or 2 cos(πt +k)
3
5 (a) 3, 3 (b) −3, − 3 2
4
(d) −2, −2 (e)
5 , 4 3
(g) π, 1 (h) −3, − 3
2 5π
(j) −3π,−3π
√
6 (a) 13sin(3t +0.9828)
(b) √ 5sin(3t +2.6779)
√
(c) 17sin(3t +4.9574)
(d) √ 17sin(3t +3.3866)
(c) 0.2, 0.4
(f) −0.858,−0.286
(i) 2, 6
π
√
7 (a) 13cos(t −2.5536)
√
(b) 117cos(t −0.9828)
√
(c) 17cos(t −6.0382)
( )
(d) 3cos t − π 2
8 (a) (i) √ 41sin(t +0.675)
(ii) √ 41sin(t −5.608)
(iii) √ 41cos(t +5.387)
(iv) √ 41cos(t −0.896)
( )
√
(b) (i) 8sin 3t + 3π 4
(ii) √ ( )
8sin 3t − 5π 4
( )
(iii) √ 8cos
(
3t + π 4
)
(iv) √ 8cos 3t − 7π 4
√
(c) (i) 52sin(2t +5.300)
(ii) √ 52sin(2t −0.983)
(iii) √ 52cos(2t +3.730)
(iv) √ 52cos(2t −2.554)
√
(d) (i) 10sin(5t +4.391)
(ii) √ 10sin(5t −1.893)
(iii) √ 10cos(5t +2.820)
(iv) √ 10cos(5t −3.463)
144 Chapter 3 The trigonometric functions
TechnicalComputingExercises3.7
1 Ploty=sin2tfor0t 2π.
2 Ploty=cos3tfor0t 3π.
( )
t
3 Ploty = sin for0t4π.
2
( )
2t
4 Ploty = cos for0t6π.
3
5 Ploty = sint +3cost for0 t 3π.Byreading
from yourgraph,statethe maximumvalue of
sint+3cost.
6 (a) Ploty=2sin3t−cos3tfor0t 2π.
Useyourgraph to findthe amplitudeof
2sin3t −cos3t.
(b) On the same axes ploty = sin3t.Estimate the
time displacement of2sin3t −cos3t.
3.8 TRIGONOMETRICEQUATIONS
Weexaminetrigonometricequationswhichcanbewritteninoneoftheformssinz =k,
cosz = k or tanz = k, wherezis the independent variable andkis a constant. These
equationsallhaveaninfinitenumberofsolutions.Thisisaconsequenceofthetrigonometric
functions being periodic. For example, sinz = 1 has solutions z = ..., −7π
2 ,
−3π
2 , π 2 , 5π 2 ,....Thesesolutionscouldbeexpressedasz = π 2 ±2nπ,n=0,1,2,....
Sometimesitisuseful,indeednecessary,tostateallthesolutions.Atothertimesweare
interestedonlyinsolutionsinsomespecifiedinterval,forexamplesolvingsinz = 1for
0 z2π. The following examples illustratethe method of solution.
Example3.15 Solve
(a) sint = 0.6105 for0 t 2π (b) sint = −0.6105 for0 t 2π.
Solution Figure 3.20 shows a graph ofy = sint, with horizontal lines drawn aty = 0.6105 and
y = −0.6105.
(a) From Figure 3.20 we see that there are two solutions in the interval 0 t 2π.
These aregiven bypoints Aand B. We have
sint = 0.6105
and so, usingascientific calculator, wehave
t = sin −1 (0.6105) = 0.6567
This is the solution at point A. From the symmetry of the graph, the second solution
is
t = π −0.6567 = 2.4849
This isthe solution atpointB. Therequired solutions aret = 0.6567,2.4849.
(b) Again from Figure 3.20 we see that the equation has two solutions in the interval
0 t 2π.ThesearegivenbythepointsCandD.Onesolutionliesintheinterval
π to 3π 2 ; the other solution liesinthe interval 3π 2
sint = −0.6105
to2π. We have
3.8 Trigonometric equations 145
sin t
1
0.6105
0 p –2 3p –2
Figure3.20
A B p C D 2p
t
–0.6105
–1
Aand Bare solution pointsfor
sint = 0.6105.Cand Dare
solution pointsforsint = −0.6105.
and so, using a calculator, wesee
t = sin −1 (−0.6105) = −0.6567
Although this value oft is a solution of sint = −0.6105 it is outside the range of
values of interest.Recall that
sint = sin(t +2π)
that is,adding 2πtoanangle does notchange the sineof the angle. Hence
t = −0.6567 +2π = 5.6265
is a required solution. This is the solution given by point D. From the symmetry of
Figure 3.20 the other solution is
t = π +0.6567 = 3.7983
This isthe solution atpoint C. The required solutions aret = 3.7983, 5.6265.
Example3.16 Solve
(a) cost = 0.3685 for0 t 2π
(b) cost = −0.3685 for0 t 2π
Solution Figure 3.21 shows a graph of y = cost between t = 0 and t = 2π together with
horizontallines aty = 0.3685andy = −0.3685.
(a) FromFigure3.21weseethatthereisasolutionofcost = 0.3685between0and π 2
and a solution between 3π 2
and 2π. These aregiven by points Aand D.Now
cost = 0.3685
and so, using a scientific calculator, wesee
t = cos −1 (0.3685) = 1.1934
146 Chapter 3 The trigonometric functions
cos t
1
0.3685
A
B
p
D
C 2p t
–0.3685
0 p –2 3p –2
Figure3.21
–1
A andDare solution pointsfor
cost = 0.3685.Band Care
solutionpointsforcost = −0.3685.
This is the solution between 0 and π , that is at point A. Using the symmetry of
2
Figure 3.21 the other solution atpoint Dis
t = 2π −1.1934 = 5.0898
The required solutions aret = 1.1934 and 5.0898.
(b) The graph in Figure 3.21 shows there are two solutions of cost = −0.3685. These
solutions areatpoints Band C. Given
cost = −0.3685
then usingascientific calculator we have
t = cos −1 (−0.3685) = 1.9482
This isthe solution given by pointB. By symmetrythe other solution atpointCis
t = 2π −1.9482 = 4.3350
The required solutions aret = 1.9482 and 4.3350.
Example3.17 Solve
(a) tant = 1.3100for0 t 2π
(b) tant = −1.3100 for0 t 2π
Solution Figure3.22showsagraphofy = tant fort = 0tot = 2πtogetherwithhorizontallines
y = 1.3100 andy = −1.3100.
(a) Thereisasolutionoftant = 1.3100between0and π andasolutionbetween πand
2
3π
. These aregiven by points Aand C.
2
tant = 1.3100
t = tan −1 (1.3100) = 0.9188
3.8 Trigonometric equations 147
tan t
2
1.3100
1
0
p
A –2 3p
B p C –2 D 2p t
–1
–1.3100
–2
Figure3.22
Aand Care solution pointsfortant = 1.3100. B andDare
solution pointsfortant = −1.3100.
This isthe solution between 0 and π given by point A.UsingFigure 3.22 wecan
2
see thatthe second solution isgiven by
t = π +0.9188 = 4.0604
This isgiven by point C.
(b) Figure 3.22 shows that there are two solutions of tant = −1.3100, one between π 2
and π,theotherbetween 3π 2 and2π.PointsBandDrepresentthesesolutions.Using
a scientific calculator we have
t = tan −1 (−1.3100) = −0.9188
This solution is outside the range of interest. Noting that the period of tant is π we
see that
t = −0.9188 + π = 2.2228
isasolution between π and π. This isgiven by pointB. The second solution is
2
t = −0.9188 +2π = 5.3644
This is the solution given by point D. The required solutions aret = 2.2228 and
5.3644.
148 Chapter 3 The trigonometric functions
Example3.18 Solve
(a) sin2t = 0.6105 for0t 2π
(b) cos(3t +2) = −0.3685for0 t 2π
Solution (a) Letz = 2t.Ast variesfrom0to2πthenzvariesfrom0to4π.Thustheproblemis
equivalent tosolving
sinz=0.6105
0z4π
From Example 3.15 the solutions between 0 and 2π are 0.6567 and 2.4849. Since
sinz has period 2π, then the solutions in the next cycle, that is between 2π and 4π,
arez = 0.6567 +2π = 6.9399 andz = 2.4849 +2π = 8.7681. Hence
z = 2t = 0.6567,2.4849,6.9399,8.7681
and so, tofour decimal places,
t = 0.3284,1.2425,3.4700,4.3840
(b) Letz = 3t +2. Ast varies from 0 to 2π thenzvaries from 2 to 6π +2. Hence the
problem isequivalent tosolving
cosz=−0.3685
2z6π+2
Solutions between 0 and 2π are given in Example 3.16 as z = 1.9482, 4.3350.
Notingthatcoszhasperiod2π,thensolutionsbetween2πand4πarez = 1.9482+
2π = 8.2314 andz = 4.3350 + 2π = 10.6182, solutions between 4π and 6π are
z = 1.9482+4π = 14.5146andz = 4.3350+4π = 16.9014andsolutionsbetween
6π and 8π arez = 1.9482 +6π = 20.7978 andz = 4.3350 +6π = 23.1846. The
solutions betweenz = 0 andz = 8πarethus
z = 1.9482,4.3350,8.2314,10.6182,14.5146,16.9014,20.7978,23.1846
Noting that 6π + 2 = 20.8496 we require values of z between 2 and 20.8496,
thatis
Finally
z = 4.3350,8.2314,10.6182,14.5146,16.9014,20.7978
t = z −2
3
= 0.7783,2.0771,2.8727,4.1715,4.9671,6.2659
Example3.19 Avoltage, v(t), isgiven by
v(t)=3sin(t+1)
t 0
Findthe first time thatthe voltage has a value of1.5 volts.
Solution We needtosolve 3sin(t +1) = 1.5,thatis
sin(t+1)=0.5
t0
Letz =t +1.Sincet 0 thenz 1.Theproblemisthus equivalent to
sinz=0.5
z1
Usingascientific calculator wehave
z = sin −1 (0.5) = 0.5236
3.8 Trigonometric equations 149
This solution is outside the range of interest. By reference to Figure 3.7 the next solution
is
z = π −0.5236 = 2.6180
This isthe first value ofzgreater than 1 such thatsinz = 0.5. Finally
t=z−1=1.6180
The voltage first has a value of 1.5 volts whent = 1.618 seconds.
EXERCISES3.8
1 Solvethe followingequations for0 t 2π:
(a) sint = 0.8426 (b) sint = 0.2146
(c) sint = 0.5681 (d) sint = −0.4316
(e) sint = −0.9042 (f) sint = −0.2491
2 Solvethe followingequations for0 t 2π:
(a) cost = 0.4243 (b) cost = 0.8040
(c) cost = 0.3500 (d) cost = −0.5618
(e) cost = −0.7423 (f) cost = −0.3658
3 Solvethe followingequations for0 t 2π:
(a) tant = 0.8493 (b) tant = 1.5326
(c) tant = 1.2500 (d) tant = −0.8437
(e) tant = −2.0612 (f) tant = −1.5731
4 Solvethe followingequations for0 t 2π:
(a)sin2t = 0.6347
(b)sin3t = −0.2516
( )
t
(c)sin = 0.4250
2
(d)sin(2t +1) = −0.6230
(e)sin(2t −3) = 0.1684
( )
t +2
(f) sin = −0.4681
3
5 Solve the following equations for0 t 2π:
(a) cos2t = 0.4234
( )
t
(b) cos = −0.5618
3
( )
2t
(c) cos = 0.6214
3
(d) cos(2t +0.5) = −0.8300
(e) cos(t −2) = 0.7431
(f) cos(πt −1) = −0.5325
6 Solve the following equations for0 t 2π:
(a) tan2t = 1.5234
( )
t
(b) tan = −0.8439
3
(c) tan(3t −2) = 1.0641
(d) tan(1.5t −1) = −1.7300
( )
2t+1
(e) tan = 1.0000
3
(f) tan(5t −6) = −1.2323
7 A time-varying voltage, v(t),has the form
v(t) =20sin(50πt +20)
t 0
Calculate the firsttime that the voltage has avalue of
(a)2volts (b)10volts (c)15volts
Solutions
1 (a) 1.0021,2.1395 (b) 0.2163,2.9253
(c) 0.6042,2.5374 (d) 3.5879,5.8369
(e) 4.2711,5.1537 (f) 3.3933,6.0314
2 (a) 1.1326,5.1506 (b) 0.6368,5.6464
(c) 1.2132,5.0700 (d) 2.1674,4.1158
(e) 2.4073,3.8759 (f) 1.9453,4.3379
150 Chapter 3 The trigonometric functions
3 (a) 0.7041, 3.8457 (b) 0.9927, 4.1343
(c) 0.8961, 4.0376 (d) 2.4408, 5.5824
(e) 2.0225, 5.1641 (f) 2.1370, 5.2786
4 (a) 0.3438,1.2270, 3.4854, 4.3686
(b) 1.1320,2.0096, 3.2264, 4.1040, 5.3208, 6.1984
(c) 0.8779,5.4053
(d) 1.4071,2.3053, 4.5487, 5.4469
(e) 1.5846,2.9862, 4.7262, 6.1278
(f) nosolutions
5 (a) 0.5668,2.5748, 3.7084, 5.7164
(b) nosolutions
(c) 1.3504
(d) 1.0250,1.6166, 4.1665, 4.7582
(e) 1.2669, 2.7331
(f) 0.9971, 1.6396, 2.9971, 3.6396, 4.9971, 5.6396
6 (a) 0.4950, 2.0658, 3.6366, 5.2073
(b) no solutions
(c) 0.9388, 1.9860, 3.0332, 4.0804, 5.1276, 6.1748
(d) 2.0633, 4.1577, 6.2521
(e) 0.6781, 5.3905
(f) 0.3939, 1.0222, 1.6505, 2.2788, 2.9071, 3.5355,
4.1638, 4.7921, 5.4204, 6.0487
7 (a) 1.2038 ×10 −2
(b) 9.3427 ×10 −3
(c) 7.2771 ×10 −3
TechnicalComputingExercises3.8
1 Ploty = sint for0 t 2π andy = 0.3500 using
the same axes. Useyour graphs to find approximate
solutionsto
sint =0.3500
0t 2π
2 Ploty = cost for0 t 2π andy = −0.5500using
the same axes. Useyour graphs to find approximate
solutionsto
cost+0.5500=0
0t2π
3 Ploty=sin(2t+1)andy=2sintfor0t 2π.
Useyourgraphs to stateapproximate solutionsto
sin(2t+1)=2sint
0t 2π
4 Ploty=2sin3tandy=3cos2tfor0t 2π.
Hence state approximate solutionsof
2sin3t =3cos2t
0t 2π
REVIEWEXERCISES3
1 Expressthe followingangles in radians:
(a) 45 ◦ (b) 72 ◦ (c) 100 ◦ (d) 300 ◦
(e) 440 ◦
2 Thefollowing angles are in radians. Expressthemin
degrees.
π 3π
(a) (b) 3π (c) (d)2 (e) 3.62
3 4
3 Statethe quadrant in whichthe angle α liesgiven
(a) sinα>0andtanα>0
(b) cosα>0andsinα<0
(c) tanα>0andcosα<0
(d) sinα<0andcosα<0
(e) tanα<0andcosα<0
4 Simplify the followingexpressions:
(a) sint cosect
sinx
(b)
tanx
cotA
(c)
cosA
secA
(d)
cosecA
(e) cotxtanx
5 Simplify the following expressions:
(a) cos 2 A +1+sin 2 A
2sinAcosA
(b)
cos 2 A −sin 2 A
√
(c) sec 2 x−1
(d) sintcost +
(e)
1
cosec 2 A −1
1
sect cosect
Review exercises 3 151
6 Simplify the following expressions:
(a) (sinx +cosx) 2 −1
(b) tanAsin2A +1+cos2A
sin4θ +sin2θ
(c)
cos2θ −cos4θ
(d) 4sinAcosAcos2A
sint
(e) ( )
t
cos
2
7 Statethe amplitude,angularfrequency, period,
frequency, phase andtime displacement ofthe
followingwaves:
(a) 2sin3t
(b) 4cos6t
(c) 0.7sin(2t +3)
(d) 0.1cos πt
(e) sin(50πt +20)
(f) 6cos(100πt −30)
( )
1
(g)
2 sin t
2
(h) 0.25cos(2πt +1)
8 Expressthe following in the form
Asin(ωt +φ),φ 0:
(a)6sin5t +5cos5t
(b)0.1sint −0.2cost
(c)7sin3t +6cos3t
( ) ( )
t t
(d)9cos −4sin
2 2
(e)3sin2t +15cos2t
9 Expressthe following in the form
Asin(ωt −φ),φ 0:
(a) 3sin4t +7cos4t
(b) 3cos2t −5sin2t
(c) 4sin6t −7cos6t
1
(d)
2 cost+2 3 sint
(e) 0.75sin(0.5t) −1.25cos(0.5t)
10 Expressthe following in the formAcos(ωt + φ),
φ0:
(a) 10sin3t +16cos3t
(b) −6sin2t −3cos2t
(c) sint−2cost
(d) 0.6cos4t +1.3sin4t
(e) cos7t −5sin7t
11 Express eachofthe following in the form
Acos(ωt −φ),φ 0:
(a) 2.3sin3t +6.4cos3t
(b) −sin2t −2cos2t
(c) 2cos9t +9sin9t
(d) 4sin4t
(
+5cos4t
) ( )
t t
(e) −6cos −2sin
2 2
12 Express eachofthe following in the form
Asin(ωt +φ),φ 0:
(a) sin(t +1) +cos(t +1)
(b) 2sin(2t +3) −3cos(2t +1)
(c) cos(3t −1) −2sin(3t +4)
(d) sin(t +1) +sin(t +3)
(e) cos(2t −1) +3cos(2t +3)
13 Reduceeachofthe followingexpressions to a single
wave andin each casestate the amplitudeandphase
angleofthe resultantwave:
(a) 2cosωt
(
+3sin
)
ωt
(b) cos
(c) 2sin
(d) 0.5sin
ωt + π 4
(
ωt + π 2
(
+sinωt
)
ωt − π 4
+4cos
)
(
+1.5sin
ωt + π 4
(
(e) 3sinωt +4sin(ωt +π)−2cos
)
ωt + π 4
(
)
ωt − π 2
14 Solve the following equations,stating allsolutions
between 0 and2π:
(a) sint = 0.5216
(b) sint = −0.3724
(c) cost = 0.9231
(d) cost = −1
(e) tant = 0.1437
(f) tant = −1
15 Solve the following equations,stating allsolutions
between 0 and2π:
(a) sin2t = 0.5421
(b) cos2t
(
=
)
−0.4687
t
(c) tan = −1.6235
2
(d) 2sin4t = 1.5
(e) 5cos2t =2
(f) 4tan2t =5
)
152 Chapter 3 The trigonometric functions
16 Avoltage, v(t),varieswith time,t,according to
v(t) = 240sin(100πt +30) t 0
Find the firsttime that v(t)has avalue of
(a) 240 (b) 0
(c) −240 (d) 100
17 Simplify asfar aspossible
(a) cos100 ◦ +cos80 ◦
(b) cos100 ◦ −cos80 ◦
sin50 ◦ +sin40 ◦
(c)
cos5 ◦
(d) sin80◦ −sin60 ◦
2sin10 ◦
Solutions
1 (a)
π
4
(b)
2π
5
(d) 5.2360 (e) 7.6794
(c) 1.7453
2 (a) 60 ◦ (b) 540 ◦ (c) 135 ◦
(d) 114.6 ◦ (e) 207.4 ◦
3 (a) 1st (b) 4th (c) 3rd
(d) 3rd (e) 2nd
4 (a) 1 (b) cosx (c) cosecA
(d) tanA (e) 1
5 (a) 2 (b) tan2A (c) tanx
(d) sin2t (e) tan 2 A
6 (a) sin2x (b) 2 (c) cot θ
( )
t
(d) sin4A (e) 2sin
2
7 (a) 2,3, 2π 3 , 3
2π ,0,0
(b) 4,6, π 3 , 3 π ,0,0
(c) 0.7,2, π, 1 π ,3, 3 2
(d) 0.1, π,2, 1 2 ,0,0
(e) 1,50π,0.04,25,20, 2
5π
(f) 6,100π,0.02,50, −30, − 3
10π
1
(g)
2 , 1 2 ,4π, 1
4π ,0,0
(h) 0.25,2π,1,1,1, 1
2π
√
8 (a) 61sin(5t +0.6947)
√
(b) 0.05sin(t +5.1760)
√
(c) 85sin(3t +0.7086)
(d)
(e)
9 (a)
( )
√ t 97sin
2 +1.9890
√
234sin(2t +1.3734)
√
58sin(4t −5.1173)
(b) √ 34sin(2t −3.6820)
√
(c) 65sin(6t −1.0517)
5
(d) sin(t −5.6397)
6
(e) 1.4577sin(0.5t −1.0304)
√
10 (a) 356cos(3t +5.7246)
(b) √ 45cos(2t +2.0344)
√
(c) 5cos(t +3.6052)
(d) √ 2.05cos(4t +5.1448)
√
(e) 26cos(7t +1.3734)
11 (a) 6.8007cos(3t −0.3450)
12 (a)
(b) √ 5cos(2t −3.6052)
√
(c) 85cos(9t −1.3521)
(d) √ 41cos(4t −0.6747)
( )
√ t
(e) 40cos
2 −3.4633
√
2sin(t +1.7854)
(b) 1.4451sin(2t +5.0987)
(c) 2.9725sin(3t +0.7628)
(d) 1.0806sin(t +2)
(e) 2.4654sin(2t +4.8828)
√ √
13 (a) 13sin(ωt +0.5880), amplitude = 13,phase
angle = 0.5880
(b) 0.765sin(ωt +1.1781),amplitude = 0.765,
phaseangle = 1.1781
(c) 5.596sin(ωt +2.10),amplitude = 5.596, phase
angle = 2.10
Review exercises 3 153
(d) 1.581sin(ωt +0.4636), amplitude = 1.581,
phaseangle = 0.4636
(e) −3sin ωt,amplitude = 3, phase angle = π
14 (a) 0.5487,2.5929 (b) 3.5232,5.9016
(c) 0.3947,5.8885
(e) 0.1427,3.2843
(d) 3.1416, i.e. π
3π
(f)
4 , 7π 4
15 (a) 0.2865,1.2843,3.4281,4.4259
(b) 1.0293,2.1123,4.1709,5.2539
(c) 4.2457
(d) 0.2120,0.5734,1.7828,2.1442,3.3536,3.7150,
4.9244,5.2858
(e) 0.5796,2.5620,3.7212,5.7035
(f) 0.4480,2.0188,3.5896,5.1604
16 (a) 9.5070 ×10 −3 (b) 4.5070 ×10 −3
(c) 1.9507 ×10 −2
17 (a) 0 (b) −2sin10 ◦
(c) √ 2
(d) cos70 ◦
(d) 5.8751 ×10 −3
4 Coordinatesystems
Contents 4.1 Introduction 154
4.2 Cartesiancoordinatesystem--twodimensions 154
4.3 Cartesiancoordinatesystem--threedimensions 157
4.4 Polarcoordinates 159
4.5 Somesimplepolarcurves 163
4.6 Cylindricalpolarcoordinates 166
4.7 Sphericalpolarcoordinates 170
Reviewexercises4 173
4.1 INTRODUCTION
The coordinates of a point describe its position. The most common coordinate system
is the x--y system: the first number, x, gives the distance along the x axis, the second
number, y, gives the distance along the y axis. However, this is not the only way to
describethepositionofapoint.Thischapteroutlinesseveralwaysinwhichtheposition
of a point can be described.
4.2 CARTESIANCOORDINATESYSTEM--TWODIMENSIONS
The Cartesian coordinate system is named after the French mathematician Descartes.
The system comprises two axes -- the x axis and the y axis -- which intersect at right
angles at the point O. The point O is called the origin. Figure 4.1 shows the Cartesian
coordinate system. By convention thexaxis is drawn horizontally. The positivexaxis
liestotherightoftheorigin,thenegativexaxisliestotheleftoftheorigin,thepositive
y axis liesabove the origin and the negativeyaxis lies below the origin.
Note that this coordinate system can be used only for locating points in a plane, that
isithas two dimensions.
Consider any point, P, in the plane. The horizontal distance of P from the y axis
is called the x coordinate. The vertical distance of P from the x axis is called the y
coordinate. Either coordinate can bepositive ornegative.
4.2 Cartesian coordinate system -- two dimensions 155
y
3
2
1
x coordinate
P
y coordinate
–4 –3 –2 –1 O 1 2 3 4
–1
x
–2
–3
Figure4.1
Cartesian coordinate system in
two dimensions.
When stating the coordinates of a point, by convention we always state thexcoordinate
first. Thus (3, 1) means that thexcoordinate is 3 and theycoordinate is 1. We
also write, for example, A(3, 1) to mean that the point whose coordinates are (3, 1) is
labelled A.InFigure 4.1 Phas coordinates (2, 3).
Example4.1 Plot the points whoseCartesiancoordinates are
(a) (4,2) (b) (−1,−3) (c) (−2,1)
Solution AsplottedinFigure 4.2
(a) Rhas coordinates (4,2).
(b) S has coordinates (−1,−3).
(c) Thas coordinates (−2,1).
Example4.2 Statethe coordinates ofthe points Aand B asshown inFigure 4.3.
Solution Ahas coordinates (−3,−1); Bhas coordinates (3,−2).
y
2
R
y
2
T
1
1
–4 –3 –2 –1 O 1 2 3 4
–1
–2
S
–3
x
–3 –2 –1 O 1 2 3
A
–1
–2
B
–3
x
Figure4.2
Figure4.3
156 Chapter 4 Coordinate systems
Engineeringapplication4.1
Electrodecoordinates
Often in science and engineering it is useful to place metallic conductors, known as
electrodes, inside a glass container from which air has been evacuated. For example,electricalvalves,
whicharealsoknown asvacuum tubes,areconstructed inthis
manner.Audioamplifiersthatmakeuseofvalvetechnologyhavemadeacomeback
inrecent years. Many musicians prefer the sound they generate.
Figure 4.4 shows two electrodes inside an evacuated glass envelope. State the
coordinates of the four points A,B, Cand D.
y
12
11
10
9
8
7
6
5
4
3
2
1
A
0 1 2 3 4 5 6 7 8 9 10 11 12
D
glass envelope
C
B
x
Figure4.4
Electrode coordinates.
Solution
A has coordinates (4,9).
B has coordinates (9,6).
C has coordinates (8,10).
D has coordinates (8,6).
Tosimulatetheelectricfieldbetweentheseelectrodes,andtoproduceaproperdesign
formanufacture,acomputermodelmaybeused.Suchamodelwouldinvolvetheuse
of mathematical techniques to solve the fields in the space between the electrodes.
However, the user would have to input the shape of the device. Coordinates would
be used to specify exactly where each part of the device is located. The coordinates
could easily be changed in the computer model to test the effects of changing the
sizeandpositionoftheelectrodes.Computermodelsspeedupthedesignprocessby
avoiding the need forearly-stage prototypes.
4.3 Cartesian coordinate system -- three dimensions 157
EXERCISES4.2
1 Plotthe following points:A(2,−2),B(−2,1),
C(−1,0),D(0, −2).
2 Statethe coordinatesofthe pointsU, V andWas
shown in Figure4.5.
3 A point Plies onthexaxis. State theycoordinate
ofP.
4 A point Qlies on theyaxis. Statethexcoordinate
ofQ.
y
y
2
2
1
V
B
1
C
–2 –1 O 1 2 3 4
–1
W
U –2
x
–2 –1 O
–1
–2
D
1
2
A
x
Figure4.5
FigureS.6
Solutions
1 FigureS.6 shows the pointsA, B,CandD.
2 U(−2, −2),V(4,1),W(3, −1)
3 0
4 0
4.3 CARTESIANCOORDINATESYSTEM--THREEDIMENSIONS
Manyengineeringproblemsrequiretheuseofthreedimensions.Figure4.6illustratesa
three-dimensional coordinate system. It comprises three axes,x,yandz. The axes are
all atright anglestoone another and intersectatthe origin, O.
The position of any point in three-dimensional space is given by specifying itsx,y
andzcoordinates.Byconventionthexcoordinateisstatedfirst,thentheycoordinateand
finally thezcoordinate. For example, P(2, 3, 4) has anxcoordinate of 2, aycoordinate
of 3 and azcoordinate of 4. It is illustrated in Figure 4.6. From the origin, P is located
by travelling 2 units in thexdirection, followed by 3 units in theydirection, followed
by 4 unitsinthezdirection.
Note that, aswith a two-dimensionalsystem,coordinates can benegative.
Example4.3 Plot the following points:A(1,−1,2), B(0,1,2), C(0,0,1).
Solution Figure 4.7 illustratesthe points A,Band C.
Wenowconsidertheequationofaplane.Anypointonthex--yplanehasazcoordinate
of 0. Hence the equation of thex--y plane isz = 0. Similarlyz = 1 represents a plane
parallel to the x--y plane but 1 unit above it. Point C in Figure 4.7 lies in the plane
z = 1.Allpoints inthis plane have azcoordinateof1.
158 Chapter 4 Coordinate systems
x
2
1
z
4
3
2
1
O
3
1
4
P (2, 3, 4)
2
2
3 y
Figure4.6
Cartesian coordinate systemin three
dimensions.
A
x
–1
1
z
2
1
C
1
B
2 y
Figure4.7
ThepointsA, Band Care
plotted in threedimensions.
AandBinFigure4.7lieintheplanez = 2.Allpointsinthisplanehaveazcoordinate
of 2.
EXERCISES4.3
1 Plotthe pointsA(2,0, −1),B(1, −1,1)and
C(−1,1,2).
2 Statethe equationofthe planepassingthrough
(4,7, −1),(3,0,−1) and(1,2, −1).
3 Statethe equationofthe planepassingthrough
(3,1,7),(−1,1,0)and(6,1, −3).
Solutions
1 FigureS.7 illustratesthe pointsA, B andC.
3 y=1
2 z=−1
B (1, –1, 1)
x
2
FigureS.7
–1
1
z
2
1
A (2, 0, –1)
–1
–1
1
C (–1, 1, 2)
2 y
y
O
u
r
x
P (x, y)
y
A
Figure4.8
Phas polar coordinatesrand θ.
x
4.4 Polar coordinates 159
4.4 POLARCOORDINATES
We have seen how the x and y coordinates of a point describe its location in the x--y
plane.Thereisanalternativewaytodescribethelocationofapoint.Figure4.8illustrates
a point Pinthex--yplane.
Phas Cartesian coordinates (x,y). Hence
OA=x,
AP=y
Consider the arm OP. The length of OP is the distance of P from the origin. We denote
thisbyr, that is
length of OP =r
Clearly,risnever negative, thatisr 0.
We note that the angle between the positivexaxis and OP is θ. The value of θ lies
between 0 and 2πradians or 0 ◦ to360 ◦ ifdegrees areused.
The values ofrand θ are known as the polar coordinates of P. Conventionally, the
valueofrisstatedfirst,thenthevalueof θ.Wecommonlywritethesepolarcoordinates
asr̸ θ.
Thevaluesofr and θ specifythepositionofapoint.Conventionally,positivevalues
of θ aremeasured anticlockwise from the positivexaxis.
Thepolarcoordinatesofapoint,P,arer̸ θ.ThevalueofristhedistanceofPfrom
the origin; the value of θ isthe angle between the positivexaxis and the armOP.
r0 0θ<2π(0 ◦ θ<360 ◦ )
Engineeringapplication4.2
Pickandplacerobot
Robotsarenowwidelyusedinfactoriesinordertoreducelabourcosts.Theyvaryin
complexity depending on their function. Almost all printed circuit boards found in
electronicdevicessuchascomputersandmobilesareassembledbyrobots.Figure4.9
shows a simple robot that can pick up a surface-mount electronic component in one
position and placeit inanotherposition.
It consists of a rotating arm the length of which can be extended and contracted.
On the end of the arm is a hand which can be closed to pick up a component and
opened toreleaseit.
r
zero datum
u
Figure4.9
A pickand place robot.
➔
160 Chapter 4 Coordinate systems
Itisnecessaryforacomputertocarryoutcalculationsinordertofindwhereacomponentislocatedandwheretoplaceacomponent.Decideuponasuitablecoordinate
systemtouse when carrying outthese calculations.
Solution
If we examine the geometry of the robot then we see that a polar coordinate system
wouldbethemostsuitable.Thecentreofthecoordinatesystemshouldbeontheaxis
ofrotation.Thelengthofthearmisthengivenbyr andtheorientationofthearmis
given by θ relative toan agreed zero datum mark.
Example4.4 Plot the points P, Qand R whose polar coordinates are
(a) 2,70 ◦
(b) 4,160 ◦
(c) 3,300 ◦
Solution Figure 4.10 shows the three points plotted.
y
y
P
Q
y
2
O
70°
x
4
O
160°
x
300°
O
x
3
(a) (b) (c)
R
Figure4.10
A point can be located bythe values ofits polar coordinates.
Consider the arm from the origin to the point. The value of r gives the length of this
arm. The value of θ gives the angle between the positivexaxis and the arm, measuring
anticlockwise from the positivexaxis.
Bystudying △OPA,shown inFigure 4.11, wecan see that
cosθ = x r
sinθ = y r
and sox =rcos θ (4.1)
and soy =rsin θ (4.2)
Hence if we know the values of r and θ, that is the polar coordinates of a point, we
can use Equations (4.1) and (4.2) to calculatexandy, the Cartesian coordinates of the
point.
4.4 Polar coordinates 161
y
O
u
r
x
Figure4.11
Thepolar coordinates arer, θ;the
Cartesian coordinates arex,y.
P
y
A
x
–3.4641
4
y
210°
–2
Figure4.12
The Cartesian coordinates canbe
calculated from the polar coordinates.
x
Example4.5 A point has polar coordinatesr = 4, θ = 210 ◦ . Calculate the Cartesian coordinates of
the point.Plot the point.
Solution TheCartesian coordinatesare given by
x =rcos θ = 4cos210 ◦ = −3.4641
y=rsinθ =4sin210 ◦ =−2
Figure 4.12 illustratesthe point.
We now look atthe problemofcalculatingthe polarcoordinates given the Cartesian
coordinates.Equations(4.1)and(4.2)canbearrangedsothatrand θ canbefoundfrom
the values ofxandy. Consider a typical pointPasshown inFigure 4.11.
The Cartesian coordinates are (x,y). Suppose that the values ofxandyare known.
The polar coordinates are r,θ; these values are unknown. By applying Pythagoras’s
theoremto △OPA wesee that
and so
r 2 =x 2 +y 2
r = √ x 2 +y 2
Note that since r is the distance from O to P it is always positive and so the positive
square rootistaken.
We now express θ in terms of the Cartesian coordinatesxandy. From Figure 4.11
wesee that
tanθ = y x
and hence
)
θ = tan −1 ( y
x
Insummary wehave
r = √ x 2 +y 2
θ = tan −1 ( y
x
)
( ) y
However, we need to exercise a little extra care before calculating tan −1 and
x
reading the resultfrom a calculator. As an illustrationnote that
tan40 ◦ = 0.8391 and tan220 ◦ = 0.8391
162 Chapter 4 Coordinate systems
and so tan −1 (0.8391) could be 40 ◦ or 220 ◦ . Similarly tan105 ◦ = −3.7321 and
tan285 ◦ = −3.7321 and so tan −1 (−3.7321)
( )
could be 105 ◦ or 285 ◦ . The value given
y
on your calculator when calculating tan −1 may not be the actual value of θ we
x
require. In order to clarify the situation it is always useful to sketch the Cartesian coordinates
and the angle θ before embarking on the calculation.
Example4.6 The Cartesian coordinates of P are (4, 7); those of Q are (−5,6). Calculate the polar
coordinates ofPand Q.
Solution Figure 4.13 illustratesthe situation forP.
Then
r = √ 4 2 +7 2 = √ 65 = 8.0623
Note from Figure 4.13 that P is in the first quadrant, that is θ lies between 0 ◦ and 90 ◦ .
Now
( ) ( ) y 7
tan −1 = tan −1
x 4
( ) 7
From a calculator, tan −1 = 60.26 ◦ . Since we know that θ lies between 0 ◦ and 90 ◦
4
then clearly 60.26 ◦ isthe required value.
The polar coordinates of Parer = 8.0623, θ = 60.26 ◦ .
Figure 4.14 illustratesthe situation forQ.
We have
r = √ (−5) 2 +6 2 = √ 61 = 7.8102
From Figure 4.14 wesee that θ lies between 90 ◦ and 180 ◦ . Now
( ) ( ) y 6
tan −1 = tan −1 = tan −1 (−1.2)
x −5
Acalculatorreturnsthevalueof −50.19 ◦ whichisclearlynottherequiredvalue.Recall
that tanθ isperiodic with period 180 ◦ . Hence the required angle is
180 ◦ + (−50.19 ◦ ) = 129.81 ◦
The polar coordinates of Qarer = 7.8102,θ = 129.81 ◦ .
y
P
y
r
7
Q 6
O
u
4
x
–5
O
u
x
Figure4.13
Pis in the firstquadrant.
Figure4.14
Qlies in the second quadrant.
4.5 Some simple polar curves 163
EXERCISES4.4
1 Given the polar coordinates, calculate the Cartesian
coordinatesofeach point.
(a) r=7,θ=36 ◦
(b) r = 10, θ = 101 ◦
(c) r = 15.7, θ = 3.7 radians
(d) r=1,θ= π 2 radians
2 Given the Cartesian coordinates, calculatethe polar
coordinatesofeachpoint.
(a) (7,11) (b) (−6,−12) (c) (0,15)
(d) (−4,6) (e) (4,0) (f) (−4,0)
Solutions
1 (a) 5.6631,4.1145 (b) −1.9081,9.8163
(c) 15, 90 ◦
(c) −13.3152, −8.3184 (d) 0, 1
(d) 7.2111,123.69 ◦
(e) 4,0 ◦
2 (a) r = 13.0384, θ = 57.53 ◦
(f) 4,180 ◦
4.5 SOMESIMPLEPOLARCURVES
UsingCartesiancoordinatestheequationy =mxdescribestheequationofalinepassing
through the origin. The equation of a line through the origin can also be stated using
polar coordinates. In addition, it is easy to state the equation of a circle using polar coordinates.
Equationofaline
Consider all points whose polar coordinates are of the formr̸ 45 ◦ . Note that the angle
θ is fixed at 45 ◦ but thatr, the distance from the origin, can vary. Asr increases, a line
at45 ◦ tothe positivexaxis istraced out.Figure 4.15 illustratesthis.
Thus, θ = 45 ◦ is the equation of a line starting at the origin, at 45 ◦ to the positive
x axis.
Ingeneral, θ = θ c
,where θ c
isafixedvalue,istheequationofalineinclinedat θ c
to
the positivexaxis, startingatthe origin.
Equationofacircle,centreontheorigin
Consider all points with polar coordinates 3̸ θ. Herer, the distance from the origin, is
fixed at 3 and θ can vary. As θ varies from 0 ◦ to 360 ◦ a circle, radius 3, centre on the
origin, issweptout.Figure 4.16 illustratesthis.
Ingeneralr =r c
wherer c
isafixedvalue,0 ◦ θ 360 ◦ describesacircleofradius
r c
, centred on the origin.
Example4.7 Draw the curve traced outbyr = 3,0 ◦ θ 180 ◦ .
Solution Hereris fixed at 3 and θ varies from 0 ◦ to 180 ◦ . As θ varies a semicircle is traced out.
Figure 4.17 illustratesthis.
AtP, θ =0 ◦ ,whileatQ, θ =180 ◦ .
164 Chapter 4 Coordinate systems
y
u = 45°
y
3
3
x
O
45°
Figure4.15
When θ isfixed andrvaries, a straight
line from the origin is traced out.
x
Figure4.16
Whenrisfixedand θ varies,acircleis
sweptout.
Q
y
3
O
Figure4.17
As θ varies from0 ◦ to 180 ◦ a
semicircle is traced out.
P
x
y
S
R
O
P
Q
Figure4.18
Surface forExample4.8.
x
Example4.8 Describethe surfacedefined by 1 r2,0 ◦ θ 90 ◦ .
Solution Herervariesfrom1to2and θ variesfrom0 ◦ to90 ◦ .Figure4.18illustratesthesurface
so formed.
AtP,r=1,θ=0 ◦ ;atQ,r=2,θ=0 ◦ ;atR,r=1,θ=90 ◦ ;atS,r=2,θ=90 ◦ .
We have seen some simple polar curves in Figures 4.16 and 4.17. In general a polar
curve isgiven by the equationr = f (θ),where the radiusrvaries with the angle θ.
Engineeringapplication4.3
Two-dimensionalantennaradiationpattern
Polarcurves areoftenusedtodepictradiation patternsfromantennas.Itisoftenthe
casethattheelectricfieldstrengthatafixeddistancefromanantennasuchasshown
inFigure 4.19 dependsupon the angle θ.
A typical expression for field strength at a particular angle θ could be
∣ cos( π cosθ)
2 ∣
∣.
sinθ
4.5 Some simple polar curves 165
u
Antenna
Figure4.19
Electric field strength atafixed distance from the
antenna depends upon the angle θ.
Byconsidering
cos(
r =
π cosθ)
2 ∣ sinθ ∣
we can use a polar curve to depict the field strength at any angle. Graph-plotting
softwareisavailableforproducingpolarplotssuchastheoneshowninFigure4.20,
which shows a typical angle, θ, and itsassociated radiusr.
The radius of the plot for a particular value of θ represents the transmitted field
strengthandsotheantennainFigure4.20hasnoradiationinthehorizontaldirection
and maximum radiation inthe vertical direction.
y
1
0.5
r
u
–0.5 0.5
x
–0.5
–1
Figure4.20
Thetransmitted field strength forthe antenna,
r,varies with the angularposition, θ.
EXERCISES4.5
1 Describethe curve defined byr = 2,0 ◦ θ 90 ◦ . 2 Describethe surfacedefined by0 r2,
30 ◦ θ 45 ◦ .
166 Chapter 4 Coordinate systems
Solutions
1 This isaquarter circle ofradius2asshown in
FigureS.8.
2 FigureS.9 illustratesthe surface. OPis setat30 ◦ to
thexaxis;OQis at45 ◦ to thexaxis. OP = OQ = 2.
y
y
Q
2
P
O
x
O
x
FigureS.8
FigureS.9
4.6 CYLINDRICALPOLARCOORDINATES
Consider the problem of studying the flow of water around a cylinder. A problem like
this would be studied by engineers when investigating the forces exerted by the sea on
the cylindrical supports of oil-rigs. It is often mathematically convenient to choose a
coordinate system that fits the shape of the object being described. It makes sense here
toselect a cylindricalcoordinate system.
Cylindrical polar coordinates comprise polar coordinates with the addition of a
vertical, or z, axis. Figure 4.21 illustrates a typical point, P, and its cylindrical polar
coordinates.
The point Q is in the x--y plane and lies directly below P. Q is the projection of P
onto thex--y plane.
Consider a point P in three-dimensional space, with Cartesian coordinates (x,y,z).
We can also describe the position of P using cylindrical polar coordinates. To do this,
the x and y coordinates are expressed as their equivalent polar coordinates, while the
z coordinate remains unaltered. Hence the cylindrical polar coordinates of a point have
theform (r,θ,z).
Recall thatris the length of the arm OQ (see Figure 4.21); that is, it is the distance
of a pointinthex--yplane from the origin, and sor 0.The angle θ is measured from
thepositivexaxistothearmOQandso θ hasvaluesbetween0 ◦ and360 ◦ or2πradians.
Finally, z is positive for points above the x--y plane and negative for points below the
x--yplane.Insummary
r0, 0θ<2π, −∞<z<∞
z
P
O
u
z
y
x
r
Q
Figure4.21
Thecylindrical polar coordinates ofPare (r,θ,z).
4.6 Cylindrical polar coordinates 167
We can relate the Cartesian coordinates, (x,y,z), to the cylindrical polar coordinates,
(r,θ,z). The following key point does this.
x=rcosθ r0
y=rsinθ 0θ <2π
z =z
Engineeringapplication4.4
Fluidflowalongapipe
Cylindrical polar coordinates provide a convenient framework for analysing liquid
flow down a pipe. The radial symmetry of a pipe makes it the natural choice. The
distance along the pipe is defined usingz. In order to utilize the radial symmetry of
thepipeitisnecessarytoalignthezaxiswiththecentreaxisofthepipe.Figure4.22
illustratesthearrangement.Distancefromthecentreofthepipeisdefinedbyr.The
angle θ isusedinconjunctionwithzandrtofixthepositionwithinthepipe.Atypical
problemthatmaybeanalysedisthevariationinfluidvelocitywithdistancefromthe
centreofthepipe.Forsmoothflow,liquidtendstotravelfasteratthecentreofapipe
than itdoes near the edge.
r
u
z
Figure4.22
Fluid flowalong apipe.
Pipes with metal walls are often used to guide electromagnetic waves, rather
than fluids, in high-powered microwave communications systems. They are termed
waveguides.Mathematicallyanalysingthewaveguide’spropagationmodesismade
much simplerby using cylindrical polar coordinates.
Example4.9 The Cartesian coordinates of P are (4,7,−6). State the cylindrical polar coordinates
ofP.
Solution We have
x=4, y=7, z=−6
Usingx = 4 andy = 7 the values ofrand θ are found to ber = 8.0623,θ = 60.26 ◦
(see Example 4.6). The z coordinate remains unchanged. Hence the cylindrical polar
coordinates are (8.0623,60.26 ◦ ,−6).
168 Chapter 4 Coordinate systems
Example4.10 Describe the figure defined by
(a) 1r2,θ=60 ◦ ,−1z1
(b)r=1,0 ◦ θ90 ◦ ,0z2
z
A
B
y
P
Q
x
P
C
Q
y
O
60°
x
D
Figure4.23
AtP,r=1;atQ,r=2.
Figure4.24
On the line AB,z = 1; onthe line CD,z = −1.
Solution (a) Consider therand θ coordinates first. Thercoordinate varies from 1 to 2 while θ
is fixed at 60 ◦ . This represents the line PQ as shown in Figure 4.23. At P the value
ofris 1; at Q the value ofris 2. The length of PQ is 1 and it is inclined at 60 ◦ to
thexaxis.
Now, we note thatzvaries from −1 to 1. We imagine the line PQ moving in the
z direction fromz = −1 toz = 1. This movement sweeps out a plane. Figure 4.24
illustratesthis.
(b) The r coordinate is fixed at r = 1. The θ coordinate varies from 0 ◦ to 90 ◦ . This
produces the quarter circle, AB, as shown in Figure 4.25. At A,r = 1, θ = 0 ◦ ; at
B,r=1,θ =90 ◦ .
z
D
y
C
B
B
A
y
O
A
x
x
Figure4.25
As θ varies from0 ◦ to 90 ◦ ,aquarter
circle is sweptout.
Figure4.26
Aszvaries from0to 2,the curveAB
sweepsoutthe curved surface.
4.6 Cylindrical polar coordinates 169
Examining the z coordinate, we see thatzvaries from 0 to 2. As z varies from
0 to 2, we imagine the curve AB sweeping out the curved surface as shown in Figure4.26.AtC,r
= 1, θ = 0 ◦ ,z = 2;atD,r = 1, θ = 90 ◦ ,z = 2.Thissurfaceis
partof a cylinder.
Iftherangeofvaluesofacoordinateisnotgivenitisunderstoodthatthatvariable
variesacrossallitspossiblevalues.Forexample,acurvemaybedescribedbyr = 1,
z = −2.Herethereisnomentionofthevaluesthat θ canhave.Itisassumedthat θ
can have its fullrange of values, thatis0 ◦ to360 ◦ .
Engineeringapplication4.5
Helicalantennas
The helix is a shape commonly found inengineering. For example, the springs used
inacar’ssuspensionoftenhave ahelicalshape.Helicalantennaswereinvented by
John Kraus in the 1940s and since then have been used extensively in a variety of
applications including space exploration, satellite communications and mobile telephony.
Developing a mathematical definition of a helix is essential to analysing its
electromagnetic properties.
Wecansetupacylindricalpolarcoordinatesystemwiththezaxisalignedwiththe
axis of the helix as shown in Figure 4.27. If we were to look at the helix along the
directionofthezaxis,allwewouldseewouldbeacircle.Wesaythattheprojectionof
the helix onto thex--y plane isacircle.
z
y
x
Figure4.27
Helix along thezaxis.
Suppose a particular helix can be defined parametrically by the Cartesian equations
x(t) =3cos2t, y(t) =3sin2t, z(t) =t
wheret is varied over a particular range in order to generate the finite length helix
required. By specifying a particular value oft these equations enable us to calculate
particular values ofx,yandzcorresponding toapoint on the helix.
➔
170 Chapter 4 Coordinate systems
We now develop the equation of the helix in cylindrical polar coordinates. By
comparing
x=3cos2t,y=3sin2t with x=rcosθ,y=rsinθ
(see Equations (4.1) and (4.2)), we haver = 3 and θ = 2t. Note thatr = 3 is the
equation of a circle, radius 3,centre the origin.
So, the projection of the helix onto thex--y plane is a circle of radius 3. Because
z(t) =t, the value ofzincreases as the parametert increases and the helix is traced
out.
Wecannowstateanalternativedefinitionofthehelixintermsofcylindricalpolar
coordinates:
r(t) =3, θ(t) =2t, z(t) =t
AsintheCartesiancase,specifyingavalueoftheparametert enablesustocalculate
particular values ofr, θ andzcorresponding to a point on the helix. This provides a
more elegant definition of the helix than that available using Cartesian coordinates.
Many problems require the use of these alternative coordinate systems in order to
simplifyanalysis.
EXERCISES4.6
1 Expressthe followingCartesian coordinates as
cylindrical polar coordinates.
(a) (−2,−1,4) (b) (0,3,−1) (c) (−4,5,0)
2 Expressthe followingcylindrical polar coordinates as
Cartesian coordinates.
(a) (3,70 ◦ ,7) (b) (1,200 ◦ ,6) (c) (5,180 ◦ ,0)
3 Describe the surface defined by
(a) z=0
(b) z=−1
(c) r=2,z=1
(d) θ =90 ◦ ,z=3
(e) r=2,0z4
Solutions
1 (a) ( √ 5,206.57 ◦ ,4)
(b) (3,90 ◦ ,−1)
(c) ( √ 41,128.66 ◦ ,0)
2 (a) (1.0261,2.8191,7)
(b) (−0.9397,−0.3420,6)
(c) (−5,0,0)
3 (a) thex--y plane
(b) aplaneparallelto thex--yplaneand1unit
below it
(c) acircle,radius2,parallelto thex--yplaneand
with centreat(0,0,1)
(d) aline 3 units above the positiveyaxisand
parallelto it
(e) thecurvedsurfaceofacylinder,radius2,height4
4.7 SPHERICALPOLARCOORDINATES
When problems involve spheres, for example modelling the flow of oil around a ball
bearing,itmaybeusefultousesphericalpolarcoordinates.Thepositionofapointis
given bythreecoordinates, (R,θ,φ). These areillustratedinFigure 4.28.
4.7 Spherical polar coordinates 171
z
z
P
f
R
O
x
u
y
x
y
Q
Figure4.28
Spherical polar coordinates are (R,θ, φ).
Consider a typical point, P. We look ateach of the three coordinates inturn.
The value ofRis the distance of the point from the origin; that is,Ris the length of
OP. Note thatR 0.
Let Q be the projection of P onto the x--y plane. Then θ is the angle between the
positive x axis and OQ. Thus, θ has the same definition as for polar and cylindrical
polar coordinates. Note that θ can have any value from0 ◦ to360 ◦ .
Consider the line OP. Then φ is the angle between the positive z axis and OP. The
angle φ can have values between 0 ◦ and 180 ◦ . When P is above thex--y plane, then φ
lies between 0 ◦ and 90 ◦ ; when P lies below the x--y plane, then φ is between 90 ◦ and
180 ◦ . When φ = 0 ◦ , then P is on the positivezaxis; when φ = 90 ◦ , P lies in thex--y
plane; when φ = 180 ◦ , Plieson the negativezaxis.
WecandetermineequationswhichrelatetheCartesiancoordinates, (x,y,z),andthe
spherical polar coordinates, (R,θ,φ).
Note that some books describe spherical polar coordinates as (R,φ,θ), that is the
definitions of θ and φ areinterchanged. Be aware of thiswhen reading other texts.
Consider △OPQ.Note that ̸ OQP isaright angle and so
OQ=OPsinφ=Rsinφ
OQ liesinthex--y plane and so
x=OQ cosθ =Rsinφcosθ
y=OQ sinθ =Rsinφsinθ
We also notethat
z=OPcosφ=Rcosφ
Insummarywe have
and
x=Rsinφcosθ
y=Rsinφsinθ
z=Rcosφ
R0, 0φ π(180 ◦ ) 0θ <2π(360 ◦ )
172 Chapter 4 Coordinate systems
Example4.11 Show that
R = √ x 2 +y 2 +z 2
Solution From Figure 4.28
x 2 +y 2 =OQ 2
From △OPQ
OP 2 = OQ 2 +PQ 2
ButOP=RandPQ=z,so
R 2 =x 2 +y 2 +z 2
and so
R = √ x 2 +y 2 +z 2
Example4.12 Describethe surfaceR = 4.
Solution We have R = 4 and θ and φ can vary across their full range of values. Such points
generate a sphereofradius 4,centredon the origin.
Engineeringapplication4.6
Three-dimensionalradiationpatternofahalf-wavedipole
Oneofthesimplesttypesofpracticalantennaisthehalf-wavedipole.Thisconsists
of two conductor elements stretched out along a straight line having a combined
lengthofapproximatelyhalfthewavelengthatthefrequencyofthea.c.signalthatis
tobetransmitted.Thesignalisappliedtotheantennaatthecentreofthearrangement
byafeedcable.Theelectricfieldstrengthproducedbytheantennaatafixeddistance
is usually expressed using a spherical coordinate system. The coordinates for the
antennaandtheoriginoftheradiationitselfareassumedtobelocatedattheantenna
feedpointandtheelectricfieldstrengthisrepresentedbytheradius,R.Plotsproduced
likethisareingeneraltermedradiationpatternsandareausefulwayofvisualizing
the amountofradiated field inagiven direction, (θ,φ), foraparticularantenna.
Thehalf-wave dipole patternisdescribedby the equation
( π
)
cos
R =K
2 cos φ sin φ
∣ ∣
whereRrepresentstheelectricfieldstrengthandK isaconstantforagivendistance
fromtheantennacentrepoint.Theequationforthissimpleantennadoesnotinvolve
θ,whichindicatesthatRdoesnotdependonit,hencethereisradialsymmetrytothe
pattern. This function isplotted inFigure 4.29.
Review exercises 4 173
z
z
Feeding lines
attached to
an a.c source
f
Antenna
elements
3D radiation
pattern
Figure4.29
The half-wave dipole antenna and itsradiation pattern in sphericalpolar coordinates.
EXERCISES4.7
1 Apoint has spherical polar coordinates (3,40 ◦ ,70 ◦ ).
Determine the Cartesian coordinates.
2 Apoint has Cartesian coordinates (1,2,3).Determine
the spherical polar coordinates.
3 Describe the surfaceR = 1,0 ◦ θ 360 ◦ ,
0 ◦ φ90 ◦ .
Solutions
1 (2.1595,1.8121, 1.0261)
3 A hemisphere ofradius1. Theflat sideis onthex--y
2 R = 3.7417, θ = 63.43 ◦ , φ = 36.70 ◦ plane.
TechnicalComputingExercises4.7
Useatechnical computing language such as
MATLAB ® to produce a plotsimilarto Engineering
application 4.3. You may findithelpful to firstgenerate
asetofnumbers0 t < 2π andthen use a builtin
functionsuch aspolarorpolarplotto generate a
graphofthe equation.
REVIEWEXERCISES4
1 Phas Cartesian coordinates (6, −3, −2).Calculate
the distance ofPfrom the origin.
2 Phas Cartesian coordinates (−4, −3).Calculate the
polar coordinates ofP.
3 Phas polar coordinates (5,240 ◦ ).Calculate the
Cartesiancoordinates ofP.
4 Describe the surface defined by
1r4,0 ◦ θ90 ◦ .
5 Calculate the cylindrical polar coordinates ofapoint
with Cartesian coordinates (−1,4,1).
6 Describe the surfacex = 0 in three dimensions.
174 Chapter 4 Coordinate systems
7 Apoint has Cartesian coordinates (−1,−1,2).
Calculate the sphericalpolar coordinatesofthe point.
8 Describe the surfaceR = 2,0 ◦ θ 180 ◦ ,
0 ◦ φ 180 ◦ .
9 Describe the three-dimensional surface defined
byx=y.
10 Thespheredefined byR = 2 intersects the plane
defined byz = 1. Describethe curve ofintersection.
Solutions
1 7
2 5, 216.87 ◦
3 −2.5, −4.3301
4 FigureS.10 illustratesthe surface.
y
5 (4.1231,104.04 ◦ ,1)
6 y--z plane
7 R = √ 6, θ = 225 ◦ , φ = 35.26 ◦
8 Ahemisphereofradius2.Theflat surfaceison the
x--z plane.
9 Thesurfaceis aplanegenerated bymoving the line
y =xup anddown thezaxis.
10 Acircle ofradius √ 3,centre (0,0,1),in the plane
z=1.
O 1 4
FigureS.10
x
5 Discretemathematics
Contents 5.1 Introduction 175
5.2 Settheory 175
5.3 Logic 183
5.4 Booleanalgebra 185
Reviewexercises5 197
5.1 INTRODUCTION
The term discrete is used to describe a growing number of modern branches of mathematics
involving topics such as set theory, logic, Boolean algebra, difference equations
and z transforms. These topics are particularly relevant to the needs of electrical and
electronic engineers. Set theory provides us with a language for precisely specifying a
great deal of mathematical work. In recent years this language has become particularly
important as more and more emphasis has been placed upon verification of software.
Boolean algebra finds its main use in the design of digital electronic circuits. Given
thataverylargeproportionofelectroniccircuitsaredigitalratherthananalogue,thisis
animportantareaofstudy.Digitalelectroniccircuitsconfinethemselvestotwoeffective
voltagelevelsratherthantherangeofvoltagelevelsusedbyanalogueelectroniccircuits.
Thesemakethemeasiertodesignandmanufactureastolerancesarenotsocritical.Digital
circuits are becoming more complex each year and one of the few ways of dealing
withthiscomplexityistousemathematics.Oneofthelikelytrendsforthefutureisthat
moreandmorecircuitdesignswillbeprovedtobecorrectusingmathematicsbeforethey
areimplemented.Differenceequationsandztransformsareofincreasingimportance
in fields such as digital control and digital signal processing. We shall deal with these
inChapter 22.
5.2 SETTHEORY
Asetisany collection of objects,things or states.
176 Chapter 5 Discrete mathematics
Theobjectsmaybenumbers,letters,daysoftheweek,or,infact,anythingunderdiscussion.Onewayofdescribingasetistolistthewholecollectionofmembersorelements
and enclose them inbraces { }.Consider the following examples.
A = {1,0}
B = {off, on}
C = {high, low}
D = {0,1,2,3,4,5,6,7,8,9}
the set ofbinary digits, one and zero
the set of possible statesof a two-statesystem
the set of effective voltage levels inadigital electronic
circuit
the set ofdigits used inthe decimal system
Notice that we usually use a capital letter to represent a set. To state that a particular
object belongs to a particular set we use the symbol ∈ which means ‘is a member of’.
So, forexample, wecan write
off∈B
3∈D
Likewise, /∈means ‘is notamember of’so that
low /∈B
5/∈A
are sensiblestatements.
Listingmembersofasetisfinewhentherearerelativelyfewbutisuselessifweare
dealingwithverylargesets.Clearly,wecouldnotpossiblywritedownallthemembers
of the set of whole numbers because there are an infinite number. To assist us special
symbols have been introduced tostand for some commonly used sets. These are
N the set of non-negative whole numbers, 0,1,2,3,...
N + the set of positive whole numbers, 1,2,3,...
Z the set of whole numbers, positive, negative and zero,
...−3,−2,−1,0,1,2,3...
R the set of all real numbers
R + the set of positive real numbers
R − the set of negative real numbers
Q the set of rational numbers
Note thatareal number isany number inthe interval (−∞,∞).
Another way of defining a set is to give a rule by which all members can be found.
Consider the following notation:
A={x:x∈Randx<2}
Thisreads‘Aisthesetofvalues ofxsuchthatxisamember ofthesetofrealnumbers
andxis less than 2’. ThusAcorresponds to the interval (−∞,2). Using this notation
other setscan be defined.
Note that
R + ={x:x∈Randx>0}
R − ={x:x∈Randx<0}
5.2 Set theory 177
Example5.1 Useset notation todescribe the intervals on thexaxis given by
(a) [0,2] (b) [0,2) (c) [−9,9]
Solution (a) {x :x∈Rand 0 x 2}
(b) {x:x∈Rand0 x < 2}
(c) {x:x∈Rand−9 x 9}
SometimesweshallbecontenttouseanEnglishdescriptionofasetofobjects,suchas
5.2.1 Equalsets
M is thesetof capacitors madebymachine M
N is thesetof capacitors madeby machineN
Q is thesetof faulty capacitors
Twosetsaresaidtobeequaliftheycontainexactlythesamemembers.Forexample,the
sets {9,5,2}and {5,9,2}areidentical.Theorderinwhichwewritedownthemembers
isimmaterial.The sets {2,2,5,9} and {2,5,9} are equal since repetition of elements is
ignored.
5.2.2 Venndiagrams
Venndiagramsprovideagraphicalwayofpicturingsetswhichoftenaidsunderstanding.
The sets are drawn as regions, usually circles, from which various properties can be
observed.
Example5.2 Suppose we are interested in discussing the set of positive whole numbers between 1
and10.LetA = {2,3,4,5}andB = {1,3,5,7,9}.TheVenndiagramrepresentingthese
sets is shown in Figure 5.1. The set containing all the numbers of interest is called the
universalset, E. Eisrepresentedbytherectangularregion.SetsAandBarerepresented
bytheinteriorsofthecirclesanditisevidentthat2,3,4and5aremembersofAwhile1,
3, 5, 7 and 9 are members ofB. It is also clear that 6 /∈A,6/∈B,8/∈A,8/∈ B. The
elements 3 and 5 are common toboth sets.
E
8
A
2
3
1
5
4 9
6
7
B
10
Figure5.1
Venn diagram forExample 5.2.
Theset containing all the members ofinterest iscalled the universal set E.
Itisusefultoaskwhethertwoormoresetshaveelementsincommon.Thisleadstothe
following definition.
178 Chapter 5 Discrete mathematics
5.2.3 Intersection
Given setsAandB, a new set which contains the elements common to bothAandBis
called theintersectionofAandB, written as
A∩B={x:x∈Aandx∈B}
In Example 5.2, we see thatA ∩B = {3,5}, that is 3 ∈A ∩B and 5 ∈A ∩B. If the set
A ∩BhasnoelementswesaythesetsAandBaredisjointandwriteA ∩B = ◦/,where
◦/denotes theempty set.
Aset with no elements iscalled an empty set and isdenoted by ◦/.
IfA∩B=◦/, thenAandBare disjoint sets.
5.2.4 Union
Given two setsAandB, the set which contains all the elements ofAand those ofBis
called theunionofAandB, written as
A∪B={x:x∈Aorx∈Borboth}
InExample5.2,A∪B = {1,2,3,4,5,7,9}.Wenotethatalthoughtheelements3and5
are common toboth setsthey are listedonly once.
5.2.5 Subsets
If all the members of a setAare also members of a setBwe sayAis a subset ofBand
writeA ⊂B. We have already metanumber ofsubsets.Convince yourself that
N⊂ZandZ⊂R
Example5.3 IfM representsthesetofallcapacitorsmanufacturedbymachineM,andM f
represents
the faulty capacitors made by machine M, then clearlyM f
⊂M.
5.2.6 Complement
If we are given a well-defined universal set E and a setAwithA ⊂ E, then the set of
membersof EthatarenotinAiscalledthecomplementofAandiswrittenasA.Clearly
A ∪A = E. There areno members inthe setA∩A, thatisA ∩A = ◦/.
Example5.4 A company has a number of machines which manufacture thyristors. We consider only
two machines, M and N. A small proportion made by each is faulty. Denoting the sets
offaultythyristorsmadebyMandNbyM f
andN f
,respectively,depictthissituationon
a Venn diagram.Describethe setsM f
∪N f
andM∪N.
Solution Let E be the universal set of all thyristors manufactured by the company. The Venn
diagram is shown in Figure 5.2. Note in particular that M ∩N = ◦/. There can be no
thyristorsintheintersectionsinceifathyristorismadebymachineMitcannotbemade
5.2 Set theory 179
E
M
N
M f
Nf
Figure5.2
Venn diagram forExample 5.4.
by machine N and vice versa. Thus M and N are disjoint sets. Also note M f
⊂ M,
N f
⊂N.ThesetM f
∪N f
isthesetoffaultythyristorsmanufacturedbyeithermachineM
or N.The setM∪N isthe set of thyristors made by machines other than M or N.
Wehaveseenhowtheoperations ∩, ∪canbeusedtodefinenewsets.Itisnotdifficult
to show that a number of laws hold, most of which are obvious from the inspection of
anappropriateVenn diagram.
5.2.7 Lawsofsetalgebra
For any setsA,B,C and a universal set E, we have the laws in Table 5.1. From these it
ispossible toprove the laws given inTable 5.2.
Table5.1
Thelaws ofset algebra.
}
A∪B=B∪A
Commutative laws
A∩B=B∩A
}
A∪(B∪C)=(A∪B)∪C
Associativelaws
A∩(B∩C)=(A∩B)∩C
}
A∩(B∪C)=(A∩B)∪(A∩C)
Distributivelaws
A∪(B∩C)=(A∪B)∩(A∪C)
}
A ∪◦/ =A
Identity laws
A∩E=A
⎫
A ∪A = E ⎪⎬
A ∩A = ◦/ Complement laws
A =A
⎪⎭
Table5.2
Laws derivable from Table 5.1.
}
A∪(A∩B)=A
Absorption laws
A∩(A∪B)=A
}
(A∩B)∪(A∩B)=A
Minimization laws
(A∪B)∩(A∪B)=A
}
A∪B=A ∩B
De Morgan’slaws
A∩B=A ∪B
5.2.8 Setsandfunctions
Ifwearegiventwosets,AandB,ausefulexerciseistoexaminerelationships,givenby
rules,betweentheelementsofAandtheelementsofB.Forexample,ifA = {0,1,4,9}
180 Chapter 5 Discrete mathematics
A
0
r : A — B
‘take plus or minus the square root of’
1
9
4
Figure5.3
Arelation between setsAandB.
2
0
1
3 –3
B
–1
–2
s : D — E
s : m — 3m + 1
D 1
0
1
2
3
4
5
4
7
10
13
16
19
22
Figure5.4
The relationsmaps elements ofDtoE.
E
andB = {−3,−2,−1,0,1,2,3}theneachelementofBisplusorminusthesquareroot
of some element ofA. We can depict thisas inFigure 5.3.
The rule, which, when given an element ofA, produces an element ofB, is called a
relation. If the ruleof the relation isgiven the symbolr wewrite
r:A→B
and say ‘the relation r maps elements of the set A to elements of the set B’. For the
example above, we can writer : 1 → ±1,r : 4 → ±2, and generallyr : x → ± √ x.
Thesetfromwhichwechooseourinputiscalledthedomain;thesettowhichwemap
is called the co-domain; the subset of the co-domain actually used is called the range.
As weshall see thisneed notbe the whole of the co-domain.
A relationrmaps elements of a setD, called the domain, to one or more elements
ofasetC, called the co-domain. We write
r:D→C
Example5.5 IfD = {0,1,2,3,4,5} andE = {1,4,7,10,13,16,19,22} and the relation with symbolsis
defined bys :D→E,s :m→3m +1, identify the domain and co-domain of
s. Draw a mapping diagramtoillustrate the relation.What isthe range ofs?
Solution Thedomainofsisthesetofvaluesfromwhichwechooseourinput,thatisD = {0,1,2,
3,4,5}.Theco-domainofsisthesettowhichwemap,thatisE = {1,4,7,10,13,16,
19,22}.Therules :m→3m+1enablesustodrawthemappingdiagram.Forexample,
s : 3 → 10 and so on. The diagram is shown in Figure 5.4. The range ofsis the subset
ofE actuallyused,inthiscase {1,4,7,10,13,16}. Wenotethatnotalltheelementsof
the co-domainareactuallyused.
The notation introduced is very similar to that for functions described in Section 2.3.
This is no accident. In fact, a function is a very special form of a relation. Let us recall
the definition ofafunction:
‘Afunctionis arulewhich whengivenaninputproducesasingleoutput’.
If we study the two relations r and s, we note that when relation r received input, it
couldproducetwooutputs.Onthemappingdiagramthisshowsupastwoarrowsleaving
5.2 Set theory 181
someelementsinA.Whenrelationsreceivedaninput,itproducedasingleoutput.This
shows up as a single arrow leaving each element in D. Hence the relation r is not a
function, whereas the relationsis. This leads to the following more rigorous definition
of a function.
A function f is a relation which maps each element of a set D, called the
domain, toasingleelement of a setC, called the co-domain. We write
f:D→C
Example5.6 IfM = {off, on},N = {0,1} and wedefine a relationr by
r:M→N
r:off→0
r:on→1
then the relationris a function since each element inM is mapped to a single element
inN.
Example5.7 IfP = {0,1} andQ = {high} and wedefine a relationr by
r:P→Q
r:1→high
thenrisnotafunction since each element inPisnot mapped toan element inQ.
All of the functions described in Chapter 2 have domains which are subsets of the real
numbers R.Theinputtoeachfunctionistheparticularvalueoftheindependentvariable
chosen from the domain and the output is the value of the dependent variable. When
dealing with continuous domains the graphs we have already considered replace the
mapping diagrams.
Example5.8 Find the domain,D, of the rational function f :D→Rgiven by
f:x→
3x
x −2
Solution Since no domain is given, we choose it to be the largest set possible. This is the set
of all real numbers except the value x = 2 at which point f is not defined. We have
D={x:x∈R,x≠2}.
EXERCISES5.2
1 Useset notationto describethe intervals onthexaxis
given by
(a) (−3,2) (b) [0,2] (c) [−2, −1)
(d) (3,6] (e) |x| < 1
2 Sketch the followingsets. [Hint:seeSection2.2 on
open andclosedintervals.]
(a) {x:x∈Rand2<x4}
(b) {x:x∈Rand−1x0}
182 Chapter 5 Discrete mathematics
(c) {x:x∈Rand0x<2}
(d) {x:x∈Rand1<x<3}
3 Using the definitions given in Section 5.2, state
whether each ofthe followingis trueorfalse:
(a) 7∈Z
(b) R − ∩Q=◦/
(c) 0.7 /∈Q (d) N + ∪Q=R +
(e) R − ∩N=◦/
(g) 5∈Q
(f) R∩Z=Z
(h) N⊂Q
4 IfA = {1,3,5,7,9,11} andB = {3,4,5,6}find
(a)A∩B
(b)A∪B
5 GivenA = {1,2,3,4,5,6},B = {2,4,6,8,10} and
C = {3,6,9} state the elements ofeach ofthe
following:
(a) A∩B
(c) A∩C
(e) A∩(B∪C)
(b)B∩C
(d)A∩B∩C
(f) B∪(A∩C)
6 Write outallthe members ofthe followingsets:
(a) A={x:x∈Nandx<10}
(b) B={x:x∈Rand0x10andxisdivisible
by3}
7 ThesetsA,BandC are given byA = {1,3,5,7,9},
B = {0,2,4,6} andC = {1,5,9} andthe universal
set,E={0,1,2,...,9}.
(a) Representthe sets onaVenn diagram.
(b) StateA ∪B.
(c) StateB ∩C.
(d) State E ∩C.
(e) StateA.
(f) StateB ∩C.
(g) StateB∪C.
8 UseVenn diagrams to illustrate the followingfor
general setsC andD:
(a)C∩D (b)C∪D (c)C∩D
(d) C∪D
(e)C∩D.
9 BydrawingVenn diagrams verifyDeMorgan’s laws
A∩B=A ∪ B and A∪B=A ∩B
10 ForsetsA = {0,1,2} andB = {3,4},draw amapping
diagramto illustrate the following relations.
Determine whichrelationsare functions.Forthose
thatare notfunctions,give reasonsforyourdecision.
(a) r:A→B,r:0→3,r:1→4,r:2→4
(b) s:A→B,s:0→3,s:0→4,s:1→3,
s:2→3
(c) t:A→B,t:0→3,t:1→4
11 IfA = {1,3,5,7} andB = {1,2,3,4},draw a
mapping diagramto illustrate the relationr :A→B,
whereristherelation‘isbiggerthan’.Israfunction?
Solutions
1 (a) {x:x∈Rand−3<x<2}
(b) {x:x∈Rand0x2}
(c) {x:x∈Rand−2x<−1}
(d) {x:x∈Rand3<x6}
(e) {x:x∈Rand−1<x<1}
2 SeeFigureS.11.
(a)
0 1 2 3 4 5 x (b) –1 0 1 x
(c)
0 1 2 3 x (d) 0 1 2 3 4 x
FigureS.11
3 (a)T (b)F (c)F (d)F
(e) T (f) T (g) T (h) T
4 (a) {3,5} (b) {1,3,4,5,6,7,9,11}
5 (a) {2,4,6} (b) {6} (c) {3,6} (d) {6}
(e) {2,3,4,6} (f) {2,3,4,6,8,10}
6 (a) {0,1,2,3,4,5,6,7,8,9}
(b) {0,3,6,9}
7 (a) SeeFigure S.12.
A
3
7
8
FigureS.12
1
5
9
B
C
0
2 4
6
(b) {0,1,2,3,4,5,6,7,9} (c) ◦/
(d) {1,5,9} (e) {0,2,4,6,8}
(f) {3,7,8} (g) {3,7,8}
5.3 Logic 183
C
D
C
D
(a)
(b)
C
D
C
D
(c)
(d)
(e)
C
D
FigureS.13
8 SeeFigure S.13.
10 (a)Function
(b)Not afunction since 0 is mapped to two elements
(c)Not afunction since 2 is notmapped to anything
11 r isnot afunction because 1 is notmapped to
anything.
5.3 LOGIC
In Section 5.4 we will examine Boolean algebra. This concerns itself with the manipulation
of logic statements and so is suitable for analysing digital logic circuits. In this
section we introduce the basic concepts of logic by means of logic gates as these form
the usual startingpoint for engineers studying this topic.
5.3.1 TheORgate
The OR gate is an electronic device which receives two inputs each in the form of a
binary digit, that is 0 or 1, and produces a binary digit as output, depending upon the
values of the two inputs.Itisrepresented by the symbolshown inFigure 5.5.
AandBarethetwoinputs,andF isthesingleoutput.Ashigh(1)orlow(0)voltages
are applied to A and B various possible outputs are achieved, these being defined by
means of a truth table as shown in Table 5.3. So, for example, if a low (0) voltage is
applied to A and a high (1) voltage is applied to B, the output is a high (1) voltage at
F. We note that a ‘1’ appears in the right-hand column of the truth table whenever A
orBtakes the value 1,hence the name OR gate. We use the symbol + torepresent OR.
BecauseitconnectsthevariablesAandB,ORisknownasalogicalconnective.Weshall
meet other logical connectives shortly. This connective is also known as adisjunction,
so thatA +Bissaidtobethe disjunction ofAandB.
Table5.3
The truth table for an OR gate
with inputsAandB.
A
B
Figure5.5
Symbol foran ORgate.
F = A + B
A B F=A+B
1 1 1
1 0 1
0 1 1
0 0 0
184 Chapter 5 Discrete mathematics
Table5.4
ThetruthtableforanANDgate
with inputsAandB.
A
B
Figure5.6
Symbol foran AND gate.
F = A . B
A B F=A·B
1 1 1
1 0 0
0 1 0
0 0 0
Table5.5
The truth table for
a NOTgate.
A
F = A
A
F=A
Figure5.7
Symbol foran inverter.
1 0
0 1
5.3.2 TheANDgate
It is possible to construct another electronic device called an AND gate which works
in a similar way except that the output only takes the value 1 when both inputs are 1.
ThesymbolforthisgateisshowninFigure5.6andthecompletetruthtableisshownin
Table 5.4. The logical connective AND is given the symbol · and is known as a conjunctionso
thatA·Bissaidtobethe conjunctionofAandB.
5.3.3 TheinverterorNOTgate
The inverter is a device with one input and one output and has the symbol shown in
Figure 5.7. It has a truth table defined by Table 5.5. If the input isA, then the output is
representedby the symbolA, known as the complement ofA.
5.3.4 TheNORgate
This gate is logically equivalent to a NOT gate in series with an OR gate as shown in
Figure 5.8. It is represented by the symbol shown in Figure 5.9 and has its truth table
defined inTable 5.6.
A
B
A + B
Figure5.8
A NOTgatein series with an OR gate.
A
B
Figure5.9
Symbol foraNOR gate.
F = A + B
F = A + B
Table5.6
ThetruthtableforaNORgate.
A B F =A +B
1 1 0
1 0 0
0 1 0
0 0 1
5.4 Boolean algebra 185
A
B
A . B
Figure5.10
ANOT gate in serieswith an AND gate.
A
B
Figure5.11
Symbol foraNAND gate.
F = A . B
F = A . B
Table5.7
Thetruthtable foraNAND gate.
A B F =A ·B
1 1 0
1 0 1
0 1 1
0 0 1
5.3.5 TheNANDgate
This gate is logically equivalent to a NOT gate in series with an AND gate as shown in
Figure5.10.ItisrepresentedbythesymbolshowninFigure5.11andhasthetruthtable
definedby Table 5.7.
Althoughwehaveonlyexaminedgateswithtwoinputsitispossibleforagatetohave
morethantwo.Forexample,theBooleanexpressionforafour-inputNANDgatewould
beF=A·B·C ·D while that of a four-input OR gate would beF = A +B+C +D,
whereA,B,C andDare the inputs, andF is the output. Logic gates form the building
blocks formore complicated digital electronic circuits.
5.4 BOOLEANALGEBRA
SupposeAandBarebinarydigits,thatis1or0.These,togetherwiththelogicalconnectives
+and · andalsothecomplementNOT,formwhatisknownasaBooleanalgebra.
The quantities A and B are known as Boolean variables. Expressions such as A +B,
A·BandAareknownasBooleanexpressions.MorecomplexBooleanexpressionscan
bebuiltupusingmoreBooleanvariablestogetherwithcombinationsof +, · andNOT;
forexample, wecan draw up a truth table forexpressions such as (A·B) + (C ·D).
Weshallrestrictourattentiontothelogicgatesdescribedinthelastsectionalthough
the techniques of Boolean algebra are more widely applicable. A Boolean variable can
only take the values 0 or 1. For our purposes a Boolean algebra is a set of Boolean
variables with the two operations · and +, together with the operation of taking the
complement, for which certainlaws hold.
5.4.1 LawsofBooleanalgebra
ForanyBooleanvariablesA,B,C,wehavethelawsinTable5.8.Fromtheseitispossible
toprovethelawsgiveninTable5.9.Youwillnoticethattheselawsareanalogoustothose
ofsetalgebraifweinterpret +as ∪, · as ∩,1astheuniversalset E,and0astheempty
set ◦/. In ordinary algebra, multiplication takes precedence over addition. In Boolean
algebra · takes precedence over +. So, for example, we can write the first absorption
lawwithout brackets, thatis
A+A·B=A
Similarly, the first minimization law becomes
A·B+A·B =A
We shall follow thisruleof precedence fromnow on.
186 Chapter 5 Discrete mathematics
Table5.8
Laws ofBoolean algebra.
}
A+B=B+A
Commutative laws
A·B=B·A
}
A+(B+C)=(A+B)+C
Associativelaws
A·(B·C)=(A·B)·C
}
A·(B+C)=(A·B)+(A·C)
Distributivelaws
A+(B·C)=(A+B)·(A+C)
}
A+0=A
Identity laws
A·1=A
⎫
A +A = 1⎪⎬
A ·A = 0 Complement laws
A =A
⎪⎭
Table5.9
Laws derived from the laws ofTable 5.8.
}
A+(A·B)=A
Absorption laws
A·(A+B)=A
}
(A·B)+(A·B)=A
Minimization laws
(A+B)·(A+B)=A
}
A+B=A ·B
DeMorgan’s laws
A·B=A +B
A+1=1
A·0=0
Example5.9 Findthe truth tableforthe BooleanexpressionA +B·C.
Solution WeconstructthetablebynotingthatA,BandC areBooleanvariables;thatis,theycan
takethevalues0or1.Thefirststageintheprocessistoformallpossiblecombinationsof
A,BandC,asshowninTable5.10.ThenwecompletethetablebyformingC,thenB·C
and finallyA +B·C, using the truth tables defined earlier. So, for example, whenever
C=1,C = 0. The complete process is shown in Table 5.11. Work through the table to
ensure you understand how itwas constructed.
Table5.10
The possible
combinations forthree
variables,A,BandC.
A B C
Table5.11
Thetruthtable forA +B·C.
A B C C B·C A+B·C
1 1 1
1 1 0
1 0 1
1 0 0
0 1 1
0 1 0
0 0 1
0 0 0
1 1 1 0 0 1
1 1 0 1 1 1
1 0 1 0 0 1
1 0 0 1 0 1
0 1 1 0 0 0
0 1 0 1 1 1
0 0 1 0 0 0
0 0 0 1 0 0
5.4 Boolean algebra 187
Table5.12
Truthtablefor (A +B)· (A +C).
A B C C A+B A+C (A+B)·(A+C)
1 1 1 0 1 1 1
1 1 0 1 1 1 1
1 0 1 0 1 1 1
1 0 0 1 1 1 1
0 1 1 0 1 0 0
0 1 0 1 1 1 1
0 0 1 0 0 0 0
0 0 0 1 0 1 0
5.4.2 Logicalequivalence
We know from the distributive laws ofBooleanalgebrathat
A+(B·C)=(A+B)·(A+C)
Let us construct the truth table for the r.h.s. of this expression (Table 5.12). If we now
observethefinalcolumnofTable5.12weseeitisthesameasthatofTable5.11.Wesay
thatA+(B·C)islogicallyequivalentto (A+B)· (A+C).Figures5.12and5.13show
thetwowaysinwhichtheselogicallyequivalentcircuitscouldbeconstructedusingOR
gates, AND gates and inverters. Clearly different electronic circuits can be constructed
toperform the same logical task. We shall shortlyexplore a wayof simplifyingcircuits
toreduce the number ofcomponents required.
A
B
C
C
B . C
A + (B . C)
Figure5.12
Circuit to implementA + (B ·C).
A
A + B
B
A
C
C
A + C
(A + B) . (A + C)
Figure5.13
Circuit to implement (A +B)· (A +C).
Engineeringapplication5.1
Booleanexpressionandtruthtableforanelectroniccircuit
Find the Boolean expression and truth table for the electronic circuit shown in Figure
5.14.
➔
188 Chapter 5 Discrete mathematics
Solution
BylabellingintermediatepointsinthecircuitweseethatX =A·B +C.Inorderto
obtain the truth table we form all possible combinations ofA,BandC, followed by
A·B,CandfinallyX =A·B+C.ThecompletecalculationisshowninTable5.13.
Table5.13
The truthtableforFigure5.14.
A B C A·B C X=A·B+C
A
B
C
Figure5.14
Circuit forEngineeringapplication 5.1.
X
1 1 1 1 0 1
1 1 0 1 1 1
1 0 1 0 0 0
1 0 0 0 1 1
0 1 1 0 0 0
0 1 0 0 1 1
0 0 1 0 0 0
0 0 0 0 1 1
Whatwewouldnowliketobeabletodoiscarryoutthereverseprocess:thatis,startwith
a truth table and find an appropriate Boolean expression so that the required electronic
device can beconstructed.
Example5.10 Given inputsA,BandC, find a Boolean expression forX as given by the truth table in
Table 5.14.
Solution To find an equivalent Boolean expression the procedure is as follows. Look down the
rows of the truth table and select those with an r.h.s. equal to 1. In this example, there
arefivesuchrows:1,2,4,6and8.Eachoftheserowsgivesrisetoatermintherequired
Booleanexpression.Eachtermisconstructedsothatithasavalue1fortheinputvalues
of that row. For example, for the input values of row 1, that is 1, 1, 1, we findA·B·C
has the value 1, whereas for the input values of row 2, that is 1, 1, 0, we findA·B·C
has the value 1. Carrying out this process for the other rows we find that the required
expression is
Table5.14
Thetruthtable fora
systemwithinputsA,B
andC andan outputX.
A B C X
1 1 1 1
1 1 0 1
1 0 1 0
1 0 0 1
0 1 1 0
0 1 0 1
0 0 1 0
0 0 0 1
X =A·B·C+A·B·C+A·B ·C +A·B·C +A ·B·C (5.1)
thatis,adisjunctionofterms,eachtermcorrespondingtooneoftheselectedrows.This
important expression is known as a disjunctive normal form (d.n.f.). We note that the
truthtableisthesameasthatofEngineeringapplication5.1whichhadBooleanexpression
(A·B)+C. The d.n.f. wehave justcalculated, whilecorrect,isnot the simplest.
Moregenerally,tofindtherequiredd.n.f.fromatruthtablewewritedownanexpression
ofthe form
( )+( )+···+( )
whereeachtermhasthevalue1fortheinputvaluesofthatrow.Wecouldnowconstruct
an electronic circuit corresponding to Equation (5.1) using a number of AND and OR
gatestogetherwithinverters,anditwoulddotherequiredjobinthesensethatthedesired
truth table would beachieved.
5.4 Boolean algebra 189
However, we know from Engineering application 5.1 that the much simpler expression
(A·B) +C has the same truth table and if a circuit were to be built corresponding
tothisitwouldrequirefewercomponents.Clearlywhatweneedisatechniqueforfinding
the simplest expression which does the desired job since the d.n.f. is not in general
the simplest. It is not obvious what is meant by ‘simplest expression’. In what follows
we shall be concerned with finding the simplest d.n.f. It is nevertheless possible that a
logically equivalent statement exists which would give a simpler circuit. Simplification
canbeachievedusingthelawsofBooleanalgebraasweshallseeinExample5.11and
Engineering application 5.3.
Engineeringapplication5.2
TheexclusiveORgate
WehavealreadylookedattheORgateinSection5.3.Thefullnameforthistypeof
ORgateistheinclusiveORgate.Itissocalledbecauseitgivesanoutputof1when
either or both inputs are 1. The exclusive OR gate only gives an output of 1 when
eitherbutnotbothinputsare1.ThetruthtableforthisgateisgiveninTable5.15and
its symbolisshown inFigure 5.15. Usingthe truthtable, the d.n.f. forthe gateis
F=A·B +A ·B
The exclusive OR often arises in the design of digital logic circuits. In fact, it is
possible tobuy integrated circuits thatcontain exclusive ORgates as basic units.
Table5.15
The truthtable foran
exclusive OR gate.
A B F
1 1 0
1 0 1
0 1 1
0 0 0
A
B
F = A . B + A . B
Figure5.15
Symbol foran exclusive OR gate.
Example5.11 UsethelawsofBooleanalgebragiveninTable5.8tosimplifythefollowingexpressions.
(a) A·B+A·B
(b) A+A ·B
(c) A+A ·B·C
(d) A·B·C+A·B·C+A·B·C+A·B ·C +A ·B·C
Solution (a) Using the distributive lawwe can write
A·B+A·B=A·(B+B)
Using the complement law,B +B = 1 and hence
A·B+A·B=A·1
=A using the identity law
190 Chapter 5 Discrete mathematics
HenceA·B+A·BsimplifiestoA.Notethatthisisthefirstminimizationlawgiven
inTable 5.9.
(b) A+A ·B=(A+A)·(A+B) by the distributive law
=1·(A+B)
by the complement law
=A +B using the identity law
(c) Note thatA +A·B·C can be written asA + (A·B)·C using the associative laws.
Then
A + (A·B)·C=(A+A·B)· (A +C) by the distributive law
=(A+B)· (A +C) usingpart(b)
=A +B ·C by the distributive law
(d) A·B·C+A·B·C+A·B·C+A·B ·C +A ·B·C
can be rearranged usingthe commutative lawtogive
A·B·C+A·B·C+A·B·C+A·B ·C +A ·B·C
This equals
A·B·(C+C)+A·B·(C+C) +A ·B·C by the distributive law
=A·B·1+A·B·1+A·B·C usingthe complement law
=A·B+A·B +A·B·C usingthe identitylaw
=A·(B+B) +A ·B·C usingthe distributive law
=A +A ·B·C usingthe complement and
identitylaws
Usingthe resultof part(c)this can be furthersimplified toA+B·C.
Engineeringapplication5.3
Designofabinaryfull-addercircuit
The binary adder circuit is a common type of digital logic circuit. For example, the
accumulator of a microprocessor is essentially a binary adder. The term full-adder
is used to describe a circuit which can add together two binary digits and also add
the carry-out digit from a previous stage. The outputs from the full-adder consist of
thesumvalueandthecarry-outvalue.Byconnectingtogetheraseriesoffull-adders
itispossibletoaddtogethertwobinarywords.(Abinarywordisagroupofbinary
digits, such as 0111 1010.) For example, adding together two 4-bit binary words
would requirefourfull-addercircuits.This isshown inFigure 5.16.
The inputs A0--A3 and B0--B3 hold the two binary words that are to be added.
TheoutputsS0--S3holdtheresultofcarryingouttheaddition.ThelinesC0--C3hold
the carry-out values from each of the stages. Sometimes there will be a carry-in to
stage 0 as a result of a previous calculation. Let us consider the design of stage 2 in
more detail. The design of the other stages will be identical. First of all a truth table
forthe circuitisderived. This isshown inTable 5.16.
5.4 Boolean algebra 191
S0
Stage
0
A0 B0
C0
S1
Stage
1
A1 B1
C1
S2
Stage
2
A2 B2
C2
S3
Stage
3
A3 B3
Figure5.16
Four full-addersconnected to allow two 4-bitbinarywordsto
be added.
C3
Table5.16
Truthtable forafull-adder.
C1 A2 B2 S2 C2
0 0 0 0 0
0 0 1 1 0
0 1 0 1 0
0 1 1 0 1
1 0 0 1 0
1 0 1 0 1
1 1 0 0 1
1 1 1 1 1
Notice that there are three inputs to the circuit,C1, A2 and B2. There are also
twooutputsfromthecircuit,S2andC2.Writingexpressionsfortheoutputsind.n.f.
yields
S2 =C1 ·A2·B2+C1·A2·B2+C1·A2 ·B2+C1·A2·B2
C2 =C1·A2·B2+C1·A2·B2+C1·A2·B2+C1·A2·B2
Itisimportanttoreducetheseexpressionstoassimpleaformaspossibleinorderto
minimize the number of electronic gates needed to implement the expressions. So,
startingwithS2,
Let
S2 =C1 ·A2·B2+C1·A2·B2+C1·A2 ·B2+C1·A2·B2
=C1·(A2·B2+A2 ·B2) +C1·(A2·B2 +A2·B2)
by the distributive law (5.2)
X=A2·B2 +A2·B2 (5.3)
Notice this isanequation for an exclusive OR gate with inputsA2 andB2.
UsingEquation (5.3) wehave
X =A2·B2 +A2·B2
= (A2·B2)·(A2·B2) by DeMorgan’s law
= (A2 +B2)·(A2 +B2) by DeMorgan’s law
= (A2+B2)·(A2+B2) by the complement law
=A2·A2+A2 ·B2+B2·A2+B2·B2
by the distributive law
= 0 +A2 ·B2 +B2·A2 +0 by the complement law
=A2 ·B2 +B2·A2 by the identity law
=A2·B2+A2 ·B2
Itisnow possible towrite Equation (5.2) as
by the commutative law
S2=C1·X +C1·X
➔
192 Chapter 5 Discrete mathematics
This is the expression for an exclusive OR gate with inputsC1 andX. It is therefore
possible to obtainS2 with two exclusive OR gates which are usually available
asabasicbuildingblockonanintegratedcircuit.ThecircuitforS2isshowninFigure
5.17. Turning toC2, wehave
C2 =C1·A2·B2+C1·A2·B2+C1·A2·B2+C1·A2·B2
=A2·B2·(C1+C1)+C1·(A2·B2+A2·B2)
=A2·B2 +C1·X
by the complement law
by the distributive law
sinceC1 +C1 = 1, whereX is given by Equation (5.3). The output,X, has already
been generated to produceS2 but can be used again provided the exclusive OR gate
can stand feeding two inputs. Assuming this is so then the final circuit for the fulladder
isshown inFigure 5.18.
A2
B2
X
C1
S2
Figure5.17
Circuit to implementS2 =C1·X +C1·X,
whereX =A2 ·B2 +A2 ·B2.
A2
B2
C1
X
S2
A2 . B2
C1 . X
C2
Figure5.18
Circuit to implement the stage 2 full-adder.
Engineeringapplication5.4
Realizationoflogicgates
Logic gates are usually constructed from one or more transistors. The most commontechnologyusedisCMOS(complementarymetal-oxidesemiconductor)logic,
which was invented by Frank Wanlass and was perfected whilst he was working
at Fairchild Semiconductor in the 1960s. CMOS logic makes use of two different
typesoftransistorknownasPMOS(p-typemetal-oxidesemiconductor)andNMOS
(n-type metal-oxide semiconductor) transistors. Both types can be readily manufactured
in vast numbers on a single silicon wafer along with their associated wiring.
Thisprincipleistheconstructionalbasisofmodernmicroprocessorswhichmaycontainbillionsofindividualtransistors.Thetermvery-large-scaleintegration(VLSI)
was coined to refer to the practice of assembling such a large number of transistor
devices onasinglesilicon wafer.
Figure5.19showstheconstructionofasingleNANDgatemadefromitsindividual
transistors.
5.4 Boolean algebra 193
V DD
PMOS
Q1
D
D
PMOS
Q2
A
G
S
D
S
G
Y
G
S
D
NMOS
Q3
B
G
S
NMOS
Q4
0 V
Figure5.19
Internal construction ofasingle
CMOS NAND gate.
ThediagramshowstwoPMOStransistors,labelledQ1andQ2,connectedinparallel.
These are connected in series with two NMOS transistors, Q3 and Q4. There
arefoursourceterminalsalllabelledS,togetherwithfourdrainterminalslabelledD.
ThevoltageV DD
isthepositivevoltagesupplywhichisconnectedtothedrainonthe
fieldeffecttransistors.ThelabellingV DD
isaconventionoftenadoptedinthistypeof
circuit.Logiclevelsinacircuitlikethisarerepresentedbytakingalowvoltage,close
to 0V, to be a logic 0 and a high voltage, close toV DD
, to be a logic 1. The PMOS
transistors Q1 and Q2 each carry current between their source and drain terminals
only when a low voltage (logic 0) is connected at their gate terminal (labelled G).
The NMOS transistors Q3 and Q4 are a complementary type where current flows,
whichonlyoccurswhenahighvoltage,correspondingtologic1,ispresentedattheir
gates.ThuswhenbothAandBareatlogic0,Q1andQ2areswitchedonandQ3and
Q4 are switched off, hence the output isV DD
, which represents logic 1. This output
is still the same if either A or B, but not both, are at logic 1 because although Q3 or
Q4 will be turned on they are connected in series and individually have no effect. If
both A and B are at logic 1, then Q1 and Q2 are switched off, and Q3 and Q4 are
switched on, hence the output will be approximately 0V, which represents logic 0.
This behaviour isconsistentwith the truthtable given inTable 5.7.
All modern VLSI chips are designed using high-level design languages such as
VHDL(aspecializedcomputerlanguageforhardware)andthetransistordesignand
layoutisfullyautomated.Itisnowrarelynecessaryforthemicroprocessordesigner
toconsider individual transistors or even individual gates.
EXERCISES5.4
1 WriteBoolean expressionsforthe output,F,from the
electronic devices shown in Figure 5.20.
2 WriteBoolean expressionsforthe output from the
devices shown in Figure5.21.
3 Design electronic devices which produce the
following outputs:
(a) A +B (b) A·(B·C)
(d) A +B
(e)A·B+A ·B+B·C
(c) (C+D)·(A+B)
194 Chapter 5 Discrete mathematics
(a)
A
B
C
F
(b)
A
B
F
(c)
A
B
F
C
(d)
A
F
B
Figure5.20
(a)
A
F
B
(b)
A
B
F
C
Figure5.21
4 Draw upthe truthtablesforthe expressions given in
Question 3.
5 Usetruthtables to verifythat the following pairsof
expressions are logically equivalent:
(a) p+p·q·r+p·qandp+q
(b) (A +B)·(A+C)andA+B ·C
(c) p·q·r+p·q·r+p·q·randp·(r+q)
6 Simplify the followingBoolean expressions usingthe
laws ofBoolean algebra:
(a) A·A·A
(b) A·A·A
(c) A·A ·A
(d) (A+A)·(A+A)
(e) A +0
(f) (A+1)·(A+1)
(g) A +1
7 Simplify the following Boolean expressions usingthe
lawsofBoolean algebra:
(a) (A+A)·(B+B)
(b) A·(A+B+A·B)
(c) (A+A)·(A+C)
5.4 Boolean algebra 195
(d) A ·B·(A+B)
(e) A·(A +B)·B
(f) A·B·C+A·B·C
(g) A·B·C+A +B+C
8 Constructatruthtable showingA·BandA +B in
order to verifythe logicalequivalence expressed in
DeMorgan’s lawA·B=A +B.Carryout asimilar
exercise to verifyA+B=A·B.
9 LetB = 1 andthenB = 0 in the absorption laws,and
use the identitylaws to obtain (a)A +A =Aand
(b)A ·A =A.Verify your resultsusingtruthtables.
10 Derive Booleanexpressions andtruthtablesforthe
circuits shown in Figure5.22.
11 Simplify the following Booleanexpressions using
Booleanalgebra:
(a) A·B+A·B+B·C+A·B·C
(b) A·(C+A)+C·B+D+C+B·C+C·A
(c) A·B·C·D+A·B·C+A·B·C·D +
A ·B·C·D+A·B·C ·D
12 Thetruthvalues ofthe Booleanexpression,X,are
given in the followingtables. WriteX in disjunctive
A
B
C
D
E
A
B
(a)
X
normal form.Usethe laws ofBooleanalgebra to
simplifyyourexpressions.
(a)
(b)
A B X A B X
(c)
(d)
0 0 1
0 1 0
1 0 1
1 1 1
A B C X
0 0 0 1
0 0 1 1
0 1 0 0
0 1 1 0
1 0 0 0
1 0 1 0
1 1 0 1
1 1 1 1
A B C X
0 0 0 0
0 0 1 0
0 1 0 0
0 1 1 1
1 0 0 0
1 0 1 1
1 1 0 1
1 1 1 0
0 0 1
0 1 1
1 0 0
1 1 0
13 ExpressA·B +A·B usingonly conjunction (AND
gate) and negation (NOTgate).
C
D
X
E
F
Figure5.22
(b)
Solutions
1 (a) (A+B)·C (b) (A·B)·B
(c) (A·B)+B+C (d) A +B
2 (a) A ·B +A ·B which isthe same asA·B+A·B
(b) (A·B+A·B)·(B+C)+(A·B +A·B)·(B+C)
3 SeeFigureS.14.
196 Chapter 5 Discrete mathematics
A
B
A
B
C
(a)
C
D
B
A
A
B
(c)
A
B
C
(e)
(b)
FigureS.14
(d)
4 (a)
A B A +B
1 1 0
1 0 0
0 1 1
0 0 0
(d)
A B A +B
1 1 1
1 0 0
0 1 0
0 0 0
(b)
A B C A·(B·C)
1 1 1 0
1 1 0 1
1 0 1 1
1 0 0 1
0 1 1 1
0 1 0 1
0 0 1 1
0 0 0 1
(e)
A B C A·B+A ·B+B·C
1 1 1 1
1 1 0 1
1 0 1 0
1 0 0 0
0 1 1 0
0 1 0 1
0 0 1 1
0 0 0 1
(c)
A B C D (C+D)·(A+B)
1 1 1 1 1
1 1 1 0 1
1 1 0 1 1
1 1 0 0 0
1 0 1 1 1
1 0 1 0 1
1 0 0 1 1
1 0 0 0 0
0 1 1 1 0
0 1 1 0 0
0 1 0 1 0
0 1 0 0 0
0 0 1 1 1
0 0 1 0 1
0 0 0 1 1
0 0 0 0 0
6 (a)A (b)0 (c)0 (d)A
(e) A (f)1 (g)0
7 (a) A·B (b) A (c) A
(d) 0 (e) 0 (f) A·B
(g) A·B·C
10 (a) X =(A·B)·(C+D+E)
(b) X=A·B +A·B+C·D+E+F
11 (a) A+B·C
(b) A+B+C+D
(c) (A·B)+(A·B ·D) which can be further
simplifiedtoA · (B +D)
Review exercises 5 197
12 (a) A·B+A·B+A·B
(b) A·B +A·B
(c) A ·B·C +A ·B·C+A·B·C+A·B·C
(d) A·B·C+A·B·C+A·B·C
13 (A ·B)·(A·B)
REVIEWEXERCISES5
1 Classifythe followingastrue orfalse:
(a) R + ⊂ R
(c) 0.667∈Q
(e) −6∈Q
(g) N∩Q=N
(b) 0.667 ∈ R −
(d) N∪Z=R
(f)9/∈ R
(h) R − ∩Q=◦/
2 Useset notationto describethe followingintervals on
thexaxis.
(a) [−6,9] (b) (−1,1) (c) |x| < 1.7
(d) (0,2] (e) |x| > 1 (f) |x| 2
3 ThesetsA,BandC are given by
A = {1,2,6,7,10,11,12,13},B = {3,4,7,8,11},
C = {4,5,6,7,9,13} andthe universal set
E = {x :x∈N + ,1 x13}.Listthe elements of
the following sets:
(a) A∪B (b)B∪C (c)A∩B
(d) A∩(B∪C) (e)A∩B∩C
(f) A∪(B∩C) (g)C∪(B∩A)
4 Representthe setsA,BandC describedin Question3
onaVenn diagram.
5 List allthe elementsofthe followingsets:
(a) S={n:n∈Z,5n 2 50}
(b) S={m:m∈N,5m 2 50}
(c) S={m:m∈N,m 2 +2m−15=0}
6 IfA={n:n∈Z,−10n20},
B={m:m∈N,m>15}listthemembersofA∩B
andwrite down an expression forA ∪B.
7 WriteBooleanexpressions forthe output,F,from the
electronic devices shown in Figure5.23.
8 Simplify the followingBoolean expressions usingthe
laws ofBooleanalgebra:
(a) A·B·1
(c) 1+0
(e) (A+C)·(C+A)
(f) A·B+A +B (g) A·B
(h) A +B
(b) A+A·B+B
(d) D·(C+B)+C·D
(i) A·(B+C)·C
(j) (C+A+D)·(C·D+A)
9 Draw uptruthtables to verifythatA·B +B and
A ·B +A are logically equivalent.
A
B
F
(a)
C
A
B
F
(b)
(c)
A
B
C
D
Figure5.23
F
198 Chapter 5 Discrete mathematics
10 ExpressA +B+C +Dusingthe Boolean
connectives AND ( · )and negation.
11 Simplify the followingexpressions usingthe laws of
Boolean algebra:
(a) (A +A)·(B+B)
(b) A·B·A
(c) (A+B)·(A+B) (d) A ·A
(e) A·B·C·D·1·0
(g) B·C·1
(f) A·B·C·B
12 Reducethe following expressions usingBoolean
algebra:
(a) A+C+A·B·(B+C)
(b) (A+B)·B·C+(A+C)·(B+A·C)
(c) (A +B)·(A +C)
13 Thetruthtables ofthe Boolean expression,X,are
given in the followingtables. Writethe disjunctive
normal formofX in eachcase.
(a)
A B X
0 0 0
0 1 1
1 0 0
1 1 0
(b)
(c)
(d)
A B X
0 0 0
0 1 0
1 0 1
1 1 0
A B C X
0 0 0 1
0 0 1 0
0 1 0 0
0 1 1 1
1 0 0 0
1 0 1 1
1 1 0 0
1 1 1 1
A B C X
0 0 0 0
0 0 1 1
0 1 0 0
0 1 1 1
1 0 0 1
1 0 1 1
1 1 0 0
1 1 1 0
Solutions
1 (a)T (b)F (c)T (d)F
(e) T (f) F (g) T (h) F
2 (a) {x:x∈R,−6x9}
(b) {x:x∈R,−1<x<1}
(c) {x:x∈R,−1.7<x<1.7}
(d) {x:x∈R,0<x2}
(e) {x:x∈R,x>1orx<−1}
(f) {x:x∈R,x2orx−2}
3 (a) {1,2,3,4,6,7,8,10,11,12,13}
(b) {3,4,5,6,7,8,9,11,13}
(c) {7,11}
(d) {6,7,11,13}
(e) {7}
(f) {1,2,4,6,7,10,11,12,13}
(g) {4,5,6,7,9,11,13}
4 SeeFigureS.15.
A
1
2
11 3
10
12 6 7 13 4
FigureS.15
9
5
5 (a) S = {−7, −6, −5, −4, −3,3,4,5,6,7}
(b) S = {3,4,5,6,7}
(c) S = {3}
6 A ∩B = {16,17,18,19,20}
A∪B={m:m∈Z,m−10}
C
8
B
Review exercises 5 199
7 (a) (A·B)·C + (A·B)·C
(b) (A+B)·B
(c) (A·B)+(C+D)
8 (a) A·B (b) A+B (c) 1
(d) D·(C+B) (e) C (f) A ·B
(g) A+B (h) A·B (i) A ·C
(j) A
10 A·B·C ·D
11 (a) 1 (b) 0 (c) A+B (d) A
(e) 0 (f) 0 (g) B·C
12 (a) A +B ·C
(b) A·B+A·C+B·C
(c) A·(B+C)
13 (a) A ·B
(b) A·B
(c) A ·B·C +A·B·C+A·B·C+A·B·C
(d) A ·B·C+A·B·C+A·B ·C+A·B·C
6 Sequencesandseries
Contents 6.1 Introduction 200
6.2 Sequences 201
6.3 Series 209
6.4 Thebinomialtheorem 214
6.5 Powerseries 218
6.6 Sequencesarisingfromtheiterativesolutionofnon-linearequations 219
Reviewexercises6 222
6.1 INTRODUCTION
Muchofthematerialinthischapterisofafundamentalnatureandisapplicabletomany
differentareasofengineering.Forexample,ifcontinuoussignalsorwaveforms,suchas
those described in Chapter 2, are sampled at periodic intervals we obtain a sequence of
measuredvalues.Sequencesalsoarisewhenweattempttoobtainapproximatesolutions
of equations which model physical phenomena. Such approximations are necessary if
a solution is to be obtained using a digital computer. For many problems of practical
interest to engineers a computer solution is the only possibility. The z transform is an
exampleofaninfiniteserieswhichisparticularlyimportantinthefieldofdigitalsignal
processing. Signal processing is concerned with modifying signals in order to improve
them in some way. For example, the signals received from space satellites have to undergo
extensive processing in order to counteract the effects of noise, and to filter out
unwanted frequencies, before they can provide, say, acceptable visual images. Digital
signal processing is signal processing carried out using a computer. So, skill in manipulating
sequences and series is crucial. Later chapters will develop these concepts and
show examples of their use insolving real engineering problems.
6.2 Sequences 201
6.2 SEQUENCES
Asequenceisasetofnumbersorterms,notnecessarilydistinct,writtendownina
definite order.
For example,
1,3,5,7,9 and 1, 1 2 , 1 4 , 1 8 , 1
16
are both sequences. Sometimes we use the notation ‘...’ to indicate that the sequence
continues. For example, the sequence 1,2,3,...,20 is the sequence of integers from 1
to 20 inclusive. These sequences have a finite number of terms but we shall frequently
dealwithonesinvolvinganinfinitenumber ofterms.Toindicatethatasequencemight
go on for ever we can use the ... notation. Thus
and
2,4,6,8,...
1,−1,1,−1,...
can be assumed tocontinue indefinitely.
Ingeneral situations we shall writeasequence as
x[1],x[2],x[3],...
or more compactly,
x[k]
k=1,2,3,...
An alternative notation is
x 1
,x 2
,x 3
,...
Theformernotationisusuallyusedinsignalprocessingwherethetermsinthesequence
representthevalues ofthesignal.Thelatternotationarisesinthenumerical solutionof
equations. Hence both forms will be required. Often x[1] will be the first term of the
sequence although thisisnotalways the case.The sequence
...,x[−3],x[−2],x[−1],x[0],x[1],x[2],x[3],...
isusually writtenas
x[k]
k=...,−3,−2,−1,0,1,2,3,...
Acompletesequence,asopposedtoaspecifictermofasequence,isoftenwrittenusing
braces,for example
{x[k]} =x[1],x[2],...
although it is common to writex[k] for both the complete sequence and a general term
inthesequencewhenthereisnoconfusion,andthisistheconventionweshalladoptin
thisbook.
Asequencecanalsoberegardedasafunctionwhosedomainisasubsetofthesetof
integers.For example, the function defined by
x:N→R
x:k→ 3k
2
202 Chapter 6 Sequences and series
isthe sequence
x[0] = 0 x[1] = 3 2
x[2] = 3 x[3] = 9 2 ...
The values in the range of the function are the terms of the sequence. The independent
variableisk.FunctionsofthissortdifferfromthosedescribedinChapter2becausethe
independent variable is not selected from a continuous interval but rather isdiscrete. It
is,nevertheless,possibletorepresentx[k]graphicallyasillustratedinExamples6.1--6.3,
butinsteadofapiecewisecontinuouscurve,wenowhaveacollectionofisolatedpoints.
Example6.1 Graphthe sequences given by
{ 0 k <0
(a) x[k] = k=...,−3,−2,−1,0,1,2,...,thatisk∈Z
1 k0
{ 1 keven
(b) x[k] = k ∈ Z
−1 kodd
Solution (a) Fromthedefinitionofthissequence,thetermx[k]iszeroifk<0and1ifk 0.The
graph is obtained by plotting the terms of the sequence againstk(see Figure 6.1).
This sequenceisknown asthe unit stepsequence. We shall denotethisbyu[k].
(b) Thesequencex[k] isshown inFigure 6.2.
Example6.2 Graphthe sequencedefinedby
x[k] =
{ 1 k = 0
0 k≠0
k=...,−3,−2,−1,0,1,2,3,...
Solution From the definition, ifk = 0 thenx[k] = 1. Ifkis not equal to zero the corresponding
term in the sequence equals zero. Figure 6.3 shows the graph of this sequence which is
commonly called theKronecker delta sequence.
Example6.3 Thesequencex[k]isobtainedbymeasuringorsamplingthecontinuousfunction f (t) =
sint,t ∈ R, att = −2π, −3π/2, −π, −π/2, 0, π/2, π, 3π/2 and 2π. Write down the
termsofthissequenceand show themon a graph.
x[k]
1
x[k]
–4 –3 –2 –1 1
1
2 3 4 5
k
–4 –3 –2 –1 1
2 3 4 5
k
–1
Figure6.1
Theunitstepsequence.
Figure6.2 { 1 k even
Thesequencex[k] =
−1 k odd.
6.2 Sequences 203
sin t
1
x[k]
1
– 3p
–2p –2 –p – p –2 p
0 –2 p
3p –2
2p
t
–4 –3 –2 –1 1
Figure6.3
TheKronecker delta sequence.
2 3 4 5
k
–1
Figure6.4
Thefunction f (t) = sint with sampled points
shown.
x[k]
1
–4 –3 –2 –1 1 2 3 4 k Figure6.5
Sequence formed from sampling
–1
f(t) =sint.
Solution The function f (t) = sint, for −2π t 2π, is shown in Figure 6.4. We sample the
continuousfunctionattherequiredpoints.Thesamplevaluesareshownas •.Fromthe
graph we see that
x[k]=0,1,0,−1,0,1,0,−1,0
The graph ofx[k] isshown inFigure 6.5.
k=−4,−3,...,3,4
Sometimes it is possible to describe a sequence by a rule giving the kth term. For example,
the sequence for whichx[k] = 2 k ,k = 0,1,2,..., is given by 1,2,4,8.... On
occasions,arulegivesx[k]intermsofearliermembersofthesequence.Forexample,the
previoussequencecouldhavebeendefinedbyx[k] = 2x[k−1],x[0] = 1.Thesequence
isthensaidtobedefinedrecursivelyandthedefiningformulaiscalledarecurrencerelationordifferenceequation.Differenceequationsareparticularlyimportantindigital
signalprocessingand are dealtwith inChapter22.
Example6.4 Write down the termsx[k] fork = 0,...,7 ofthe sequencedefined recursively as
x[k] =x[k −2] +x[k −1]
wherex[0] = 1 andx[1] = 1.
Solution Thevalues ofx[0]andx[1]aregiven. Usingthe given recurrencerelationwefind
x[2] =x[0] +x[1] = 2
x[3] =x[1] +x[2] = 3
Continuing inthisfashion wefind the first eight termsofthe sequenceare
1,1,2,3,5,8,13,21
This sequenceis known asthe Fibonaccisequence.
204 Chapter 6 Sequences and series
6.2.1 Arithmeticprogressions
An arithmetic progression is a sequence where each term is found by adding a fixed
quantity, called thecommon difference, tothe previous term.
Example6.5 Writedownthefirstfivetermsofthearithmeticprogressionwherethefirsttermis1and
the common difference is3.
Solution The second term is found by adding the common difference, 3, to the first term, 1, and
so the second termis4.Continuing inthis way wecan construct the sequence
1,4,7,10,13,...
A more general arithmetic progression has first term a and common difference d,
that is
a,a+d,a+2d,a+3d,...
Itis easytosee thatthekth termis
a+(k−1)d
Allarithmeticprogressions can be writtenrecursively asx[k] =x[k −1] +d.
Arithmetic progression:a,a +d,a +2d,...
a = first term,d = common difference,kth term =a + (k −1)d
Example6.6 Findthe10thand20thtermsofthearithmeticprogressionwithafirstterm5andcommon
difference −4.
Solution Here a = 5 and d = −4. The kth term is 5 − 4(k − 1). Therefore the 10th term is
5 −4(9) = −31andthe20thtermis5−4(19) = −71.
6.2.2 Geometricprogressions
A geometric progression is a sequence where each term is found by multiplying the
previous termby a fixed quantity called thecommon ratio.
Example6.7 Write down the geometric progression whose first term is 1 and whose common ratio
is 1 2 .
Solution Thesecondtermisfoundbymultiplyingthefirstbythecommonratio, 1 2 ,thatis 1 1
2 ×1 =
. Continuing inthisway we obtain the sequence
2
1, 1 2 , 1 4 , 1 8 ,...
6.2 Sequences 205
Ageneralgeometricprogressionhasfirsttermaandcommonratiorandcantherefore
be writtenas
a,ar,ar 2 ,ar 3 ,...
anditiseasytoseethatthekthtermisar k−1 .Allgeometricprogressionscanbewritten
recursively asx[k] =rx[k −1].
Geometric progression:a,ar,ar 2 ,...
a = first term,r = common ratio,kth term =ar k−1
6.2.3 Moregeneralsequences
We have already metanumber of infinite sequences. For example,
(1) x[k]=2,4,6,8,...
(2) x[k] = 1, 1 2 , 1 4 ,...
In case (1) the terms of the sequence go on increasing without bound. We say the sequenceisunbounded.Ontheotherhand,incase(2)itisclearthatsuccessivetermsget
smallerandsmallerandask → ∞,x[k] → 0.Thenotionofgettingcloserandcloserto
afixedvalueisveryimportantinmathematicsandgivesrisetotheconceptofalimit.In
case(2)wesay‘thelimitofx[k]asktendstoinfinityis0’andwewritethisconciselyas
lim x[k] = 0
k→∞
We say that the sequence converges to 0, and because its terms do not increase without
bound wesay itisbounded.
More formally, we say that a sequencex[k] converges to a limitl if, by proceeding
far enough along the sequence, all subsequent terms can be made to lie as close tol as
wewish.Whenever a sequence isnotconvergent itissaidtobedivergent.
Itispossibletohavesequences whicharebounded butnevertheless donotconverge
toalimit. The sequence
x[k] = −1,1,−1,1,−1,1,...
clearly fails to have a limit ask → ∞ although it is bounded, that is its values all lie
within a given range. This particular sequence issaidtooscillate.
Itispossibletoevaluatethelimitofasequence,whensuchalimitexists,fromknowledgeofitsgeneralterm.Tobeabletodothiswecanmakeuseofcertainrules,theproofs
of which arebeyond the scope of thisbook, but which wenow state:
Ifx[k]andy[k]aretwosequencessuchthatlim k→∞
x[k] =l 1
,andlim k→∞
y[k] =l 2
,
wherel 1
andl 2
arefinite,then:
(1) Thesequencegiven byx[k] ±y[k] has limitl 1
±l 2
.
(2) Thesequencegiven bycx[k], wherecisaconstant, has limitcl 1
.
(3) Thesequencex[k]y[k] has limitl 1
l 2
.
(4) Thesequence x[k]
y[k] has limit l 1
providedl
l 2 ≠ 0.
2
206 Chapter 6 Sequences and series
Furthermore, wecan always assume that
lim
k→∞
1
= 0 forany constantm > 0
km Example6.8 Find,ifpossible, the limitofeachofthe following sequences,x[k].
(a) x[k] = 1 k
(b) x[k]=5
(c) x[k] = 3 + 1 k
(d) x[k] = 1
k +1
(e) x[k]=k 2
k=1,2,3,4,...
k=1,2,3,4,...
k=1,2,3,4,...
k=1,2,3,4,...
k=1,2,3,4,...
Solution (a) The sequencex[k] isgiven by
1, 1 2 , 1 3 , 1 4 ,...
Successivetermsgetsmallerandsmaller,andask → ∞,x[k] → 0.Byproceeding
far enough along thesequence wecan get asclose tothelimit0as wewish.Hence
1
lim x[k] = lim
k→∞ k→∞ k = 0
(b) The sequencex[k] isgiven by 5,5,5,5,.... This sequence has limit5.
(c) Thesequence3,3,3,3,...haslimit3.Thesequence1, 1 2 , 1 3 ,...haslimit0.Therefore,usingrule(1)
wehave
lim
k→∞ 3 + 1 k =3+0=3
The terms of the sequence x[k] = 3 + 1 k are given by 4,31 2 ,31 3 ,...,andby
proceeding far enough along we can make all subsequent terms lie as close to the
limit3as wewish.
(d) The sequencex[k] = 1
k +1 ,k=1,2,3,4,...,isgivenby
1
2 , 1 3 , 1 4 ,...
and has limit0.
(e) The sequence x[k] = k 2 ,k = 1,2,3,4,..., is given by 1,4,9,16,..., and
increases withoutbound. This sequence has no limit-- itisdivergent.
Example6.9 Given a sequencewith generaltermx[k] = k −1
k +1 , find lim k→∞ x[k].
Solution It is meaningless simply to writek = ∞ to obtain lim k→∞
x[k] = ∞ −1 , since such
∞ +1
a quantity is undefined. What we should do is try to rewritex[k] in a form in which we
6.2 Sequences 207
can sensibly letk → ∞. Dividing both numerator and denominator byk, we write
k −1 1− (1/k)
=
k +1 1+ (1/k)
Then, ask → ∞, 1/k → 0 so that
( ) 1− (1/k)
lim
k→∞
1+ (1/k)
= lim k→∞
(1 − (1/k))
lim k→∞
(1 + (1/k))
byrule (4)
= 1 1
= 1
Example6.10 Given a sequencewith generalterm
x[k] = 3k2 −5k+6
k 2 +2k+1
find lim k→∞
x[k].
Solution Dividing the numerator and denominator byk 2 introduces terms which tend to zero as
k → ∞, that is
3k 2 −5k+6
k 2 +2k+1 = 3 − (5/k) + (6/k2 )
1 + (2/k) + (1/k 2 )
Then ask → ∞, wefind
lim
k→∞ x[k] = 3 1 = 3
k
Example6.11 2
Examine the behaviour of
3k+1 ask→∞.
Solution k 2
3k+1 = k
3+ (1/k)
As k → ∞, 1/k → 0 so that the denominator approaches 3. On the other hand, as
k → ∞the numerator tends toinfinity so thatthissequence diverges toinfinity.
EXERCISES6.2
1 Graphthe sequences given by
(a)x[k]=k,k=0,1,2,3,...
{ 3 k = 2
(b)x[k] = k=0,1,2,3,...
0 otherwise
(c)x[k]=e −k ,k=0,1,2,3,...
2 Thesequencex[k] is obtainedbysampling
f (t) = cos(t +2),t ∈ R.Thesampling begins at
t = 0 andthereafter att = 1,2,3, . . ..Writedown
the firstsixterms ofthe sequence.
3 A sequence,x[k], is defined by
x[k] = k2
2 +k,k=0,1,2,3,...
State the firstfiveterms ofthe sequence.
4 Writedownthefirstfiveterms,andplotgraphs,ofthe
sequences given recursively by
(a) x[k] =
x[k −1]
, x[0]=1
2
208 Chapter 6 Sequences and series
(b) x[k] = 3x[k −1] −2x[k −2],
x[0] = 2, x[1] = 1
5 Arecurrence relation is defined by
x[n +1] =x[n] +10, x[0] = 1,
n=0,1,2,3,...
Findx[1],x[2],x[3]andx[4].
6 Asequence isdefined by meansofthe recurrence
relation
x[n +1] =x[n] +n 2 , x[0] = 1,
n=0,1,2,3,...
Write down the firstfive terms.
7 Considerthe difference equation
x[n +2] −x[n +1] = 3x[n],
n=0,1,2,3,...
Ifx[0] = 1 andx[1] = 2, findthe terms
x[2],x[3],...,x[6].
8 Write down the 10th and 19th termsofthe arithmetic
progressions
(a) 8,11,14, . . .
(b) 8,5,2,...
9 An arithmetic progressionis given by
b, 2b
3 , b 3 ,0,...
(a) State the sixth term.
(b) State thekth term.
(c) Ifthe 20th term has avalue of15, findb.
10 Writedown the 5thand10th terms ofthe geometric
progression 8,4,2, . . ..
11 Find the 10th and20th terms ofthe geometric
progression with firstterm 3 andcommon ratio 2.
12 Ageometric progression is given by
a,ar,ar 2 ,ar 3 ,...
If |(k +1)thterm | > |kthterm |and (k +1)thterm ×
kth term < 0, whichofthe following,ifany, mustbe
true?
(a)r>1 (b)a>1
(c) r < −1 (d) ais negative
(e) −1<r<1
13 Ageometricprogression has firstterma = 1. The
ninth termexceeds the fifthtermby 240. Find
possiblevalues forthe eighthterm.
14 Ifx[k] = 3k+2 find lim
k k→∞ x[k].
3k+2
15 Findlim k→∞
k 2 +7 .
16 Findthe limitsasktends to infinity,ifthey exist, of
the following sequences:
(a) x[k] =k 3
(b) x[k] = 2k+3
4k+2
(c) x[k] = k2 +k
k 2 +k+1
( ) 4
6k+7
17 Findlim k→∞ .
3k−2
18 Findlim k→∞ x[k], ifitexists, when
(a) x[k] = (−1) k
(b) x[k] = 2 − k
10
( ) k
1
(c) x[k] =
3
(d) x[k] = 3k3 −2k 2 +4
5k 3 +2k 2 +4
( ) 2k
1
(e) x[k] =
5
Solutions
2 cos2,cos3,cos4,cos5,cos6,cos7
3 0, 3 15
,4,
2 2 ,12
4 (a) 1, 1 2 , 1 4 , 1 8 , 1 16
5 11,21, 31,41
6 1,1,2,6,15
7 5, 11, 26,59, 137
(b)2,1, −1, −5, −13
8 (a) 10th term = 35, 19th term = 62
(b) 10th term = −19, 19th term = −46
9 (a)− 2b
3
10 1 2 , 1 64
11 1536,1572864
(b)
12 Only(c)must be true
b(4−k)
3
(c) − 45
16
6.3 Series 209
13 ±128
14 3
15 0
16 (a) Limitdoes not exist (b) 1 2
(c) 1
17 16
18 (a) Limit doesnot exist
(b) Limit doesnotexist
(c) 0 (d) 3 5
(e) 0
6.3 SERIES
Whenever the terms of a sequence are added together we obtain what is known as a
series.Forexample,ifweaddthetermsofthesequence1, 1 2 , 1 4 , 1 8 ,weobtaintheseries
S, where
S=1+ 1 2 + 1 4 + 1 8
This series ends after the fourth term and is said to be a finite series. Other series we
shall meet continue indefinitely and aresaidtobeinfinite series.
Given an arbitrarysequencex[k], we use thesigmanotation
n∑
S n
= x[k]
k=1
to mean the sum x[1] +x[2] + ··· +x[n], the first and last values of k being shown
below and above the Greek letter , which is pronounced ‘sigma’. If the first term of
the sequence isx[0] rather thanx[1] wewould write ∑ n
k=0 x[k].
6.3.1 Sumofafinitearithmeticseries
An arithmetic series isthe sum ofan arithmetic progression. Consider the sum
S=1+2+3+4+5
Clearly this sums to 15. When there are many more terms it is necessary to find a more
efficient way of adding them up. The equation forScan be writtenintwo ways:
and
S=1+2+3+4+5
S=5+4+3+2+1
If weadd these two equations together we get
2S=6+6+6+6+6
There arefive termsso that
thatis
2S=5×6=30
S=15
Now a general arithmeticseries withkterms can be writtenas
S k
=a+(a+d)+(a+2d)+···+(a+(k−1)d)
butrewritingthis back tofront, wehave
S k
=(a+(k−1)d)+(a+(k−2)d)+···+(a+d)+a
210 Chapter 6 Sequences and series
Addingtogetherthefirsttermineachseriesproduces2a+ (k−1)d.Addingthesecond
terms together produces 2a + (k − 1)d. Indeed adding together the ith terms yields
2a + (k −1)d.Hence,
2S k
=(2a+(k−1)d)+(2a+(k−1)d)+···+(2a+(k−1)d)
} {{ }
k times
that is
so that
2S k
=k(2a + (k −1)d)
S k
= k 2 (2a+(k−1)d)
This formula tells us the sum to k terms of the arithmetic series with first term a and
common differenced.
Sumofanarithmeticseries:S k
= k 2 (2a+(k−1)d)
Example6.12 Findthesumofthearithmeticseriescontaining30terms,withfirstterm1andcommon
difference 4.
Solution We wish tofindS k
:
S k
=1+5+9+···
} {{ }
30 terms
UsingS k
= k 2 (2a + (k −1)d)wefindS 30 = 30 (2 +29 ×4) =1770.
2
Example6.13 Find the sum of the arithmetic series with first term 1, common difference 3 and with
last term100.
Solution Wealreadyknowthatthekthtermofanarithmeticprogressionisgivenbya+(k−1)d.
Inthis casethe lasttermis100. We can use thisfacttofind the numberofterms.Thus,
that is
100=1+3(k−1)
3(k−1)=99
k−1=33
k=34
So thereare34 termsinthisseries.Thereforethe sum,S 34
, isgiven by
S 34
= 34 {2(1) + (33)(3)}
2
= 17(101)
= 1717
6.3.2 Sumofafinitegeometricseries
6.3 Series 211
A geometric series is the sum of the terms of a geometric progression. If we sum the
geometric progression 1, 1 2 , 1 4 , 1 8 , 1
16 wefind
S=1+ 1 2 + 1 4 + 1 8 + 1
16
(6.1)
Iftherehadbeenalargenumberoftermsitwouldhavebeenimpracticaltoaddthemall
directly. However, let us multiplyEquation (6.1) by the common ratio, 1 2 :
1
2 S = 1 2 + 1 4 + 1 8 + 1
16 + 1
32
so that, subtracting Equation (6.2)from Equation (6.1), we find
(6.2)
S − 1 2 S=1− 1
32
since mostterms cancel out. Therefore 1 2 S = 31
32 andsoS=31 16 = 115 16 .
We can apply this approach more generally: when we have a geometric progression
with first termaand common ratior, the sum toktermsis
S k
=a+ar+ar 2 +ar 3 +···+ar k−1
Multiplying byr gives
rS k
=ar+ar 2 +ar 3 +···+ar k−1 +ar k
Subtraction givesS k
−rS k
=a −ar k , so that
S k
= a(1−rk )
1 −r
providedr ≠ 1
This formula gives the sum to k terms of the geometric series with first term a and
common ratior.
Sumofageometricseries:S k
= a(1−rk )
1 −r
r≠1
6.3.3 Sumofaninfiniteseries
When dealing with infinite series the situation is more complicated. Nevertheless, it is
frequently the case that the answer to many problems can be expressed as an infinite
series.Incertaincircumstances,thesumofaseriestendstoafiniteanswerasmoreand
more terms are included and we say the series has converged. To illustrate this idea,
consider the graphical interpretation of the series 1 + 1 2 + 1 4 + 1 8 +···,asgivenin
Figure 6.6.
Start at0and move a length 1:total distance moved = 1
Move further, a length 1 2 : total distance moved = 11 2
Move further, a length 1 4 : total distance moved = 13 4
212 Chapter 6 Sequences and series
1
–
3
–
7
0 1 1 1 1–
2
2 4 8
Figure6.6
Graphical interpretation ofthe series
1 + 1 2 + 1 4 + 1 8 +···.
At each stage the extra distance moved is half the distance remaining up to the point
x = 2. It is obvious that the total distance we move cannot exceed 2 although we
can get as close to 2 as we like by adding on more and more terms. We say that the
series 1 + 1 2 + 1 + ··· converges to 2. The sequence of total distances moved, given
4
previously,
1,1 1 2 ,13 4 ,17 8 ,...
iscalled thesequence of partialsumsof the series.
For any given infinite series ∑ ∞
k=1x[k], wecan formthe sequence ofpartialsums,
1∑
S 1
= x[k] =x[1]
S 2
=
S 3
=
.
k=1
2∑
x[k] =x[1] +x[2]
k=1
3∑
x[k] =x[1] +x[2] +x[3]
k=1
If the sequence {S n
} converges to a limitS, we say that the infinite series has a sumS
or that it has converged toS. Clearly not all infinite series will converge. For example,
consider the series
1+2+3+4+5+···
The sequence of partial sums is 1,3,6,10,15,.... This sequence diverges to infinity
and sothe series 1 +2+3+4+5+··· is divergent.
It is possible to establish tests or convergence criteria to help us to decide whether
or not a given series converges or diverges, but for these you must refer to a more
advanced text.
6.3.4 Sumofaninfinitegeometricseries
In the case of an infinite geometric series, it is possible to derive a simple formula for
its sum when convergence takes place. We have already seen that the sum tok terms is
given by
S k
= a(1−rk )
1 −r
r≠1
What we must do is allowkto become large so that more and more terms are included
in the sum. Provided that −1 < r < 1, thenr k →0ask→∞.ThenS k
→
a
1 −r .
When thishappens we write
S ∞
=
a
1 −r
6.3 Series 213
whereS ∞
is known as the ‘sum to infinity’. Ifr > 1 orr < −1,r k fails to approach a
finitelimitask → ∞and the geometric series diverges.
Sum of aninfinite geometric series:S ∞
=
a
1 −r
−1<r<1
Example6.14 Findthe sum toktermsofthe following series and deducetheirsumstoinfinity:
(a) 1+ 1 3 + 1 9 + 1
27 +··· (b) 12+6+3+11 2 +···
Solution (a) This isageometric series with first term1and common ratio1/3.Therefore,
S k
= a(1−rk )
= 1(1 − (1/3)k )
= 3 ( ( 1 k )
1 −
1 −r 2/3 2 3)
Ask → ∞, (1/3) k → 0sothatS ∞
= 3/2.
(b) This isageometric series with first term12 and common ratio 1 2 . Therefore,
S k
= 24(1 − (1/2) k )
Ask → ∞, (1/2) k → 0sothatS ∞
= 24.Thiscould,ofcourse,havebeenobtained
directly from the formulafor the sum toinfinity.
EXERCISES6.3
1 An arithmeticseries has afirstterm of4andits 30th
term is1000.Find the sum to 30terms.
2 Find the sum to 20 termsofthe arithmeticserieswith
firstterma,andcommon differenced,given by
(a)a=4,d=3 (b)a=4,d=−3
3 Ifthe sumto 10terms ofan arithmeticseries is100
anditscommondifference,d,is −3,finditsfirstterm.
4 Thesum to 20terms ofan arithmeticseries is
identicalto the sum to 22 terms. Ifthe common
difference is −2, find the firstterm.
5 Find the sum to five terms ofthe geometricseries
with firstterm 1 andcommon ratio 1/3. Find the sum
to infinity.
6 Find the sum ofthe first20terms ofthe geometric
serieswith firstterm 3 andcommon ratio 1.5.
7 Find the sum ofthe arithmeticseries with firstterm 2,
common difference 2, andlast term50.
8 The sumto infinityofageometricseries isfour times
the firstterm.Find the common ratio.
9 The sumto infinityofageometricseries istwicethe
sum ofthe firsttwo terms. Findpossiblevalues ofthe
common ratio.
10 Express the alternatingharmonic series
1 − 1 2 + 1 3 − 1 + · · ·in sigma notation.
4
11 Writedown the firstsixterms ofthe series ∑ ∞
k=0 z −k .
12 Explain why ∑ ∞
k=1 x[k] is the same as ∑ ∞
n=1 x[n].
Further, explain why both canbe written as
∑ ∞k=0
x[k +1].
Solutions
1 15060
2 (a) 650 (b) −490
3 23.5
4 41
214 Chapter 6 Sequences and series
5 1.494,S ∞ = 3 2
6 19946
7 650
8
3
4
9 ± √ 1 2
∑
10 ∞ (−1) n+1
1 n
11 1+ 1 z + 1 z 2 + 1 z 3 + 1 z 4 + 1 z 5
TechnicalComputingExercises6.3
Most technicalcomputing languageshave built-in
functionsforgenerating geometricseries.MATLAB ®
has the functioncumprod whichcalculates the
cumulative productofthe numberspassed to its
input. Ittakes the first term andthensuccessively
multiplies the succeedingarguments in turnbythe
result.
Forexample, in MATLAB ® wecould type:
S=cumprod([1 0.5 0.5 0.5 0.5])
to produce the geometricseriesandstore itin a row
vectorS.
1.000000 0.500000 0.250000
0.125000 0.062500
1 Calculate the sum ofallofthe elementsin the finite
seriesgiven above.
2 Increase the number ofelementsin the series to 10
andnotethe difference in youranswer.
3 Compare the answers to the previous two exercises to
the exact equationgiven atthe endofSection6.3.2
witha = 1 andr = 0.5.Whatwould you expect to
happeniftherewere100 elementsin the series?
6.4 THEBINOMIALTHEOREM
It is straightforward to show that the expression (a + b) 2 can be written as a 2 +
2ab + b 2 . It is slightly more complicated to expand the expression (a + b) 3 to a 3 +
3a 2 b +3ab 2 +b 3 . However, it is often necessary to expand quantities such as (a +b) 6
or (a +b) 10 , say, and then the algebra becomes extremely lengthy. A simple technique
forexpandingexpressionsoftheform (a+b) n ,wherenisapositiveinteger,isgivenby
Pascal’s triangle.
Pascal’striangleisthetriangleofnumbersshowninFigure6.7,whereitisobserved
that every entry is obtained by adding the two entries on either side in the preceding
row, always starting and finishing a row with a ‘1’. You will note that the third row
down,1 2 1,givesthecoefficientsintheexpansionof (a +b) 2 = 1a 2 +2ab +1b 2 ,
whilethefourthrow,1 3 3 1,givesthecoefficientsintheexpansionof (a +b) 3 =
1a 3 +3a 2 b +3ab 2 +1b 3 .Furthermore,thetermsintheseexpansionsarecomposedof
decreasingpowersofaandincreasingpowersofb.Whenwecometoexpandthequantity
(a +b) 4 therowbeginning‘1 4’inthetrianglewillprovideuswiththenecessary
coefficientsintheexpansionandwemustsimplytakecaretoputinplacetheappropriate
powers ofaandb. Thus (a +b) 4 = 1a 4 +4a 3 b +6a 2 b 2 +4ab 3 +1b 4 .
1
1 1
1 2 1
1 3 3 1
1 4 + 6 4 1
1 5 10 10 5 1
1 6 15 20 15 6 1
Figure6.7
Pascal’striangle.
6.4 The binomial theorem 215
Example6.15 UsePascal’s triangle toexpand (a +b) 6 .
Solution We look to the row commencing ‘1 6’, that is 1 6 15 20 15 6 1, because
a+bisraised tothe power 6.This row provides the necessary coefficients. Thus,
(a +b) 6 =a 6 +6a 5 b +15a 4 b 2 +20a 3 b 3 +15a 2 b 4 +6ab 5 +b 6
Example6.16 Expand (1 +x) 7 usingPascal’s triangle.
Solution Forming the rowcommencing ‘1 7’we select the coefficients
1 7 21 35 35 21 7 1
Inthisexample,a = 1 andb=xso that
(1 +x) 7 =1+7x +21x 2 +35x 3 +35x 4 +21x 5 +7x 6 +x 7
When it is necessary to expand the quantity (a + b) n for large n, it is clearly inappropriate
to use Pascal’s triangle since an extremely large triangle would have to be
constructed. However, it is frequently the case that in such situations only the first few
termsinthe expansion are required. This iswhere thebinomial theoremisuseful.
The binomial theorem states thatwhennisapositive integer
(a+b) n =a n +na n−1 b +
+···+b n
n(n −1)
a n−2 b 2 +
2!
n(n −1)(n −2)
a n−3 b 3
3!
Itisalsofrequently quoted for the case whena = 1 andb=x, so that
(1+x) n =1+nx+
n(n −1)
x 2 +
2!
n(n −1)(n −2)
x 3 +···+x n (6.3)
3!
Example6.17 Expand (1 +x) 10 uptothe terminx 3 .
Solution WecouldusePascal’striangletoanswerthisquestionandlooktotherowcommencing
‘1 10’buttofindthisrowconsiderablecalculationswouldbeneeded.Weshallusethe
binomialtheoreminthe formofEquation (6.3). Takingn = 10, wefind
(1+x) 10 =1+10x+ 10(9) x 2 + (10)(9)(8) x 3 +···
2! 3!
=1+10x+45x 2 +120x 3 +···
so that, up toand includingx 3 , the expansion is
1+10x+45x 2 +120x 3
216 Chapter 6 Sequences and series
Wehaveassumedintheforegoingdiscussionthatnisapositiveintegerinwhichcase
the expansion given by Equation (6.3) will eventually terminate. In Example 6.17 this
wouldoccurwhenwereachedtheterminx 10 .Itcanbeshown,however, thatwhennis
not a positive integer the function (1 +x) n and the expansion given by
(1+x) n =1+nx+
n(n −1)
x 2 +
2!
n(n −1)(n −2)
x 3 +··· (6.4)
3!
havethesamevalueprovided −1 <x<1.However,whennisnotapositiveintegerthe
seriesdoesnotterminateandwemustdealwithaninfiniteseries.Thisseriesconverges
when −1 < x < 1 and the expansion is then said to be valid. Whenxlies outside this
intervaltheinfiniteseriesdivergesandsobearsnorelationtothevalueof (1 +x) n .The
expansion isthen saidtobe invalid.
Thebinomialtheorem:
(1+x) n =1+nx+
n(n −1)
x 2 +
2!
n(n −1)(n −2)
x 3 +···
3!
−1<x<1
Example6.18 Usethebinomialtheoremtoexpand
Solution
the terminx 3 .
1
1 +x inascendingpowersofxuptoandincluding
1
1 +x canbewrittenas (1 +x)−1 .UsingthebinomialtheoremgivenbyEquation(6.4)
withn = −1, wefind
(1 +x) −1 = 1 + (−1)x + (−1)(−2) x 2 + (−1)(−2)(−3) x 3 +···
2! 3!
=1−x+x 2 −x 3 +···
provided −1 <x<1.Consequently,ifinfutureapplicationswecomeacrosstheseries
1 −x+x 2 −x 3 + ..., we shall be able to write it in the form (1 +x) −1 . This closed
formavoidstheuseofaninfiniteseriesandsoitiseasiertohandle.Weshallmakeuse
of this technique inChapter 22 when wemeet theztransform.
Example6.19 Obtain a quadraticapproximation to (1 −2x) 1/2 usingthe binomialtheorem.
Solution Using Equation (6.4)withxreplaced by −2x andn = 1 2 wehave
( 1
(1 −2x) 1/2 = 1 + (−2x) +
2)
(1/2)(−1/2) (−2x) 2 + ···
2!
=1−x− 1 2 x2 +···
provided that −1 < −2x < 1, that is − 1 2 < x < 1 . A quadratic approximation is
2
therefore 1 −x− 1 2 x2 .
6.4 The binomial theorem 217
EXERCISES6.4
1 Usethe binomial theorem to expand
(a) (1 +x) 3 (b) (1 +x) 4 (c)
(d)
(
1 − x 2) 5
(e)
(
2 + x 2) 5
(f)
2 UsePascal’striangleto expand (a +b) 8 .
(
(
1 + x 3
3 − x 4
3 UsePascal’striangleto expand (2x +3y) 4 .
4 Expand (a −2b) 5 .
5 Usethe binomial theoremto findthe expansion of
(3 −2x) 6 upto andincluding the term inx 3 .
6 Obtainthe firstfour terms in the expansion of
(
1 + 1 2 x ) 10
.
7 Obtainthe firstfive termsin the expansion of
(1 +2x) 1/2 .Statethe rangeofvalues ofxforwhich
the expansion is valid.Chooseavalue ofxwithin the
rangeofvalidity andcomputevalues ofyour
expansionforcomparison with the truefunction
values. (
8 Expand 1 + 1 ) −4
2 x in ascending powersofxupto
the term inx 4 ,stating the range ofvalues ofxfor
whichthe expansion is valid.
(
9 Expand 1 + 1 −1/2
in descending powers upto the
x)
fourthterm.
) 4
) 4
10 (a) Expand (1 +x 2 ) 4 .
(b) Expand (1 +1/x 2 ) 4 .
11 A function, f (x),isgiven by
( ) 1/2
f(x)=
1 + 1 x
(a) Obtain the firstfour termsin the expansion of
f (x)in descending powersofx.State the range
ofvalues ofxforwhich the expansion is valid.
(b) By writing f (x)in the form
f(x) =x −1/2 (1 +x) 1/2
obtain the firstfourterms in the expansion of
f (x)in ascending powersofx.State the range of
values ofxforwhich the expansion is valid.
12 The function,g(x),is defined by
g(x) =
1
(1+x 2 ) 4
(a) Obtain the firstfour termsin the expansion of
g(x) in ascending powersofx. Statethe range of
values ofxforwhich the expansion is valid.
(b) By rewritingg(x) in an appropriate form,obtain
the firstfour termsin the expansion ofg(x) in
descending powers ofx.State the range ofvalues
ofxforwhich the expansion isvalid.
Solutions
1 (a) 1+3x+3x 2 +x 3
3 16x 4 +96x 3 y +216x 2 y 2 +216xy 3 +81y 4
(b) 1+4x+6x 2 +4x 3 +x 4
4 a 5 −10a 4 b +40a 3 b 2 −80a 2 b 3 +80ab 4 −32b 5
(c) 1+ 4x
3 + 2x2
3 + 4x3
27 + x4
5 729 −2916x +4860x 2 −4320x 3
81
(d) 1− 5x
2 + 5x2
2 − 5x3
4 + 5x4
16 − x5 6 1+5x+ 45x2 +15x 3
32
4
(e) 32 +40x +20x 2 +5x 3 + 5x4
8 + x5 7 1+x− x2
32
2 + x3
2 − 5x4
8 valid for −1 2 <x< 1 2
(f) 81 −27x + 27x2 − 3x3
8 16 + x4 8 1−2x+ 5x2
256
2 − 5x3
2 + 35x4 valid for −2 <x<2
16
2 a 8 +8a 7 b +28a 6 b 2 +56a 5 b 3 +70a 4 b 4 +56a 3 b 5 + 9 1 − 1
28a 2 b 6 +8ab 7 +b 8 2x + 3
8x2 − 5
16x 3
218 Chapter 6 Sequences and series
(
)
10 (a) 1+4x 2 +6x 4 +4x 6 +x 8
(b) 1+ 4 x 2 + 6 x 4 + 4 x 6 + 1 (b) x −1/2 1 + x 2 − x2
8 + x3 valid for |x| < 1
16
x 8
12 (a) 1 −4x 2 +10x 4 −20x 6 validfor |x| < 1
11 (a) 1+ 1 2x − 1
8x 2 + 1
(
·)
16x 3 valid for |x| > 1 (b) x −8 1 − 4 x 2 + 10
x 4 − 20
x 6 · · valid for |x| > 1
6.5 POWERSERIES
Aparticularlyimportantclassofseriesareknownaspowerseriesandtheseareinfinite
series involving integer powers ofthe variablex. For example,
and
1+x+x 2 +x 3 +···
1+x+ x2
2 + x3
6 +···
arebothpowerseries.Notethatapowerseriescanberegardedasaninfinitepolynomial.
Many common functions can be expressed intermsof a power series,forexample
sinx=x− x3
3! + x5
− ··· x inradians
5!
which converges for any value ofx. For example,
sin(0.5) = 0.5 − (0.5)3
6
+ (0.5)5
120 −···
Taking justthe first threeterms,we find
sin(0.5) ≈ 0.5 −0.0208333 +0.0002604 = 0.4794271
as compared with the truevalue, sin0.5 = 0.4794255.
More generally, a power series is only meaningful if the series converges for the
particular value of x chosen. We define an important quantity known as the radius of
convergence,R, as the largest value for which anxchosen in the interval −R <x<R
causes the series toconverge.
Theopeninterval (−R,R)isknownastheintervalofconvergence.Testsforconvergenceofapowerseriesarethesubjectofmoreadvancedtexts.Furtherconsiderationwill
be given to power series in Chapter 18, but for future reference we give some common
expansions now:
sinx=x− x3
3! + x5
− ··· x inradians
5!
cosx=1− x2
2! + x4
− ··· x inradians
4!
e x =1+x+ x2
2! + x3
3! + x4
4! +···
all of which converge for any value ofx.
6.6 Sequences arising from the iterative solution of non-linear equations 219
Each of these series converges rapidly when x is small, and so can be used to obtain
usefulapproximations. Inparticular, wenote that
Ifxis smalland measured inradians
sinx≈x and cosx≈1− x2
2!
These formulae areknown as thesmall-angle approximations.
EXERCISES6.5
1 Thepower seriesexpansion ofe x is given by
e x =1+x+ x2
2! + x3
3! +···
andis valid foranyx.Take four terms ofthe series
whenx=0, 0.1,0.5and1, tocompare the sumtofour
termswith the value ofe x obtained fromyour
calculator. Comment upon the result.
2 Using the power series expansion forcosx:
(a) Writedown the power series expansion for
cos2x.
(b) Writedown the power series expansion for
cos(x/2).
Byconsidering the power series expansionfor
cos(−x) showthatcosx = cos(−x).
3 Byconsidering the power series expansionofe x find
∑ ∞k=0
1/k!.
4 Obtainacubicapproximation to e x sinx.
5 (a) Statethe power series expansion fore −x .
(b) Byusingyoursolution to (a)andthe expansion
fore x ,deduce the power seriesexpansions of
coshx andsinhx.
Solutions
1
x e x Sum to 4 terms
0 1 1
0.1 1.1052 1.1052
0.5 1.6487 1.6458
1 2.7183 2.6667
Values are in close agreementwhenxissmall.
2 (a) 1−2x 2 + 2x4
3
3 e
−··· (b) 1−x2
8 + x4
384 −···
4 x+x 2 + x3
3
5 (a) 1−x+ x2
2! − x3
3! +···
(b) coshx=1+ x2
2! + x4
4! +···,
sinhx=x+ x3
3! + x5
5! +···
6.6 SEQUENCESARISINGFROMTHEITERATIVESOLUTION
OFNON-LINEAREQUATIONS
Itisoften necessarytosolve equationsofthe form f (x) = 0.For example,
f(x)=x 3 −3x 2 +7=0, f(x)=lnx− 1 x = 0
To solve means to find values of x which satisfy the given equation. These values are
knownasroots.Forexample,therootsofx 2 −3x +2 = 0arex = 1andx = 2because
220 Chapter 6 Sequences and series
whenthesevaluesaresubstitutedintotheequationbothsidesareequal.Equationswhere
the unknown quantity, x, occurs only to the first power are called linear equations.
Otherwise an equation is non-linear. A simple way of finding the roots of an equation
f (x) = 0 istosketch a graph ofy = f (x)as shown inFigure 6.8.
The roots are those values of x where the graph cuts or touches the x axis.
Generally, thereisnoanalytical wayofsolvingtheequation f (x) = 0andsoitisoften
necessary to resort to approximate or numerical techniques of solution. An iterative
technique is one which produces a sequence of approximate solutions which may converge
to a root. Iterative techniques can fail in that the sequence produced can diverge.
Whetherornotthishappensdependsupontheequationtobesolvedandtheavailability
ofagoodestimateoftheroot.Suchanestimatecouldbeobtainedbysketchingagraph.
The technique we shall describe here is known as simple iteration. It requires that the
equation berewrittenintheformx =g(x). Anestimateoftherootismade,sayx 0
,and
this value is substituted into the r.h.s. ofx =g(x). This yields another estimate,x 1
. The
process isthen repeated. Formally weexpress this as
x n+1
=g(x n
)
This is a recurrence relation which produces a sequence of estimates x 0
,x 1
,x 2
,....
Under certain circumstances the sequence will converge to a root of the equation. It
is particularly simple to program this technique on a computer. A check would be built
into the program totestwhether ornot successive estimates areconverging.
Example6.20 Solve the equation f (x) = e −x −x = 0 by simple iteration.
Solution Theequationmustfirstbearrangedintotheformx =g(x),andsowewritee −x −x = 0
as
x=e −x
In this example we see that g(x) = e −x . The recurrence relation which will produce
estimates of the rootis
x n+1
= e −x n
Table6.1
Iterativesolutionof
e −x −x=0.
n
x n
f(x)
Figure6.8
A rootof f (x) = 0 occurswhere the
graph touches orcrossesthexaxis.
x
0 0
1 1
2 0.368
3 0.692
4 0.501
5 0.606
6 0.546
.
.
. 0.567
6.6 Sequences arising from the iterative solution of non-linear equations 221
Suppose weestimatex 0
= 0.Then
Then
x 1
=e −x 0 =e −0 =1
x 2
= e −x 1 = e −1 = 0.368
Theprocessiscontinued.ThecalculationisshowninTable6.1fromwhichweseethat
the sequence eventually converges to 0.567 (3 d.p.). We conclude thatx = 0.567 is a
rootof e −x −x = 0.
Notethatiftheequationtobesolvedinvolvestrigonometricfunctions,angleswillusuallybe
measured inradians and not degrees.
Itispossibletodeviseatesttocheckwhetheranygivenrearrangementwillconverge.
Fordetailsofthisyoushouldrefertoatextbookonnumericalanalysis.Thereareother
more sophisticated iterative methods for the solution of non-linear equations. One such
method, the Newton--Raphson method, isdiscussedinChapter 12.
EXERCISES6.6
1 Showthat the quadratic equationx 2 −5x −7 = 0 can
be written in the formx = √ 7+5x.Withx 0 =6
locate arootofthisequation.
2 Forthe quadraticequationofQuestion1showthat an
alternative rearrangement is
x = x2 −7
.Withx
5 0 = 0.6 findthe second
solution ofthisequation.
3 Forthe quadraticequationofQuestion1showthat
anotherrearrangement isx = 7 +5. Tryto
x
solvethe equation usingvarious initialestimates.
Investigatefurtheralternative arrangements ofthe
original equation.
4 Showthat onerecurrence relationforthe solution of
the equation
e x +10x−3=0
is
x n+1 = 3−ex n
10
Withx 0 = 0 locate aroot ofthe given equation.
5 (a) Show thatthe cubicequationx 3 +3x −5 = 0
can be writtenas
(i)x= 5−x3 ,
3
(ii) x = 5
x 2 +3 .
(b) By sketching agraph forvalues ofxbetween 0
and 3 obtain aroughestimate ofthe rootofthe
equation given in part (a).
(c) Determine which, ifeither, ofthe arrangements
in part (a)converges more rapidly to the root.
Solutions
1 6.14
2 −1.14
5 (b) 1.15
(c) Arrangement (ii)converges more rapidly
4 0.18
222 Chapter 6 Sequences and series
TechnicalComputingExercises6.6
Write acomputer program to implement the simple
iteration method. Bycomparing successive estimates
the program should check whether convergence is
taking place.
REVIEWEXERCISES6
1 Write down andgraph the firstfive terms ofthe
sequencesx[k] defined by
(a) x[k] = (−1) k , k =0,1,2,3,...
(b) x[k] = (−1)k
2k+1 , k=0,1,2,3,...
2 Find expressions forthekth termsofthe sequences
whose firstfiveterms are
(a) 1, 9, 17, 25,33
(b) −1, 1, −1, 1, −1
3 Forthe Fibonacci sequence
showthat
1,1,2,3,5,8,...
lim
k→∞
x[k +1]
x[k]
= 1 2
(
1 + √ )
5
[Hint:writex[k +1] =x[k] +x[k −1],form
x[k +1]/x[k] andtake limits.]
4 Usethe binomial theoremto expand (1 +x+x 2 ) 5 as
far asthe term inx 3 .
1
5 Usethe binomial theoremto expand
(3 +x) 3
in ascending powers ofxasfar asthe term inx 3 .
6 Thepower series expansion forln(1 +x) isgiven by
ln(1+x)=x− x2
2 + x3
3 − x4
4 +···
and isvalid for −1 <x 1. Take anumber ofvalues
ofxin thisinterval and obtain an approximate value
ofln(1 +x)by meansofthisseries.Compare your
answerswiththevaluesobtainedfromyourcalculator.
7 Bymultiplyingboth numerator anddenominatorof
√ √
k+1− k
by √ k+1+ √ k find
2
√ √
k+1− k
lim
k→∞ 2
8 Find lim
k→∞
(
) 3
3k−1
.
2k+7
2k 5 −3k 2
9 Find lim
k→∞ 7k 7 +2k .
10 Writedown the firsteight termsofthe series ∑ n
k=1 k.
Bynotingthat thisis an arithmeticseries showthat
n∑
k =
k=1
n(n +1)
2
11 Writedown the firstsixterms ofthe sequence defined
bythe recurrence relation
x[n +3] =x[n +2] −2x[n]
x[0] = 0 x[1] = 2 x[2] = 3
12 Findthe limit, ifitexists, ask → ∞ ofthe geometric
progression
when
a,ar,ar 2 ,...,ar k−1 ,...
(a) −1<r<1
(b) r>1
(c) r<−1
(d) r=1
(e) r=−1
13 An arithmeticseries has afirst termof4andthe 10th
termis 0.
(a) FindS 20 .
(b) IfS n = 0,findn.
14 Ageometricseries has
S 3 = 37
8
S 6 = 3367
512
Findthe firsttermand the common ratio.
Review exercises 6 223
15 (a) Write down the seriesgiven by ∑ 5
r=1 r 2 .
(b) The sumofthe squaresofthe firstnwhole
numbers can be foundfrom the formula
n∑
r 2 n(n +1)(2n +1)
=
6
r=1
Use thisformula to find
(i) ∑ 5
r=1 r 2 , (ii) ∑ 10
r=1 r 2 , (iii) ∑ 10
r=6 r 2 .
16 (a) Writedown the series given by ∑ 6
r=1 r 3 .
(b) The sumofthe cubesofthe firstnwhole
numbers canbe foundfrom the formula
(
n∑
r 3 =
n(n +1)
2
r=1
Use thisformula to find
) 2
(i) ∑ 6
r=1 r 3 , (ii) ∑ 12
r=1 r 3 , (iii) ∑ 12
r=7 r 3 .
17 Thethirdterm ofan arithmeticprogression is 18.The
fifthterm is28. Find the sum of20terms.
18 (a) Find an expression forthe general term in the
sequence
2,5,10,17,...
(b) Define the sequence in terms ofarecurrence
relation.
19 (a) Show thatthe equationx 3 +2x −14 = 0 canbe
rearrangedintothe formx = 3√ 14 −2x.With
x 0 = 2 use simple iteration to findarootofthe
equation.
(b) Rearrange the equation 0.8sinx −0.5x = 0 into
the formx =g(x).Withx 0 = 2 use simple
iteration to findarootofthe equation.
(c) Rearrange the equationx 3 = 2e −x intothe form
x =g(x).Withx 0 = 0usesimpleiterationtofind
a rootofthe equation.
20 Writeoutexplicitlythe firstfour terms ofthe series
∞∑
m=0
(−1) m x 2m
2 2m (m!) 2 .
21 Writeoutexplicitlythe firstfour terms ofthe series
∞∑
m=0
(−1) m x 2m+1
2 2m+1 m!(m +1)! .
Solutions
1 (a) 1,−1,1, −1,1 (b) 1, − 1 3 , 1 5 , −1 7 , 1 9
2 (a) 8k−7 k 1 (b) (−1) k k 1
4 1+5x+15x 2 +30x 3
(
+···)
1
5 1 −x+ 2x2
27 3 − 10x3
27
7 0
8
9 0
27
8
11 0, 2, 3, 3, −1, −7
12 (a) 0 (b) nolimit (c) nolimit
(d) a (e) nolimit
13 (a) − 40
9
(b) 19
14 2, 3 4
15 (a) 1+4+9+16+25
(b) (i) 55 (ii) 385 (iii) 330
16 (a) 1+8+27+64+125+216
(b) (i) 441 (ii) 6084 (iii) 5643
17 1110
18 (a) x[k]=k 2 +1,k=1,2,3,...
(b)x[k+1]=x[k]+2k+1
19 (a) 2.13
(b) x = 1.6sinx,1.6
(c)x= 3 √
2e −x ,0.93
20 1− 1 4 x2 + 1 64 x4 − 1
2304 x6 +···
21 1 2 x − 1 16 x3 + 1
384 x5 − 1
18432 x7 +···
7 Vectors
Contents 7.1 Introduction 224
7.2 Vectorsandscalars:basicconcepts 224
7.3 Cartesiancomponents 232
7.4 Scalarfieldsandvectorfields 240
7.5 Thescalarproduct 241
7.6 Thevectorproduct 246
7.7 Vectorsofndimensions 253
Reviewexercises7 255
7.1 INTRODUCTION
Certainphysicalquantitiesarefullydescribedbyasinglenumber:forexample,themass
of a stone, the speed of a car. Such quantities are called scalars. On the other hand,
some quantities are not fully described until a direction is specified in addition to the
number. For example, a velocity of 30 metres per second due east is different from a
velocityof30metrespersecondduenorth.Thesequantitiesarecalledvectorsanditis
importanttodistinguishthemfromscalars.There aremany engineering applications in
which vector and scalar quantities play important roles. For example, speed, potential,
work and energy are scalar quantities, while velocity, electric and magnetic forces, the
position of a robot and the state-space representation of a system can all be described
by vectors. A variety of mathematical techniques have been developed to enable useful
calculations tobe carriedout usingvectors and inthischapter these will be discussed.
7.2 VECTORSANDSCALARS:BASICCONCEPTS
Scalarsarethesimplestquantitieswithwhichtodeal;thespecificationofasinglenumberisallthatisrequired.Vectorsalsohaveadirectionanditisusefultoconsideragraphical
representation. Thus the line segment AB of length 4 in Figure 7.1 can represent
7.2 Vectors and scalars: basic concepts 225
A
a
B
A
C
AB
B
CD
D
Figure7.1
Avector, AB,oflength → 4.
Figure7.2
Two equal vectors.
a vector in the direction shown by the arrow on AB. This vector is denoted by AB.
→
→
Note that AB ≠ BA. → The vector AB → →
is directed from A to B, but BA is directed from
BtoA.
An alternative notation is frequently used: we denote AB → by a. This bold notation
is commonly used in textbooks but the notation a is preferable for handwritten work.
Wenowneedtorefertothediagramtoappreciate theintended direction.Thelengthof
the line segment represents the magnitude, or modulus, of the vector and we use the
notation | → AB|, |a| or simply a to denote this. Note that whereas a is a vector, |a| is a
scalar.
7.2.1 Negativevectors
The vector −a is a vector in the opposite direction to, but with the same magnitude as,
a. Geometrically itwill be → BA. Thus −aisthe same as → BA.
7.2.2 Equalvectors
Twovectorsareequaliftheyhavethesamemagnitudeanddirection.InFigure7.2vectors
→ CDand → ABareequaleventhoughtheirlocationsdiffer.Thisisausefulpropertyof
vectors:avectorcanbetranslated,maintainingitslengthanddirectionwithoutchanging
thevectoritself.Thereareexceptionstothisproperty.Forexample,weshallsoonmeet
position vectors which are used to locate specific fixed points in space. They clearly
cannot be moved around freely. Nevertheless most of the vectors we shall meet can be
translated freely, and assuch are often calledfree vectors.
7.2.3 Vectoraddition
Itisfrequentlyusefultoaddtwoormorevectorstogetherandtheadditionofvectorsis
definedbythetrianglelaw.ReferringtoFigure7.3,ifwewishtoaddABto → CD,
→ CDis
translateduntilCandBcoincide.Asmentionedearlier,thistranslationdoesnotchange
→
thevectorCD.Thenthesum,
→ AB+ CD,isdefinedbythevectorrepresentedbythethird
→
sideof the completed triangle, thatis → AD(Figure 7.4).Thus,
→
AB + → CD = → AD
→
Similarly,ifABisdenotedbya,
→ CDisdenotedbyb, ADisdenotedbyc,thenwehave
c=a+b
Wenotethattoaddaandbatriangleisformedusingaandbastwoofthesidesinsuch
a way that the head of one vector touches the tail of the other as shown in Figure 7.4.
The suma +bisthen represented by the third side.
→
→
226 Chapter 7 Vectors
A
Figure7.3
a
B
Two vectors, → ABand → CD.
C
b
Itispossible toprove the following rules:
D
A
Figure7.4
AB
a
B
c
AD
C
b
CD
Addition ofthe two vectors ofFigure 7.3using
the triangle law.
a +b = b +a vector addition iscommutative
a + (b +c) = (a +b) +c vector addition isassociative
D
ToseewhyitisappropriatetoaddvectorsusingthetrianglelawconsiderExamples7.1
and 7.2.
Engineeringapplication7.1
Routingofanautomatedvehicle
Automatedvehiclesareacommonfeatureofthemodernwarehouse.Sometimesthey
are guided by tracks set into the factory floor. Consider the case of an automated
vehiclecarryingelectricalcomponentsfromstoresatAtoworkersatCasillustrated
inFigure 7.5.
ThevehiclemayarriveatCeitherdirectly,orviapointB.ThemovementfromA
toBcanberepresentedbyavector → ABknownasadisplacementvector,whosemagnitude
is the distance between points A and B. Similarly, movement from B to C is
representedby → BC,andmovementdirectlyfromAtoCisrepresentedby → AC.Clearly,
sinceA,BandCarefixedpoints,thesedisplacementvectorsarefixedtoo.Sincethe
headofthevector → ABtouchesthetailofthevector → BCwearereadytousethetriangle
lawof vector addition tofind the combined effect of the two displacements:
→
AB + → BC = → AC
Thismeansthatthecombinedeffectofdisplacements → ABand → BCisthedisplacement
→
AC. We say that → AC istheresultant of → ABand → BC.
C
AC
BC
A
AB
B
Figure7.5
Routing ofan automated vehicle from storesat
Ato workersat C.
7.2 Vectors and scalars: basic concepts 227
In considering motion from point A to point B, the vector → AB is called a displacement
vector.
Engineeringapplication7.2
Resultantoftwoforcesactingonabody
Sometimesitisusefultobeabletocalculatetheresultantforcewhenseveraldifferent
forces act on a body. Consider the simplest case when there aretwo forces.
AforceF 1
of2Nactsverticallydownwards,andaforceF 2
of3Nactshorizontally
to the right, upon the body shown in Figure 7.6. Translating F 1
until its tail touches
the head of F 2
we can apply the triangle law to find the combined effect of the two
forces.This isasingle forceRknown as the resultant ofF 2
andF 1
. We write
R=F 2
+F 1
andsaythatRisthevectorsumofF 2
andF 1
.Theresultantforce,R,actsatanangle
of θ to the vertical, where tanθ = 3/2, and has a magnitude given by Pythagoras’s
theorem as √ 2 2 +3 2 = √ 13 N.
F 2
u
F 1
R
u
Figure7.6
Resultant force,R, producedbyavertical force,F 1 ,anda
horizontalforce,F 2 .
Engineeringapplication7.3
Resolvingaforceintotwoperpendiculardirections
Inthepreviousexamplewesawthattwoforcesactinguponabodycanbereplacedby
asingleforcewhichhasthesameeffect.Equivalentlywecanconsiderasingleforce
astwoforcesactingatrightanglestoeachother.ConsidertheforceFinFigure7.7.
It can be replaced by two forces, one of magnitude |F|cosθ and one of magnitude
|F|sin θ asshown.WesaythattheforceFhasbeenresolvedintotwoperpendicular
components.
For example, Figure 7.8 shows a force of 5 N acting at an angle of 30 ◦ to thex
axis. It can be resolved into two components. The first component is directed along
the x axis and has magnitude 5cos30 ◦ N. The second component is perpendicular
to the first and has magnitude 5sin30 ◦ N. These two components combine to have
the sameeffect asthe original5Nforce.
➔
228 Chapter 7 Vectors
F sin u
u
F
F cos u
Figure7.7
Aforce F can be resolved into two
perpendicular components.
5 sin 30°
30°
5
5 cos 30°
Figure7.8
The5Nforce can be resolved intotwo
perpendicular components.
x
Example7.1 Vectors p,q,r andsform the sidesofthe squareshown inFigure 7.9.Express
(a) pintermsof r
(b) sintermsof q
→
(c) diagonalBDintermsof qandr
Solution (a) The vector p has the same length as r but has the opposite direction. Therefore
p=−r.
(b) Vectorshasthesamelengthasqbuthastheoppositedirection.Therefores = −q.
(c) The head of BC → →
coincides with the tail of CD. Therefore, by the triangle law of
addition, the third sideoftriangle BCDrepresents the sum of → BCand → CD, thatis
→
BD = → BC + → CD
=q+r
7.2.4 Vectorsubtraction
Subtraction of one vector from another is performed by adding the corresponding negative
vector; thatis, ifweseeka −b, weforma + (−b).
Example7.2 ConsidertherectangleillustratedinFigure7.10withaandbasshown.Expressinterms
ofA,B, CorDthe vectors a −bandb −a.
B
p
q
C
r
a
B
C
A
s
Figure7.9
Thevectors p,q,r andsform the sides
ofasquare.
D
A
b
Figure7.10
Therectangle ABCD.
D
7.2 Vectors and scalars: basic concepts 229
B
B
AB
DB
BA
BD
A
DA
D
A
AD
D
Figure7.11
ThevectorsABand → DA.
→
Figure7.12
ThevectorsBAand → AD.
→
Solution We have
hence
b = → AD
−b = → DA
Then
a−b=a+(−b)
= → AB + → DA
= → DA + → AB by commutativity
→
DAand → ABareshown inFigure 7.11. Their sum isgiven by the triangle law, thatis
Similarly,
hence
a−b= → DA + → AB = → DB
a = → AB
−a = → BA
Then
b−a=b+(−a)
= → AD + → BA
= → BA + → AD
→
BAand → ADareshowninFigure7.12.Againtheirsumisgivenbythetrianglelaw,thatis
b−a= → BA + → AD = → BD
Example7.3 Referring to Figure 7.13, if r 1
and r 2
are as shown, find the vector b = → QP represented
by the third sideof the triangle OPQ.
Solution FromFigure 7.13 we note fromthe triangle law that
→
QP = → QO + → OP = → OP + → QO by commutativity.
→
ButQO = −r 2
andQP → = b, and so
b=r 1
−r 2
230 Chapter 7 Vectors
r 1
P
P
Q
P
PQ
Q
O
r 2
b
Q
p
O
q
r
R
O
p
q
Figure7.13
Thetwo vectors r 1 and r 2 .
Figure7.14
Thetetrahedron OPQR.
Figure7.15
Thetriangle OPQ.
Vectors do not necessarily lie in a two-dimensional plane. Three-dimensional vectors
are commonly used as isillustratedinthe following example.
Example7.4 OPQR is the tetrahedron shown in Figure 7.14. Let
→
ExpressPQ,
→ QRandRP → intermsofp, qandr.
Solution Consider the triangle OPQ shown in Figure 7.15. We note that
→
OP=p, → OQ=qand → OR=r.
→
OQ represents the third
sideofthetriangleformedwhenpand → PQareplacedheadtotail.Usingthetrianglelaw
we find
→
OP + → PQ = → OQ
Therefore,
→
PQ = → OQ − → OP
=q−p
→
Similarly, QR=r−qandRP=p−r.
→
7.2.5 Multiplicationofavectorbyascalar
Ifkis any positive scalar and a is a vector thenka is a vector in the same direction as a
butktimesaslong.Ifkisanynegative scalar,kaisavectorintheoppositedirectionto
a, andktimesas long.Bywayof example, studythe vectors inFigure 7.16. Clearly 2a
is twice as long as a but has the same direction. The vector 1 b is half as long as b but
2
has the same direction asb. Itispossible toprove the following rules.
For any scalarskandl, and any vectorsaand b:
k(a+b)=ka+kb
(k+l)a=ka+la
k(la) = (kl)a
Thevectorkaissaid tobeascalarmultiple ofa.
7.2 Vectors and scalars: basic concepts 231
A
a 1 –2 b
2a
Figure7.16
Scalar multiplication ofavector.
b
C
b
M
Figure7.17
Thetriangle ABC,with midpointM
ofside AB.
a
B
→
Example7.5 In a triangle ABC, M is the midpoint of AB. Let AB be denoted by a, and BC → by b.
ExpressAC,
→ →
CA andCMinterms →
ofaandb.
Solution The situationissketched inFigure 7.17. Usingthe triangle ruleforaddition wefind
→
AB + → BC = → AC
Therefore,
→
AC=a+b
Itfollows that → CA=− → AC = −(a +b).
Again by the triangle ruleapplied totriangle CMB we find
→
CM = CB → + BM
→
→
NowBM →
= 1 BA=− 1 aand so
2 2
(
→
CM=−b+ − 1 )
2 a
=−
(b+ 1 )
2 a
7.2.6 Unitvectors
Vectors which have length 1 are called unit vectors. If a has length 3, for example,
then a unit vector in the direction of a is clearly 1 a. More generally we denote the unit
3
vectorinthedirectionabyâ.Recallthatthelengthormodulusofais |a|andsowecan
write
â = a
|a|
Note that |a|and hence 1
|a| arescalars.
232 Chapter 7 Vectors
7.2.7 Orthogonalvectors
Iftheanglebetweentwovectorsaandbis90 ◦ ,thatisaandbareperpendicular,thena
and baresaidtobeorthogonal.
EXERCISES7.2
1 ForthearbitrarypointsA,B,C,DandE,findasingle
vector which isequivalent to
→
(a) DC + CB
→ →
(b) CE + DC
→
2 Figure7.18 shows acube. Letp = → AB,q= → AD and
r = → AE.Expressthe vectors representing → BD, → ACand
→
AGin terms of p,qand r.
F
G
3 Inatriangle ABC,Mis the midpoint ofBC,andNis
→
the midpoint ofAC.Show thatNM = 1 →
2AB.
4 Considerarectangle with verticesat E,F, GandH.
SupposeEF=pand → FG → = q.Expresseachofthe
vectorsEH,
→ → →
GH, FH andGE → in terms of p andq.
5 Ifaisan arbitraryvector, represent on adiagramthe
vectorsa, 1 2 a, −a ,3a, −3aand â.
4
B
p
E
r
A q
Figure7.18
C
D
H
6 Aparticleis positionedat the origin.Two forcesact
onthe particle.Thefirstforce has magnitude 7 N
andactsin the negativexdirection.The secondforce
has magnitude 12 Nandactsin theydirection.
Calculate the magnitude anddirectionofthe resultant
force.
7 Aforce of15Nacts atanangleof65 ◦ to thexaxis.
Resolvethisforce intotwo forces, onedirected along
thexaxisandthe other directed along the
yaxis.
Solutions
1 (a)
→
DB
(b)
→
DE
a
2 q−p,q+p,q+r+p
4 q,−p,q−p,−p−q
5 SeeFigureS.16.
6 13.9 Natan angleof59.7 ◦ to the negativexaxis
7 6.34 N, 13.59N
1– a
2
– –– a
4
FigureS.16
3a
–3a
7.3 CARTESIANCOMPONENTS
Consider thex--y plane in Figure 7.19. The general point P has coordinates (x,y). We
can join the origin to P by a vectorOP, → which is called the position vector of P, which
weoftendenotebyr.Themodulusofris |r| =r,andisthelengthofOP.Itispossible
→
y
j
O
i
r
Figure7.19
Thex--y plane with
point P.
P(x, y)
M
x
7.3 Cartesian components 233
to express r in terms of the numbers ofxandy. If we denote a unit vector along thex
axis by i, and a unit vector along theyaxis by j (we usually omit the ˆ here), then it is
clearfromthedefinitionofscalarmultiplicationthatOM =xi,andMP → =yj.Itfollows
fromthe triangle law of addition that
r = → OP = →
OM + → MP=xi+yj
Clearly the vectors i and j are orthogonal. The numbersxandyare the i and j componentsof
r. Furthermore, using Pythagoras’s theorem we can deduce that
r = √ x 2 +y 2
Alternative notations which aresometimes usefulare
(
r = OP → x
=
y)
→
and
r = → OP = (x,y)
( x
When written intheseforms iscalled a columnvector and (x,y) iscalled arow
y)
vector.Toavoidconfusionwiththecoordinates (x,y)weshallnotuserowvectorshere
but they will be needed in Chapter 26. We will also use the column vector notation for
moregeneralvectors, thus,
( a
ai+bj=
b)
Wesaidearlierthatavectorcanbetranslated,maintainingitslengthanddirectionwithoutchanging
the vector itself.While this is true generally, position vectors form animportant
exception. Position vectors are constrained to their specific position and must
always remaintiedtothe origin.
Example7.6 IfAisthepointwithcoordinates(5,4)andBisthepointwithcoordinates (−3,2)find
the position vectors ofAand B, and the vector → AB. Further, find | → AB|.
( 5
Solution The position vector of A is 5i +4j = , which we shall denote by a. The position
( ) 4)
−3
vector of B is −3i + 2j = , which we shall denote by b. Application of the
2
trianglelaw totriangleOAB (Figure 7.20)gives
thatis
→
OA + → AB = → OB
a + → AB=b
234 Chapter 7 Vectors
y
5
B(–3, 2)
4
3
2
b
1
–6 –5 –4 –3 –2 –1 0
a
1 2 3 4
A(5, 4)
5 6 x
Figure7.20
Points Aand Bin thex--yplane.
Therefore,
→
AB=b−a
= (−3i +2j) − (5i +4j)
=−8i−2j
Alternatively, intermsof column vectors
b−a=
=
( ) ( −3 5
−
2 4)
( ) −8
−2
Wenotethatsubtraction(andlikewiseaddition)ofcolumnvectorsiscarriedoutcomponentbycomponent.Tofind
| → AB|wemustobtainthelengthofthevector → AB.Referring
toFigure7.20,wenotethatthisquantityisthelengthofthehypotenuseofaright-angled
triangle with perpendicular sides 8 and 2.Thatis, | → AB| = √ 8 2 +2 2 = √ 68 = 8.25.
More generallywehave the following result:
Given vectors a = → OA=a 1
i+a 2
jandb= → OB =b 1
i +b 2
j (Figure7.21), then
and
→
AB=b−a=(b 1
−a 1
)i+(b 2
−a 2
)j
| AB|=|b−a|=|(b → 1
−a 1
)i+(b 2
−a 2
)j|
√
= (b 1
−a 1
) 2 +(b 2
−a 2
) 2
Example7.7 Ifa =
( ( ) 7 −2
andb= ,
3)
5
(a) find a +b, a −b, b +aandb −a, commentingupon the results
(b) find 2a −3b
(c) find |a −b|.
7.3 Cartesian components 235
Solution (a)
a 2
O
b
a
a 1
A
b – a
b 1
Figure7.21
Thequantity √
|b−a|= (b 1 −a 1 ) 2 + (b 2 −a 2 ) 2 .
a+b=
a−b=
b+a=
b−a=
( ( ) ( 7 −2 5
+ =
3)
5 8)
x
( ( 7 −2
− =
3)
5)
( ( −2 7
+ =
5)
3)
( ) 9
−2
( 5
8)
( ( ( −2 7 −9
− =
5)
3)
2)
A
x
z
O
{
P}
z
r
B
}
y
Figure7.22
Thequantity |r| = √ x 2 +y 2 +z 2 .
x
y
y
b 2
B
We note thataddition is commutative whereas subtraction isnot.
( ( ( ( ) ( )
7 −2 14 −6 20
(b) 2a−3b=2 −3 = − =
3)
5)
6)
15 −9
(c) |a−b|=|9i−2j|= √ 9 2 + (−2) 2 = √ 85
The previous development readily generalizes to the three-dimensional case. Taking
Cartesian axes x, y and z, any point in three-dimensional space can be represented by
givingx,yandzcoordinates(Figure7.22).Denotingunitvectorsalongtheseaxesbyi,
jandk, respectively, wecan write the vectorfrom OtoP(x,y,z) as
→
OP=r=xi+yj+zk=
⎛ ⎞
x
⎝y⎠
z
Thevectors i,jand kareorthogonal.
Example7.8 Ifr =xi +yj +zk show thatthe modulus ofrisr = √ x 2 +y 2 +z 2 .
Solution Recalling Figure 7.22 we first calculate the length of OB. Now OAB is a right-angled
triangle with perpendicular sides OA = x and AB = y. Therefore by Pythagoras’s
236 Chapter 7 Vectors
theoremOBhaslength √ x 2 +y 2 .Then,applyingPythagoras’stheoremtoright-angled
triangle OBP which has perpendicular sides OBand BP =z, we find
|r|=OP=
as required.
√
OB 2 +BP 2
= √ (x 2 +y 2 )+z 2
= √ x 2 +y 2 +z 2
Ifr=xi+yj+zk then |r|= √ x 2 +y 2 +z 2
Inthreedimensions we have the following general result:
Given vectors a = → OA=a 1
i+a 2
j+a 3
kandb= → OB =b 1
i+b 2
j+b 3
k,then
and
→
AB=b−a=(b 1
−a 1
)i+(b 2
−a 2
)j+(b 3
−a 3
)k
| AB|=|b−a|=|(b → 1
−a 1
)i+(b 2
−a 2
)j+(b 3
−a 3
)k|
√
= (b 1
−a 1
) 2 +(b 2
−a 2
) 2 +(b 3
−a 3
) 2
Example7.9 Ifa = 3i −2j +k andb = −2i +j−5k, find
(a) |a| (b) â (c) |b| (d) ˆb (e)b−a (f)|b−a|
Solution (a) |a| = √ 3 2 + (−2) 2 +1 2 = √ 14
(b) â = a
|a| = 1 √
14
(3i−2j+k)= 3 √
14
i − 2 √
14
j + 1 √
14
k
(c) |b| = √ (−2) 2 +1 2 + (−5) 2 = √ 30
(d) ˆb = b
|b| = √ 1 (−2i+j−5k)= √ −2 i + √ 1 j − √ 5 k
30 30 30 30
(e) b−a=−5i+3j−6k
(f) |b−a|= √ (−5) 2 +3 2 + (−6) 2 = √ 70
7.3.1 Thezerovector
A vector, all the components of which are zero, is called a zero vector and is denoted
by 0 to distinguish it from the scalar 0. Clearly the zero vector has a length of 0; it is
unusualinthatithas arbitrarydirection.
7.3 Cartesian components 237
Engineeringapplication7.4
Robotpositions
Positionvectorsprovideausefulmeansofdeterminingthepositionofarobot.There
aremanydifferenttypesofrobotbutacommontypeusesaseriesofrigidlinksconnected
together by flexible joints. Usually the mechanism is anchored at one point.
Atypical example isillustratedinFigure 7.23.
b
c
d
Y
a
p
X
Figure7.23
Atypical robot configuration with vectors representing
the robot links.
TheanchorpointisXandthetipoftherobotissituatedatpointY.Thefinallink
is sometimes called the hand of the robot. The hand often has rotating and gripping
facilitiesanditssizerelativetotherestoftherobotisusuallyquitesmall.Eachofthe
robotlinkscanberepresentedbyavector(seeFigure7.23).Thevectordcorresponds
tothehand.Acommonrequirementinroboticsistobeabletocalculatetheposition
of the tip of the hand to ensure it does not collide with other objects. This can be
achieved by defining a set of Cartesian coordinates with origin at the anchor point
of the robot, X. Each of the link vectors can then be represented in terms of these
coordinates. For example, inthe case of the robot inFigure 7.23:
a=a 1
i+a 2
j+a 3
k
b=b 1
i+b 2
j+b 3
k
c=c 1
i+c 2
j+c 3
k
d=d 1
i+d 2
j+d 3
k
Thepositionofthetipofthehandcanbecalculatedbyaddingtogetherthesevectors.
So,
p=a+b+c+d
=(a 1
+b 1
+c 1
+d 1
)i+(a 2
+b 2
+c 2
+d 2
)j+(a 3
+b 3
+c 3
+d 3
)k
7.3.2 Linearcombinations,dependenceandindependence
Supposewehavetwovectorsaandb.Ifweformarbitraryscalarmultiplesofthese,that
isk 1
a andk 2
b, and add these together, we obtain a new vector c where c = k 1
a+k 2
b.
The vector c is said to be a linear combination of a and b. Note that scalar multiplication
and addition of vectors are the only operations allowed when forming a linear
combination. Vector c is said to depend linearly on a and b. Of course we could also
write
a = 1 k 1
c − k 2
k 1
b providedk 1 ≠ 0
238 Chapter 7 Vectors
so thatadepends linearly oncandb. Providedk 2 ≠ 0,then
b = 1 k 2
c − k 1
k 2
a
so that b depends linearly on c and a. The set of vectors {a,b,c} is said to be linearly
dependent and any one of the set can be written as a linear combination of the other
two.Ingeneral, we have the following definition:
Asetofnvectors {a 1
,a 2
,...,a n
} islinearlydependentif the expression
k 1
a 1
+k 2
a 2
+···+k n
a n
=0
canbesatisfiedbyfindingscalarconstantsk 1
,k 2
,...,k n
,notallofwhicharezero.
If the only way we can make the combination zero is by choosing all thek i
s to be
zero,thenthe given setofvectors issaidtobe linearly independent.
Example7.10 Show thatthe vectors iandjarelinearly independent.
Solution Weformtheexpressionk 1
i +k 2
j = 0andtrytochoosek 1
andk 2
sothattheequationis
satisfied. Using columnvectors wehave
( ) ( ) ( 1 0 0
k 1
+k
0 2
=
1 0)
that is (k10 ) ( ) ( ) ( 0 k1 0
+ = =
k 2
k 2
0)
The only way we can satisfy the equation is by choosingk 1
= 0 andk 2
= 0 and hence
we conclude that the vectors i and j are linearly independent. Geometrically, we note
that sincethey are perpendicular, no scalar multiple ofican give j and viceversa.
Example7.11 Thevectors
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
1 5 13
⎝2⎠
⎝1⎠
⎝−1⎠
3 9 21
are linearlydependentbecause, forexample
⎛ ⎞ ⎛ ⎞ ⎛ ⎞ ⎛ ⎞
5 1 13 0
3⎝1⎠ −2⎝2⎠ −1⎝−1⎠ = ⎝0⎠
9 3 21 0
EXERCISES7.3
1 PandQliein thex--y plane. Find → PQ,wherePis the
point with coordinates (5,1) andQisthe point with
coordinates (−1,4).Find | → PQ|.
2 AandBlie in thex--yplane. If Ais the point (3,4)
and Bis the point (1,−5) write down the vectors
→
OA, → OB and → AB.Find aunitvector in the direction
of → AB.
7.3 Cartesian components 239
3 Ifa=4i−j+3kandb=−2i+2j−k,findunit
vectors in the directions ofa,b andb −a.
4 Ifa = 5i −2j,b = 3i −7jandc = −3i +4j,express,
in termsof iand j,
a+b a+c c−b 3c−4b
Drawdiagrams to illustrate your results.Repeat the
calculations usingcolumn vectornotation.
5 Writedown aunit vectorwhich is parallel to the line
y=7x−3.
6 Find → PQwhere Pisthe point in three-dimensional
space with coordinates (4,1,3) andQisthe point
with coordinates (1,2,4).Find the distancebetween
PandQ. Further,findthe position vectorofthe point
dividing PQ in the ratio 1:3.
7 If P,QandRhave coordinates (3,2,1), (2,1,2) and
(1,3,3),respectively, use vectors to determine which
pair ofpointsare closestto each other.
8 Considerthe robot ofExample7.4. Thelinkvectors
have the following values:
a =12i+18j+k
b=6i−3j+8k
c =3i+2j−4k
d = 0.5i −0.2j +0.6k
Calculate the length ofeach ofthe links and the
position vector ofthe tip ofthe robot.
9 Show that the vectorsa = i +j andb = −i +jare
linearly independent.
10 Show that the vectorsi,j and kare linearly
independent.
Solutions
1
→
PQ=−6i+3j | → PQ| = 6.71
2 3i+4j,i−5j,−2i−9j
−1
unitvector: √ (2i +9j)
85
3
1
√ (4i −j +3k), 1 (−2i +2j −k),
26 3
1
√
61
(−6i +3j −4k)
4 8i −9j,2i +2j, −6i +11j,−21i +40j
5
6
1
√
50
(i +7j)
→
PQ = −3i +j+k,distancefrom Pto Q = 3.32,
1
(13i +5j +13k)
4
7 PandQ
8 21.66,10.44,5.39,0.81,21.5i +16.8j +5.6k
TechnicalComputingExercises7.3
To plotadisplacement vectorbetween points(1,2)and
(3,4)in MATLAB ® wecouldtype:
X=[1 3];
Y=[2 4];
plot(X,Y)
whichwould resultin aplotcontaining asingle line
between the two points.Notice the order in whichthe
coordinatesare passed to the plotfunction. Thefirst
entry in Xandthe firstentry in Yare forthe first
location andsoon. Ifwewantedto plotasecond
displacement vector from (3,4)to (5,6)wewould
type:
X=[1 3 5];
Y=[2 4 6];
plot(X,Y)
Now consider the problemofroutingan automated
vehicle between the following positionson afactory
floor:
Doorway: A =(0,0)
Resistorbin: B =(1,1)
Capacitorbin: C = (1,-0.5)
PCBrack: D =(2,0)
Assemblyline: E =(2.5, 0)
Forsimplicityyou mayassume that the robotwill
notcollidewith any objects ifittravels directly
betweenany ofthose two points.
1 Plotthe displacement vectors forthe robotifit
enters through the doorway andvisits the resistor
bin,the PCBrackandthenthe assemblyline. How
would thisactionbe writtenin vectornotation?
240 Chapter 7 Vectors
2 Calculate the distance travelled bythe robot.
3 Now consider the problem ofvisitingallofthe
positionsA, B,C,Dand Eonetime only, with the
robot travelling the least distance. The robot must
enter through the doorway and finishat the
assembly line. Bytrialand errorproduce a solution
to the problem, usingthe computer to plot each
trial. Expressyourfinal calculated solution in
vector notation.
7.4 SCALARFIELDSANDVECTORFIELDS
Imaginealargeroomfilledwithair.Atanypoint,P,wecanmeasurethetemperature, φ,
say. The temperature will depend upon whereabouts in the room we take the measurement.
Perhaps, close to a radiator the temperature will be higher than near to an open
window. Clearly the temperature φ isafunctionofthe position ofthe point.Ifwelabel
the point by its Cartesian coordinates (x,y,z), then φ will be a function of x,y and z,
that is
φ = φ(x,y,z)
Additionally, φ may be a function of time but for now we will leave this additional
complication aside. Since temperature is a scalar what we have done is define a scalar
ateach pointP(x,y,z) inaregion. This isanexample of ascalar field.
Alternatively,supposeweconsiderthemotionofalargebodyoffluid.Ateachpoint,
fluid will be moving with a certain speed in a certain direction; that is, each small fluid
elementhasaparticularvelocity,v,dependinguponwhereaboutsinthefluiditis.Since
velocity is a vector, what we have done is define a vector at each pointP(x,y,z). We
now have a vector function ofx,y andz, known as avector field. Let us write
v = (v x
,v y
,v z
)
so that v x
,v y
and v z
arethei,j and kcomponents respectively ofv, thatis
v=v x
i+v y
j+v z
k
We note that v x
, v y
and v z
will each bescalar functions ofx,y andz.
Engineeringapplication7.5
ElectricfieldstrengthEandelectricdisplacementD
Electrostaticsisthestudyoftheforceswhichstationarypositiveandnegativeelectric
charges exert upon one another. Consider Figure 7.24 which shows a single positive
charge placed at O. The presence of this charge gives rise to an electric force field
around it. Faraday introduced the idea of lines of force to help visualize the field.
At any point, P, there exists a vector which gives the direction and magnitude of
the electrostatic force at P. Because all the lines of force emerge radially from O,
the direction of the electrostatic force is radially outwards. It can be shown that the
magnitudeoftheforceisinverselyproportionaltothesquareofthedistancefromO.
7.5 The scalar product 241
O
E
Figure7.24
Acharge atOgives riseto an electric fieldE.
Ifasecondchargeisplacedinthisfielditexperiencesaforce.Animportantquantity
is the electric field strength, E. This is a vector field which describes the force
experienced by a unit charge.
A related quantity is the electric displacement, D, also called the electric flux
density, defined as D = εE, where ε is called the permittivity of the medium in
which the field islocated. Note thatDisascalar multiple ofE.
Engineeringapplication7.6
ElectrostaticpotentialV
An important electrostatic field is the electrostatic potentialV. This is an example
of a scalar field. The difference between the potential measured at any two points in
thefieldisequaltotheworkwhichneedstobedonetomoveaunitchargefromone
point to the other. Later, in Chapter 26, we will see that the scalar fieldV is closely
relatedtothe vectorfield E.
7.5 THESCALARPRODUCT
u
b
a
Figure7.25
Two vectorsaandb
separated by
angle θ.
Givenanytwovectorsaandb,therearetwowaysinwhichwecandefinetheirproduct.
Theseareknownasthescalarproductandthevectorproduct.Asthenamessuggest,the
resultoffindingascalarproductisascalarwhereastheresultoffindingavectorproduct
isavector. The scalar product of aandbiswritten as
a · b
Thisnotationgivesrisetothealternativenamedotproduct.Itisdefinedbytheformula
a·b=|a‖b|cosθ
where θ isthe angle between the two vectors asshown inFigure 7.25.
242 Chapter 7 Vectors
From the definition of the scalar product, it is possible to show that the following
rules hold:
a·b = b·a the scalar product iscommutative
k(a·b) = (ka·b) wherekisascalar
(a +b)·c = (a·c) + (b·c) thedistributiverule
Itisimportantatthisstagetorealizethatnotationisveryimportantinvectorwork.You
should not use a× to denote the scalar product because this is the symbol we shall use
for the vector product.
Example7.12 If a and b are parallel vectors, show that a·b = |a‖b|. If a and b are orthogonal show
that theirscalar product iszero.
Solution If a and b are parallel then the angle between them is zero. Therefore a·b =
|a‖b|cos0 ◦ = |a||b|. If a and b are orthogonal, then the angle between them is 90 ◦
and a·b = |a||b|cos90 ◦ = 0.
Similarly we can show that if a and b are two non-zero vectors for which a·b = 0,
thenaandbmustbeorthogonal.
If a andbareparallel vectors, a·b = |a‖b|.
If a andbareorthogonal vectors,a·b = 0.
Animmediateconsequenceofthepreviousresultisthefollowingusefulsetofformulae:
i·i=j·j=k·k=1
i·j=j·k=k·i=0
Example7.13 Ifa =a 1
i+a 2
j+a 3
kandb =b 1
i+b 2
j+b 3
kshowthata·b =a 1
b 1
+a 2
b 2
+a 3
b 3
.
Solution We have
a·b = (a 1
i+a 2
j+a 3
k)·(b 1
i+b 2
j+b 3
k)
=a 1
i·(b 1
i+b 2
j+b 3
k)+a 2
j·(b 1
i+b 2
j+b 3
k)
+a 3
k·(b 1
i+b 2
j+b 3
k)
=a 1
b 1
i·i+a 1
b 2
i·j+a 1
b 3
i·k+a 2
b 1
j·i+a 2
b 2
j·j+a 2
b 3
j·k
+a 3
b 1
k·i+a 3
b 2
k·j+a 3
b 3
k·k
=a 1
b 1
+a 2
b 2
+a 3
b 3
as required. Thus, given two vectors in component form their scalar product is the sum
of the products of corresponding components.
The resultdeveloped inExample 7.13 isimportant and should bememorized:
Ifa =a 1
i+a 2
j+a 3
kandb =b 1
i+b 2
j+b 3
k,
thena·b =a 1
b 1
+a 2
b 2
+a 3
b 3
.
7.5 The scalar product 243
Example7.14 Ifa = 5i −3j +2kand b = −2i +4j +kfind the scalar producta·b.
Solution Usingthe previous resultwe find
a·b=(5i−3j+2k)·(−2i+4j+k)
= (5)(−2) + (−3)(4) + (2)(1)
=−10−12+2
= −20
Example7.15 Ifa =a 1
i +a 2
j +a 3
k, find
(a) a·a (b) |a| 2
Solution (a) Using the previous resultwefind
a·a = (a 1
i+a 2
j+a 3
k)·(a 1
i+a 2
j+a 3
k)
=a 2 1 +a2 2 +a2 3
(b) From Example 7.8 we know that the modulus ofr =xi +yj +zk is √ x 2 +y 2 +z 2
and therefore
√
|a| = a 2 1 +a2 2 +a2 3
so that
|a| 2 =a 2 1 +a2 2 +a2 3
We notethe generalresultthat
a·a=|a| 2
Example7.16 Ifa = 3i +j−k andb = 2i +j+2k finda·band the angle betweenaandb.
Solution We have
a·b = (3)(2) + (1)(1) + (−1)(2)
=6+1−2
= 5
Furthermore, from the definition ofthe scalar producta·b = |a||b|cos θ. Now,
|a| = √ 9+1+1= √ 11 and |b| = √ 4+1+4=3
Therefore,
cosθ = a·b
|a‖b| = 5
3 √ 11
fromwhich wededuce that θ = 59.8 ◦ or 1.04 radians.
244 Chapter 7 Vectors
Engineeringapplication7.7
Workdonebyaforce
Ifaforceisappliedtoanobjectandtheobjectmoves,thenworkisdonebytheforce.
It is possible to use vectors to calculate the work done. Suppose a constant force F
is applied and as a consequence the object moves from A to B, represented by the
displacement vectors, asshown inFigure 7.26.
Wecanresolvetheforceintotwoperpendicularcomponents,oneparalleltosand
oneperpendiculartos.Theworkdonebyeachcomponentisequaltotheproductof
itsmagnitudeandthedistancemovedinitsdirection.Thecomponentperpendicular
to s will not do any work because there is no movement in this direction. For the
component alongs, that is |F|cos θ, we find
work done = |F|cos θ|s|
From the definition of the scalar product we see that the r.h.s. of this expression is
the scalar product ofFands.
F cos u
F
F sin u
u
A
AB = s
B
Figure7.26
ThecomponentofFinthedirectionofs
isFcosθ.
The work done by a constant force F which moves an object along the vector s is
equal tothe scalar productF·s.
Example7.17 A force F = 3i + 2j − k is applied to an object which moves through a displacement
s = 2i +2j +k. Find the work done by the force.
Solution The work done isequal to
F·s=(3i+2j−k)·(2i+2j+k)
=6+4−1
= 9 joules
Engineeringapplication7.8
Movementofachargedparticleinanelectricfield
Figure7.27showstwochargedplatessituatedinavacuum.PlateAhasanexcessof
positivecharge,whileplateBhasanexcessofnegativecharge.Suchanarrangement
gives risetoan electric field.An electricfield isan example of avector field.
–
–
–
–
–
–
–
–
–
–
7.5 The scalar product 245
A
+
+
+
+
+
+
+
+
+
+
X
E
ds
E
E
B
Figure7.27
Two charged platessituated in avacuum.
InFigure7.27theelectricfieldvectorEintheregionofspacebetweenthecharged
plateshasadirectionperpendiculartotheplatespointingfromAtoB.Themagnitude
of the electric field vector is constant in this region if end effects are ignored. If
a charged particle is required to move against an electric field, then work must be
donetoachievethis.Forexample,totransportapositivelychargedparticlefromthe
surfaceofplateBtothesurfaceofplateAwouldrequireworktobedone.Thiswould
lead toan increase inpotential of the charged particle.
If s represents the displacement andV the potential it is conventional to write δs
to represent a very small change in displacement, and δV to represent a very small
change inpotential.
Ifaunitpositivechargeismovedasmalldisplacement δsinanelectricfield(Figure
7.27) then the change inpotential δV isgiven by
δV =−E·δs (7.1)
This is an example of the use of a scalar product. Notice that although E and δs are
vector quantities the change inpotential, δV, isascalar.
ConsideragainthechargedplatesofFigure7.27.Ifaunitchargeismovedasmall
displacement along the plane X,then δs isperpendicular toE. So,
δV = −E·δs = −|E‖δs|cosθ
With θ = 90 ◦ ,wefind δV = 0.ThesurfaceXisknownasanequipotentialsurface
because movement of a charged particle along this surface does not give rise to a
change in its potential. Movement of a charge in a direction parallel to the electric
field gives risetothe maximum change inpotential, as forthis case θ = 0 ◦ .
EXERCISES7.5
1 Ifa=3i−7jandb=2i+4jfinda·b,b·a,a·a
andb·b.
2 Ifa=4i+2j−k,b=3i−3j+3kand
c=2i−j−k,find
(a) a·a (b) a·b
(c) a·c (d) b·c
3 Evaluate (−13i −5j) · (−3i +4j).
4 Find the angle between the vectorsp = 7i +3j +2k
andq=2i−j+k.
5 Find the angle between the vectors 7i +j and 4j −k.
6 Findtheanglebetweenthevectors4i −2jand3i −3j.
7 Ifa=7i+8jandb=5ifinda·ˆb.
⎛ ⎞ ⎛ ⎞
3 5
8 Ifr 1 = ⎝1⎠and r 2 = ⎝1⎠ findr 1 ·r 1 ,r 1 ·r 2
2 0
andr 2 ·r 2 .
9 Given thatp = 2qsimplify p ·q, (p +5q) ·q and
(q−p)·p.
246 Chapter 7 Vectors
10 Find the modulusofa = i −j−k,the modulusof
b = 2i +j+2kand the scalar producta ·b.Deduce
the angle betweenaand b.
11 Ifa=2i+2j−kandb=3i−6j+2k,find
|a|, |b|,a·b and the angle betweenaand b.
12 Useavector methodto showthat the diagonals ofthe
rhombusshown in Figure7.28 intersect at90 ◦ .
A
B
D
Figure7.28
TherhombusABCD.
C
13 Usethe scalar product to findatwo-dimensional
vector a =a 1 i +a 2 j perpendicular to the vector
4i −2j.
14 Ifa=3i−2j,b=7i+5jandc=9i−j,showthat
a·(b−c)=(a·b)−(a·c).
15 Findthe work done bythe forceF = 3i −j+k
in moving an object through adisplacement
s=3i+5j.
16 Aforce ofmagnitude 14Nactsin the direction
i +j+k upon an object. Itcauses the object to move
from point A(2,1,0)to point B(3,3,3).Find the
work done bythe force.
17 (a) Usethe scalar product to showthat the
component ofain the direction ofbisa · ˆb,
where ˆb isaunitvector in the directionofb.
(b) Findthe component of2i +3j in the directionof
i+5j.
Solutions
1 −22, −22,58,20
2 (a) 21 (b) 3 (c) 7 (d) 6
3 19
4 47.62 ◦
5 82.11 ◦
6 18.4 ◦
7 7
8 14,16, 26
9 2|q| 2 ,7|q| 2 , −2|q| 2
10 √ 3,3, −1,101.1 ◦
11 3,7, −8,112.4 ◦
13 c(i +2j),cconstant
15 4J
16 48.5 J
17 17/ √ 26
7.6 THEVECTORPRODUCT
The result of finding the vector product of a and b is a vector of length |a||b|sinθ,
where θ is the angle between a and b. The direction of this vector is such that it is
perpendicular to a and to b, and so it is perpendicular to the plane containing a and
b (Figure 7.29). There are, however, two possible directions for this vector, but it is
conventionaltochoosetheoneassociatedwiththeapplicationoftheright-handedscrew
rule. Imagine turning a right-handed screw in the sense from a towards b as shown. A
right-handedscrewisonewhich,whenturnedclockwise,entersthematerialintowhich
it is being screwed. The direction in which the screw advances is the direction of the
required vector product. The symbol we shall use to denote the vector product is ×.
Formally, wewrite
a×b=|a‖b|sinθê
7.6 The vector product 247
a 3 b
u
b
Plane containing
a and b
a
b
Figure7.29
a ×b is perpendicular to the plane containing
aand b.Theright-handed screw ruleallows
the direction ofa ×b to be found.
a
Figure7.30
Right-handedscrew ruleallowsthe
directionofb ×ato be found.
where ê is the unit vector required to define the appropriate direction, that is ê is a unit
vectorperpendicular toaandtobinasensedefined bytheright-handedscrewrule.To
evaluate b ×a we must imagine turning the screw from the direction of b towards that
of a. The screwwill advance as shown inFigure 7.30.
We notice immediately that a ×b ≠ b ×a since their directions are different. From
the definition of the vector product, itispossible toshow thatthe following rules hold:
a ×b = −(b ×a)
a × (b +c) = (a ×b) + (a ×c)
k(a ×b) = (ka) ×b =a× (kb)
the vector product is notcommutative
thedistributiverule
wherekisascalar
Example7.18 Ifaandbare parallel show thata ×b = 0.
Solution If a and b are parallel then the angle between a and b is zero. Therefore, a × b =
|a‖b|sin0ê= 0.Notethattheanswerisstillavector,andthatwedenotethezerovector
0i +0j +0k by0, todistinguish itfromthe scalar 0.Inparticular, wenote that
i×i=j×j=k×k=0
Ifaandbareparallel,thena ×b = 0.
Inparticular:
i×i=j×j=k×k=0
Example7.19 Show thati ×j = kand find expressions forj ×kandk ×i.
Solution We note that i and j are perpendicular so that |i ×j| = |i‖j|sin90 ◦ = 1. Furthermore,
theunitvectorperpendiculartoiandtojinthesensedefinedbytheright-handedscrew
rule is k. Therefore, i ×j = k as required. Similarly you should be able to show that
j×k=iandk×i=j.
i×j=k j×k=i k×i=j
j×i=−k k×j=−i i×k=−j
248 Chapter 7 Vectors
Example7.20 Simplify (a ×b) − (b ×a).
Solution Usethe resulta ×b = −(b ×a) toobtain
(a×b)−(b×a)=(a×b)−(−(a×b))
=(a×b)+(a×b)
=2(a×b)
Example7.21 (a) Ifa =a 1
i +a 2
j +a 3
kandb =b 1
i +b 2
j +b 3
k, show that
a×b = (a 2
b 3
−a 3
b 2
)i− (a 1
b 3
−a 3
b 1
)j+ (a 1
b 2
−a 2
b 1
)k
(b) Ifa=2i+j+3kandb=3i+2j+kfinda×b.
Solution (a) a×b=(a 1
i+a 2
j+a 3
k)× (b 1
i+b 2
j+b 3
k)
=a 1
i× (b 1
i+b 2
j+b 3
k)+a 2
j× (b 1
i+b 2
j+b 3
k)
+a 3
k× (b 1
i+b 2
j+b 3
k)
=a 1
b 1
(i×i)+a 1
b 2
(i×j)+a 1
b 3
(i×k)+a 2
b 1
(j×i)+a 2
b 2
(j×j)
+a 2
b 3
(j×k)+a 3
b 1
(k×i)+a 3
b 2
(k×j)+a 3
b 3
(k×k)
Using the results of Examples 7.18 and 7.19, we find that the expression for a ×b
simplifies to
a×b = (a 2
b 3
−a 3
b 2
)i− (a 1
b 3
−a 3
b 1
)j+ (a 1
b 2
−a 2
b 1
)k
(b) Usingthe resultof part(a) we have
a ×b = ((1)(1) − (3)(2))i − ((2)(1) − (3)(3))j + ((2)(2) − (1)(3))k
=−5i+7j+k
Verifyforyourself thatb ×a = 5i −7j −k.
7.6.1 Usingdeterminantstoevaluatevectorproducts
Evaluationofavectorproductusingthepreviousformulaisverycumbersome.Amore
convenient and easily remembered method is now described. The vectors a and b are
written inthe following pattern:
∣ i j k ∣∣∣∣∣
a 1
a 2
a 3
∣b 1
b 2
b 3
This quantity is called a determinant. A more thorough treatment of determinants is
given in Section 8.7. To find the i component of the vector product, imagine crossing
out the row and column containing i and performing the following calculation on what
isleft,thatis
a 2
b 3
−a 3
b 2
Theresultingnumberistheicomponentofthevectorproduct.Thejcomponentisfound
by crossing out the row and column containing j, performing a similar calculation, but
now changing the sign of the result.Thus thejcomponent equals
−(a 1
b 3
−a 3
b 1
)
7.6 The vector product 249
The k component is found by crossing out the row and column containing k and performingthe
calculation
We have
a 1
b 2
−a 2
b 1
Ifa =a 1
i+a 2
j+a 3
kandb =b 1
i+b 2
j+b 3
k,then
∣ i j k ∣∣∣∣∣
a×b=
a 1
a 2
a 3
∣b 1
b 2
b 3
= (a 2
b 3
−a 3
b 2
)i− (a 1
b 3
−a 3
b 1
)j+ (a 1
b 2
−a 2
b 1
)k
Example7.22 Find the vector product ofa = 2i +3j +7k andb = i +2j +k.
Solution The two given vectors arerepresented inthe following determinant:
i jk
237
∣121∣
Evaluating this determinant we obtain
a×b=(3−14)i−(2−7)j+(4−3)k=−11i+5j+k
Youwillfindthat,withpractice,thismethodofevaluatingavectorproductissimpleto
apply.
7.6.2 Applicationsofthevectorproduct
The following threeexamples illustrateapplications of the vector product.
Engineeringapplication7.9
MagneticfluxdensityBandmagneticfieldstrengthH
Itispossibletomodeltheeffectofmagnetismbymeansofavectorfield.Amagnetic
fieldwithmagneticfluxdensityBisavectorfieldwhichisdefinedinrelationtothe
forceitexertsonamovingchargedparticleplacedinthefield.ConsiderFigure7.31.
If a charge q moves with velocity v in a magnetic field B it experiences a force F
given by
F=qv×B
Notethatthisforceisdefinedusingavectorproduct.Theunitofmagneticfluxdensityistheweberpersquaremetre(Wbm
−2 ),ortesla(T).Thedirectionofthisforce
is at right angles to both v and B, in a sense defined by the right-handed screw rule.
Its magnitude, or modulus, is
F =qvBsinθ
➔
250 Chapter 7 Vectors
F
q
B
v
Figure7.31
Force,F,exerted on aparticle with charge,q,when moving with
velocity,v,in a magnetic field,B.
where θ is the angle between v and B, v is the modulus of v andBis the modulus
of B.
Note thatif θ = 90 ◦ , sin θ = 1,thenB = F qv .
Theseformulaeareusefulbecausetheycanbeusedtocalculatetheforcesexerted
on a conductor in an electric motor. They are also used to analyse electricity generators
in which the motion of a conductor in a magnetic field leads to movement of
charges within the conductor, thus generating electricity.
Arelatedquantityisthemagneticfieldstrength,orthemagneticfieldintensity,
H, defined from
B=µH
µ is called the permeability of a material and has units of webers per ampere per
metre (Wb A −1 m −1 ). The units of H are amperes per metre (A m −1 ). Confirm for
yourself thatthe units match on both sides of the equation.
Engineeringapplication7.10
Magneticfieldduetoamovingcharge
A chargeqmoving with velocity v gives rise to a magnetic field with magnetic flux
densityBinitsvicinity.Asaresultofthis,anothermovingchargeplacedinthisfield
will experienceamagnetic force.Themagnetic fluxdensityis given by
B = qµ 0
4πr 2 (v׈r)
where r is a vector from the charge to the point at which B is measured, and µ 0
is a
constant called the permeability of free space.
Thisequationcanbeusedtofindthemagneticfieldduetoacurrentinawire.Supposeasmallportionofwirehaslength
δsandcontainsacurrentI.Bywriting δsasa
vectoroflength δsinthedirectionofthewire,itcanbeshownthatthecorresponding
contribution tothe magnetic fluxdensityis given by
δB = µ 0 I (δs × ˆr)
4πr2 ThisistheBiot--Savartlaw.Techniquesofintegrationarerequiredinordertocompletethecalculation,
butusingthisitispossibletoshow,forexample,thatthemagnetic
flux density a distance r from a long straight wire has magnitude B = µ 0 I
2πr .
7.6 The vector product 251
Engineeringapplication7.11
TheHalleffectinasemiconductor
A frequent requirement in the semiconductor industry is to be able to measure the
density of holes in a p-type semiconductor and the density of electrons in an n-type
semiconductor. This can be achieved by using the Hall effect. We will consider the
case of a p-type semiconductor but the derivation for an n-type semiconductor is
similar.
area, A
z
y
x
B
L
I
hole
E H
+
V
–
B
p-type
semiconductor
+ –
Meter
V H
Figure7.32
Hall effect in ap-type semiconductor.
Consider the piece of semiconductor shown in Figure 7.32. A d.c. voltage,V, is
applied to the ends of the semiconductor. This gives rise to a flow of current composed
mainly of holes as they are the majority carriers for a p-type semiconductor.
This current can be represented by a vector pointing in thexdirection and denoted
by I. A magnetic field, B, is applied to the semiconductor in the y direction. The
moving holes experience a force, F B
, per unit volume, caused by the magnetic field
given by
F B
= 1 A I ×B
where A is the cross-sectional area of the semiconductor. This causes the holes to
drift in thezdirection and so causes an excess of positive charge to appear on one
side of the semiconductor. This gives rise to a voltage known as the Hall voltage,
V H
. As this excess charge builds up it creates an electric field, E H
, in the negativez
direction,whichinturnexertsanopposingforceontheholes.Thisforceisgivenby
F E
= qp 0
E H
whereq = elementary charge = 1.60 ×10 −19 C, and p 0
= density of
holes (holes per cubic metre). Equilibrium is reached when the two forces are equal
inmagnitude, that is |F B
| = |F E
|. Now,
|F E
| =qp 0
|E H
| |F B
| = |I×B|
A
= IB A
➔
252 Chapter 7 Vectors
In equilibrium the magnitude of the electric field, |E H
|, is constant and so we can
write |E H
| = V H
, whereListhe widthof the semiconductor. Hence,
L
IB
A =qp H
0V
L
so that
p 0
= BIL
V H
qA
So, by measuring the value of the Hall voltage, it is possible to calculate the density
of the holes, p 0
, inthe semiconductor.
EXERCISES7.6
1 Evaluate
i jk
(a)
312
∣ 214 ∣
i jk
(c)
010
∣ 104 ∣
(b)
∣
(d)
∣
∣
jk
∣
ij k
−1 2 −3
−4 0 1
i
3 52
−3 −1 4
2 Ifa=i−2j+3kandb=2i−j−k,find
(a) a×b
(b) b×a
3 Ifa=i−2jandb=5i+5kfinda×b.
4 Ifa=i+j−k,b=i−jandc=2i+kfind
(a) (a×b)×c
(b) a×(b×c)
5 Ifp=6i+7j−2kandq=3i−j+4kfind|p|,|q|
and |p ×q|.Deduce the sineofthe anglebetween p
and q.
6 Forarbitraryvectors p andqsimplify
(a) (p+q)×p
(b) (p+q)×(p−q)
7 Ifc=i+jandd=2i+k,findaunitvector
perpendicular to bothcandd.Further, findthe sineof
the anglebetweencandd.
8 A, B,Care pointswith coordinates (1,2,3), (3,2,1)
and (−1,1,0),respectively. Find aunitvector
perpendicular to the planecontaining A, B andC.
9 Ifa=7i−2j−5kandb=5i+j+3k,findavector
perpendicular toaand b.
10 Ifa=7i−j+k,b=3i−j−2kand
c = 9i +j−3k, showthat
a×(b+c)=(a×b)+(a×c)
11 (a) Thearea,A,ofaparallelogramwith baseb
andperpendicular heighthisgiven byA =bh.
Showthat ifthe two non-parallelsidesofthe
parallelogramare representedby the vectorsa
andb,thenthe areais alsogiven by
A=|a×b|.
(b) Findthe area ofthe parallelogramwith sides
representedby2i +3j +k and3i +j−k.
12 Thevolume,V,ofaparallelepipedwithsidesa,band
cisgivenbyV = |a · (b ×c)|.Findthevolumeofthe
parallelepiped with sides 3i +2j +k,2i +j+k and
i+2j+4k.
13 Suppose a forceFactsthrough the pointPwith
position vector r.Themoment aboutthe origin, M,
ofthe force isameasureofthe turningeffect ofthe
force andis given by M = r ×F.Aforce of4Nacts
in the directioni +j+k,andthrough the point with
coordinates (7,1,3).Findthe momentofthe force
about the origin.
14 Inthe theory ofelectromagnetic waves an important
quantityassociatedwith the flow ofelectromagnetic
energy isthe Poyntingvector S.This isdefined as
S = E ×Hwhere Eis the electric field strength and
Hthe magnetic field strength. Suppose thatin a plane
electromagnetic wave
( ) 2πz
E=E 0 cos
λ − ωt j
7.7 Vectors ofndimensions 253
and
H = − 2πE ( )
0 2πz
cos
λωµ 0 λ − ωt i
where λ, ω,µ 0 andE 0 are constants. Findthe
Poyntingvector andconfirm thatthe directionof
energy flow isthezdirection.
Solutions
1 (a) 2i−8j+k (b) 2i+13j+8k
(c) 4i −k (d) 22i −18j +12k
8
1
√
27
(i−5j+k)
2 (a) 5i+7j+3k (b) −5i−7j−3k
3 −10i −5j +10k
4 (a) −i−3j+2k (b) i−j
5 √ 89, √ 26,48.01,0.9980
6 (a) q×p (b) 2q×p
7
1
√
6
(i −j−2k),0.775
9 −i−46j+17k
11 (b) √ 90 = 9.49
12 5
13
4
√
3
(−2i −4j +6k)
( )
14 2πE2 0
cos 2 2πz
λωµ 0 λ − ωt k,
which is avector in thezdirection.
TechnicalComputingExercises7.6
Vectorandscalarproductsarereadilycalculatedinmost
technicalcomputing languages.Forexample in
MATLAB ® wecouldcalculatethe scalar product of
a=i−2j+3kandb=2i−j−kbytyping:
a=[1 -2 3];
b=[2 -1 -1];
dot(a,b)
producingthe answer:
ans= 1
andthe vector product bythentyping:
cross(a,b)
producingthe answer:
ans =
5 7 3
1 ConfirmusingMATLAB ® ,orasimilarlanguage,that
i×j=k.
2 (a) Selecttwo different vectorsaand b and
calculate |a ×b| 2 + (a.b) 2 .
(b) Calculate |a| 2 |b| 2
(c) Whatare yourobservations about the results
from (a)and(b)? Verifyyourconclusions by
changing the vectorsaandbandrepeating
the calculations.
7.7 VECTORSOFnDIMENSIONS
The examples we have discussed have all concerned two- and three-dimensional vectors.
Our understanding has been helped by the fact that two-dimensional vectors can
be drawn in the plane of the paper and three-dimensional vectors can be visualized in
thethree-dimensionalspaceinwhichwelive.However,therearesomesituationswhen
254 Chapter 7 Vectors
the generalization to higher dimensions is appropriate, but no convenient geometrical
interpretation is available. Nevertheless, many of the concepts we have discussed are
still applicable. For example, wecan introduce the four-dimensional vectors
⎛ ⎞
⎛ ⎞
3 1
a = ⎜1
⎟
⎝2
⎠ and b= ⎜0
⎟
⎝3
⎠
4 1
It is natural to define the magnitude, or norm, of a as √ 3 2 +1 2 +2 2 +4 2 = √ 30 and
the scalar product of a and b as a·b = (3)(1) + (1)(0) + (2)(3) + (4)(1) = 13. An
n-dimensional vector will havencomponents. Operations such as addition, subtraction
and scalar multiplication aredefined inanobvious way.
It is also possible to define a set of variables as a vector. This turns out to be a useful
way of modelling a physical system. The system is described by means of a vector
which consists of an ordered set of variables sufficient to describe the state of the system.
Such a vector is called a state vector. This concept is explored in more detail in
Chapter 20.
Engineeringapplication7.12
Meshcurrentvector
When analysing a complex circuit it can be convenient to assign a current to each
smallindependentloopwithinthecircuit.Eachofthesecurrentsisknownasamesh
current. It is possible to collect these individual currents together to form a vector
quantity. Consider the following example.
A circuit as shown in Figure 7.33 has a set of mesh currents {I 1
,I 2
,I 3
,I 4
} from
whichwecan formacurrentvector
⎛ ⎞
I 1
I = ⎜I 2 ⎟
⎝I 3
⎠
I 4
Nogeometricalinterpretationispossiblebutneverthelessthisvectorprovidesauseful
mathematical way of handling the mesh currents. We shall see how vectors such
asthesecan bemanipulatedinSection8.12.
I 1 I 2 I 3 I 4
Figure7.33
Acircuitwith meshcurrents
shown.
Review exercises 7 255
EXERCISES7.7
1 If
⎛ ⎞
1
1
a =
⎜0
⎟
⎝1
⎠
1
⎛ ⎞
3
2
and b=
⎜1
⎟
⎝0
⎠
1
findthe norm ofa, the norm ofband a·b.Further,
findthe norm ofa −b.
2 Two non-zero vectors are mutually orthogonal iftheir
scalar product iszero. Determine which ofthe
following are mutually orthogonal.
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
1 2 3
a = ⎜ 2
⎟
⎝ 4 ⎠ b = ⎜1
⎟
⎝0
⎠ c = ⎜0
⎟
⎝1
⎠
−1 0 0
⎛ ⎞ ⎛ ⎞
0 3
d = ⎜0
⎟
⎝7
⎠ e = ⎜ 0
⎟
⎝ −2 ⎠
2 −5
Solutions
1 2, √ 15,6, √ 7 2 aande,bandd
REVIEWEXERCISES7
1 Finda·banda×bwhen
(a) a=7i−j+k,b=3i+2j+5k
(b) a=6i−6j−6k,b=i−j−k.
2 Foratriangle ABC, express assimplyaspossiblethe
vector → AB + → BC + → CA.
3 Ifa=7i−j+2kandb=8i+j+k,find|a|,|b|and
a ·b. Deducethe cosineofthe anglebetweenaand b.
4 Ifa=6i−j+2kandb=3i−j+3k,find
|a|,|b|, |a ×b|.Deducethe sineofthe anglebetween
aandb.
5 Ifa=7i+9j−3kandb=2i−4j,findâ, ˆb, ̂ a×b.
6 Bycombining the scalar andvector productsother
types ofproductscan be defined. The triplescalar
productforthreevectors isdefined as (a ×b)·c
whichisascalar.Ifa=3i−j+2k,b=2i−2j−k,
c=3i+j,finda×band(a×b)·c.Showthat
(a×b)·c=a·(b×c).
7 Thetriplevectorproduct isdefined by (a ×b) ×c.
Find the triple vectorproduct ofthe vectors given in
Question6. Alsofind a ·c,b·c andverifythat
(a·c)b−(b·c)a=(a×b)×c
Further,finda × (b ×c) andconfirm that
a×(b×c)≠(a×b)×c.
8 Show thatthe vectorsp = 3i −2j +k,
q=2i+j−4kandr=i−3j+5kformthethree
sides ofaright-angledtriangle.
9 Find aunitvector parallelto the liney = 7x −3.Find
a unitvector paralleltoy = 2x +7.Use the scalar
product to findthe anglebetweenthese two lines.
10 An electric chargeqwhichmoves with avelocity v
produces amagnetic fieldBgiven by
B = µq v× ˆr
4π |r| 2 where µ = constant
FindBifr=3i+j−2kandv=i−2j+3k.
11 InatriangleABC, denote → ABbyc, → AC byband → CB
by a.Usethe scalar product to provethe cosine rule:
a 2 =b 2 +c 2 −2bccosA.
12 Evaluate
ij k
(a)
−4 0 −3
∣ 71 4 ∣
(b)
i jk
825
∣100∣
13 Find the area ofthe parallelogram with sides
representedby3i +5j −k and i +3j −k.
14 Find the anglebetween the vectors 7i +2j and i −3j.
15 Find aunitvector in the directionofthe linejoining
the points (2,3,4) and (7,17,1).
256 Chapter 7 Vectors
16 Show that the vectorsi −j and −3i −3j are
perpendicular.
17 Find the norm ofeach ofthe vectors
⎛ ⎞ ⎛ ⎞
7 2
⎜ 2
⎟
⎝ −1⎠ and ⎜ 1
⎟
⎝ 0 ⎠
2 −4
18 (a)Usethe scalar product to findthe value ofthe
scalar µ sothat i +j+µk isperpendicular to the
vector i +j+k.
(b)Usethevectorproductandtheresultsfrompart(a)
to findamutually perpendicular setofunitvectors
ˆv 1 , ˆv 2 and ˆv 3 , where ˆv 1 is inclined equally to the
vectorsi,jand k.
19 ThepointsA, Band C have coordinates (2,−1, −2),
(4,−1,−3)and (1,3,−1).
(a) Writedown the vectors → ABand → AC.
(b) Using the vectorproduct findaunitvector
which isperpendicular to the plane containing
A, B and C.
(c) IfDisthe point with coordinates (3,0,1),use
the scalar product to find the perpendicular
distance fromDto the plane ABC.
20 Thecondition forvectorsa,band cto be coplanar
(i.e.they liein the same plane) isa · (b ×c) = 0.
(a) Show that the vectorsa = 4i +5j +6k,
b=6i−3j−3kandc=−i+2j+2kare
notcoplanar.
(b) Givend = −i +2j + λk,findthe value of λ so
thata, b anddare coplanar.
21 Points Aand Bhave position vectorsaand b
respectively. Show that the position vector ofan
arbitrarypoint on the line AB isgiven by
r = a + λ (b −a) forsome scalar λ.Thisis the
vector equationofthe line.
22 Usevector methods to show that the threemedians of
any triangle intersect atacommon point (called the
centroid).
23 Usethe vectorproduct to findthe area ofatriangle
with vertices atthe pointswith coordinates (1,2,3),
(4,−3,2),and (8,1,1).
Solutions
1 (a) 24, −7i −32j +17k (b) 18, 0
2 0
3 √ 54, √ 66,57,0.9548
4 √ 41, √ 19, √ 154,0.4446
5
1
√
139
a,
1
√
20
b,
6 5i+7j−4k,22
1
√
2296
(−12i −6j −46k)
7 (a×b)×c=4i−12j−16k
a·c=8,b·c=4
a×(b×c)=−2i−22j−8k
1 1
9 √ (i +7j), √ (i +2j),18.4 ◦
50 5
10
µq
56 √ (i +11j +7k)
14π
12 (a) 3i−5j−4k
(b) 5j −2k
13 √ 24 = 4.90
14 87.5 ◦
15
1
√
230
(5i +14j −3k)
17 √ 58, √ 21
18 (a) µ=−2
(b) ˆv 1 = 1 √
3
(1,1,1), ˆv 2 = 1 √
6
(1,1,−2),
ˆv 3 = 1 √
2
(−1,1,0)
19 (a) (2,0,−1), (−1,4,1) (b)
1
(4,−1,8) (c) 3
9
20 (a) a · (b ×c) = 9 ≠ 0 andhence the vectors are not
coplanar
(b) λ = 31/14
23 1 2√
1106
8 Matrixalgebra
Contents 8.1 Introduction 257
8.2 Basicdefinitions 258
8.3 Addition,subtractionandmultiplication 259
8.4 Usingmatricesinthetranslationandrotationofvectors 267
8.5 Somespecialmatrices 271
8.6 Theinverseofa2×2matrix 274
8.7 Determinants 278
8.8 Theinverseofa3×3matrix 281
8.9 Applicationtothesolutionofsimultaneousequations 283
8.10 Gaussianelimination 286
8.11 Eigenvaluesandeigenvectors 294
8.12 Analysisofelectricalnetworks 307
8.13 Iterativetechniquesforthesolutionofsimultaneousequations 312
8.14 Computersolutionsofmatrixproblems 319
Reviewexercises8 321
8.1 INTRODUCTION
Matrices provide a means of storing large quantities of information in such a way that
each piece can be easily identified and manipulated. They permit the solution of large
systemsoflinearequationstobecarriedoutinalogicalandformalwaysothatcomputer
implementation follows naturally. Applications of matrices extend over many areas of
engineering including electrical network analysis and robotics.
258 Chapter 8 Matrix algebra
An example of an extremely large electrical network is the national grid in Britain.
Theequationsgoverningthisnetworkareexpressedinmatrixformforanalysisbycomputerbecausesolutionsarerequiredatregularintervalsthroughoutthedayandnightin
order to make decisions such as whether or not a power station should be connected to,
or removed from, the grid.
Toobtainthetrajectoryofarobotitisnecessarytoperformmatrixcalculationstofind
thespeedatwhichvariousmotorswithintherobotshouldoperate.Thisisacomplicated
problemasitisnecessarytoensurethatarobotreachesitsrequireddestinationanddoes
not collide with another object during its movement.
8.2 BASICDEFINITIONS
Amatrix isarectangular pattern or arrayof numbers.
For example,
⎛
A = ⎝
⎞
123
456⎠ B =
−124
( ) 1 2 −2
340.5
C = ( 1 −1 1 )
areallmatrices.Notethatweusuallyuseacapitallettertodenoteamatrix,andenclose
the array of numbers in brackets. To describe the size of a matrix we quote its number
of rows and columns in that order so, for example, an r ×s matrix has r rows and s
columns. We say the matrix hasorderr×s.
Anr×smatrix hasrrows andscolumns.
Example8.1 DescribethesizesofthematricesA,BandCatthestartofthissection,andgiveexamples
of matrices of order 3 ×1,3 ×2 and 4 ×2.
Solution Ahas order 3 ×3,Bhas order 2 ×3 andC has order 1 ×3.
⎛ ⎞
⎛ ⎞
−1
1 2
⎝−2⎠ isa3 ×1 matrix ⎝3 4⎠ isa3 ×2 matrix
3
5 6
and ⎛ ⎞
−1 −1
⎜ ⎜⎝ −1 2
⎟
2 −0.5⎠ 1 0
isa4 ×2 matrix
More generally, ifthe matrixAhasmrows andncolumns wecan write
⎛
⎞
a 11
a 12
... a 1n
a 21
a 22
... a 2n
A = ⎜
⎝
.
.
. .. .
⎟
⎠
a m1
a m2
... a mn
8.3 Addition, subtraction and multiplication 259
wherea ij
representsthenumberorelementintheithrowand jthcolumn.Amatrixwith
a singlecolumn can alsobe regarded asacolumn vector.
Theoperationsofaddition,subtractionandmultiplicationaredefineduponmatrices
and these areexplained inSection 8.3.
8.3 ADDITION,SUBTRACTIONANDMULTIPLICATION
8.3.1 Matrixadditionandsubtraction
Twomatricescanbeadded(orsubtracted)iftheyhavethesameshapeandsize,thatisthe
sameorder.Theirsum(ordifference)isfoundbyadding(orsubtracting)corresponding
elements as the following example shows.
Example8.2 If
A =
( ) 1 5 −2
31 1
and B=
( ) 120
−1 1 4
findA+BandA−B.
Solution
( ) ( ) ( )
1 5 −2 120 2 7 −2
A +B= + =
31 1 −1 1 4 22 5
( ) ( ) ( )
1 5 −2 120 0 3 −2
A−B= − =
31 1 −1 1 4 4 0 −3
Ontheotherhand,thematrices
they have different orders.
( ) ( 1 2 3
and cannotbeaddedorsubtractedbecause
0 1 1)
Example8.3 IfC =
( ) a b
andD=
c d
( ) e f
showthatC+D=D+C.
g h
Solution C+D =
D+C =
( ) ( ) ( )
a b e f a +e b+f
+ =
c d g h c+g d+h
( ) ( ) ( )
e f a b e +a f+b
+ =
g h c d g+c h+d
Nowa +e is exactly the same ase +a because addition of numbers is commutative.
The same observation can be made ofb + f,c +gandd+h. HenceC +D =D +C.
The addition of these matrices is therefore commutative. This may seem an obvious
statementbut weshall shortlymeet matrix multiplication which isnotcommutative, so
ingeneral commutativity should not be simplyassumed.
260 Chapter 8 Matrix algebra
The resultobtained inExample 8.3 istruemore generally:
Matrix addition iscommutative, thatis
A+B=B+A
Itis alsoeasytoshow that
Matrix addition isassociative, that is
A+(B+C)=(A+B)+C
8.3.2 Scalarmultiplication
GivenanymatrixA,wecanmultiplyitbyanumber,thatisascalar,toformanewmatrix
of the same order asA. This multiplication is performed by multiplying every element
ofAby the number.
Example8.4 If
⎛ ⎞
1 3
A = ⎝−2 1⎠
0 1
find 2A, −3Aand 1 2 A.
⎛ ⎞ ⎛ ⎞
1 3 2 6
Solution 2A=2⎝−2 1⎠ = ⎝−4 2⎠
0 1 0 2
⎛ ⎞
1 3
⎛ ⎞
−3 −9
−3A = −3⎝−2 1⎠ = ⎝ 6 −3⎠
0 1 0 −3
and
⎛
1
⎛ ⎞
1
2 A = 1 1 3
2
⎝−2 1⎠ =
−1
2 ⎜
0 1 ⎝
0
3
⎞
2
1
2⎟
⎠
1
2
8.3 Addition, subtraction and multiplication 261
Ingeneral wehave
⎛
⎞ ⎛
⎞
a 11
a 12
... a 1n
ka 11
ka 12
... ka 1n
a 21
a 22
... a 2n
IfA= ⎜
⎝
.
.
. ..
. ..
⎟
⎠ thenkA = ka 21
ka 22
... ka 2n
⎜
⎝
.
.
. ..
. ..
⎟
⎠
a m1
a m2
... a mn
ka m1
ka m2
... ka mn
8.3.3 Matrixmultiplication
Matrix multiplication is defined in a special way which at first seems strange but is in
factveryuseful.IfAisa p ×qmatrixandBisanr×smatrixwecanformtheproduct
AB only ifq =r; that is, only if the number of columns inAis the same as the number
of rows inB. The product isthen a p ×smatrixC, thatis
C=AB where Aisp×q
Bisq×s
Cisp×s
Example8.5 GivenA = ( 4 2 ) andB=
Solution Ahas size1 ×2
Bhassize2×3
( ) 3 7 6
can the productAB beformed?
52 −1
Because the number of columns inAis the same as the number of rows in B, we can
formtheproductAB.Theresultingmatrixwillhavesize1 ×3becausethereisonerow
inAand three columns inB.
SupposewewishtofindABwhenA = ( 4 2 ) andB=
( 3
7)
.Ahassize1 ×2andBhas
size2 ×1andsowecanformtheproductAB. Theresultwillbea1 ×1matrix,thatis
a singlenumber. We perform the calculation as follows:
AB = ( 4 2 )( )
3
=4×3+2×7=12+14=26
7
Note that we have multiplied elements in the row ofAwith corresponding elements in
the column ofB, and added the results together.
⎛ ⎞
Example8.6 FindCD whenC = ( 192 ) 2
andD= ⎝6⎠.
8
⎛ ⎞
Solution CD = ( 192 ) 2
⎝6⎠ =1×2+9×6+2×8=2+54+16=72
8
Let us now extend this idea to general matrices A and B. Suppose we wish to findC
whereC =AB.Theelementc 11
isfoundbypairingeachelementinrow1ofAwiththe
262 Chapter 8 Matrix algebra
Example8.7 IfA =
correspondingelementincolumn1ofB.Thepairsaremultipliedtogetherandthenthe
resultsareaddedtogivec 11
.Similarly,tofindtheelementc 12
,eachelementinrow1of
Aispairedwiththecorrespondingelementincolumn2ofB.Again,thepairedelements
are multiplied together and the results are added to form c 12
. Other elements ofC are
foundinasimilarway.Ingeneraltheelementc ij
isfoundbypairingelementsintheith
rowofAwiththoseinthe jthcolumnofB.Thesearemultipliedtogetherandtheresults
are added togivec ij
. Consider the following example.
( ) 1 2
andB=
4 3
Solution We can form the product
( ) ( ) 1 2 5
C=AB=
4 3 −3
( ) 5
find, ifpossible, the matrixC whereC =AB.
−3
↑ ↑
2×2 2×1
because the number of columns inA, that is 2, is the same as the number of rows inB.
The size of the product is found by inspecting the number of rows in the first matrix,
which is 2, and the number of columns in the second, which is 1. These numbers give
the number of rows and columns respectively inC. ThereforeC will be a 2 ×1 matrix.
To find the elementc 11
we pair the elements in the first row ofAwith those in the
first column ofB, multiplyand then add these together. Thus
c 11
=1×5+2×−3=5−6=−1
Similarly,tofindtheelementc 21
wepairtheelementsinthesecondrowofAwiththose
inthe first column ofB, multiply and then add these together. Thus
c 21
=4×5+3×−3=20−9=11
The complete calculation iswritten as
( )( ) ( )
1 2 5 1 ×5+2×−3
AB = =
4 3 −3 4×5+3×−3
( ) 5 −6
=
20−9
( ) −1
=
11
IfAis a p ×q matrix andBis aq ×s matrix, then the productC = AB will be a
p ×s matrix. To findc ij
we take theith row ofAand pair its elements with the jth
column ofB. The paired elements aremultiplied together and added toformc ij
.
Example8.8 IfB =
( ) 123
andC=
456
⎛ ⎞
1
⎝2⎠ findBC.
4
8.3 Addition, subtraction and multiplication 263
Solution B has order 2 ×3 andC has order 3 ×1 so clearly the productBC exists and will have
order 2 ×1.BC isformed as follows:
( ) ⎛ ⎞
( ) ( )
123
BC = ⎝1
2⎠ 1 ×1+2×2+3×4 17
=
=
456 4×1+5×2+6×4 38
4
Note that the order of the product, 2 ×1, can be determined at the start by considering
the orders ofBandC.
Example8.9 FindABwhere
⎛ ⎞
1 2
⎛ ⎞
−1
A = ⎝ 3 4⎠ and B= ⎝ 2⎠
−1 0
−1
Solution A andBhave orders 3 × 2 and 3 × 1, respectively, and consequently the product,AB,
cannot beformed.
Example8.10 Given
⎛ ⎞
1 11
A = ⎝2 10⎠ and
⎛ ⎞
0 31
B= ⎝4−10⎠
3−12 2 21
find, ifpossible,ABandBA, and comment upon the result.
Solution A andBboth have order 3 × 3 and the productsAB andBA can both be formed. Both
will have order 3 ×3.
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
1 11 0 31 6 42
AB = ⎝2 10⎠
⎝4−10⎠ = ⎝4 52⎠
3−12 2 21 0145
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
0 31 1 11 922
BA = ⎝4−10⎠
⎝2 10⎠ = ⎝234⎠
2 21 3−12 934
ClearlyAB andBA are not the same. Matrix multiplication is not usually commutative
and wemustpay particular attention tothisdetailwhen weareworking withmatrices.
IngeneralAB ≠BA and so matrix multiplication isnot commutative.
In the product AB we say that B has been premultiplied by A, or alternatively A has
beenpostmultiplied byB.
264 Chapter 8 Matrix algebra
Example8.11 Given
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
1−12 −1 29 415
A = ⎝ 3 02⎠ B = ⎝ 1 00⎠ C = ⎝231⎠
−1 35 3−21 015
findBC,A(BC),AB and (AB)C, commenting upon the result.
⎛
−1
⎞ ⎛ ⎞
2 9 415
⎛ ⎞
0 14 42
Solution BC = ⎝ 1 00⎠
⎝231⎠ = ⎝4 1 5⎠
3 −2 1 015 8 −2 18
⎛ ⎞ ⎛ ⎞
1 −1 2 0 14 42
⎛
12 9
⎞
73
A(BC) = ⎝ 3 02⎠
⎝4 1 5⎠ = ⎝16 38 162⎠
−1 3 5 8 −218 52 −21 63
⎛ ⎞ ⎛
1 −1 2 −1
⎞
29
⎛
4 −2
⎞
11
AB = ⎝ 3 02⎠
⎝ 1 00⎠ = ⎝ 3 2 29⎠
−1 3 5 3 −21 19 −12 −4
⎛
4 −2
⎞ ⎛ ⎞
11 415
⎛
12 9
⎞
73
(AB)C = ⎝ 3 2 29⎠
⎝231⎠ = ⎝16 38 162⎠
19 −12 −4 015 52 −21 63
We note thatA(BC) = (AB)C so thatinthiscase matrix multiplication isassociative.
The resultobtained inExample 8.11 isalsotrueingeneral:
Matrix multiplication isassociative:
(AB)C =A(BC)
EXERCISES8.3
1 Evaluate
( ) ( )
−1
(a) 1 4
4
(b) ( 3 7 ) ( )
3
−4
( ( ) 5 2 9
(c)
1 3)
1
( ) ( ) 1 4 5 2
(d)
−1 3 −3 4
( ) ( ) 5 1 6 −1
(e)
29 6 −29 5
⎛ ⎞
( )
1
(f) 124 ⎝3⎠
2
⎛ ⎞
(g) ( 5 −1 3 ) 3 3
⎝2 6⎠
4 1
(h) ( 1 9 ) ( )
111
222
( )( ) 1 3 2 1
(i)
−1 0 5 0
( )( ) 3 0 0 3
(j)
0 1 9 0
( ) ( ( )
1 1 2 −7 1
2 IfA= ,B = ,C = ,
3 4 1)
0 4
D = ( 321 ) ( ) 23 4
andE= find,ifpossible,
12 −1
(a)A+D,C−AandD−E
(b) AB,BA,CA,AC,DA,DB,BD,EB,BE andAE
(c) 7C, −3D andkE,wherekisascalar.
8.3 Addition, subtraction and multiplication 265
3 Plotthe pointsA,B,C with position vectors given by
( ( ( 1 2 2
v 1 = v
0)
2 = v
0)
3 =
3)
respectively. Treating these vectors asmatrices of
order 2 ×1 findthe productsMv 1 ,Mv 2 ,Mv 3 when
( ) 1 0
(a) M=
0 −1
( ) 0 1
(b)M=
1 0
( ) 0 −1
(c) M=
1 0
Ineach casedraw adiagram to illustrate the effect
uponthe vectors ofmultiplication bythe matrix.
4 FindABandBA where
⎛ ⎞
13 2
A = ⎝−10 4⎠
51 −1
⎛ ⎞
521
B = ⎝034⎠
135
5 Given thatA 2 meansthe product ofamatrixAwith
( )
itself,findA 2 4 2
whenA = .FindA
1 3
3 .
( ) ( ) 1 3 2 1
6 IfA= ,B = findAB,BA,A +B
−2 4 −4 5
and (A +B) 2 .Show that
(A+B) 2 =A 2 +AB+BA+B 2
Whyis (A +B) 2 notequaltoA 2 +2AB +B 2 ?
7 Find, ifpossible,
⎛ ⎞⎛
⎞
10 00 1
(a) ⎜00 −10
⎟⎜1
⎟
⎝01 00⎠⎝2⎠
00 01 1
⎛ ⎞⎛
⎞
0010 2
(b) ⎜ 0100
⎟⎜5
⎟
⎝−1000⎠⎝5⎠
0001 1
( ) ⎛ 2 1 36
8 Find ⎝1
3 −5⎠.
2 −57
6 7
⎛ ⎞
1
9 Given the vectorv = ⎝2⎠calculate the vectors
3
obtained whenvispremultiplied by the following
matrices:
⎛ ⎞ ⎛ ⎞
62 9
(a) ⎝ 13 2⎠
−12 −3
⎛ ⎞
131
(c) ⎝926⎠
280
⎛ ⎞
683
(e) ⎜964
⎟
⎝539⎠
252
⎞
(b)
(d)
−103
⎝ 719⎠
134
( ) 312
654
Solutions
1 (a) 15 (b) −19
( ) ( )
47
−7 18
(c) (d)
12
−14 10
( ) 1 0
(e) (f) 15
0 1
(g) (25 12) (h) (19 19 19)
(i)
( ) 17 1
−2 −1
(j)
( ) 0 9
9 0
( ) −8 0
2 (a) A +Ddoes notexist, ,
−3 0
D −E does not exist
( ) 3
(b) ,BAdoes notexist,
10
(c)
3 (a)
(c)
( ) −4 −3
,
12 16
( ) −7 5
,
−21 19
DAdoes notexist,DBdoesnot exist,
( ) 642
,
321
EBdoes notexist,BE doesnot exist,
( ) 3 53
10178
( ) ( )
−49 7 2k 3k 4k
, (−9−6−3),
0 28 k2k −k
( ( ( ) ( ( ( 1 2 2 0 0 3
, , (b) , ,
0)
0)
−3 1)
2)
2)
( ( ( 0 0 −3
, ,
1)
2)
2)
266 Chapter 8 Matrix algebra
4
5
6
⎛ ⎞ ⎛ ⎞
71723 81617
⎝−1 10 19⎠,
⎝17 4 8⎠
2410 4 23 8 9
( ) ( ) 18 14 86 78
,
7 11 39 47
( ) ( )
−10 16 0 10
, ,
−20 18 −14 8
( ) ( ) 3 4 −15 48
,
−6 9 −72 57
⎛ ⎞
1
7 (a) ⎜−2
⎟
⎝ 1⎠
1
( ) 46 29
8
29 78
⎛ ⎞
37
9 (a) ⎝ 13⎠
−6
(d)
( ) 11
28
(b)
⎛
⎜
⎝
⎞
5
5
⎟
⎠
−2
1
⎛ ⎞
8
(b) ⎝36⎠
19
⎛ ⎞
31
(e) ⎜33
⎟
⎝38⎠
18
(c)
⎛ ⎞
10
⎝31⎠
18
TechnicalComputingExercises8.3
Technical computing languages such asMATLAB ® are
usuallydesigned to perform operationson scalars,
vectorandmatrices.However,ashasbeendemonstrated
in thischapter, different mathematical rules apply when
workingwith vectors and matrices.Consider the
followingquantities:
a = −2 (a scalar quantity)
b = 7 (a scalar quantity)
( ) 1 3
c = , (a 2 by2matrix)
2 4
( ) 5 7
d = , (a 2 by 2 matrix)
6 8
⎛ ⎞
1 4
e = ⎝2 5⎠, (a 3 by2matrix)
3 6
( ) 135
f = , (a 2 by3matrix)
246
Load these asvariables in atechnical computing
language and thenattempt to perform the following
calculations andanswer the questions.
1. Find the product ofthe scalarsaandb.Isthisa
valid calculation?
2. Find the product ofthe scalar aand the matrixc.Is
thiscalculation valid? Similarlyfindthe product of
the matrixcandscalara.Do these calculations
produce the same result?
3. Find the productsofthe matricescandd,that iscd
anddc. Are these calculations valid and dothey
produce the same result?
4. Find the productsofthe matriceseand f,that isef
and fe.Are these calculations valid and dothey
produce the same result?
5. Find the productsofthe matricesd ande,that isde
anded. Are these calculations valid and dothey
produce the same result?
6. Findc 2 .Isthiscalculation valid?
7. Finde 2 .Isthiscalculation valid?
Solutions
1 Yes, the calculation is valid. Itis ascalar multiplied
byascalar.InMATLAB ® :
a*b
ans = -14
2 Yes, the calculations are valid and produce the same
result.
a*c
ans =
-2 -6
-4 -8
8.4 Using matrices in the translation and rotation of vectors 267
c*a
ans =
-2 -6
-4 -8
Multiplication ofascalar byamatrixis
commutative.Itdoes notmatter in which order the
calculation is performed. Each element in the matrix
ismultiplied bythe scalar value.
3 Thecalculations are valid but they do notproduce the
same result becausematrixmultiplication isnot
commutative.
c*d
ans =
23 31
34 46
>> d*c
ans =
19 43
22 50
Thisresult emphasizes the need forcare when
manipulating matrixequations.
4 Bothcalculations are valid but the resultsare not the
same.
e*f
ans =
f*e
ans =
9 19 29
12 26 40
15 33 51
22 49
28 64
5 de isnot valid asthere are 2 columns ind and 3 rows
ine. Itshould generate an error in atechnical
computing language. Itisworth observing the error
message andunderstanding why thishappensfor
future reference.
ed isvalid,
e*d
ans =
29 39
40 54
51 69
6 Itis valid. We takethisto mean the matrix
multiplication ofcbyitself:
c*c
ans =
7 15
10 22
Notice that thisisdifferent to taking the squareof
each element withinc. Ifwe wanted to dothat it
would be necessaryto use the ‘dot’ notation:
c.^2
ans =
1 9
4 16
Thecommandc.ˆ2means‘takeeach elementofcand
raiseitto the power of2.There is asetofoperators
within MATLAB ® which allow multiplication of
matrices onan element byelement basis.Care has to
be taken as(forexample)c ∗dandc. ∗dare both
valid operationswhich generate a3by 3 result but the
elements ofthe matrixare different. Inother words,it
could give an incorrect result but MATLAB ® would
not giveany error messages.
7 Itis notvalid. Itis only possibleto multiply amatrix
by itselfifthere are the same number ofcolumns as
rows.
8.4 USINGMATRICESINTHETRANSLATIONANDROTATION
OFVECTORS
There are many applications in engineering that require vectors to be manipulated into
newconfigurations.Onemechanismforachievingthisisbytheuseofmatrices.Wewill
introduce this topic through the example of robotics. However, it is important to note
thatthe ideas aremore generally applicable inengineering.
268 Chapter 8 Matrix algebra
Engineeringapplication8.1
Robotcoordinateframes
InChapter7wesawthatvectorsprovideausefultoolfortheanalysisoftheposition
ofrobots.Byassigningavectortoeachofthelinksthepositionvectorcorresponding
tothetipoftherobot canbecalculated (Figure8.1).Inpracticetheinverse problem
ismorelikely:calculatethelinkvectorstoachieveaparticularpositionvector.Usually
a desired position for the tip of the robot is known and link positions to achieve
this are required. The problem is made more complicated because the position of a
link depends upon the movements of all the joints between it and the anchor point.
Thesolutionofthisproblemcanbequitecomplicated,especiallywhentherobothas
several links. One way forward is to define the position of a link by its own local
set of coordinates. This is usually termed a coordinate frame because it provides a
frameofreferenceforthelink.Matrixoperationscanthenbeusedtorelatethecoordinateframes,thusallowingalinkpositiontobedefinedwithrespecttoaconvenient
coordinateframe.Acommonrequirementistobeabletorelatethelinkpositionsto
a world coordinate frame. If a robot is being used in conjunction with other machinesthentheworldcoordinateframemayhaveanoriginsomedistanceawayfrom
the robot. The advantage of defining link coordinate frames is that the position of a
linkiseasilydefinedwithinitsowncoordinateframeandthemovementofcoordinate
frames relative toeach other can be expressed by means ofmatrix equations.
b
c
d
a
p
p = a + b + c + d
Figure8.1
Arobotwithlinksrepresentedbyvectors
a, b, cand d.
8.4.1 Translationandrotationofvectors
An introduction to the mathematics involved in analysing the movement of robots can
be obtained by examining the way in which vectors can be translated and rotated using
matrix operations.
Consider the pointPwith position vector given by
⎛ ⎞
x
r = ⎝y⎠
z
In order to translate and rotate this vector it is useful to introduce an augmented vector
Vgiven by
⎛ ⎞
x
V = ⎜y
⎟
⎝z⎠ (8.1)
1
8.4 Using matrices in the translation and rotation of vectors 269
Itisthen possible todefine several matrices:
⎛
⎞
1 0 0 0
Rot(x,θ) = ⎜0 cosθ −sinθ 0
⎟
⎝0 sinθ cosθ 0⎠ (8.2)
0 0 0 1
⎛
⎞
cosθ 0 sinθ 0
Rot(y,θ) = ⎜ 0 1 0 0
⎟
⎝−sinθ 0 cosθ 0⎠ (8.3)
0 0 0 1
⎛
⎞
cosθ −sinθ 0 0
Rot(z,θ) = ⎜sinθ cosθ 0 0
⎟
⎝ 0 0 1 0⎠ (8.4)
0 0 0 1
⎛ ⎞
100a
Trans(a,b,c) = ⎜010b
⎟
⎝001c⎠ (8.5)
0001
Matrices (8.2) to (8.4) allow vectors to be rotated by an angle θ around axesx,yandz,
respectively. For example, the product Rot(x,θ)V has the effect of rotating r through
anangle θ about thexaxis. Matrix (8.5)allows a vector tobetranslatedaunits inthex
direction,bunits intheydirection andcunits inthezdirection.
It is possible to combine these matrices to calculate the effect of several operations
on a vector. In doing so, it is important to maintain the correct order of operations as
matrix multiplication isnon-commutative.
For example, the position of a vector that has first been translated and then rotated
about thexaxis can be defined by
V new
= Rot(x,θ)Trans(a,b,c)V old
Afewexamples will help toclarifythese ideas.
Example8.12 Rotate the vector
⎛ ⎞
1
r = ⎝1⎠
2
through 90 ◦ about thexaxis.
⎛ ⎞
⎛ ⎞ 1
1
Solution r old
= ⎝1⎠ V old
= ⎜1
⎟
⎝2⎠
2
1
Rot(x,90 ◦ ) =
⎛
⎞ ⎛ ⎞
1 0 0 0 10 00
⎜0 cos90 ◦ −sin90 ◦ 0
⎟
⎝0 sin90 ◦ cos90 ◦ 0⎠ = ⎜00−10
⎟
⎝01 00⎠
0 0 0 1 00 01
270 Chapter 8 Matrix algebra
V new
=
So,
⎛ ⎞
1
r new
= ⎝−2⎠
1
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
10 00 1 1
⎜00−10
⎟ ⎜1
⎟
⎝01 00⎠
⎝2⎠ = ⎜−2
⎟
⎝ 1⎠
00 01 1 1
Example8.13 Translate the vector
⎛ ⎞
1
r = ⎝3⎠
2
by
⎛ ⎞
1
⎝2⎠
3
and then rotate by 90 ◦ about theyaxis.
⎛ ⎞
⎛ ⎞ 1
1
Solution r old
= ⎝3⎠ V old
= ⎜3
⎟
⎝2⎠
2
1
To translate r old
, weform
⎛ ⎞
1001
Trans(1,2,3) = ⎜0102
⎟
⎝0013⎠
0001
Then
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
1001 1 2
Trans(1,2,3)V old
= ⎜0102
⎟ ⎜3
⎟
⎝0013⎠
⎝2⎠ = ⎜5
⎟
⎝5⎠
0001 1 1
To rotate by 90 ◦ about theyaxis we require
⎛
⎞ ⎛ ⎞
cos90 ◦ 0 sin90 ◦ 0 0010
Rot(y,90 ◦ ) = ⎜ 0 1 0 0
⎟
⎝−sin90 ◦ 0 cos90 ◦ 0⎠ = ⎜ 0100
⎟
⎝−1000⎠
0 0 0 1 0001
⎛ ⎞
2
The vector ⎜5
⎟
⎝5⎠ ispremultiplied by thismatrix togive
1
8.5 Some special matrices 271
V new
= Rot(y,90 ◦ )Trans(1,2,3)V old
⎛ ⎞⎛
⎞ ⎛ ⎞
0010 2 5
= ⎜ 0100
⎟⎜5
⎟
⎝−1000⎠⎝5⎠ ⎜ 5
⎟
⎝−2⎠
0001 1 1
Hence
⎛ ⎞
5
r new
= ⎝ 5⎠
−2
8.5 SOMESPECIALMATRICES
8.5.1 Squarematrices
Amatrixwhichhasthesamenumberofrowsascolumnsiscalledasquarematrix.Thus
⎛ ⎞
123
( )
⎝−101⎠ −130
isasquare matrix, while isnot
241
321
8.5.2 Diagonalmatrices
Some square matrices have elements which are zero everywhere except on the leading
diagonal (top-lefttobottom-right). Such matrices aresaidtobediagonal. Thus
⎛ ⎞
⎛ ⎞
10 0 ( )
3000
⎝02 0⎠
1 0
⎜0200
⎟
0 b ⎝0010⎠
00−1
0000
areall diagonal matrices, whereas
⎛ ⎞
124
⎝010⎠
301
isnot.
8.5.3 Identitymatrices
Diagonal matrices which have only ones on their leading diagonals, for example
⎛ ⎞
( ) 100
1 0
and ⎝010⎠
0 1
001
arecalledidentity matrices and aredenoted by the letterI.
( ) ( )
1 0 2 44
Example8.14 FindIAwhereI = andA= and comment upon the result.
0 1 3−10
( )( ) ( )
1 0 2 44 2 44
Solution IA =
=
0 1 3−10 3−10
272 Chapter 8 Matrix algebra
TheeffectofpremultiplyingAbyIhasbeentoleaveAunaltered.Theproductisidentical
tothe originalmatrixA, and this iswhyI iscalled an identitymatrix.
In general, if A is an arbitrary matrix and I is an identity matrix of the appropriate
size,then
IA=A
IfAisasquare matrix thenIA =AI =A.
8.5.4 Thetransposeofamatrix
IfAisanarbitrarym×nmatrix,arelatedmatrixisthetransposeofA,writtenA T ,found
by interchanging the rows and columns ofA. Thus the first row ofAbecomes the first
column ofA T and so on.A T isann ×mmatrix.
( ) 1 −1
Example8.15 IfA = findA
2 4
T .
( )
Solution 1 2
A T =
−1 4
( ) 426
Example8.16 IfA = findA
187
T and evaluateAA T .
⎛ ⎞
4 1
( ) ⎛ ⎞
1
Solution A T = ⎝2 8⎠ 426
AA T = ⎝4
2 8⎠ =
187
6 7
6 7
( ) 56 62
62 114
8.5.5 Symmetricmatrices
IfasquarematrixAanditstransposeA T areidentical,thenAissaidtobeasymmetric
matrix.
⎛ ⎞
5−4 2
Example8.17 IfA = ⎝−4 6 9⎠findA T .
2 913
⎛ ⎞
5−4 2
Solution A T = ⎝−4 6 9⎠
2 913
whichisclearlyequaltoA.HenceAisasymmetricmatrix.Notethatasymmetricmatrix
issymmetrical about its leading diagonal.
8.5.6 Skewsymmetricmatrices
If a square matrixAissuchthatA T = −A thenAissaidtobe skewsymmetric.
Example8.18 IfA =
Solution We haveA T =
( ) 0 5
, findA
−5 0
T and deduce thatAisskew symmetric.
8.5 Some special matrices 273
( ) 0 −5
which isclearly equal to −A. HenceAisskew symmetric.
5 0
EXERCISES8.5
1 IfA=
(a)findA T ,
(b)findB T ,
(c)findAB,
( ) 3 1
andB=
2 6
(d)find (AB) T ,
( ) −1 4
3 8
(e)deduce that (AB) T =B T A T .
⎛ ⎞
x
2 Treating the columnvectorx = ⎝y⎠asa3×1
z
matrix, findIxwhereI is the 3 ×3 identitymatrix.
( ) a b
3 IfA= showthatAA
c d
T is asymmetric
matrix.
⎛ ⎞ ⎛ ⎞
213 1 −70
4 IfA= ⎝ 421⎠andB=
⎝0 25⎠
−132 3 45
findA T ,B T ,ABand (AB) T .
Deduce that (AB) T =B T A T .
5 Determine the type ofmatrixobtained when two
diagonal matrices are multiplied together.
Solutions
( ) 3 2
1 (a)
1 6
( ) 0 20
(c)
16 56
⎛ ⎞
x
2 ⎝y⎠
z
4
( ) −1 3
(b)
4 8
( ) 0 16
(d)
20 56
⎛ ⎞ ⎛ ⎞
24 −1 103
⎝12 3⎠,
⎝−724⎠,
31 2 055
6 IfA=
( ) a b
is skew symmetric, showthat
c d
a =d = 0,that is the diagonal elementsare zero.
( ) 1 13
7 IfA=
15 7
(a) findA T ,
(b) find (A T ) T ,
(c) deduce that (A T ) T is equal toA.
( ) 9 4
8 IfA=
3 2
(a) findA +A T andshowthat thisisasymmetric
matrix,
(b) findA −A T andshowthat thisisaskew
symmetric matrix.
9 The sumofthe elements onthe leading diagonal
ofasquarematrixis known asitstrace. Find the
trace of
( ) ( )
7 2 0 9
(a) (b)
−1 5 −1 0
⎛ ⎞ ⎛ ⎞
(c)
100
⎝010⎠
001
(d)
7 2 1
⎝8 2 3⎠
9 −1 −4
⎛ ⎞ ⎛ ⎞
11 020 11 7 5
⎝ 7 −20 15⎠,
⎝ 0 −20 21⎠
5 2125 20 15 25
5 Diagonal matrix
( ) 1 15
7 (a)
13 7
( ) 18 7
8 (a)
7 4
( ) 1 13
(b)
15 7
( ) 0 1
(b)
−1 0
9 (a)12 (b)0 (c)3 (d)5
274 Chapter 8 Matrix algebra
8.6 THEINVERSEOFA2×2MATRIX
Whenwearedealingwithordinarynumbersitisoftennecessarytocarryouttheoperationofdivision.Thus,forexample,ifweknowthat3x
= 4,thenclearlyx = 4/3.Ifwe
are given matricesAandC and know that
AB=C
how do we findB? Itmight be tempting towrite
B = C A
Unfortunately, this would be entirely wrong since division of matrices is not defined.
However, given expressions like AB = C it is often necessary to be able to find the
appropriateexpressionforB.Thisiswhereweneedtointroducetheconceptofaninverse
matrix.
IfAisasquare matrix and we can find another matrixBwith the property that
AB=BA=I
thenBissaid tobe theinverseofAand iswrittenA −1 , thatis
AA −1 =A −1 A =I
IfBis the inverse ofA, thenAis also the inverse ofB. Note thatA −1 does not mean a
reciprocal; there is no such thing as matrix division.A −1 is the notation we use for the
inverse ofA.
Multiplyingamatrix byits inverse yieldsthe identitymatrixI, thatis
AA −1 =A −1 A =I
Example8.19 IfA =
SinceAisasquarematrix,A −1 isalsosquareandofthesameorder,sothattheproducts
AA −1 andA −1 A can be formed. The term ‘inverse’ cannot be applied to a matrix which
isnotsquare.
( ) 2 1
show thatthe matrix
3 2
( ) 2 −1
isthe inverse ofA.
−3 2
Solution Forming the products
( )( ) ( )
2 1 2 −1 1 0
=
3 2 −3 2 0 1
( )( ) ( )
2 −1 2 1 1 0
=
−3 2 3 2 0 1
wesee that
( ) 2 −1
is the inverse ofA.
−3 2
8.6.1 Findingtheinverseofamatrix
For 2 ×2 matrices a simple formula exists tofind the inverse of
( ) a b
A =
c d
This formulastates
IfA=
( ) a b
thenA
c d
−1 =
1
ad−bc
8.6 The inverse of a 2 × 2 matrix 275
( ) d −b
.
−c a
Example8.20 IfA =
( ) 3 5
findA
1 2
−1 .
Solution Clearlyad −bc = 6 −5 = 1,so that
A −1 = 1 ( ) ( ) 2 −5 2 −5
=
1 −1 3 −1 3
The solution should always be checked by formingAA −1 .
Example8.21 IfA =
( ) 1 5
findA
2 4
−1 .
Solution Herewehavead −bc = 4 −10 = −6.Therefore
⎛
A −1 = 1 ( ) − 2 5
⎞
4 −5 ⎜
=
3 6⎟
−6 −2 1
⎝
1
⎠
3 −1 6
Example8.22 IfA =
( ) 2 4
findA
1 2
−1 .
1
Solution Thistime,ad −bc = 4 −4 = 0,sowhenwecometoform
ad−bc wefind1/0which
isnotdefined. We cannot form the inverse ofAinthiscase; itdoes notexist.
Clearly not all square matrices have inverses. The quantity ad − bc is obviously the
important determining factor since only if ad −bc ≠ 0 can we find A −1 . This quantity
is therefore given a special name: the determinant ofA, denoted by |A|, or detA.
Given any 2 × 2 matrix A, its determinant, |A|, is the scalar ad − bc. This is easily
remembered as
[product of ց diagonal] −[product of ւ diagonal]
276 Chapter 8 Matrix algebra
( )
a b
If A is the matrix , we write its determinant as
c d
∣ a b
c d∣ . Note that the straight
lines || indicate that we are discussing the determinant, which is a scalar, rather than
the matrix itself. If the matrixAis such that |A| = 0, then it has no inverse and is said
tobe singular. If |A| ≠ 0 thenA −1 exists andAissaidtobe non-singular.
Asingular matrixAhas |A| = 0.
Anon-singular matrixAhas |A| ≠ 0.
Example8.23 IfA =
( ) 1 2
andB=
5 0
( ) −1 2
find |A|, |B|and |AB|.
−3 1
Solution |A| =
∣ 1 2
5 0∣ = (1)(0) − (2)(5) = −10
|B| =
∣ −1 2
−3 1∣ = (−1)(1) − (2)(−3) =5
( )( ) ( )
1 2 −1 2 −7 4
AB =
=
5 0 −3 1 −5 10
|AB| = (−7)(10) − (4)(−5) = −50
We note that |A||B| = |AB|.
The resultobtained inExample 8.23 istruemore generally:
IfAandBaresquare matrices of the same order, |A||B| = |AB|.
8.6.2 Orthogonalmatrices
A non-singular square matrixAsuch thatA T = A −1 is said to be orthogonal. Consequently,
ifAisorthogonalAA T =A T A =I.
Example8.24 Find the inverse ofA =
( ) 0 −1
. Deduce thatAisan orthogonal matrix.
1 0
Solution From the formula forthe inverse of a 2 ×2 matrix wefind
A −1 = 1 ( ) ( ) 0 1 0 1
=
1 −1 0 −1 0
This isclearly equal tothe transpose ofA. HenceAisan orthogonal matrix.
To findthe inversesoflargermatricesweshallneedtostudydeterminantsfurther.This
isdone inSection8.7.
8.6 The inverse of a 2 × 2 matrix 277
EXERCISES8.6
1 IfA=
( ) 5 6
findA
−4 8
−1 .
2 Find the inverse, ifitexists,ofeach ofthe following
matrices:
( ) ( ) ( )
1 0 −1 0 2 3
(a) (b) (c)
0 1 0 −1 4 1
( ) ( ) ( )
−1 0 6 2 −6 2
(d) (e) (f)
−1 7 9 3 9 3
⎛ ⎞
1 1
(g) ⎜2
2
⎝
0 1 ⎟
⎠
2
( ) ( ) 3 0 7 8
3 IfA= andB=
−1 4 4 3
find |AB|, |BA|.
( ) a b
4 IfA= ,B =
c d
findAB, |A|, |B|, |AB|.
( ) e f
g h
Verifythat |AB| = |A||B|.
( ) 1 2
5 IfA= findA
3 4
−1 .
Find values ofthe constantsaandb
such thatA +aA −1 =bI.
( ) ( )
1 1 2 1
6 IfA= andB=
0 3 −1 3
findAB, (AB) −1 ,B −1 ,A −1 andB −1 A −1 .
Deduce that (AB) −1 =B −1 A −1 .
7 Given that the matrix
⎛
⎞
cos ωt −sin ωt 0
M = ⎝sin ωt cos ωt 0⎠
0 0 1
is orthogonal, findM −1 .
( ) a b
8 (a) IfA= andkis ascalar constant,
c d
showthat the inverse ofthe matrixkA
is 1 k A−1 .
( ) 1 1
(b) Findthe inverse of andhence write
1 0
⎛ ⎞
1 1
down the inverse of ⎜3
3
⎟
⎝1
3 0 ⎠ .
Solutions
1
64
( ) 8 −6
4 5
1
⎛
( ) ( ) 1 0 −1 0
− 1 ⎞
3
2 (a) (b) (c) ⎜ 10 10
0 1 0 −1 ⎝ 2
− 1 ⎟
⎠
5 5
⎛
( ) −1 0
(d)
− 1 − 1 ⎞
1
1 (e) No inverse (f) ⎜ 12 18
⎟
⎝ 1 1 ⎠
7 7
( )
4 6
2 −2
(g)
0 2
3 −132, −132
4
( ) ae +bg af+bh
,
ce+dg cf+dh
ad−bc,eh−fg,
(ad −bc)(eh − fg)
( ) −2 1
5 3
2 −1 a=−2,b=5
2
( ) 1 4
6 AB= , (AB)
−3 9
−1 = 1 ( ) 9 −4
,
21 3 1
B −1 = 1 ( ) 3 −1
,A
7 1 2
−1 = 1 ( ) 3 −1
3 0 1
⎛
⎞
cos ωt sinωt 0
7 ⎝−sin ωt cos ωt 0⎠
0 0 1
( ) ( ) 0 1 0 3
8 (b) ,
1 −1 3 −3
278 Chapter 8 Matrix algebra
8.7 DETERMINANTS
⎛ ⎞
a 11
a 12
a 13
IfA= ⎝a 21
a 22
a 23
⎠, the value of itsdeterminant, |A|, isgiven by
a 31
a 32
a 33
∣ ∣ ∣ ∣ ∣ ∣ ∣∣∣ a
|A| =a 22
a ∣∣∣ ∣∣∣ 23
a
11
−a 21
a ∣∣∣ ∣∣∣ 23
a
a 32
a 12
+a 21
a ∣∣∣ 22
33
a 31
a 13
33
a 31
a 32
If we choose an element ofA, a ij
say, and cross out its row and column and form the
determinant of the four remaining elements, this determinant is known as theminor of
the elementa ij
.
Amoment’s study will therefore reveal thatthe determinant ofAisgiven by
|A| = (a 11
×its minor) − (a 12
×its minor) + (a 13
×its minor)
This method of evaluating a determinant isknown asexpansion along the first row.
Example8.25 Find the determinant of the matrix
⎛ ⎞
121
A = ⎝−134⎠
512
Solution The determinant ofA, written as
121
−134
∣ 512∣
isfound by expanding along itsfirst row:
|A|=1
∣ 3 4
∣ ∣ ∣∣∣ 1 2∣ −2 −1 4
∣∣∣ 5 2∣ +1 −1 3
5 1∣
= 1(2) −2(−22) +1(−16)
= 2+44−16
= 30
Example8.26 Find the minors ofthe elements 1 and 4 inthe matrix
⎛ ⎞
723
B = ⎝103⎠
042
Solution To find the minor of 1 delete its row and column to form the determinant
∣ 2 3
4 2∣ . The
required minor istherefore 4 −12 = −8.
Similarly, the minor of 4 is
∣ 7 3
1 3∣ =21−3=18.
8.7 Determinants 279
In addition to finding the minor of each element in a matrix, it is often useful to find a
related quantity -- the cofactor of each element. The cofactor is found by imposing on
the minor a positive or negative sign depending upon its position, that is a place sign,
according tothe following rule:
+ − +
− + −
+ − +
Example8.27 If
⎛ ⎞
3 27
A = ⎝9 10⎠
3−12
find the cofactors of 9 and 7.
Solution The minor of 9 is
2 7
∣−1 2∣ = 4 − (−7) = 11, but since its place sign is negative, the
required cofactor is −11.
The minor of 7 is
∣ 9 1
3 −1∣ = −9 − 3 = −12. Its place sign is positive, so that the
required cofactor issimply −12.
8.7.1 Usingdeterminantstofindvectorproducts
Determinants can also be used to evaluate the vector product of two vectors. If a =
a 1
i +a 2
j +a 3
k and b = b 1
i +b 2
j +b 3
k, we showed in Section 7.6 that a × b is the
vector defined by
a×b = (a 2
b 3
−a 3
b 2
)i+ (a 3
b 1
−a 1
b 3
)j+ (a 1
b 2
−a 2
b 1
)k
If weconsider the expansion of the determinant given by
∣ i jk ∣∣∣∣∣
a 1
a 2
a 3
∣b 1
b 2
b 3
we find the same result. This definition is therefore a convenient mechanism for evaluating
a vector product.
Ifa =a 1
i+a 2
j+a 3
kandb =b 1
i+b 2
j+b 3
k,then
∣ i jk ∣∣∣∣∣
a×b=
a 1
a 2
a 3
∣b 1
b 2
b 3
280 Chapter 8 Matrix algebra
Example8.28 Ifa=3i+j−2kandb=4i+5kfinda×b.
Solution We have
ij k
a×b =
31−2
∣40 5∣
=5i−23j−4k
8.7.2 Cramer’srule
A useful application of determinants is to the solution of simultaneous equations. Consider
the case of three simultaneous equations inthreeunknowns:
a 11
x+a 12
y+a 13
z = b 1
a 21
x+a 22
y+a 23
z = b 2
a 31
x+a 32
y+a 33
z = b 3
Cramer’s rulestates thatx,yandzaregiven by the following ratios of determinants.
Cramer’srule:
∣ ∣ ∣ b 1
a 12
a ∣∣∣∣∣
13
a 11
b 1
a ∣∣∣∣∣
13
a 11
a 12
b ∣∣∣∣∣ 1
b 2
a 22
a 23
a 21
b 2
a 23
a 21
a 22
b 2
∣b 3
a 32
a ∣ 33
a
x =
∣ 31
b 3
a ∣ 33
a
y =
∣ 31
a 32
b
z =
3
∣ a 11
a 12
a ∣∣∣∣∣ 13
a 11
a 12
a ∣∣∣∣∣ 13
a 11
a 12
a ∣∣∣∣∣ 13
a 21
a 22
a 23
a 21
a 22
a 23
a 21
a 22
a 23
∣a 31
a 32
a ∣ 33
a 31
a 32
a ∣ 33
a 31
a 32
a 33
Note that in all cases the determinant in the denominator is identical and its elements
arethecoefficientsonthel.h.s.ofthesimultaneousequations.Whenthisdeterminantis
zero, Cramer’s method will clearly fail.
Example8.29 Solve
3x+2y−z =4
2x−y+2z =10
x−3y−4z =5
Solution We find
4 2−1
10−1 2
∣ 5−3−4∣
x =
3 2 −1
2−1 2
∣1−3−4∣
= 165
55 = 3
Verifyfor yourself thaty = −2 andz = 1.
8.8 The inverse of a 3 × 3 matrix 281
EXERCISES8.7
∣ ∣ 1 Find
∣ 4 6
∣∣∣∣∣ 13 4
∣∣∣∣∣ 2 8∣ , 672
21 0
35 −1∣ and 143
−114∣ .
2 Find
cos ωt sin ωt
∣−sin ωt cos ωt∣ .
∣ 500
∣∣∣∣∣ 900
3 Evaluate
632
∣457∣ and 070
008∣ .
⎛ ⎞
2 −17
4 IfA= ⎝0 84⎠,find |A|and |A T |.
3 64
Comment uponyour result.
5 UseCramer’s ruleto solve
(a) 2x−3y+z=0
5x+4y+z=10
2x−2y−z=−1
(b) 3x+y=−1
2x−y+z=−1
5x+5y−7z=−16
(c) 4x+y+z=13
2x−y=4
x+y−z=−3
(d) 3x+2y=1
x+y−z=1
2x+3z=−1
6 Given
⎛ ⎞
37 6
A = ⎝−21 0⎠
42 −5
(a) find |A|
(b) findthe cofactors ofthe elements ofrow 2,that is
−2,1, 0
(c) calculate
−2 × (cofactorof −2)
+1 × (cofactorof1)
+0 × (cofactorof0).
What doyou deduce?
7 Ifa=7i+11j−2kandb=6i−3j+kfinda×b.
8 Find a ×bwhen
(a) a=3i−j,b=i+j+k
(b) a=2i+j+k,b=7k
(c) a=−7j−k,b=−3i+j
Solutions
1 20,33,39
2 1
3 55,504
4 −164, −164 Note |A| = |A T |
5 (a) x=y=z=1
(b) x=−1,y=2,z=3
(c) x=2,y=0,z=5
(d) x=1,y=−1,z=−1
6 (a) −133 (b) 47, −39,22 (c) −133
7 5i−19j−87k
8 (a) −i−3j+4k
(b) 7i −14j
(c) i+3j−21k
8.8 THEINVERSEOFA3×3MATRIX
Given a 3×3 matrix,A, its inverse isfound as follows:
(1) Find the transpose ofA, by interchanging the rows and columns ofA.
(2) ReplaceeachelementofA T byitscofactor;byitsminortogetherwithitsassociated
place sign. The resulting matrix isknown as theadjoint ofA, denoted adj(A).
282 Chapter 8 Matrix algebra
(3) Finally, the inverse ofAisgiven by
A −1 = adj(A)
|A|
Example8.30 Findthe inverse of
⎛ ⎞
1−20
A = ⎝ 3 15⎠
−1 23
⎛ ⎞
13−1
Solution A T = ⎝−21 2⎠
05 3
ReplacingeachelementofA T byits cofactor, wefind
⎛ ⎞
−7 6 −10
adj(A) = ⎝−14 3 −5⎠
70 7
ThedeterminantofAisgiven by
|A|=1
∣ 1 5
∣ ∣ ∣∣∣ 2 3∣ − (−2) 3 5
∣∣∣ −1 3∣ +0 3 1
−1 2∣
Therefore,
= (1)(−7) + (2)(14)
= 21
A −1 = adj(A)
|A|
⎛ ⎞
= 1 −7 6 −10
⎝−14 3 −5⎠
21
70 7
Note thatthis solution should be checked by formingAA −1 togiveI.
It is clear that should |A| = 0 then no inverse will exist since then the quantity 1/|A| is
undefined.Recallthatsuchamatrix issaidtobesingular.
Forany squarematrixA, the following statements are equivalent:
|A|=0
Aissingular
Ahas no inverse
8.9 Application to the solution of simultaneous equations 283
EXERCISES8.8
1 Find adj(A), |A| and, ifitexists,A −1 ,if
⎛ ⎞
2 −3 1
(a) A= ⎝5 4 1⎠
2 −2 −1
⎛ ⎞
3 1 0
(b) A= ⎝2 −1 1⎠
5 5 −7
⎛ ⎞
2 −1 4
(c) A= ⎝5 −2 9⎠
3 2 −1
⎛ ⎞
10 −5 −4
2 IfP= ⎝−5 10 −3⎠,find adj(P)and |P|.
−4 −3 8
DeduceP −1 .
Solutions
1 (a) |A| = −43
⎛ ⎞
−2 −5 −7
adj(A) = ⎝ 7 −4 3 ⎠ ,
−18 −2 23
⎛ ⎞
A −1 = 1 25 7
⎝ −74 −3 ⎠
43
18 2 −23
(b) |A| = 25
⎛ ⎞
2 7 1
adj(A) = ⎝19 −21 −3⎠,
15 −10 −5
⎛ ⎞
A −1 = 1 2 7 1
⎝19 −21 −3⎠
25
15 −10 −5
(c) |A| = 0
⎛
⎞
−16 7 −1
adj(A) = ⎝ 32 −14 2 ⎠
16 −7 1
A −1 does notexist.
2 |P| = 230
⎛ ⎞
71 52 55
adj(P) = ⎝52 64 50⎠
55 50 75
⎛ ⎞
P −1 = 1 71 52 55
⎝52 64 50⎠
230
55 50 75
8.9 APPLICATIONTOTHESOLUTIONOFSIMULTANEOUS
EQUATIONS
Thematrixtechniqueswehavedevelopedallowthesolutionofsimultaneousequations
tobefound inasystematic way.
Example8.31 Useamatrix methodtosolve the simultaneousequations
2x+4y=14
x−3y=−8
(8.6)
Solution We first notethatthe systemofequationscan bewritteninmatrix form asfollows:
( ( ) 2 4 x
=
1 −3)
y
( ) 14
−8
(8.7)
284 Chapter 8 Matrix algebra
To understand this expression it is necessary that matrix multiplication has been fully
mastered, for, by multiplying outthe l.h.s.,wefind
( ( ) ( )
2 4 x 2x +4y
=
1 −3)
y 1x−3y
and the form (8.7)follows immediately.
We can writeEquation (8.7) as
AX=B (8.8)
( )
( ( )
2 4 x 14
whereAisthe matrix ,X isthe matrix andBis the matrix .
1 −3 y)
−8
( x
In order to findX = it is now necessary to makeX the subject of the equation
y)
AX =B. We can premultiply Equation (8.8) byA −1 , the inverse ofA, provided such an
inverse exists,togive
A −1 AX =A −1 B
Then, noting thatA −1 A =I, we find
that is
IX =A −1 B
X =A −1 B
usingthepropertiesoftheidentitymatrix.WehavenowmadeX thesubjectoftheequationasrequiredandweseethattofindX
wemustpremultiplyther.h.s.ofEquation(8.8)
by the inverse ofA.
Inthiscase
A −1 = 1 ( ) −3 −4
−10 −1 2
( ) 3/10 2/5
=
1/10 −1/5
and
A −1 B =
=
( ) ( ) 3/10 2/5 14
1/10 −1/5 −8
( 1
3)
( x
that is,X = =
y)
( 1
3)
, so thatx = 1 andy = 3 isthe required solution.
IfAX =BthenX =A −1 BprovidedA −1 exists.
Thistechniquecanbeappliedtothreeequationsinthreeunknownsinananalogousway.
8.9 Application to the solution of simultaneous equations 285
Example8.32 Express the following equations inthe formAX =Band hence solve them:
3x+2y−z =4
2x−y+2z =10
x−3y−4z =5
Solution Usingthe rules of matrix multiplication, wefind
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
3 2−1 x 4
⎝2−1 2⎠
⎝y⎠ = ⎝10⎠
1−3−4 z 5
whichisintheformAX =B.ThematrixAiscalledthecoefficientmatrixandissimply
the coefficients ofx,yandzinthe equations. As before,
AX=B
A −1 AX = A −1 B
IX =X=A −1 B
We musttherefore find the inverse ofAinorder tosolve the equations.
To invertAweuse the adjoint. If
⎛ ⎞
3 2−1
A = ⎝2−1 2⎠
1−3−4
then
⎛ ⎞
3 2 1
A T = ⎝ 2−1−3⎠
−1 2−4
and you should verify thatadj(A) isgiven by
⎛ ⎞
10 11 3
adj(A) = ⎝ 10 −11 −8⎠
−5 11 −7
The determinant ofAis found by expanding along the first row:
|A|=3
∣ −1 2
∣ ∣ ∣∣∣ −3 −4∣ −2 2 2
∣∣∣ 1 −4∣ −1 2 −1
1 −3∣
Therefore,
= (3)(10) − (2)(−10) − (1)(−5)
=30+20+5
= 55
A −1 = adj(A)
|A|
⎛ ⎞
= 1 10 11 3
⎝ 10 −11 −8⎠
55
−5 11 −7
286 Chapter 8 Matrix algebra
Finally, the solutionX isgiven by
⎛ ⎞ ⎛ ⎞⎛
⎞
x
X = ⎝y⎠ =A −1 B = 1 10 11 3 4
⎝10 −11 −8⎠⎝10⎠
z
55
−5 11 −7 5
⎛ ⎞
3
= ⎝−2⎠
1
that is,the solution isx = 3,y = −2 andz = 1.
EXERCISES8.9
1 Byexpressing the following equations in matrixform
andfinding an inverse matrix, solve
(a) 4x−2y=14
2x+ y=5
(b) 2x−2y=0
x+3y=−8
(c) 8x+3y=59
−2x+ y=−13
2 Solve the following equationsAX =Bbyfinding
A −1 ,ifitexists.
( ( ) ( ) 6 3 x 12
(a) =
5 2)
y 9
(b)
(c)
(d)
(e)
(f)
( ) ( ) 4 4 x 20
=
1 3)(
y 11
( ) ( 2 −1 x −4
=
3 2)(
y 1)
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
4 13 x 20
⎝2 −14⎠
⎝y⎠ = ⎝20⎠
0 15 z 20
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
4 13 x 15
⎝2 −14⎠
⎝y⎠ = ⎝12⎠
0 15 z 17
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
4 13 x 0
⎝2 −14⎠
⎝y⎠ = ⎝0⎠
0 15 z 0
Solutions
1 (a)x=3,y=−1
(b) x=−2,y=−2
(c)x=7,y=1
2 (a) x=1,y=2 (b) x=2,y=3
(c) x=−1,y=2 (d) x=2,y=0,z=4
(e) x=1,y=2,z=3 (f) x=y=z=0
8.10 GAUSSIANELIMINATION
An alternative technique for the solution of simultaneous equations is that of Gaussian
eliminationwhich weintroduce bymeansofthe following trivialexample.
Example8.33 UseGaussian eliminationtosolve
2x+3y=1
x+y=3
Solution Firstconsider the equations with a steppattern imposed as follows:
2x+3y=1 (1)
x +y =3 (2)
8.10 Gaussian elimination 287
Ouraimwillbetoperformvariousoperationsontheseequationstoremoveoreliminate
all the values underneath the step. You will probably remember from your early work
on simultaneous equations that in order to eliminate a variable from an equation, that
equationcanbemultipliedbyanysuitablenumberandthenaddedtoorsubtractedfrom
another equation. In this example we can eliminate thexterm from below the step by
multiplying the second equation by 2 and subtracting the first equation. Since the first
equationisentirelyabovethestepweshallleaveitasitstands.Thiswholeprocesswill
be writtenas follows:
R 1
R 2
→2R 2
−R 1
2x+3y=1
0x −y =5
(8.9)
where the symbolR 1
means that Equation (1) is unaltered, andR 2
→ 2R 2
−R 1
means
that Equation (2) has been replaced by 2 × Equation (2) -- Equation (1). All this may
seem to be overcomplicating a simple problem but a moment’s study of Equation (8.9)
will reveal why this ‘stepped’ form is useful. Because the value under the step is zero
wecan read offyfrom the lastline, thatis −y = 5,so that
y=−5
Knowingywe can then move up tothe first equation and substituteforytofindx.
2x +3(−5) =1
x = 8
This laststage isknown asback substitution.
Beforeweconsideranother example,let usnotesome important points:
(1) It is necessary to write down the operations used as indicated previously. This aids
checkingand provides a record ofthe working used.
(2) Theoperationsallowedtoeliminate unwanted variables are:
(a) any equation can bemultipliedby any non-zeroconstant;
(b) any equation can beadded toorsubtractedfromany other equation;
(c) equationscan beinterchanged.
It is often convenient to use matrices to carry out this method, in which case the operationsallowedarereferredtoasrowoperations.Theadvantageofusingmatricesisthat
it is unnecessary to write downx,y(and laterz) each time. To do this, wefirst form the
augmentedmatrix:
( ) 231
113
( )
( 2 3
1
so called because the coefficient matrix is augmented by the r.h.s. matrix .
1 1
3)
Itistobeunderstoodthatthisnotationmeans2x+3y = 1,andsoon,sothatwenolonger
write down x and y. Each row of the augmented matrix corresponds to one equation.
288 Chapter 8 Matrix algebra
The aim, as before, istocarryout rowoperations on the stepped form
( )
231
1 1 3
inordertoobtainvaluesofzerounderthestep.Clearlytoachievetherequiredform,the
row operations we performed earlier arerequired, thatis
( )
R 1 2 31
R 2
→2R 2
−R 1 0 −1 5
The last line means 0x −1y = 5, that isy = −5, and finally back substitution yieldsx,
as before.
This technique has other advantages in that it allows us to observe other forms of
behaviour.Weshallseethatsomeequationshaveauniquesolution,somehavenosolutions,whileothers
have an infinite number.
Example8.34 UseGaussian eliminationtosolve
2x+3y=4
4x+6y=7
Solution Inaugmentedmatrix form wehave
( )
234
4 6 7
We proceed toeliminate entries under the step:
( )
R 1 23 4
R 2
→R 2
−2R 1 0 0 −1
Study of the last line seems to imply that 0x + 0y = −1, which is clearly nonsense.
When this happens the equations have no solutions and we say that the simultaneous
equations areinconsistent.
Example8.35 UseGaussian eliminationtosolve
x+y=0
2x+2y=0
Solution Inaugmentedmatrix form wehave
( )
110
2 2 0
Eliminating entries under the stepwefind
( )
R 1 110
R 2
→R 2
−2R 1 0 0 0
This last line implies that 0x + 0y = 0. This is not an inconsistency, but we are now
observing a third type of behaviour. Whenever this happens we need to introduce what
are called free variables. The first row starts off with a non-zero x. There is now no
row which starts off with a non-zero y. We therefore say y is free and choose it to be
8.10 Gaussian elimination 289
anything we please, thatis
y = λ
λisour free choice
Then back substitution occurs as before:
x+λ=0
x=−λ
The solution is therefore x = −λ, y = λ, where λ is any number. There are thus an
infinite number of solutions, forexample
or
x=−1 y=1
x = 1 2
y = − 1 2
and so on.
Observation of the coefficient matrices in the last two examples shows that they have a
determinantofzero.Wheneverthishappensweshallfindtheequationseitherareinconsistentorhaveaninfinitenumberofsolutions.Weshallnextconsiderthegeneralization
ofthismethodtothreeequationsinthreeunknowns.
Example8.36 Solve byGaussian elimination
x−4y−2z =21
2x+y+2z =3
3x+2y−z = −2
Solution We first formthe augmentedmatrix and add the steppedpatternasindicated:
⎛
⎞
1−4−2 21
⎝ 2 1 2 3 ⎠
3 2 −1 −2
The aim is to eliminate all numbers underneath the steps by carrying out appropriate
rowoperations.Thisshouldbecarriedoutbyeliminatingunwantednumbersinthefirst
column first. We find
R 1
R 2
→R 2
−2R 1
R 3
→R 3
−3R 1
⎛
⎝
1−4−2 21
0 9 6−39
0 14 5 −65
We have combined the elimination of unwanted numbers in the first column into one
stage.We now remove unwanted numbers inthe second column:
⎛
⎞
R 1 1−4 −2 21
R 2
⎜ 0 9 6 −39 ⎟
R 3
→R 3
− 14 ⎝
9 R 2 0 0 − 13 ⎠
3 −13 3
and the elimination is complete. AlthoughR 3
→R 3
+ 14 4 R 1
would eliminate the 14, it
would reintroduce a non-zero term into the first column. It is therefore essential to use
the second row and not the first to eliminate this element. We can now read offzsince
⎞
⎠
290 Chapter 8 Matrix algebra
thelastequationstates0x +0y − 13 3 z = −13 ,thatisz = 1.Backsubstitutionofz = 1
3
in the second equation gives y = −5 and, finally, substitution of z and y into the first
equation givesx = 3.
Example8.37 Solve the following equationsby Gaussian elimination:
x−y+z =3
x+5y−5z =2
2x+y−z =1
Solution Forming the augmented matrix,wefind
⎛ ⎞
1−1 13
⎝ 1 5−52⎠
2 1 −1 1
Then, as before, we aim to eliminate all non-zero entries under the step. Starting with
those inthe first column, wefind
⎛
R 1
R 2
→R 2
−R 1
⎝ 1 −1 1 3
⎞
0 6−6−1⎠
R 3
→R 3
−2R 1 0 3 −3 −5
Then,
R 1
R 2
R 3
→2R 3
−R 2
⎛
⎝ 1 −1 1 3
0 6−6−1
0 0 0 −9
Thislastlineimpliesthat0x+0y+0z = −9,whichisclearlyinconsistent.Weconclude
that thereareno solutions.
⎞
⎠
You will see from Examples 8.36 and 8.37 that not only have all entries under the step
beenreducedtozero,butalsoeachsuccessiverowcontainsmoreleadingzerosthanthe
previousone.Wesaythesystemhasbeenreducedtoechelonform.Moregenerallythe
systemhasbeenreducedtoechelonformiffori < jthenumberofleadingzerosinrow
j islarger thanthe number inrowi. Consider Example 8.38.
Example8.38 Solve the following equationsby Gaussian elimination:
2x−y+z =2
−2x+y+z =4
6x−3y−2z = −9
Solution Forming the augmented matrix,wehave
⎛
⎝ 2 −1 1 2
⎞
−2 1 1 4 ⎠
6 −3 −2 −9
Eliminating the unwanted values inthe first column, wefind
⎛
R 1
R 2
→R 2
+R 1
⎝ 2 −1 1 2
⎞
0 0 2 6 ⎠
R 3
→R 3
−3R 1 0 0 −5 −15
8.10 Gaussian elimination 291
Entriesunderthesteparenowzero.Toreducethematrixtoechelonformwemustensure
each successive row has more leading zeros than the row before.We continue:
⎛
R 1
R 2
⎝ 2−112
⎞
0 026⎠
R 3
→ 2R 3
+5R 2 0 0 0 0
which isnow inechelon form.
In this form there is a row which starts off with a non-zeroxvalue, that is the first
row, there is a row which starts off with a non-zerozvalue, but no row which starts off
withanon-zeroyvalue.Therefore,wechooseytobethefreevariable,y = λsay.From
thesecondrowwehavez = 3andfromthefirst2x −y+z = 2,sothat2x = λ−1,that
isx = (λ −1)/2.
Engineeringapplication8.2
TheVandermondeMatrix
Many data storage devices, for example the Blu-ray disc , and data transmission
standards, for example WiMAX ® , have built-in techniques to reduce the effects of
errors which occur during normal use. These errors originate from noise and interference
and they can result in the loss of data. One such technique is the use of an
error-correctingcodetoprovideameasureofprotectionagainstdataerrors.Errorcorrecting
codes are used to encode the data when it is stored or transmitted. The
data is then decoded when it is read or received. In the case of a transmission line,
theerror-correctingcodemakesitpossibletodetecterrorsinthereceivedsignaland
tomake corrections,so thatthe errors arenotsubsequently retransmitted.
Oneimportantclassoferror-correctingcodesarebasedonReed-Solomoncodes,
details ofwhicharebeyond the scopeofthistext.However animportantmathematical
concept usedinReed-Solomoncodes is thatofaVandermondematrix.
The Vandermonde matrix can be illustrated by considering the problem of representing
a signal by a polynomial. For example, to approximate a signal f (t) by a
second-degree polynomialwewrite
f(t)≈a 0
+a 1
t+a 2
t 2
wherea 0
,a 1
anda 2
are coefficients of the polynomial which must be found. These
are found by forcing the original signal f (t) and its polynomial approximation to
agree at three different values oft, sayt 0
,t 1
andt 2
. This gives rise to the following
systemofequations:
⎛
1t 0
t 2 ⎞ ⎛ ⎞ ⎛ ⎞
0 a
⎜
⎝1t 1
t1
2 ⎟ 0
f(t 0
)
⎠ ⎝a 1
⎠ = ⎝f(t 1
) ⎠
1t 2
t2
2 a 2
f(t 2
)
➔
292 Chapter 8 Matrix algebra
ThecoefficientmatrixisknownasaVandermondematrix.Supposewewishtofind
asecond-degreepolynomialapproximationtothesignal f (t) = cos πt
2 forvaluesof
t between −1 and 1. We can do this by making the approximating polynomial and
the original signal equal at three points, sayt = −1,t = 0 andt = 1. The equations
tobe solved arethen
⎛ ⎞⎛
⎞ ⎛ ⎞ ⎛ ⎞
1−11 a 0
f(−1) 0
⎝1 00⎠⎝a 1
⎠ = ⎝ f(0) ⎠ = ⎝1⎠
1 11 a 2
f(1) 0
It is straightforward to solve this system of equations by Gaussian elimination and
obtaina 0
=1,a 1
=0 anda 2
= −1. Therefore the second-degree polynomial which
approximates f (t) = cos πt
2 is1−t2 .
8.10.1 Findingtheinversematrixusingrowoperations
AsimilartechniquecanbeusedtofindtheinverseofasquarematrixAwherethisexists.
Suppose wearegiven the matrixAand wish tofind its inverseB. Thenweknow
AB=I
that is,
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
a 11
a 12
a 13
b 11
b 12
b 13
100
⎝a 21
a 22
a 23
⎠ ⎝b 21
b 22
b 23
⎠ = ⎝010⎠
a 31
a 32
a 33
b 31
b 32
b 33
001
We form the augmentedmatrix
⎛
⎞
a 11
a 12
a 13
1 0 0
⎝a 21
a 22
a 23
0 1 0⎠
a 31
a 32
a 33
0 0 1
Nowcarryoutrowoperationsonthismatrixinsuchawaythatthel.h.s.isreducedtoa
3 ×3identitymatrix.Thematrixwhichthenremainsonther.h.s.istherequiredinverse.
Example8.39 Findthe inverse of
⎛ ⎞
−18 −2
A = ⎝−6 49 −10⎠
−4 34 −5
by rowreduction tothe identity.
Solution We form the augmentedmatrix
⎛
⎞
−1 8 −2 100
⎝−649−10 010⎠
−434 −5 001
We now carry out row operations on the whole matrix to reduce the l.h.s. to an identity
matrix. This means we must eliminate all the elements off the diagonal. Work through
the following calculationyourself:
⎛
R 1
R 2
→R 2
−6R 1
⎝
R 3
→R 3
−4R 1
⎞
−18−2 100
01 2−610⎠
02 3−401
8.10 Gaussian elimination 293
Thishasremovedalltheoff-diagonalentriesincolumn1.Toremovethoseincolumn2:
⎛
⎞
R 1
→R 1
−8R 2
−10−18 49−80
R 2
⎝ 01 2−6 10⎠
R 3
→R 3
−2R 2
00 −1 8−21
To remove those incolumn 3:
⎛
⎞
R 1
→R 1
−18R 3
−1 0 0 −95 28 −18
R 2
→R 2
+2R 3
⎝ 01 0 10−3 2⎠
R 3
00−1 8−2 1
We mustnow adjust the ‘−1’ entries toobtain the identitymatrix:
⎛
⎞
R 1
→ −R 1
100 95−28 18
R 2
⎝ 010 10 −3 2⎠
R 3
→ −R 3
001−8 2−1
Finally, the required inverse isthe matrix remaining on the r.h.s.:
⎛ ⎞
95 −28 18
⎝ 10 −3 2⎠
−8 2 −1
You should check thisresultby evaluatingAA −1 .
EXERCISES8.10
1 Solvethe followingequations byGaussian
elimination:
(a) 2x−3y=32
3x+7y=−21
(b) 2x+y−3z=−5
x−y+2z=12
7x−2y+3z=37
(c) x+y−z=1
3x−y+5z=3
7x+2y+3z=7
(d) 2x+y−z=−9
3x−2y+4z=5
−2x−y+7z=33
(e) 4x+7y+8z=2
5x+8y+13z=0
3x+5y+7z=1
2 Use Gaussian elimination to solve
x+y+z=7
x−y+2z=9
2x+y−z=1
3 Find the inversesofthe following matrices usingthe
technique ofExample 8.39:
( ) 4 1
(a)
3 2
⎛ ⎞
421
(b) ⎝ 034⎠
−113
⎛ ⎞
103
(c) ⎝ 215⎠
−721
294 Chapter 8 Matrix algebra
Solutions
1 (a) x=7,y=−6
(b) x=3,y=−5,z=2
(c) x=1−µ,y=2µ,z=µ
(d) x=−3,y=1,z=4
(e) Inconsistent
2 x=2,y=1,z=4
3 (a)
(b)
(c)
1
5
( ) 2 −1
−3 4
⎛ ⎞
5 −5 5
⎝−4 13 −16⎠
3 −6 12
⎛ ⎞
−9 6 −3
⎝−37 22 1⎠
11 −2 1
1
15
1
24
8.11 EIGENVALUESANDEIGENVECTORS
We are now in a position to examine the meaning and calculation of eigenvalues and
theircorrespondingeigenvectors.
8.11.1 Solutiontosystemsoflinearhomogeneousequations
Recall that an equation is linear when the variables occur only to the first power. For
example,
2x+3y=1 (1)
isalinear equation but
2x 2 +3y=1 (2)
isanon-linearequation due tothe term2x 2 .
Equation (1) is called inhomogeneous. When the r.h.s. of a linear equation is 0 then
the equation ishomogeneous.Forexample,
2x+3y=0 and 7x−3y=0
are both homogeneous. This section looks at the solution of systems of linear homogeneous
equations.
Consider the simultaneouslinear homogeneousequations
ax+by =0
cx+dy =0
where a, b, c and d are constants. Clearly x = 0, y = 0 is a solution. It is called the
trivial solution. Non-trivial solutions are solutions other thanx = 0, y = 0. We now
study the system to find conditions on a, b, c and d under which non-trivial solutions
exist.
Fordefinitenessweconsidertwo caseswith values ofa,b,candd given.
Case1
3x−5y =0
6x−7y =0
8.11 Eigenvalues and eigenvectors 295
Solving Case 1,forexample by Gaussian elimination, leads tox = 0,y = 0 as the only
possible solution.Thus, the only solution isthe trivialsolution.
Case2
x+y=0
2x+2y =0
This systemwas solved inExample 8.35 usingGaussian elimination toyield
x=−λ,
y=λ
where λ is any number and y is a free variable. Thus there are an infinite number of
solutions.Notethatinthissystemthesecondequation,2x+2y = 0,isamultipleofthe
first equation,x+y = 0.The second equation istwice the first equation.
We now return tothe system
ax+by =0
cx+dy =0
Asseen,dependinguponthevaluesofa,b,candd thesystemhaseitheronlythetrivial
solution or an infinite number of non-trivial solutions. For there to be non-trivial solutionsthesecondequationmustbeamultipleofthefirst.Whenthisisthecase,thencis
a multiple ofaandd is the same multiple ofb, thatis
c = αa, d = αb forsomevalueof α
Inthiscase, consider the quantityad −bc:
ad −bc = a(αb) −b(αa)
= αab− αab
= 0
Hence the condition for non-trivial solutions to exist is thatad −bc = 0. Writing the
systeminmatrix formgives
( ) ( a b x 0
=
c d)(
y 0)
or
where
AX=0
A =
( ) ( ( a b x 0
, X = , 0 =
c d y)
0)
We note that ad −bc is the determinant of A, so non-trivial solutions exist when the
determinant ofAis zero;thatis, whenAisasingular matrix.
Insummary:
296 Chapter 8 Matrix algebra
Consider the system
AX=0
If |A| = 0,the systemhas non-trivial solutions.
If |A| ≠ 0,the systemhas only the trivialsolution.
Example8.40 Decide which of the following systems of equations has non-trivial solutions:
(a) 3x+7y=0
2x−y=0
(b) 2x+y=0
6x+3y=0
Solution (a) We writethe systemas
( ) ( 3 7 x 0
=
2 −1)(
y 0)
Let
Then
A =
( ) 3 7
2 −1
|A| = 3(−1) −2(7) = −17
Since the determinant ofAisnon-zero, the systemhas only the trivialsolution.
(b) We writethe systemas
( ( ) ( 2 1 x 0
=
6 3)
y 0)
Let
A =
( ) 2 1
6 3
fromwhich |A| = 2(3) −1(6) = 0.SincethedeterminantofAis0,thesystemhas
non-trivial solutions.
Example8.41 Determine which of the following systems ofequations has non-trivial solutions:
(a) 2x+y−3z=0
x−3y+2z=0
5x−8y+3z=0
(b) 2x+y−3z=0
x−3y+2z=0
5x−7y+3z=0
8.11 Eigenvalues and eigenvectors 297
Solution (a) We have
AX=0
where
⎛ ⎞ ⎛ ⎞
2 1−3 x
A = ⎝1−3 2⎠ X = ⎝y⎠
5−8 3 z
Evaluation of |A|shows that |A| = 0 and so the systemhas non-trivial solutions.
(b) Here
⎛ ⎞
2 1−3
A = ⎝1−3 2⎠
5−7 3
from which |A| = −7.Since |A| ≠ 0 the systemhas only the trivialsolution.
EXERCISES8.11.1
1 Explain what ismeant by the trivial solution ofa
systemoflinear equations and what is meantby a
non-trivial solution.
2 Determine which ofthe following systemshas
non-trivial solutions:
(a) x−2y=0
3x−6y=0
(b) 3x+y=0
9x+2y=0
(c) 4x−3y=0
−4x+3y=0
(d) 6x−2y=0
2x − 2 3 y = 0
(e) y=2x
x=3y
3 Determine whichofthe following systemshave
non-trivial solutions:
(a) x+2y− z=0
3x+ y+2z=0
x+y =0
(b) 2x−3y−2z=0
3x+ y−3z=0
x−7y− z=0
(c) x+2y+3z=0
4x−3y− z=0
6x+ y+3z=0
(d) x+ 3z=0
x−y =0
y+2z=0
Solutions
2 (a),(c)and(d)have non-trivialsolutions. 3 (a)and(b)have non-trivialsolutions.
8.11.2 Eigenvalues
We will explain the meaning of the term eigenvalue by means of an example. Consider
the system
2x+y =λx
3x+4y=λy
298 Chapter 8 Matrix algebra
where λ is some unknown constant. Clearly these equations have the trivial solution
x = 0,y = 0.The equations may be written inmatrix formas
( ) ( 2 1 x x
= λ
3 4)(
y y)
or, usingthe usual notation,
AX=λX
We now seek values of λ so that the system has non-trivial solutions. Although it is
tempting to write (A − λ)X = 0 this would be incorrect sinceA−λ is not defined:A
is a matrix and λ is constant. Hence to progress we need to write the r.h.s. ( in a slightly
x
differentway.Tohelpusdothisweusethe2 ×2identitymatrix,I.Now λ maybe
y)
expressed as
( ) 1 0 x
λ
0 1)(
y
( x
since multiplying by the identity matrix leaves it unaltered. So λX may be written
y)
as λIX. Hence wehave
AX = λIX
which can bewritten as
AX−λIX =0
(A−λI)X =0
Notethattheexpression (A − λI)isdefinedsincebothAand λI aresquarematrices
of the same size.
WehaveseeninSection8.11.1thatforAX = 0tohavenon-trivialsolutionsrequires
|A| = 0.Hence for
(A−λI)X=0
to have non-trivial solutions requires
|A−λI|=0
Now
( ) ( ) 2 1 1 0
A−λI = − λ
3 4 0 1
( ) ( ) 2 1 λ 0
= −
3 4 0 λ
( ) 2 − λ 1
=
3 4−λ
So the condition |A − λI| = 0 gives
∣ 2 − λ 1
3 4−λ∣ = 0
Itfollows that
so that
(2−λ)(4−λ)−3 =0
λ 2 −6λ+5 =0
(λ−1)(λ−5) =0
λ=1 or 5
8.11 Eigenvalues and eigenvectors 299
Thesearethevaluesof λwhichcausethesystemAX = λX tohavenon-trivialsolutions.
They arecalledeigenvalues.
The equation
|A−λI|=0
which when written out explicitly is the quadratic equation in λ, is called the characteristicequation.
Example8.42 Find values of λfor which
x+4y = λx
2x+3y = λy
has non-trivial solutions.
Solution We write the systemas
where
AX=λX
A =
( ) ( 1 4 x
, X =
2 3 y)
To have non-trivialsolutions we require
|A−λI|=0
Now
( ) ( ) 1 4 1 0
A−λI = − λ
2 3 0 1
( ) ( ) 1 4 λ 0
= −
2 3 0 λ
( ) 1 − λ 4
=
2 3−λ
Hence
|A−λI| = (1−λ)(3−λ)−8
= λ 2 −4λ−5
To have non-trivialsolutions we require
λ 2 −4λ−5 =0
(λ+1)(λ−5) =0
300 Chapter 8 Matrix algebra
which yields
λ=−1 or 5
The given system has non-trivial solutions when λ = −1 and λ = 5. These are the
eigenvalues.
If A is a 2 × 2 matrix, the characteristic equation will be a polynomial of degree 2,
that is a quadratic equation in λ, leading to two eigenvalues. IfAis a 3 ×3 matrix, the
characteristicequationwillbeapolynomialofdegree3,thatisacubic,leadingtothree
eigenvalues. In general ann×n matrix gives rise to a characteristic equation of degree
n and hence toneigenvalues.
The characteristic equation of a square matrixAisgiven by
|A−λI|=0
Solutions of this equation are the eigenvalues of A. These are the values of λ for
whichAX = λX has non-trivial solutions.
Example8.43 Determine the characteristic equation and eigenvalues, λ, inthe system
( ( ) ( 3 1 x x
= λ
−1 5)
y y)
Solution InthisexampletheequationshavebeenwritteninmatrixformwithA =
characteristic equation is given by
|A−λI| =0
( ) ( )∣ 3 1 1 0 ∣∣∣
∣ − λ = 0
−1 5 0 1
∣ 3 − λ 1
−1 5−λ∣ = 0
(3−λ)(5−λ)+1 =0
λ 2 −8λ+16 =0
( ) 3 1
.The
−1 5
The characteristic equation is λ 2 − 8λ + 16 = 0. Solving the characteristic equation
gives
λ 2 −8λ+16 =0
(λ−4)(λ−4) =0
λ = 4 (twice)
There isone repeated eigenvalue, λ = 4.
8.11 Eigenvalues and eigenvectors 301
Example8.44 Find the eigenvalues λinthe system
( ( ) ( 4 1 x x
= λ
3 2)
y y)
Solution We form the characteristic equation, |A − λI| = 0.Now
( ) 4 − λ 1
A−λI=
3 2−λ
Then
|A−λI|=(4−λ)(2−λ)−3=λ 2 −6λ+5
Solving the characteristic equation, λ 2 −6λ +5 = 0,gives
λ=1 or 5
There aretwo eigenvalues, λ = 1, λ = 5.
The process of finding the characteristic equation and eigenvalues of a matrix has
been illustrated using 2 × 2 matrices. This same process can be applied to a square
matrix of any size.
Example8.45 Find (a)the characteristic equation (b)the eigenvalues ofAwhere
⎛ ⎞
1 2 0
A = ⎝−1−1 1⎠
3 2−2
Solution (a) We need tocalculate |A − λI|. Now
⎛
⎞
1−λ 2 0
A−λI= ⎝ −1 −1−λ 1 ⎠
3 2 −2−λ
and ∣∣∣∣∣∣ 1 −λ 2 0
∣ ∣ ∣∣∣
−1 −1−λ 1
3 2 −2−λ∣ = (1−λ) −1 − λ 1
∣∣∣ 2 −2−λ∣ −2 −1 1
3 −2−λ∣
= (1 − λ)[(−1 − λ)(−2 − λ) −2]
−2[−1(−2 − λ) −3]
Upon simplification thisreduces to −λ 3 −2λ 2 + λ +2.Hence
yields
|A−λI|=0
−λ 3 −2λ 2 +λ+2=0
which maybe written as
λ 3 +2λ 2 −λ−2=0
The characteristic equation is λ 3 +2λ 2 − λ −2 = 0.
302 Chapter 8 Matrix algebra
(b) The characteristic equation issolved toyield the eigenvalues:
λ 3 +2λ 2 −λ−2=0
Factorizing yields
(λ+2)(λ+1)(λ−1)=0
from which λ = −2,−1,1.
The eigenvalues are λ = −2,−1,1.
EXERCISES8.11.2
1 Calculate (i) the characteristic equation (ii) the
eigenvalues ofthe systemAX = λX,
whereAisgiven by
( ) ( )
5 6 −3 4
(a) (b)
(c)
2 1
( 7 −2
1 4
)
(d)
−4 5
( ) 1 3
4 −1
2 Calculate (i) the characteristic equation (ii) the
eigenvalues ofthe following 3 ×3 matrices:
⎛ ⎞
1 −12
(a) ⎝−3 −23⎠
2 −11
(b)
(c)
(d)
(e)
⎛ ⎞
10 −1
⎝31 4⎠
02 2
⎛ ⎞
21 2
⎝−11 −1⎠
83 0
⎛ ⎞
−2 62
⎝ 0 34⎠
3 −35
⎛ ⎞
3 −21
⎝ 2 −43⎠
16 −41
Solutions
1 (a) (i) λ 2 −6λ−7=0
(ii) λ = −1,7
(b) (i) λ 2 −2λ+1=0
(ii) λ = 1 (twice)
(c) (i) λ 2 −11λ+30=0
(ii) λ=5,6
(d) (i) λ 2 −13=0
(ii) λ = − √ 13, √ 13
2 (a) (i) −λ 3 +7λ+6=0
(ii) λ = −2, −1,3
(b) (i) λ 3 −4λ 2 −3λ+12=0
(ii) λ = − √ 3, √ 3,4
(c) (i) λ 3 −3λ 2 −10λ+24=0
(ii) λ = −3,2,4
(d) (i) λ 3 −6λ 2 +5λ=0
(ii) λ=0,1,5
(e) (i) λ 3 −13λ+12=0
(ii) λ = −4,1,3
TechnicalComputingExercises8.11.2
Thecalculationofeigenvalues andeigenvectors is
usuallyperformedby abuilt-infunction. Forexample,
in MATLAB ® the functioneigcan used to calculate
the eigenvalues.
To produce asolution to question 1.(a)in the previous
exercise wewould type:
A=[56;21];
eig(A)
8.11 Eigenvalues and eigenvectors 303
which produces:
ans =
7
-1
which isavectorcontaining the eigenvalues.
UseMATLAB ® orasimilar language to confirm the
restofthe eigenvalues given in Exercises8.11.2.
8.11.3 Eigenvectors
We have studied the system
AX=λX
anddeterminedthevaluesof λforwhichnon-trivialsolutionsexist.Thesevaluesof λare
calledeigenvaluesofthesystem,or,moresimply,eigenvaluesofA.Foreacheigenvalue
thereisanon-trivialsolution of the system.This solution iscalled aneigenvector.
Example8.46 Find the eigenvectors of
where
AX=λX
A =
( ) ( 4 1 x
andX=
3 2 y)
Solution We seek solutions ofAX = λX which may be written as
(A−λI)X=0
The eigenvalues werefound inExample 8.44 tobe λ = 1,5.
Firstlywe consider λ = 1.The systemequation becomes
(A−λI)X =0
(A−I)X =0
[( ) ( ( ) ( 4 1 1 0 x 0
− =
3 2 0 1)]
y 0)
( ) ( 3 1 x 0
=
3 1)(
y 0)
Writtenas individual equations wehave
3x+y =0
3x+y =0
Clearlythereisonlyoneequationwhichisrepeated.Aslongasy = −3xtheequationis
satisfied.Thusthereareaninfinitenumberofsolutionssuchasx = 1,y = −3;x = −5,
y = 15;and so on. Generally we write
x=t,y=−3t
for any numbert. Thus the eigenvector corresponding to λ = 1 is
( ( ) ( ) x t 1
X = = =t
y)
−3t −3
304 Chapter 8 Matrix algebra
Notethattheeigenvectorhasbeendeterminedtowithinanarbitraryscalar,t.Thusthere
isan infinity of solutions corresponding to λ = 1.
We now consider λ = 5 and seek solutions of the systemequation:
(A−λI)X =0
[( ) ( ( ) ( 4 1 1 0 x 0
−5 =
3 2 0 1)]
y 0)
( ( ) ( −1 1 x 0
=
3 −3)
y 0)
Written as individual equations we have
−x+y =0
3x−3y =0
Wenotethatthesecondequationissimplyamultipleofthefirstsothatinessencethere
isonlyoneequation.Solving −x+y = 0givesy =xforanyx.Sowewritex ( =t,y =t.
1
Hencetheeigenvectorcorrespondingto λ = 5isX =t .Againtheeigenvectorhas
1)
been determined towithin an arbitraryscaling constant.
Sometimesthearbitraryscalingconstantsarenotwrittendown;itisunderstoodthat
they arethere. Insuch a case wesay the eigenvectors of the systemare
( )
( 1 1
X = and X=
−3 1)
Example8.47 Determine the eigenvectors of
( ( ) ( 3 1 x x
= λ
−1 5)
y y)
Solution InExample8.43wefoundthatthereisonlyoneeigenvalue, λ = 4.Weseekthesolution
of (A−λI)X =0.Withλ =4wehave
( ) ( ) ( )
3 1 1 0 −1 1
A−λI= −4 =
−1 5 0 1 −1 1
Hence (A − λI)X = 0is the same as
( ( ) ( −1 1 x 0
=
−1 1)
y 0)
Thus there isonly one equation, namely
−x+y=0
which has an infinity of solutions:x =t,y =t. Hence thereisone eigenvector:
( 1
X =t
1)
8.11 Eigenvalues and eigenvectors 305
The concept ofeigenvectors iseasilyextended tomatrices ofhigher order.
Example8.48 Determine the eigenvectors of
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
1 2 0 x x
⎝−1−1 1⎠
⎝y⎠ = λ ⎝y⎠
3 2−2 z z
The eigenvalues werefound inExample 8.45.
Solution FromExample8.45theeigenvaluesare λ = −2,−1,1.Weconsidereacheigenvaluein
turn.
λ=−2
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
1 2 0 x x
⎝−1−1 1⎠
⎝y⎠ = −2⎝y⎠
3 2−2 z z
⎡⎛
⎞ ⎛ ⎞⎤⎛
⎞ ⎛ ⎞
1 2 0 100 x 0
⎣⎝−1−1 1⎠ +2⎝010⎠⎦
⎝y⎠ = ⎝0⎠
3 2−2 001 z 0
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
320 x 0
⎝−111⎠
⎝y⎠ = ⎝0⎠
320 z 0
We note that the first and lastrows areidentical. So wehave
3x +2y =0
−x+y +z=0
Solving these equations gives
x=t, y=− 3 2 t, z=5 2 t
Hence the corresponding eigenvector is
⎛ ⎞
1
− 3 X =t ⎜ 2⎟
⎝ ⎠
5
2
λ=−1
We have
⎡⎛
⎞
1 2 0
⎛ ⎞⎤⎛
⎞
100 x
⎣⎝−1−1 1⎠ + ⎝010⎠⎦
⎝y⎠ =
3 2−2 001 z
⎛ ⎞ ⎛ ⎞
22 0 x
⎝−10 1⎠
⎝y⎠ =
32−1 z
⎛ ⎞
0
⎝0⎠
0
⎛ ⎞
0
⎝0⎠
0
306 Chapter 8 Matrix algebra
Thus we have
2x+2y =0
−x +z=0
3x+2y −z=0
Wenotethatthethirdequationcanbederivedfromthefirsttwoequations,bysubtracting
thesecondequationfromthefirst.Ifyoucannotspotthistheequationsshouldbesolved
by Gaussian elimination. Ineffect we have only two equations:
2x+2y =0
−x +z =0
Solving these givesx =t,y = −t,z =t. The eigenvector is
⎛ ⎞
1
X =t ⎝−1⎠
1
λ = 1
We have
⎡⎛
⎞
1 2 0
⎛ ⎞⎤
⎛ ⎞
100 x
⎛ ⎞
0
⎣⎝−1−1 1⎠ − ⎝010⎠⎦
⎝y⎠ = ⎝0⎠
3 2−2 001 z 0
Thus we have
2y =0
−x−2y+ z=0
3x +2y−3z=0
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
0 2 0 x 0
⎝−1−2 1⎠
⎝y⎠ = ⎝0⎠
3 2−3 z 0
From the first equation,y = 0;puttingy = 0 into the other equations yields
−x+z =0
3x−3z =0
Here the second equation can be derived from the first by multiplying the first by −3.
Solving, wehavex =t,z =t. So the eigenvector is
⎛ ⎞
1
X =t ⎝0⎠
1
EXERCISES8.11.3
1 Calculate the eigenvectors ofthe matrices given in
Question 1 ofExercises8.11.2.
2 Calculate the eigenvectors ofthe matrices given in
Question2ofExercises8.11.2.
8.12 Analysis of electrical networks 307
Solutions
( ) )
1
1
1 (a)t ,t(
1
−1
3
( )
1
1
(c) t ,t(
1
1)
2
( ) (
1
(d) t
(
− 1 + √ )
,t
13 /3
( 1
(b) t
1)
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
1 1 1
2 (a)t⎝5⎠,t
⎝12⎠,t
⎝0⎠
1 5 1
⎛ ⎞ ⎛ ⎞
1 1
(b)t ⎝−5.0981⎠,t
⎝ 0.0981⎠,t
2.7321 −0.7321
)
(
1 √13−1 )
/3
⎛ ⎞
1
⎝−3⎠
−3
⎛ ⎞
1
− 1 ⎛ ⎞ ⎛ ⎞
1 1
(c)t ⎜ 3⎟
⎝
− 7 ⎠ ,t ⎝−2⎠,t
⎝−0.8⎠
1 1.4
3
⎛ ⎞
1 ⎛ ⎞ ⎛ ⎞
4
1 1
(d)t ⎜ 9⎟
⎝
− 1 ⎠ ,t ⎝ 0.6⎠,t
⎝ 1⎠
−0.3 0.5
3
⎛ ⎞
1 ⎛ ⎞ ⎛ ⎞
19
1 1
(e)t ⎜ 6 ⎟
⎝
− 2 ⎠ ,t ⎝4⎠,t
⎝2⎠
6 4
3
8.12 ANALYSISOFELECTRICALNETWORKS
Matrix algebra is very useful for the analysis of certain types of electrical network. For
suchnetworksitispossibletoproduceamathematicalmodelconsistingofsimultaneous
equations which can be solved using Gaussian elimination. We will consider the case
whenthenetworkconsistsofresistorsandvoltagesources.Thetechniqueissimilarfor
othertypesofnetwork.
In order to develop this approach, it is necessary to develop a systematic method
for writing the circuit equations. The method adopted depends on what the unknown
variables are. A common problem is that the voltage sources and the resistor values are
knownanditisdesiredtoknowthecurrentvaluesineachpartofthenetwork.Thiscan
beformulatedasamatrix equation. Given
where
V=RI ′
V = voltage vectorforthe network
I ′ = currentvectorforthe network
R = matrix ofresistor values
theproblemistocalculateI ′ whenV andRareknown.I ′ isusedtoavoidconfusionwith
the identitymatrix.
Anysizeofelectricalnetworkcanbeanalysedusingthisapproach.Wewilllimitthe
discussion to the case whereI ′ has three components, for simplicity. The extension to
larger networks is straightforward. Consider the electrical network of Figure 8.2. Mesh
currentshavebeendrawnforeachoftheloopsinthecircuit.Ameshisdefinedasaloop
that cannot contain a smaller closed current path. For convenience, each mesh current
is drawn in a clockwise direction even though it may turn out to be in the opposite
directionwhenthecalculationshavebeenperformed.Thenetcurrentineachbranchof
thecircuitcanbeobtainedbycombiningthemeshcurrents.Thesearetermedthebranch
308 Chapter 8 Matrix algebra
R 1 R 2
+
E 1
I 1
–
R 4
R 5
R 3
R 6 R 7
I 2 I 3
–
E 2 E 3
+ R 8
+
–
Figure8.2
An electrical network with meshcurrents
shown.
currents.Theconceptofameshcurrentmayappearslightlyabstractbutitdoesprovide
aconvenientmechanismforanalysingelectricalnetworks.Wewillexamineanapproach
that avoids the use of mesh currents later inthis section.
ThenextstageistomakeuseofKirchhoff’svoltagelawforeachofthemeshesinthe
network.Thisstatesthatthealgebraicsumofthevoltagesaroundanyclosedloopinan
electrical network iszero. Therefore the sum of the voltage risesmustequal the sum of
voltage drops. When applying Kirchhoff’s voltage law it is important to use the correct
sign for a voltage source depending on whether or notitis‘aiding’ a mesh current.
For mesh 1
E 1
= I 1
R 1
+I 1
R 2
+ (I 1
−I 3
)R 4
+ (I 1
−I 2
)R 3
E 1
= I 1
(R 1
+R 2
+R 4
+R 3
)+I 2
(−R 3
)+I 3
(−R 4
)
For mesh 2
−E 2
−E 3
= I 2
R 5
+ (I 2
−I 1
)R 3
+ (I 2
−I 3
)R 6
+I 2
R 8
−E 2
−E 3
= I 1
(−R 3
)+I 2
(R 5
+R 3
+R 6
+R 8
)+I 3
(−R 6
)
For mesh 3
E 3
= (I 3
−I 2
)R 6
+ (I 3
−I 1
)R 4
+I 3
R 7
E 3
= I 1
(−R 4
)+I 2
(−R 6
)+I 3
(R 6
+R 4
+R 7
)
These equations can be written inmatrix form as
⎛ ⎞ ⎛
⎞ ⎛ ⎞
E 1 R 1 +R 2 +R 4 +R 3 −R 3 −R 4 I 1
⎝−E 2 −E 3
⎠ = ⎝ −R 3 R 5 +R 3 +R 6 +R 8 −R 6
⎠ ⎝I 2
⎠
E 3
−R 4 −R 6 R 6 +R 4 +R 7 I 3
Engineeringapplication8.3
Calculationofmeshandbranchcurrentsinanetwork
Consider the electrical network of Figure 8.3. It has the same structure as that of
Figure8.2butwithactualvaluesforthevoltagesourcesandresistors.Branchcurrents
aswellasmeshcurrentshavebeenshown.Calculatethemeshcurrentsandhencethe
branch currents forthe network.
8.12 Analysis of electrical networks 309
2 V I 4 V
a
3 V
+
– I b
3 V
I 1
1 V
I c
5 V I 2 2 V I 3
3 V
I d – I e +
2 V 4 V
+ 4 V –
I f
Figure8.3
Theelectrical network ofFigure 8.2
with values forthe source voltagesand
resistorsadded.
Solution
We have already obtained the equations for this network. Substituting actual values
for the resistorsand voltage sources gives
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
3 10 −3 −1 I 1
⎝−2−4⎠ = ⎝−3 14 −2⎠
⎝I 2
⎠
4 −1−2 6 I 3
This is now in the formV =RI ′ . We shall solve these equations by Gaussian elimination.
Forming the augmented matrix, we have
⎛
⎞
10−3−1 3
⎝−3 14 −2 −6⎠
−1−2 6 4
Then
and
Hence,
R 1
R 2
→ 10R 2
+3R 1
R 3
→ 10R 3
+R 1
R 1
R 2
R 3
→ 131R 3
+23R 2
Similarly,
I 3
= 4460
7200 = 0.619 A
I 2
=
Finally,
I 1
=
−51 +23(0.619)
131
3 +0.619 +3(−0.281)
10
⎛
⎞
10 −3 −1 3
⎝0 131 −23 −51⎠
0 −23 59 43
⎛
⎞
10 −3 −1 3
⎝0 131 −23 −51⎠
0 0 7200 4460
= −0.281 A
= 0.278 A
➔
310 Chapter 8 Matrix algebra
The branch currents arethen
I a
=I 1
=278mA
I b
= I 2
−I 1
= −281 −278 = −559 mA
I c
=I 3
−I 1
=619−278=341mA
I d
= −I 2
= 281 mA
I e
= I 2
−I 3
= −281 −619 = −900 mA
I f
=I 3
=619mA
An alternative approach to analysing an electrical network is to use the node voltage
method which is often simply called nodal analysis. This technique is fundamental to
manycomputerprogramswhichareusedtosimulateelectricalcircuits,suchasSPICE.
We introduce thistechnique by means of an example.
Engineeringapplication8.4
Analysinganelectricalnetworkusingthenodevoltagemethod
The node voltage method utilises the notion that ‘islands’ of equal potential lie between
electrical components and sources.The procedure isas follows:
(1) Pick a reference node. In order to simplify the equations this is usually chosen
tobethenodewhichiscommontothelargestnumberofvoltagesourcesand/or
the largestnumber of branches.
(2) Assignanodevoltagevariabletoalloftheothernodes.Iftwonodesareseparated
solelybyavoltagesourcethenonlyoneofthenodesneedbeassignedavoltage
variable. The node voltages areall measured with respect tothe reference node.
(3) At each node, write Kirchhoff’s current law in terms of the node voltages. Note
that once the node voltages have been calculated it is easy to obtain the branch
currents.
We will again examine the network of Figure 8.2, but this time use the node voltage
method.ThenetworkisshowninFigure8.4withnodevoltagesassignedandbranch
currentslabelled.Thereferencenodeisindicatedbyusingtheearthsymbol.Writing
Kirchhoff’s current lawfor each node, weobtain:
node a
node b
I a
=I a
V b
+E 1
−V a
R 1
= V a −V d
R 2
V b
R 2
+E 1
R 2
−V a
R 2
=V a
R 1
−V d
R 1
V a
(R 1
+R 2
)−V b
R 2
−V d
R 1
=E 1
R 2
I a
+I b
+I d
=0
V b
+E 1
−V a
+ V b −V c
R 1
R 3
+ V b +E 2 −V e
R 5
= 0
8.12 Analysis of electrical networks 311
R 1 V R a 2
a I
+
a
E 1
–
I
R 3 b
V R c 4 I
V c
b
b c d
R 5 R 6 R 7
V d
I d –
I e +
I f
E 2 E 3
+ R 8 –
e
V e
Figure8.4
The network ofFigure8.2with node
voltages labelled.
Rearrangement yields
that is,
V b
R 3
R 5
+ E 1
R 3
R 5
−V a
R 3
R 5
+V b
R 1
R 5
−V c
R 1
R 5
+V b
R 1
R 3
+ E 2
R 1
R 3
−V e
R 1
R 3
=0
V a
R 3
R 5
− V b
(R 1
R 3
+R 1
R 5
+R 3
R 5
)+V c
R 1
R 5
+V e
R 1
R 3
node c
= E 1
R 3
R 5
+E 2
R 1
R 3
I b
=I c
+I e
so that
V b
−V c
R 3
= V c −V d
R 4
+ V c −E 3
R 6
that is,
node d
node e
V b
R 4
R 6
−V c
R 4
R 6
=V c
R 3
R 6
−V d
R 3
R 6
+V c
R 3
R 4
−E 3
R 3
R 4
V b
R 4
R 6
−V c
(R 4
R 6
+R 3
R 6
+R 3
R 4
)+V d
R 3
R 6
= −E 3
R 3
R 4
I a
+I c
=I f
V a
−V d
+ V c −V d
R 2
R 4
= V d
R 7
V a
R 4
R 7
−V d
R 4
R 7
+V c
R 2
R 7
−V d
R 2
R 7
=V d
R 2
R 4
V a
R 4
R 7
+V c
R 2
R 7
−V d
(R 4
R 7
+R 2
R 7
+R 2
R 4
) =0
I d
=I d
V b
+E 2
−V e
R 5
= V e
R 8
V b
R 8
−V e
(R 5
+R 8
) = −E 2
R 8
➔
312 Chapter 8 Matrix algebra
These equations can be writteninthe matrix formAV =B, whereAisthe matrix
⎛
⎞
R 1 +R 2 −R 2 0 −R 1 0
R 3 R 5 −R 1 R 3 −R 1 R 5 −R 3 R 5 R 1 R 5 0 R 1 R 3
⎜ 0 R
⎝ 4 R 6 −R 4 R 6 −R 3 R 6 −R 3 R 4 R 3 R 6 0 ⎟
R 4 R 7 0 R 2 R 7 −R 4 R 7 −R 2 R 7 −R 2 R 4 0 ⎠
0 R 8 0 0 −R 5 −R 8
and
⎛ ⎞
⎛
⎞
V a
E 1
R 2
V b
E 1
R 3
R 5
+E 2
R 1
R 3
V =
⎜V c ⎟ and B=
⎜ −E 3
R 3
R 4 ⎟
⎝V d
⎠
⎝ 0 ⎠
V e
−E 2
R 8
TheequationswouldgenerallybesolvedbyGaussianeliminationtoobtainthenode
voltages and hence the branch currents.
Using the component values from Engineering application 8.3, these equations
become
⎛
⎞ ⎛ ⎞ ⎛ ⎞
6 −4 0 −2 0 V a
12
15−31 10 0 6
V b
⎜ 0 2 −11 6 0
⎟ ⎜V c ⎟
⎝ 3 0 12 −19 0⎠
⎝V d
⎠ = 57
⎜−12
⎟
⎝ 0⎠
0 4 0 0 −9 V e
−8
Use of a computer package avoids the tedious arithmetic associated with Gaussian
elimination and yields
V a
= 2.969 V b
= 0.5250 V c
= 2.200 V d
= 1.858 V e
= 1.122
Itisthen straightforward tocalculate the branch currents:
I a
= V a −V d
=278mA I
R b
= V b −V c
= −558 mA
2
R 3
I c
= V c −V d
R 4
=342mA I d
= V e
R 8
= 281 mA
I e
= V c −E 3
R 6
= −900 mA I f
= V d
R 7
= 619 mA
Compare these answers withthose of Engineering application 8.3.
It is possible to analyse electrical networks containing more complex elements
suchascapacitors,inductors,activedevices,etc.,usingthesameapproach.Theequationsaremorecomplicatedbutthetechniqueisthesame.Oftenitisnecessarytouse
iterative techniques in view of the size and complexity of the problem. These are
examined inthe following section.
8.13 ITERATIVETECHNIQUESFORTHESOLUTION
OFSIMULTANEOUSEQUATIONS
The techniques met so far for the solution of simultaneous equations are known as direct
methods, which generally lead to the solution after a finite number of stages in
the calculation process have been carried out. An alternative collection of techniques is
8.13 Iterative techniques for the solution of simultaneous equations 313
available and these are known as iterative methods. They generate a sequence of approximate
solutions which may converge to the required solution, and are particularly
advantageous when large systems of equations are to be solved by computer. We shall
studytwo such techniques here: Jacobi’s method and Gauss--Seidel iteration.
Example8.49 Solve the equations
2x+y =4
x−3y = −5
usingJacobi’s iterative method.
Solution We first rewrite the equations as
2x =−y+4
−3y =−x−5
and then as
x = − 1 2 y +2
(8.10)
y = 1 3 x + 5 3
Jacobi’smethodinvolves‘guessing’asolutionandsubstitutingtheguessinther.h.s.of
the equations in(8.10).Suppose we guessx = 0,y = 0.Substitution then gives
x = 2
y = 5 3
We now use these values as estimates of the solution and resubstitute into the r.h.s. of
Equation (8.10).This time wefind
x = − 1 2
( 5
3)
+2 = 1.1667 (tofour decimal places)
y = 1 3 (2) + 5 3 = 2.3333
(tofour decimal places)
The whole process is repeated in the hope that each successive application oriteration
will give an answer close to the required solution, that is successive iterates will converge.
In order to keep track of the calculations, we label the initial guessx (0) ,y (0) , the
resultof the first iterationx (1) ,y (1) and so on. Generally, we find
x (n+1) = − 1 2 y(n) +2
y (n+1) = 1 3 x(n) + 5 3
314 Chapter 8 Matrix algebra
Table8.1
Iterates produced byJacobi’smethod.
Iterationno. (n) x (n) y (n)
0 0 0
1 2.0000 1.6667
2 1.1667 2.3333
3 0.8333 2.0556
4 0.9722 1.9444
5 1.0278 1.9907
6 1.0047 2.0093
7 0.9954 2.0016
8 0.9992 1.9985
9 1.0008 1.9997
10 1.0002 2.0003
TheresultsofsuccessivelyapplyingtheseformulaeareshowninTable8.1.Thesequence
of values ofx (n) seems toconverge to1while thatofy (n) seems toconverge to2.
Clearly this sort of approach is simple to program and iterative techniques such as
Jacobi’s method are best implemented on a computer. When writing the program a test
should be incorporated so that after each iteration a check for convergence is made by
comparing successive iterates. In many cases, even when convergence does occur, it is
slow and so other techniques are used which converge more rapidly. The Gauss--Seidel
method is attractive for this reason. It uses the most recent approximation to x when
calculatingyleadingtoimprovedratesofconvergenceasthefollowingexampleshows.
Example8.50 Usethe Gauss--Seidel method tosolve the equations ofExample 8.49.
Solution As before wewrite the equations inthe form
x = − 1 2 y +2
y = 1 3 x + 5 3
Withx (0) = 0,y (0) = 0 as our initial guess,wefind
x (1) = − 1 2 (0)+2=2
To findy (1) weuse the mostrecent approximation toxavailable, thatisx (1) :
y (1) = 1 3 (2) + 5 3 = 2.3333
Generally, wefind
x (n+1) = − 1 2 y(n) +2
y (n+1) = 1 3 x(n+1) + 5 3
8.13 Iterative techniques for the solution of simultaneous equations 315
Table8.2
Iterates produced bythe Gauss--Seidel
method.
Iterationno. (n) x (n) y (n)
0 0 0
1 2.0000 2.3333
2 0.8334 1.9445
3 1.0278 2.0093
4 0.9954 1.9985
5 1.0008 2.0003
6 0.9999 2.0000
andtheresultsofsuccessivelyapplyingtheseformulaeareshowninTable8.2.Asbefore,
we see that the sequence x (n) seems to converge to 1 and y (n) seems to converge to 2,
although more rapidly than before.
Both ofthese techniques generalize tolarger systems of equations.
Example8.51 Perform three iterations of Jacobi’s method and three iterations of the Gauss--Seidel
method tofind anapproximate solution of
−8x+y+z=1
x−5y+ z =16
x+ y−4z =7
with aninitial guess ofx =y=z=0.
Solution Werewritethesystemtomakex,yandzthesubjectofthefirst,secondandthirdequation,
respectively:
x = 1 8 y + 1 8 z − 1 8
y = 1 5 x + 1 5 z − 16
5
(8.11)
z = 1 4 x + 1 4 y − 7 4
To apply Jacobi’s method we substitute the initial guessx (0) = y (0) = z (0) = 0 into
the r.h.s. of Equation (8.11) to obtainx (1) ,y (1) andz (1) , and then repeat the process. In
general,
x (n+1) = 1 8 y(n) + 1 8 z(n) − 1 8
y (n+1) = 1 5 x(n) + 1 5 z(n) − 16 5
z (n+1) = 1 4 x(n) + 1 4 y(n) − 7 4
316 Chapter 8 Matrix algebra
We find
x (1) = − 1 8 = −0.1250
y (1) = − 16 5 = −3.2000
z (1) = − 7 4 = −1.7500
Then,
x (2) = 1 8 (−3.2000) + 1 8 (−1.7500) − 1 8 = −0.7438
y (2) = 1 5 (−0.1250) + 1 5 (−1.7500) − 16 5 = −3.5750
z (2) = 1 4 (−0.1250) + 1 4 (−3.2000) − 7 4 = −2.5813
Finally,
x (3) = 1 8 (−3.5750) + 1 8 (−2.5813) − 1 8 = −0.8945
y (3) = 1 5 (−0.7438) + 1 5 (−2.5813) − 16 5 = −3.8650
z (3) = 1 4 (−0.7438) + 1 4 (−3.5750) − 7 4 = −2.8297
To apply the Gauss--Seidel iteration to Equation (8.11), the most recent approximation
isusedateach stage leading to
x (n+1) = 1 8 y(n) + 1 8 z(n) − 1 8
y (n+1) = 1 5 x(n+1) + 1 5 z(n) − 16 5
z (n+1) = 1 4 x(n+1) + 1 4 y(n+1) − 7 4
Startingfromx (0) =y (0) =z (0) = 0,wefind
x (1) = − 1 8 = −0.1250
y (1) = 1 5 (−0.1250) + 1 5 (0) − 16 5 = −3.2250
z (1) = 1 4 (−0.1250) + 1 4 (−3.2250) − 7 4 = −2.5875
8.13 Iterative techniques for the solution of simultaneous equations 317
Table8.3
Comparisonofthe Jacobi andGauss--Seidel methods.
Iteration
no. (n)
Jacobi’smethod Gauss--Seidel
x (n) y (n) z (n) x (n) y (n) z (n)
0 0.0000 0.0000 0.0000 0.0000 0.0000 0.0000
1 −0.1250 −3.2000 −1.7500 −0.1250 −3.2250 −2.5875
2 −0.7438 −3.5750 −2.5813 −0.8516 −3.8878 −2.9348
3 −0.8945 −3.8650 −2.8297 −0.9778 −3.9825 −2.9901
4 −0.9618 −3.9448 −2.9399 −0.9966 −3.9973 −2.9985
5 −0.9856 −3.9803 −2.9767 −0.9995 −3.9996 −2.9998
6 −0.9946 −3.9924 −2.9915 −0.9999 −3.9999 −3.0000
7 −0.9980 −3.9972 −2.9968
8 −0.9992 −3.9990 −2.9988
9 −0.9997 −3.9996 −2.9996
Then,
x (2) = 1 8 (−3.2250) + 1 8 (−2.5875) − 1 8 = −0.8516
y (2) = 1 5 (−0.8516) + 1 5 (−2.5875) − 16 5 = −3.8878
Finally,
z (2) = 1 4 (−0.8516) + 1 4 (−3.8878) − 7 4 = −2.9348
x (3) = 1 8 (−3.8878) + 1 8 (−2.9348) − 1 8 = −0.9778
y (3) = 1 5 (−0.9778) + 1 5 (−2.9348) − 16 5 = −3.9825
z (3) = 1 4 (−0.9778) + 1 4 (−3.9825) − 7 4 = −2.9901
For completeness, further iterations areshown inTable 8.3.
AsexpectedtheGauss--SeidelmethodconvergesmorerapidlythanJacobi’s.Thisis
generally the case because ituses the mostrecently calculated values ateach stage.
Unfortunately, as with all iterative methods, convergence is not guaranteed. However,
it can be shown that if the matrix of coefficients is diagonally dominant, that is each
diagonal element is larger in modulus than the sum of the moduli of the other elements
inits row, thenthe Gauss--Seidel methodwill converge.
Engineeringapplication8.5
Findingapproximatesolutionsforthenodevoltagesofan
electricalnetwork
When trying to analyse complicated electrical networks it is frequently necessary
to resort to computer-based methods in order to find the voltages and currents. For ➔
318 Chapter 8 Matrix algebra
example, large networks for the distribution of electricity from power stations to
points of consumption can be very complicated to analyse due to the large number
of loops involved. This example illustrates the method by making use of the circuit
given in Engineering application 8.3. However, it should be borne in mind that this
approach isof mostvalue when analysing much larger networks.
Use Jacobi’s method and the Gauss--Seidel method to obtain approximate solutions
for the node voltages of the electrical network examined in Engineering
application 8.3.
Solution
The node voltage equations are
6V a
−4V b
−2V d
= 12 3V a
+12V c
−19V d
= 0
15V a
−31V b
+10V c
+6V e
= 57
2V b
−11V c
+6V d
= −12
These can be rearranged togive
4V b
−9V e
= −8
V a
= 2V b +V d +6
3
V b
= 15V a +10V c +6V e −57
31
V c
= 2V b +6V d +12
11
V d
= 3V a +12V c
19
V e
= 4V b +8
9
The results of applying Jacobi’s method with aninitial guess of
V (0)
a
=V (0)
b
=V (0)
c
=V (0)
d
=V (0)
e
= 0
are shown in Table 8.4. Convergence was achieved to within 0.001 after 44 iterations.TheresultsofapplyingtheGauss--SeidelmethodareshowninTable8.5.Convergence
was achieved to within 0.001 after 21 iterations. Clearly the Gauss--Seidel
method converges more rapidly than Jacobi’s method.
Table8.4
Node voltages derived from Jacobi’smethod.
Iterationno.(n)
V (n)
a
V (n)
b
V (n)
c
V (n)
d
V (n)
e
0 0.0000 0.0000 0.0000 0.0000 0.0000
1 2.0000 −1.8387 1.0909 0.0000 0.8889
.
20 2.8710 0.4802 2.1379 1.8265 1.0738
.
44 2.9679 0.5243 2.1990 1.8578 1.1215
8.14 Computer solutions of matrix problems 319
Table8.5
Node voltages derived from the Gauss--Seidel method.
Iterationno.(n)
V (n)
a
V (n)
b
V (n)
c
V (n)
d
V (n)
e
0 0.0000 0.0000 0.0000 0.0000 0.0000
1 2.0000 −0.8710 0.9326 0.9048 0.5018
.
20 2.9665 0.5226 2.1985 1.8569 1.1212
21 2.9674 0.5233 2.1989 1.8573 1.1215
EXERCISES8.13
1 Performthree iterationsofthe methods ofJacobiand
Gauss--Seidel to obtain approximate solutionsofthe
following.Ineach case, use an initialguess of
x (0) =y (0) =z (0) =0
(a) 4x+y+z=−1
x+6y+2z=0
x+2y+4z=1
(b) 5x+y−z=4
x−4y+z=−4
2x+2y−4z=−6
(c) 4x+y+z=17
x+3y−z=9
2x−y+5z=1
Solutions
1 (a) Jacobi
x 1 = −0.2500,y 1 = 0,z 1 = 0.2500
x 2 = −0.3125,y 2 = −0.0417,z 2 = 0.3125
x 3 = −0.3177,y 3 = −0.0521,z 3 = 0.3490
Gauss--Seidel
x 1 = −0.2500,y 1 = 0.0417,z 1 = 0.2917
x 2 = −0.3333,y 2 = −0.0417,z 2 = 0.3542
x 3 = −0.3281,y 3 = −0.0634,z 3 = 0.3637
(b) Jacobi
x 1 = 0.8000,y 1 = 1,z 1 = 1.5000
x 2 = 0.9000,y 2 = 1.5750,z 2 = 2.4000
x 3 = 0.9650,y 3 = 1.8250,z 3 = 2.7375
Gauss--Seidel
x 1 = 0.8000,y 1 = 1.2000,z 1 = 2.5000
x 2 = 1.0600,y 2 = 1.8900,z 2 = 2.9750
x 3 = 1.0170,y 3 = 1.9980,z 3 = 3.0075
(c) Jacobi
x 1 = 4.2500,y 1 = 3,z 1 = 0.2000
x 2 = 3.4500,y 2 = 1.6500,z 2 = −0.9000
x 3 = 4.0625,y 3 = 1.5500,z 3 = −0.8500
Gauss--Seidel
x 1 = 4.2500,y 1 = 1.5833,z 1 = −1.1833
x 2 = 4.1500,y 2 = 1.2222,z 2 = −1.2156
x 3 = 4.2484,y 3 = 1.1787,z 3 = −1.2636
8.14 COMPUTERSOLUTIONSOFMATRIXPROBLEMS
Theuseoftechnicalcomputinglanguageshasvastlyimprovedtheabilityofengineersto
calculate the solutions to complex engineering problems. As has been demonstrated by
theexamplesgiveninpreviouschapters,highlevelmathematicalfunctionsareprovided
such as the ability to calculate the roots of an equation and the eigenvalues of a matrix.
In electrical engineering problems it is frequently necessary to solve matrix equations.
Consider again Example 8.32. The equations areof the form
AX=B
320 Chapter 8 Matrix algebra
where
⎛ ⎞ ⎛ ⎞ ⎛ ⎞
3 2−1 4 x
A = ⎝2−1 2⎠ B = ⎝10⎠ X = ⎝y⎠
1−3−4 5 z
Inordertosolvetheseequationsinatechnicalcomputinglanguagethematricesneedto
be loaded into memory. InMATLAB ® wewould type
A = [3,2,−1;2,−1,2;1,−3,−4]
B = [4;10;5]
We note that commas are used to separate entries on the same row and semicolons are
used toseparate different rows.
ThevalueofX canbeobtainedbythemethodofmatrixinversion.Thecodeforthisis
X =inv(A) ∗B
Alternatively,X can be obtained by Gaussian elimination. The code for thisis
X=A\B
Gaussian elimination is the usual method of solving systems of linear equations in
MATLAB ® because it is computationally efficient. If we run this three-line program
using MATLAB ® then weobtain
A = [3,2,−1;2,−1,2;1,−3,−4]
B = [4;10;5]
X=A\B
X = 3.0000
−2.0000
1.0000
and sox = 3.0000, y = −2.0000, z = 1.0000.
We see that MATLAB ® is a powerful tool for carrying out matrix calculations. Part
ofitspowerderivesfromtheextremelyhigh-levelnatureofitscommands.Acommand
suchasinv(A)wouldrequiretypically50linesifwritteninanormalhigh-levellanguage
such as Cor FORTRAN.
Engineeringapplication8.6
SolutionofanelectricalnetworkusingMATLAB®
Consider again the electrical network of Engineering application 8.3. Write the
MATLAB ® code tosolve thisnetwork.
Solution
TheequationsareoftheformV =RI ′ whereV isthevoltagevectorofthenetwork,
RisthematrixofresistorvaluesandI ′ isthecurrentvectorofthenetwork.Weknow
V andRand we wish to obtainI ′ . Using the values of Engineering application 8.3
wecan write
➔
Review exercises 8 321
V = [3;−6;4]
R = [10,−3,−1;−3,14,−2;−1,−2,6]
IPRIME = inv(R) ∗V
Running thisprogram gives
IPRIME = 0.2778
−0.2806
0.6194
REVIEWEXERCISES8
1 Evaluate the following products:
(a) ( −4 1 )( )
2
(b) ( −4 1 )( )
3
7
6
(c) ( −4 1 )( )
( )
2 3 2 −3 −1
(d)
7 6 4 5)(
1
⎛ ⎞
(e) ( 2 −1 0 ) 131 ( )
⎝240⎠
2 1
(f)5
3 6
107
( )
1 732
(g)
2 101
2 Simplify
∣ cosh θ sinh θ
sinh θ cosh θ∣ .
⎛ ⎞
0 23
3 Given thatA = ⎝2 00⎠findA −1 andA 2 .Show
1 −10
thatA 2 +6A −1 −7I = 0, whereI denotes the 3 ×3
identitymatrix.
⎛ ⎞
2 −14
4 IfA= ⎝1 00⎠find
1 −20
(a) |A|
(b) adj(A)
(c) A −1
5 Find the inverse ofthe matrix
⎛ ⎞
1 −20
⎝ 3 15⎠
−1 23
Hencesolvethe equations
x−2y=3
3x+y+5z=12
−x+2y+3z=3
6 Use Gaussian elimination to solve
x+2y−3z+2w=2
2x+5y−8z+6w=5
3x+4y−5z+2w=4
7 Use Jacobi’smethodto obtain a solution ofAX =B
to three decimalplaces where
⎛ ⎞ ⎛ ⎞
10 1 0 1
A = ⎝ 1 10 1 ⎠ and B= ⎝2⎠
0 1 10 1
8 Use amatrixmethodto solve
2x+y−z=3
x−y+2z=1
3x+4y+3z=2
9 Consider the Vandermonde matrix
⎛
1aa 2 ⎞
V = ⎝1bb 2 ⎠
1cc 2
(a) Find detV andshowthat itcanbe written as
(a −c)(a −b)(c −b).
(b) Show thatifa,bandcare all different, thenthe
Vandermonde matrixis non-singular.
10 (a) Thesignal f (t) = sin πt isto be approximated
2
byathird-degree polynomial forvalues oft
between −1 and 2.Byforcingthe original signal
and itsapproximating polynomial to agree at
t=−1,t=0,t=1andt=2,findthis
approximation. [Hint: seeEngineering
application 8.2.]
(b) Useagraphics calculator orgraph plotting
package to compare the graphs of f (t)andits
approximating polynomial.
322 Chapter 8 Matrix algebra
( ) 1 −2
11 SupposeA = .
1 4
( −2
(a) FindAv when v = .
2)
( −2
(b) Find 3vwhen v = .Deduce thatAv = 3v.
2)
( −µ
(c) FindAv when v = forany constant µ.
µ)
Deduce thatAv = 3v.
12 Usethe Gauss--Seidel method to find an approximate
solution of
(a) 5x +3y = −34
2x−7y=93
(b) 3x+y+z=6
2x+5y−z=5
x−3y+8z=14
13 Determine which ofthe followingsystemshave
non-trivial solutions.
(a) 2x−y=0
3x−1.5y=0
(b) 6x+5y=0
5x+6y=0
(c) −x−4y=0
2x+8y=0
(d) 7x−3y=0
1.4x −0.6y = 0
(e) −4x+5y=0
3x−4y=0
14 Determine which ofthe followingsystemshave
non-trivial solutions.
(a) 3x−2y+2z=0
x−y+z=0
2x+2y−z=0
(b) x+3y−z=0
4x−y+2z=0
6x+5y=0
(c) x+2y−z=0
x−3z=0
5x+6y−9z=0
15 ThematrixAis defined by
( ) 3 2
A =
−3 −4
(a) Determine the characteristic equation ofA.
(b) Determine the eigenvalues ofA.
(c) Determine the eigenvectors ofA.
(d) Formanew matrixM whose columns are the two
eigenvectors ofA.M iscalled amodal matrix.
(e) Show thatM −1 AM is adiagonal matrix,D,with
the eigenvalues ofAonits leading diagonal.Dis
called thespectral matrixcorresponding to the
modal matrixM.
16 (a) Show that the matrix
( ) 5 2
A =
−2 1
has only one eigenvalue and determine it.
(b) Calculate the eigenvector ofA.
17 ThematrixH isgiven by
⎛ ⎞
4 −11
H = ⎝−2 40⎠
−4 31
(a) Findthe eigenvalues ofH.
(b) Determine the eigenvectors ofH.
(c) Formanew matrixM whose columns are the
threeeigenvectors ofH.M iscalled a modal
matrix.
(d) ShowthatM −1 HM isadiagonal matrix,D, with
the eigenvalues ofH onits leading diagonal.Dis
called thespectral matrixcorresponding to the
modal matrixM.
Solutions
1 (a) −1 (b) −6
( −5
(c) (−1 −6) (d)
1)
(e) (0 2 2)
⎛ ⎞
7 3
(g) ⎜2
2 1
⎝1
2 0 1 ⎟
⎠
2
(f)
( ) 10 5
15 30
2 1
⎛ ⎞
3 A −1 = 1 0 3 0
⎝0 3 −6⎠
6
2 −2 4
⎛ ⎞
7 −30
A 2 = ⎝ 0 46⎠
−2 23
Review exercises 8 323
4 (a) −8
⎛
(b) ⎝
⎞
0 −80
0 −44⎠
−2 31
⎛ ⎞
0 8 0
1
(c) ⎝0 4 −4⎠
8
2 −3 −1
5
⎛
1
⎝
21
−7 6 −10
−14 3 −5
70 7
x=1,y=−1,z=2
⎞
⎠
6 x=2µ−λ,y=1+2λ−2µ,z=λ,w=µ
7 x = 0.082,y = 0.184,z = 0.082
8 x = 1.462,y = −0.308,z = −0.385
12 (a) x=1,y=−13
(b)x=1,y=1,z=2
13 (a),(c)and(d)have non-trivialsolutions.
14 (b)and(c)have non-trivialsolutions.
15 (a) λ 2 +λ−6=0
(b) ( λ=−3,2 ) ( ) 1 1
(c) ,
−3 −0.5
16 (a) λ=3
( ) 1
(b)
−1
17 (a) λ=2,3,4
⎛
(b) ⎝
⎞ ⎛ ⎞ ⎛ ⎞
1 1 0
1⎠,
⎝2⎠,
⎝1⎠
−1 1 1
10 (a) f(t) = 4 3 t − 1 3 t3
( ( −6 −6
11 (a) (b)
6)
6)
(c)
( ) −3µ
3µ
9 Complexnumbers
Contents 9.1 Introduction 324
9.2 Complexnumbers 325
9.3 Operationswithcomplexnumbers 328
9.4 Graphicalrepresentationofcomplexnumbers 332
9.5 Polarformofacomplexnumber 333
9.6 Vectorsandcomplexnumbers 336
9.7 Theexponentialformofacomplexnumber 337
9.8 Phasors 340
9.9 DeMoivre’stheorem 344
9.10 Lociandregionsofthecomplexplane 351
Reviewexercises9 354
9.1 INTRODUCTION
Complex numbers often seem strange when first encountered but it is worth persevering
with them because they provide a powerful mathematical tool for solving several
engineeringproblems.Oneofthemainapplicationsistotheanalysisofalternatingcurrent
(a.c.) circuits. Engineers are very interested in these because the mains supply is
itself a.c., and electricity generation and transportation are dominated by a.c. voltages
and currents.
A great deal of signal analysis and processing uses mathematical models based on
complex numbers because they allow the manipulation of sinusoidal quantities to be
undertakenmoreeasily.Furthermore,thedesignoffilterstobeusedincommunications
equipment relies heavily on their use.
One area of particular relevance is control engineering -- so much so that control
engineers often prefer to think of a control system in terms of a ‘complex plane’ representation
rather than a ‘time domain’ representation. We will develop these concepts in
this and subsequent chapters.
9.2 Complex numbers 325
9.2 COMPLEXNUMBERS
We have already examined quadratic equations such as
x 2 −x−6=0 (9.1)
andhavemettechniquesforfindingtherootsofsuchequations.Theformulaforobtainingthe
roots of a quadratic equationax 2 +bx +c = 0 is
x = −b ± √ b 2 −4ac
2a
Applying thisformulatoEquation (9.1), wefind
(9.2)
x = +1 ± √ (−1) 2 −4(1)(−6)
2
= 1 ± √ 25
2
= 1 ±5
2
so thatx = 3 andx = −2 are the two roots. However, if we try to apply the formula to
the equation
we find
2x 2 +2x+5=0
x = −2 ± √ −36
4
A problem now arises in that we need to find the square root of a negative number. We
knowfromexperiencethatsquaringbothpositiveandnegativenumbersyieldsapositive
result; thus,
6 2 = 36 and (−6) 2 = 36
so that there is no real number whose square is −36. In the general case, if
ax 2 +bx +c = 0, we see by examining the square root in Equation (9.2) that this problemwillalwaysarisewheneverb
2 −4ac < 0.Nevertheless,itturnsouttobeveryuseful
to invent a technique for dealing with such situations, leading to the theory of complex
numbers.
To make progress weintroduce a number, denoted j,with the property that
j 2 =−1
We have already seen that using the real number system we cannot obtain a negative
numberbysquaringareal numbersothenumberjisnotreal-- wesay itisimaginary.
Thisimaginarynumberhasaveryusefulroletoplayinengineeringmathematics.Using
it we can now formally write down an expression for the square root of any negative
number. Thus,
√
−36 =
√
36 × (−1)
= √ 36×j 2
= 6j
326 Chapter 9 Complex numbers
Returning tothe solution of the quadratic equation 2x 2 +2x +5 = 0,wefind
x = −2 ± √ −36
4
= −2±6j
4
= −1±3j
2
We have found two roots, namelyx = − 1 2 + 3 2 jandx=−1 2 − 3 j.These numbers are
2
calledcomplexnumbersandweseethattheyaremadeupoftwoparts--arealpartand
animaginarypart.Forthefirstcomplexnumbertherealpartis − 1 2 andtheimaginary
part is 3 2 . For the second complex number the real part is −1 and the imaginary part is
2
− 3 2 .Inamoregeneralcaseweusuallyusetheletterztodenoteacomplexnumberwith
realpartaandimaginarypartb,soz =a +bj.Wewritea = Re(z)andb= Im(z),and
denote the set of all complex numbers by C. Note thata,b∈ Rwhereasz ∈ C.
z =a +bj,zisamemberofthe setofcomplex numbers, that isz ∈ C
a =Re(z) b =Im(z)
Complex numbers which have a zero imaginary part are purely real and hence all real
numbers arealsocomplex numbers, thatis R ⊂ C.
Example9.1 Solve the quadraticequation 2s 2 −3s +7 = 0.
Solution Using the formulaforsolvingaquadraticequation wefind
s = −(−3) ± √ (−3) 2 −4(2)(7)
2(2)
= 3 ± √ −47
4
= 3 ± √ 47j
4
= 0.75 ±1.71j
Usingthe factthat j 2 = −1 wecan develop other quantities.
Example9.2 Simplify the expression j 3 .
Solution We have
j 3 =j 2 ×j
=(−1)×j
= −j
9.2 Complex numbers 327
9.2.1 Thecomplexconjugate
Ifz =a +bj,wedefineitscomplexconjugatetobethenumberz =a −bj;thatis,we
change the sign of the imaginary part.
Example9.3 Write down the complex conjugates of
(a) −7+j (b) 6−5j (c) 6 (d) j
Solution Tofindthecomplexconjugatesofthegivennumberswechangethesignoftheimaginary
parts. Apurelyreal number has animaginarypart0.We find
(a) −7 −j (b) 6 +5j (c) 6,thereisno imaginarypart toalter (d) −j
We recall that the solution of the quadratic equation 2x 2 +2x +5 = 0 yielded the two
complexnumbers − 1 2 + 3 2 jand−1 2 − 3 2 j,andnotethattheseformacomplexconjugate
pair. This illustratesamore general result:
When the polynomial equationP(x) = 0 has real coefficients, any complex roots
will always occur incomplex conjugatepairs.
Consider the following example.
Example9.4 Show that the equationx 3 −7x 2 +19x −13 = 0 has a root atx = 1 and find the other
roots.
Solution IfweletP(x) =x 3 −7x 2 +19x −13,thenP(1) =1−7 +19 −13 =0sothatx =1
is a root. This means thatx −1 must be a factor ofP(x) and so we can expressP(x) in
the form
P(x)=x 3 −7x 2 +19x−13=(x−1)(αx 2 +βx+γ)
=αx 3 +(β−α)x 2 +(γ−β)x−γ
where α, β and γ arecoefficientstobedetermined.Comparingthecoefficientsofx 3 we
find α = 1. Comparing the constant coefficients we find γ = 13. Finally, comparing
coefficients ofxwefind β = −6,and hence
P(x) =x 3 −7x 2 +19x −13 = (x −1)(x 2 −6x +13)
Theothertwo rootsofP(x) = 0 arefoundby solvingthe quadraticequation
x 2 −6x+13=0,thatis
x = 6 ± √ 36−52
2
= 6 ± √ −16
2
=3±2j
and again we note that the complex roots occur as a complex conjugate pair. This illustratesthegeneralresultgiveninSection1.4thatannth-degreepolynomialhasnroots.
328 Chapter 9 Complex numbers
EXERCISES9.2
1 Solve the following equations:
(e) cos ωt +jsin ωt (f) cos ωt −jsin ωt
complex numbers.
1
(c)z=0 (d)z= 1 +j
(a) −11 −8j (b) 5 +3j (c)
2 j (d) −17 2
(f)z=j (e)z=j 2
(a)x 2 +1=0 (b)x 2 +4=0
(g) −0.333j +1
(c)3x 2 +7=0 (d)x 2 +x+1=0 4 Recallfrom Chapter2thatthe polesofarational
(e) x2
2 −x+2=0
functionR(x) =P(x)/Q(x)are those values ofxfor
whichQ(x) = 0.Findany poles of
(f)−x 2 −3x−4=0
x 3x 3
(g)2x 2 (a) (b)
+3x+3=0
x −3 x 2 (c)
+1 x 2 +x+1
(h)x 2 +3x+4=0
2 Solve the cubicequation
5 Solvethe equations 2 +2s +5 = 0.
6 Expressasacomplex number
3x 3 −11x 2 +16x−12=0
(a) j 4 (b) j 5 (c) j 6
given thatoneofthe rootsisx = 2. 7 StateRe(z)andIm(z)where
3 Writedown the complex conjugates ofthe following (a)z=7+11j (b)z=−6+j
Solutions
1 (a) ±j (b) ±2j
(c) ± √ 7/3j (d) −1/2 ± ( √ 3/2)j
(e) 1± √ 3j (f) −3/2 ± ( √ 7/2)j
(g) −3/4 ± ( √ 15/4)j (h) −3/2 ± ( √ 7/2)j
2 2,5/6±( √ 47/6)j
3 (a) −11 +8j (b) 5 −3j (c) − 1 2 j
(d) −17 (e) cos ωt −jsin ωt
(f) cos ωt +jsin ωt (g) 0.333j +1
4 (a)x=3 (b)x=±j
(c)x = −1/2 ± ( √ 3/2)j
5 s=−1±2j
6 (a) 1 (b) j (c) −1
7 (a) 7,11 (b) −6,1 (c) 0, 0
(d)
1
2 , 1 2
(e) 0, 1 (f) −1,0
9.3 OPERATIONSWITHCOMPLEXNUMBERS
Twocomplexnumbersareequalifandonlyiftheirrealpartsareequalandtheirimaginaryparts
areequal.
Example9.5 Findxandyso thatx+6jand 3 −yjrepresentthe samecomplex number.
Solution If both quantitiesrepresentthe same complex number wehave
x+6j=3−yj
Sincethe real parts mustbeequal wecan equatethem,thatis
x = 3
Similarly, wefind, by equating imaginary parts
6=−y
so thaty = −6.
9.3 Operations with complex numbers 329
Theoperationsofaddition,subtraction,multiplicationanddivisioncanallbeperformed
on complex numbers.
9.3.1 Additionandsubtraction
Toaddtwocomplexnumberswesimplyaddtherealpartsandaddtheimaginaryparts;
tosubtractacomplexnumberfromanotherwesubtractthecorrespondingrealpartsand
subtractthe corresponding imaginary parts as shown inExample 9.6.
Example9.6 Ifz 1
= 3 −4jandz 2
= 4 +2jfindz 1
+z 2
andz 1
−z 2
.
Solution
z 1
+z 2
=(3−4j)+(4+2j)
=(3+4)+(−4+2)j
=7−2j
z 1
−z 2
=(3−4j)−(4+2j)
=(3−4)+(−4−2)j
=−1−6j
9.3.2 Multiplication
Wecanmultiplyacomplexnumberbyarealnumber.Boththerealandimaginaryparts
of the complex number aremultiplied by the real number. Thus 3(4 −6j) = 12 −18j.
To multiply two complex numbers we use the factthat j 2 = −1.
Example9.7 Ifz 1
= 2 −2jandz 2
= 3 +4j, findz 1
z 2
.
Solution
z 1
z 2
= (2 −2j)(3 +4j)
Removing brackets wefind
z 1
z 2
=6−6j+8j−8j 2
=6−6j+8j+8 usingj 2 =−1
=14+2j
Example9.8 Ifz = 3 −2jfindzz.
Solution Ifz = 3 −2jthen itsconjugate isz = 3 +2j. Therefore,
zz = (3 −2j)(3 +2j)
=9−6j+6j−4j 2
=9−4j 2
= 13
We see thatthe answer isareal number.
330 Chapter 9 Complex numbers
Whenever we multiplyacomplex number by its conjugate the answer is a real number.
Thusifz=a+bj
zz = (a +bj)(a −bj)
=a 2 +baj−abj−b 2 j 2
=a 2 +b 2
Ifz=a+bjthenzz=a 2 +b 2
9.3.3 Division
To divide two complex numbers it is necessary to make use of the complex conjugate.
We multiply both the numerator and denominator by the conjugate of the denominator
and then simplifythe result.
Example9.9 Ifz 1
= 2 +9jandz 2
= 5 −2jfind z 1
z 2
.
Solution We seek 2+9j . The complex conjugate of the denominator is 5 + 2j, so we multiply
5−2j
bothnumeratoranddenominatorbythisquantity.Theeffectofthisistoleavethevalue
of z 1
unaltered since wehave only multiplied by 1.Therefore,
z 2
z 1
= 2+9j (2 +9j) (5 +2j)
=
z 2
5−2j (5 −2j) (5 +2j)
=
10 +45j +4j +18j2
25+4
=
−8 +49j
29
= − 8
29 + 49
29 j
The multiplication of two conjugates in the denominator allows a usefulsimplification.
Weseethattheeffectofmultiplyingbytheconjugateofthedenominatoristomakethe
denominator of the solution purely real.
Ifz 1
=x 1
+jy 1
andz 2
=x 2
+jy 2
thenthequotient z 1
z 2
isfoundbymultiplyingboth
numerator and denominator by the conjugate of the denominator, thatis
z 1
z 2
= x 1 +jy 1
x 2
+jy 2
= x 1 +jy 1
x 2
+jy 2
× x 2 −jy 2
x 2
−jy 2
= x 1 x 2 +y 1 y 2 +j(x 2 y 1 −x 1 y 2 )
x 2 2 +y2 2
9.3 Operations with complex numbers 331
EXERCISES9.3
1 Ifz 1 =3+2jandz 2 =4−8jfind
(a) z 1 +z 2 (b) z 1 −z 2 (c) z 2 −z 1
2 Expressthe following in the forma +bj:
1
(a) (b) −2
1 +j j
1
(c)
j + 1 j
(d)
2 −j 1 +j
3
(e)
3+2j + 1
5 −j
3 Expressthe following in the forma +bj:
2
(a) (b) −2+3j
1 −j j
(c)3j(4 −2j)
(e) 5+3j
2+2j
(b) (7 −2j)(5 +6j)
4 Find aquadraticequationwhose rootsare 1 −3j and
1+3j.
5 If (x +jy) 2 =3+4j,findxandy,wherex,y ∈ R.
6 Find the real andimaginaryparts of
(a)
2
4 +j − 3
2 −j
(b) j 4 −j 5
(c) 1 j +j (d) 1
j 3 −3j
Solutions
1 (a)7−6j (b) −1+10j (c) 1−10j 4 x 2 −2x+10=0
1
2 (a)
2 − 1 2 j (b) 2j (c) 2 5 − 4 5 j 5 x=2,y=1;x=−2,y=−1
1
(d)
2 + 1 23 −11j
j (e) 6 (a) − 62
2 26
85 , −61 85
(b) 1, −1
3 (a) 1+j (b) 3+2j (c) 6+12j
(d) 47 +32j (e) 2 − 1 (c) 0, 0
2 j (d) 0, 1 4
TechnicalComputingExercises9.3
1 Many technicalcomputing languagesallow the user
toinputandmanipulatecomplex numbers.Investigate
howthe complex numbera +bjisinput to the
software to whichyou have access.
2 Usetechnicalcomputing software to simplifythe
followingcomplex numbers:
(a) (1 +j) 5 3j
(b)
(1 −2j) 7
4
(c)
(3 +j) 3 − 7
(2 −3j) 2
3 Solvethe polynomial equations
(a) x 3 +7x 2 +9x+63=0
(b) x 4 +15x 2 −16=0
YoumayrecallfromChapter1thatin MATLAB ® the
rootsfunctioncan be used to solvepolynomial
equationswith real roots.This functioncan also
calculatecomplex roots.
4 Use apackageto findthe real andimaginaryparts of
15+j
7−5j .
5 Acommonrequirementincontroltheoryistofindthe
poles ofarationalfunction. Useapackageto findthe
poles ofthe function
0.5
G(s) =
s 3 +2s 2 +0.2s +0.0556
To solvethisproblem automatically mayrequire the
use ofspecialist toolboxesin MATLAB ® orGNU
Octave which are not partofthe core program.For
example, the Signal Processingtoolboxin
MATLAB ® has the functiontf2zp which convertsa
transfer function expressed asaratio ofpolynomials
to pole/zero form.
332 Chapter 9 Complex numbers
7
6
z 2 = –3+5j
Imaginary axis
5
4
3
2
z 1 = 7+2j
1
Imaginary axis
y
b
Figure9.1
Argand diagram.
a + bj
(a, b)
a x
Real axis
–4 –3 –2 –1 1 2 3 4 5 6 7
–1
Real axis
z 3 = –1–2j
–2
–3
–4
–5
z 4 = –4j
Figure9.2
Argand diagram forExample9.10.
9.4 GRAPHICALREPRESENTATIONOFCOMPLEXNUMBERS
Given a complex numberz = a +bj we can obtain a useful graphical interpretation of
it by plotting the real part on the horizontal axis and the imaginary part on the vertical
axis and obtain a unique point in thex--y plane (Figure 9.1). We call thexaxis the real
axis and theyaxis the imaginary axis, and the whole picture an Argand diagram. In
this context, thex--y plane isoften referredtoasthe complex plane.
Example9.10 Plotthecomplexnumbersz 1
= 7 +2j,z 2
= −3 +5j,z 3
= −1 −2jandz 4
= −4jonan
Argand diagram.
Solution TheArgand diagramisshown inFigure 9.2.
EXERCISES9.4
1 Plotthe followingcomplex numbersonan Argand
diagram:
(a) z 1 =−3−3j
(b) z 2 =7+2j
(c) z 3 =3
(d) z 4 = −0.5j
(e) z 5 = −2
2 (a) Plotthe complex numberz = 1 +jon anArgand
diagram.
(b) Simplify the complex number j(1 +j)andplot
the result onyour Arganddiagram.Observe that
theeffectofmultiplyingthecomplexnumberbyj
isto rotatethe complex numberthrough an angle
of π/2 radiansanticlockwise aboutthe origin.
9.5 Polar form of a complex number 333
Solutions
1 SeeFigure S.17.
2 (a) SeeFigure S.18. (b) j(1 +j) = −1 +j
Imaginary
Imaginary
–2
7+2j
–1+j 1
1+j
–0.5j
3
Real
–3–3j
FigureS.17
–1
FigureS.18
1
Real
9.5 POLARFORMOFACOMPLEXNUMBER
ItisoftenusefultoexchangeCartesiancoordinates (a,b)forpolarcoordinatesrand
θ asdepictedinFigure 9.3.
FromFigure 9.3 wenotethat
cosθ = a r
sinθ = b r
and so,
a=rcosθ
b=rsinθ
Furthermore,
tanθ = b a
UsingPythagoras’stheoremweobtainr = √ a 2 +b 2 .Byfindingrand θ wecanexpress
the complex numberz =a +bj in polar formas
z=rcosθ+jrsinθ =r(cosθ+jsinθ)
whichweoftenabbreviatetoz =r̸ θ.Clearly,risthe‘distance’ofthepoint (a,b)from
the origin and is called the modulus of the complex numberz. The modulus is always
a non-negative number and is denoted |z|. The angle is conventionally measured from
the positivexaxis. Angles measured in an anticlockwise sense are regarded as positive
whilethosemeasuredinaclockwisesenseareregardedasnegative.Theangle θ iscalled
the argument ofz, denoted arg(z). Since adding or subtracting multiples of 2π from θ
will result in the ‘arm’ in Figure 9.3 being in the same position, the argument can have
many values. Usuallywe shall choose θ tosatisfy −π < θ π.
Cartesian form:z =a +bj
Polarform:z =r(cosθ +jsinθ) =r̸
|z|=r= √ a 2 +b 2
θ
a=rcosθ b=rsinθ tanθ = b a
334 Chapter 9 Complex numbers
y
y
b
r
u
a
(a, b)
x
7– 4
p
p –4 –
1
2
–1 (1, –1)
x
Figure9.3
Polar andCartesian formsofacomplex
number.
Figure9.4
Argand diagram depictingz = 1 −jin
Example 9.11.
Note that
r̸
(−θ) =r(cos(−θ) +jsin(−θ))
=r(cosθ −jsinθ)
=z
Ifz=a+bjthenz=a−bjandz =r̸
(−θ).
Example9.11 Depict the complex numberz = 1 − j on an Argand diagram and convert it into polar
form.
Solution Therealpartofzis1andtheimaginarypartis −1.Wethereforeplotapointinthex--y
plane withx = 1 andy = −1 asshown inFigure 9.4.
FromFigure9.4weseethatr = √ 1 2 + (−1) 2 = √ 2and θ = −45 ◦ or −π/4radians.
Thereforez = 1 −j = √ 2̸ (−π/4).
To express a complex number in polar form it is essential to draw an Argand diagram
and notsimplyquoteformulae, asthe following example will show.
Example9.12 Expressz = −1 −j inpolar form.
Solution If we use the formula |z| =r= √ a 2 +b 2 ,wefindthatr = √ 2. Using tan θ =b/a, we
find that tanθ = −1/ −1 = 1 so that you may be tempted to take θ = π/4. Figure 9.5
√shows the Argand diagram and it is clear that θ = −3π/4. Therefore,z = −1 − j =
2̸ −3π/4,and we see the importance ofdrawing an Argand diagram.
y
–1
2
5p —4
3p —4 –
–1
x
Figure9.5
Argand diagramdepictingz = −1 −jin Example 9.12.
9.5.1 Multiplicationanddivisioninpolarform
9.5 Polar form of a complex number 335
ThepolarformmayseemmorecomplicatedthantheCartesianformbutitisoftenmore
useful. For example, suppose wewant tomultiplythe complex numbers
We find
z 1
=r 1
(cosθ 1
+jsinθ 1
) and z 2
=r 2
(cosθ 2
+jsinθ 2
)
z 1
z 2
=r 1
(cosθ 1
+jsinθ 1
)r 2
(cosθ 2
+jsinθ 2
)
=r 1
r 2
{(cosθ 1
cosθ 2
−sinθ 1
sinθ 2
) +j(sinθ 1
cosθ 2
+sinθ 2
cosθ 1
)}
which can be written as
r 1
r 2
{cos(θ 1
+ θ 2
) +jsin(θ 1
+ θ 2
)}
using the trigonometric identities of Section 3.6. This is a new complex number which,
ifwecomparewiththegeneralformr(cosθ +jsin θ),weseehasamodulusofr 1
r 2
and
an argument of θ 1
+ θ 2
. To summarize: to multiply two complex numbers we multiply
their moduliand add their arguments, thatis
z 1
z 2
=r 1
r 2̸ (θ 1
+θ 2
)
Example9.13 Ifz 1
= 3̸ π/3 andz 2
= 4̸ π/6 findz 1
z 2
.
Solution Multiplyingthemoduliwefindr 1
r 2
= 12,andaddingtheargumentswefind θ 1
+ θ 2
=
π/2.Thereforez 1
z 2
= 12̸ π/2.
Asimilardevelopmentshowsthattodividetwocomplexnumberswedividetheirmoduli
and subtracttheirarguments,that is
z 1
z 2
= r 1
r 2
̸ (θ 1
−θ 2
)
Example9.14 Ifz 1
= 3̸ π/3 andz 2
= 4̸ π/6 findz 1
/z 2
.
Solution Dividing the respective moduli, we find r 1
/r 2
= 3/4 and subtracting the arguments,
π/3 − π/6 = π/6.Hencez 1
/z 2
= 0.75̸ π/6.
EXERCISES9.5
1 Markonan Arganddiagrampointsrepresenting
z 1 =3−2j,z 2 =−j,z 3 =j 2 ,z 4 =−2−4jand
z 5 = 3. Find the modulus andargumentofeach
complex number.
2 Expressthe following complex numbersin polar
form:
(a) 3 −j (b) 2 (c) −j (d) −5 +12j
3 Find the modulusandargumentof(a)z 1 = − √ 3 +j
and (b)z 2 = 4 +4j. Hence expressz 1 z 2 andz 1 /z 2 in
polar form.
4 Express √ 2̸ π/4,2̸ π/6and2̸ −π/6 in Cartesian
forma +bj.
5 Prove the result z 1
z 2
= r 1
r 2
̸ θ 1 −θ 2 .
(
336 Chapter 9 Complex numbers
6 Expressz = 1 ,where ω andC are real constants,
jωC
in the forma +bj.Plotzonan Argand diagram.
7 Ifz 1 = 4(cos40 ◦ +jsin40 ◦ )andz 2 = 3(cos70 ◦ +
jsin70 ◦ ),expressz 1 z 2 andz 1 /z 2 in polar form.
8 Simplify
( √ 2̸ (5π/4)) 2 (2̸ (−π/3)) 2
2̸ (−π/6)
Solutions
1 |z 1 | = √ 13,arg(z 1 ) = −0.5880
|z 2 | = 1,arg(z 2 ) = −π/2
|z 3 | = 1,arg(z 3 ) = π
|z 4 | = √ 20,arg(z 4 ) = −2.0344
|z 5 | = 3,arg(z 5 ) = 0
2 (a) √ 10̸ −0.3218 (b) 2̸ 0
(c)1̸ −π/2 (d) 13̸ 1.9656
3 (a) 2,5π/6
(b) 4 √ 2,π/4
z 1 z 2 =8 √ 2̸ 13π/12
√
2
z 1 /z 2 =
4 ̸ 7π/12
4 1+j, √ 3+j, √ 3 −j
6 − j
ωC
7 z 1 z 2 = 12(cos110 ◦ +jsin110 ◦ )
8 4
z 1 /z 2 = 4 3 (cos30◦ −jsin30 ◦ )
9.6 VECTORSANDCOMPLEXNUMBERS
It is often convenient to represent complex numbers by vectors in the x--y plane. Figure
9.6(a) shows the complex numberz = a + jb. Figure 9.6(b) shows the equivalent
vector. Figure 9.7 shows the complex numbersz 1
= 2 +j andz 2
= 1 +3j.
If we now evaluate z 3
= z 1
+z 2
we find z 3
= 3 + 4j which is also shown. If we
form a parallelogram, two sides of which are the representations ofz 1
andz 2
, we find
thatz 3
isthediagonaloftheparallelogram.Ifweregardz 1
andz 2
asvectorsintheplane
we see that there is a direct analogy between the triangle law of vector addition (see
Section7.2.3)and the addition ofcomplex numbers.
y
4
y
b
z = a + jb
y
b
(
a
b
3
2
z 2
z 3
a x a x
1
z 1
(a)
Figure9.6
Thecomplex numberz =a +jbandits equivalent vector.
(b)
0 1 2 3 4
Figure9.7
Vector addition in the complex plane.
x
9.7 The exponential form of a complex number 337
y
B
z 2
OC
z 1
BA
A
C
y
5
4
3
2
1
B
z 2
z 1 + z 2
z 1 – z 2
z 1
A
C
O
x
O 1
2 3 4 5 6 7 8 9 10
x
Figure9.8
Vector addition and subtraction.
Figure9.9
Diagram forExample9.15.
More generally, if z 1
and z 2
are any complex numbers represented on an Argand
diagram by the vectors OA ⃗ and OB ⃗ (Figure 9.8) then upon completing the parallelogram
OBCA, the sum z 1
+ z 2
is represented by the vector OC. ⃗ We can also obtain
a representation of the difference of two complex numbers in the following way. If
wewrite
z 3
=z 1
−z 2
=z 1
+(−z 2
)
andnotethatifz 2
isrepresentedbythevectorOB,then ⃗ −z 2
isrepresentedbythevector
−OB ⃗ = BO ⃗ = CA. ⃗ The complex number z 1
is represented by OA ⃗ = BC, ⃗ so that
z 1
+ (−z 2
) = BC ⃗ + CA ⃗ = BA. ⃗ Thus the difference z 1
− z 2
is represented by the
diagonal BA. To summarize, the sum and difference of z 1
and z 2
are represented by
the two diagonals of the parallelogram OBCA.
Example9.15 Represent z 1
= 6 + j and z 2
= 3 + 4j, and their sum and difference, on an Argand
diagram.
Solution WedrawvectorsOAand ⃗ OBrepresentingz ⃗ 1
andz 2
,respectively (Figure9.9).Thenwe
completetheparallelogramOACBasshown.Thesumz 1
+z 2
isthenrepresentedby ( ) OC ⃗
andthedifferencez 1
−z 2
byBA.Itiseasytoseethatthevector ⃗ BA ⃗ 3
(
= ,thatisit
−3
representsthecomplexnumber3 −3j,whileOC ⃗ 9
= ,thatisitrepresents9 +5j,so
5)
thatz 1
+z 2
=9+5jandz 1
−z 2
=3−3j.
9.7 THEEXPONENTIALFORMOFACOMPLEXNUMBER
You will recall from Chapter 6 that many functions possess a power series expansion,
thatisthefunctioncanbeexpressedasthesumofasequenceoftermsinvolvinginteger
powers ofx. For example,
e x =1+x+ x2
2! + x3
3! +···
338 Chapter 9 Complex numbers
andthisrepresentationisvalidforanyrealvalueofx.Theexpressiononther.h.s.is,of
course, an infinite sum but its terms get smaller and smaller, and as more are included,
the sum weobtain approaches e x . Other examples ofpower series include
and
sinx=x− x3
3! + x5
−··· (9.3)
5!
cosx=1− x2
2! + x4
−··· (9.4)
4!
whicharealsovalidforanyrealvalueofx.Itisusefultoextendtherangeofapplicability
of these power series by allowing x to be a complex number. That is, we define the
function e z tobe
e z =1+z+ z2
2! + z3
3! +···
andtheorybeyondthescopeofthisbookcanbeusedtoshowthatthisrepresentationis
valid for all complex numbersz.
We have already seen thatwecan express a complex number inpolar form:
z=r(cosθ+jsinθ)
Using Equations (9.3) and (9.4) wecan write
z =r
{(1 −···)
− θ2
2! + θ4
+j
(θ −···)}
− θ3
4! 3! + θ5
5!
=r
(1 ···)
+jθ− θ2
2! −jθ3 3! + θ4
4! +jθ5 5!
Furthermore, wenote thate jθ can be written as
so that
e jθ =1+jθ+ j2 θ 2
2!
+ j3 θ 3
3!
+···
=1+jθ− θ2
2! −jθ3 3! +···
z=r(cosθ+jsinθ)=re jθ
This is yet another form of the same complex number which we call the exponential
form. We see that
e jθ =cosθ+jsinθ (9.5)
Itis straightforward toshowthat
e −jθ =cosθ−jsinθ
9.7 The exponential form of a complex number 339
Therefore,ifz =r(cos θ −jsin θ)wecanequivalentlywritez =re −jθ .Thetwoexpressions
for e jθ and e −jθ are known as Euler’s relations. From these it is easy to obtain the
following usefulresults:
cosθ = ejθ +e −jθ
2
sinθ = ejθ −e −jθ
2j
Example9.16 WesawinSection3.7thatawaveformcanbewrittenintheform f (t) =Acos(ωt +φ).
Consider the complex numbere j(ωt+φ) . We can use Euler’s relations towrite
e j(ωt+φ) = cos(ωt + φ) +jsin(ωt + φ)
and hence,
f(t) =ARe(e j(ωt+φ) )
EXERCISES9.7
1 Find the modulusandargumentof
(a) 3e jπ/4 (b) 2e −jπ/6 (c) 7e jπ/3
2 Find the real andimaginaryparts of
(a) 5e jπ/3
(c) 11e jπ
(b) e j2π/3
(d) 2e −jπ
3 Expressz = 6(cos30 ◦ +jsin30 ◦ ) in exponential
form.Plotzonan Arganddiagramandfind itsreal
andimaginaryparts.
4 If σ, ω,T ∈ R,findthe real andimaginaryparts of
e (σ+jω)T .
5 Expressz = e 1+jπ/2 in the forma +bj.
6 Express −1 −jin the formre jθ .
7 Express
(a) 7 +5j and
1
(b)
2 − 1 jin exponential form.
3
8 Expressz 1 = 1 −jand
z 2 = 1 +j √
3 −j
in the formre jθ .
Solutions
1 (a) 3, π/4 (b) 2,−π/6 (c) 7,π/3
2 (a) 2.5, 4.3301 (b) −0.5, 0.8660 (c) −11,0
(d) −2, 0
3 6e (π/6)j ,Re(z) = 5.1962,Im(z) = 3
4 Real part: e σT cos ωT, imaginarypart: e σT sin ωT
5 ej
6 √ 2e (−3π/4)j
7 (a) √ 74e 0.62j (b) 0.60e −0.59j
8 √ 2e (−π/4)j ,
1
√
2
e (5π/12)j
340 Chapter 9 Complex numbers
9.8 PHASORS
Electrical engineers are often interested in analysing circuits in which there is an a.c.
power supply. Almost invariably the supply waveform is sinusoidal and the resulting
currentsandvoltageswithinthecircuitarealsosinusoidal.Forexample,atypicalvoltage
isof the form
v(t) =Vcos(ωt +φ) =Vcos(2πft +φ) (9.6)
whereV is the maximum or peak value, ω is the angular frequency, f is the frequency,
and φ is the phase relative to some reference waveform. This is known as the time
domain representation. Each of the voltages and currents in the circuit has the same
frequency as the supply but differs inmagnitude and phase.
In order to analyse such circuits it is necessary to add, subtract, multiply and divide
these waveforms. If the time domain representation is used then the mathematics
becomes extremely tedious. An alternative approach is to introduce a waveform representation
known as a phasor. A phasor is an entity consisting of two distinct parts: a
magnitude and an angle. It is possible to represent a phasor by a complex number in
polarform.Thefixedmagnitudeofthiscomplex number corresponds tothemagnitude
of the phasor and hence the amplitude of the waveform. The argument of this complex
number, φ, corresponds to the angle of the phasor and hence the phase angle of the
waveform. Figure 9.10 shows a phasor for the sinusoidal waveform of Equation (9.6).
The time dependency of the waveform is catered for by rotating the phasor anticlockwise
at an angular frequency, ω. The projection of the phasor onto the real axis
gives the instantaneous value of the waveform. However, the main interest of an engineer
is in the phase relationships between the various sinusoids. Therefore the phasors
are‘frozen’atacertainpointintime.Thismaybechosensothatt = 0oritmaybechosensothataconvenientphasor,knownasthereferencephasor,alignswiththepositive
realaxis.Thisapproachisvalidbecausethephaseandmagnituderelationshipsbetween
the various phasors remain the same at all points in time once a circuit has recovered
from any initial transients caused by switching.
Sometextbooksrefertophasorsasvectors.Thiscanleadtoconfusionasitispossible
todividephasorswhereasdivisionofvectorsbyothervectorsisnotdefined.Inpractice
this is not a problem as phasors, although thought of as vectors, are manipulated as
complex numbers, whichcan bedivided. We willavoidthese conceptual difficulties by
introducing a different notation. We will denote a phasor byṼ, which corresponds to
V ̸ φ incomplexnumbernotation(seeFigure9.10).Thus,forexample,acurrenti(t) =
Icos(ωt+φ)wouldbewrittenĨinphasornotationandI̸ φincomplexnumbernotation.
Many engineers use the root mean square (r.m.s.) value of a sinusoid as the magnitude
of a phasor. The justification for this is that it represents the value of a d.c. signal
that would dissipate the same amount of power in a resistor as the sinusoid. For
y
v
V
~
V
f
x
Figure9.10
Illustration ofthe phasorṼ =V̸ φ where ω =angular
frequency with which the phasor rotates.
9.8 Phasors 341
example, in the case of a current signal, Irms 2 R is the average power dissipated by the
sinusoidIcos(ωt +φ)inaresistorR.Forthecaseofasinusoidalsignalther.m.s.value
ofthesignalis1/ √ 2timesthepeakvalueofthesignal(seeSection15.3,Example15.4,
foraproof of this).We will not adopt thisapproach butitisacommon one.
We start by examining the phasor representation of individual circuit elements. In
order todo thiswe need a phasor form of Ohm’s law. This is
Ṽ
=ĨZ (9.7)
where Ṽ is the voltage phasor, Ĩ is the current phasor and Z is the impedance of an
element or group of elements and may be a complex quantity. Note that phasors and
complex numbers are mixed together in the same equation. This is a common practice
because phasors areusually manipulated as complex numbers.
9.8.1 Resistor
Experimentally it can be shown that if an a.c. voltage is applied to a resistor then the
currentisinphasewiththevoltage.Theratioofthemagnitudeofthetwowaveformsis
equal to the resistance,R. So, givenĨ = I̸ 0,Z=R̸ 0, using Equation (9.7) we have
Ṽ=IR̸ 0.This isillustratedinFigure 9.11.
9.8.2 Inductor
Foraninductorweknowfromexperimentthatthevoltageleadsthecurrentbyaphaseof
π/2, and so the phase angle of the impedance is π/2. We also know that the magnitude
oftheimpedanceisgivenby ωL.So,givenĨ =I̸ 0,Z=ωL̸ π/2,usingEquation(9.7)
we haveṼ =IωL̸ π/2. An alternative way of representingZ for an inductor is to use
the Cartesian form, that is
(
Z=ωLe jπ/2 = ωL cos π )
2 +jsinπ 2
= jωL
Thisisusefulwhenphasorsneedtobeaddedandsubtracted.Thephasordiagramforan
inductor isillustratedinFigure 9.12.
9.8.3 Capacitor
For a capacitor it is known that the voltage lags the current by a phase of π/2 and the
magnitude of the impedance is given by 1
ωC . So givenĨ = I̸ 0,Z= 1
ωC ̸ −π/2, we
y
y
p –2
~
I
V ~
x
~
I x
V ~
Figure9.11
Phasordiagram foraresistor.
Figure9.12
Phasordiagram foran inductor.
342 Chapter 9 Complex numbers
have, using Equation (9.7),Ṽ = I
ωC ̸ −π/2. Alternatively,
Z = e−jπ/2
ωC = 1 (
cos π )
ωC 2 −jsinπ 2
= − j
ωC
Engineers often prefer torewrite this lastexpression as
1
jωC
The phasor diagram forthe capacitor isillustratedinFigure 9.13.
Wehaveshownhowphasorscanbemultipliedbyacomplexnumber;divisionisvery
similar.Addition ofphasors will nowbeillustrated;subtraction issimilar.Consider the
circuit shown in Figure 9.14 in which a resistor, capacitor and inductor are connected
in series and fed by an a.c. source. As this is a series circuit, the current through each
element,Ĩ,isthesamebyKirchhoff’scurrentlaw.ByKirchhoff’svoltagelawthevoltage
rise produced by the supply,Ṽ S
, must equal the sum of the voltage drops across the
elements. Therefore,
Ṽ S
=ṼR +ṼC +ṼL
Note that this is an addition of phasors so that the voltage drops across the elements
do not necessarily have the same phase. The phasor diagram for the circuit is shown in
Figure 9.15, inthiscase with |ṼL | > |ṼC |;Ĩ isthe reference phasor.
Notethatthephasoradditionoftheelementvoltagesgivestheoverallsupplyvoltage
for a particular supply current. If the magnitude of these element voltage phasors is
known then it is possible to calculate the supply voltage graphically. In practice it is
easier to convert the polar form of the phasors into Cartesian form and use algebra to
analyse the circuit.
Now,
Ṽ R
=ĨR̸
Ṽ L
=ĨωL̸
Ṽ C
=
0 =ĨR
π/2 =ĨjωL
Ĩ Ĩ
ωC ̸ −π/2 =
jωC
y
~ I x
–
p –2
V ~
v
~
I
R
~
V R
~
~
~ V S
C V C
L
~
V L
y
~
V L
~
V C
~
V S
~ ~
I V x R
Figure9.13
Phasordiagram foracapacitor.
Figure9.14
RLC circuit.
Figure9.15
Phasordiagram forthe circuit in
Figure 9.14.
9.8 Phasors 343
Therefore,
Ṽ S
=ṼR +ṼC +ṼL
Ĩ
=ĨR +ĨjωL +
jωC
(
=Ĩ R +jωL+ 1 )
jωC
Therefore the impedance of the circuit is Z = R + jωL + 1 . We can calculate the
jωC
frequency forwhich the impedance of the circuithas minimum magnitude:
Now
Z=R+jωL+ 1
jωC
=R+jωL− j
(
=R+j
|Z| =
√
R 2 +
ωC
ωL − 1
ωC
)
(
ωL − 1 ) 2
ωC
and so, as ω varies |Z| isaminimum when
ωL − 1
ωC = 0
ω 2 = 1
√
LC
1
ω =
LC
This minimum value is |Z| = R. Examining Figure 9.15 it is clear that the minimum
impedance occurs whenṼL andṼC have the same magnitude, in which caseṼS has no
imaginary component. The frequency at which this occurs is known as the resonant
frequencyof the circuit.
Engineeringapplication9.1
ThePoyntingvector
An electromagnetic wave freely travelling in space has electric and magnetic field
componentswhichoscillateatrightanglestoeachotherandtothedirectionofpropagation.
This type of wave is known as a transverse wave. Figure 9.16 illustrates
anelectromagneticwavetravellinginfreespace.ThePoyntingvectorisusedtodescribetheenergyfluxassociatedwithanelectromagneticwave.IthasunitsofWm
−2
andisapowerperunitarea,thatisapowerdensity.Ifthetotalpowerassociatedwith
anelectromagneticwavefrontisrequiredthenthePoyntingvectorcanbeintegrated
over anareaofinterest.
ThePoynting vector, S, forelectricfield,E, and magnetic field, H,is definedas
S=E×H
➔
344 Chapter 9 Complex numbers
Electric field
Magnetic field
x
y
z
Figure9.16
An electromagnetic wave travelling in freespace.
where E and H are vector quantities. Note that S is also a vector quantity as this
expression isavector product.
It is possible to specify magnitude and phase of the field quantities using complex
numbers; that is, in terms of phasors. Consider the case when E and H are the
following:
E=(2+j0.5)i+0j+0k
H = 0i + (3.5 +j0.25)j +0k
HereCartesiancoordinateshavebeenusedtodefinethetwofieldquantitiesandiisa
unitvectorinthexdirection,jisaunitvectorintheydirectionandkisaunitvector
inthezdirection.Examiningthesetwotermswenotethattheelectricfieldisaligned
inthexdirection and the magnetic field isaligned intheydirection.
Calculating the Poynting vector we obtain
i j k
S=E×H=
2+j0.5 0 0
∣0 3.5 +j0.25 0 ∣
= 0i −0j + (2 +j0.5)(3.5 +j0.25)k = (6.875 +j2.25)k
Weseethattheonlynon-zerocomponentisalignedinthezdirection.Thisisconsistentwithourunderstandingofatransversewaveinthatthedirectionofpropagation
is at right angles to the two field components. This implies that the energy flow of
the electromagnetic wave isinthezdirection.
9.9 DEMOIVRE’STHEOREM
AveryimportantresultincomplexnumbertheoryisDeMoivre’stheoremwhichstates
that ifn ∈ N,
(cosθ +jsinθ) n = cosnθ +jsinnθ (9.8)
Example9.17 VerifyDe Moivre’s theorem whenn = 1 andn = 2.
Solution Whenn = 1,the theorem states:
(cosθ +jsinθ) 1 =cos1θ +jsin1θ
which isclearly true,and the theorem holds. Whenn = 2,wefind
(cosθ +jsinθ) 2 = (cosθ +jsinθ)(cosθ +jsinθ)
=cos 2 θ+jsinθcosθ+jcosθsinθ+j 2 sin 2 θ
=cos 2 θ −sin 2 θ +j(2sinθcosθ)
9.9 De Moivre’s theorem 345
Recallingthetrigonometricidentitiescos2θ = cos 2 θ − sin 2 θ andsin2θ = 2sin θ cos θ,
wecan write the previous expression as
cos2θ +jsin2θ
Therefore,
(cosθ +jsinθ) 2 =cos2θ +jsin2θ
and DeMoivre’s theorem has been verified whenn = 2.
The theorem also holds whennis a rational number, that isn = p/q where pandqare
integers.Thus wehave
(cosθ +jsinθ) p/q =cos p q θ+jsinp q θ
Inthisform itcan be used toobtain roots of complex numbers. For example,
√
3
cosθ +jsinθ =(cosθ +jsinθ) 1/3 =cos 1 3 θ+jsin1 3 θ
Insuchacasetheexpressionobtainedisonlyoneofthepossibleroots.Additionalroots
can be found as illustratedinExample 9.18.
De Moivre’s theorem is particularly important for the solution of certain types of
equation.
Example9.18 Findallcomplex numberszwhich satisfy
z 3 =1 (9.9)
Solution The solution of this equation is equivalent to finding the solutions ofz = 1 1/3 ; that is,
findingthecuberootsof1.Sinceweareallowingztobecomplex,thatisz ∈ C,wecan
write
z=r(cosθ+jsinθ)
Then,usingDeMoivre’s theorem,
z 3 =r 3 (cosθ +jsinθ) 3
=r 3 (cos3θ +jsin3θ)
Wenextconverttheexpressiononther.h.s.ofEquation(9.9)intopolarform.Figure9.17
shows the number1 = 1 +0jon anArgand diagram.
From the Argand diagram we see that its modulus is 1 and its argument is 0, or possibly
±2π, ±4π,..., thatis2nπ wheren ∈ Z.Consequently, wecan write
1 =1(cos2nπ +jsin2nπ) n ∈ Z
346 Chapter 9 Complex numbers
z 2
1
y
y
1 x
2p —3
4p —3
1 z 1
1
z 3
x
Figure9.17
The complex numberz = 1 +0j.
Figure9.18
Solutionsofz 3 = 1.
Using the polar form, Equation (9.9) becomes
r 3 (cos3θ +jsin3θ) = 1(cos2nπ +jsin2nπ)
Comparing both sides of thisequation we see that
and
r 3 =1 thatis r=1sincer∈R
3θ = 2nπ thatis θ = 2nπ/3 n ∈ Z
Apparently θ can take infinitely many values, but, as we shall see, the corresponding
complex numbers aresimplyrepetitions. Whenn = 0,we find θ = 0,so that
z=z 1
=1(cos0+jsin0)=1
isthe first solution.Whenn = 1 wefind θ = 2π/3,so that
(
z=z 2
=1 cos 2π )
3 +jsin2π = − 1 √
3
3 2 +j 2
isthe second solution.Whenn = 2 we find θ = 4π/3,so that
(
z=z 3
=1 cos 4π )
3 +jsin4π = − 1 √
3
3 2 −j 2
is the third solution. If we continue searching for solutions using larger values ofnwe
find that we only repeat solutions already obtained. It is often useful to plot solutions
on an Argand diagram and this is easily done directly from the polar form, as shown in
Figure 9.18. We note thatthe solutions are equally spaced atangles of 2π/3.
Example9.19 Findthe complex numberszwhichsatisfyz 2 = 4j.
Solution Sincez ∈ Cwewritez =r(cos θ +jsin θ). Therefore,
z 2 =r 2 (cosθ +jsinθ) 2
=r 2 (cos2θ +jsin2θ)
9.9 De Moivre’s theorem 347
y
2
5p —4
p –4
x
2
Figure9.19
Solution ofz 2 = 4j.
by De Moivre’s theorem. Furthermore, 4j has modulus 4 and argument π/2 + 2nπ,
n ∈ Z, thatis
4j = 4{cos(π/2 +2nπ) +jsin(π/2 +2nπ)}
n ∈ Z
Therefore,
r 2 (cos2θ +jsin2θ) = 4{cos(π/2 +2nπ) +jsin(π/2 +2nπ)}
Comparing both sides of this equation, we see that
and
r 2 =4 andso r=2
2θ=π/2+2nπ andso θ=π/4+nπ
Whenn = 0wefind θ = π/4,andwhenn = 1wefind θ = 5π/4.Usinglargervaluesof
n simply repeats solutions already obtained. These solutions are shown in Figure 9.19.
We note that in this example the solutions are equally spaced at intervals of 2π/2 = π.
InCartesian form,
z 1
= 2 √
2
+j 2 √
2
= √ 2(1+j) and z 2
=− 2 √
2
−j 2 √
2
= − √ 2(1 +j)
Ingeneral, thenrootsofz n =a +jbare equallyspacedatangles2π/n.
Once the technique for solving equations like those in Examples 9.18 and 9.19 has
beenmastered,engineersfinditsimplertoworkwiththeabbreviatedformr̸ θ.Example
9.19 reworked inthisfashion becomes
Letz=r̸ θ, then z 2 =r 2̸ 2θ
Furthermore, 4j = 4̸ π/2 +2nπ, and hence ifz 2 = 4j, we have
( ) π
2 +2nπ
r 2̸
fromwhich
2θ=4̸
r 2 =4 and 2θ= π 2 +2nπ
as before.Rework Example 9.18 foryourself usingthisapproach.
348 Chapter 9 Complex numbers
Engineeringapplication9.2
Characteristicimpedance
In a later application we shall explain the concept of the characteristic impedance
ofanelectricaltransmissionlineinmoredetail.Fornow,wecanassumethatitisan
important electricalparameter represented by the equation
√
R+jωL
Z 0
=
G+jωC
whereR,L,GandCaretransmissionlineparametersand ωistheangularfrequency,
all of which are real numbers. If R and G are zero, as they would be for an ideal
transmissionline, thenZ 0
iswholly real because:
√ √
jωL L
Z 0
=
jωC = C
Alternatively, ifGorRis included then the equation involves taking the square root
of a complex number.
Given values ofR,G,L,C and ω we can write
R+jωL
G+jωC
in Cartesian form (see Example 9.9) and hence in polar form (see Section 9.5).
Application of De Moivre’s Theorem will then allow us to calculate the required
square roots tofindZ 0
.
Another application of De Moivre’s theorem is the derivation of trigonometric
identities.
Example9.20 UseDeMoivre’s theoremtoshow that
cos3θ =4cos 3 θ−3cosθ
and
sin3θ =3sinθ−4sin 3 θ
Solution We know that
(cosθ +jsinθ) 3 =cos3θ +jsin3θ
Expanding the l.h.s. wefind
cos 3 θ+3jcos 2 θsinθ−3cosθsin 2 θ−jsin 3 θ =cos3θ+jsin3θ
Equating the real parts gives
cos 3 θ −3cos θ sin 2 θ = cos3θ (9.10)
and equating the imaginarypartsgives
3cos 2 θ sinθ −sin 3 θ = sin3θ (9.11)
9.9 De Moivre’s theorem 349
Now, writing sin 2 θ = 1 −cos 2 θ in Equation (9.10) and cos 2 θ = 1 −sin 2 θ in Equation
(9.11),wefind
cos3θ =cos 3 θ −3cosθ(1−cos 2 θ)
=cos 3 θ+3cos 3 θ−3cosθ
=4cos 3 θ−3cosθ
sin3θ = 3(1 −sin 2 θ)sinθ −sin 3 θ
as required.
=3sinθ−4sin 3 θ
This technique allows trigonometric functions of multiples of angles to be expressed
in terms of powers. Sometimes we want to carry out the reverse process and express a
power interms ofmultiple angles. Consider Example 9.21.
Example9.21 Ifz = cos θ +jsin θ show that
z + 1 z =2cosθ
z−1 z =2jsinθ
and find similarexpressions forz n + 1 z n andzn − 1 z n.
Solution Consider the complex number
z=cosθ+jsinθ
UsingDe Moivre’s theorem,
1
z =z−1 = (cosθ +jsinθ) −1 = cos(−θ) +jsin(−θ)
Butcos(−θ) = cosθ andsin(−θ) = −sinθ,sothatifz = cosθ +jsinθ
1
z =cosθ−jsinθ
Consequently,
z + 1 z =2cosθ and z−1 z =2jsinθ
Moreover,
so that
z n = cosnθ +jsinnθ and z −n = cosnθ −jsinnθ
z n + 1 z n =2cosnθ and zn − 1 z n = 2jsinnθ
z n + 1 z n =2cosnθ and zn − 1 z n = 2jsinnθ
350 Chapter 9 Complex numbers
Example9.22 Show thatcos 2 θ = 1 (cos2θ +1).
2
Solution The formulae obtained in Example 9.21 allow us to obtain expressions for powers of
cosθ andsinθ.Since2cosθ =z+ 1 , squaring both sides wehave
z
(
2 2 cos 2 θ = z + 1 ) 2
=z 2 +2+ 1 z z 2
=
(z 2 + 1 )
+2
z 2
Butz 2 + 1 z =2cos2θ,so 2
2 2 cos 2 θ =2cos2θ+2
and therefore,
cos 2 θ = 1 2 cos2θ + 1 2
as required.
= 1 (cos2θ +1)
2
EXERCISES9.9
1 Express (cos θ +jsinθ) 9 and (cos θ +jsinθ) 1/2 in
the formcosnθ +jsinnθ.
2 UseDeMoivre’stheoremto simplify
(a) (cos3θ +jsin3θ)(cos4θ +jsin4θ)
cos8θ +jsin8θ
(b)
cos2θ −jsin2θ
3 Solve the equations
(a) z 3 +1=0
(b) z 4 =1+j
4 Find 3 √ 2 +2j and displayyoursolutionsonan
Argand diagram.
5 Prove the following trigonometricidentities:
(a) cos4θ =8cos 4 θ−8cos 2 θ+1
(b) 32sin 6 θ = 10 −15cos2θ +6cos4θ −cos6θ
6 Solvethe equationz 4 +25 = 0.
7 Findthe fifthrootsofjanddepictyoursolutionson
an Arganddiagram.
8 Show thatcos 3 θ = 1 (cos3θ +3cosθ).
4
9 Show thatsin 4 θ = 1 (cos4θ −4cos2θ +3).
8
10 Expresscos5θ in termsofpowers ofcos θ.
11 Expresssin5θ in terms ofpowersofsin θ.
12 Given e jθ = cos θ +jsin θ,proveDeMoivre’s
theoremin the form
(e jθ ) n =cosnθ+jsinnθ
Solutions
1 cos9θ +jsin 9θ,cos(θ/2) +jsin(θ/2)
2 (a) cos 7θ +jsin 7θ
(b) cos 10θ +j sin 10θ
3 (a)1̸ π/3 +2nπ/3 n = 0,1,2
(b) 2 1/8̸ π/16 +nπ/2 n = 0,1,2,3
4 8 1/6̸ π/12 +2nπ/3 n = 0,1,2
9.10 Loci and regions of the complex plane 351
6 √ 5̸ π/4+nπ/2 n=0,1,2,3
7 1̸ π/10 +2nπ/5 n = 0,1,2,3,4
10 16cos 5 θ−20cos 3 θ+5cosθ
11 16sin 5 θ−20sin 3 θ+5sinθ
9.10 LOCIANDREGIONSOFTHECOMPLEXPLANE
Regionsofthecomplexplanecanoftenbeconvenientlydescribedbymeansofcomplex
numbers. For example, the points that lie on a circle of radius 2 centred at the origin
(Figure 9.20) represent complex numbers all of which have a modulus of 2. The argumentsareanyvalueof
θ, −π < θ π.Wecandescribeallthepointsonthiscircleby
the simple expression
|z|=2
thatis,allcomplexnumberswithmodulus2.Wesaythatthelocus(orpath)ofthepoint
z is a circle, radius 2, centred at the origin. The interior of the circle is described by
|z| < 2 whileitsexterior isdescribed by |z| > 2.
Similarlyallpointslyinginthefirstquadrant(shadedinFigure9.21)havearguments
between 0 and π/2.This quadrantistherefore describedby the expression:
0 < arg(z) < π/2
y
y
2
z
y
2
x
x
p –4
arg z = p – 4
x
Figure9.20
A circle, radius2, centred atthe
origin.
Figure9.21
First quadrant ofthex--y plane.
Figure9.22
Locusofpointssatisfying
arg(z) = π/4.
Example9.23 Sketch the locus ofthe pointsatisfyingarg(z) = π/4.
Solution The set of points with arg(z) = π/4 comprises complex numbers whose argument is
π/4.Allthesecomplex numbers lie onthe lineshown inFigure 9.22.
Example9.24 Sketch the locus ofthe pointsatisfying |z −2| = 3.
Solution First mark the fixed point 2 on the Argand diagram labelling it ‘A’ (Figure 9.23). Consider
the complex numberzrepresented by the point P. From the vector triangle law of
addition
⃗OA + ⃗ AP = ⃗ OP
⃗AP = ⃗ OP − ⃗ OA
352 Chapter 9 Complex numbers
y
y
z
P
z such that |z – 2| = 3
1 2 3 4 5
x
A
0 1 2
x
Figure9.23
Pointszand2 +0j.
Figure9.24
Locusofpointssatisfying |z −2| = 3.
Recall from Section 9.6 that the graphical representation of the sum and difference of
vectors in the plane, and the sum and difference of complex numbers, are equivalent.
Sincevector ⃗ OPrepresentsthecomplexnumberz,andvector ⃗ OArepresentsthecomplex
number2, ⃗ AP = ⃗ OP− ⃗ OAwillrepresentz−2.Therefore |z−2|representsthedistance
betweenAandP.Wearegiventhat |z−2| = 3,whichthereforemeansthatPcanbeany
point such that its distance from A is 3. This means that P can be any point on a circle
ofradius3centredatA(2,0).ThelocusisshowninFigure9.24. |z −2| < 3represents
the interior of the circle while |z −2| > 3 represents the exterior. Alternatively we can
obtain the same result algebraically: given |z −2| = 3 and also thatz =x +jy, we can
write
|z−2|=|(x−2)+jy|=3
that is
√
(x−2)2 +y 2 =3
or
(x−2) 2 +y 2 =9
Generally,theequation (x−a) 2 +(y−b) 2 =r 2 representsacircleofradiusrcentredat
(a,b), soweseethat (x −2) 2 +y 2 = 9representsacircleofradius3centredat (2,0),
as before.
Example9.25 Usethe algebraicapproachtofind the locusofthe pointzwhichsatisfies
|z−1|= 1 2 |z−j|
Solution Ifz =x +jy, then we have
|(x−1)+jy|= 1 |x +j(y −1)|
2
Therefore,
(x−1) 2 +y 2 = 1 4 {x2 + (y−1) 2 }
9.10 Loci and regions of the complex plane 353
and so
thatis
4(x−1) 2 +4y 2 =x 2 +(y−1) 2
3x 2 −8x+3y 2 +2y+3=0
Bycompleting the square thismay be writteninthe form
(
3 x −
3) 4 2 (
+3 y + 1 ) 2
= 8 3 3
thatis
(
x − 4 2 (
+ y +
3) 1 ) 2
= 8 3 9
which represents a circle of radius √ ( 4
8/3 centred at
3 3)
,−1 .
EXERCISES9.10
1 Sketchthe locidefined by
(a) arg(z) = 0
(b) arg(z) = π/2
(c) arg(z −4) = π/4
(d) |2z| = |z −1|
2 Sketchthe regionsdefined by
(a) Re(z) 0
(b) Im(z) < 3
(c) |z| > 3
(d) 0 arg(z) π/2
(e) |z+2|3
(f) |z+j|>3
(g) |z−1|<|z−2|
3 Ifs = σ +jω sketchthe regionsdefined by
(a) σ0
(b)σ0
(c) −2ω2
Solutions
1 SeeFigure S.19.
arg(z) = 0
arg(z) = p/2
arg(z – 4) = p/4
2_
3
– 1_ 1_ 3 3
2|z| = |z – 1|
4
(a)
(b)
(c)
(d)
FigureS.19
354 Chapter 9 Complex numbers
2 See FigureS.20.
y
y
y
3
Im(z) , 3
3
|z| . 3
y
0 arg(z) p/2
Re(z) 0
x
x
x
x
(a)
(b)
(c)
(d)
3
y
y
2
|z+j| . 3
y
|z –1| , |z –2|
–2
x
|z+2| 3
–1
3
x
1 2
x
(e)
(f)
(g)
FigureS.20
3 See FigureS.21.
v v v
2
s > 0
–2 < v < 2
s < 0
s s s
–2
(a) (b) (c)
FigureS.21
REVIEWEXERCISES9
1 Show that
1
cosθ−jsinθ =cosθ+jsinθ.
2 Expressin Cartesian form
5+4j
(a)
5−4j
1
(c)
2+3j + 1
2−3j
1
(b)
2+3j
1
(d)
x−jy
3 Find the modulusandargumentof −j, −3,1 +j,
cosθ+jsinθ.
4 Markonan Arganddiagramvectors corresponding to
the followingcomplex numbers: −3 +2j, −3 −2j,
cosπ+jsinπ.
5 Expressin the forma +bj:
(a) (cosθ +jsinθ) 6
(c)
cosθ+jsinθ
cosφ+jsinφ
6 Ifz ∈ C,showthat
(a) z+z =2Re(z)
(c) zz = |z| 2
(b)
1
(cosθ +jsinθ) 3
(b) z −z =2jIm(z)
7 Show thate jωt +e −jωt = 2cosωt andfindan
expression fore jωt −e −jωt .
8 Express1 +e 2jωt in the forma +bj.
Review exercises 9 355
9 Sketch the region in the complex plane described by
|z+2j|<1.
10 Expresse (1/2−6j) in the forma +bj.
11 Solvethe equationz 4 +1 = j √ 3.
12 Expresss 2 +6s +13intheform (s −a)(s −b)
wherea,b∈ C.
13 Express2s 2 +8s +11intheform2(s −a)(s −b)
wherea,b∈ C.
Solutions
2 (a)
(d)
9
41 + 40
41 j (b) 2
13 − 3 13 j (c) 4
13
y
x 2 +y 2j
x
x 2 +y 2 +
3 | −j| = 1,arg(−j) = −π/2; |−3| = 3,arg(−3) = π;
|1+j|= √ 2,arg(1+j) = π/4,|cosθ +jsinθ| =1,
arg(cosθ +jsinθ)=θ
5 (a) cos 6θ +jsin6θ (b) cos 3θ −jsin 3θ
(c) cos(θ −φ)+jsin(θ −φ)
7 e jωt −e −jωt = 2jsin ωt
8 1+cos2ωt+jsin2ωt
10 1.5831 +0.4607j
11 2 1/4̸ π/6+nπ/2 n=0,1,2,3
12 [s − (−3 +2j)][s − (−3 −2j)]
[ ( √ )][ (
6
13 2 s − −2 +
2 j s− −2 −
√
6
2 j )]
10 Differentiation
Contents 10.1 Introduction 356
10.2 Graphicalapproachtodifferentiation 357
10.3 Limitsandcontinuity 358
10.4 Rateofchangeataspecificpoint 362
10.5 Rateofchangeatageneralpoint 364
10.6 Existenceofderivatives 370
10.7 Commonderivatives 372
10.8 Differentiationasalinearoperator 375
Reviewexercises10 385
10.1 INTRODUCTION
Differentiation is a mathematical technique for analysing the way in which functions
change. In particular, it determines how rapidly a function is changing at any specific
point.Asthefunctioninquestionmayrepresentthemagneticfieldofamotor,thevoltage
acrossacapacitor,thetemperatureofachemicalmix,etc.,itisoftenimportanttoknow
howquicklythesequantitieschange.Forexample,ifthevoltageonanelectricalsupply
network is falling rapidly because of a short circuit, then it is necessary to detect this
in order to switch out that part of the network where the fault has occurred. However,
the system should not take action for normal voltage fluctuations and so it is important
to distinguish different types and rates of change. Another example would be detecting
a sudden rise in the pressure of a fermentation vessel and taking appropriate action to
stabilize the pressure.
Differentiationwillbeintroducedinthischapter.Weshallderiveaformulawhichcan
be used to find the rate of change of a function. To avoid always having to resort to the
formulaengineersoftenuseatableofderivatives;suchatableisgiveninSection10.7.
The chapter closes with a discussion of an important property of differentiation -- that
of linearity.
10.2 Graphical approach to differentiation 357
y (t)
1 2 3 4 5 6 7 8 9 10 11 12 t
Figure10.1
Thefunctiony(t)hasdifferentratesof
change over different regionsoft.
10.2 GRAPHICALAPPROACHTODIFFERENTIATION
Differentiation is concerned with the rate at which a function is changing, rather than
the actual change itself. We can explore the rate of change of a function by examining
Figure 10.1. There are several regions to this curve corresponding to different intervals
oft. In the interval [0, 5] the function does not change at all. The rate of change of y
is zero. Fromt = 5 tot = 7 the function increases slightly. Thus the rate of change of
y ast increases is small. Sinceyis increasing, the rate of change ofyis positive. From
t = 7 tot = 8 there is a rapid rise in the value of the function. The rate of change ofy
islargeandpositive.Fromt = 8tot = 9thevalueofydecreasesveryrapidly.Therate
ofchangeofyislargeandnegative.Finallyfromt = 9tot = 12thefunctiondecreases
slightly. Thus the rate ofchangeofyissmalland negative.
The aim of differential calculus is to specify the rate of change of a function precisely.Itisnotsufficienttosay‘therateofchangeofafunctionislarge’.Werequirean
exactvalueorexpressionfortherateofchange.Beforebeingabletodothisweneedto
introduce two conceptsconcerningthe rate ofchangeofafunction.
10.2.1 Averagerateofchangeofafunctionacrossaninterval
ConsiderFigure10.2.Whent =t 1
,thefunctionhasavaluey(t 1
).ThisisdenotedbyA
on the curve. Whent =t 2
, the function has a value ofy(t 2
). This point is denoted by B
on the curve. The function changes by an amounty(t 2
) −y(t 1
) over the interval [t 1
,t 2
].
Thusthe average rate ofchangeofthe function over the intervalis
changeiny
change int = y(t 2 ) −y(t 1 )
t 2
−t 1
The straight line joining A and B is known as a chord. Graphically,y(t 2
) −y(t 1
) is the
verticaldistanceandt 2
−t 1
thehorizontaldistancebetweenAandB,sothatthegradient
of the chord AB isgiven by
BC
AC = y(t 2 ) −y(t 1 )
t 2
−t 1
Thegradientorslopeofalineisameasureofitssteepnessandlinesmayhavepositive,
negative orzero gradients as shown inFigure 10.3.
Thus the gradient of the chord AB corresponds to the average rate of change of the
function between Aand B. To summarize:
TheaveragerateofchangeofafunctionbetweentwopointsAandBisthegradient
ofthe chord AB.
358 Chapter 10 Differentiation
y (t)
y (t 2 )
B
y
Positive
gradient
y (t 1 )
A
t 1
(t 2 – t 1 )
Figure10.2
Average rate ofchange across an interval.
u
t 2
C
y (t 2 ) – y (t 1 )
t
Zero
gradient
Negative
gradient
Figure10.3
Linescan have different gradients.
t
y (t)
B 1
Chord AB 1
Chord AB 2
A
B 2
Tangent at A
10.2.2 Rateofchangeofafunctionatapoint
t
Figure10.4
Point B ismoved nearer to Ato improve
accuracy.
Consider again Figure 10.2. Suppose we require the rate of change of the function at
pointA.WecanusethegradientofthechordABasanapproximationtothisvalue.IfB
isclosetoAthentheapproximationisbetterthanifBisnotsoclosetoA.Thereforeby
moving B nearer to A it is possible to improve the accuracy of this approximation (see
Figure 10.4).
Suppose the chord AB is extended as a straight line on both sides of AB, and B is
moved closer and closer to A until both points eventually coincide. The straight line
becomesatangenttothecurveatA.Thisisthestraightlinethatjusttouchesthecurve
atA.However,therateofchangeofthistangent,thatisitsgradient,stillcorrespondsto
the rate of change of the function, but now it is the rate of change of the function at the
point A.To summarize:
The rate of change of a function at a point A on the curve is the gradient of the
tangenttothe curve atpointA.
We have still to address the question of how the gradients of chords and tangents are
found.This requires a knowledge oflimits whichisthe topic ofthe next section.
10.3 LIMITSANDCONTINUITY
The concept of a limit is crucial to the development of differentiation. We write
t →c to denote thatt approaches, or tends to, the value ofc. Note carefully that this is
distinct from statingt = c. Ast tends tocwe consider the value to which the function
approachesand call thisvalue thelimit ofthe functionast →c.
10.3 Limits and continuity 359
f (t)
5
y = 1 – x
y
4
3
y = x +1
2
–3 2 t
Figure10.5
Thecurvef(t) =t 2 +2t −3.
–1
Figure10.6
Asx → 0,y → 1,even though
y(0) =3.
1
0
3 x
Example10.1 Ift → 2,whatvalue does
approach?
f(t)=t 2 +2t−3
Solution Figure 10.5 shows a graph of f (t). Clearly, whethert = 2 is approached from the l.h.s.
orther.h.s.thefunctiontendsto5.Thatis,ift → 2,then f (t) → 5.Wenotethatthisis
thevalueof f (2).Informallywearesayingthatast getsnearerandnearertothevalue2,
so f (t)gets nearerand nearer to5.This isusuallywritten as
lim(t 2 +2t−3)=5
t→2
where ‘lim’ is an abbreviation of limit. In this example, the limit of f (t) ast → 2 is
simply f (2), butthisisnottrueforall functions.
Example10.2 Figure 10.6 illustratesy(x) defined by
⎧
⎨1−x x<0
y(x) = 3 x=0
⎩
x+1 x>0
Evaluate:
(a) lim x→3
y (b) lim x→−1
y (c) lim x→0
y
Solution We notethat thisfunctionispiecewise continuous.Ithas a discontinuityatx = 0.
(a) Weseekthelimitofyasxapproaches3.Asxapproaches3,wewillbeonthatpart
of the function defined byx > 0, that isy(x) =x +1. Asx → 3, theny → 3 +1,
that isy → 4.So
lim y = 4
x→3
(b) Whenxapproaches −1, we will be on that part of the function defined byx < 0,
thatisy(x) = 1 −x.Soasx → −1,theny → 1 − (−1),thatisy → 2.Hence
lim y = 2
x→−1
(c) Asxapproaches0whatvaluedoesyapproach?Notethatwearenotevaluatingy(0)
which actually has a value of 3. We simply ask the question ‘What value isynear
360 Chapter 10 Differentiation
whenxisnear,butdistinctfrom,0?’FromFigure10.6weseeyisnearto1,thatis
lim y = 1
x→0
Example10.3 The functiony(x) isdefined by
⎧
⎨0 x0
y(x) = x 0<x2
⎩
x−2 x>2
(a) Sketch the function.
(b) State the limitofyasxapproaches (i)3,(ii)2,(iii)0.
Solution (a) ThefunctionisshowninFigure10.7.Notethatthefigurehasthreeparts;eachpart
corresponds toapartinthe algebraic definition.
(b) (i) Asx → 3,the relevant partof the function isy(x) =x −2.Hence
lim y = 1
x→3
(ii) Supposex < 2 and gradually increases, approaching the value 2. Then, from
the graph, we see that y approaches 2. Now, suppose x > 2 and gradually
decreases,tendingto2.Inthiscaseyapproaches0.Hence,wecannotfindthe
limitofyasxtends to2.The lim x→2
ydoes notexist.
(iii) Asxtends to 0,ytends to 0. This is true whetherxapproaches 0 from below,
thatisfrom the left,orfrom above, thatisfrom the right.So,
lim y = 0
x→0
y
y = x
y = x – 2
y = 0
2
x
Figure10.7
Thefunctionyhas different limitsasx → 2
from the left and the right.
Itisappropriateatthisstagetointroducetheconceptofleft-handandright-handlimits.
ReferringtoExample10.3,weseethatasxapproaches2fromtheleft,thatisfrombelow,
then y approaches 2. We say that the left-hand limit of y as x tends to 2 is 2. This is
written as
lim
x→2 −y = 2
Similarly, the right-hand limit ofyis obtained by lettingxtend to 2 from above. In this
case,yapproaches0.This iswritten as
lim
x→2 +y = 0
Consider a point at which the left-hand and right-hand limits are equal. At such a point
wesay ‘thelimitexists atthatpoint’.
10.3 Limits and continuity 361
Thelimitofafunction, atapointx =a, existsonly iftheleft-handand right-hand
limits areequal there.
10.3.1 Continuousanddiscontinuousfunctions
Afunction f iscontinuousatthe point wherex =a, if
lim f =f(a)
x→a
that is, the limit value matches the function value at a point of continuity. A function
which is not continuous is discontinuous. In Example 10.3, the function is continuous
atx = 0 because
lim y=0=f(0)
x→0
butdiscontinuousatx = 2becauselim x→2
ydoesnotexist.InExample10.2,thefunction
isdiscontinuousatx = 0becauselim x→0
y = 1buty(0) = 3.Theconceptofcontinuity
corresponds to our natural understanding of a break in the graph of the function, as
discussed inChapter 2.
Afunction f iscontinuous atapointx =aifand only if
lim x→a
f = f(a)
thatis, the limitof f exists atx =aand isequal to f (a).
EXERCISES10.3
1 Thefunction, f (t),is defined by
⎧
⎨1 0t2
f(t)= 2 2<t3
⎩
3 t>3
Sketchagraphof f (t)andstatethefollowinglimitsif
they exist:
(a) lim t→1.5 f
(b) lim t→2 + f
(c) lim t→3 f
(d) lim t→0 + f
(e) lim t→3 − f
2 The functiong(t)is defined by
⎧
0 t<0
⎪⎨
t
g(t) =
2 0t3
2t+3 3<t 4
⎪⎩
12 t>4
(a) Sketchg.
(b) State any pointsofdiscontinuity.
(c) Find, ifthey exist,
(i) lim t→3 g
(ii) lim t→4 g
(iii) lim t→4 −g
Solutions
1 (a) 1 (b) 2 (c) notdefined
(d) 1 (e) 2
2 (b)t=4
(c) (i) 9 (ii) not defined (iii) 11
362 Chapter 10 Differentiation
10.4 RATEOFCHANGEATASPECIFICPOINT
WesawinSection10.2thattherateofchangeofafunctionatapointisthegradientof
thetangenttothecurveatthatpoint.Also,wecanthinkofatangentatAasthelimitof
an extended chord AB as B → A. We now put these two ideas together to find the rate
of change of a function atapoint.
Example10.4 Given y = f (x) = 3x 2 + 2, obtain estimates of the rate of change of y at x = 3 by
considering the intervals
(a) [3,4] (b) [3,3.1] (c) [3,3.01]
Solution (a) Consider Figure 10.8.
y(3) = 3(3) 2 +2 =29
y(4) = 3(4) 2 +2 =50
Let Abe the point (3, 29) on the curve. Let Bbe the point (4, 50).Then
average rate of change over the interval [3,4] =
=
change iny
change inx
y(4) −y(3)
4 −3
= 50−29
4 −3 = 21
ThisisthegradientofthechordABandisanestimateofthegradientofthetangent
atA.Thatis, the rate of change atAisapproximately 21.
(b) y(3.1) = 30.83 and so,
average rate of change over the interval [3,3.1] =
This isamore accurate estimateof the rate of change atA.
(c) y(3.01) = 29.1803 and so,
30.83 −29
3.1−3
29.1803 −29
average rate of change over the interval [3,3.01] =
3.01 −3
= 18.03
= 18.3
y
50
B
29
A
3 4
x
Figure10.8
Thefunction:y = 3x 2 +2.
10.4 Rate of change at a specific point 363
ThisisanevenbetterestimateoftherateofchangeatA.HenceatA,ifxincreases
by 1 unit then y increases by approximately 18 units. This corresponds to a steep
upward slope atA.
Example10.4illustratestheapproachofestimatingtherateofchangeatapointbyusing
the‘shrinkinginterval’method.Bytakingsmallerandsmallerintervals,betterandbetter
estimates of the rate of change of the function atx = 3 can be obtained. However, we
eventuallywanttheintervalto‘shrink’tothepointx = 3.Weintroduceasmallchange
orincrementofxdenotedby δxandconsidertheinterval[3,3+δx].Byletting δxtend
tozero,the interval [3,3 + δx] effectively shrinks tothe pointx = 3.
Example10.5 Find the rate of change ofy = 3x 2 +2 atx = 3 by considering the interval [3,3 + δx]
and letting δx tend to0.
Solution Whenx = 3,y(3) = 29. Whenx = 3 + δx then
So,
y(3 + δx) = 3(3 + δx) 2 +2
=3(9+6δx+(δx) 2 )+2
= 3(δx) 2 +18δx +29
average rate of change ofyacross [3,3 + δx] =
change iny
change inx
= (3(δx)2 +18δx +29) −29
δx
= 3(δx)2 +18δx
δx
=
δx(3δx +18)
δx
=3δx+18
We now let δx tend to0,so thatthe intervalshrinks toapoint:
rate of change ofywhenxis3 = lim
δx→0
(3δx +18) =18
We have found the rate of change ofyat a particular value ofx, rather than across an
interval.We usually write
( ) y(3 + δx) −y(3)
rate of change ofywhenxis3 = lim
δx→0 δx
( 3(δx) 2 )
+18δx
= lim
= lim (3δx +18)
δx→0 δx
δx→0
= 18
364 Chapter 10 Differentiation
EXERCISES10.4
1 Find the rate ofchange ofy = 3x 2 +2 at
(a) x = 4 by considering the interval[4,4 + δx]
(b) x = −2byconsideringtheinterval[−2,−2 + δx]
(c) x = 1 byconsidering the interval[1 − δx,1 + δx]
2 Find the rate ofchange ofy = 1/x atx = 2.
3 To determine the rate ofchange ofy =x 2 −x
atx = 1 the interval [1,1 + δx] could be used.
Equally the intervals [1 − δx,1]or
[1 − δx,1 + δx] could be used.Show that the
same answer resultsregardless ofwhich interval is
used.
4 Findtherateofchangeofy(x) = 2 −x 2 at
(a) x = 3,byconsidering the interval[3,3 + δx]
(b) x = −5,byconsidering the interval
[−5,−5 + δx]
(c) x = 1,byconsidering the interval
[1 − δx,1 + δx].
5 Findthe rate ofchange ofy(x) = x
x +3 atx=3by
considering the interval [3,3 + δx].
Solutions
1 (a) 24 (b) −12 (c) 6
4 (a) −6 (b) 10 (c) −2
2 −0.25
3 1
5
1
12
10.5 RATEOFCHANGEATAGENERALPOINT
Example 10.5 shows that the rate of change of a function at a particular point can be
found. We will now develop a general terminology for the method. Suppose we have a
function ofx,y(x). We wish to find the rate of change ofyat a general value ofx. We
beginbyfindingtheaveragerateofchangeofy(x)acrossanintervalandthenallowthe
intervaltoshrinktoasinglepoint.Considertheinterval[x,x + δx].Atthebeginningof
theintervalyhasavalueofy(x).Attheendoftheintervalyhasavalueofy(x + δx)so
that the changeinyisy(x + δx) −y(x), whichwedenoteby δy(see Figure 10.9).So,
average rate ofchangeofy = changeiny
change inx
=
y(x + δx) −y(x)
δx
= δy
δx
Now let δx tend to0,so thatthe interval shrinks toapoint.Then
( ) ( )
y(x + δx) −y(x) δy
rate of change ofy = lim
= lim
δx→0 δx
δx→0 δx
To see how weproceed toevaluate this limitconsider Example 10.6.
10.5 Rate of change at a general point 365
y
dy = y(x + dx) – y(x)
y(x + dx)
y(x)
x
x + dx
dx
x
Figure10.9
Therate ofchange ofyat apoint is foundby letting δx → 0.
Example10.6 Findtherateofchangeofy(x) = 2x 2 +3x.Calculatetherateofchangeofywhenx = 2
and whenx = −3.
Solution Giveny(x) = 2x 2 +3x
then
Hence
So,
y(x + δx) = 2(x + δx) 2 +3(x + δx)
= 2x 2 +4xδx +2(δx) 2 +3x +3δx
y(x + δx) −y(x) =2(δx) 2 +4xδx +3δx
( ) y(x + δx) −y(x)
rate ofchangeofy = lim
δx→0 δx
= lim
δx→0
( 2(δx) 2 +4xδx +3δx
δx
)
= lim
δx→0
(2δx+4x+3)=4x+3
Whenx = 2,therateofchangeofyis4(2) +3 = 11.Whenx = −3,therateofchange
ofyis4(−3) +3 = −9.Apositiverateofchangeshowsthatthefunctionisincreasing
at that particular point. A negative rate of change shows that the function is decreasing
atthatparticular point.
( ) δy
Therateofchangeofyiscalledthederivativeofy.Wedenote lim by dy
δx→0 δx dx .This
ispronounced ‘deeyby deex’.
rate ofchangeofy(x) = dy ( ) y(x + δx) −y(x)
dx = lim
δx→0 δx
Note thatthe notation dy δy
means lim
dx δx→0 δx .
366 Chapter 10 Differentiation
then
Usingthe previous example, if
y(x) =2x 2 +3x
dy
dx =4x+3
dy
isoften abbreviated toy′
dx
y ′ is pronounced ‘y dash’ or ‘y prime’. To stress thatyis the dependent variable andx
the independent variable we often talk of ‘the rate of change ofywith respect tox’, or,
more compactly, ‘the rate of change of y w.r.t. x’. The process of finding y ′ from y is
calleddifferentiation.Thisshrinkingintervalmethodoffindingthederivativeiscalled
differentiation from first principles. We know that the derivative dy is the gradient
dx
of the tangent to the function at a point. It is also the rate of change of the function. In
manyexamples,theindependentvariableist andweneedtofindtherateofchangeofy
withrespecttot;thatis,find dy
dt .Thisisalsooftenwrittenasy′ althoughẏ,pronounced
‘ydot’,isalsocommon.Thereadershouldbeawareofbothnotations.Finally,y ′ isused
to denote the derivative ofywhatever the independent variable may be. So dy
dz , dy
dr and
dy
dw could all be represented byy′ .
Example10.7 Findthe gradientofthe tangent toy =x 2 atA(1,1),B(−1,1) andC(2,4).
Solution We havey =x 2 and so
y(x + δx) = (x + δx) 2
and hence
Then
=x 2 +2xδx + (δx) 2
y(x + δx) −y(x) =x 2 +2xδx + (δx) 2 −x 2
dy
dx
= 2xδx + (δx) 2
= gradient of a tangent tocurve
( ) y(x + δx) −y(x)
= lim
δx→0 δx
)
= lim
δx→0
( (x + δx) 2 −x 2
δx
= lim
δx→0
(2x + δx) =2x
( ) 2xδx + (δx)
2
= lim
δx→0 δx
10.5 Rate of change at a general point 367
At (1, 1), dy
dy
= 2 = gradient of tangent at A. At (−1,1),
dx dx
tangent atB. At (2, 4), dy = 4 = gradient of tangent atC.
dx
= −2 = gradient of
Suppose we wish to evaluate the derivative, dy
dx , at a specific value ofx, sayx 0
. This is
denoted by
dy
dx (x =x 0 )
dy
or more compactly by
dx (x 0 ) ory′ (x 0
)
An alternative notation is
dy
∣
dx
∣
x=x0
or
dy
∣
dx
∣
x0
So,for Example 10.7 wecould have written
dy
dx (1)=2
dy
dx (x=−1)=−2
dy
dx∣ =4 y ′ (2)=4
x=2
Example10.8 Refer to Figure 10.10. By considering the gradient of the tangent at the points A, B, C,
Dand Estatewhether dy ispositive, negative orzero atthese points.
dx
Solution AtAandCthetangentisparalleltothexaxisandso dy
dx iszero.AtBandEthetangent
has a positive gradient and so dy is positive. At D the tangent has a negative gradient
dx
and thus dy
dx isnegative.
y
E
B
A
D
C
x
Figure10.10
Graph forExample 10.8.
AswesawinChapter3,functionsareusedtorepresentphysicallyimportantquantities
suchasvoltageandcurrent.Whenthecurrentthroughcertaindeviceschanges,thiscan
give rise to voltages, the magnitudes of which are proportional to the rate of change of
the current. Consequently differentiation is needed to model these effects as illustrated
inEngineeringapplications10.1 and 10.2
368 Chapter 10 Differentiation
Engineeringapplication10.1
Voltageacrossaninductor
The voltage, v, across an inductor with inductance, L, is related to the current, i,
through the inductor by
v =L di
dt
Figure10.11showstherelationshipbetweenmagneticfluxlinespassingthroughthe
coils of aninductor and the current flowing through the inductor.
i
Figure10.11
Schematic diagram ofan
inductorshowing the
relationshipbetween magnetic
fluxlinesand current.
This relationship is a quantification of Faraday’s law which states that the voltage
induced in a coil is proportional to the rate of change of magnetic flux through it. If
the current in a coil is changing then this corresponds to a change in the magnetic
flux through the coil. Note that if di is large then v is large. This is why care has
dt
to be taken when abruptly switching off the current to an inductor because it causes
high voltages tobe generated.
Engineeringapplication10.2
Currentthroughacapacitor
The current,i, through a capacitor with capacitance,C, is related to the voltage, v,
acrossthe capacitorby
i =C dv
dt
Displacement current i D
Conduction current i
i
Figure10.12
Schematic diagram ofacapacitor
showingthe conduction current and
the displacement current.
It may appear confusing to talk of a current flow through a capacitor as no actual
charge flows through the capacitor apart from that caused by any leakage current.
Insteadthereisabuild-upofchargeontheplatesofthecapacitor.Thisinturngives
risetoavoltageacrossthecapacitor.Ifthecurrentflowislargethentherateofchange
ofthisvoltagewillbelarge.Thecurrentflowthroughthecapacitorwiresistermeda
10.5 Rate of change at a general point 369
conductioncurrentwhilethatbetweenthecapacitorplatesiscalledadisplacement
current (see Figure 10.12).
Thedisplacementcurrentcanbethoughtofasavirtualcurrentthatflowsbetween
the plates due to the build-up of positive charge on one plate and negative charge
on the other plate. It doesn’t correspond to a real current flowing between the plates
as they are separated by an insulating material.
We can relate the smallchanges δx and δy tothe derivative dy
dx .
If δx isvery smallyet still finitewe can statethat
dy
dx ≈ δy
(10.1)
δx
Thisresultallowsanimportantapproximationtobemade.FromEquation(10.1)wesee
thatifasmallchange, δx,ismadetotheindependentvariable,thecorrespondingchange
inthe dependent variable isgiven by the following formula:
δy ≈ dy
dx δx
Example10.9 Ify =x 2 estimate the changeinycausedbychangingxfrom3to3.1.
Solution Ify =x 2 then dy = 2x. The approximate change inthe dependent variable isgiven by
dx
δy ≈ dy δx =2xδx
dx
Takingx = 3 and δx = 0.1 we have
δy ≈ (2)(3)(0.1) = 0.6
We conclude that at the point wherex = 3 a change inxto 3.1 causes an approximate
change of 0.6 inthe value ofy.
EXERCISES10.5
1 Calculatethe gradientofthe functionsat the specified
points.
(a) y = 2x 2 at(1,2)
(b) y=2x−x 2 at(0,0)
(c) y=1+x+x 2 at(2,7)
(d) y=2x 2 +1at(2,9)
2 Afunction,y,is such that dy is constant.
dx
What can you say about the function,y?
3 Forwhichgraphs in Figure10.13isthe derivative
always (a) positive or(b)negative?
4 Find the derivative ofy(x)whereyis
(a) x 2 (b) −x 2 +2x
5 Differentiatey = 2x 2 +9; thatis,find dy
dx .
What is the rate ofchange ofywhenx = 3, −2, 1,0?
6 Find the rate ofchange ofy = 4t −t 2 .Whatis the
value of dy whent = 2?
dt
7 Ify=x 3 −3x 2 +xthen
dy
dx =3x2 −6x+1
Estimate the change inyasxchanges from
(a) 2 to 2.05
(b) 0 to 0.025
(c) −1 to −1.05
370 Chapter 10 Differentiation
y
y
y
y
x
x
x
(i) (ii) (iii) (iv)
Figure10.13
x
Solutions
1 (a)4 (b)2 (c)5 (d)8
2 yislinearinx,thatisy =ax +b
3 (i) always negative (ii) always positive
(iii) always positive (iv) always negative
4 (a) 2x (b) −2x +2
5 4x,12,−8,4,0
6 4−2t,0
7 (a) 0.05 (b) 0.025 (c) −0.5
10.6 EXISTENCEOFDERIVATIVES
So far we have seen that the derivative, dy , of a function,y(x), may be viewed either
dx
algebraically orgeometrically.
( )
dy y(x + δx) −y(x)
dx = lim
δx→0 δx
dy
= rate of change ofy
dx
= gradient of the graph ofy
We now discuss briefly the existence of dy . For some functions the derivative does not
dx
exist at certain points and we need to be able to recognize such points. Consider the
graphs shown in Figure 10.14. Figure 10.14(a) shows a function with a discontinuity at
y
y
(a)
a
x
Figure10.14
(a) The graphhas a discontinuity atx =a. (b)The graphhas a cuspat
x=a.
(b)
a
x
10.6 Existence of derivatives 371
x = a. The function shown in Figure 10.14(b) is continuous but has a cusp or corner
atx = a. In both cases it is impossible to draw a tangent atx = a, and so dy does not
dx
existatx =a.Itisimpossibletodrawatangent toacurve atapointwhere thecurve is
not smooth. Note from Figure 10.14(b) that continuity is not sufficient to guarantee the
existence of a derivative.
Example10.10 Sketch the following functions. State the values oft for which the derivative does not
exist.
(a) y=|t| (b) y=tant (c) y=1/t
Solution (a) Thegraphofy = |t|isshowninFigure10.15(a).Acornerexistsatt = 0andsothe
derivative doesnotexist here.
(b) A graph ofy = tant is shown in Figure 10.15(b). There is a discontinuity in tant
when t = ... −3π/2,−π/2,π/2,3π/2,.... No derivative exists at these
points.
(c) Figure 10.15(c) shows a graph of y = 1/t. The function has one discontinuity at
t = 0,and so the derivative doesnotexist here.
y
y = tan t
y
y
y = |t|
–
p
–
p
–
2
2
t
y = 1_ t
t
(a)
t
(b)
Figure10.15
(a) There is acorner att = 0; (b)tant has discontinuities; (c)y = 1/t has a discontinuity att = 0.
(c)
EXERCISES10.6
1 Sketchthe functionsanddetermineany pointswhere
aderivative does notexist.
(a) y= 1
t −1
(b) y=|sint|
(c) y=e t
(d) y = |1/t|
{ 1 t0
(e) The unitstepfunctionu(t) =
0 t<0
{ ct t0
(f) The ramp function f (t) =
0 t<0
372 Chapter 10 Differentiation
Solutions
1 See FigureS.22.
(a) No derivative exists for t = 1
y
(b) No derivative exists for t = nπ
|sin t|
(c) Derivative exists for all values of t
e t
1
t
–2π–π
0 π 2π3π
t
0
t
(d) No derivative exists for t = 0
1
–
t
FigureS.22
0 t
(e) No derivative exists for t = 0
u(t)
1
t
(f) No derivative exists for t = 0, although
the function is continuous here
f(t)
t
10.7 COMMONDERIVATIVES
Itistimeconsumingtofindthederivative ofy(x) usingthe‘shrinkinginterval’method
(often referred to as differentiation from first principles). Consequently the derivatives
of commonly used functions are listed for reference in Table 10.1. It will be helpful
to memorize the most common derivatives. Note that a, b and n are constants. In
the trigonometric functions, the quantity ax +b, being an angle, must be measured in
radians.
A shorter table of the more common derivatives is given on the inside back cover of
this bookforeasyreference.
Example10.11 UseTable 10.1 tofindy ′ when
(a) y = e −7x
(b)y=x 5
(c) y = tan(3x −2)
(d) y = sin(ωx + φ)
(e)y= √ 1 x
(f)y= 1 x 5
(g) y = cosh −1 5x
10.7 Common derivatives 373
Table10.1
Derivatives ofcommonly usedfunctions.
Function,y(x) Derivative,y ′
constant 0
x n
e x
e −x
e ax
lnx
sinx
cosx
sin(ax +b)
cos(ax +b)
tan(ax +b)
cosec(ax +b)
sec(ax +b)
cot(ax +b)
sin −1 (ax +b)
nx n−1
e x
−e −x
ae ax
1
x
cosx
−sinx
acos(ax +b)
−asin(ax +b)
asec 2 (ax +b)
−acosec(ax +b)cot(ax +b)
asec(ax +b)tan(ax +b)
−acosec 2 (ax +b)
a
√
1−(ax+b) 2
Function,y(x) Derivative,y ′
cos −1 (ax +b)
tan −1 (ax +b)
sinh(ax +b)
cosh(ax +b)
tanh(ax +b)
cosech(ax +b)
sech(ax +b)
coth(ax +b)
sinh −1 (ax +b)
cosh −1 (ax +b)
tanh −1 (ax +b)
−a
√
1−(ax+b) 2
a
1+(ax+b) 2
acosh(ax +b)
asinh(ax +b)
asech 2 (ax +b)
−acosech(ax +b)×
coth(ax +b)
−asech(ax +b)×
tanh(ax +b)
−acosech 2 (ax +b)
a
√
(ax+b) 2 +1
a
√
(ax+b) 2 −1
a
1−(ax+b) 2
Solution (a) FromTable 10.1,wefind thatif
y = e ax then y ′ =ae ax
Inthis case,a = −7 and so if
y = e −7x then y ′ = −7e −7x
(b) FromTable 10.1,wefind thatif
y =x n then y ′ =nx n−1
Inthis case,n = 5 and so if
y =x 5 then y ′ = 5x 4
(c) Ify = tan(ax+b)theny ′ =asec 2 (ax+b).Inthiscase,a = 3andb = −2.Henceif
y =tan(3x −2) then y ′ =3sec 2 (3x −2)
(d) Ify = sin(ax +b)theny ′ =acos(ax +b).Herea = ωandb = φ,andsoif
(e) Note that
y = sin(ωx + φ) then y ′ = ωcos(ωx + φ)
1
√ x
= x −1/2 . From Table 10.1 we find that ify = x n theny ′ = nx n−1 . In
this case,n = −1/2 and so if
y = 1 √ x
then y ′ = − 1 2 x−3/2
374 Chapter 10 Differentiation
(f) Note that 1 x 5 =x−5 . UsingTable 10.1, we find thatify =x −5 theny ′ = −5x −6 .
(g) FromTable 10.1, ify = cosh −1 (ax +b) then
y ′ =
a
√
(ax+b)2 −1
Inthiscase,a = 5 andb= 0.Hence, if
y = cosh −1 5x then y ′ =
5
√
25x2 −1
Example10.12 Differentiatey(t) = e t .
Solution We note that the independent variable ist. However, Table 10.1 can still be used. From
Table 10.1, wefind
dy
dt =et =y
We note that the derivative of e t is again e t . This is the only function which reproduces
itself upon differentiation.
EXERCISES10.7
1 UseTable 10.1 to findy ′ given:
(a) y=t 2 (b) y=t 9
(c) y=t −3 (d) y=t
(e) y= 1 (f) y= 1
t t 2
(g) y = e 3t (h) y = e −3t
(i) y= 1
e 5t (j) y =t 1/2
(k) y =sin(2t +3) (l) y =cos(4 −t)
( )
t
(m) y = tan
2 +1 (n) y = cosec(3t +7)
(o) y =cot(1−t)
(q) y =sin −1 (t +π)
(p) y =sec(2t −π)
(r) y = π
(s) y = tan −1 (−2t −1) (t) y = cos −1 (4t −3)
(u) y = tanh(6t)
( )
(v) y = cosh(2t +5)
t +3
(w) y = sinh
2
(x) y = sech(−t)
( )
2t
(y) y = coth
3 − 1 2
(z) y = cosh −1 (t +3)
2 Find dy
dx when
(a) y= 1 √ x
(b) y = e 2x/3
(c) y = e −x/2 ( )
2x−1
(e) y = cosec
3
(f) y = tan −1 (πx +3)
(g) y = tanh(2x +1)
(h) y = sinh −1 (−3x)
(i) y = cot(ωx + π)
1
(j) y=
sin(5x +3)
(l) y= 1
cos3x(
)
x −1
(n) y = cosech
2
(o) y = tanh −1 (
2x+3
7
)
(d) y = lnx
ω constant
(k) y = cos3x
(m) y = tan(2x + π)
10.8 Differentiation as a linear operator 375
Solutions
1 (a) 2t (b) 9t 8
(c) −3t −4 (d) 1
(e) −t −2 (f) −2t −3
(g) 3e 3t (h) −3e −3t
(i) −5e −5t (j) 0.5t −1/2
(k) 2cos(2t +3) (l) sin(4 −t)
(m) 0.5sec 2 (t/2 +1)
(n) −3cosec(3t +7)cot(3t +7)
(o) cosec 2 (1 −t)
(p) 2sec(2t − π)tan(2t − π)
1
(q) √ (r) 0
1−(t+π) 2
2
4
(s) −
1 + (−2t −1) 2 (t) −√ 1−(4t−3) 2
(u) 6sech 2 6t
( )
(v) 2sinh(2t +5)
t +3
(w) 0.5cosh
2
(x) sech(−t)tanh(−t)
(y) − 2 3 cosech2 (
2t
3 − 1 2
)
(z)
1
√
(t+3) 2 −1
2 (a) −0.5x −3/2 2
(b)
3 e2x/3
(c) −0.5e −x/2 (d) 1/x
(e) − 2 3 cosec (
2x−1
3
)
cot
(
2x−1
3
π
(f)
1+(πx+3) 2 (g) 2sech 2 (2x +1)
3
(h) −√ (i) −ωcosec 2 (ωx + π)
9x 2 +1
(j) −5cosec(5x +3)cot(5x +3)
(k) −3sin3x
(l) 3sec3xtan3x (m) 2sec 2 (2x + π)
( ) ( )
x −1 x −1
(n) −0.5cosech coth
2 2
(o)
2
[ ( ) 2 ]
2x+3
7 1 −
7
)
10.8 DIFFERENTIATIONASALINEAROPERATOR
Inmathematicallanguagedifferentiationisalinearoperator.Thismeansthatifwewish
to differentiate the sum of two functions we can differentiate each function separately
and thensimplyadd the two derivatives, thatis
derivative of (f +g) = derivative of f +derivative ofg
This isexpressed mathematically as
d
dx (f+g)=df dx + dg
dx
We can regard d as the operation of differentiation being applied to the expression
dx
whichfollows it.Thepropertiesofalinear operator alsomake thehandling ofconstant
factorseasy.Todifferentiatekf,wherekisaconstant,wetakektimesthederivativeof
f,thatis
derivative of (kf) =k ×derivative of f
Mathematically, wewould state:
d
dx (kf)=kdf dx
376 Chapter 10 Differentiation
Table 10.1 together with these two linearity properties allow us to differentiate some
quite complicated functions.
Example10.13 Differentiate
(a) 3x 2 (b) 9x (c) 7 (d) 3x 2 +9x +7
Solution (a) Lety = 3x 2 , then
dy
dx = d dx (3x2 )
= 3 d dx (x2 ) using linearity
= 3(2x) fromthe table
= 6x
(b) Lety = 9x, then
dy
dx = d dx (9x)
= 9 d (x) using linearity
dx
= 9
(c) Lety = 7,theny ′ = 0.
(d) Lety=3x 2 +9x+7
dy
dx = d dx (3x2 +9x +7)
dy
dx = 3 d dx (x2 )+9 d dx (x) + d (7) usinglinearity
dx
=6x+9
Engineeringapplication10.3
Fluidflowintoatank
If fluid is being poured into a tank at a rate of q m 3 s −1 , then this will result in an
increase in volume,V, of fluid in the tank. The arrangement is illustrated in Figure
10.16. Therate ofincrease involume, dV m 3 s −1 , isgiven by
dt
dV
=q
dt
This relationship follows from the principle of conservation of mass. If q is large,
then dV is large, which corresponds to the fluid volume in the tank increasing at a
dt
10.8 Differentiation as a linear operator 377
fast rate. Consequently, the height of the fluid,h, also increases at a fast rate. If the
cross-sectional area ofthe tank,A, isconstant, thenV =Ah. Therefore,
dV
= d (Ah) =Adh
dt dt dt
because differentiation isalinear operator. So,
A dh
dt
=q
q
h
V
Figure10.16
A closedtankcontaining avolume offluidV.
Example10.14 UseTable 10.1 and the linearityproperties ofdifferentiation tofindy ′ where
(a) y=3e 2x
(b) y = 1/x
(c) y=3sin4x
(d) y = sin2x −cos5x
(e) y=3lnx
(f) y=ln2x
(g) y=3x 2 +7x−5
Solution (a) Ify = 3e 2x , then
dy
dx = d dx (3e2x ) = 3 d dx (e2x ) usinglinearity
= 3(2e 2x ) usingTable 10.1
=6e 2x
(b) Ify =x −1 ,then
y ′ = −1x −2 from Table 10.1
= − 1 x 2
(c) Ify = 3sin4x, then
dy
dx = d dx (3sin4x) =3 d dx (sin4x) usinglinearity
= 3(4cos4x) usingTable 10.1
= 12cos4x
378 Chapter 10 Differentiation
(d) The linearityproperties allowus todifferentiate each termindividually. If
y = sin2x −cos5x
then
dy
dx = d dx (sin2x) − d dx (cos5x)
=2cos2x+5sin5x
(e) Ify = 3lnx,then
dy
dx = d dx (3lnx)=3d dx (lnx)
= 3 x
using linearity
using Table 10.1
(f) Ify = ln2x, then
y = ln2 +lnx
usinglaws of logarithms
and so
dy
dx = 0 + 1 x = 1 x
(g) Each termisdifferentiated:
y ′ =6x+7−0=6x+7
since ln2 isconstant
Engineeringapplication10.4
Dynamicresistanceofasemiconductordiode
RecallfromEngineeringapplication2.9thatasemiconductordiodecanbemodelled
by the equation
I =I S
(e qV
nkT −1)
where V is the applied voltage, I is the diode current, I S
is the reverse saturation
current, k is the Boltzmann constant, q is the charge on the electron, and T is the
temperature of the diode junction in Kelvin. The parameter n is the ideality factor
whichisusuallyconsideredtobeaconstantforagiventypeofsemiconductorjunction,
dependent primarily on the design and manufacture ofthe diode.
Examiningthediodeelectricalcharacteristics,showninFigure2.31,wenotethat
there are two main electrical regimes. The first occurs when the applied voltage,V,
isnegativeandthecurrent,I,islimitedtoanegativesaturationcurrent,I s
,apartfrom
a small region around the origin. The second occurs when the applied voltage,V, is
positive and the current,I, takes on the shape of an exponential curve, apart from a
smallregion around the origin.
Wewillconcentrateonthesecondcaseandfurthermorewewillassumethate qV
nkT
is very much greater than 1. This is written concisely as e qV
nkT >> 1. This is a good
approximationapartfromthesmallregionclosetotheorigin.Theoriginalexpression
forIcan then be writtenas
I≈I S
e qV
nkT
where the symbol ≈means ‘is approximately equal to’.
10.8 Differentiation as a linear operator 379
For a fixed temperature the only variables in this equation are I andV. We can
differentiate thisequation togive
dI
dV ≈I q
S
nkT e qV
nkT
Back substituting the approximate expressionI ≈I S
e qV
nkT into this equation yields
dI
dV ≈I q
S
nkT e qV q
nkT ≈
nkT I
Inverting this equation gives
dV
dI ≈ nkT
qI
Itisconstructiveatthisstagetocomparethisresultwiththestaticequivalent,V =IR,
from which
V
I =R
The quantity dV
dI can be thought of as a dynamic value of the diode resistance,r d ,
which varies depending on the value of the current in the diode. It is valid for a
positive voltageV that is sufficiently large to ensure the diode is in the exponential
portion of the electrical characteristic.
Engineeringapplication10.5
Operatingpointforanidealsemiconductordiode
In Engineering application 2.18 we examined the mathematical model of a semiconductordiodeandthenconsideredthespecialisedcaseofanidealdiode.Atroom
temperaturetheequationdescribingthebehaviourofitselectricalcharacteristicswas
I =I s
(e 40V −1)
where I is the diode current,V is the applied voltage and I s
is the constant reverse
saturation current. Sometimes when a diode is used in a circuit it may be biased
to operate in a certain region of its I--V characteristic. This means that its use is
restricted to a certain voltage range. The point around which it operates is known as
its operating point. This isillustratedinFigure 10.17.
Deviationsfromthisoperatingpointmaybesmallincertaincases.Ifso,theyare
known assmall-signalvariations andarecausedbysmalla.c.voltagesbeingsuperimposedonthemaind.c.biasvoltage.Incalculatinghowthediodewillreacttosuch
small-signalvoltages,theslopeofthediodecharacteristicaroundtheoperatingpoint
is more relevant than the overall ratio of current to voltage. Provided the deviations
from the operating point are not large, the tangent to theI--V curve at the operating
point provides an adequate model for how the diode will behave. The slope of the
curve can beobtained by differentiating the diodeequation.If
then
I =I s
(e 40V −1) =I s
e 40V −I s
dI
dV = 40I s e40V
➔
380 Chapter 10 Differentiation
I
Bias current
Operating
point
dV
dI
Bias
voltage
Tangent
approximation
V
Figure10.17
Diode characteristic showing
operating point and tangent
approximation.
sinceI s
isconstant.
It is usual to write small changes in current and voltage as δI and δV. Therefore,
since
δI
δV ≈ dI
dV
δI ≈ 40I s
e 40V δV
Thisexpressionallowsthechangeindiodecurrent, δI,tobeestimatedgivenachange
indiodevoltage, δV,providedtheoperatingpointisknownandthechangesaresmall.
Example10.15 Findthe derivative ofy = e −t +t 2 , when
(a)t=1
(b)t=0
Solution
y=e −t +t 2
y ′ = −e −t +2t
(a) Whent = 1,y ′ = −e −1 +2 = 1.632, thatisy ′ (1) = 1.632.
(b) Whent = 0,y ′ = −1,thatisy ′ (0) = −1.
Engineeringapplication10.6
Obtainingalinearmodelforasimplefluidsystem
Consider the fluid system illustrated in Figure 10.18. The pump is driven by a d.c.
motor. The pump/motor can be modelled by a linear relationship in which the fluid
flowrate,q i
, isproportionaltothe control voltage, v in
, thatis
q i
=k p
v in
10.8 Differentiation as a linear operator 381
v in
Tank Valve
Pump/motor
q i
h
p
q o
Figure10.18
Afluidsystem comprising pump, tank
andvalve.
where k p
= pump/motor constant, v in
= control voltage (V), q i
= flow rate into
the tank (m 3 s −1 ). The valve has a non-linear characteristic given by the quadratic
polynomial
p = 20000q 2 o
where p = pressure at the base of the tank (Nm −2 ), q o
= flow out of the tank
(m 3 s −1 ).Thefluidbeingusediswaterwhichhasadensity ρ = 998kg m −3 .Assume
g = 9.81 ms −2 and thatk p
= 0.03 m 3 s −1 V −1 . Carry outthe following:
(a) Calculate theflowrateoutofthetank,q o
,andthecontrol voltage, v in
,when the
systemisinequilibrium and the height ofthe water inthe tank,h, is0.25 m.
(b) Obtainalinearmodelforthesystem,validforsmallchangesaboutawaterheight
of0.25m.Usethismodel tocalculatethenewwaterheightandflowrateoutof
the tank when the control voltage isincreased by 0.4 V.
Solution
(a) The pressure atthe bottomof the tank isgiven by
p = ρgh = 998 ×9.81 ×0.25 = 2448
The flowrate through the valve isgiven by
q 2 o = p
20000
√ √
p 2448
q o
=
20000 = 20000 = 0.350 m3 s −1
Nowifthesystemisinequilibrium,thentheheightofthewaterinthetankmust
have stabilized toaconstant value. Therefore,
and so
q i
=q o
= 0.350
v in
= q i
k p
= 0.350
0.03 = 11.7 V
(b) Before answering this part it is worth examining what is meant by a linear
model. Figure 10.19 shows the valve characteristic together with a linear approximationaroundanoutputflowrateofq
o
= 0.350m 3 s −1 .Thiscorresponds
toawater height of 0.25 m.
➔
382 Chapter 10 Differentiation
p (N m –2 )
2448
0.35
q o (m 3 s –1 )
Figure10.19
Relationshipbetween pressureacross
valve (p)and flowthrough valve (q o ).
Alinearmodelforthevalveisoneinwhichtherelationshipbetweenpand
q o
isapproximated by the straightlinewhich forms a tangent tothe curve atthe
operatingpoint.Theoperatingpointisthepointaroundwhichthemodelisvalid.
Itisclearthatifthestraightlineapproximationisusedforpointsthatarealarge
distance from the operating point, then the linear model will not be very accurate.
However, for small changes around the operating point the approximation
isreasonablyaccurate.Clearlyadifferentoperatingpointwillrequireadifferent
linearapproximation.Inordertoobtainthegradientofthislineitisnecessaryto
differentiate the function relating valve pressure tovalve flow. So,
p = 20000q 2 o
dp
dq o
= 40000q 0
At the operating pointq o
= 0.350. Therefore,
∣
dp ∣∣∣qo
= 40000 ×0.350 = 14000
dq o =0.350
This value is the gradient of the tangent to the curve at the operating point.
Smallchangesaroundanoperatingpointareusuallyindicatedbythenotation δ.
Therefore,
δp
≈ dp ∣ ∣∣∣qo
= 14000
δq o
dq o =0.350
δp = 14000δq o
(10.2)
Note that equality has been assumed for the purposes of the linear model. It is
easy to relate a change in pump flow to a change in control voltage because the
relationship islinear and so a linear approximation isnotrequired. So,
q i
=k p
v in
Differentiating thisequation w.r.t. v in
yields
dq i
dv in
=k p
Inthis case
δq i
δv in
= dq i
dv in
=k p
10.8 Differentiation as a linear operator 383
and has a constant value independent of the operating point:
δq i
=k p
δv in
(10.3)
Therelationshipbetweenpressureatthebottomofthetankandthewaterheight
isalsolinear.
Since p = ρgh we have dp dp
= ρg. Because is a constant wecan write
dh dh
δp
δh = dp
dh = ρg
δp= ρgδh (10.4)
Thechangeincontrolvoltage, δv in
,isgivenas0.4.Wealsoknowthatk p
= 0.03.
Therefore, using Equation (10.3),we have
δq i
= 0.03 ×0.4 = 0.012
Now if time is allowed for the system to reach equilibrium with this increased
inputflow,then δq o
= δq i
.Inotherwords,theoutputflowincreasesbythesame
amount as the input flow and the water height once again stabilizes to a fixed
value. Therefore,
δq o
= δq i
=0.012
UsingEquation (10.2),wefind
δp =14000 δq o
=14000 ×0.012 =168
UsingEquation (10.4),weget
δh = δp
ρg = 168
998 ×9.81 = 0.0172
Therefore the new water height is 0.25 +0.0172 = 0.267 m to three significant
figures. The new water flowrate is0.35 +0.012 = 0.362 m 3 s −1 .
Torecap,alltheelementsofthefluidsystemwerelinearapartfromthevalve.
Byobtainingalinearmodelforthevalve,validforvaluesclosetotheoperating
point, it was possible to calculate the effect of changing the control voltage to
themotor.Itisimportanttostressthatthelinearmodelforthevalveisonlygood
forsmallchangesaroundtheoperatingpoint.Inthiscasetheincreaseincontrol
voltage was approximately 3%. The model would not have been very good for
predictingtheeffectofa50%increaseincontrolvoltage.Linearmodelsofnonlinear
systems are particularly useful when several components are non-linear,
astheyaremucheasiertoanalyse.Weexaminetheseconceptsinmoredetailin
Chapter 18.
EXERCISES10.8
1 Differentiate the following functions:
(a) y = 4x 3 −5x 2
(b) y = 3sin(5t) +2e 4t
(c) y = sin(4t) +3cos(2t) −t
(d) y = tan(3z)
(e) y =2e 3t +17−4sin(2t)
(f) y= 1
t 3 + cos5t
2
384 Chapter 10 Differentiation
(g)y= 2w3
3 + e4w
2
(h)y= √ ( √x )
x+ln .[Hint: √ x =x 1/2 ,and use the
laws oflogarithms.]
2 Evaluate the derivatives ofthe functionsat the given
value:
(a) y=2t+9+e t/2 t=1
(b) y= t2 −4t+6
t = 2
3
(c) y=sint+cost t=1
( t
(d) y=3e 2t −2sin t = 0
2)
(e) y = 5tan(2x) + 1
e 2x x=0.5
(f) y =3lnt +sin(3t) t =0.25
3 Find dx
dt , if
(a) x = e ωt (b) x = e −ωt
where ω isaconstant.
4 Find the derivative of
(a) y = 3sin −1 (2t) −5cos −1 (3t)
(b) y= 1 2 tan−1 (t +2) +4cos −1 (2t −1)
( )
t −3
(c) y = 2sinh(3t −1) −4cosh
2
cosech(4t) +3sech(6t)
(d) y=
2
( ) ( )
(e) y = 2sinh −1 t +1
−3cosh −1 1 −t
2 2
(f) y = 3tanh −1 (2t +3) −2tanh −1 (3t +2)
5 Afunction,y(t),is given by
y(t) = t3 3 − 5t2
2 +4t+1
(a) Find dy
dt .
(b) Forwhich values oft isthe derivative zero?
6 Findthe equationofthe tangent to the curve
y(x)=x 3 +7x 2 −9
atthe point (2,27).
7 Findvalues oft in the interval [0,π] forwhichthe
tangent tox(t) = sin2t has zerogradient.
8 Findthe rate ofchangeof
when
(a) t=0
(b) t=3
z(t) =2e t/2 −t 2
Solutions
1 (a) 12x 2 −10x (b) 15cos5t +8e 4t
(c) 4cos4t −6sin2t −1
(d) 3sec 2 3z
(e) 6e 3t −8cos2t (f) − 3
t 4 −2.5sin5t
(g) 2w 2 +2e 4w (h) 0.5x −1/2 + 1 2x
2 (a) 2.8244 (b) 0 (c) −0.3012
(d) 5 (e) 33.5194 (f) 14.195
3 (a) ωe ωt (b) −ωe −ωt
4 (a)
(b)
6
√ + 15
√
1−4t 2 1−9t 2
1
2[1+ (t +2) 2 ] − 4
√ t(1−t)
(c) 6cosh(3t −1) −2sinh t −3
2
(d) −2cosech4tcoth4t −9sech6ttanh6t
(e)
(f)
1
√ (1
2 (t+1) ) 2
+1
+
6
1−(2t+3) 2 − 6
1−(3t+2) 2
5 (a) t 2 −5t+4 (b) 1,4
6 y=40x−53
7 π/4,3π/4
8 (a) 1 (b) −1.5183
3
2√ (1
2 (1−t) ) 2
−1
Review exercises 10 385
REVIEWEXERCISES10
1 Findtherateofchangeof f(x) = 5 +3x 2 atx = 2by
considering each ofthe intervals [2 − δx,2],
[2,2 + δx],[2 − δx,2 + δx].Showthatthesame
resultis obtained in each case.
2 Useatable ofderivatives and the linearity rules to
differentiate the following:
(a) y=4x 2 +6x−11
(b)y=−x 2 +2x−10
(c) y =x 1/3 −x 1/4
(d) y=5cos4x−3cos2x
(e) y = sin −1 (4x +3)
(f) y= √ 2x 4 − 5
3x 2
(g) y =t 3/2 +cost
(h)y=t 2 −14t+8
(i) y=5lnt+sin4t
(j) y= 1 2 x − 1 3 x2
(k)y= 2t2
3 +e2t
3 Find the equationofthe tangent toy =x 2 +7x −4 at
the point on the graph wherex = 2.
4 Find the rate ofchange of f (t) = 2cost +3sint at
t=1.
5 At any timet,the voltage, v, acrossan inductorof
inductanceLis relatedto the current,i,through the
inductorby v =L di
dt .
(a) Findan expression forthe voltage when
i = 5cosωt where ω isthe constantangular
frequency.
(b) Findan expression forthe voltage when the
current takesthe formofasinewave with
amplitude 10 andperiod 0.01 seconds.
6 Use the shrinking interval methodto findthe rate of
change of f (t) = sint att = 0 by consideringthe
interval [0, δt]. [Hint:use the trigonometricidentities
in Section3.6 andthe small-angle approximation in
Section6.5.]Usetheshrinkingintervalmethodtofind
the rate ofchange of f (t) = sint at ageneral point.
7 Giveny(t) = 3 +sin2t,findthe averagerate of
change ofyast variesfrom 0 to 2.
8 Explain the essentialdifference between δy dy
and
δx dx .
9 Findy ′ forthe followingfunctions:
(a) y =2e −t +6cos(t/2)
(b) y = (−t +2) 2
10 Using derivatives, estimatethe changeinyasx
changes from 1.5 to 1.55 wherey = 2e 2x +x 3 .
Solutions
1 12
2 (a) 8x+6
(b) −2x +2
1
(c) 3 x−2/3 − 1 4 x−3/4
(d) −20sin4x +6sin2x
4
(e) √
1−(4x+3) 2
(f) 2 √ 2x + 10
3x 3
3
(g) 2 t1/2 −sint
(h) 2t −14
(i)
5
t +4cos4t
(j)
(k)
1
2 − 2 3 x
4
3 t+2e2t
3 y=11x−8
4 −0.062
5 (a) −5ωLsin ωt (b) 2000πLcos200πt
7 −0.3784
( )
9 (a) −2e −t t
−3sin
2
10 4.355
(b) 2t−4
11 Techniquesof
differentiation
Contents 11.1 Introduction 386
11.2 Rulesofdifferentiation 386
11.3 Parametric,implicitandlogarithmicdifferentiation 393
11.4 Higherderivatives 400
Reviewexercises11 404
11.1 INTRODUCTION
InthischapterwedevelopthetechniquesofdifferentiationintroducedinChapter10so
thatratesofchangeofmorecomplicatedfunctionscanbefound.Weintroducerulesfor
differentiatingproductsandquotientsoffunctions.Thechainruleisusedfordifferentiating
functions of functions. We explain what is meant by defining functions implicitly
andparametricallyandshowhowthesecanbedifferentiated.Thetechniqueoflogarithmic
differentiation allows complicated products of functions to be simplified and then
differentiated.Finallyderivativesoffunctionscanthemselvesbedifferentiated.Thisinvolvestheuseofhigherderivativeswhichareexplainedtowardstheendofthechapter.
11.2 RULESOFDIFFERENTIATION
Therearethreeruleswhichenableustodifferentiatemorecomplicatedfunctions.They
are (a) the product rule, (b) the quotient rule, (c) the chain rule. Traditionally they are
written withxas the independent variable but apply in an analogous way for other independent
variables.
11.2.1 Theproductrule
Asthenamesuggests,thisruleallowsustodifferentiateaproductoffunctions,suchas
xsinx,t 2 cos2t and e z lnz.
11.2 Rules of differentiation 387
The product rulestates: if
then
y(x) =u(x)v(x)
dy
dx = du
dx v +udv dx =u′ v+uv ′
To apply this rule one of the functions in the product must be chosen to beu, and the
other, v. Beforewe can apply the ruleweneed tocalculateu ′ and v ′ .
Example11.1 Findy ′ given
(a) y=xsinx
(b)y=t 2 e t
Solution (a) y = xsinx = uv. Chooseu = x and v = sinx. Thenu ′ = 1, v ′ = cosx. Applying
the productruletoyyields
y ′ =sinx+xcosx
(b)y=t 2 e t =uv.Chooseu =t 2 andv =e t .Thenu ′ =2tandv ′ = e t . Applying the
product ruletoyyields
y ′ =2te t +t 2 e t
Engineeringapplication11.1
Dampedsinusoidalsignal
Acommonfunctionfoundinengineeringisthedampedsinusoidalsignal.Thisconsists
of a negative exponential function multiplied by a sinusoid. A typical example
is
f (t) = e −0.1t cost
The graph of this function is shown in Figure 11.1. This function approximates the
wayacarbodyreactswhenthecardrivesoveralargebumpintheroad.Fortunately,
thecarshockabsorbersensurethattheoscillationsreduceinamplitudequitequickly.
When sketching such a function it can be useful to think of the exponential term,
and its mirror image around the time axis, providing an envelope that contains the
signal. When values of the sinusoid are 1 then the signal touches the positive part
of the envelope and when values of the sinusoid are −1 then the signal touches the
negative partofthe envelope.
Therateofchangeofthissignalwithrespecttotimecanbefoundbydifferentiating
f (t)usingtheproductrule.Todosowenotethat f (t)isaproductofu(t) = e −0.1t
and v(t) = cost.Recastingtheformulafordifferentiatingaproductintermsoftwe
have
f(t) =u(t)v(t)
➔
388 Chapter 11 Techniques of differentiation
df
= du
dt dt v +udv dt
Now wehave
du
= −0.1e −0.1t
dt
dv
=−sint
dt
So
df
= du
dt dt v +udv dt
df
= −0.1e −0.1t cost +e −0.1t (−sint)
dt
Rearranging gives
df
dt
= −e −0.1t (0.1cost +sint)
1
f(t)
0.8
0.6
0.4
0.2
–0.2
–0.4
–0.6
–0.8
–1
p 2p 3p 4p 5p 6p 7p 8p 9p 10p 11p 12p 13p 14p 15p
t
Figure11.1
Adamped sinusoidalsignal.
11.2.2 Thequotientrule
This rule allows us to differentiate a quotient of functions, such as
and e−3z
z 2 −1 .
x
sinx , t2 −t+3
t +2
11.2 Rules of differentiation 389
The quotient rulestates: when
y(x) = u(x)
v(x)
then
( ) ( ) du dv
v −u
y ′ dx dx
=
= vu′ −uv ′
v 2 v 2
Example11.2 Findy ′ given
(a)y= sinx
x
t2
(b)y=
2t+1
e2t
(c)y=
t 2 +1
Solution (a)y= sinx = u x v .Sou=sinx,v=xandu′ =cosx,v ′ = 1. Using the quotient rule
the derivative ofyisfound:
(b)y=
y ′ = xcosx−sinx
x 2
t2
2t+1 = u v .Sou=t2 ,v=2t+1andu ′ =2t,v ′ = 2.Hence,
y ′ = (2t +1)2t − (t2 )(2) 2t(t +1)
=
(2t +1) 2 (2t +1) 2
(c)y=
e2t
t 2 +1 .Sou=e2t ,v=t 2 +1andu ′ = 2e 2t , v ′ = 2t. Application of the
quotient ruleyields
y ′ = (t2 +1)2e 2t −e 2t 2t
= 2e2t (t 2 −t+1)
(t 2 +1) 2 (t 2 +1) 2
11.2.3 Thechainrule
Thisrulehelpsustodifferentiatecomplicatedfunctions,whereasubstitutioncanbeused
to simplify the function. Supposey = y(z) andz = z(x). Thenymay be considered as
afunctionofx.Forexample,ify =z 3 −zandz = sin3x,theny = (sin3x) 3 −sin(3x).
Suppose weseekthe derivative, dy . Note that the derivative w.r.t.xissought.
dx
Thechain rule states:
dy
dx = dy
dz × dz
dx
390 Chapter 11 Techniques of differentiation
Example11.3 Giveny =z 6 wherez =x 2 +1 find dy
dx .
Solution If y = z 6 and z = x 2 + 1, then y = (x 2 + 1) 6 . We recognize this as the composition
y(z(x)) (see Section 2.3.6). Now y = z 6 and so dy
dz = 6z5 . Alsoz = x 2 +1 and so
dz
= 2x. Usingthe chain rule,
dx
dy
dx = dy
dz × dz
dx =6z5 2x =12x(x 2 +1) 5
Example11.4 Ify = ln(3x 2 +5x +7), find dy
dx .
Solution Weuseasubstitutiontosimplifythegivenfunction:letz = 3x 2 +5x+7sothaty = lnz.
Since
dy
y=lnz then
dz = 1 z
Also
z=3x 2 +5x+7
Using the chain rulewefind
dy
dx = dy
dz × dz
dx
= 1 z ×(6x+5)
=
6x+5
3x 2 +5x+7
andso
dz
dx =6x+5
Note that in the previous answer the numerator is the derivative of the denominator.
This result is true more generally and can be applied when differentiating the natural
logarithm ofany function:
Wheny = ln f(x)then dy
dx = f ′ (x)
f(x)
Example11.5 Findy ′ when
(a) y =ln(x 5 +8) (b) y =ln(1−t) (c) y =8ln(2−3t)
(d) y =ln(1 +x)
(e) y =ln(1 +cosx)
Solution Ineachcase weapplythe previousrule.
(a) Ify = ln(x 5 +8),theny ′ =
5x4
x 5 +8 .
(b) Ify = ln(1 −t),theny ′ = −1
1 −t = − 1
1 −t = 1
t −1 .
(c) Ify =8ln(2−3t),theny ′ =8 −3
2−3t = − 24
2−3t = 24
3t−2 .
(d) Ify =ln(1 +x)theny ′ = 1
1 +x .
(e) Ify = ln(1 +cosx)theny ′ = −sinx
1+cosx = − sinx
1+cosx .
11.2 Rules of differentiation 391
Example11.6 Differentiate
(a) y = 3e sinx
(b) y = (3t 2 +2t −9) 10
(c)y= √ 1 +t 2
Solution Inthese examples we mustformulate the functionzourselves.
(a) Letz(x) = sinx. Theny(z) = 3e z so dy
dz = 3ez ;z(x) = sinx and so dz
dx = cosx.
The chain ruleisused tofind dy
dx .
dy
dx = dy
dz × dz
dx = 3ez cosx = 3e sinx cosx
(b) Letz(t) = 3t 2 +2t −9.Theny(z) =z 10 , dy
dz = 10z9 , dz = 6t +2.Usingthechain
dt
rule dy
dt isfound:
dy
dt = dy
dz × dz
dt = 10z9 (6t +2) = 20(3t +1)(3t 2 +2t−9) 9
(c) Letz(t) =1+t 2 .Theny= √ z =z 1/2 , dy
dz = 1 2 z−1/2 and dz
dt = 2t.Usingthechain
rule,we obtain
dy
dt = dy
dz × dz
dt = 1 2 z−1/2 2t = √ t t
= √ z 1 +t
2
EXERCISES11.2
1 Usethe product ruleto differentiate the following
functions:
(a) y=sinxcosx
(b) y=lnttant
(c) y = (t 3 +1)e 2t
(d) y= √ xe x
(e) y=e t sintcost
(f) y = 3sinh2tcosh3t
(g) y=(1+sint)tant
(h) y = 4sinh(t +1)cosh(1 −t)
392 Chapter 11 Techniques of differentiation
2 Usethe quotient ruleto findthe derivatives ofthe
following:
cosx tant
(a) (b)
sinx lnt
(c)
(e)
(g)
e 2t
t 3 +1
(d)
x 2 +x+1
1+e x (f)
1+e t
1+e 2t
3x 2 +2x−9
x 3 +1
sinh2t
cosh3t
3 Usethe chainruleto differentiate the following:
(a) (t 3 +1) 100 (b) sin 3 (3t +2)
(c) ln(x 2 +1) (d) (2t +1) 1/2
(e) 3 √ cos(2x −1)
(g) (at +b) n ,aandbconstants
(f)
1
t +1
4 Differentiate eachofthe followingfunctions:
(a) y=5sinx
(c) y = 5e sinx
(e) y = (t 3 +4t) 15
(g)y=
sinx
4cosx+1
(b) y=5e x sinx
(d)y= 5sinx
e −x
(f) y = 7e −3t2
5 Forwhichvalues oft is the derivative ofy(t) = e −t t 2
zero?
6 Findtherateofchangeofyatthespecifiedvaluesoft.
(a) y = sin 1 t = 1
t
(b) y=(t 2 −1) 17 t=1
(c) y =sinh(t 2 ) t =2
(d) y= 1+t+t2
1 −t
(e) y=
et
tsint
t = 1
t = 2
7 Findthe equationofthe tangent to
8 Differentiate
y(x) = e 3x (1 −x) atthepoint (0,1)
(a) y=lnx
(b) y=ln2x
(c) y = lnkx,k constant
(d) y =ln(1+t)
(e) y =ln(3+4t)
(f) y =ln(5 +7sinx)
Solutions
1 (a) cos 2 x −sin 2 1
x (b)
t tant+lntsec2 t
e x (−x 2 +x)+2x+1
(e)
( )
(e x +1) 2
√x
(c) e 2t (2t 3 +3t 2 +2) (d) e x 1 +
2 √ 2cosh2tcosh3t −3sinh2tsinh3t
x
(f)
(cosh3t) 2
(e) e t (2cos 2 t +sintcost −1)
−e 3t −2e 2t +e t
(f) 3[2cosh2tcosh3t +3sinh2tsinh3t]
(g)
(e 2t +1) 2
(g) (1+sint)sec 2 t +sint
3 (a) 300t 2 (t 3 +1) 99
(h) 4[cosh(t +1)cosh(1 −t)
−sinh(t +1)sinh(1 −t)]
(b) 9sin 2 (3t +2)cos(3t +2)
2 (a) −cosec 2 x
2x
(c)
x 2 +1
lntsec 2 t − (tant)/t
(b)
(lnt) 2
(d) (2t +1) −1/2
e 2t (2t 3 −3t 2 −3sin(2x −1)
+2)
(e) √
(c)
cos(2x −1)
(t 3 +1) 2
−3x 4 −4x 3 +27x 2 (f) −(t +1) −2
+6x +2
(d)
(x 3 +1) 2 (g) an(at +b) n−1
11.3 Parametric, implicit and logarithmic differentiation 393
4 (a) 5cosx
5 0,2
(e) 15(t 3 +4t) 14 (3t 2 +4)
1 1 1
8 (a) (b) (c)
(f) −42te −3t2
x x x
4+cosx
1 4 7cosx
(g)
(d) (e) (f)
(4cosx +1) 2 1 +t 3+4t 5+7sinx
(b) 5e x (cosx +sinx)
6 (a) −0.5403 (b) 0 (c) 109.2
(c) 5e sinx cosx
(d) 2 (e) −2.0742
(d) 5e x (cosx +sinx)
7 y=2x+1
11.3 PARAMETRIC,IMPLICITANDLOGARITHMIC
DIFFERENTIATION
11.3.1 Parametricdifferentiation
In some circumstances bothyandxdepend upon a third variable,t. This third variable
is often called a parameter. By eliminatingt, y can be found as a function of x. For
example, ify = (1 +t) 2 andx = 2t then, eliminatingt, we can writey = (1 +x/2) 2 .
Hence,ymay be considered as a function ofx, and so the derivative dy can be found.
dx
However, sometimes the elimination oft is difficult or even impossible. Consider the
exampley=sint+t,x=t 2 +e t .Inthiscase,itisimpossibletoobtainyintermsofx.
The derivative dy can stillbe found usingthe chain rule.
dx
dy
dx = dy
dt × dt
dx = dy / dx
dt dt
Finding dy by thismethod isknown asparametric differentiation.
dx
Example11.7 Giveny = (1 +t) 2 ,x = 2t find dy
dx .
Solution Byeliminatingt, we see
(
y = 1 + x ) 2
=1+x+ x2
2 4
and so
dy
dx = 1 + x 2
Parametricdifferentiationisanalternativemethodoffinding dy
dx whichdoesnotrequire
the elimination oft.
dy
dt =2(1+t)
dx
dt = 2
394 Chapter 11 Techniques of differentiation
Using the chain rule,we obtain
dy
dx = dy / dx
dt dt = 2(1+t) =1+t=1+ x 2 2
Example11.8 Giveny = e t +t,x =t 2 +1,find dy using parametric differentiation.
dx
Solution
dy
dt =et +1
dx
dt = 2t
Hence,
dy
dx = dy / dx
dt dt = et +1
2t
In this example, the derivative is expressed in terms oft. This will always be the case
whent has not been eliminated betweenxandy.
Example11.9 Ifx = sint +cost andy =t 2 −t+1find dy (t =0).
dx
Solution
Hence,
dy
dt =2t−1
dy
dx = 2t−1
cost−sint
Whent = 0, dy
dx = −1
1 = −1.
dx
dt =cost−sint
11.3.2 Implicitdifferentiation
Suppose wearetold that
y 3 +x 3 =5sinx+10cosy
Althoughydependsuponx,itisimpossibletowritetheequationintheformy = f (x).
We say y is expressed implicitly in terms of x. The form y = f (x) is an explicit
expression for y in terms of x. However, given an implicit expression for y it is still
possible to find dy dy
. Usually will be expressed in terms of bothxandy. Essentially,
dx dx
the chain ruleisused when differentiating implicit expressions.
Whencalculating dy
dx weneedtodifferentiateafunctionofy,asopposedtoafunction
ofx. For example, we may need tofind d dx (y4 ).
Example11.10 Find
d
(a)
dx (y4 )
(b)
d
dx (y−3 )
11.3 Parametric, implicit and logarithmic differentiation 395
Solution (a) We make a substitution and letz = y 4 so that the problem becomes that of finding
dz
. Now, usingthe chain rule,
dx
dz
dx = dz
dy × dy
dx
Ifz =y 4 then dz
dy = 4y3 and so
dz
dx = 4y3dy dx
We conclude that
d
dx (y4 ) = 4y 3dy
dx
(b) Wemakeasubstitutionandletz =y −3 sothattheproblembecomesthatoffinding
dz
dxİfz =y−3 then dz
dy = −3y−4 and so
d
dx (y−3 ) = dz
dx = dz
dy × dy
dx = −3y−4dy dx
Example11.11 Find d dt (lny).
Solution Weletz = lnysothattheproblembecomesthatoffinding dz
dt .Ifz=lnythendz dy = 1 y
and so usingthe chain rule
dz
dt = dz
dy × dy
dt = 1 dy
y dt
weconclude that
d
dt (lny) = 1 dy
y dt
Examples11.10and 11.11illustrate the generalformula:
d
dx (f(y))=df dy × dy
dx .
This issimplythe chain ruleexpressed inaslightly different form.
396 Chapter 11 Techniques of differentiation
Example11.12 Find d dx (y).
Solution We have
d dy
(y) =
dx dy × dy
dx = dy
dx
Example11.13 Given
find dy
dx .
x 3 +y=1+y 3
Solution Consider differentiation of the l.h.s.w.r.t.x.
d
dx (x3 +y)= d dx (x3 )+ dy
dx =3x2 + dy
dx
Now consider differentiation of the r.h.s.w.r.t.x.
d
dx (1+y3 )= d dx (1) + d dx (y3 )= d dx (y3 )
d
We note from the formulafollowing Example 11.11 that
dx (y3 ) = 3y 2dy . So finally,
dx
3x 2 + dy
dx = 3y2dy dx
from which
dy
dx = 3x2
3y 2 −1
Note that dy isexpressed intermsofxandy.
dx
Example11.14 Find dy
dx given
(a) lny=y−x 2
(b) x 2 y 3 −e y = e 2x
Solution (a) Differentiating the given equation w.r.t.xyields
1dy
y dx = dy
dx −2x
from which
dy
dx = 2xy
y −1
11.3 Parametric, implicit and logarithmic differentiation 397
(b) Consider d dx (x2 y 3 ). Using the product rule wefind
d
dx (x2 y 3 ) = d dx (x2 )y 3 +x 2 d dx (y3 ) = 2xy 3 +x 2 3y 2dy
dx
Consider d dx (ey ).Letz = e y so dz
dy = ey . Hence,
d
dx (ey )= dz
dx = dz
dy
dy
dx = eydy dx
So, upon differentiating, the equation becomes
So,
2xy 3 +x 2 3y 2dy
dx −eydy dx = 2e2x
dy
dx (3x2 y 2 −e y ) = 2e 2x −2xy 3
from which
dy
dx = 2e2x −2xy 3
3x 2 y 2 −e y
11.3.3 Logarithmicdifferentiation
Thetechniqueof logarithmicdifferentiationisusefulwhenweneedtodifferentiatea
cumbersomeproduct.The methodinvolves takingthe naturallogarithm ofthe function
tobedifferentiated.This isillustratedinthe following examples.
Example11.15 Given thaty =t 2 (1−t) 8 find dy
dt .
Solution Theproductrulecouldbeusedbutwewilldemonstrateanalternativetechnique.Taking
the natural logarithm of both sides ofthe given equation yields
lny = ln(t 2 (1−t) 8 )
Usingthe laws of logarithms we can write thisas
lny=lnt 2 +ln(1−t) 8
=2lnt+8ln(1−t)
Both sides of thisequation are now differentiated w.r.t.t togive
d
dt (lny) = d dt (2lnt)+ d (8ln(1 −t))
dt
The evaluation of d (lny) has already been found inExample 11.11, and so
dt
1dy
y dt = 2 t − 8
1 −t
398 Chapter 11 Techniques of differentiation
Hence
(
dy 2
dt =y t − 8 )
1 −t
Finally, replacingybyt 2 (1 −t) 8 wehave
(
dy 2
dt =t2 (1−t) 8 t − 8 )
1 −t
Example11.16 Given
find dy
dx .
y =x 3 (1+x) 9 e 6x
Solution Theproductrulecouldbeused.However,wewilluselogarithmicdifferentiation.Taking
the natural logarithm of the equation and applying the laws of logarithms produces
lny = ln(x 3 (1 +x) 9 e 6x ) =lnx 3 +ln(1 +x) 9 +lne 6x
lny=3lnx+9ln(1+x)+6x
This equation isnow differentiated:
and so
1dy
y dx = 3 x + 9
1 +x +6
(
dy 3
dx =y x + 9 )
1 +x +6
= 3x 2 (1 +x) 9 e 6x +9x 3 (1 +x) 8 e 6x +6x 3 (1 +x) 9 e 6x
Example11.17 Ify = √ 1 +t 2 sin 2 t findy ′ .
Solution Taking logarithms we find
lny=ln( √ 1 +t 2 sin 2 t)
= ln √ 1 +t 2 +ln(sin 2 t)
= 1 2 ln(1 +t2 )+2ln(sint)
Differentiation yields
1
y y′ = 1 2t
21 +t +2cost 2 sint
( ) t
y ′ =y
1 +t +2cott 2
= √ 1 +t 2 sin 2 t
( t
1 +t 2 +2cott )
11.3 Parametric, implicit and logarithmic differentiation 399
EXERCISES11.3
1 Find each ofthe following:
(d) x=cos2t y=3t
d d d
(a)
dx (x5 ) (b)
dx (y5 ) (c)
dt (y5 )
(e) x= 3 y=e 2t
t
(f) x=e
d d
(d)
dx (y2 ) (e)
dx (5y) (f) d
t −e −t y=e t +e −t
dx (y) 5 Use logarithmicdifferentiationto find the derivatives
d
ofthe followingfunctions:
(g)
dx (y2 +y 3 )
(a) y =x 4 e x (b) y = 1 x e−x
2 Find eachofthe following:
(c) z =t 3 (1+t) 9 (d) y =e x sinx
d d
(a)
dx (ey ) (b)
dx (siny) (e) y =x 7 sin 4 x
d
(c)
dx (cos2y) (d) d
6 Use logarithmicdifferentiationto find the derivatives
dx (e−3y )
ofthe followingfunctions:
d
(e)
dx (√ d
y) (f)
dx (x+y) (a) z =t 4 (1−t) 6 (2+t) 4
d
(g) (cos(x +y))
dx (h) d
dx (lny)
(b)y= (1 +x2 ) 3 e 7x
(2 +x) 6
(c) x = (1+t) 3 (2+t) 4 (3+t) 5
d
(i)
dx (ln7y)
(d)y= (sin4 t)(2 −t 2 ) 4
3 Find dy
dx given
(1+e t ) 6
(e) y =x 3 e x sinx
(a) 2y 2 −3x 3 =x+y
(b) √ y + √ x=x 2 +y 3
7 Ifx=t+t 2 +t 3 andy=sin2t,find dy whent = 1.
dx
√
(c) 2x+3y=1+e x
8 Givenx=1+t 6 andy=1−t 2 ,find:
(d) y= ex√ 1 +x
(a) the rate ofchange ofxw.r.t.t whent = 2
x 2
(b) the rate ofchange ofyw.r.t.t whent = 1
(e) 2xy 4 =x 3 +3xy 2
(c) the rate ofchange ofyw.r.t.xwhent = 1
(f) sin(x+y)=1+y
(d) the rate ofchange ofxw.r.t.ywhent = 2
(g) ln(x 2 +y 2 ) = 2x −3y
(h) ye 2y =x 2 e x/2
9 Find the equations ofthe tangentsto
4 Find dy
dx ,given
y 2 =x 2 +6y
whenx = 4.
(a) x=t 2 y=1+t 3
(b) x=sint y=e t
10 Given the implicitfunction3x 2 +y 3 =yfindan
(c) x=(1+t) 3 y=1+t 3 expression for dy
dx .
Solutions
1 (a) 5x 4 (b) 5y 4dy
dx
(c) 5y 4dy
dt
(d) 2y dy
dx
(e) 5 dy
dx
(g) (2y +3y 2 ) dy
dx
(f)
dy
dx
400 Chapter 11 Techniques of differentiation
2 (a) e ydy
dx
(c) −2sin2y dy
dx
1
(e)
2 y−1/2dy dx
(
(g) −sin(x +y)
(h)
3 (a)
(b)
1 dy
y dx
1+9x 2
4y−1
(4x √ x−1) √ y
(1−6y 2√ y) √ x
(b) cosy dy
dx
(d) −3e −3ydy
dx
(f) 1+ dy
dx
)
1 + dy
dx
1 dy
(i)
y dx
2
(c)
3 (ex√ 2x+3y−1)
(
1
(d) y
2(1 +x) +1− 2 )
x
(e)
(f)
(g)
3x 2 +3y 2 −2y 4
2xy(4y 2 −3)
cos(x +y)
1 −cos(x +y)
2(x 2 +y 2 −x)
3x 2 +3y 2 +2y
5 (a) (4x 3 +x 4 )e x
( )
(b) −e −x 1
x 2 + 1 x
( )
(c) t 3 (1+t) 9 3
t + 9
1 +t
(d) e x sinx(1 +cotx)
( )
(e) x 7 sin 4 7
x
x +4cotx
(
)
4
6 (a)z
t − 6
1 −t + 4
2 +t
(
)
6x
(b) y
1+x 2 +7− 6
2 +x
(
)
3
(c) x
1 +t + 4
2 +t + 5
3 +t
(
)
(d) y 4cott −
8t
2−t 2 − 6et
1+e t
( )
(e) x 3 e x 3
sinx
x +1+cotx
7 −0.1387
8 (a) 192 (b) −2 (c) − 1 3
(h)
e (x/2−2y) x(x +4)
2(1 +2y)
4 (a) 3t/2 (b)
(d)
−3
2sin2t
e t
cost
(c)
(e) −2e 2t t 2 /3 (f)
( ) 2
t
1 +t
e t −e −t
e t +e −t
(d)
dx
dy = −3t4 ;whent =2, dx
dy = −48
9 y = −4x+6 ,y = 4x+24
5 5
10 y ′ = 6x
1−3y 2
11.4 HIGHERDERIVATIVES
The derivative, y ′ , of a function y(x) is more correctly called the first derivative of y
w.r.t.x.Sincey ′ itselfisafunctionofx,thenitisoftenpossibletodifferentiatethistoo.
Thederivative ofy ′ iscalled the secondderivative ofy:
secondderivative ofy = d dx
( ) dy
dx
which iswrittenas d2 y
dx 2 or more compactly asy′′ .
11.4 Higher derivatives 401
Example11.18 Ify(x) = 3x 2 +8x +9,findy ′ andy ′′ .
Solution y ′ =6x+8 y ′′ = d dx (6x+8)=6
Example11.19 Ify(t) = 2sin3t, findy ′ andy ′′ .
Solution y ′ = 6cos3t y ′′ = −18sin3t
Thefirst and secondderivatives w.r.t.time,t, arealsodenotedbyẏandÿ.
Example11.20 Findy ′′ given
1+xy=x 2 +y 2
Solution Theequation isdifferentiated implicitly toobtain dy
dx :
0 +y+xy ′ =2x+2yy ′
y ′ = 2x−y
x−2y
Thequotientruleisnowusedwithu = 2x −yand v =x −2y.Thederivativesofuand
v are
Then,
du
dx =u′ =2−y ′
dv
dx = v′ =1−2y ′
y ′′ = (x −2y)(2 −y′ ) − (2x −y)(1 −2y ′ )
(x −2y) 2
This issimplified to
y ′′ = 3xy′ −3y
(x −2y) 2
Replacingy ′ by 2x−y and simplifying yields
x−2y
y ′′ = 6(x2 −xy+y 2 )
(x −2y) 3
Note that it is possible to simplify this further by observing thatx 2 +y 2 = 1 +xy as
given. Therefore,
y ′′ =
6(1 +xy −xy)
(x −2y) 3 =
6
(x −2y) 3
Justasthefirstderivativemaybedifferentiatedtoobtainthesecondderivative,sothe
second derivative may be differentiated to find the third derivative and so on. A similar
402 Chapter 11 Techniques of differentiation
notation is used. The third derivative is written d3 y
dx 3 ory′′′ ory (3) . The fourth derivative
iswritten d4 y
dx 4 oryiv ory (4) . The fifth derivative iswritten d5 y
dx 5 oryv ory (5) .
Example11.21 Findthe first five derivatives ofz(t) = 2t 3 +sint.
Solution z ′ =6t 2 +cost z iv =sint
z ′′ =12t−sint z v =cost
z ′′′ =12 −cost
Example11.22 Calculate the values ofxforwhichy ′′ = 0,giveny =x 4 −x 3 .
Solution y=x 4 −x 3 y ′ =4x 3 −3x 2 y ′′ =12x 2 −6x
Puttingy ′′ = 0 gives
Hence
12x 2 −6x =0 andso 6x(2x −1) =0
x=0, 1 2
The first and second derivatives can be used to describe the nature of increasing and
decreasing functions. In Figure 11.2(a, b) the tangents to the curves have positive gradients,
that isy ′ > 0. As can be seen, asxincreases the value of the function increases.
Conversely, in Figure 11.2(c, d) the tangents have negative gradients (y ′ < 0) and as
x increases the value of the function decreases. The sign of the first derivative tells us
whetheryis increasing or decreasing. However, the curves in (a) and (b) both showy
increasingbut, clearly, thereisadifference inthe wayychanges.
ConsideragainFigure11.2(a).ThetangentsatA,BandCareshown.Asxincreases
thegradientofthetangentincreases,thatisy ′ increasesasxincreases.Sincey ′ increases
as x increases then the derivative of y ′ is positive, that is y ′′ > 0. (Compare with: y
increaseswhenitsderivativeispositive.)SoforthecurveshowninFigure11.2(a),y ′ > 0
andy ′′ > 0.
For that shown in Figure 11.2(b) the situation is different. The value ofy ′ decreases
asxincreases, as can be seen by considering the gradients of the tangents at A, B and
C, thatisthe derivative ofy ′ mustbenegative. For thiscurvey ′ > 0 andy ′′ < 0.
A function is concave down wheny ′ decreases and concave up wheny ′ increases.
HenceFigure11.2(a)illustratesaconcaveupfunction;Figure11.2(b)illustratesaconcavedownfunction.Thesignofthesecondderivativecanbeusedtodistinguishbetween
concave upand concave down functions.
Consider now the functions shown in Figure 11.2(c) and Figure 11.2(d). In both (c)
and(d),yisdecreasingandsoy ′ < 0.In(c)thegradientofthetangentbecomesincreasinglynegative;thatis,itisdecreasing.Hence,forthefunctionin(c)y
′′ < 0.Conversely,
for the function in (d) the gradient of the tangent is increasing asxincreases, although
it is always negative, that isy ′′ > 0. So for the function in (c)y ′ < 0 andy ′′ < 0; that
11.4 Higher derivatives 403
y
C
B
y
C
A
y
B
A
y
(a)
A
B
x
A
(b)
x
(c)
C
x
(d)
B
C
x
Figure11.2
(a)yis concave up (y ′ > 0,y ′′ > 0);(b)yisconcave down (y ′ > 0,y ′′ < 0);(c)yis concave down (y ′ < 0,y ′′ < 0);(d)y
is concave up (y ′ < 0,y ′′ > 0).
is, the function is concave down. For the function in (d)y ′ < 0 andy ′′ > 0; that is, the
function isconcave up. Insummary, we can state:
Wheny ′ > 0,yisincreasing.Wheny ′ < 0,yisdecreasing.
Wheny ′ isincreasingthe functionisconcave up. Inthiscasey ′′ > 0.
Wheny ′ isdecreasing the functionisconcave down. Inthiscasey ′′ < 0.
EXERCISES11.4
An easy way of determining the concavity of a curve is to note that as the curve is
traced from left to right, an anticlockwise motion reveals that the curve is concave up.
Aclockwisemotionmeansthatthe curve is concave down.
Aswillbeseeninthenextchapter,higherderivativesareusedtodeterminethelocationandnatureofimportantpointscalledmaximumpoints,minimumpointsandpoints
ofinflexion.
1 Calculate dy
dt and d2 y
dt 2 given
(a) y=t 2 +t
2 If
(b) y=2t 3 −t 2 +1
(c) y = sin2t
(d) y = sinkt
(e) y=2e 3t −t 2 +1
(f) y=
t
t +1
(g) y=4cos t 2
(h) y =e t t
(i) y = sinh4t
(j) y = sin 2 t
k constant
y=2x 3 +3x 2 −12x+1
findvalues ofxforwhichy ′′ = 0.
3 If dy
dt =3t2 +t,find
(a)
d 2 y
dt 2
(b)
d 3 y
dt 3
4 Find values oft atwhichy ′′ = 0,where
y = t3 3 − 7t2
2 +12t−1
5 Determine whether the following functionsare
concave up orconcave down.
(a) y=e t (b) y=t 2 (c) y=1+t−t 2
6 Determine the interval onwhichy =t 3 is
(a) concave up,
(b) concave down.
7 Evaluatey ′′ atthe specifiedvalue oft.
(a) y=2cost−t 2 t=1
(b) y= sint+cost
2
t=π/2
(c) y=(1+t)e t t=0
8 Find d2 y
dx 2 givenxy +x2 =y 2 .
9 Find dx
dt whenx3 + x t =t2 +x 2 t.
404 Chapter 11 Techniques of differentiation
Solutions
1 (a) 2t+1,2
(b) 6t 2 −2t,12t −2
(c) 2cos2t, −4sin2t
(d) kcoskt,−k 2 sinkt
(e) 6e 3t −2t,18e 3t −2
1
(f)
2
(t +1) 2,− (t +1) 3
(g) −2sin(t/2),−cos(t/2)
(h) e t (t +1),e t (t +2)
(i) 4cosh4t,16sinh4t
(j) 2sintcost,2cos2t
2 − 1 2
3 (a) 6t+1 (b) 6
4
7
2
5 (a) concave up
(b) concave up
(c) concave down
6 (a) concave upon (0,∞)
(b) concave down on (−∞,0)
7 (a) −3.08 (b) − 1 2
8
9
10y 2 −10x 2 −10yx
(2y −x) 3
x+2t 3 +x 2 t 2
3x 2 t 2 +t −2xt 3
(c) 3
REVIEWEXERCISES11
1 Differentiate eachofthe followingfunctions:
(a) y = sin(5 +x) 2
(b) y = e 2sinx
(c) y = (4x +7) 5
(d) y =x 4 sin3x
(e) y=
e4x
x 3 +11
(f) y=x 2 tanx
(g) y= cos3x
x 2
(h) y = e −x cos5x
(i) y = lncos4x
(j) y = sin2tcos2t
(k) y= 1
x 2 +1
2 Find dy in each ofthe following cases:
dx
(a) y= x3 sin2x
cosx
(b) y =x 3 e −x tanx
(c) y= xe5x
sinx
(d) x 2 +3xy+y 2 =5
(e) 9=3x 3 +2xy 2 −y
3 Ifx= 5+3t
1 −t
andy= 2 −t dy
find
1 −t dx and d2 y
dx 2.
4 Ifx=4(1+cosθ)andy=3(θ−sinθ)find dy
dx .
5 Ifx=3cos 2 θandy=2sin 2 θfind dy
dx and d2 y
dx 2.
6 Show thaty = e −4x sin8x satisfiesthe equation
y ′′ +8y ′ +80y = 0.
7 Differentiatey =x x .
8 Giveny =x 3 e 2x find
dy d 2 y
(a) (b)
dx dx 2
(c)
d 3 y
dx 3
9 Uselogarithmicdifferentiationto find dx
dt given
(a) x =te t sint
(b) x =t 2 e −t cos3t
(c) x =t 2 e 3t sin4tcos3t
10 Show thatify(t) =Asin ωt +Bcos ωt,where ω isa
constant, then
y ′′ +ω 2 y=0
Review exercises 11 405
Solutions
1 (a) 2(5 +x)cos(5 +x) 2
(b) 2cosxe 2sinx
(c) 20(4x +7) 4
(d) 3x 4 cos3x +4x 3 sin3x
(e)
2 (a)
e 4x (4x 3 −3x 2 +44)
(x 3 +11) 2
(f) x 2 sec 2 x +2xtanx
−3xsin3x −2cos3x
(g)
x 3
(h) −e −x (5sin5x +cos5x)
(i) −4tan4x
(j) 2cos4t
−2x
(k)
(x 2 +1) 2
cosx(2x 3 cos2x +3x 2 sin2x) +x 3 sin2xsinx
cos 2 x
whichsimplifiesto 2x 3 cosx +6x 2 sinx
(b) e −x (x 3 sec 2 x −x 3 tanx +3x 2 tanx)
(c)
e 5x (5xsinx +sinx −xcosx)
sin 2 x
3
(d) − 2x+3y
3x+2y
9x 2 +2y 2
(e)
1−4xy
1
8 ,0
4 − 3(1−cosθ)
4sinθ
5 − 2 3 ,0
7 x x (lnx+1)
8 (a) e 2x (2x 3 +3x 2 )
(b) 2xe 2x (2x 2 +6x +3)
(c) 2e 2x (4x 3 +18x 2 +18x +3)
9 (a) e t (tcost+(t+1)sint)
(b) −e −t ((t 2 −2t)cos3t +3t 2 sin3t)
(
)
(c) t 2 e 3t 2 4cos4t
sin4tcos3t +3+ − 3sin3t
t sin4t cos3t
12 Applicationsof
differentiation
Contents 12.1 Introduction 406
12.2 Maximumpointsandminimumpoints 406
12.3 Pointsofinflexion 415
12.4 TheNewton--Raphsonmethodforsolvingequations 418
12.5 Differentiationofvectors 423
Reviewexercises12 427
12.1 INTRODUCTION
In this chapter the techniques of differentiation are used to solve a variety of problems.
It is possible to use differentiation to find the maximum or minimum values of a function.
For example, it is possible to find the maximum power transferred from a voltage
source to a load resistor, as we shall show later in the chapter. Differentiation is also
usedintheNewton--Raphsonmethodofsolvingnon-linearequations.Suchanequation
needs to be solved to calculate the steady-state values of current and voltage in a series
diode--resistorcircuit.
Finallyweshowhowvectorscanbedifferentiated.Thisformsanintroductiontothe
important topic of vector calculus which is discussedinChapter 26.
12.2 MAXIMUMPOINTSANDMINIMUMPOINTS
Consider Figure 12.1. A and B are important points on the curve. At A the function
stops increasing and starts to decrease. At B it stops decreasing and starts to increase.
A is a local maximum, B is a local minimum. Note that A is not the highest point on
the curve, nor B the lowest point. However, for that part of the curve near to A, A is
the highest point. The word ‘local’ is used to stress that A is maximum in its locality.
Similarly, Bisthe lowestpoint initslocality.
12.2 Maximum points and minimum points 407
y
A
y
A
(a)
B
Figure12.1
Thefunctionyhas a local maximum at Aand alocal minimumat B.
x
(b)
B
x
In Figure 12.1(a) tangents drawn at A and B would be parallel to thexaxis and so
atthesepoints dy
dx iszero.However,inFigure12.1(b)therearecornersatAandB.Itis
impossibletodraw tangents atthese points and so dy does notexistatthese points.
dx
Hence, when searching for maximum and minimum points we need only examine
thosepoints atwhich dy dy
is zero,or does not exist.
dx dx
Points at which dy is zero are known as turning points or stationary values of the
dx
function.
At maximumand minimumpoints either:
dy
(i) does notexist, or
dx
dy
(ii)
dx = 0
To distinguish between maximum and minimum points we can study the sign of dy
dx on
eithersideofthepoint.AtmaximumpointssuchasA,yisincreasingimmediatelytothe
leftofthepoint,anddecreasingimmediatelytotheright.Thatis, dy
dx ispositiveimmediatelytotheleft,and
dy
dx isnegativeimmediatelytotheright.Atminimumpointssuchas
B,yisdecreasingimmediatelytotheleftofthepoint,andincreasingimmediatelytothe
right.Thatis, dy
dy
isnegativeimmediatelytotheleft,and
dx dx ispositiveimmediatelytothe
right.Thisso-calledfirst-derivativetestenablesustodistinguishmaximafromminima.
This testcan be used even when the derivative does notexistatthe pointinquestion.
Thefirst-derivative testtodistinguish maximafromminima:
To the left ofamaximumpoint, dy dy
ispositive; tothe right,
dx dx isnegative.
To the left of a minimum point, dy
dy
isnegative; tothe right,
dx dx ispositive.
408 Chapter 12 Applications of differentiation
Example12.1 Determinethepositionandnatureofallmaximumandminimumpointsofthefollowing
functions:
(a)y=x 2
(b)y=−t 2 +t+1
(c)y= x3
3 + x2
2 −2x+1
(d) y = |t|
Solution (a) Ify =x 2 , then by differentiation dy
dx = 2x.
Recall thatatmaximum and minimum points either
(i)
(ii)
dy
does notexist, or
dx
dy
= 0.We mustcheck both of these conditions.
dx
The function 2x exists for all values of x, and so we move to examine any points
where dy = 0.So, wehave
dx
dy
dx =2x=0
Theequation2x = 0hasonesolution,x = 0.Weconcludethataturningpointexists
atx = 0. Furthermore, from the given functiony =x 2 , we see that whenx = 0 the
valueofyisalso0,soaturningpointexistsatthepointwithcoordinates (0,0).To
determine whether this point is a maximum or minimum we use the first-derivative
test and examine the sign of dy on either side ofx = 0. To the left ofx = 0,xis
dx
clearly negative and so 2x is also negative. To the right ofx = 0,xis positive and
so2xisalsopositive.Henceyhasaminimumatx = 0.Agraphofy =x 2 showing
thisminimum isgiven inFigure 12.2.
(b) Ify=−t 2 +t + 1, theny ′ = −2t + 1 and this function exists for all values oft.
Solvingy ′ = 0 we have
−2t+1=0 andsot= 1 2
Weconcludethatthereisaturningpointatt = 1 2 .Theycoordinatehereis − ( 1
2) 2
+
1
2 +1=11 4 . We now inspect the sign ofy′ to the left and to the right oft = 1 2 . A
little to the left, say att = 0, we see thaty ′ = −2(0) +1 = 1 which is positive. A
littletotheright,sayatt = 1,weseethaty ( ′ = −2(1) +1 = −1whichisnegative.
1
Hence thereisamaximum atthe point
2 4)
,11 .
Agraph ofthe function isshown inFigure 12.3.
12.2 Maximum points and minimum points 409
y
y = x 2
y
y = –t 2 + t + 1
1–
2
t
x
Figure12.2
Thefunctionyhas a minimumatx = 0.
Figure12.3
Thefunctionyhas amaximum att = 1 2 .
(c) Ify= x3
3 + x2
2 −2x +1,theny′ =x 2 +x−2andthisfunctionexistsforallvalues
ofx. Solvingy ′ = 0 wefind
x 2 +x−2=0
(x−1)(x+2)=0
x=1,−2
Therearethereforetwoturningpoints,oneatx = 1andoneatx = −2.Weconsider
each inturn.
Atx = 1,weexaminethesignofy ′ totheleftandtotherightofx = 1.Alittleway
to the left, say atx = 0, we see thaty ′ = −2 which is negative. A little to the right,
sayatx = 2,weseethaty ′ = 2 2 +2−2 = 4whichispositive.Sothepointwhere
x = 1 isaminimum.
Atx = −2, we examine the sign ofy ′ to the left and to the right ofx = −2. A little
way to the left, say atx = −3, we see thaty ′ = (−3) 2 + (−3) − 2 = 4 which is
positive.Alittletotheright,sayatx = −1,weseethaty ′ = (−1) 2 +(−1)−2 = −2
which isnegative. So the point wherex = −2 isamaximum.
Agraph of the function is shown inFigure 12.4.
(d) Recall that the modulus functiony = |t| isdefined as follows:
{ −t t0
y=|t|=
t t>0
A graph of this function was given in Figure 10.13(a) and this should be looked at
before continuing. Note that dy = −1 fort negative, and dy = +1 fort positive.
dt
dt
Thederivativeisnotdefinedatt = 0becauseofthecornerthere.Therearenopoints
when dy = 0. Because the derivative is not defined att = 0 this point requires
dt
further scrutiny. To the left oft = 0, dy dy
< 0; to the right,
dt dt >0andsot=0isa
minimum point.
y
y = —
x 3
+ —
x 2
– 2x + 1
3 2
–2
1
x
Figure12.4
Thefunctionyhas a maximum atx = −2
and aminimum atx = 1.
410 Chapter 12 Applications of differentiation
y
dy
—
dx > 0
dy
—
dx = 0
Figure12.5
The derivative dy decreases on
dx
passingthrough amaximum point.
dy
—
dx < 0
x
y
dy
—
dx < 0
dy
—
dx = 0
dy
—
dx > 0
Figure12.6
Thederivative dy increases onpassing
dx
through aminimum point.
x
Rather than examine the sign of y ′ on both sides of the point, a second-derivative
test may be used. On passing through a maximum pointy ′ changes from positive to 0
to negative, as shown in Figure 12.5. Hence,y ′ is decreasing. Ify ′′ is negative then this
indicates y ′ is decreasing and the point is therefore a maximum point. Conversely, on
passingthroughaminimumpoint,y ′ increases,goingfromnegativeto0topositive(see
Figure 12.6). Ify ′′ ispositive theny ′ isincreasing and this indicates a minimum point.
So,havinglocatedthepointswherey ′ = 0,welookatthesecondderivative,y ′′ .Thus
y ′′ > 0 implies a minimum point;y ′′ < 0 implies a maximum point. Ify ′′ = 0, then we
must return to the earlier, more basic test of examiningy ′ on both sides of the point. In
summary:
Thesecond-derivative test todistinguish maximafrom minima:
Ify ′ = 0 andy ′′ < 0 atapoint,thenthisindicates thatthe pointisamaximum
turningpoint.
Ify ′ = 0 andy ′′ > 0 atapoint,thenthisindicates thatthe pointisaminimum
turningpoint.
Ify ′ = 0 andy ′′ = 0 atapoint,the second-derivative testfailsand you must
use the first-derivative test.
Example12.2 Usethesecond-derivativetesttofindallmaximumandminimumpointsofthefunctions
inExample 12.1.
Solution (a) Given y = x 2 then y ′ = 2x and y ′′ = 2. We locate the position of maximum and
minimum points by solvingy ′ = 0 and so such a point exists atx = 0. Evaluating
y ′′ atthispointweseethaty ′′ (0) = 2whichispositive.Usingthesecond-derivative
testweconcludethatthe pointisaminimum.
(b) Giveny = −t 2 +t + 1 theny ′ = −2t + 1 andy ′′ = −2. Solvingy ′ = 0 we find
t = 1 ( ) 1
2 . Evaluatingy′′ at this point we findy ′′ = −2 which is negative. Using
2
the second-derivative testwe conclude thatt = 1 isamaximum point.
2
(c) Given y = x3
3 + x2
2 −2x+1,theny′ =x 2 +x−2andy ′′ =2x+1.y ′ =0
at x = 1 and x = −2. At x = 1, y ′′ = 3 which is positive and so the point
12.2 Maximum points and minimum points 411
is a minimum. At x = −2, y ′′ = −3 which is negative and so the point is a
maximum. ⎧
⎪⎨ −1 t<0
(d) y ′ = 1 t>0
⎪⎩
undefined att = 0
Since y ′ (0) is undefined, we use the first-derivative test. This was employed in
Example 12.1.
Engineeringapplication12.1
Risetimeforasecond-orderelectricalsystem
Consider the electrical system illustrated in Figure 12.7. The input voltage, v i
, is
appliedtoterminalsa--b.Theoutputfromthesystemisavoltage, v o
,measuredacross
the terminals c--d. The easiest way to determine the time response of this system to
a particular input istouse the technique of Laplace transforms (seeChapter 21).
When a step input is applied to the system, the general form of the response dependsonwhetheraquantitycalledthedampingratio,
ζ,issuchthat ζ > 1,ζ = 1or
ζ < 1.Thequantity ζ itselfdependsuponthevaluesofL,CandR.Thisisillustrated
inFigure12.8.Ifthedampingratio, ζ < 1,then v o
overshootsitsfinalvalueandthe
systemissaidtobe underdamped. For thiscase itcan beshown that
(
v o
=U −Ue −αt cos(βt)+ αsin(βt) )
for t > 0 (12.1)
β
whereU isthe height ofastepinputapplied att = 0,and
α = R 2L
(12.2)
ω r
= 1 √
LC
resonant frequency (12.3)
β =
√
ωr 2 − α2 naturalfrequency (12.4)
Engineers are often interested in knowing how quickly a system will respond to a
particular input. For many systems this is an important design criterion. One way of
characterizingthespeedofresponseofthesystemisthetimetakenfortheoutputto
reach a certain level in response to a step input. This is known as the rise time and
is often defined as the time taken for the output to rise from 10% to 90% of its final
value.However,bylookingattheunderdampedresponseillustratedinFigure12.8it
is clear that the time,t m
, required for the output to reach its maximum value would
also provide an indicator of system response time. As the derivative of a function is
zero atamaximum point itispossible tocalculate thistime.
Differentiating Equation (12.1) and usingthe product rule,
dv o
dt
= d dt
( (U −Ue −αt cos(βt)+ αsin(βt)
β
))
( ) e −αt αsin(βt)
t > 0
=0−U d dt (e−αt cos(βt)) −U d dt
β
➔
412 Chapter 12 Applications of differentiation
= −U(−αe −αt cos(βt) −e −αt βsin(βt))
( −αe −αt )
αsin(βt)
−U
+ e−αt αβcos(βt)
β
β
(
)
= −Ue −αt −αcos(βt) − βsin(βt) − α2 sin(βt)
+ αcos(βt)
β
( )
=Ue −αt β + α2
sin(βt)
β
At a turning point dv o
= 0.Hence
dt
( β
Ue −αt 2 + α 2 )
sin(βt) =0
β
y o (t)
z < 1
a
R
L
c
U
z = 1
y i
b
Figure12.7
Asecond-order electrical system.
C
d
y o
t m
z > 1
Figure12.8
Responseofasecond-order system to a step
input.
This occurs when sin(βt) = 0, which corresponds tot = kπ/β, k = 0,1,2....
It is now straightforward to calculate t m
, once β has been calculated, using Equations
(12.2), (12.3) and (12.4) for particular values ofR,L andC. You may like to
show that the turning point corresponding to k = 1 is a maximum by calculating
d 2 v o
dt 2 and carrying outthe second-derivative test.
Itispossibletocheckwhetherornotasystemisunderdampedusingthefollowing
formulae:
ζ = R R c
damping ratio (12.5)
t
√
L
R c
=2
C
criticalresistance (12.6)
LetuslookataspecificcasewithtypicalvaluesL = 40mH,C = 1 µF,R = 200.
UsingEquations (12.5) and (12.6),wefind
√ √
L
R c
=2
C = 2 4 ×10 −2
= 400
1 ×10−6 ζ = R R c
= 200
400 = 0.5
12.2 Maximum points and minimum points 413
and therefore the systemisunderdamped because ζ < 1.
Also,
ω r
= 1 √
LC
= 5000
α = R 2L = 200
= 2500
2×4×10−2 √
β = ωr 2 −α2 = √ 5000 2 −2500 2 = 4330
Finally,
t m
=
π
4330 = 7.26 ×10−4 = 726 µs
We conclude forthis case that the risetimeis726 µs.
Engineeringapplication12.2
Maximumpowertransfer
ConsiderthecircuitofFigure12.9inwhichanon-idealvoltagesourceisconnected
toavariableloadresistorwithresistanceR L
.ThesourcevoltageisV anditsinternal
resistanceisR S
.CalculatethevalueofR L
whichresultsinthemaximumpowerbeing
transferred from the voltage source to the load resistor. This is an essential piece of
information for engineers involved in the design of power systems. Often an importantdesignconsiderationistotransferthemaximumamountfromthepowersource
tothe pointwhere the power isbeingconsumed.
i
R S
Non-ideal
voltage
source
+
V
–
R L
Figure12.9
Maximum power transferoccurs whenR L =R S .
Solution
Letibethecurrentflowinginthecircuit.UsingKirchhoff’svoltagelawandOhm’s
law gives
V =i(R S
+R L
)
LetPbethe power developed inthe loadresistor.Then,
P=i 2 R L
=
V2 R L
(R S
+R L
) 2 ➔
414 Chapter 12 Applications of differentiation
ClearlyPdependsonthevalueofR L
.Differentiatingw.r.t.R L
andusingthequotient
rule,weobtain
dP
=V 21(R S +R L )2 −R L
2(R S
+R L
)
dR L
(R S
+R L
) 4
=V 2 (R S +R L )−2R L
(R S
+R L
) 3
=V 2 R S
−R L
(R S
+R L
) 3
Equating dP
dR L
tozero toobtain the turning pointgives
thatis,
V 2 R S
−R L
(R S
+R L
) 3 = 0
R L
=R S
So a turning point occurs when the load resistance equals the source resistance. We
need tocheck ifthisisamaximum turning point,so
dP
dR L
=V 2 R S
−R L
(R S
+R L
) 3
d 2 P
dR 2 L
=V 2 −1(R S +R L )3 − (R S
−R L
)3(R S
+R L
) 2
(R S
+R L
) 6
=V 2 −(R S +R L )−3(R S −R L )
(R S
+R L
) 4
=V 2 2R L −4R S
(R S
+R L
) 4
= 2V 2 (R L −2R S )
(R S
+R L
) 4
WhenR L
= R S
, this expression is negative and so the turning point is a maximum.
Therefore, maximum power transfer occurs when the load resistance equals the
source resistance.
EXERCISES12.2
1 Locatethe position ofany turningpointsofthe
following functionsanddeterminewhether they are
maxima orminima.
(a) y=x 2 −x+6 (b) y=2x 2 +3x+1
(c) y=x−1 (d) y=1+x−2x 2
(e) y=x 3 −12x (f) y=7+3x
2 Locateandidentify allturningpointsof
(a) y= x3
3 −3x2 +8x+1
(b) y =te −t
(c) y=x 4 −2x 2
12.3 Points of inflexion 415
Solutions
1 (a)
( 1
2 ,53 4)
,minimum
(b) (−0.75,−0.125),minimum
(c) none
(d) (0.25,1.125),maximum
(e) (2,−16) minimum, (−2,16) maximum
(f) none
2 (a)
(
2, 23 ) (
maximum, 4, 19 )
minimum
3 3
(b) (1,0.368),maximum
(c) (0,0) maximum, (1,−1) minimum, (−1,−1)
minimum
TechnicalComputingExercises12.2
Computerlanguagessuch asMATLAB ® are matrix
orientatedanddonotalways providethe ability to
differentiate functions.Others such asWolfram
Mathematica andMaplesoftMaple have thiscapability by
default. Ifyou are attempting the followingexercisesin
MATLAB ® you mayrequire the Symbolic Math Toolbox
whichis an add-on forthe main program.
(a) Useatechnicalcomputinglanguagetofindy ′ and
y ′′ wheny = e −0.2t cost.
(b) Solvey ′ = 0 andhence locate any turningpoints
in the interval [0,6]anddeterminetheirtype.
(c) Plotagraphofyandcheck the position of
the turningpointswith the resultsobtainedin
part (b).
12.3 POINTSOFINFLEXION
Recall from Section 11.4 that when the gradient of a curve, that is y ′ , is increasing,
the second derivative y ′′ is positive and the curve is said to be concave up. When the
gradient is decreasing the second derivative y ′′ is negative and the curve is said to be
concave down. A point at which the concavity of a curve changes from concave up to
concave down orviceversa iscalled a point ofinflexion.
Apointofinflexionisapointonacurvewheretheconcavitychangesfromconcave
up to concave down or vice versa. It follows that y ′′ = 0 at such a point or, in
exceptionalcases,y ′′ doesnotexist.
Figure 12.10(a) shows a graph for which a point of inflexion occurs at the point
markedA.Notethatatthispointthegradientofthegraphiszero.Figure12.10(b)shows
a graph with points of inflexion occurring at A and B. Note that at these points the
gradientofthe graph isnotzero.
Tolocateapointofinflexionwemustlookforapointwherey ′′ = 0ordoesnotexist.
We mustthenexamine the concavity ofthe curve oneither sideofsuchapoint.
416 Chapter 12 Applications of differentiation
y
y
A
A
B
(a)
x
Figure12.10
(a) There isapoint ofinflexion atA; (b)there are pointsofinflexion at A
and B.
(b)
y
x
y = x 4 x
y
y = x 3
y'' > 0
y'' = 0
x
y' < 0, y'' > 0 y' > 0, y'' > 0
y'' < 0
Figure12.11
The second derivative,y ′′ ,changes
sign atx = 0.
Figure12.12
Thederivative,y ′ ,changessignatx = 0,but
y ′′ remainspositive.
Example12.3 Locateany points ofinflexion ofthe curvey =x 3 .
Solution Given y = x 3 , then y ′ = 3x 2 and y ′′ = 6x. Points of inflexion can only occur where
y ′′ = 0 or does not exist. Clearlyy ′′ exists for allxand is zero whenx = 0. It is possible
that a point of inflexion occurs whenx = 0 but we must examine the concavity of
the curve on either side. To the left ofx = 0,xis negative and soy ′′ is negative. Hence
to the left, the curve is concave down. To the right of x = 0, x is positive and so y ′′
is positive. Hence to the right, the curve is concave up. Thus the concavity changes at
x = 0.Weconcludethatx = 0isapointofinflexion.AgraphisshowninFigure12.11.
Note thatatthis pointofinflexiony ′ = 0 too.
Acommonerroristostatethatify ′ =y ′′ = 0thenthereisapointofinflexion.This
isnotalways true;considerthe next example.
Example12.4 Locateall maximumpoints,minimum points and points ofinflexion ofy =x 4 .
Solution y ′ = 4x 3 y ′′ = 12x 2
12.3 Points of inflexion 417
y ′ = 0 atx = 0. Alsoy ′′ = 0 atx = 0 and so the second-derivative test is of no help
in determining the position of maximum and minimum points. We return to examiney ′
on both sides ofx = 0. To the left ofx = 0,y ′ < 0; to the righty ′ > 0 and sox = 0 is
a minimum point. Figure 12.12 illustrates this. Note that at the pointx = 0, the second
derivativey ′′ is zero. However,y ′′ is positive both to the left and to the right ofx = 0;
thusx = 0 isnot a point of inflexion.
Example12.5 Find any maximum points,minimum points and points of inflexion ofy =x 3 +2x 2 .
Solution Giveny =x 3 +2x 2 theny ′ = 3x 2 +4xandy ′′ = 6x +4.Letusfirstfindanymaximum
andminimumpoints.Thefirstderivativey ′ iszerowhen3x 2 +4x =x(3x+4) = 0,that
iswhenx=0orx=− 4 3 . Using the second-derivative test we findy′′ (0) = 4 which
(
corresponds to a minimum point. Similarly,y ′′ − 4 )
= −4 which corresponds to a
3
maximum point.
We seek points of inflexion by looking for points wherey ′′ = 0 and then examining
the concavity on either side.y ′′ = 0 whenx = − 2 3 .
Sincey ′′ is negative whenx < − 2 3 , theny′ is decreasing there, that is the function is
concavedown.Also,y ′′ ispositivewhenx > − 2 3 andsoy′ isthenincreasing,thatisthe
functionisconcave up. Hence thereisapointofinflexion whenx = − 2 .Thegraph of
3
y =x 3 +2x 2 isshown inFigure 12.13.
y
Figure12.13
– 4–
3
– 2–
3
x
There isamaximum atx = − 4 ,aminimum at
3
x = 0 and apoint ofinflexion atx = − 2 3 .
FromExamples12.4 and 12.5 wenotethat:
(1) The conditiony ′′ = 0 is not sufficient to ensure a point is a point of inflexion.
The concavity of the function on either side of the point wherey ′′ = 0 must be
considered.
(2) At a pointofinflexion itis notnecessarytohavey ′ = 0.
(3) At a pointofinflexiony ′′ = 0 ory ′′ doesnotexist.
418 Chapter 12 Applications of differentiation
EXERCISES12.3
1 Locate the maximumpoints,minimum pointsand (e) y=t 5
pointsofinflexion of
(f) y=t 6
(a) y=3t 2 +6t−1
(g) y=x 4 −2x 2
(b) y=4−t−t 2
(h) z=t+ 1
(c) y= x3
3 − x2
t
2 +10
(i) y=x 5 − 5x3
(d) y= x3
3 + x2
3
2 −20x+7 (j) y =t 1/3
Solutions
1 (a) (−1,−4) minimum
(b) (−0.5,4.25) maximum
(c) (0,10) maximum,
( 1
2 , 119
12
(
(d) 4, − 131
3
(
1, 59
6
)
point ofinflexion
)
minimum,
)
minimum, (−5,77.83) maximum,
(− 1 2 ,17.08 )
point ofinflexion
(e) (0,0) point ofinflexion
(f) (0,0) minimum
(g) (0,0) maximum, (1,−1) minimum,
( ) ( )
1
(−1,−1) minimum, √ , − 5 , −√ 1 , − 5
3 9 3 9
pointsofinflexion
(h) (1,2) minimum, (−1, −2)maximum
(
(i) 1, − 2 ) (
minimum, −1, 2 )
maximum,
3 3
( ) ( )
1
(0,0), √ , − 7
2 12 √ , −√ 1 7
,
2 2 12 √ are
2
alsopointsofinflexion
(j) (0,0) point ofinflexion
TechnicalComputingExercises12.3
(a) Useatechnicalcomputing language such as
MATLAB ® to produce agraph ofy = 3t 1/5 .
(b) Fromyour graphfind the position ofany maxima,
minimaorpointsofinflexion.
12.4 THENEWTON--RAPHSONMETHODFORSOLVING
EQUATIONS
We often needtosolve equationssuchas
f(x)=2x 4 −x 3 +x 2 −10=0
f(t) =2e −3t −t 2 = 0
f(t)=t−sint=0
The Newton--Raphson technique is a method of obtaining an approximate solution, or
root, ofsuchequations.Itinvolves the use ofdifferentiation.
12.4 The Newton--Raphson method for solving equations 419
f (x)
y = f (x)
B
^
x
C A
x 2 x 1 x
Figure12.14
Thetangent at Bintersects thexaxisat C.
Suppose we wish to find a root of f (x) = 0. Figure 12.14 illustrates the curvey =
f (x). Roots of the equation f (x) = 0 correspond to where the curve cuts the x axis.
One such root is illustrated and is labelledx = ˆx. Suppose we know thatx = x 1
is an
approximate solution. Let A be the point on thexaxis wherex = x 1
and let B be the
pointonthecurvewherex =x 1
.ThetangentatBisdrawnandcutsthexaxisatCwhere
x =x 2
.Clearlyx =x 2
isabetterapproximationto ˆxthanx 1
.Wenowfindx 2
intermsof
the known value,x 1
.
Hence,
AB = distance of Babove thexaxis = f (x 1
)
CA=x 1
−x 2
gradient of lineCB = AB
CA = f(x 1 )
x 1
−x 2
ButCB isatangent tothe curve atx =x 1
and so has gradient f ′ (x 1
). Hence,
f ′ (x 1
) = f(x 1 )
x 1
−x 2
x 1
−x 2
= f(x 1 )
f ′ (x 1
)
and therefore,
x 2
=x 1
− f(x 1 )
f ′ (x 1
)
(12.7)
Equation (12.7) is known as the Newton--Raphson formula. Knowing an approximate
root of f (x) = 0, that is x 1
, the Newton--Raphson formula enables us to calculate an
improved approximate root,x 2
.
Example12.6 Given thatx 1
= 7.5 is an approximate root of e x −6x 3 = 0, use the Newton--Raphson
techniquetofind animproved value.
Solution x 1
=7.5
f(x)=e x −6x 3 f(x 1
)=−723
f ′ (x)=e x −18x 2 f ′ (x 1
) =796
420 Chapter 12 Applications of differentiation
Using the Newton--Raphson technique the value ofx 2
isfound:
x 2
=x 1
− f(x 1 )
f ′ (x 1
) =7.5−(−723) = 8.41
796
An improved estimate of the root of e x − 6x 3 = 0 isx = 8.41. To two decimal places
the trueanswer isx = 8.05.
The Newton--Raphson technique can be used repeatedly as illustrated in
Example12.7.Thisgeneratesasequenceofapproximatesolutionswhichmayconverge
tothe required root. Eachapplication ofthe methodisknown asan iteration.
Example12.7 A root of 3sinx = x is near to x = 2.5. Use two iterations of the Newton--Raphson
technique tofind a more accurateapproximation.
Solution Theequation mustfirst bewritten inthe form f (x) = 0,thatis
Then
Then
f(x)=3sinx−x=0
x 1
= 2.5
f(x)=3sinx−x f(x 1
)=−0.705
f ′ (x)=3cosx−1 f ′ (x 1
) = −3.403
x 2
=2.5− (−0.705)
(−3.403) = 2.293
The process isrepeated withx 1
= 2.293 as the initial approximation:
Then
x 1
= 2.293 f (x 1
) = −0.042 f ′ (x 1
) = −2.983
x 2
= 2.293 − (−0.042)
(−2.983) = 2.279
Using two iterations of the Newton--Raphson technique, we obtainx = 2.28 as an improved
estimateof the root.
Example12.8 An approximaterootof
Solution We have
x 3 −2x 2 −5=0
isx = 3. By using the Newton--Raphson technique repeatedly, determine the value of
the rootcorrecttotwo decimalplaces.
Hence
x 1
=3
f(x)=x 3 −2x 2 −5 f(x 1
)=4
f ′ (x)=3x 2 −4x f ′ (x 1
)=15
x 2
=3− 4
15 = 2.733
12.4 The Newton--Raphson method for solving equations 421
Animprovedestimateofthevalueoftherootis2.73(2d.p.).Themethodisusedagain,
takingx 1
= 2.73 asthe initial approximation:
x 1
= 2.73 f(x 1
) = 0.441 f ′ (x 1
) = 11.439
x 2
= 2.73 − 0.441
11.439 = 2.691
An improved estimateisx = 2.69 (2d.p.). The method isused again:
So
x 1
= 2.69 f (x 1
) = −0.007 f ′ (x 1
) = 10.948
x 2
= 2.69 − (−0.007)
10.948 = 2.691
Thereisnochangeinthevalueoftheapproximaterootandsototwodecimalplacesthe
rootof f(x) =0isx =2.69.
The calculation can be performed inMATLAB ® usingtherootsfunction:
roots([1 -2 0 -5])
which will produce the real root of 2.69, agreeing with our numerical calculation using
the Newton–Raphson method. The two complex roots of the equation will also be returned.Technicalcomputinglanguagesmakeuseofavarietyofnumericalmethodsand
to some extent the user has to take it on trust that they are correctly implemented and
testedso thatthey always produce the correctresult.
Theprevious examples illustrate the generalNewton--Raphson formula.
Ifx = x n
is an approximate root of f (x) = 0, then an improved estimate,x n+1
, is
given by
x n+1
=x n
− f(x n )
f ′ (x n
)
TheNewton--Raphsonformulaiseasytoprograminaloopstructure.Exitfromtheloop
is usually conditional upon |x n+1
−x n
| being smaller than some prescribed very small
value. This condition shows that successive approximate roots are very close to each
other.
Engineeringapplication12.3
Seriesdiode--resistorcircuit
Consider the circuit of Figure 12.15(a). A diode is in series with a resistor with resistanceR.ThevoltageacrossthediodeisdenotedbyV
andthecurrentthroughthe
diode is denoted byI. TheI--V relationship for the diode is non-linear and is given
by
I =I s
(e 40V −1)
where I s
is the reverse saturation current of the diode. Given that the supply voltage,V
s
,is2V,I s
is10 −14 AandRis22k,calculatethesteady-statevaluesofIandV.
➔
422 Chapter 12 Applications of differentiation
I
I
+
–
R
V
V s —R
V s
V s V
Point at which both
the diode and the
resistor equations
are satisfied
(a)
(b)
Figure12.15
Asimple non-linear circuit: (a)seriesdiode--resistorcircuit;(b)resistorload line
superimposed on diode characteristic.
Solution
There are several ways to solve this problem. A difficulty exists because the diode
I--V relationship is non-linear. One possibility is to draw a load line for the resistor
superimposedonthediodeI--V characteristic,asshowninFigure12.15(b).Theload
lineisanequationfortheresistorcharacteristicwrittenintermsofthevoltageacross
the diode,V, and the current through the diode,I. Itisgiven by
V s
−V=IR
I = − 1 R V +V s
R
This is a straight line with slope − 1 R and vertical intercept V s
.WhenV =0,I=
V
R
s
. This corresponds to all of the supply voltage being dropped across the resistor.
R
WhenV = V s
,I = 0. This corresponds to all of the supply voltage being dropped
across the diode. Therefore, these two limits correspond to the points within which
the circuit must operate. The solution to the circuit can be obtained by determining
the intercept of the diode characteristic and the load line. This is possible because
both the resistor characteristic and diode characteristic are formulated in terms ofV
andI,andsoanysolutionmusthavethesamevaluesofI andV forbothcomponents.
If an accurate graph is used, it is possible to obtain a reasonably good solution. An
alternative approach is to use the Newton--Raphson technique. Combining the two
component equations gives
−V +V s
=RI s
(e 40V −1)
NowR =2.2×10 4 ,I s
=10 −14 ,V s
=2andso
−V +2 = 2.2 ×10 4 ×10 −14 (e 40V −1)
Now, define f (V) by
f(V)=2.2×10 −10 (e 40V −1)+V −2
We wishtosolve f (V) = 0.We have
f ′ (V ) = 2.2 ×10 −10 ×40e 40V +1 = 8.8 ×10 −9 e 40V +1
Choose aninitial guess ofV 1
= 0.5:
V 2
=V 1
− f(V 1 )
f ′ (V 1
) =0.5−2.2 ×10−10 (e 20 −1) +0.5 −2
= 0.7644
8.8 ×10 −9 e 20 +1
12.5 Differentiation of vectors 423
Withanequationofthiscomplexity,itisbettertouseacomputeroraprogrammable
calculator. Doing so gives
V 5
= 0.6895,...,V 10
= 0.5770,...,V 14
= 0.5650
which is accurate tofourdecimal places.
Itisusefultocheckthesolutionbyindependentlycalculatingthecurrentthrough
the diode using the two different expressions. So,
I = 10 −14 (e 40×0.5650 −1) = 6.53 ×10 −5 A
I = − 0.5650
2.2 ×10 + 2
4 2.2 ×10 = 6.53 4 ×10−5 A
and therefore the solution iscorrect.
EXERCISES12.4
1 Usethe Newton--Raphson technique to findthe value
ofarootofthe following equations correctto two
decimalplaces. An approximateroot,x 1 ,isgiven in
eachcase.
(a)2cosx=x 2 x 1 =0.8
(b)3x 3 −4x 2 +2x−9=0 x 1 =2
(c)e x/2 −5x=0 x 1 =6
(d)lnx= 1 x
x 1 = 1.6
(e)sinx + 2x
π =1 x 1 =0.6
2 Explain circumstances in whichthe Newton--Raphson
technique mayfail to converge to arootof f (x) = 0.
Solutions
1 (a) 1.02 (b) 1.85 (c) 7.15 (d) 1.76 (e) 0.64
12.5 DIFFERENTIATIONOFVECTORS
Consider Figure 12.16. If r represents the position vector of an object and that object
moves along a curve C, then the position vector will be dependent upon the time,t. We
writer = r(t)toshowthedependenceupontime.Supposethattheobjectisatthepoint
P with position vector r at timet and at the point Q with position vector r(t + δt) at
the later timet + δt as shown in Figure 12.17. Then PQ ⃗ represents the displacement
vectoroftheobjectduringtheintervaloftime δt.Thelengthofthedisplacementvector
represents the distance travelled while its direction gives the direction of motion. The
average velocity during the time fromt tot + δt is the displacement vector divided by
the time interval δt, thatis
PQ
average velocity = ⃗ r(t +δt)−r(t)
=
δt δt
The instantaneous velocity,v, isgiven by
r(t +δt)−r(t)
v=lim
= dr
δt→0 δt dt
424 Chapter 12 Applications of differentiation
y
r
P
C
r(t)
P
PQ
Q
Figure12.16
Position vector ofapoint Pona
curve C.
x
r(t + dt)
Figure12.17
VectorPQrepresents ⃗ the displacement of
the object during the time interval δt.
Now,sincethexandycoordinatesoftheobjectdependuponthetime,wecanwritethe
position vectorras
r(t) =x(t)i +y(t)j
Therefore,
so that
r(t +δt) =x(t +δt)i+y(t +δt)j
x(t +δt)i+y(t +δt)j−x(t)i−y(t)j
v(t) = lim
δt→0 δt
{ }
x(t +δt)−x(t) y(t +δt)−y(t)
= lim i + j
δt→0 δt
δt
= dx
dt i + dy
dt j
often abbreviated to v =ṙ =ẋi+ẏj. Recall the dot notation for derivatives w.r.t.
time which is commonly used when differentiating vectors. So the velocity vector is
the derivative of the position vector with respect to time. This result generalizes in an
obvious way tothree dimensions. If
then
r(t) =x(t)i +y(t)j +z(t)k
ṙ(t) =ẋ(t)i+ẏ(t)j+ż(t)k
The magnitude of the velocity vector gives the speed of the object. We can define the
acceleration inasimilarway:
a = dv
dt = d2 r
= ¨r =ẍi+ÿj+¨zk
dt2 Inmoregeneralsituations,wewillnotbedealingwithpositionvectorsbutotherphysical
quantities such as time-dependent electricor magnetic fields.
Example12.9 Ifa = 3t 2 i +cos2tj, find
(a) da
(b)
da
dt ∣dt
∣
(c) d2 a
dt 2
Solution (a) Ifa = 3t 2 i +cos2tj, then differentiation with respect tot yields
(b)
(c)
da
dt =6ti−2sin2tj
da
∣dt
∣ = √ (6t) 2 + (−2sin2t) 2 =
d 2 a
dt 2 =6i−4cos2tj
√
36t 2 +4sin 2 2t
12.5 Differentiation of vectors 425
It is possible to differentiate more complicated expressions involving vectors provided
certain rules are adhered to. If a and b are vectors andcis a scalar, all functions
oftimet, then
d
(ca) =cda
dt dt + dc
dt a
d
dt (a+b)=da dt + db
dt
d
(a·b)=a·db
dt dt + da
dt ·b
d
dt (a×b)=a×db dt + da
dt ×b
Example12.10 Ifa = 3ti −t 2 jandb=2t 2 i +3j,verify
d
(a) (a·b)=a·db
dt dt + da
dt ·b (b) d
dt (a×b)=a×db dt + da
dt ×b
Solution (a) a·b = (3ti −t 2 j)· (2t 2 i +3j)=6t 3 −3t 2
Also
So,
d
dt (a·b)=18t2 −6t
da
dt =3i−2tj
a· db
dt
+b· da
dt
db
dt
=4ti
=(3ti−t 2 j)·(4ti) + (2t 2 i+3j)·(3i−2tj)
= 12t 2 +6t 2 −6t=18t 2 −6t
We have verified d (a·b)=a·db
dt dt + da
dt ·b.
i j k
(b) a×b=
3t −t 2 0
∣2t 2 3 0∣
=(9t+2t 4 )k
d
dt (a×b)=(9+8t3 )k
426 Chapter 12 Applications of differentiation
Also,
a × db
dt
i j k
=
3t −t 2 0
∣4t 0 0∣
= 4t 3 k
∣ ∣∣∣∣∣ i j k
da
dt ×b = 3 −2t 0
2t 2 3 0∣
=(9+4t 3 )k
and so
a × db
dt + da
dt ×b=4t3 k +(9+4t 3 )k=(9+8t 3 )k = d dt (a×b)
as required.
EXERCISES12.5
1 Ifr =3ti+2t 2 j+t 3 k,find
dr d 2 r
(a) (b)
dt dt 2
2 GivenB =te −t i+costjfind
dB d 2 B
(a) (b)
dt dt 2
3 Ifr =4t 2 i+2tj−7kevaluaterand dr
dt
4 Ifa=t 3 i−7tk,andb=(2+t)i+t 2 j−2k,
(a) finda ·b
(b) find da
dt
whent = 1.
(c) find db
dt
(d) showthat d dt (a·b)=a·db
dt + da
dt ·b.
5 Givenr =sinti+costj,find
(a)r (b)r (c) |r|
Show thatthe position vectorandvelocityvector are
perpendicular.
6 Show r = 3e −t i + (2 +t)jsatisfies
¨r +ṙ =j
7 Givena=t 2 i−(4−t)j,b=i+tjshow
( ) ( )
d
(a)
dt (a×b)= a × db da
+
dt dt ×b
(b)
d
(a·b)=a·db da
+b ·
dt dt dt
Solutions
1 (a) 3i+4tj+3t 2 k
(b) 4j+6tk
2 (a) (−te −t +e −t )i −sintj
(b) e −t (t −2)i −costj
3 4i+2j−7k,8i+2j
4 (a) t(t 3 +2t 2 +14) (b) 3t 2 i−7k
(c) i+2tj
5 (a) costi−sintj (b) −sinti−costj (c) 1
Review exercises 12 427
REVIEWEXERCISES12
1 Determine the position ofall maximum points,
minimumpointsand pointsofinflexion of
(a)y=2t 3 −21t 2 +60t+9
(b)y =t(t 2 −1)
2 InSection 9.8 weshowed that the impedance ofan
LCRcircuitcan be written as
( )
Z=R+j ωL − 1
ωC
(a) Find |Z|.
(b) Foragiven circuit,R,L andC are constants,and
ω can be varied. Find d|Z|
dω .
(c) Forwhat value of ω will |Z| have amaximum or
minimum value? Does thisvalue give a
maximum orminimum value of |Z|?
3 Usetwo iterationsofthe Newton--Raphson technique
to findan improved estimateofthe rootof
t 3 =e t
givent = 1.8 is an approximate root.
4 Given
a=(t 2 +1)i−j+tk
b=2tj−k
find
da
(a)
dt
(c)
d
dt (a·b)
(b)
db
dt
(d) d
dt (a×b)
5 Use two iterationsofthe Newton--Raphson method
to find animproved estimateofthe rootof
sint=1− t ,0 t π,givent=0.7isan
2
approximate root.
6 Determine the position ofallmaximumpoints,
minimum pointsandpointsofinflexion of
(a) y = e −x2
(b) y =t 3 e −t
(c) y=x 3 −3x 2 +3x−1
(d) y=e x +e −x
(e) y=|t|−t 2
Solutions
( 7
1 (a) (2,61) maximum, (5,34) minimum,
2 , 95 )
2
point ofinflexion
( ) ( )
1
(b) √ ,− 2
3 3 √ minimum, −√ 1 2
,
3 3 3 √ 3
maximum, (0,0) point ofinflexion
√
2 (a) R 2 +ω 2 L 2 − 2L
C + 1
ω 2 C 2
(b)
ωL 2 −1/ω 3 C 2
√
R 2 + ω 2 L 2 −2L/C +1/(ω 2 C 2 )
(c) ω = 1 √
LC
producesaminimumvalue ofZ
3 1.859,1.857
4 (a) 2ti+k
(b) 2j
(c) −3
(d) −4ti+2tj+ (6t 2 +2)k
5 0.705, 0.705
6 (a) (0,1)is amaximum
√
2
Points ofinflexion whenx = ±
2
(b) (0,0)point ofinflexion, (3,1.34) maximum.
Further pointsofinflexion whent = 4.73,1.27
(c) (1,0)point ofinflexion
(d) (0,2)minimum
(e) (0.5,0.25) maximum, (−0.5,0.25) maximum,
minimum at (0,0)(y ′ doesnot existhere)
13 Integration
Contents 13.1 Introduction 428
13.2 Elementaryintegration 429
13.3 Definiteandindefiniteintegrals 442
Reviewexercises13 453
13.1 INTRODUCTION
When a function, f (x), is known we can differentiate it to obtain the derivative, df
dx .
Thereverseprocessistoobtainthefunction f (x)fromknowledgeofitsderivative.This
process iscalledintegration. Thus,integration isthe reverse of differentiation.
A problem related to integration is to find the area between a curve and thexaxis.
At first sight it may not be clear that the calculation of area is connected to integration.
This chapter aims to explain the connection. An area can have various interpretations.
For example, the area under a graph of power used by a motor plotted against time
representsthetotalenergyusedbythemotorinaparticulartimeperiod.Theareaunder
a graph of current flow into a capacitor against time represents the total charge stored
by the capacitor.
Circuits to carry out integration are used extensively in electronics. For example, a
circuittodisplaythetotaldistancetravelledbyacarmayhaveaspeedsignalasinputand
may integrate this signal to give the distance travelled as output. Integrator circuits are
widely used in analogue computers. These computers can be used to model a physical
system and observe its response to a range of inputs. The advantage of this approach is
thatthesystemparameterscanbevariedinordertoseewhateffecttheyhaveonsystem
performance.Thisavoidstheneedtobuildtheactualsystemandallowsdesignideasto
be explored relatively quickly and cheaply by an engineer.
13.2 Elementary integration 429
13.2 ELEMENTARYINTEGRATION
Consider the following problem: given dy = 2x, findy(x). Differentiation of the functiony(x)
= x 2 +c, wherecis a constant, yields dy = 2x for anyc. Thereforey(x) =
dx
dx
x 2 +cisasolutiontotheproblem.Asccanbeanyconstant,thereareaninfinitenumber
of different solutions. The constantcis known as aconstant of integration. Inthis example,thefunctionyhasbeenfoundfromaknowledgeofitsderivative.
Wesay2xhas
beenintegrated,yieldingx 2 +c.Toindicatetheprocessofintegrationthesymbols ∫ and
dx are used. The ∫ sign denotes that integration is to be performed and the dx indicates
thatxisthe independent variable. Returning tothe previous problem, we write
dy
dx = 2x
∫
y = 2xdx=x 2 +c
↑ ↑ ↑
symbols for
integration
constant of integration
Ingeneral, if
dy
dx =f(x)
then
∫
y = f(x)dx
Consider a simple example.
Example13.1 Given dy
dx =cosx−x,findy.
Solution We need to find a function which, when differentiated, yields cosx −x. Differentiating
sinx yields cosx, while differentiating −x 2 /2 yields −x. Hence,
∫
y =
(cosx−x)dx=sinx− x2
2 +c
wherecis the constant of integration. Usually brackets are not used and the integral is
written simplyas ∫ cosx−xdx.
Thefunctiontobeintegratedisknownastheintegrand.InExample13.1theintegrand
iscosx−x.
430 Chapter 13 Integration
Example13.2 Find d dx
( x
n+1
n +1 +c )
and hence deduce that ∫ x n dx= xn+1
n +1 +c.
Solution From Table 10.1 wefind
( )
d x
n+1
dx n +1 +c = d ( ) x
n+1
+ d dx n +1 dx (c) usingthe linearity
of differentiation
= 1 d
n +1dx (xn+1 ) + d dx (c)
= 1
n +1 {(n +1)xn } +0
=x n
again usingthe
linearityof differentiation
usingTable10.1
Consequently, reversing the process wefind
∫
x n dx= xn+1
n +1 +c
as required. Note that this result is invalid if n = −1 and so this result could not be
applied tothe integral ∫ (1/x)dx.
Table 13.1 lists several common functions and their integrals. Although the variable
x is used throughout Table 13.1, we can use this table to integrate functions of other
variables, forexamplet andz.
Example13.3 UseTable 13.1 tointegratethe following functions:
(a) x 4
(b) coskx, wherekisaconstant
(c) sin(3x +2)
(d) 5.9
(e) tan(6t −4)
(f) e −3z
1
(g)
x 2
(h) cos100nπt, wherenisaconstant
Solution (a) From Table 13.1, we find ∫ x n dx= xn+1
n +1 +c,n ≠ −1. To find ∫ x 4 dxletn=4;
weobtain
∫
x 4 dx= x5
5 +c
(b) FromTable 13.1, wefind ∫ cos(ax)dx = sin(ax) +c. Inthiscasea =kand so
a
∫
coskxdx = sinkx +c
k
13.2 Elementary integration 431
Table13.1
Theintegrals ofsome common functions.
f(x)
∫
f(x)dx
f(x)
∫
f(x)dx
k, constant kx +c
x n x n+1
+c n≠−1
n +1
x −1 = 1 x
ln|x|+c
e x e x +c
e −x −e −x +c
e ax
sinx
sinax
sin(ax +b)
cosx
cosax
e ax
a +c
−cosx+c
−cosax
+c
a
−cos(ax +b)
+c
a
sinx+c
sinax
a
+c
cos(ax +b)
tanx
tanax
tan(ax +b)
cosec(ax +b)
sec(ax +b)
sin(ax +b)
+c
a
ln|secx|+c
ln|secax|
+c
a
ln|sec(ax +b)|
+c
a
1
{ln|cosec(ax +b)
a
−cot(ax +b)|} +c
1
{ln|sec(ax +b)
a
+tan(ax +b)|} +c
cot(ax +b)
1
{ln|sin(ax +b)|} +c
a
1
√
a2 −x 2 sin −1 x a +c
1
a 2 +x 2 1
a tan−1 x a +c
Notethata,b,nandcareconstants.Whenintegratingtrigonometricfunctions,
anglesmustbeinradians.
(c) FromTable13.1,wefind ∫ −cos(ax +b)
sin(ax+b)dx =
a
andb= 2,and so
∫
−cos(3x +2)
sin(3x +2)dx = +c
3
+c.Inthiscasea = 3
(d) From Table 13.1, we find thatifkisaconstant then ∫ k dx =kx +c.Hence,
∫
5.9 dx = 5.9x +c
(e) In this example, the independent variable ist but nevertheless from Table 13.1 we
can deduce
∫
ln|sec(at +b)|
tan(at +b)dt = +c
a
Hence witha = 6 andb=−4,we obtain
∫
ln|sec(6t −4)|
tan(6t −4)dt = +c
6
432 Chapter 13 Integration
(f) TheindependentvariableiszbutfromTable13.1wecandeduce ∫ e az dz = eaz
a +c.
Hence, takinga = −3 we obtain
∫
e −3z dz = e−3z
−3 +c=−e−3z 3 +c
(g) Since 1 x 2 =x−2 ,wefind
∫ 1
x 2 dx = ∫
x −2 dx = x−1
−1 +c=−1 x +c
(h) Whenintegratingcos100nπt withrespecttot,notethat100nπisaconstant.Hence,
usingpart (b)wefind
∫
cos100nπt dt = sin100nπt
100nπ +c
13.2.1 Integrationasalinearoperator
Integration, like differentiation, is a linear operator. If f and g are two functions of x,
then
∫ ∫ ∫
f+gdx= fdx+ gdx
This states that the integral of a sum of functions is the sum of the integrals of the individual
functions.IfAisaconstant and f a functionofx, then
∫
∫
Afdx=A
f dx
Thus,constantfactors can betakenthroughthe integral sign.
IfAandBareconstants,and f andgarefunctions ofx, then
∫
∫
Af+Bgdx=A
∫
fdx+B
gdx
These three properties are all consequences of the fact that integration is a linear operator.
Note that the first two are special cases of the third. The properties are used in
Example 13.4.
Example13.4 Use Table 13.1 and the properties of a linear operator to integrate the following
expressions:
(a) x 2 +9 (d) (t +2) 2 (g) 3sin4t
(b) 3t 4 − √ t
(c)
1
x
(e)
(f) 4e 2z
1
+z (h) 4cos(9x +2)
z
(i) 3e 2z
Solution (a)
sinx+cosx
(j)
2
(k) 2t −e t
∫
∫
x 2 +9dx=
∫
x 2 dx+
( ) z −1
(l) tan
2
(m) e t +e −t
9 dx usinglinearity
13.2 Elementary integration 433
(n) 3sec(4x −1)
(o) 2cot9x
(p) 7cosec(π/3)
(b)
+9x +c usingTable 13.1
3
Note thatonly a single constant of integration isrequired.
∫
3t 4 − √ ∫ ∫
tdt=3 t 4 dt − t 1/2 dt using linearity
= x3
( ) t
5
=3 − t3/2
+c using Table 13.1
5 3/2
= 3t5
5 − 2t3/2
+c
3
(c)
∫ 1
dx = ln |x| +c using Table 13.1.
x
Sometimesitisconvenienttousethelawsoflogarithmstorewriteanswersinvolving
logarithms. For example, we can write ln|x| +c as ln|x| +ln|A| wherec = ln |A|.
This enables us towritethe integralas
∫ 1
dx = ln|Ax|
x
(d)
∫
∫
(t+2) 2 dt=
t 2 +4t+4dt= t3 3 +2t2 +4t+c
(e)
(f)
(g)
(h)
(i)
(j)
(k)
(l)
∫ 1
z +zdz=ln|z|+z2 2 +c
∫
4e 2z dz = 4e2z
2 +c=2e2z +c
∫
3sin(4t)dt = − 3cos4t +c
4
∫
4sin(9x +2)
4cos(9x +2)dx = +c
9
∫
3e 2z dz = 3e2z
2 +c
∫ sinx+cosx
dx = −cosx+sinx +c
2 2
∫
2t−e t dt=t 2 −e t +c
∫ ( )
( )∣ z −1
tan dz=2ln
z −1 ∣∣∣
2 ∣ sec +c
2
434 Chapter 13 Integration
∫
(m)
∫
(n)
∫
(o)
∫
(p)
e t +e −t dt=e t −e −t +c
3sec(4x −1)dx = 3 ln|sec(4x −1) +tan(4x −1)| +c
4
2cot9xdx= 2 9 ln|sin9x|+c
7cosec(π/3) dx = {7cosec(π/3)}x +c
as cosec(π/3) isaconstant
Engineeringapplication13.1
Distancetravelledbyaparticle
The speed, v, of a particle is the rate of change of distance,s, with respect to timet,
thatis v = ds .Thespeedattimet isgivenby3+2t.ThisisillustratedinFigure13.1.
dt
Note that the speed of the particle is increasing linearly with time. Find the distance
intermsoft.
Solution
We aregiven that
v = ds
dt =3+2t
and arerequired tofinds. Therefore,
∫
s = 3+2tdt=3t+t 2 +c
Notethatthespeedoftheparticleismodelledbyalinearexpressionint.Thismeans
thatthespeedincreasesbythesameamountineachsubsequentsecond.Ontheother
hand,thedistance,s,ismodelledbyaquadraticint.Therefore,thedistancetravelled
ineachsubsequentsecondincreases.Figure13.1illustratesthespeed-timegraphand
Figure 13.2 illustratesthe distance-time graph.
y
s
3
c
O
Figure13.1
Graphofspeed ofthe particle, v,against
time,t.
t
O
Figure13.2
Graphofdistancetravelled by the
particle,s,against time,t.
t
13.2 Elementary integration 435
Engineeringapplication13.2
Voltageacrossacapacitor
Recall from Engineering application 10.2 that the current, i, through a capacitor
depends upon time,t, and isgiven by
i =C dv
dt
where v is the voltage across the capacitor andC is the capacitance of the capacitor.
Derive an expression for v.
Solution
If
i =C dv then
dt
Therefore,
∫ i
v =
C dt = 1 ∫
C
i dt
dv
dt
= i C
usinglinearity
Notethatwhereasthecapacitance,C,isconstant,thecurrent,i,isnotandsoitcannot
betakenthroughtheintegralsign.Inordertoperformtheintegrationweneedtoknow
i as a function oft.
13.2.2 Electronicintegrators
Often there is a requirement in engineering to integrate electronic signals. Various circuits
are available to carry out this task. One of the simplest circuits is shown in
Figure 13.3. The input voltage is v i
, the output voltage is v o
, the voltage drop across
theresistoris v R
andthecurrentflowinginthecircuitisi.ApplyingKirchhoff’svoltage
lawyields
v i
=v R
+v o
(13.1)
For the resistor with resistance,R,
v R
=iR (13.2)
For the capacitorwith capacitance,C,
i =C dv o
dt
Combining Equations (13.1) to(13.3) yields
(13.3)
v i
=RC dv o
+ v
dt o
(13.4)
In general, v i
will be a time-varying signal consisting of a range of frequencies. For
the case where v i
is sinusoidal we can specify a property of the capacitor known as the
capacitive reactance,X c
, given by
X c
= 1
2πfC
436 Chapter 13 Integration
C
R
i
R 1 i 1 i
y f
1
R 2 i y2
2
–
R
i A
y 3 i
X 3 3
+
y i
y R
C
y o
y o
Figure13.3
Simple integrator.
Figure13.4
Summingintegrator usingan operational
amplifier(op amp).
where f =frequencyofthesignal(Hz).ItcanbeseenthatX c
decreaseswithincreasing
frequency, f. For frequencies whereX c
is small compared withR, most of the voltage
droptakesplaceacrosstheresistor.Inotherwords, v o
issmallcomparedwith v R
.ExaminingEquation(13.1)forthecasewhenX
c
isverymuchlessthanR(writtenasX c
≪R),
and v o
≪ v R
, itcan be seen thatEquation (13.4) simplifies to
v i
=RC dv o
dt
(13.5)
Thisequationisonlyvalidfortherangeoffrequencies forwhichX c
≪R.Rearranging
Equation (13.5) yields
dv o
dt
= v i
RC
v o
= 1
RC
∫
v i
dt
The output voltage from the circuit is an integrated version of the input voltage with a
1
scaling factor
RC .
Anelectronicintegratorwhichperformsbettercanbemadefromanoperationalamplifier.
The circuit for an operational amplifier integrator is given in Figure 13.4. The
main advantage of this circuit is the low output impedance and high input impedance,
makingitusefulforelectroniccontrolapplicationsandanaloguesignalprocessing.The
function of the operational amplifier is to amplify the potential difference between the
invertingandnon-invertinginputs.Thesearelabelled −and +respectivelyinthecircuit
diagram.Usuallythegainoftheamplifierisextremelyhigh,soevenasmallvoltagedifferencebetweenthetwoinputswillgiveaverylargeoutputvoltage,whichislimitedby
the voltage supply attached to the device. Notice in this circuit that there is a capacitor
connectedfromtheoutputbacktotheinput.Thiscapacitorprovidesnegativefeedback.
This means that a proportion of the output voltage is fed back to the input and this in
turnservestoreducetheoutput.Asaconsequenceofthistheoverallgainofthecircuit
is limited and the amplifier reaches an equilibrium state where the voltage at point X is
the same as at the non-inverting input, which is connected to earth. For this reason the
point Xinthe circuitis sometimes referredtoas avirtual earth.
Assuming point Xisatzero volts, and using Ohm’s law, gives
i 1
= v 1
R 1
i 2
= v 2
R 2
i 3
= v 3
R 3
(13.6)
13.2 Elementary integration 437
4.00
3.00
2.00
1.00
y 0
0
–1.00
–2.00
–3.00
–4.00
290 291 292 293 294 295 296 297
t (ms)
y 1
298 299 300
Figure13.5
Output froman operational
amplifier integrator circuit with
v 1 being asquare wave input
andv 2 =v 3 =0.
Assumingi A
isnegligible, then
i f
=i 1
+i 2
+i 3
(13.7)
For the capacitor,
i f
=−C dv o
dt
(13.8)
Thenegativesignisaresultofthedirectionchosenfori f
.CombiningEquations(13.6)--
(13.8) yields
v 1
+ v 2
+ v 3
= −C dv o
R 1
R 2
R 3
dt
∫
v
v o
=− 1
R 1
C + v 2
R 2
C + v 3
R 3
C dt
The circuit therefore acts as an integrator. The minus sign in the integration is a result
of the circuit design which isknown as aninverting circuit. Atypical output from this
circuitisgiven inFigure 13.5.
13.2.3 Integrationoftrigonometricfunctions
The trigonometric identities given in Table 3.1 together with Table 13.1 allow us to
integrate a number of trigonometric functions.
Example13.5 Evaluate
(a) ∫ cos 2 t dt
(b) ∫ sin 2 t dt
438 Chapter 13 Integration
Solution Powersoftrigonometricfunctions,forexamplesin 2 t,donotappearinthetableofstandardintegrals.Whatwemustattempttodoisrewritetheintegrand
toobtainastandard
form.
(a) FromTable 3.1
cos 2 t = 1+cos2t
2
(b)
and so
∫ ∫ 1+cos2t
cos 2 t dt = dt
2
∫ ∫ 1 cos2t
=
2 dt + dt
2
= t 2 + sin2t +c
4
∫ ∫
sin 2 t dt = 1 −cos 2 t dt using the trigonometric identities
∫
=
∫
1dt−
cos 2 t dt
( t
= t −
2 + sin2t )
+c
4
= t 2 − sin2t
4
+k
using linearity
using part(a)
Example13.6 Find
(a) ∫ sin2tcost dt
(b) ∫ sinmtsinnt dt, wheremandnareconstants withm ≠n
Solution (a) Usingthe identitiesinTable3.1 wefind
2sinAcosB =sin(A +B) +sin(A −B)
hence sin2tcost can bewritten 1 (sin3t +sint). Therefore,
2
∫
∫ 1
sin2tcost dt = (sin3t +sint)dt
2
= 1 ( )
−cos3t
−cost +c
2 3
= − 1 6 cos3t − 1 2 cost+c
(b) Usingthe identity2sinAsinB = cos(A −B) −cos(A +B), we find
sinmtsinnt = 1 {cos(m −n)t −cos(m +n)t}
2
Therefore,
∫
∫ 1
sinmtsinnt dt = {cos(m −n)t −cos(m +n)t}dt
2
= 1 { }
sin(m −n)t sin(m +n)t
− +c
2 m −n m +n
13.2 Elementary integration 439
EXERCISES13.2
1 Integratethe following expressions usingTable 13.1:
1 1
(e) √ (f)
(a) x 10 (b) 9 (c) x 1.5
0.25 −x 2 0.01 + v 2
(d) √ 1 1
1
(g)
t (e) (f) −3.2
10 6 +r 2 (h)
10+t 2
z
1 1
1 1
(g)
x 3 (h) √ (i) (x 2 ) 3
(i) √ (j)
t 2−x 2 1
(j) x −2 (k) t 1/3
9 +x2
5 Integratethe followingexpressions:
2 Integratethe following expressions usingTable 13.1:
(a) e 5x (b) e 6x (c) e −3t (a)3+x+ 1 x
1
(d)
e x (e) e 0.5t (f) e z/3 (b)e 2x −e −2x
1
(g)
e 4t (h) e −2.5x
(c)2sin3x +cos3x
( )
t
(d)sec(2t + π) +cot
3 Integratethe following expressions usingTable 13.1:
2 − π
( )
(a) sin4x (b) sin9t
( )
( )
t
(e)tan +cosec(3t − π)
x 2t
2
(c) sin (d) sin
2 5
(e) cos7x (f) cos(−3x)
(f) sinx + x 3 + 1 e
( )
x
5t
1
(g) cos (h) tan9x
(g)
3
cos(3x)
( ) 2
(i) cosec2x (j) sec5t
(h) t + 1 (k) cot8y (l) cos(5t +1)
t
(m) tan(3x +4) (n) sin(3t − π)
1
( ) (i)
x
3e 2x
(o) cosec(5z +2) (p) sec
2 +1 (j) tan(4t −3) +2sin(−t −1)
( )
2t
(k)1+2cot3x
(q) sin
3 −1 (r) cot(5 −x)
( ) ( )
t t
(l) sin −3cos
(s) cosec(π −2x)
2 2
4 UseTable 13.1 to integrate the following expressions: (m)(t −2) 2
1 1
(n)3e −t −e −t/2
(a)
1+x 2 (b) √
1−x 2
(o)7 −7x 6 +e −x
1 1
(c) √ (d)
(p) (k +t) 2 k constant
4−x 2 9+z 2 (q)ksint −coskt k constant
440 Chapter 13 Integration
1
(r)
25+t 2
1
(s) √
25−t 2
(t)
6
1+x 2 + 1+x2
6
6 Theacceleration,a, ofaparticleis the rate ofchange
ofspeed, v,withrespecttotimet,thatisa = dv
dt .The
speed ofthe particle is the rate ofchange of
distance,s,that is v = ds .Ifthe acceleration isgiven
dt
by1+ t ,find expressionsforspeed anddistance.
2
7 Thespeed, v,ofaparticlevaries with time according
to
v(t) =2−e −t
(a) Obtainan expression forthe distancetravelled by
the particle.
(b) Calculate the distancetravelled bythe particle
betweent = 0 andt = 3.
8 Bywritingsinhax andcoshax in terms ofthe
exponential functionfind
∫
(a) sinhax dx
∫
(b) coshax dx
(c) Useyour results from (a)and(b)to find
∫
3sinh2x +cosh4x dx.
9 Acapacitor ofcapacitance10 −2 Fhas acurrenti(t)
through itwhere
i(t) =10−e −t
Find an expression forthe voltageacross the
capacitor.
10 Integratethe following:
1 1
(a)
x 2 (b)
+4 2x 2 +4
3
(c)
2x 2 +1
2
(e) √
4−x 2
1
(g) √
1 − (x 2 /2)
(d)
(f)
1
√
9−x 2
−7
√
2−3x 2
11 Bywriting 3 +x in the form 3 x x +1,
∫ 3 +x
find dx.
x
12 (a) Express
x 2 +2x+1
x(x 2 +1)
asitspartial fractions.
∫ x 2 +2x+1
(b) Hence find
x(x 2 dx.
+1)
13 Thevelocity, v,ofaparticleisgiven by
v=2+e −t/2
(a) Given distance,s,and v are relatedby ds
dt = v
findan expression fordistance.
(b) Acceleration isthe rate ofchange ofvelocity
with respect tot.Determine the acceleration.
14 (a) Usethe product ruleofdifferentiationto verify
d
dx (xe2x ) = e 2x +2xe 2x
(b) Hence show
∫
xe 2x dx = xe2x
2 − e2x
4 +c
15 Integrate
2−t 2
(a)
t 3
(c)
(e)
cos4x
sin4x
4+e 2t
(b)
e 3t
1
(d)
2sin3x
2+x 2
1+x 2 (f) sin 2 t +cos 2 t
Solutions
x 11
1 (a) +c (b) 9x+c
11
2 2
(c)
5 x2.5 +c (d)
3 t1.5 +c
(e) ln |z| +c (f) −3.2x +c
(g) − 1
2x 2 +c (h) 2√ t +c
x 7
(i)
7 +c (j) −1 x +c
3
(k)
4 t4/3 +c
13.2 Elementary integration 441
2 (a)
e 5x
5 +c (b) e 6x
6 +c
(c) − e−3t
3 +c (d) −e−x +c
(e) 2e 0.5t +c (f) 3e z/3 +c
(g) − e−4t
4
3 (a) − cos4x +c
4
(b) − cos9t +c
9 ( )
x
(c) −2cos +c
2
(d) − 5 2 cos (
(e)
sin7x
7
+c (h) −2e−2.5x
5
+c
2t
5
)
+c
(f) − 1 sin(−3x) +c
3 ( )
3
(g)
5 sin 5t
+c
3
ln|sec9x|
(h) +c
9
1
(i) (ln|cosec2x −cot2x|) +c
2
1
(j) (ln|sec5t +tan5t|)+c
5
1
(k)
8 ln|sin8y|+c
1
(l) sin(5t +1) +c
5
(m) 1 ln|sec(3x +4)| +c
3
(n) − 1 cos(3t −π)+c
3
1
5
+c
(o) ln|cosec(5z +2) −cot(5z +2)| +c
( ) ( )∣ (p) 2ln
∣ sec x
2 +1 x ∣∣∣∣
+tan
2 +1 +c
( )
(q) − 3 2 cos 2t
3 −1 +c
(r) −ln|sin(5 −x)| +c
(s) − 1 ln|cosec(π −2x) −cot(π −2x)| +c
2
4 (a) tan −1 x +c
(b) sin −1 x +c
( )
(c) sin −1 x
+c
2
(d)
( )
1 z
3 tan−1 +c
3
(e) sin −1 (2x) +c
(f) 10tan −1 (10v) +c
( )
(g) 10 −3 tan −1 x
10 3 +c
(h)
( )
1
√ tan −1 t
√ +c
10 10
( )
(i) sin −1 x
√ +c
2
(j) 3tan −1 (3x) +c
5 (a) 3x+ x2
2 +ln|x|+c
(b) e2x
2 + e−2x +c
2
(c) − 2 sin3x
cos(3x) + +c
3 3
(d) 0.5ln|sec(2t + π) +tan(2t + π)|
( )∣ +2ln
∣ sin t ∣∣∣∣
2 − π +c
( )∣ (e) 2ln
∣ sec t ∣∣∣∣
+ 1 ln|cosec(3t − π)
2 3
−cot(3t −π)|+c
(f) −cosx+ x2
6 −e−x +c
(g) 1 3 ln|sec3x+tan3x|+c
(h) t3 3 +2t−1 t +c
(i) − e−2x +c
6
1
(j) ln|sec(4t −3)| +2cos(−t −1) +c
4
(k)x+ 2 3 ln|sin3x|+c
( ) ( )
t t
(l) −2cos −6sin +c
2 2
(m) t3 3 −2t2 +4t+c
(n) −3e −t +2e −t/2 +c
(o)7x−x 7 −e −x +c
(p) k 2 t +kt 2 + t3 3 +c
(q) −kcost − sinkt
k
+c
442 Chapter 13 Integration
( )
1 t
(r)
5 tan−1 +c
5
( )
(s) sin −1 t
+c
5
(t) 6tan −1 x + x 6 + x3
18 +c
6 Speed:t + t2 4 +c,distance: t2 2 + t3
12 +ct+d
7 (a) 2t +e −t +c (b) 5.0498
8 (a) coshax +c
a
(b) sinhax +c
a
(c) 3 2 cosh2x + 1 sinh4x +c
4
9 100(10t +e −t ) +c
( )
1 x
10 (a)
2 tan−1 +c
2
( )
1
(b)
2 √ x
2 tan−1 √ +c
2
(c)
3
√
2
tan −1 ( √ 2x)+c
( )
(d) sin −1 x
+c
3
( )
(e) 2sin −1 x
+c
2
(√ )
−7 3
(f) √ sin −1 √ x +c
3 2
(g) √ ( )
2sin −1 x
√ +c
2
11 3ln|x|+x+c
12 (a)
1
x + 2
x 2 +1
(b) ln|x| +2tan −1 x +c
13 (a) 2t −2e −t/2 +c (b) − 1 2 e−t/2
15 (a) −t −2 −ln|t|+c
(b) − 4 3 e−3t −e −t +c
1
(c)
4 ln|sin4x|+c
1
(d) ln|cosec3x −cot3x| +c
6
(e) x+tan −1 x+c
(f) t+c
13.3 DEFINITEANDINDEFINITEINTEGRALS
Alltheintegrationsolutionssofarencounteredhavecontainedaconstantofintegration.
Such integrals are known as indefinite integrals. Integration can be used to determine
the area under curves and this gives rise todefinite integrals.
To estimate the area under y(x), it is divided into thin rectangles. The sum of the
rectangularareasisanapproximationtotheareaunderthecurve.Severalthinrectangles
will give a better approximationthanafewwide ones.
ConsiderFigure13.6wheretheareaisapproximatedbyalargenumberofrectangles.
Suppose each rectangle has width δx. The area of rectangle 1 is y(x 2
)δx, the area of
rectangle2isy(x 3
)δxandsoon.LetA(x n
)denotethetotalareaunderthecurvefromx 1
tox n
. Then,
A(x n
) ≈ sum ofthe rectangularareas =
n∑
y(x i
)δx
i=2
Lettheareabeincreasedbyextendingthebasefromx n
tox.ThenA(x)isthetotalarea
under the curve fromx 1
tox(see Figure 13.7). Then,
increaseinarea = δA =A(x) −A(x n
) ≈y(x)δx
13.3 Definite and indefinite integrals 443
y
y
n
1 2 3 4 n–1
dx
x 1 x 2
dx
x 3 x 4 x 5
x n
x
x 1
x n
x
x
Figure13.6
The area isapproximated by (n −1) rectangles.
Figure13.7
Thearea isextended byadding an extrarectangle.
So,
δA
δx ≈y(x)
Inthe limitas δx → 0,we get
( ) δA
lim = dA
δx→0 δx dx =y(x)
Since differentiation isthe reverse of integration, we can write
∫
A = y(x)dx
To denote the limits of the area being considered we place values on the integral sign.
Thearea under the curve,y(x), betweenx =aandx =bisdenotedas
∫ x=b
x=a
ydx
ormorecompactlyby
∫ b
a
ydx
Theconstantsaandbareknownasthelimitsoftheintegral:lowerandupper,respectively.Sinceanareahasaspecificvalue,suchanintegraliscalledadefiniteintegral.The
area under the curve up to the vertical linex = b isA(b) (see Figure 13.8). Similarly
A(a) is the area up to the vertical linex = a. So the area betweenx = a andx = b is
A(b) −A(a), asshown inFigure 13.9.
Thearea betweenx =aandx =bis given by
Area =
∫ b
a
ydx =A(b) −A(a)
The integral is evaluated at the upper limit, b, and at the lower limit, a, and the
difference between thesegives the required area.
444 Chapter 13 Integration
y
y
y
A(b)
a
A(a)
Figure13.8
Thearea depends onthe limitsaandb.
x
b
x
a
Figure13.9
Thearea betweenx =aandx =bis
A(b) −A(a).
b
x
The expressionA(b) −A(a) isoften writtenas [A(x)] b a . Similarly[x2 +1]
2 3 isthe value
ofx 2 +1atx=3lessthevalueofx 2 +1atx=2.Thus
[x 2 +1] 3 2 =(32 +1)−(2 2 +1)=5
Ingeneral
[f(x)] b a =f(b)−f(a)
Note that since
∫ b
a
ydx =A(b) −A(a)
then, interchanging upper and lower limits,
∫ a
that is,
b
ydx =A(a) −A(b) = −{A(b) −A(a)}
∫ b
ydx=−
∫ a
a b
ydx
Interchanging the limitschanges the sign of the integral.
The evaluation of definite integrals isillustratedinthe following examples.
Example13.7 Evaluate
(a)
∫ 2
1
x 2 +1dx
(b)
∫ 1
∫ 2
Solution (a) LetI stand for x 2 +1dx.
I =
∫ 2
1
1
2
x 2 +1dx
[ ] x
x 2 3 2
+1dx=
3 +x 1
(c)
∫ π
0
sinx dx
Theintegralisnowevaluatedattheupperandlowerlimits.Thedifferencegivesthe
value required.
( ) ( ) 2
3 1
3
I =
3 +2 −
3 +1 = 8 3 +2− 4 3 = 10 3
13.3 Definite and indefinite integrals 445
(b) Because interchanging thelimitsofintegration changes thesignoftheintegral,we
(c)
find
∫ 1
x 2 +1dx=−
∫ π
0
∫ 2
2
1
x 2 +1dx=− 10 3
sinx dx = [−cosx] π 0 =(−cosπ)−(−cos0)=1−(−1)=2
Figure 13.10 illustratesthisarea.
sin x
0 p
x
Figure13.10
The area isgiven byadefinite integral.
Note that:
(1) Theintegratedfunctionisevaluatedattheupperandlowerlimits,andthedifference
found.
(2) No constantofintegration is needed.
(3) Any anglesaremeasured inradians.
Example13.8 Findthe area underz(t) = e 2t fromt = 1 tot = 3.
∫ 3 ∫ 3
[ e
Solution Area = zdt= e 2t 2t
dt =
1 1 2
[ ] [ ] e
6 e
2
= − = 198
2 2
] 3
1
If the evaluation of an area by integration yields a negative quantity this means that
some or all of the corresponding area is below the horizontal axis. This is illustrated in
Example 13.9.
Example13.9 Findthe area bounded byy =x 3 and thexaxisfromx = −3tox = −2.
Solution Figure 13.11illustratesthe required area.
∫ −2
[ ] x
x 3 4 −2
dx=
−3 4
−3
{ } { }
(−2)
4 (−3)
4
= − = − 65 4 4 4
The area is65/4 square units; the negative sign indicates thatitisbelow thexaxis.
446 Chapter 13 Integration
y
y = x 3
y
y = sin x
–3
–2
0 x
–p
p
x
Figure13.11
Areas below thexaxisare classed asnegative.
Figure13.12
Thepositive and negative areas cancel
each other out.
Example13.10 (a) Sketchy = sinx forx = −πtox = π.
(b) Calculate
∫ π
−π
sinxdx and comment onyourfindings.
(c) Calculatetheareaenclosedbyy = sinxandthexaxisbetweenx = −πandx = π.
Solution (a) Agraph ofy = sinx betweenx = −πandx = π isshown inFigure 13.12.
(b)
∫ π
−π
sinx dx = [−cosx] π −π = −cos(π) +cos(−π) = 0
Examining Figure 13.12 we see that the positive and negative contributions have
cancelledeachotherout;thatis,theareaabovethexaxisisequalinsizetothearea
below thexaxis.
(c) From(b)theareaabovethexaxisisequalinsizetotheareabelowthexaxis.From
Example 13.7(c) the area above the x axis is 2. Hence the total area enclosed by
y=sinxandthexaxisfromx=−πtox=πis4.
Ifanareacontainspartsbothaboveandbelowthehorizontalaxisthencalculatingan
integral will give the net area. If the total area is required, then the relevant limits must
first befound.Asketchofthe function oftenclarifiesthe situation.
Example13.11 Findthe area containedbyy = sinx fromx = 0 tox = 3π/2.
Solution Figure13.13illustratestherequiredarea.Fromthisweseethattherearepartsbothabove
and below thexaxisand the crossover pointoccurswhenx = π.
∫ π
0
∫ 3π/2
π
sinx dx = [−cosx] π 0
=−cosπ+cos0=2
sinx dx = [−cosx] 3π/2
π
( ) 3π
=−cos +cosπ=−1
2
13.3 Definite and indefinite integrals 447
y
0
p
3p ––2 y = sin x
x
Figure13.13
Thepositive andnegative areas are calculated
separately.
The total area is 3 square units. Note, however, that the single integral over 0 to 3π/2
evaluates to1;thatis, itgives the net value of 2 and −1.
∫ 3π/2
0
( ) 3π
sinx dx = [−cosx] 3π/2
0
=−cos +cos0=1
2
Theneedtoevaluatetheareaunderacurveisacommonrequirementinengineering.
Oftentherateofchangeofanengineeringvariablewithtimeisknownanditisrequired
to calculate the value of the engineering variable. This corresponds to calculating the
area under a curve.
Engineeringapplication13.3
Energyusedbyanelectricmotor
Consider a small d.c. electric motor being used to drive an electric screwdriver. The
amount of power that is supplied to the motor by the battery depends on the load on
the screwdriver. Therefore the power supplied to the screwdriver is a function that
varies with time.Figure 13.14shows a typicalcurve ofpower versus time.Now,
P = dE
dt
whereP = power (W),E = energy (J). Therefore, to calculate the energy used by
the motor between timest 1
andt 2
, wecan write
E =
∫ t2
t 1
Pdt
This is equivalent to evaluating the area under the curve, P(t), between t 1
and t 2
,
which is shown as the shaded region inFigure 13.14.
Power (watts)
t 1
t 2
Time (s)
Figure13.14
Shaded area representsthe energy usedto
drivethe motor duringthe time interval
t 1 tt 2 .
448 Chapter 13 Integration
Engineeringapplication13.4
Capacitanceofacoaxialcable
A coaxial cable has an inner conductor with a diameter of 1.02 mm and an outer
conductorwithaninternaldiameterof3mm,asshowninFigure13.15.Theinsulator
separatingthetwoconductorshasarelativepermittivityof1.55.Letuscalculatethe
capacitance ofthe cable per metre length.
Outer conductor
Cable sheath
r
a
b
1.02 mm 3 mm
Insulator
Inner conductor
Figure13.15
Cross-sectionofacoaxial
cable.
Before solving this problem it is instructive to derive the formula for the capacitance
of a coaxial cable. Imagine that the inner conductor has a charge of +Q per
metrelength and thatthe outer conductor has a charge of −Qper metrelength. Furtherassumethecableislongandacentralsectionisbeinganalysedinorderthatend
effects can be ignored.
Consideranimaginarycylindricalsurface,radiusrandlengthl,withintheinsulator(ordielectric).Gauss’stheoremstatesthattheelectricfluxoutofanyclosedsurfaceisequaltothechargeenclosedbythesurface.Inthiscase,becauseofsymmetry,
theelectricfluxpointsradiallyoutwardsandsonofluxisdirectedthroughtheends
of the imaginary cylinder; that is, end effects can be neglected. The curved surface
area of the cylinder is2πrl. Therefore, using Gauss’stheorem
D×2πrl=Ql
whereD =electricflux density.
WhenaninsulatorordielectricispresentthenD = ε r
ε 0
E,whereE istheelectric
field strength, ε r
is the relative permittivity, ε 0
is the permittivity of free space and
has a value of 8.85 ×10 −12 Fm −1 . Therefore,
thatis,
ε r
ε 0
E2πrl =Ql
E =
Q
2πε r
ε 0
r
(13.9)
This equation gives a value for the electric field within the dielectric. In order to
calculatethecapacitanceofthecableitisnecessarytocalculatethevoltagedifference
between thetwoconductors. LetV a
representthevoltageoftheinnerconductor and
V b
the voltage of the outer conductor.
13.3 Definite and indefinite integrals 449
The electric field is a measure of the rate of change of the voltage with position.
Inotherwords,ifthevoltageischangingrapidlywithpositionthenthiscorresponds
to a large magnitude of the electric field. This is illustrated in Figure 13.16. The
magnitude of the electric field at point A is larger than at point B. As a positive
electricfield,E,correspondstoadecreaseinvoltage,V,withpositiontherelationship
betweenE andV, ingeneral, is
E = − dV
(13.10)
dr
This expression can be used to calculate the voltage difference arising as a result of
an electric field. In practice, this is a simplified equation and is only valid provided
r is in the same direction as the electric field. If this is not the case, then a modified
vector formof Equation (13.10) isneeded.
V
A
B
r
Figure13.16
Thegradient ofthe curve, dV , isproportional to the
dr
magnitude ofthe electric field.
In the case of the coaxial cable, E is in the same direction as r and so
Equation (13.10) can be used to calculate the voltage difference between the two
conductors. From Equation (13.10)
dV
dr = −E
Therefore the voltage atanarbitrarypoint,r, isgiven by
∫
V = − E dr
Consequently, the voltage difference between pointsr =bandr =aisgiven by
V b
−V a
=−
∫ b
a
E dr
= − Q ∫ b
1
dr using Equation (13.9)
2πε r
ε 0
r
= − Q
2πε r
ε 0
[lnr] b a
= − Q
2πε r
ε 0
ln( b
a
a
)
Thisgivesthevoltageoftheouterconductorrelativetotheinnerone.Thusthevoltage
of the inner conductor relative tothe outer one is
V a
−V b
=
Q ) b
ln(
2πε r
ε 0
a
➔
450 Chapter 13 Integration
More generally, capacitance is defined asC = Q/V, whereV is the voltage difference.
Therefore
C =
Q = 2πε r ε 0
(
V a
−V b b
ln
a)
Note that this is the capacitance per unit length of cable. Using ε r
= 1.55, a =
5.1 ×10 −4 m,b= 1.5 ×10 −3 m, weget
2π ×1.55 ×8.85 ×10−12
C = ( ) = 7.99 ×10 −11 ≈ 80 pF m −1
1.5 ×10
−3
ln
5.1 ×10 −4
Engineeringapplication13.5
Characteristicimpedanceofacoaxialcable
Acommonlyquotedparameterofacoaxialcableisitscharacteristicimpedance,Z 0
.
Thecharacteristicimpedanceistheratioofthevoltagetothecurrentforapropagating
wave travelling on an electrical transmission line in the absence of reflections.
ThevalueofZ 0
iseasytoselectatthedesignstagebycarefullychoosingthedimensionsaandbtogether
with the type of insulating material within the cable. The two
mostcommoncharacteristicimpedancesforflexiblecablesareapproximately50
and 75 . The main reason for selecting 50 is that it represents a good compromise
between the ability to handle high power and the minimization of losses that
occur in thermoplastic dielectrics. The value of 75 is mainly considered optimal
for situations of low power transmission and where losses are the most important
consideration. Often these 75 cables are of the air-dielectric type where the inner
conductor is supported by a spacer rather than a solid plastic dielectric. An example
of an application for a 75 cable is the connection from a rooftop TV antenna to a
TV set.
Itcanbeshownfromfundamentaltransmissionlinetheorythatthecharacteristic
impedanceofaloss-freecableis
√
L
Z 0
=
C
Itcan be shown thatthe expression for the inductance of a coaxial cable isgiven by
L = µ ( )
0 b
2π ln a
As shown inEngineering application 13.4, the expression for the capacitance is
C = 2πε r ε 0
( b
ln
a)
13.3 Definite and indefinite integrals 451
Substituting forLandC inthe equation forZ 0
( )
µ 0 b Z 0
=
2π ln a
/ ( ) = 1 √ ( )
µ0 b
ln
√2πε r
ε 0 b 2π ε r
ε 0
a
ln
a
For the cable defined in Engineering application 13.4, and substituting for the permeability
of free space, µ 0
= 4π ×10 −7 ,
Z 0
= 1 √
( )
µ 0 1.5 ×10
−3
2π 1.55 ×8.85 ×10 ln =52
−12 0.51 ×10 −3
13.3.1 Useofadummyvariable
Consider the following integrals,I 1
andI 2
:
I 1
=
∫ 1
Then,
[ ] t
3 1
I 1
= =
3
0
0
[ x
3
I 2
=
3
t 2 dt I 2
=
] 1
0
=
∫ 1
0
x 2 dx
( 1
3)
−(0)= 1 3
( 1
3)
−(0)= 1 3
So clearlyI 1
= I 2
. The value ofI 1
does not depend upont, and the value ofI 2
does not
depend uponx. Ingeneral,
I =
∫ b
a
f(t)dt =
∫ b
a
f(x)dx
Because the value ofI is the same, regardless of what the integrating variable may be,
wesayxandt are dummy variables. Indeed we could write
I =
∫ b
a
f(z)dz=
∫ b
a
f(r)dr=
Thenz,randyaredummy variables.
∫ b
a
f(y)dy
EXERCISES13.3
1 Evaluate the following integrals:
∫ 3 ∫ 4
(a) x 3 1
dx (b)
1 1 x dx
∫ 1 ∫ 1
(c) 2 dx (d) e x dx
0
−1
∫ π/3
(e) sint dt
0
∫ π/2
(g) cos3t dt
0
∫ 1.2
(i) tanx dx
1
(f)
(h)
∫ π
sin(t +3)dt
0
∫ 2
cos πt dt
1
452 Chapter 13 Integration
2 Evaluate the followingintegrals:
∫ 1
(a) t 2 +1dt
0
∫ 1 1
(b)
0 1+t 2 dt
∫ 2
(c) 3e 2x −2e 3x dx
1
∫ 2
(d) (x +1)(x +2)dx
1
∫ 2
(e) 2sin4t dt
0
∫ ( ) π t
(f) 4cos dt
0 2
3 Evaluate the followingintegrals:
∫ 1
(a) t 2 +0.5t−6dt
0
∫ 3 3
(b)
2 2x + 2x
3 dx
∫ 2
(c) e −2x −3e −x dx
1
∫ 2
(d) 3sin(4t − π) +5cos(3t + π/2)dt
0
∫ 1
(e) 2tantdt
0
∫ 1.5 2
(f)
0 1+x 2 − 1
√ dx
4−x 2
4 Calculate the areabetween f (t) = cost andthet axis
ast variesfrom
(a) 0 to π (b) 0 to π 4 2
3π
(c) toπ (d) 0toπ
4
5 Calculate the total area between f (t) = cos2t andthe
t axisast variesfrom
(a) 0 to π 4
(b)
π
4 to π 2
(c) 0 to π 2
6 Calculate the area enclosed by the curvesy =x 2 and
y=x.
7 Calculate the area enclosed by the graphs ofthe
functionsy =t 2 +5 andy = 6.
8 Evaluate the following definiteintegrals:
∫ 1.5 1
(a)
1 t + 1 e t + 1
sint dt
∫ 4 5
(b)
1 8+3x 2 dx
∫ 1
(c) sinhx dx
−1
∫ 1
(d) coshx dx
−1
9 Calculate the area betweeny = 2tant andthet axis
for −1 t 1.4.
10 Findthe areabetweeny = sint,y = cost andthey
axis, fort 0.
11 Thevelocity, v,ofaparticleisgiven by
v=(1+t) 2
Findthe distancetravelled bythe particlefromt = 1
tot = 4; thatis,evaluate ∫ 4
1 vdt.
12 Evaluate the area under the functionx = 1/t for
1t10.
13 Evaluate
∫ 2
(a) x 1.4 dx
3
∫ π
(c) sinxcosx dx
0
∫ 1
(b) (e t ) 2 dt
0
∫ 2
(d) sinh 2 x dx
1
Solutions
1 (a) 20 (b) 1.3863 (c) 2
(d) 2.3504 (e) 0.5 (f) −1.9800
(g) − 1 3
4
2 (a)
3
53
(d)
6
(h) 0 (i) 0.3995
(b)
π
4
(e) 0.5728 (f) 8
(c) −184.75
3 (a)− 65
12
(b) 2.275 (c) −0.639
(d) −0.9255 (e) 1.2313 (f) 1.1175
4 (a) 0.7071 (b) 1
5 (a)
(c) 0.7071 (d) 2
1
2
(b)
1
2
(c) 1
Review exercises 13 453
6
7
1
6
4
3
8 (a) 1.0839 (b) 0.6468
(c) 0 (d) 2.3504
9 4.7756
10 0.4142
11 39
12 2.3026
13 (a) −3.6202 (b) 3.1945
(c) 0 (d) 5.4158
REVIEWEXERCISES13
1 Find the following integrals:
∫
∫ 2
(a) 3x 2 +xdx (b)
t +2t+2dt
∫
∫ √t 1
(c) (1 +z)(1 −z)dz (d) − √t dt
(e)
∫
2 Given
x 2/3 +4x 3 dx
dy
dx =x2 +sinx+cos2x+1
findan expression fory(x).
3 Find the following integrals:
∫
(a) 3e x + 3 e x dx
∫
(b) (1+e x )(1−e −x )dx
∫
(c) 2e 4t +1 dt
∫
(d) e x (1+e x )dx
∫
(e) 4e −t −e −2t dt
4 Find the integrals
∫
(a) sin2x +cos2x dx
∫
(b) 2sint−costdt
∫ ( )
t
(c) 4tan dt
2
∫
(d) sin(π −z) +cos(π −2z)dz
∫
(e) tan(t + π)dt
∫ ( )
t
(f) 2sin3t+2sin dt
3
5 Find the following integrals:
∫
(a) cosec(3t + π)dt
∫ ( )
x
(b) sec
2 +1 dx
∫ ( )
π +t
(c) cot dt
2
∫ ( )
y
(d) 3cosec
3 −2 dy
∫ 1
(e) cot(π −2z)dz
2
∫ 2
(f) sec(2t − π)dt
3
6 Find the following integrals:
∫
∫
4
(a) √ dv (b) 1−v 2
(c)
(e)
(g)
∫
∫
∫
∫
1
49+t 2 dt (d)
2
√
1−4t 2 dt
3
x 2 +3 dx
(f) ∫
1
2 √ 1−v dv 2
1
50+2t 2 dt
1
√
36 −9x 2 dx
7 The speed, v(t),ofaparticleis given by
v(t) =t +e −t
(a) Findthe distancetravelled by the particle.
(b) Calculate the distancetravelled betweent = 1
andt=3.
8 The capacitanceofacapacitor is 0.1F.The current,
i(t),through the capacitor is given by
i(t) = 50sinπt
Derive an expression forthe voltageacross the
capacitor.
454 Chapter 13 Integration
9 Byusingsuitable trigonometric identities find
∫
(a) sin 2 2t +cos 2 2t dt
∫
(b) sin2tcos2t dt
∫
1
(c)
sin2t dt
∫ cos2t
(d)
sin2t dt
∫ sin2t
(e)
cos2t dt
10 Byexpressing
2x 2 +x+2
x 3 +x
asits partialfractions, find
∫ 2x 2 +x+2
x 3 dx
+x
11 Evaluate the followingdefiniteintegrals:
∫ 1
(a) 2t 2 +t 3 dt
0
∫ 3 2
(b)
1 x − x 2 dx
∫ 1
(c) 7−t+7t 2 dt
0
∫ 2
(d) (z +1)(z +2)dz
0
∫ 4 √
(e) x dx
1
∫ −1 x −2
(f) dx
−2 x
12 Evaluate the followingdefiniteintegrals:
∫ 1
∫ 1
(a) e 3x+1 dx (b) e 2t −e t +1dt
0
−1
∫ 2
∫ 1
(c) (e z −1) 2 dz (d) e −2x +e 2x dx
1
0
∫ 0
(e) x+e x dx
−1
13 Evaluate the followingintegrals:
∫ π/2
(a) 2sin3t dt
0
∫ π
(b) sin2t −cos2t dt
π/2
∫ π/4
(c) tant+tdt
0
∫ ( ) ( )
π/4 t t
(d) sin +cos dt
0 2 2
∫ 0.1
(e) tan3t dt
0
14 Evaluate the following definiteintegrals:
∫ 2π/k
(a) sinkt dt
0
∫ 2π/k
(b) coskt dt
0
wherekis aconstant.
15 Evaluate the following definiteintegrals:
∫ 0.5
(a) cosec(2x +1)dx
0
∫ 0.1
(b) sec3t dt
0
∫ π/4
(c) tan(x + π)dx
−π/4
16 Evaluate the following definiteintegrals:
∫ 2 3
(a) √ dx
0 9−x 2
∫ 1 2
(b)
−1 9+x 2 dx
∫ 1 1
(c) √ dx
0 8−2x 2
∫ 3 1
(d)
1 10+4t 2 dt
17 (a) Calculate the area enclosed bythe curvey =x 3 ,
thexaxisandx = 2.
(b) Calculate the area enclosed bythe curvey =x 3 ,
theyaxisandy = 8.
18 Findthe total area between f (t) =t 2 −4 andthet
axisonthe followingintervals:
(a) [−4, −3] (b) [−3, −1]
(c) [0,3]
(d) [−3,3]
19 Calculate the area enclosed by the curve
y=x 2 −x−6andthexaxis.
20 Calculate the total area betweeny =x 2 −3x −4 and
thexaxisonthe followingintervals:
(a) [−2, −1] (b) [−2,1]
(c) [2,4] (d) [2,5]
21 Calculate the area enclosed by the curvey =x 3 and
theliney =x.
Review exercises 13 455
22 Calculate the area enclosed byy =x 2 +4 and
y=12−x 2 .
23 Calculate the area enclosed byy = sinx andy = 2x
π .
Solutions
1 (a) x 3 + x2
2 +c
2
(b) 2ln|t|+t 2 +2t+c
(c) z− z3 3 +c
2
(d)
3 t3/2 −2t 1/2 +c
3
(e)
5 x5/3 +x 4 +c
x 3
3 −cosx+1 2 sin2x+x+c
3 (a) 3e x −3e −x +c
(b) e x +e −x +c
(c)
e 4t
2 +t+c
(d) e x + e2x
2 +c
(e) −4e −t + e−2t
2 +c
4 (a) − 1 2 cos2x + 1 2 sin2x+c
(b) −2cost−sint+c
( )∣ (c) 8ln
∣ sec t ∣∣∣∣
+c
2
(d) cos(π −z) − 1 sin(π −2z) +c
2
(e) ln|sec(t +π)|+c
( )
(f) − 2 3 cos3t −6cos t
+c
3
1
5 (a) ln|cosec(3t + π) −cot(3t + π)| +c
3 ( ) ( )∣ (b) 2ln
∣ sec x
2 +1 x ∣∣∣∣
+tan
2 +1 +c
( )∣ (c) 2ln
∣ sin π +t ∣∣∣∣
+c
2
( ) ( )∣ (d) 9ln
∣ cosec y
3 −2 y ∣∣∣∣
−cot
3 −2 +c
(f)
(e) − 1 ln|sin(π −2z)| +c
4
1
ln|sec(2t −π)+tan(2t −π)|+c
3
6 (a) 4sin −1 v+c
(b) 1 2 sin−1 v +c
( )
1 t
(c)
7 tan−1 +c
7
( )
1 t
(d)
10 tan−1 +c
5
(e) sin −1 (2t) +c
( )
1 x
(f)
3 sin−1 +c
2
(g) √ ( )
3tan −1 x
√ +c
3
7 (a) t2 2 −e−t +c (b) 4.3181
8 − 500
π cosπt+c
9 (a) t+c
(b) − cos4t +c
8
1
(c) ln|cosec2t −cot2t|+c
2
1
(d)
2 ln|sin2t|+c
1
(e)
2 ln|sec2t|+c
10 2ln|x|+tan −1 x+c
11
11 (a)
12
38
(d)
3
(b) 0.1972
(e)
14
3
(c)
53
6
(f) 2.3863
12 (a) 17.2933 (b) 3.2765 (c) 15.2630
(d) 3.6269 (e) 0.1321
2
13 (a)
3
(b) −1 (c) 0.6550
(d) 0.9176 (e) 0.0152
14 (a) 0 (b) 0
456 Chapter 13 Integration
15 (a) 0.5238 (b) 0.1015 (c) 0
16 (a) 2.1892 (b) 0.4290
(c) 0.3702 (d) 0.0825
17 (a) 4 (b) 12
18 (a)
25
3
(b) 4
(c)
23
3
(d)
46
3
19 125
6
20 (a)
21 0.5
22 64
3
17
6
23 0.4292
(b)
61
6
(c)
22
3
(d)
61
6
14 Techniques
ofintegration
Contents 14.1 Introduction 457
14.2 Integrationbyparts 457
14.3 Integrationbysubstitution 463
14.4 Integrationusingpartialfractions 466
Reviewexercises14 468
14.1 INTRODUCTION
The previous chapter showed us how to integrate functions which matched the list of
standard integrals given in Table 13.1. Clearly, it is impossible to list all possible functions
in the table and so some general techniques are required. Integration techniques
maybeclassifiedasanalytical,thatisexact,ornumerical,thatisapproximate.Wewill
now study threeanalytical techniques:
(1) integration by parts;
(2) integration by substitution;
(3) integration usingpartialfractions.
14.2 INTEGRATIONBYPARTS
This technique is used to integrate a product, and is derived from the product rule for
differentiation. Let u and v be functions of x. Then the product rule of differentiation
states:
d du
(uv) =
dx dx v +udv dx
Rearranging we have
u dv
dx = d dx (uv)−vdu dx
458 Chapter 14 Techniques of integration
Integrating thisequation yields
∫
u dv ∫ ∫ d
dx dx = dx (uv)dx−
v du
dx dx
Recognizing thatintegration and differentiation areinverse processes allows
∫ d(uv)
dx
dx
tobe simplified touv. Hence,
∫
( ) ∫ dv
u dx=uv−
dx
( du
v
dx
)
dx
This isthe formulafor integration by parts.
∫
Example14.1 Find
xsinxdx.
Solution We recognize the integrand as a product of the functions x and sinx. Let u = x,
dv du
=sinx. Then =1, v = −cosx. Using the integration by parts formula we get
dx dx
∫
∫
xsinxdx=x(−cosx)− (−cosx)1dx
=−xcosx+sinx+c
Whendealingwithdefiniteintegralsthecorrespondingformulaforintegrationbyparts
is
∫ b
a
( ∫ dv b
( ) du
u
)dx =[uv] b a
dx
− v dx
dx
a
Example14.2 Evaluate
Solution We let
Then
∫ 2
0
u=x
xe x dx
and
dv
dx = ex
du
=1 and v=ex
dx
14.2 Integration by parts 459
Usingthe integration by parts formula fordefinite integrals we have
∫ 2
∫ 2
xe x dx = [xe x ] 2 0 − e x·1dx
0
0
= 2e 2 −[e x ] 2 0
=2e 2 −[e 2 −1]
=e 2 +1
Sometimes integration by parts needs tobe used twice,as the next example illustrates.
Example14.3 Evaluate
∫ 2
0
x 2 e x dx
Solution We let
Then
u=x 2
and
dv
dx = ex
du
=2x and v=ex
dx
Usingthe integration by parts formula wehave
Now
∫ 2
0
∫ 2
0
∫ 2
0
∫ 2
x 2 e x dx = [x 2 e x ] 2 0 − 2xe x dx
=4e 2 −2
0
∫ 2
0
xe x dx
xe x dx has been evaluated usingintegration by parts inExample 14.2. So
x 2 e x dx = 4e 2 −2[e 2 +1] = 2e 2 −2 = 12.78
The next example illustrates a case in which the integral to be found reappears after
repeated application ofintegration by parts.
Example14.4 Find
∫
e t sintdt
460 Chapter 14 Techniques of integration
Solution We let
and so
u=e t
and
dv
dt
=sint
du
dt =et and v=−cost
Applying integration by parts yields
∫
∫
e t sintdt =−e t cost− (−cost)e t dt +c
∫
=−e t cost+
e t costdt +c (14.1)
We now apply integration by parts to ∫ e t costdt. We let
Then
u=e t
and
dv
dt
=cost
du
dt =et and v =sint
So
∫ ∫
e t costdt =e t sint−
e t sintdt (14.2)
Substituting Equation (14.2) into Equation (14.1) yields
∫
∫
e t sintdt =−e t cost+e t sint− e t sintdt+c
Rearranging the equation gives
∫
2 e t sintdt =−e t cost+e t sint+c
from which we see that
∫
e t sintdt = −et cost+e t sint+c
2
Example14.5 Evaluate
∫ 2
0
x n e x dx
forn=3,4,5.
Solution The integral may be evaluated by using integration by parts repeatedly. However, this
is slow and cumbersome. Instead it is useful to develop a reduction formula as is now
illustrated.
Letu =x n and dv
dx = ex . Then du
dx =nxn−1 and v = e x .
Usingintegration by parts we have
∫ 2
0
Writing
I n
=
wesee that
Hence
I n−1
=
∫ 2
x n e x dx = [x n e x ] 2 0 − nx n−1 e x dx
∫ 2
0
∫ 2
=2 n e 2 −n
x n e x dx
0
x n−1 e x dx
0
∫ 2
0
x n−1 e x dx
14.2 Integration by parts 461
I n
= 2 n e 2 −nI n−1
(14.3)
Equation (14.3) iscalled areduction formula.
We have already evaluatedI 1
, thatis
I 1
=e 2 +1
∫ 2
Usingthe reduction formula withn = 2 gives
∫ 2
0
x 2 e x dx =I 2
=2 2 e 2 −2I 1
= 4e 2 −2(e 2 +1)
=2e 2 −2
Note that thisisinagreement with Example 14.3.
Withn = 3 the reduction formulayields
∫ 2
0
x 3 e x dx =I 3
=2 3 e 2 −3I 2
Withn = 4 we have
∫ 2
0
= 8e 2 −3(2e 2 −2)
=2e 2 +6
x 4 e x dx =I 4
=2 4 e 2 −4I 3
Withn = 5 we have
∫ 2
= 16e 2 −4(2e 2 +6)
=8e 2 −24
0
xe x dx inExample 14.2, and found
0
x 5 e x dx =I 5
=2 5 e 2 −5I 4
= 32e 2 −5(8e 2 −24)
= 120 −8e 2
462 Chapter 14 Techniques of integration
EXERCISES14.2
1 Useintegration by partsto findthe following:
∫
∫
(a) xsin(2x)dx (b) te 3t dt
(c)
(e)
∫
xcosxdx
∫ x
e x dx
(d)
∫
( )
v
2vsin dv
2
2 Useintegration by partsto find
∫
(a) tlntdt
∫
(b) lntdt
∫
(c) t n lntdt (n ≠ −1)
∫
(d) tsin(at +b)dt a,bconstants
∫
(e) te at+b dt a,bconstants
3 Evaluate the followingdefiniteintegrals:
∫ 1
∫ π/2
(a) xcos2xdx (b) xsin2xdx
0
0
∫ 1
∫ 3
(c) te 2t dt (d) t 2 lntdt
−1
1
∫ 2 2x
(e) dx
0 e2x 4 Find
∫
(a) t 2 e 2t dt
(b)
(c)
∫
t 2 cos3tdt
∫ ( )
t 2 t
sin dt
2
5 Evaluate the following definiteintegrals:
∫ 2
(a) t 2 e t dt
0
∫ 1
(b) t 2 sintdt
−1
∫ 1
(c) t 2 cos3tdt
0
6 Obtainareduction formulafor
∫
I n = t n e kt dt n,k constants
∫
Hencefind
∫
t 2 e 3t dt,
∫
t 3 e 3t dt and
t 4 e 3t dt.
7 Useintegration byparts twiceto obtain areduction
formulafor
∫ π/2
I n = t n sintdt
0
∫ π/2 ∫ π/2
Hencefind t 3 sintdt, t 5 sintdt
0 0
∫ π/2
and t 7 sintdt.
0
8 Useintegration byparts to find
∫ π/2
e 2x cosxdx
0
Solutions
sin2x
1 (a) − xcos2x
4 2
( )
(b) e 3t t
3 − 1 +c
9
+c
(c) cosx+xsinx+c
( ) ( )
v v
(d) 8sin −4vcos +c
2 2
(e) −e −x (x +1) +c
2 (a)
t 2 lnt
2
− t2 4 +c
(b) tlnt−t+c
(c)
(d)
(lnt)t n+1
−
tn+1
n +1 (n +1) 2 +c
sin(at +b)
a 2
−
(e) e at+b (
t
a − 1 a 2 )
+c
tcos(at +b)
a
+c
14.3 Integration by substitution 463
3 (a) 0.1006 (b) 0.7854 (c) 1.9488
4 (a)
(d) 6.9986 (e) 0.4542
(b)
e 2t (2t 2 −2t +1)
4
+c
)
2
(t 2
9 tcos3t+ 3 − 2 sin3t+c
27
( ) ( )
t
(c) 8tsin −2(t 2 t
−8)cos +c
2 2
5 (a) 12.7781 (b) 0 (c) −0.1834
6 I n = tn e kt
− n k k I n−1 , e3t (9t 2 −6t +2)
,
27
e 3t (9t 3 −9t 2 +6t −2)
,
27
e 3t (27t 4 −36t 3 +36t 2 −24t +8)
81
7 I n =n
8 4.2281
{(
π
2
) n−1
− (n −1)I n−2
}
1.4022,2.3963,4.5084
14.3 INTEGRATIONBYSUBSTITUTION
∫
Example14.6 Find
This technique is the integral equivalent of the chain rule. It is best illustrated by
examples.
(3x +1) 2.7 dx.
Solution Letz = 3x + 1, so that dz
dx
becomes
∫
z 2.71 3 dz = 1 ∫
z 2.7 dz = 1 ( z
3.7
3 3 3.7
dz
= 3, that is dx = . Writing the integral in terms ofz, it
3
)
+c = 1 (3x +1) 3.7
+c
3 3.7
Example14.7 Evaluate
∫ 3
Solution Let v =t 2 so dv
dt
2
dt = 1 2t dv
tsin(t 2 )dt.
= 2t, thatis
When changing the integral from one in terms oft to one in terms of v, the limits must
alsobe changed. Whent = 2, v = 4;whent = 3, v = 9.Hence, the integral becomes
∫ 9
4
sinv
2 dv = 1 2 [−cos v]9 4 = 1 [−cos9 +cos4] = 0.129
2
Sometimesthe substitution can involve a trigonometric function.
464 Chapter 14 Techniques of integration
Example14.8 Evaluate
∫ 2
1
sint cos 2 tdt.
Solution Putz = cost so that dz = −sint, that is sintdt = −dz. Whent = 1,z = cos1; when
dt
t = 2,z = cos2.Hence
∫ 2
∫ cos2
[ ] z
sint cos 2 tdt =− z 2 3 cos2
dz=−
3
∫
Example14.9 Find
1
e
tanx
cos 2 x dx.
cos1
cos1
= cos3 1 −cos 3 2
3
Solution Putz = tanx. Then dz
dx = sec2 x,dz =
dx
cos 2 x . Hence,
∫ e
tanx
∫
cos 2 x dx = e z dz=e z +c=e tanx +c
∫
Example14.10 Find
Integration by substitution allows functions ofthe form df/dx
f
3x 2 +1
x 3 +x+2 dx.
= 0.0766
tobe integrated.
Solution Putz =x 3 +x+2,then dz
dx = 3x2 +1,thatisdz = (3x 2 +1)dx. Hence,
∫
3x 2 ∫
+1 dz
x 3 +x+2 dx = z =ln|z|+c=ln|x3 +x+2|+c
∫ df/dx
Example14.11 Find dx.
f
Solution Putz = f.Then dz
dx = df
dx , thatis
dz = df
dx dx.
Hence,
∫ df/dx
f
dx =
∫ dz
z =ln|z|+c=ln|f|+c
and so
∫
df/dx
f
dx=ln|f|+c
14.3 Integration by substitution 465
The resultofExample 14.11 isparticularly important.
∫ df/dx
dx=ln|f|+c
f
∫ 4
3t
Example14.12 2 +2t
Evaluate
2 t 3 +t 2 +1 dt.
Solution The numerator isthe derivative ofthe denominator and so
∫ 4
2
3t 2 +2t
t 3 +t 2 +1 dt = [ln|t3 +t 2 +1|] 4 2 =ln81−ln13=1.83
Example14.13 Find
∫
(a)
∫
(b)
∫
(c)
4
5x−7 dx
t
t 2 +1 dt
e t/2
e t/2 +1 dt
Solution The integrands arerewritten so thatthe numerator isthe derivative of the denominator.
∫
4
(a)
5x−7 dx = 4 ∫
5
5 5x−7 dx = 4 5 ln|5x−7|+c
∫
t
(b)
t 2 +1 dt = 1 ∫
2t
2 t 2 +1 dt
(c)
∫
= 1 2 ln|t2 +1|+c
e t/2 ∫ 1
e t/2 +1 dt=2 2 et/2
e t/2 +1 dt =2ln|et/2 +1|+c
EXERCISES14.3
1 Usethe given substitutionsto findthe following
integrals:
∫
(a) (4x+1) 7 dx, z=4x+1
∫
(b) t 2 sin(t 3 +1)dt, z =t 3 +1
∫
(c) 4te −t2 dt, z=t 2
∫
(d) (1−z) 1/3 dz, t=1−z
∫
(e) cost(sin 5 t)dt, z =sint
2 Evaluate the following definiteintegrals:
∫ 2
∫ π/2
(a) (2t +3) 7 dt (b) sin2tcos 4 2tdt
1
0
∫ 1
∫ 2
(c) 3t 2 √
e t3 dt (d) 4+3xdx
0
0
(
∫ √x )
2 sin
(e) √ dx
1 x
3 Find the area betweeny =x(3x 2 +2) 4 andthexaxis
fromx=0tox=1.
466 Chapter 14 Techniques of integration
4 Find
∫
(a)
∫
(c)
∫
(e)
4x+1
2x 2 +x+3 dx
3
9−2x dx
1
tlnt dt
(b) ∫
(d) ∫
2sin2x
cos2x+7 dx
t
t 2 +1 dt
5 Evaluate the following:
∫ 1
∫
2
π/2
(a)
0 (1 +3x) 2 dx (b) sint √ costdt
0
∫ 2
∫
3 +x 2
(c)
1 x 2 +6x+1 dx (d) e √ t
√ dt
1 t
∫ 2
(e) xsin(π −x 2 )dx
0
Solutions
1 (a)
(4x +1) 8
32
+c (b) − cos(t3 +1)
3
+c
(c) −2e −t2 +c (d) − 3 4 (1 −z)4/3 +c
(e)
1
6 sin6 t+c
2 (a) 3.3588 ×10 5 (b) 0.2 (c) 1.7183
(d) 5.2495 (e) 0.7687
3 103.098
4 (a) ln(2x 2 +x+3)+c
(b) −ln(cos2x +7) +c
(c) − 3 ln(9 −2x) +c
2
1
(d)
2 ln(t2 +1)+c
(e) ln(lnt) +c
5 (a) 0.5 (b) 0.6667 (c) 0.3769
(d) 2.7899 (e) 0.8268
14.4 INTEGRATIONUSINGPARTIALFRACTIONS
The technique of expressing a rational function as the sum of its partial fractions has
been covered in Section 1.7. Some expressions which at first sight look impossible to
integrate mayinfactbeintegrated when expressed astheirpartialfractions.
Example14.14 Find
∫
(a)
∫
(b)
1
x 3 +x dx
13x−4
6x 2 −x−2 dx
Solution (a) Firstexpress the integrand inpartialfractions:
Then,
1
x 3 +x = 1
x(x 2 +1) = A x + Bx+C
x 2 +1
1=A(x 2 +1)+x(Bx+C)
Equating the constant terms: 1 =Aso thatA = 1.
Equating the coefficients ofx: 0 =C so thatC = 0.
Equating the coefficients ofx 2 : 0 =A +B and henceB = −1.
14.4 Integration using partial fractions 467
(b)
Then, ∫ ∫
1 1
x 3 +x dx = x − x
x 2 +1 dx
=ln|x|− 1 2 ln|x2 +1|+c=ln
x
∣√ x2 +1
∣ +c
∫
∫
13x−4
6x 2 −x−2 dx = 13x−4
(2x +1)(3x −2) dx
∫
=
= 3 2
3
2x+1 + 2
3x−2 dx
∫
2
2x+1 dx + 2 ∫
3
using partialfractions
3
3x−2 dx
= 3 2 ln|2x+1|+2 3 ln|3x−2|+c
∫ 1
4t
Example14.15 3 −2t 2 +3t−1
Evaluate dt.
0 2t 2 +1
Solution Usingpartialfractions we maywrite
4t 3 −2t 2 +3t−1 t
=2t−1+
2t 2 +1 2t 2 +1
Hence,
∫ 1
0
4t 3 −2t 2 +3t−1
2t 2 +1
dt =
=
∫ 1
0
2t−1+
[
t
2t 2 +1 dt = t 2 −t + 1 ] 1
4 ln |2t2 +1|
0
[1 −1+ 1 ]
4 ln3 −
[0 −0+ 1 ]
4 ln1 = 0.275
EXERCISES14.4
1 Bywriting the integrand asits partialfractions find
∫ x +3
(a)
x 2 +x dx
∫ t −3
(b)
t 2 −1 dt
∫
8x+10
(c)
4x 2 +8x+3 dx
∫ 2t 2 +3t+3
(d) dt
2(t +1)
∫ 2x 2 +x+1
(e)
x 3 +x 2 dx
2 Evaluate the following integrals:
∫ 3 5x+6
(a)
1 2x 2 +4x dx
∫ 1 3x+5
(b)
0 (x +1)(x +2) dx
∫ 2 3−3x
(c)
1 2x 2 +6x dx
∫ 0 4x+1
(d)
−1 2x 2 +x−6 dx
∫ 3 x 2 +2x−1
(e)
2 (x 2 +1)(x −1) dx
468 Chapter 14 Techniques of integration
3 Find the area betweeny =
axisfromx=0tox=1.
4 Usepartialfractions to find
∫ 3 3x+2
(a)
2 x 2 −1 dx
4x+7
4x 2 and thex
+8x+3
(b)
(c)
(d)
∫
t +3
t 2 +2t+1 dt
∫ 2t 2 +3t+1
t 3 dt
+t
∫
6t+3
2t 2 −5t+2 dt
Solutions
1 (a) 3lnx−2ln(x+1)+c
(b) 2ln(t+1)−ln(t−1)+c
1
(c)
2 ln(2x +3) + 3 ln(2x +1) +c
2
t(t +1)
(d) ln(t +1) + +c
2
(e) 2ln(x +1) − 1 x +c
2 (a) 2.1587
(b) 1.7918
(c) −0.09971
(d) 0.1823
(e) 0.9769
3 1.2456
4 (a) 1.8767
(b) ln(t +1) − 2
t +1 +c
(c) 3tan −1 t + 1 2 ln(t2 +1)+lnt+c
(d) 5ln(t−2)−2ln(2t−1)+c
REVIEWEXERCISES14
1 Usethe given substitutionto findthe following
integrals:
∫ 1
(a) (9t+2) 10 dt z=9t+2
0
∫ 5
(b) (−t+1) 6 dt z=−t+1
3
∫ 3
(c) (4x−1) 27 dx z=4x−1
∫6
√3t+1dt
(d) z=3t+1
∫
(e) (9y−2) 17 dy z=9y−2
∫ 2 3
(f) dz
0 (2z +5) 6
∫
y=2z+5
(g) t 2 sin(t 3 )dt z =t 3
∫
(h) x 2 e x3 +1 dx z =x 3 +1
∫ 0.5
(i) sin(2t)e cos(2t) dt z = cos(2t)
∫0
π
(j) sintcos 2 tdt z=cost
∫0
(k) cost √ sintdt z=sint
2 Useintegration byparts to find
∫ π/2
∫ π/2
(a) e 2x cosxdx (b) e 2x sinxdx
0
0
3 Find
∫ lnt
dt
t
using
(a)integrationby parts
(b)the substitutionz = lnt.
4 TheintegralI n isgiven by
∫ π/2
I n = sin n θ dθ
0
(a) StateI n−2 .
(b) Show
I n = n −1
n I n−2
(c) EvaluateI 0 ,I 1 ,I 2 andI 3 .
5 Evaluate
∫
t 2
(a)
t 3 +1 dt
Review exercises 14 469
∫ π/3
(b) sintcostdt
0
∫ 3 4
(c) dt
1 e2t ∫
3x−7
(d)
(x −2)(x −3)(x −4) dx
∫
e x
(e)
e x +1 dx
6 Evaluate
∫ 1 3
(a)
0 (e t ) 2 +sintcos2 tdt
∫ 1
(b) 4t 2 e t3 +t(1+t 2 ) 12 dt
0
∫ π
(c) sin 2 ωt +cos 2 ωt + ωdt
0
ω constant
∫ 2 1+t+t 2
(d)
1 t(1+t 2 ) dt
∫ 1
(e) (t+e t )sintdt
0
∫ 3 1+4x
(f)
1 2x +4x 2 dx
7 Calculatethe area undery(x) = 1 +2e2x
x+e 2x fromx=1
tox=3.
8 Evaluate the following integrals:
(a)
∫
(−2t +0.1) 4 dt
∫
(b) (1+x)sinxdx
∫ 2
(c) xsin(1 +x)dx
1
∫ 6 t
(d) √
3 t 2 +1 dt
∫
1
(e)
t 3 +2t 2 +t dt
∫ 5 1
(f)
1 1+e tdt
9 Find
∫
(a)
cost
10+sint dt
∫
1
(c)
t(1+lnt) dt
∫ 1+lnx
(e)
xlnx dx
∫
10 Find
x 3 e x2 dx.
∫ 2sintcost
(b)
1+sin 2 dt
t
∫
(d) 1
e t (1+e −t ) dt
Solutions
1 (a) 2.8819 ×10 9
(b) 2322
(c) −1.2 ×10 36
(d) 2 9 (3t +1)3/2 +c
(9y −2) 18
(e) +c
162
(f) 9.092 ×10 −5
(g) − 1 3 cos(t3 )+c
+1
(h) ex3 +c
3
(i) 0.5009
2
(j)
3
(k) 2 3 (sint)3/2 +c
2 (a) 4.2281 (b) 9.4563
3
(lnt) 2
+c
2
∫ π/2
4 (a) I n−2 = sin n−2 θ dθ
0
π
(c)
2 ,1,0.7854,0.6667
1
5 (a)
3 ln|t3 +1|+c
3
(b)
8
(c) 0.2657
(d) − 1 2 ln|x−2|−2ln|x−3|+5 2 ln|x−4|+c
(e) ln|e x +1|+c
6 (a) 1.5778 (b) 317.3 (c) π(1 + ω)
(d) 1.0149 (e) 1.2105 (f) 0.9730
7 3.8805
(−2t +0.1)5
8 (a) − +c
10
(b) −(1+x)cosx+sinx+c
(c) 0.7957
470 Chapter 14 Techniques of integration
(d) 2.9205
(e) ln|t|−ln|t+1|+ 1
t +1 +c
(f) 0.3066
9 (a) ln(sint +10) +c
(b) ln(sin 2 t +1) +c
(c) ln(1+lnt)+c
(d) −ln(1+e −t )+c
(e) ln(xlnx) +c
10 ex2 (x 2 −1)
2
+c
15 Applicationsof
integration
Contents 15.1 Introduction 471
15.2 Averagevalueofafunction 471
15.3 Rootmeansquarevalueofafunction 475
Reviewexercises15 479
15.1 INTRODUCTION
Currents and voltages often vary with time. Engineers may wish to know the average
valueofsuchacurrentorvoltageoversomeparticulartimeinterval.Theaveragevalue
of a time-varying function is defined in terms of an integral. An associated quantity is
therootmeansquare(r.m.s.)valueofafunction.Ther.m.s.valueofacurrentisusedin
the calculation of the power dissipated by a resistor.
15.2 AVERAGEVALUEOFAFUNCTION
Suppose f (t)isafunction defined ona t b. The area,A, under f isgiven by
A =
∫ b
a
f dt
A rectangle with base spanning the interval [a,b] and heighthhas an area ofh(b −a).
Supposetheheight,h,ischosensothattheareaunder f andtheareaoftherectangleare
equal. This means
h(b−a)=
h =
∫ b
a
f dt
∫ b
a f dt
b −a
Thenhiscalledtheaverage valueofthefunctionacrosstheinterval[a,b] andisillustrated
inFigure 15.1:
average value =
∫ b
a f dt
b −a
472 Chapter 15 Applications of integration
f (t)
h
a b t
Figure15.1
Thearea underthe curve fromt =atot =bandthe
area ofthe rectangle are equal.
Example15.1 Findthe average value of f (t) =t 2 across
(a) [1,3]
(b) [2,5]
Solution (a) average value =
(b) average value =
∫ 3
1 t2 dt
3 −1 = 1 [ ] t
3 3
2 3
1
∫ 5
2 t2 dt
5 −2 = 1 [ ] t
3 5
3 3
2
= 13
3
= 13
Example15.1showsthatiftheintervalofintegrationchangesthentheaveragevalueof
a functioncan change.
Engineeringapplication15.1
Saw-toothwaveform
Recall from Engineering application 2.2 that engineers frequently make use of the
saw-toothwaveform. Consider the saw-toothwaveform shown inFigure 15.2.
y
5
–2 0 2
4 6 8 t
Figure15.2
Asaw-tooth waveform.
Calculate the average value of thiswaveform over a complete period.
15.2 Average value of a function 473
Solution
We first need to obtain an equation for the waveform. We choose the interval
0t<2.
The general equation for a straightlineis
v=mt+c
Whent=0thenv=0.So
0=0+c
c = 0
Whent=2thenv=5.So
Hence
5 =m(2)
m=2.5
v = 2.5t
The average value isgiven by
∫ 2
[ 2.5t
2
] 2
0
v av
= 1 2.5tdt = 1 2 0 2 2
v av
= 1 ( ) 2.5×4
−0 = 10 2 2 4 =2.5V
Engineeringapplication15.2
Athyristorfiringcircuit
Figure 15.3 shows a simple circuit to control the voltage across a load resistor,R L
.
This circuit has many uses, one of which is to adjust the level of lighting in a room.
Thecircuithasana.c.powersupplywithpeakvoltage,V S
.Themaincontrolelement
is the thyristor. This device is similar in many ways to a diode. It has a very high
resistance when it is reverse biased and a low resistance when it is forward biased.
However,unlikeadiode,thislowresistancedependsonthethyristorbeing‘switched
on’ by the application of a gate current. The point at which the thyristor is switched
on can be varied by varying the resistor,R G
. Figure 15.4 shows a typical waveform
ofthe voltage, v L
, across the loadresistor.
The point at which the thyristor is turned on in each cycle is characterized by
the quantity αT, where 0 α 0.25 and T is the period of the waveform. This
restrictionon αreflectsthefactthatifthethyristorhasnotturnedonwhenthesupply
voltage has peaked inthe forward direction thenitwill never turnon.
Calculate the average value of the waveform over a period and comment on the
result.
➔
474 Chapter 15 Applications of integration
y L
~
V S
y L R L
Thyristor Gate R G
V S
aT aT aT
t
Figure15.3
A thyristorfiringcircuit.
Figure15.4
Load voltage waveform.
Solution
The average value of load voltage is
∫
1 T
v
T L
dt= 1 ∫ T/2
( ) 2πt
V
T S
sin dt
T
0
= V S
T
αT
T
2π
[
−cos
( 2πt
T
)] T/2
αT
= V S
(1 +cos2πα)
2π
If α = 0, then the average value is V S
/π, the maximum value for this circuit. If
α = 0.25,thentheaveragevalueisV S
/2πwhichillustratesthatdelayingtheturning
on ofthe thyristor reduces the average value of the load voltage.
EXERCISES15.2
1 Calculate the averagevalue ofthe given functions
across the specified interval:
(a) f (t) = 1 +t across [0,2]
(b) f (x) = 2x −1 across [−1,1]
(c) f (t) =t 2 across [0,1]
(d) f (t) =t 2 across [0,2]
(e) f(z) =z 2 +zacross[1,3]
2 Calculate the averagevalue ofthe given functions
over the specified interval:
(a) f (x) =x 3 across[1,3]
(b) f(x) = 1 x across[1,2]
(c) f(t) = √ t across [0,2]
3
(d) f (z) =z 3 −1 across [−1,1]
(e) f(t) = 1 across[−3, −2]
t2 Calculate the average value ofthe following:
[ ]
(a) f (t) = sint across
0, π 2
(b) f (t) = sint across[0, π]
(c) f (t) = sin ωt across [0,π]
[ ]
(d) f (t) = cost across
0, π 2
(e) f (t) = cost across [0,π]
(f) f (t) = cos ωt across [0, π]
(g) f (t) = sin ωt +cos ωt across [0,1]
4 Calculate the average value ofthe following
functions:
(a) f(t) = √ t +1 across[0,3]
(b) f (t) = e t across[−1,1]
(c) f (t) = 1 +e t across[−1,1]
15.3 Root mean square value of a function 475
Solutions
1 (a) 2 (b) −1 (c)
1
3
(d)
2 (a) 10 (b) 0.6931 (c) 0.9428
1
(d) −1 (e)
6
2 2
3 (a) (b)
π π
4
3
(e)
19
3
1
(c)
πω [1 −cos(πω)] (d) 2
π
sin(πω)
(e) 0 (f)
πω
1+sinω−cosω
(g)
ω
14
4 (a) (b) 1.1752 (c) 2.1752
9
15.3 ROOTMEANSQUAREVALUEOFAFUNCTION
If f (t)isdefinedon [a,b], theroot mean square (r.m.s.)value is
r.m.s. =
√ ∫b
a (f(t))2 dt
b −a
Example15.2 Findthe r.m.s. valueof f (t) =t 2 across[1, 3].
Solution r.m.s. =
√ ∫ 3
1 (t2 ) 2 dt
3 −1
=
√ ∫ 3
1 t4 dt
2
=
√ √
[t 5 /5]
1
3 242
=
2 10 = 4.92
Example15.3 Calculate the r.m.s. valueof f (t) =Asint across [0,2π].
√ ∫ 2π
A
0 2 sin 2 tdt
Solution r.m.s. =
2π
√
A ∫ 2 2π
(1 −cos2t)/2dt
0
=
2π
√
[
A
=
2
t − sin2t ] 2π
4π 2
0
√
A2 2π
=
4π = √ A = 0.707A
2
Thus the r.m.s.value is0.707 ×the amplitude.
476 Chapter 15 Applications of integration
Example15.4 Calculate the r.m.s.value of f (t) =Asin(ωt + φ) across [0,2π/ω].
√ ∫ 2π/ω
A
0
Solution 2 sin 2 (ωt + φ)dt
r.m.s. =
2π/ω
√
∫
A
=
2 ω 2π/ω
1−cos2(ωt +φ)dt
4π 0
√
[
]
A
=
2 ω sin2(ωt + φ) 2π/ω
t −
4π 2ω
0
=
√
(
A 2 ω 2π
4π ω
−
sin2(2π + φ)
2ω
+ sin2φ )
2ω
Now sin2(2π + φ) = sin(4π + 2φ) and since sin(t + φ) has period 2π we see that
sin(4π +2φ) = sin2φ. Hence,
√ √
A2 ω 2π A
r.m.s. =
4π ω = 2
2 = √ A = 0.707A
2
Notethatsin(ωt + φ)hasperiod2π/ω.TheresultofExample15.4illustratesageneral
result:
The r.m.s. value of any sinusoidal waveform taken across an interval of length one
periodis
0.707 ×amplitudeofthe waveform
Root mean square value is an effective measure of the energy transfer capability of a
time-varyingelectricalcurrent.Toseewhythisisso,considerthefollowingEngineering
application.
Engineeringapplication15.3
Average power developed across a resistor by a time-varying
current
Consideracurrenti(t)whichdevelopsapowerp(t)inaloadresistorR.Thiscurrent
flowsfromtimet =t 1
totimet =t 2
.LetP av
betheaveragepowerdissipatedbythe
resistor during the time interval [ ]
t 1
,t 2 . We require that total energy transfer,E,be
the same inboth cases. So wehave
E =P av
(t 2
−t 1
)=
∫ t2
t 1
p(t)dt
15.3 Root mean square value of a function 477
Now
and so
p(t) = (i(t)) 2 R
P av
(t 2
−t 1
)=
∫ t2
t 1
i 2 Rdt
Ifwenowconsidertheaveragepowerdissipatedbytheresistortobetheresultofan
effective currentI eff
then wehave
I 2 eff R(t 2 −t 1 )= ∫ t2
I 2 eff (t 2 −t 1 )= ∫ t2
t 1
i 2 Rdt
t 1
i 2 dt
∫ t2
i 2 dt
I 2 eff = t 1
t 2
−t 1
∫ t2
√
i
t 2 dt
I eff
=
1
t 2
−t 1
We see that the equivalent direct current is the r.m.s. value of the time-varying
current.
Engineeringapplication15.4
Averagevalueandr.m.s.valueofaperiodicwaveform
Consider the periodicwaveform shown inFigure 15.5.
Thecurrenti(t) is
i(t) =20e −t 0 t <10, period T =10
(a) Calculate the average valueofthe currentover a complete period.
(b) Calculate the r.m.s. valueofthe currentover a complete period.
i(t)
20
0 10 20
30 t
Figure15.5
Waveform forEngineering application
15.4.
➔
478 Chapter 15 Applications of integration
Solution
(a) As the waveform isperiodic, we need only consider the interval 0 t < 10.
I av
= 1 T
∫ T
∫ t2
√
i
t 2 dt
(b) I eff
=
1
t 2
−t 1
Here
0
i(t)dt = 1
10
∫ 10
0
20e −t dt
t 1
=0 t 2
=10 i(t)=20e −t
Firstwe evaluate
So
∫ t2
t 1
i 2 dt=
∫ 10
0
[ e
−2t
= 400
−2
√
200.00
I eff
=
10−0
= 4.4721 A
= 1
10 [−20e−t ] 10
0
= 1
10 (−20e−10 +20e 0 )
= 20
10 (1 −e−10 ) = 2 ×0.99995 = 2.000A
400e −2t dt
] 10
= 400
−2 (e−20 −e 0 )
= 200(1 −e −20 ) = 200.00
0
EXERCISES15.3
1 Calculate the r.m.s.values ofthe functions in
Question1in Exercises15.2.
2 Calculate the r.m.s.values ofthe functions in
Question2in Exercises15.2.
3 Calculate the r.m.s.values ofthe functionsin
Question3in Exercises15.2.
4 Calculate the r.m.s.values ofthe functionsin
Question4in Exercises15.2.
Solutions
1 (a) 2.0817 (b) 1.5275 (c) 0.4472
(d) 1.7889 (e) 6.9666
2 (a) 12.4957 (b) 0.7071 (c) 1
(d) 1.0690 (e) 0.1712
3 (a) 0.7071
(b) 0.7071
√
1
(c)
2 − sinπωcosπω
2πω
Review exercises 15 479
(d) 0.7071
(e) 0.7071
√
1
(f)
2 + sinπωcosπω
2πω
(g)
√
1 + sin2 ω
ω
4 (a) 1.5811 (b) 1.3466 (c) 2.2724
REVIEWEXERCISES15
1 Find the average value ofthe following functions
acrossthe specified interval:
(a) f (t) = 3 −t across[0,4]
(b) f(t) =t 2 −2across[1,3]
(c) f(t)=t+ 1 across [1,4]
t
(d) f(t) = √ t +1 across [0,4]
(e) f (t) =t 2/3 across [0,1]
2 Calculatethe average value ofthe following:
[ ]
(a) f (t) = 2sin2t across 0, π 2
[ ]
(b) f (t) =Asin4t across
0, π 2
(c) f (t) = sint +cost across [0,π]
( ) [ ]
t
(d) f(t) =cos across 0, π 2 2
(e) f (t) = sintcost across [0, π]
3 Calculate the averagevalue ofthe following
functions:
(a) f (t) =Ae kt across [0,1]
(b) f(t) = 1
e 3t across [0,2]
(c) f (t) = 3 −e −t across [1,3]
(d) f (t) = e t +e −t across[0,2]
(e) f (t) =t +e t across[0,2]
4 Find the average andr.m.s.values of
Acost +Bsint across
(a) [0,2π]
(b) [0, π]
5 Find the r.m.s.values ofthe functions
in Question1.
6 Find the r.m.s.values ofthe functionsin
Question2.
7 Find the r.m.s.values ofthe functions
in Question3.
Solutions
1 (a) 1
7
(b)
3
(c) 2.9621
7 3
(d) (e)
3 5
4
2
2 (a) (b) 0 (c)
π
π
(d) 0.9003
(
(e) 0
)
e k −1
3 (a)A
k
(b) 0.1663
(c) 2.8410 (d) 3.6269
(e) 4.1945
4 (a) average = 0, r.m.s. =
√
A 2 +B 2
2
(b) average = 2B π ,r.m.s. = √
A 2 +B 2
5 (a) 1.5275 (b) 3.2965 (c) 3.0414
(d) 2.3805 (e) 0.6547
6 (a) 1.4142 (b)
2
A
√
2
(c) 1
(d) 0.9046 (e) 0.3536
√
e 2k −1
7 (a)A (b) 0.2887
2k
(c) 2.8423 (d) 3.9554
(e) 4.8085
16 Furthertopicsin
integration
Contents 16.1 Introduction 480
16.2 Orthogonalfunctions 480
16.3 Improperintegrals 483
16.4 Integralpropertiesofthedeltafunction 489
16.5 Integrationofpiecewisecontinuousfunctions 491
16.6 Integrationofvectors 493
Reviewexercises16 494
16.1 INTRODUCTION
Thischapterexaminessomefurthertopicsinintegration.Orthogonalfunctionsareintroduced
in Section 16.2. These functions are used extensively in Fourier analysis
(see Chapter 23). Some integrals have one or two infinite limits of integration, or have
an integrand which becomes infinite at particular points in the interval of integration.
Such integrals are termed ‘improper’ and require special treatment. They are used extensivelyinthetheoryofLaplaceandFouriertransforms.TheDiracdeltafunction,
δ(t),
hasbeenintroducedinChapter2.TheintegralpropertiesareexaminedinSection16.4.
The chapter concludes with the integration of piecewise continuous functions and the
integration of vectors.
16.2 ORTHOGONALFUNCTIONS
Two functions f (x)andg(x) aresaidtobe orthogonal over the interval [a,b] if
∫ b
a
f(x)g(x)dx =0
Toshowthattwofunctionsareorthogonalwemustdemonstratethattheintegraloftheir
product over the interval of interest iszero.
[
Example16.1 Show that f (x) =xandg(x) =x −1 areorthogonal on 0, 3 ]
.
2
16.2 Orthogonal functions 481
Solution
] 3/2
∫ 3/2 ∫ 3/2
[ x
x(x−1)dx= x 2 3
−xdx=
0
0 3 − x2
2
0
[
Hence f andgare orthogonal over the interval 0, 3 ]
.
2
= 9 8 − 9 8 = 0
Clearlyfunctionsmaybeorthogonaloveroneintervalbutnotorthogonaloverothers.
For example,
∫ 1
0
x(x−1)dx≠0
and soxandx −1 arenotorthogonal over [0,1].
Example16.2 Show f (t) = 1,g(t) = sint andh(t) = cost aremutually orthogonal over [−π,π].
Solution We are required toshow thatany pairoffunctions isorthogonal over [−π,π].
∫ π
−π
1sintdt =[−cost] π −π = −cosπ +cos(−π)
∫ π
−π
= −(−1) + (−1) =0
1costdt =[sint] π −π = sinπ −sin(−π) =0
Usingthe trigonometricidentitysin2A = 2sinAcosA, wecan write
∫ π
−π
sintcostdt =
∫ π
−π
[
1 cos(2t)
2 sin(2t)dt = − 4
cos(2π) −cos(−2π)
= − = 0
4
] π
−π
Hence the functions 1,sint, cost form an orthogonal set over [−π,π].
ThesetofExample 16.2 maybeextendedto
{1,sint,cost,sin(2t),cos(2t),sin(3t),cos(3t),...,sin(nt),cos(nt)}
n ∈ N
Example16.3 Verifythat {1,sint,cost,sin(2t),cos(2t),...} formsanorthogonal setover [−π,π].
Solution Supposen,m ∈ N. We must show that all combinations of 1,sinnt, sinmt, cosnt and
cosmt are orthogonal.
∫ π
[ −cos(nt) −cos(nπ) +cos(−nπ)
1sin(nt)dt =
= = 0
n n
−π
] π
−π
482 Chapter 16 Further topics in integration
Inasimilarmanner, itiseasytoshow
∫ π
1cos(nt)dt = 0
−π
Also,usingthe trigonometric identities inSection 3.6
∫ π
=
−πcos(nt)sin(mt)dt 1 ∫ π
sin(n +m)t −sin(n −m)tdt
2 −π
We have seen that ∫ π
−πsinnt dt = 0 for any n ∈ N. Noting that (n +m) ∈ N and
(n −m) ∈ N,weseethat
∫ π
−π
sin(n +m)t −sin(n −m)tdt =0
Itis left as anexercise for the reader toshow that
∫ π
−π
∫ π
−π
sinntsinmtdt =0 n≠m
cosntcosmtdt =0
n≠m
The functions thus form an orthogonal set across [−π,π].
TheresultofExample 16.3 can beextended:
{1,sint,cost,sin2t,cos2t,...}isanorthogonalsetoveranyintervaloflength2π.
More generally:
{
1,sin
( 2πt
T
)
,cos
( 2πt
T
)
,sin
( 4πt
T
)
,cos
( 4πt
T
) }
,...
isanorthogonalsetoveranyintervaloflengthT.Inparticular,thesetisorthogonal
[
over [0,T] and − T ]
2 ,T .
2
These results areused extensively inFourier analysis.
Example16.4 Find
(a)
(b)
∫ π
−π
∫ π
−π
sin 2 (nt)dt n∈ Z n ≠0
cos 2 (nt)dt n∈ Z n ≠0
Solution (a) We use the trigonometric identity
(b)
sin 2 nt = 1 −cos2nt
2
toget
∫ π
sin 2 (nt)dt =
−π
∫ π
−π
1 −cos2nt
2
dt =
using the orthogonal properties of cos(nt).
∫ π
−π
cos 2 (nt)dt =
∫ π
−π
1 −sin 2 (nt)dt
=2π−π=π
∫ π
−π
16.3 Improper integrals 483
1
2 dt=π
It is a simple extension to show that integrating sin 2 (nt) or cos 2 (nt) over any interval
of length 2π yields the same result, namely π. It is also possible to extend the result of
Example 16.4 toshow
∫ T/2
−T/2
sin 2 ( 2nπt
T
)
dt =
Finally, integrating sin 2 ( 2nπt
T
the same result,thatis T 2 .
∫ T/2
−T/2
)
( ) 2nπt
cos 2 dt = T T 2
)
and cos 2 ( 2nπt
T
n∈Z
n≠0
over any interval of lengthT gives
EXERCISES16.2
1 Show f (x) =x 2 andg(x) = 1 −xareorthogonal
[ ]
across
0, 4 3
.
2 Show f(x) = 1 x andg(x) =x2 are orthogonal over
[−k,k].
3 (a)Showf(t)=1−tandg(t)=1+tare
orthogonalover [0, √ 3].
(b)Findanotherinterval over which f (t)andg(t)are
orthogonal.
4 Show f (t) = e t andg(t) = 1 −e −2t are orthogonal
across [−1,1].
5 Show f(x) = √ xandg(x)=1− √ x are orthogonal
[ ]
on 0, 16 .
9
Solutions
3 (b) [− √ 3,0]
16.3 IMPROPERINTEGRALS
There aretwo caseswhen evaluation ofanintegral needs specialcare:
(1) one,orboth,ofthe limits ofanintegral areinfinite;
(2) the integrand becomesinfiniteatone,ormore,points ofthe interval ofintegration.
484 Chapter 16 Further topics in integration
If either (1) or (2) is true the integral is called an improper integral. Evaluation of
improper integrals involves the use of limits.
∫ ∞
1
Example16.5 Evaluate
2 t dt. 2
Solution
∫ ∞
2
1
[−
t dt = 1 ] ∞ 2 t
2
To evaluate − 1 atthe upper limitwe consider lim
t −1 . Clearly the limitis0.Hence,
t→∞ t
∫ ∞
1
(−
t dt=0− 1 )
= 1 2 2 2
2
∫ 1
Example16.6 Evaluate e 2x dx.
−∞
Solution
∫ 1
−∞
[ e
e 2x 2x
dx =
2
] 1
−∞
We need toevaluate lim
x→−∞
∫ 1
−∞
e 2x
. This limitis 0.So,
2
[ ] e
e 2x 2x 1
dx = = e2
2
−∞
2 −0=3.69
Engineeringapplication16.1
Capacitorsinseries
Engineers are often called upon to simplify an electronic circuit in order to make
it easier to analyse. One of the arrangements frequently met is that of two or more
capacitors connected together in series. It is useful to be able to replace this configuration
by a single capacitor with a capacitance value equivalent to that of the
original capacitors in series. Derive an expression for the equivalent capacitance of
two capacitors connected togetherinseries (see Figure 16.1).
Solution
In Engineering application 13.2 we obtained an expression for the voltage across a
capacitor. Thiswas
v = 1 ∫
idt
C
16.3 Improper integrals 485
i
C 1 C 2
y 1
y
y 2
Figure16.1
Two capacitors
connected in series.
This can be written as a definite integral to give the voltage expression across the
capacitor atageneral pointintime,t. The expression is
v = 1 C
∫ t
−∞
idt
Now consider the situation depicted in Figure 16.1. Writing an equation for each of
the capacitors gives
v 1
= 1 C 1
∫ t
−∞
idt
v 2
= 1 C 2
∫ t
By Kirchhoff’s voltage law, v = v 1
+ v 2
and so
v = 1 ∫ t
idt+ 1 ∫ t
( 1
idt = + 1 ) ∫ t
idt
C 1 −∞ C 2 −∞ C 1
C 2 −∞
−∞
idt
Therefore,thetwocapacitorscanbereplacedbyanequivalentcapacitance,C,given
by
so that
1
C = 1 C 1
+ 1 C 2
= C 1 +C 2
C 1
C 2
C = C 1 C 2
C 1
+C 2
The result is easily generalised for more than two capacitors. For example, for three
capacitors wehave
1
C = 1 + 1 + 1 C 1
C 2
C 3
so that
C
C = 1
C 2
C 3
C 2
C 3
+C 1
C 3
+C 1
C 2
You may wish toprove this result.
∫ ∞
2
Example16.7 Evaluate
3 2t+1 − 1 t dt.
Solution
∫ ∞
3
2
2t+1 − 1 t dt =[ln|2t +1| −ln |t|]∞ 3
[
] = ln
2t+1
∞ [
∣ t ∣ = ln
∣ 2 + 1 3
t ∣
[ (
= lim ln 2 + 1 )]
−ln 7
t→∞ t 3
] ∞
3
=ln2−ln 7 3 = ln 6 7 = −0.1542
486 Chapter 16 Further topics in integration
Example16.8 Evaluate
∫ ∞
1
sintdt.
Solution
∫ ∞
1
sintdt =[−cost] ∞ 1
Nowlim t→∞
(−cost)doesnotexist,thatisthefunctioncost doesnotapproachalimit
ast → ∞, and so the integral cannot be evaluated. We say the integraldiverges.
∫ 1
1
Example16.9 Evaluate √ dx.
0 x
1
Solution Theintegrand, √ ,becomesinfinitewhenx = 0,whichisintheintervalofintegration.
x
The point x = 0 is ‘removed’ from the interval. We consider
slightly greater than 0,and then letb→0 + . Now,
∫ 1
b
Then,
∫ 1
0
1
√ x
dx = [2 √ x] 1 b =2−2√ b
∫
1 1
√ dx = lim x b→0 + b
1
√ x
dx = lim
b→0 +(2−2√ b)=2
The improper integral exists and has value 2.
∫ 1
b
1
√ x
dx where b is
∫ 2
1
Example16.10 Determinewhether the integral dx exists ornot.
0 x
Solution As in Example 16.9 the integrand is not defined atx = 0, so we consider
b >0andthenletb →0 + .
So,
∫ 2
b
1
x dx = [ln|x|]2 b =ln2−lnb
(∫ 2
)
1
lim
b→0 b x dx = lim(ln2−lnb)
b→0
Since lim b→0
lnbdoes notexistthe integral diverges.
∫ 2
b
1
dx for
x
∫ 2
1
Example16.11 Evaluate dx ifpossible.
−1 x
Solution We ‘remove’
∫
the point x = 0 where the integrand becomes infinite and consider two
b
∫
1
2
integrals:
−1 x dx wherebis slightly smaller than 0, and 1
dx wherecis slightly
c x ∫ 2
largerthan0.Iftheseintegralsexistasb→0 − andc → 0 + 1
then
x dxconverges.If
−1
∫ 2
either of the integrals failstoconverge then
∫ b
−1
1
dx = ln |b| −ln |−1| = ln|b|
x
∫ b
1
lim dx = lim
b→0 − −1 x b→0 −(ln|b|)
This limitfailstoexist and so
∫ 2
−1
−1
1
dx diverges.
x
16.3 Improper integrals 487
1
dx diverges. Now,
x
Engineeringapplication16.2
Energystoredinacapacitor
Acapacitorprovidesausefulmeansofstoringenergy.Thisenergycanbedischarged
by connecting the capacitor in series with a load resistor and closing a switch. The
stored electrical energy is converted into heat energy as a result of electrical current
flowing through the resistor. Consider the circuit shown in Figure 16.2 which
consists of a capacitor,C F, connected in series with a resistor with valueRand
isolatedbymeansofaswitch,S.Wewishtocalculatetheamountofenergystoredin
the capacitor. The switch is closed att = 0 and a current,i, flows in the circuit. We
have already seen in Chapter 2 that for such a case the time-varying voltage across
the capacitordecaysexponentially and isgiven by
v =Ve −t/RC
So,usingOhm’s law
i = v R = Ve−t/RC
R
S
i
C
R
Figure16.2
The capacitor is dischargedbyclosingthe switch.
Now the effect of closing the switch is to allow the energy stored in the capacitor to
be dissipated in the resistor. Therefore, if the total energy dissipated in the resistor
is calculated then this will allow the energy stored in the capacitor to be obtained.
However, the energy dissipation rate, that is power dissipated, is not a constant for
theresistorbutdependsonthecurrentflowingthroughit.Thetotalenergydissipated,
E, isgiven by
E =
∫ ∞
0
P(t)dt
whereP(t) is the power dissipated in the resistor at timet. This equation has been
discussedinEngineeringapplication 13.3.Now,
P=i 2 R= RV2 e −2t/RC
R 2
= V2 e −2t/RC
R
➔
488 Chapter 16 Further topics in integration
Now
E =
∫ ∞
0
= V2 RC
−2R
V 2 e −2t/RC
R
lim
t→∞ e−2t/RC = 0
[
e
−2t/RC ] ∞
0
dt = V2
R
∫ ∞
0
e −2t/RC dt
and so the energy storedinthe capacitor isgiven by
E = CV2
2
Example16.12 Find
∫ ∞
0
e −st sintdt s>0
Solution Using integration by parts, withu = e −st and dv = sint, wehave
dt
∫ ∞
0
e −st sintdt = [ −e −st cost ] ∫ ∞
∞
−s e −st costdt
0
Considerthefirsttermonther.h.s.Weneedtoevaluate [ −e −st cost ] ast → ∞andwhen
t = 0.Notethat −e −st cost → 0ast → ∞becausewearegiventhatsispositive.When
t = 0, −e −st cost evaluates to −1, and so
∫ ∞
0
e −st sintdt =1−s
∫ ∞
0
0
e −st costdt
Integrating by parts for a second time yields
∫ ∞
{ [e
e −st sintdt =1−s
−st sint ] ∫ ∞
∞
+s 0
0
∫ ∞
=1−s 2 e −st sintdt
0
0
}
e −st sintdt
because [ e −st sint ] ∞
0 evaluatestozeroatbothlimits.Atthisstagethereadermightsuspectthatwehavegonearoundinacircleandstillneedtoevaluatetheoriginalintegral.
However, somealgebraic manipulation yields the required result.We have
∫ ∞
0
∫ ∞
e −st sintdt +s 2 e −st sintdt = 1
(1+s 2 )
0
∫ ∞
0
∫ ∞
0
e −st sintdt = 1
e −st sintdt = 1
1+s 2
16.4 Integral properties of the delta function 489
EXERCISES16.3
1 Evaluate, ifpossible,
∫ ∞
(a) e −t dt
0
∫ ∞
(b) e −kt dt k is aconstant,k > 0
0
∫ ∞ 1
(c)
1 x dx
∫ ∞ 1
(d)
1 x 2 dx
∫ 3 1
(e)
1 x −2 dx
2 Evaluate the following integrals wherepossible:
∫ 4
∫
3 3
(a)
0 x −2 dx (b) 1
0 x −1 + 1
x −2 dx
∫ 2
∫
1
∞
(c)
0 x 2 −1 dx (d) sin3tdt
0
∫ 3
(e) xe x dx
−∞
3 Find
∫ ∞
e −st costdt s>0
0
Solutions
1
1 (a) 1 (b) (c) does not exist
k
(d) 1 (e) does not exist
2 (a) does notexist (b) does notexist
3
(c) does notexist
(e) 2e 3
s
s 2 +1
(d) doesnot exist
16.4 INTEGRALPROPERTIESOFTHEDELTAFUNCTION
Thedeltafunction, δ(t −d),wasintroducedinChapter2.Thefunctionisdefinedtobea
rectanglewhoseareais1inthelimitasthebaselengthtendsto0andastheheighttends
toinfinity.Sometimesweneedtointegratethedeltafunction.Inparticular,weconsider
the improper integral
∫ ∞
−∞
δ(t −d)dt
Theintegral gives the area under the functionand this isdefinedtobe1.Hence,
∫ ∞
−∞
δ(t−d)dt =1
InChapter21 weneedtoconsiderthe improper integral
∫ ∞
−∞
f(t)δ(t −d)dt
where f (t) is some known function of time. The delta function δ(t −d) is zero everywhere
except att =d. Whent =d, then f (t)has a value f (d).Hence,
∫ ∞
−∞
f(t)δ(t −d)dt =
∫ ∞
−∞
f(d)δ(t −d)dt = f(d)
∫ ∞
−∞
δ(t −d)dt
490 Chapter 16 Further topics in integration
∫ ∞
since f (d)is a constant. But
∫ ∞
−∞
∫ ∞
−∞
The result
∫ ∞
−∞
−∞
f(t)δ(t −d)dt = f(d)
∫ ∞
−∞
δ(t−d)dt=1
f(t)δ(t −d)dt=f(d)
f(t)δ(t −d)dt = f(d)
δ(t −d)dt =1andhence
is known as the sifting property of the delta function. By multiplying a function, f (t),
by δ(t −d) and integrating from −∞ to ∞ we siftfrom the function the value f (d).
Example16.13 Evaluate the following integrals:
(a)
∫ ∞
−∞
Solution (a) We use
∫ ∞
t 2 δ(t −2)dt
−∞
(b)
∫ ∞
f(t)δ(t −d)dt = f(d)
withf(t) =t 2 andd = 2.Hence
∫ ∞
−∞
0
e t δ(t −1)dt
t 2 δ(t −2)dt=f(2)=2 2 =4
(b) We note thatthe expression e t δ(t −1) is0everywhere except att = 1.Hence
∫ ∞
0
e t δ(t−1)dt =
∫ ∞
−∞
e t δ(t −1)dt
EXERCISES16.4
Using ∫ ∞
f(t)δ(t −d)dt = f(d)
−∞
with f (t) = e t andd = 1 gives
∫ ∞
0
e t δ(t−1)dt =f(1)=e 1 =e
1 Evaluate
∫ ∞
(a) e t δ(t)dt
−∞
∫ ∞
(b) e t δ(t −4)dt
−∞
∫ ∞
(c) e t δ(t +3)dt
−∞
∫ ∞
(d)4 t 2 δ(t −3)dt
−∞
16.5 Integration of piecewise continuous functions 491
∫ ∞ (1+t)δ(t −1)
(e) dt
−∞ 2
∫ ∞
(f) e −kt δ(t)dt
−∞
∫ ∞
(g) e −kt δ(t −a)dt
−∞
∫ ∞
(h) e −k(t−a) δ(t −a)dt
−∞
2 Evaluate the following integrals:
∫ ∞
(a) (sint)δ(t −2)dt
−∞
∫ ∞
(b) e −t δ(t +1)dt
−∞
∫ 0
(c) e −t δ(t +3)dt
−∞
∫ 10
(d) x 3 δ(x−2)dx
0
∫ 1
(e) x 2 δ(x+2)dx
−1
3 Evaluate the following:
∫ ∞
(a) t(sin2t)δ(t −3)dt
−∞
∫ ∞
(b) δ(t+1)−δ(t−1)dt
0
∫ ∞
(c) δ(t −d)dt
0 ∫ a
(d) δ(t −d)dt
−a
Solutions
1 (a) 1 (b) 54.5982
(c) 4.9787 ×10 −2 (d) 36
(e) 1 (f) 1
(g) e −ak (h) 1
2 (a) 0.9093
(b) 2.7183
(c) 20.0855
(d) 8
(e) 0
3 (a) −0.8382
(b) −1
(c) 1 ifd 0,0otherwise
(d) 1 if −a d a,0otherwise
16.5 INTEGRATIONOFPIECEWISECONTINUOUSFUNCTIONS
Integration of piecewise continuous functions is illustrated in Example 16.14. If a discontinuity
occurs within the limits of integration then the interval is divided into subintervals
so thatthe integrand iscontinuouson eachsub-interval.
Example16.14 Given
f(t)=
{
2 0 t<1
t 2
evaluate ∫ 3
0 f(t)dt.
1<t3
Solution The function, f (t), is piecewise continuous with a discontinuity att = 1. The function
is illustrated in Figure 16.3. The discontinuity occurs within the limits of integration.
We split the interval ofintegration atthe discontinuitythus:
∫ 3
0
f(t)dt =
∫ 1
0
f(t)dt+
∫ 3
1
f(t)dt
492 Chapter 16 Further topics in integration
f (t)
2
1 3 t
Figure16.3
Thefunction f (t) =
{ 2 0t<1
t 2 1<t 3.
On the intervals (0, 1) and (1, 3), f (t)iscontinuous, so
∫ 3
0
fdt=
=
∫ 1
0
[
2t
2dt+
] 1
0
∫ 3
1
t 2 dt
[ ] t
3 3
+ = 2 +
3
1
[
9 − 1 ]
= 32 3 3
EXERCISES16.5
1 Given
⎧
⎪⎨
3 −1t<1
f(t)= 2t 1t2
⎪⎩
t 2 2<t3
evaluate
∫ 1 ∫ 1.5
(a) f dt (b) f dt
−1
−0.5
∫ 2.5 ∫ 3
(c) f dt (d) f dt
0
−1
2 Given
⎧
⎨3t
0t<3
g(t) = 15−2t 3t<4
⎩
6 4t6
evaluate
∫ 2 ∫ 4
(a) g(t)dt (b) g(t)dt
0 2
∫ 5
∫ 6
(c) g(t)dt (d) g(t)dt
3
0
∫ 4.5
(e) g(t)dt
3.5
3 Givenu(t)isthe unitstepfunction, evaluate
∫ 4
(a) u(t)dt
0
∫ 2
(b) u(t)dt
−3
∫ 4
(c) 2u(t +1)dt
−2
∫ 2
(d) tu(t)dt
−1
∫ 4
(e) e kt u(t −3)dt k constant
0
Solutions
1 (a) 6 (b) 5.75
(c) 8.5417 (d) 15.3333
2 (a) 6 (b) 15.5 (c) 14
(d) 33.5 (e) 6.75
3 (a)4 (b)2
(c) 10 (d) 2
e 4k −e 3k
(e)
k
16.6 Integration of vectors 493
16.6 INTEGRATIONOFVECTORS
Ifavectordependsupontimet,itisoftennecessarytointegrateitwithrespecttotime.
Recall that i, j and k are constant vectors and are treated thus in any integration. Hence
the integral
∫
I = (f(t)i+g(t)j+h(t)k)dt
issimplyevaluated as three scalar integrals, and so
( ∫ ) ( ∫ ) ( ∫ )
I = f(t)dt i + g(t)dt j + h(t)dt k
Example16.15 Ifr = 3ti +t 2 j + (1 +2t)k, evaluate ∫ 1
0 rdt.
Solution
∫ 1
0
( ∫ ) (
1 ∫ ) (
1 ∫ )
1
rdt= 3tdt i + t 2 dt j + 1+2tdt k
0
[ 3t
2
=
2
0
] 1 [ ] t
3 1
i + j +
0
3
0
0
[
t +t 2 ] 1
0k = 3 2 i + 1 3 j+2k
EXERCISES16.6
1 Givenr = 3sinti−costj+(2−t)k,evaluate ∫ π
0 rdt.
2 Given v = i −3j +k,evaluate
(a)
∫ 1
vdt
0
(b)
∫ 2
vdt
0
3 Thevector, a,is defined by
a=t 2 i+e −t j+tk
Evaluate
∫ 1
(a) adt
0
(b)
∫ 3
adt
2
(c)
∫ 4
adt
1
4 Letaandbbe two three-dimensional vectors. Isthe
following true?
∫ t2
∫ t2
∫ t2
adt× bdt = a×bdt
t 1 t 1 t 1
Recallthat ×denotes the vectorproduct.
Solutions
1 6i +1.348k
2 (a) i−3j+k (b) 2i−6j+2k
3 (a)0.333i +0.632j +0.5k
4 no
(b)6.333i +0.0855j +2.5k
(c)21i +0.3496j +7.5k
494 Chapter 16 Further topics in integration
REVIEWEXERCISES16
1 Show f (x) =x n andg(x) =x m areorthogonalon
[−k,k] ifn +mis an odd number.
2 Show f (t) = sint andg(t) = cost are orthogonal on
[a,a +nπ].
3 Show f (t) = sinht andg(t) = cosht are orthogonal
over [−k,k].
4 Determine the values ofkforwhich ∫ ∞
0 t k dt exists.
5 Evaluate ifpossible
∫ ∞
(a) u(t)dt
∫0
∞
(b) e −1000t dt
−∞
∫ 0 1
(c)
−2 x +1 dx
∫ ∞
(d) u(t)e −t dt
−∞
∫ 2 1
(e)
−2 x 2 −1 dx
(Note thatu(t)is the unitstepfunction.)
6 Find the values ofkforwhich ∫ ∞
0 e kt dt exists.
7 Evaluate
∫ ∞
(a) (t 2 +1)δ(t −1)dt
−∞
∫ ∞
(b) te t δ(t −2)dt
−∞
∫ ∞
(c) (t 2 +t +2)δ(t −1)dt
−∞
∫ ∞ δ(t +2)
(d)
−∞ t 2 +1 dt
∫ ∞
(e) δ(t +1)δ(t +2)dt
−∞
8 (a) Evaluate
∫ ∞
(t 2 +1)δ(2t)dt
−∞
[Hint: substitutez = 2t.]
(b) Show
∫ ∞
f(t)δ(nt)dt = 1 f(0), n>0
−∞ n
9 Evaluate
∫ ∞
(a) (1+t)δ(t −2)dt
0
∫ ∞
(b) tδ(1+t)dt
−∞
∫ ∞
(c) (1+t)δ(−t)dt
−∞
∫ 0
(d) δ(t−6)+δ(t+6)dt
−∞
∫ 4k
(e) δ(t +k)+δ(t +3k)+δ(t +5k)dt
−2k
k > 0
10 Thefunctiong(t)ispiecewise continuous and
defined by
{ 2t 0t1
g(t) =
3 1<t2
Evaluate
∫ 1
(a) g(t)dt
0
∫ 1.5
(b) g(t)dt
0
∫ 1.7
(c) g(t)dt
0.5
∫ 2
(d) g(t)dt
1
∫ 1.5
(e) g(t)dt
1.3
11 Given
⎧
⎨1+t −1t<3
f(t)= t−1 3t4
⎩
0 otherwise
evaluate
∫ 2
(a) f(t)dt
−1
∫ 4
(b) f(t)dt
−1
∫ 5
(c) f(t)dt
0
∫ ∞
(d) f(t)δ(t −2)dt
−∞
∫ ∞
(e) f(t)u(t)dt
−∞
Review exercises 16 495
12 Given
14 Given
⎧
⎨t 2 v(t) =2ti+(3−t 2 )j+t 3 k
−2t2
h(t) = t
⎩
3 find
2<t3
∫ 1
4 3<t5
(a) vdt
0
evaluate
∫
∫ 2
2
(b) vdt
(a) h(t)dt
1
−1
∫
∫ 2
2.5
(c) vdt
(b) h(t)dt
0
0
∫ 4
15 Evaluate the following integrals:
(c) h(t)+1dt
∫
2
3
∫ 4
(a) |t|dt
−1
(d) h(t +1)dt
∫
0
2
∫ 2
(b) |t+2|dt
−3
(e) h(2t)dt
∫
−1
2
13 Ifa =ti−2tj+3tk,find ∫ 1
0 adt. (c) |3t −1|dt
0
Solutions
4 k<−1
5 (a) does notexist
(b) does notexist
(c) does notexist
(d) 1
(e) does notexist
6 k<0
7 (a) 2 (b) 14.7781 (c) 4
(d) 0.2 (e) 0
8 (a) 1 2
9 (a) 3 (b) −1 (c) 1
(d) 1 (e) 1
5
10 (a) 1 (b)
2
(d) 3 (e) 0.6
(c) 2.85
11 (a) 4.5 (b) 10.5 (c) 10
(d) 3 (e) 10
12 (a) 3
(b) 8.4323
(c) 22.25
(d) 26.5833
(e) 12.7917
13 0.5i −j+1.5k
14 (a) i+ 8 3 j + 1 4 k
(b) 3i + 2 3 j + 15
4 k
(c) 4i + 10
3 j+4k
15 (a) 5 (b) 8.5 (c)
13
3
17 Numericalintegration
Contents 17.1 Introduction 496
17.2 Trapeziumrule 496
17.3 Simpson’srule 500
Reviewexercises17 505
17.1 INTRODUCTION
Many functions, for example sinx 2 and ex
, cannot be integrated analytically. Integrationofsuchfunctionsmustbeperformednumerically.Thissectionoutlinestwosimple
x
numericaltechniques--thetrapeziumruleandSimpson’srule.Moresophisticatedones
exist and there are many excellent software packages available which implement these
methods.
17.2 TRAPEZIUMRULE
We wish to find the area undery(x), fromx = a tox = b, that is we wish to evaluate
∫ b
ydx. The required area is divided intonstrips, each of widthh. Note that the width,
a
h, of each strip is given byh = b −a . Each strip is then approximated by a trapezium.
n
AtypicaltrapeziumisshowninFigure17.1.Theareaofthetrapeziumis 1 2 h[y i +y i+1 ].
Summing the areas of all the trapezia will yield an approximation tothe total area:
area oftrapezia = h 2 (y 0 +y 1 )+h 2 (y 1 +y 2 )+···+h 2 (y n−1 +y n )
= h 2 (y 0 +2y 1 +2y 2 +2y 3 +···+2y n−1 +y n )
17.2 Trapezium rule 497
y
{ {
y 0 y n
y i y i +1
x i
a
x i +1 b
h
{
{
x
Figure17.1
Each stripis approximated by atrapezium.
If the number of strips is increased, that ishis decreased, then the accuracy of the approximation
is increased.
Table17.1
Example17.1 Usethe trapezium ruletoestimate
(a) a strip widthof0.2
∫ 1.3
0.5
e (x2) dx using
(b) a strip widthof0.1.
Solution (a) Table 17.1 lists values ofxand correspondingvalues ofe (x2) .
We notethath = 0.2.Usingthe trapeziumrulewefind
x
0.5
y=e (x2 )
1.2840 =y 0
sum ofareasoftrapezia = 0.2 {1.2840 +2(1.6323) +2(2.2479)
2
+2(3.3535) +5.4195}
0.7 1.6323 =y 1
= 2.117
0.9 2.2479 =y 2
Table17.2
1.1 3.3535 =y 3 Hence
1.3 5.4195 =y 4 ∫ 1.3
x y=e (x2 )
0.5
e (x2) dx ≈ 2.117
x 0 = 0.5 1.2840 =y 0
x 1 = 0.6 1.4333 =y 1
x 2 = 0.7 1.6323 =y 2
x 3 = 0.8 1.8965 =y 3
x 4 = 0.9 2.2479 =y 4
x 5 = 1.0 2.7183 =y 5
x 6 = 1.1 3.3535 =y 6
x 7 = 1.2 4.2207 =y 7
x 8 = 1.3 5.4195 =y 8
(b) Table 17.2 listsxvalues and correspondingvalues ofe (x2) .
Using Table17.2, wefind
Hence
sum ofareasoftrapezia = 0.1 {1.2840 +2(1.4333) +2(1.6323) + ···
2
+2(4.2207) +5.4195}
∫ 1.3
0.5
e (x2) dx ≈ 2.085.
= 2.085
Dividingtheinterval[0.5,1.3]intostripsofwidth0.1resultsinamoreaccurateestimate
thanusingstripsofwidth0.2.
498 Chapter 17 Numerical integration
Engineeringapplication17.1
Distancetravelledbyarocket
A rocket is released and travels at a variable speed v. A motion sensor on the rocket
measures this speed and the value is sampled by an onboard computer at 1 second
intervals. The computer is required to calculate the distance travelled by the rocket
andrelaythevaluetoagroundstationatregularintervals.Table17.3recordsvalues
of the measurements taken by the computer during the first 10 seconds of flight.
Assumingthatthecomputerusesthetrapeziumruletoestimatethedistancetravelled
by the rocket, calculate the value that the computer will relay to the ground station
after 10 seconds.
Table17.3
Time(s) Speed(ms −1 )
0 10.1
1 17.2
2 24.4
3 29.2
4 34.6
5 41.2
6 50.9
7 57.8
8 60.3
9 61.2
10 62.1
Solution
We know that v = ds where v =speed ands = distance travelled intimet. So
dt
s =
∫ t
0
We estimate the value of this integral using the trapezium rule. We choose a strip of
widthh = 1 as this is the time interval at which data is collected by the computer.
Therefore
s =
∫ 10
0
vdt
vdt≈ 1 {10.1 +2(17.2 +24.4 +29.2 +34.6
2
+41.2 +50.9 +57.8 +60.3 +61.2) +62.1}
= 1 2 (825.8)
= 412.9
The distance travelled after 10 seconds isapproximately 412.9 m.
17.2.1 Errorduetothetrapeziumrule
Suppose we wish to estimate ∫ b
f dx using the trapezium rule. The interval [a,b] is
a
dividedintonequal strips, eachofwidthh = b −a
n .
17.2 Trapezium rule 499
The difference between the estimated value of the integral and the true value of the
integral istheerror. So
error = estimated value −truevalue
Weareabletofindthemaximumvalueof |error|.Firstlywecalculatethesecondderivative
of f, that is f ′′ . Suppose that |f ′′ | is never greater than some value,M, throughout
the interval [a,b], thatis
|f ′′ | M forallxvalues on [a,b]
WesaythatM isanupperboundfor |f ′′ |on[a,b].Thentheerrorduetothetrapezium
ruleissuch that
|error | (b−a) h 2 M
12
The expression (b−a) h 2 M is an upper bound for the error. Note that the error
12
depends upon h 2 : if the strip width, h, is halved the error reduces by a factor of 4; if
hisdivided by 10 the erroris divided by 100.
Example17.2 Find an upper bound for the error in the estimates calculated in Example 17.1. Hence
find upper and lower boundsforthe truevalue of ∫ 1.3
0.5 e(x2) dx.
Solution We have f = e (x2) . Then
f ′ = 2xe (x2 )
and f ′′ = 2e (x2) (1+2x 2 )
We note that f ′′ is increasing on [0.5,1.3] and so its maximum value is obtained at
x = 1.3. Thus the maximum value of f ′′ on [0.5,1.3] is 2e (1.3)2 [1 + 2(1.3) 2 ]. Noting
that2e (1.3)2 [1 +2(1.3) 2 ] = 47.47weseethat48isanupperboundfor f ′′ on[0.5,1.3],
thatisM = 48.
We notethata = 0.5,b= 1.3.Thus
|error | (0.8)h2 (48)
= 3.2h 2
12
(a) InExample 17.1(a),h = 0.2 and so
| error | 3.2(0.2) 2 = 0.128
Thus an upper bound for the error is0.128. We have
−0.128 error 0.128
The estimated value of the integral is2.117 and so
that is
2.117 −0.128 truevalue of integral 2.117 +0.128
1.989
∫ 1.3
0.5
e (x2) dx 2.245
An upper bound for ∫ 1.3
0.5 e(x2) dx is2.245 and a lower bound is 1.989.
500 Chapter 17 Numerical integration
(b) InExample 17.1(b),h = 0.1 and so
|error | 3.2(0.1) 2 = 0.032
Hence an upper bound forthe erroris 0.032. Now
−0.032 error 0.032
The estimated value of the integral is2.085 and so
2.085 −0.032
thatis
2.053
∫ 1.3
0.5
∫ 1.3
0.5
e (x2) dx 2.117
e (x2) dx 2.085 +0.032
EXERCISES17.2
1 Estimate the followingdefinite integrals usingthe
trapezium rule:
∫ 1
(a) sin(t 2 )dt useh = 0.2
0
Solutions
∫ 1.2 e x
(b)
1 x dx
use five strips
1 (a) 0.3139 (b) 0.5467
TechnicalComputingExercises17.2
1 (a) Plotthe second derivative of f (t) = sin(t 2 ) for
0 t 1. Useyour graph to findan upperbound
for f ′′ (t)for0 t 1. Hence findan upper
bound forthe error in Question1(a) in
Exercises17.2.Stateupperandlower boundsfor
the integral given in the question.
(b) Repeat(a)with f (x) = ex
x for1x1.2.
Hence findan upper bound forthe errorin
Question1(b) in Exercises17.2. Stateupper and
lower bounds forthe integral given in the
question.
17.3 SIMPSON’SRULE
Inthetrapeziumrulethecurvey(x)isapproximatedbyaseriesofstraightlinesegments.
InSimpson’srulethecurveisapproximatedbyaseriesofquadraticcurvesasshownin
Figure 17.2. The area is divided into an even number of strips of equal width. Consider
thefirstpairofstrips.AquadraticcurveisfittedthroughthepointsA,BandC.Another
y
A
B
C
D
E
17.3 Simpson’s rule 501
quadraticcurveisfittedthroughthepointsC,DandE.Aftersomeanalysisanexpression
forapproximating the area isfound.
Simpson’s rulestates:
Figure17.2
InSimpson’s rulean
even number ofstrips
isused. Thecurve is
approximated by
quadratic curves.
x
area ≈ h 3 (y 0 +4y 1 +2y 2 +4y 3 +2y 4 +···+2y n−2 +4y n−1 +y n )
= h 3 {y 0 +4(y 1 +y 3 +···)+2(y 2 +y 4 +···)+y n }
wherenisan even number.
Example17.3 Estimate ∫ 1.3
0.5 e(x2) dx usingSimpson’srulewith (a)fourstrips, (b)eight strips.
Solution (a) We use Table 17.1 and notethath = 0.2.UsingSimpson’s rulewehave
estimatedvalue = 0.2
3 {1.2840+4(1.6323+3.3535)+2(2.2479)+5.4195}
= 2.0762
Using Simpson’s rulewe have found ∫ 1.3
0.5 e(x2) dx ≈ 2.0762.
(b) We use Table 17.2 and note thath = 0.1:
estimated value = 0.1 {1.2840 +4(1.4333 +1.8965 +2.7183 +4.2207)
3
+2(1.6323 +2.2479 +3.3535) +5.4195}
= 2.0749
Using Simpson’s rulewe have found ∫ 1.3
0.5 e(x2) dx ≈ 2.0749.
Example17.4 Estimate ∫ 2
1
√
1 +x3 dx using Simpson’s rulewith 10 strips.
Solution With10 stripsh = 0.1. UsingTable 17.4, wefind
estimatedvalue ≈ 0.1
3 {1.4142+4(1.5268+1.7880+2.0917+2.4317+2.8034)
+2(1.6517 +1.9349 +2.2574 +2.6138) +3.000}
= 2.130
In some cases the numerical values are not derived from a function but from actual
measurements.Numericalmethods can still beappliedinanidentical manner.
502 Chapter 17 Numerical integration
Table17.5
3.0000 =y 10
Table17.4
x 10 = 2.0
Using Simpson’s rule to estimate
∫ 2
√ Measurementsusedto
1 1+x 3 dx.
estimatethe integral.
x y= √ 1+x 3 t Measurement (f )
x 0 = 1.0 1.4142 =y 0
0 4
x 1 = 1.1 1.5268 =y 1
1 4.7
x 2 = 1.2 1.6517 =y 2
2 4.9
x 3 = 1.3 1.7880 =y 3
3 5.3
x 4 = 1.4 1.9349 =y 4
4 6.0
x 5 = 1.5 2.0917 =y 5
5 5.3
x 6 = 1.6 2.2574 =y 6
6 5.9
x 7 = 1.7 2.4317 =y 7
x 8 = 1.8 2.6138 =y 8
x 9 = 1.9 2.8034 =y 9
Example17.5 Measurements of a variable, f, were made at 1 second intervals and are given in
Table 17.5. Estimate ∫ 6
f dt using
0
(a) the trapezium rule
(b) Simpson’srule.
Solution (a) Thesum ofthe areasofthe trapezia is
1
[4 +2(4.7 +4.9 +5.3 +6.0 +5.3) +5.9] = 31.2
2
(b) The area has been divided into sixstripsand so Simpson’s rule can beapplied:
approximate value of integral = 1 [4 +4(4.7 +5.3 +5.3)
3
+2(4.9 +6.0) +5.9] = 31.0
Engineeringapplication17.2
Energydissipationinaresistor
Aresistorisbeingusedtodissipateenergyfromavariabled.c.supply.Acalculation
is needed of how much energy has been dissipated over a period of time. Table 17.6
contains values of current,I, through the resistor, and voltage,V, across the resistor
forthefirst100secondssinceelectricalpowerwasfirstapplied.Calculatetheenergy
dissipation during this time period using Simpson’s rule with a step interval of 10
seconds.
Solution
Theenergy dissipated,E, isgiven by
E =
∫ t
0
Pdt
17.3 Simpson’s rule 503
Table17.6
Time(s) Voltage(V) Current(A)
0 50.5 10.1
10 101.0 20.2
20 67.5 13.5
30 80.5 16.1
40 92.0 18.4
50 96.0 19.2
60 78.5 15.7
70 82.0 16.4
80 90.5 18.1
90 107.0 21.4
100 86.0 17.2
Table17.7
Time(s) Voltage(V) Current(A) Power(W)
0 50.5 10.1 510.05
10 101.0 20.2 2040.20
20 67.5 13.5 911.25
30 80.5 16.1 1296.05
40 92.0 18.4 1692.80
50 96.0 19.2 1843.20
60 78.5 15.7 1232.45
70 82.0 16.4 1344.80
80 90.5 18.1 1638.05
90 107.0 21.4 2289.80
100 86.0 17.2 1479.20
wherePis the power. AlsoP =IV, and so
E =
∫ t
0
IV dt
We first need toevaluatePas shown inTable 17.7.
Then using Simpson’s rulewithh = 10 wehave
E =
∫ 100
0
IVdt≈ 10 {510.05 +4(2040.20 +1296.05 +1843.20 +1344.80
3
+2289.80) +2(911.25 +1692.80 +1232.45 +1638.05)
+1479.20}
= 160648.5 J
= 160.649 kJ
The energy dissipated istherefore approximately 160.6 kJ.
17.3.1 ErrorduetoSimpson’srule
Simpson’sruleprovidesanestimatedvalueofadefiniteintegral.Thedifferencebetween
theestimatedvalueandthetrue(exact)valueistheerror.Justaswiththetrapeziumrule,
wecan calculateanupper bound forthiserror.
We need to calculate the fourth derivative of f, that is f (4) . Suppose |f (4) | is never
greaterthansome value,M, throughoutthe interval [a,b], thatis
|f (4) | M forallxvalues on [a,b]
ClearlyM isanupperboundfor |f (4) |on[a,b].TheerrorduetoSimpson’sruleissuch
that
|error | (b−a)h4 M
180
Note that the error isproportional toh 4 .
504 Chapter 17 Numerical integration
Example17.6 Find an upper bound for the error in the estimates calculated in Example 17.3. Hence
find upper and lower bounds for ∫ 1.3
0.5 e(x2) dx.
Solution Here wehave f = e (x2) . Calculating the fourthderivative gives
f (4) = 4e (x2) (4x 4 +12x 2 +3)
Weseekanupperboundfor |f (4) |on[0.5,1.3].Wenotethat f (4) increasesasxincreases
and soits maximum value inthe interval occurs whenx = 1.3:
f (4) (1.3) = 752.319 < 753
Hence 753 is anupper bound for |f (4) |.
Inthis examplea = 0.5 andb=1.3 and so
| error | (0.8)h4 (753)
180
(a) Hereh = 0.2 and so
= 3.347h 4
|error | 3.347(0.2) 4 = 0.0054
An upper bound forthe erroris0.0054. Now
−0.0054 error 0.0054
The estimated value of the integral is2.0762 and so
2.0762 −0.0054
∫ 1.3
0.5
e (x2) dx 2.0762 +0.0054
thatis
2.0708
∫ 1.3
0.5
e (x2) dx 2.0816
(b) Hereh = 0.1 and so
|error | 3.347(0.1) 4 = 3.347 ×10 −4
An upper bound forthe erroris3.347 ×10 −4 . Now
−3.347 ×10 −4 error 3.347 ×10 −4
Noting thatthe estimated value of the integral is2.0749 we have
2.0749 −3.347 ×10 −4
∫ 1.3
0.5
e (x2) dx 2.0749 +3.347 ×10 −4
thatis
2.0746
∫ 1.3
0.5
e (x2) dx 2.0752
Review Exercises 17 505
EXERCISES17.3
1 Estimate the values ofthe following integrals using
Simpson’s rule:
∫ 3
(a) ln(x 3 +1)dx use10strips
2
∫ 2.6 √
(b) te t dt use eightstrips
1
2 Evaluate,usingthe trapeziumruleandSimpson’s
rule,
∫ 1
(a) (x 2 +1) 3/2 dx use four strips
0
∫ 1.6 sin2t
(b) dt use sixstrips
1 t
Solutions
1 (a) 2.7955 (b) 15.1164
2 (a) trapeziumrule: 1.5900,Simpson’s rule: 1.5681
(b) trapeziumrule:0.2464,Simpson’s rule:
0.2460
TechnicalComputingExercises17.3
1 (a) Plotthe fourthderivative of f (x) =
ln(x 3 +1)for2 x3.Useyourgraph
to findan upper bound for f (4) (x)for2x3.
Hence findan upperbound forthe error in
Question1(a) in Exercises17.3. Stateupper and
lower boundsforthe integral given in the
question.
(b) Repeat (a)with f (t) = √ te t for1 t 2.6.
Hence findan upper bound forthe error in Question1(b)
in Exercises17.3. Statelower and upper
boundsforthe integralgiven in the question.
2 Use MATLAB ® orasimilar technicalcomputing
language to findupper andlower boundsforthe
integrals in Question2in Exercises17.3.
REVIEWEXERCISES17
1 Iff(t)= √ t 2 +1find ∫ 2
1 f (t)dt using
(a) the trapezium rulewithh = 0.25
(b) Simpson’s ruleusingeightstrips.
2 Estimate the followingdefiniteintegrals usingthe
trapeziumrulewith sixstrips:
∫ 4 √
∫ 0.6
(a) x 3 +1dx (b) sin(t 2 )dt
1
0
∫ 0.8
∫ 0.3
(c) e (t2) 3
dt (d)
0.2
0 x 3 +2 dx
∫ 2.5 e t
(e) dt
1 t
3 Estimate the definiteintegrals in Question2using
Simpson’s rulewith sixstrips.
4 Estimate the followingdefiniteintegrals usingthe
trapeziumrule with eightstrips:
∫ 4
(a) cos( √ t)dt
0
∫ 6
(b) (t 2 +1) 3/2 dt
−2
∫ 3
(c) e √x dx
1
∫ 0.8
(d) tan(t 2 )dt
0
∫ 5
(e) ln(x 2 +1)dx
3
506 Chapter 17 Numerical integration
Solutions
1 (a) 1.8111
(b) 1.8101
2 (a) 12.9113
(b) 0.07227
(c) 0.80860
(d) 0.44845
(e) 5.19384
3 (a) 12.87181
(b) 0.071334
(c) 0.80643
(d) 0.44849
(e) 5.17879
4 (a) 0.81058
(b) 370.25
(c) 8.27686
(d) 0.18390
(e) 5.63032
TechnicalComputingExercises17
1 UseMATLAB ® orasimilartechnical computing
language to findupper boundsforthe errorsin
Questions1to 4 in Review exercises17. Hence find
upperand lower boundsforthe integrals given in
these questions.
18 Taylorpolynomials,
Taylorseriesand
Maclaurinseries
Contents 18.1 Introduction 507
18.2 Linearizationusingfirst-orderTaylorpolynomials 508
18.3 Second-orderTaylorpolynomials 513
18.4 Taylorpolynomialsofthenthorder 517
18.5 Taylor’sformulaandtheremainderterm 521
18.6 TaylorandMaclaurinseries 524
Reviewexercises18 532
18.1 INTRODUCTION
Often the value of a function and the values of its derivatives are known at a particular
pointandfromthisinformationitisdesiredtoobtainvaluesofthefunctionaroundthat
point.TheTaylorpolynomialsandTaylorseriesallowengineerstomakesuchestimates.
Oneapplicationofthisisinobtaininglinearizedmodelsofnon-linearsystems.Thegreat
advantageofalinearmodelisthatitismucheasiertoanalysethananon-linearone.Itis
possibletomakeuseoftheprincipleofsuperposition:thisallowstheeffectsofmultiple
inputstoasystemtobeconsideredseparately,andtheresultantoutputtobeobtainedby
summing the individual outputs.
Oftenasystemmaycontainonlyafewcomponentsthatarenon-linear.Bylinearizing
theseitisthenpossibletoproducealinearmodelforthesystem.Wesawanexampleof
thiswhenweanalysedafluidsysteminEngineeringapplication10.6.Althoughelectricalsystemsareoftenlinear,mechanical,thermalandfluidsystems,orsystemscontainingamixtureofthese,arelikelytocontainsomenon-linearcomponents.Unfortunately
itmaynotbepossibletoobtainasufficientlyaccuratelinearmodelforeverynon-linear
systemas weshall see inthischapter.
Taylorpolynomialsofhigherdegreecanbefoundwhichapproximatetoagivenfunction.ThisisdealtwithinSections18.3and18.4.Thedifferencebetweenagivenfunction
andthecorrespondingTaylorpolynomialiscoveredinSection18.5.Thechaptercloses
with a treatment of Taylor and Maclaurin series.
508 Chapter 18 Taylor polynomials, Taylor series and Maclaurin series
18.2 LINEARIZATIONUSINGFIRST-ORDERTAYLOR
POLYNOMIALS
Suppose we know that y is a function of x and we know the values of y and y ′ when
x =a,thatisy(a)andy ′ (a)areknown.Wecanusey(a)andy ′ (a)todeterminealinear
polynomial which approximates toy(x). Let this polynomial be
p 1
(x) =c 0
+c 1
x
We choose the constantsc 0
andc 1
so that
p 1
(a) =y(a)
p ′ 1 (a) =y′ (a)
that is, the values of p 1
and its first derivative evaluated atx = a match the values ofy
and itsfirst derivative evaluated atx =a. Then,
p 1
(a) =y(a) =c 0
+c 1
a
p ′ 1 (a) =y′ (a) =c 1
Solving forc 0
andc 1
yields
Thus,
c 0
=y(a) −ay ′ (a)
c 1
=y ′ (a)
p 1
(x) =y(a) −ay ′ (a) +y ′ (a)x
p 1
(x) =y(a) +y ′ (a)(x −a)
p 1
(x) isthefirst-order Taylor polynomialgenerated byyatx =a.
y
O
Q
Figure18.1
Graphical representation
ofafirst-orderTaylor
polynomial.
a
x
The function,y(x), isoften referred toas thegenerating function. Note that p 1
(x) and
its first derivative evaluated atx =a agree withy(x) and its first derivative evaluated at
x=a.
First-order Taylor polynomials can also be viewed from a graphical perspective.
Figure 18.1 shows the function, y(x), and a tangent at Q where x = a. Let the equation
ofthe tangent atx =abe
p(x)=mx+c
The gradient of the tangent is, by definition, the derivative ofyatx = a, that isy ′ (a).
So,
p(x) =y ′ (a)x +c
The tangent passes through the point (a,y(a)), and so
y(a) =y ′ (a)a +c
that isc =y(a) −y ′ (a)a. The equation of the tangent isthus
p(x) =y ′ (a)x +y(a) −y ′ (a)a
p(x) =y(a) +y ′ (a)(x −a)
18.2 Linearization using first-order Taylor polynomials 509
This is the first-order Taylor polynomial. We see that the first-order Taylor polynomial
issimplythe equation of the tangent toy(x) wherex =a.
Clearly,forvaluesofxneartox =athevalueofp 1
(x)willbeneartoy(x);p 1
(x)isa
linearapproximationtoy(x).Intheneighbourhoodofx =a,p 1
(x)closelyapproximates
y(x), but being linear isamuch easier function todeal with.
Example18.1 Afunction,y, and its first derivative areevaluated atx = 2.
y(2)=1 y ′ (2)=3
(a) State the first-order Taylor polynomial generated byyatx = 2.
(b) Estimatey(2.5).
Solution (a) p 1
(x) =y(2) +y ′ (2)(x −2) =1+3(x −2) = −5 +3x
(b) We use the first-order Taylor polynomial toestimatey(2.5):
p 1
(2.5) = −5 +3(2.5) = 2.5
Hence,y(2.5) ≈ 2.5.
Example18.2 Find a linear approximation toy(t) =t 2 neart = 3.
Solution We require the equation of the tangent toy = t 2 att = 3, that is the first-order Taylor
polynomial aboutt = 3.Note thaty(3) = 9 andy ′ (3) = 6.
p 1
(t) =y(a) +y ′ (a)(t −a) =y(3) +y ′ (3)(t −3)
=9+6(t−3)
=6t−9
Att = 3, p 1
(t) and y(t) have an identical value. Near tot = 3, p 1
(t) and y(t) have
similarvalues, forexample p 1
(2.8) = 7.8,y(2.8) = 7.84.
18.2.1 Linearization
It is a frequent requirement in engineering to obtain a linear mathematical model of a
systemwhichisbasicallynon-linear.Mathematicallyandcomputationallylinearmodels
arefareasiertodealwiththannon-linearmodels.Themainreasonforthisisthatlinear
modelsobeytheprincipleofsuperposition.Itfollowsthatiftheapplication,separately,
of inputs u 1
(t) and u 2
(t) to the system produces outputs y 1
(t) and y 2
(t), respectively,
thentheapplicationofaninputu 1
(t) +u 2
(t)willproduceanoutputy 1
(t) +y 2
(t).This
isonly trueforlinear systems.
The value of this principle is that the effect of several inputs to a system can be calculated
merely by adding together the effects of the individual inputs. This allows the
effect of simple individual inputs to the system to be analysed and then combined to
evaluate the effect of more complicated combinations of inputs. A few examples will
help clarifythese points.
510 Chapter 18 Taylor polynomials, Taylor series and Maclaurin series
Engineeringapplication18.1
Ad.c.electricalnetwork
Consider the d.c. network of Figure 18.2. This network is a linear system. This is
because the voltage/current characteristic of a resistor is linear provided a certain
voltage isnot exceeded. Recall Ohm’s lawwhich isgiven by
V=IR
whereV is the voltage across the resistor,I is the current through the resistor andR
istheresistance.Thismakestheanalysisofthenetworkrelativelyeasy.Thevoltage
sources,V 1
,V 2
,V 3
,V 4
, can be thought of as the inputs to the system. It is possible to
analyse the effect of each of these sources separately, for example the voltage drop
across the resistor,R 5
, resulting from the voltage sourceV 1
, and then combine these
effects to obtain the total effect on the system. The voltage drop acrossR 5
when all
sources are considered would be the sum of each of the voltage drops due to the
individual sourcesV 1
,V 2
,V 3
andV 4
.
R V 1 2 V 3
+ – + –
–
R 2 R 3 V 4 R 4 R 5
+
+ –
V R 6 R 1 7
Figure18.2
Ad.c. electrical network.
Engineeringapplication18.2
Agravityfeedwatersupply
Consider the water supply network of Figure 18.3. The network consists of three
sourcereservoirsandaseriesofconnectingpipes.Wateristakenfromthenetworkat
twopoints,S 1
andS 2
.Inapracticalnetwork,reservoirsareusuallyseveralkilometres
away from the points at which water is taken from the network and so the effect of
pressure drops along the pipes is significant. For this reason many networks require
pumps toboostthe pressure.
The main problem with analysing this network is that it is non-linear. This is because
the relationship between pressure drop along a pipe and water flow through a
pipeisnotlinear:adoublingofpressuredoesnotleadtoadoublingofflow.Forthis
R 1
P 1 P 2
R 2 R 3
Figure18.3
P 3 P 4 P 5 P 6
S 1 S 2
Awater supply network.
18.2 Linearization using first-order Taylor polynomials 511
reason, it is not possible to use the principle of superposition when analysing the
network. For example, if the effect of the pressure at S 1
for a given flow rate was
calculatedseparatelyforeachoftheinputstothesystem--thereservoirsR 1
,R 2
,R 3
--
then the effect of all the reservoirs could not be obtained by adding the individual
effects. It is not possible to obtain a linear model for this system except under very
restrictive conditions and so the analysis of water networks isvery complicated.
Having demonstrated the valueoflinear models itisworthanalysing howand when
a non-linear system can be linearized. The first thing to note is that many systems may
containamixtureoflinearandnon-linearcomponentsandsoitisonlynecessarytolinearizecertainpartsofthesystem.AsystemofthistypehasbeenstudiedinEngineering
application 10.6. Thereforelinearization involves deciding which components ofasystemarenon-linear,decidingwhetheritwouldbevalidtolinearizethecomponentsand,
ifso, then obtaining linearized models of the components.
Consider again Figure 18.1. Imagine it illustrates a component characteristic. The
actual component characteristic is unimportant for the purposes of this discussion. For
instance, it could be the pressure/flow relationship of a valve or the voltage/current
relationship of an electronic device. The main factor in deciding whether a valid linear
model can be obtained is the range of values over which the component is required to
operate. If an operating point Q were chosen and deviations from this operating point
were small then it is clear from Figure 18.1 that a linear model -- corresponding to the
tangent tothe curve atpointQ-- would be an appropriate model.
Obtaining a linear model is relatively straightforward. It consists of calculating the
first-order Taylor polynomial centred around the operating point Q.This isgiven by
p 1
(x) =y(a) +y ′ (a)(x −a)
Then p 1
(x)isusedasthelinearizedmodelofthecomponentwithcharacteristicy(x).It
isvalidprovidedthatitisonlyusedforvaluesofxsuchthat |x−a|issufficientlysmall.
Asstated, p 1
(x)isalsotheequationofthetangenttothecurveatpointQ.Therangeof
values for which the model is valid depends on the curvature of the characteristic and
the accuracy required.
Engineeringapplication18.3
Powerdissipationinaresistor
The power dissipated in a resistor varies with the current. Derive a linear model for
thispowervariationvalidforanoperatingpointof0.5A.Theresistorhasaresistance
of 10 .
Solution
For a resistor
where
P=I 2 R
P = power dissipated (W)
I = current (A)
R = resistance ().
➔
512 Chapter 18 Taylor polynomials, Taylor series and Maclaurin series
The first-order polynomial, valid around an operating pointI = 0.5,is
p 1
(I) =P(0.5) +P ′ (0.5)(I −0.5)
NowP(0.5) = (0.5) 2 10 = 2.5,P ′ (0.5) = 2(0.5)(10) = 10,andso
p 1
(I) =2.5 +10(I −0.5) =10I −2.5
It is interesting to compare this linear approximation with the true curve for values
ofI around the operating point. Table 18.1 shows some typical values. Figure 18.4
shows a graph of the power dissipated in the resistor,P, against the current flowing
intheresistor,I.Noticethatthelinearapproximationisquitegoodwhenclosetothe
operating pointbut deteriorates further away.
Table18.1
A comparison oflinear approximations with truevalues.
I (A) TruevalueofP (W) ApproximatevalueofP (W)
0.5 2.5 2.5
0.499 2.49001 2.49
0.501 2.51001 2.51
0.49 2.401 2.4
0.51 2.601 2.6
0.4 1.6 1.5
0.6 3.6 3.5
1.0 10 7.5
P
10
8
6
4
Operating point
P = 10 I 2
p 1 (I) = 10 I – 2.5
2
0
0.2 0.4 0.6 0.8 1
I
Figure18.4
We see that the tangential approximation is
good when close to the operating point.
EXERCISES18.2
1 Calculate the first-orderTaylor polynomial generated
byy(x) = e x about
(a) x=0 (b) x=2 (c) x=−3
2 Calculate the first-orderTaylor polynomial generated
byy(x) = sinxabout
(a) x=0 (b) x=1 (c) x=−0.5
3 Calculate the first-orderTaylor polynomial generated
byy(x) = cosxabout
(a)x=0
(b)x=1
(c)x = −0.5
18.3 Second-order Taylor polynomials 513
4 (a) Find alinear approximation, p 1 (t),toh(t) =t 3
aboutt = 2.
(b) Evaluateh(2.3)and p 1 (2.3).
5 (a) Findalinearapproximation, p 1 (t),toR(t) = 1 t
aboutt = 0.5.
(b) EvaluateR(0.7)and p 1 (0.7).
Solutions
1 (a) p 1 (x)=x+1
(b) p 1 (x) =e 2 (x −1)
(c) p 1 (x) =e −3 (x +4)
2 (a) p 1 (x) =x
(b) p 1 (x) = 0.5403x +0.3012
(c) p 1 (x) = 0.8776x −0.0406
4 (a) p 1 (t) =12t −16
(b) h(2.3) = 12.167,p 1 (2.3) = 11.6
5 (a) p 1 (t) = −4t +4
(b) R(0.7) = 1.4286,p 1 (0.7) = 1.2
3 (a) p 1 (x) =1
(b) p 1 (x) = −0.8415x +1.3818
(c) p 1 (x) = 0.4794x +1.1173
18.3 SECOND-ORDERTAYLORPOLYNOMIALS
Suppose that in addition to y(a) and y ′ (a), we also have a value of y ′′ (a). With this
informationasecond-orderTaylorpolynomialcanbefound,whichprovidesaquadratic
approximationtoy(x). Let
p 2
(x) =c 0
+c 1
x+c 2
x 2
We require
p 2
(a) =y(a)
p ′ 2 (a) =y′ (a)
p ′′ 2 (a) =y′′ (a)
thatis,thepolynomialanditsfirsttwoderivativesevaluatedatx =amatchthefunction
and its first two derivatives evaluated atx =a. Hence
p 2
(a) =c 0
+c 1
a +c 2
a 2 =y(a) (18.1)
p ′ 2 (a) =c 1 +2c 2 a =y′ (a) (18.2)
p ′′ 2 (a) = 2c 2 =y′′ (a) (18.3)
Solvingforc 0
,c 1
andc 2
yields
c 2
= y′′ (a)
from Equation (18.3)
2
c 1
=y ′ (a) −ay ′′ (a) fromEquation (18.2)
c 0
=y(a) −ay ′ (a) + a2
2 y′′ (a) from Equation (18.1)
514 Chapter 18 Taylor polynomials, Taylor series and Maclaurin series
Hence,
Finally,
p 2
(x) =y(a) −ay ′ (a) + a2
2 y′′ (a)
+{y ′ (a) −ay ′′ (a)}x + y′′ (a)
2 x2
p 2
(x) =y(a) +y ′ (a)(x −a) +y ′′ (x −a)2
(a)
2
p 2
(x) isthesecond-order Taylor polynomialgenerated byyaboutx =a.
Example18.3 Giveny(1) = 0,y ′ (1) = 1,y ′′ (1) = −2,estimate
(a) y(1.5)
(b) y(2)
(c) y(0.5)
usingthe second-order Taylor polynomial.
Solution Thesecond-order Taylor polynomialis p 2
(x):
p 2
(x) =y(1) +y ′ (1)(x −1) +y ′′ (x −1)2
(1)
2
(x −1)2
=x−1−2 =x−1−(x−1) 2 =−x 2 +3x−2
2
We use p 2
(x) as an approximation toy(x).
(a) The value ofy(1.5) isapproximated by p 2
(1.5):
y(1.5) ≈ p 2
(1.5) = 0.25
(b) The value ofy(2) isapproximated by p 2
(2):
y(2) ≈ p 2
(2) =0
(c) The value ofy(0.5) isapproximated by p 2
(0.5):
y(0.5) ≈ p 2
(0.5) = −0.75
Example18.4 (a) Calculatethe second-order Taylor polynomial, p 2
(x), generatedby
y(x)=x 3 +x 2 −6aboutx=2.
(b) Verifythaty(2) = p 2
(2),y ′ (2) = p ′ 2 (2)andy′′ (2) = p ′′ 2 (2).
(c) Comparey(2.1) and p 2
(2.1).
Solution (a) We needtocalculatey(2),y ′ (2) andy ′′ (2). Now
and so
y(x)=x 3 +x 2 −6,y ′ (x)=3x 2 +2x,y ′′ (x)=6x+2
y(2) =6,y ′ (2) =16,y ′′ (2) =14
18.3 Second-order Taylor polynomials 515
The required second-order Taylor polynomial, p 2
(x), isthus given by
p 2
(x) =y(2) +y ′ (2)(x −2) +y ′′ (x −2)2
(2)
2
(x −2)2
=6+16(x−2)+14
2
=6+16x−32+7(x 2 −4x+4)
=7x 2 −12x+2
(b) Using (a)we can see that
and so
Hence
(c) We have
p 2
(x) =7x 2 −12x +2,p ′ 2 (x) = 14x −12,p′′ 2
(x) =14
p 2
(2) =6,p ′ 2 (2) = 16,p′′ 2
(2) =14
y(2) = p 2
(2),y ′ (2) = p ′ 2 (2),y′′ (2) = p ′′ 2 (2)
y(2.1) = (2.1) 3 + (2.1) 2 −6 = 7.671
p 2
(2.1) = 7(2.1) 2 −12(2.1) +2 = 7.67
Clearly there is a very close agreement between values of y(x) and p 2
(x) near to
x=2.
Engineeringapplication18.4
Quadraticapproximationtoadiodecharacteristic
In Engineering application 10.5 we derived a linear approximation to a diode characteristic
suitable for small signal variations around an operating point. Sometimes
it is not possible to use a linear approximation because the variations are too large
to maintain sufficient accuracy. Even so, an approximate model may be desirable.
In general, a higher order Taylor polynomial will give a more accurate model than
that of a lower order polynomial. We will consider a quadratic model for a diode
characteristic. TheV −I characteristic of typical diode at room temperature can be
modelledby the equation
I =I(V) =I s
(e 40V −1)
Given anoperating point,V a
, the second-order Taylor polynomialis
p 2
(V)=I(V a
)+I ′ (V a
)(V −V a
)+I ′′ (V a
) (V −V a )2
2
Now
I ′ (V) = 40I s
e 40V I ′′ (V) = 1600I s
e 40V ➔
516 Chapter 18 Taylor polynomials, Taylor series and Maclaurin series
so
p 2
(V) =I s
(e 40V a −1) +40Is e 40V a (V −Va ) +1600I s
e 40V (V −V
a a
) 2
2
Thecoefficientsneedtobecalculatedonlyonce.Afterthatthecalculationofacurrent
value only involves evaluating a quadratic.
EXERCISES18.3
1 (a) Obtainthe second-order Taylor polynomial,
p 2 (x),generatedbyy(x) = 3x 4 +1aboutx = 2.
(b) Verifythaty(2) = p 2 (2),y ′ (2) = p ′ 2 (2)and
y ′′ (2) = p ′′ 2 (2).
(c) Evaluate p 2 (1.8)andy(1.8).
2 (a) Calculate the second-order Taylor polynomial,
p 2 (x),generated byy(x) = sinx aboutx = 0.
(b) Calculate the second-order Taylor polynomial,
p 2 (x),generated byy(x) = cosx aboutx = 0.
(c) Compare your resultsfrom (a) and(b)with the
small-angle approximations given in Section6.5.
3 Afunction,y(x),issuchthaty(−1) = 3,y ′ (−1) = 2
andy ′′ (−1) = −2.
(a) Statethe second-order Taylor polynomial
generated byyaboutx = −1.
(b) Estimatey(−0.9).
4 Afunction,y(x),satisfiesthe equation
y ′ =y 2 +x
y(1)=2
(a) Estimatey(1.3)usingafirst-orderTaylor
polynomial.
(b) Bydifferentiatingthe equationwith respect tox,
obtain an expression fory ′′ .Hence evaluate
y ′′ (1).
(c) Estimatey(1.3)usingasecond-order Taylor
polynomial.
5 Afunction,x(t),satisfiesthe equation
ẋ=x+ √ t+1
x(0)=2
(a) Estimatex(0.2)usingafirst-orderTaylor
polynomial.
(b) Differentiate the equation w.r.t.t and hence
obtain an expression forẍ.
(c) Estimatex(0.2)usingasecond-order Taylor
polynomial.
6 Afunction,h(t),isdefined by
h(t) = sin2t +cos3t
Obtainthe second-order Taylor polynomial generated
byh(t)aboutt = 0.
7 Thefunctionsy 1 (x),y 2 (x)andy 3 (x)are defined by
y 1 (x) =Ae x ,y 2 (x) =Bx 3 +Cx,
y 3 (x) =y 1 (x) +y 2 (x)
(a) Obtainasecond-order Taylor polynomial for
y 1 (x)aboutx = 0.
(b) Obtainasecond-order Taylor polynomial for
y 2 (x)aboutx = 0.
(c) Obtainasecond-order Taylor polynomial for
y 3 (x)aboutx = 0.
(d) Canyou draw any conclusionsfrom your
answers to (a),(b)and(c)?
Solutions
1 (a) p 2 (x) = 72x 2 −192x +145
(c) p 2 (1.8) = 32.68,y(1.8) = 32.4928
2 (a) p 2 (x) =x (b) p 2 (x) =1− x2
2
3 (a) p 2 (x) = −x 2 +4
(b) p 2 (−0.9) = 3.19.This isan approximation to
y(−0.9).
4 (a) 3.5
(b) y ′′ = 2yy ′ +1,y ′′ (1) = 21
(c) 4.445
5 (a) 2.6
1
(b) ẍ=ẋ +
2 √ t +1
(c) 2.67
18.4 Taylor polynomials of the nth order 517
6 p 2 (t) =1+2t −4.5t 2
( )
7 (a)A1 +x+ x2
2
(b) Cx
(c) A+(A+C)x+ Ax2
2
TechnicalComputingExercises18.3
Many technicalcomputing languageshave the
capability ofproducingaTaylor polynomial ofa
function.Insome languagesthisis available asa
defaultwhereasin others, such asMATLAB ® ,thisis
offered via a toolboxfunctionwhichmayneedto be
loaded orpurchased separately. Ifthe software you
are usingdoes not have the capability to produce the
Taylor seriesautomaticallyyou maychoose to dothis
partmanuallyandthenplot the resultsusingthe
technicalcomputing language.
1 (a) Calculate the second-order Taylor polynomial,
p 2 (x),generatedbyy(x) =x 3 aboutx = 0.
(b) Drawy(x)andp 2 (x)for −2 x2.
2 (a) Calculate the second-order Taylor polynomial,
p 2 (x),generated byy(x) = sinx aboutx = 0.
(b) Drawy(x)andp 2 (x)for −2 x 2.
3 (a) Calculate the second-order Taylor polynomial,
( )
1
p 2 (x),generated byy(x) = sin aboutx = 3.
x
(b) Drawy(x)andp 2 (x)for1 x 5.
4 (a) Calculate the second-order Taylor polynomial,
p 2 (x),generated byy(x) = e cosx aboutx = 0.
(b) Drawy(x)andp 2 (x)for −2 x 2.
18.4 TAYLORPOLYNOMIALSOFTHEnTHORDER
If we know y and its first n derivatives evaluated at x = a, that is y(a), y ′ (a),
y ′′ (a), ...,y (n) (a), thenthenth-orderTaylorpolynomial, p n
(x), maybewrittenas
p n
(x)=y(a) +y ′ (a)(x −a) +y ′′ (x −a)2
(a) +y (3) (x −a)3
(a)
2! 3!
+···+y (n) (x −a)n
(a)
n!
This provides an approximation toy(x). The polynomial and its firstnderivatives evaluated
atx = a match the values ofy(x) and its firstnderivatives evaluated atx = a,
thatis
p n
(a) =y(a)
p ′ n (a) =y′ (a)
p ′′ n (a) =y′′ (a)
.
p (n)
n (a) =y(n) (a)
518 Chapter 18 Taylor polynomials, Taylor series and Maclaurin series
Example18.5 Giveny(0) = 1,y ′ (0) = 1,y ′′ (0) = 1,y (3) (0) = −1,y (4) (0) = 2,obtainafourth-order
Taylor polynomial generated byyaboutx = 0.Estimatey(0.2).
Solution Inthis examplea = 0 and hence
p 4
(x) =y(0) +y ′ (0)x +y ′′ (0) x2
2! +y(3) (0) x3
3! +y(4) (0) x4
4!
=1+x+ x2
2 − x3
6 + x4
12
The Taylor polynomial can be used toestimatey(0.2):
p 4
(0.2) = 1 +0.2 + (0.2)2
2
y(0.2) ≈ 1.2188
− (0.2)3
6
+ (0.2)4
12
= 1.2188
Example18.6 (a) Calculatethe first-,second-,third-and fourth-orderTaylor polynomialsgenerated
byy(x) = e x aboutx = 0.
(b) Ploty = e x and the Taylor polynomials for −2 x2.
Solution (a) We have
and
y(x) =y ′ (x) =y ′′ (x) =y (3) (x) =y (4) (x) =e x
y(0) =y ′ (0) =y ′′ (0) =y (3) (0) =y (4) (0) =1
Thus the Taylor polynomials, p 1
(x),p 2
(x),p 3
(x) and p 4
(x), are given by
p 1
(x) =y(0)+y ′ (0)x =1+x
p 2
(x) =y(0) +y ′ (0)x +y ′′ (0) x2
2 =1+x+x2 2
p 3
(x) =y(0) +y ′ (0)x +y ′′ (0) x2
2 +y(3) (0) x3
3! =1+x+x2 2 + x3
6
p 4
(x) =y(0) +y ′ (0)x +y ′′ (0) x2
2 +y(3) (0) x3
3! +y(4) (0) x4
4!
=1+x+ x2
2 + x3
6 + x4
24
(b) The graphs ofy = e x and the Taylor polynomials are shown in Figure 18.5. Note
that the Taylor polynomials become better and better approximations to e x as the
order of the polynomial increases.
Example18.7 Given thatysatisfiesthe equation
y ′′ −(y ′ ) 2 +2y=x 2 (18.4)
and alsothe conditions
y(0)=1 y ′ (0)=2
18.4. Taylor polynomials of the nth order 519
y(x), p i (x)
8
7
6
5
4
3
2
p
1 2 (x)
0 y(x)
p 1 (x)
–1
–2.0 –1.5 –1.0 –0.5 0
x
(a)
y(x)
p 2 (x)
p 1 (x)
0.5 1.0 1.5 2.0
y(x), p i (x)
Figure18.5
Agraph ofy = e x andfour Taylor polynomials.
8
7
6
5
4
3
2
1
p 4 (x)
y(x)
0 p 3 (x)
–1
–2.0 –1.5 –1.0 –0.5 0
(b)
x
p 4 (x)
y(x)
p 3 (x)
0.5 1.0 1.5 2.0
use a third-order Taylor polynomial toestimatey(0.5).
Solution To write down the third-order Taylor polynomial about x = 0 we require y(0), y ′ (0),
y ′′ (0) andy (3) (0). FromEquation (18.4),
So,
y ′′ (x) =x 2 + {y ′ (x)} 2 −2y(x) (18.5)
y ′′ (0) =0+ {y ′ (0)} 2 −2y(0) =2
To findy (3) (0), Equation (18.5) is differentiated w.r.t.x:
y (3) (x) = 2x +2y ′ (x)y ′′ (x) −2y ′ (x)
y (3) (0) = 2y ′ (0)y ′′ (0) −2y ′ (0) =4
The Taylor polynomial may now be written as
p 3
(x) =y(0) +y ′ (0)x +y ′′ (0) x2
2 +y(3) (0) x3
6
=1+2x+x 2 + 2x3
3
We use p 3
(x) as anapproximation toy(x):
thatis,
p 3
(0.5) = 1 +1+0.25 +0.0833 = 2.333
y(0.5) ≈ 2.333
EXERCISES18.4
1 Afunction,y(x),hasy(0) = 3,y ′ (0) = 1,
y ′′ (0) = −1andy (3) (0) =2.
(a) Obtainathird-orderTaylor polynomial, p 3 (x),
generated byy(x) aboutx = 0.
(b) Estimatey(0.2).
2 Afunction,h(t),hash(2) = 1,h ′ (2) = 4,
h ′′ (2) = −2,h (3) (2) =1andh (4) (2) =3.
(a) Obtainafourth-orderTaylor polynomial, p 4 (t),
generated byh(t)aboutt = 2.
(b) Estimateh(1.8).
520 Chapter 18 Taylor polynomials, Taylor series and Maclaurin series
3 Giveny(x) = sinx,obtain the third-, fourth-and
fifth-orderTaylor polynomials generated byy(x)
aboutx = 0.
4 Giveny(x) = cosx, obtain the third-, fourth- and
fifth-orderTaylor polynomials generated byy(x)
aboutx = 0.
5 (a) Giveny(x) = sin(kx),k a constant, obtain the
third-, fourth-and fifth-orderTaylor polynomials
generated byy(x)aboutx = 0.
(b) Writedown the third-, fourth-and fifth-order
Taylor polynomials generated byy = sin(−x)
aboutx = 0.
6 (a) Giveny(x) = cos(kx),k aconstant, obtain the
third-, fourth-and fifth-orderTaylor polynomials
generated byy(x)aboutx = 0.
(b) Writedown the third-, fourth-and fifth-order
Taylor polynomials generated byy = cos(2x)
aboutx = 0.
7 If p n (x) isthenth-order Taylor polynomial generated
byy(x) aboutx = 0, showthat p n (kx) is thenth-order
Taylor polynomial generated byy(kx) aboutx = 0.
8 Thefunction,y(x),satisfiesthe equation
y ′′ =y+3x 2 y(1) =1,y ′ (1) =2
(a) Obtain athird-orderTaylor polynomial generated
byyaboutx = 1.
(b) Estimatey(1.3)using(a).
(c) Obtain afourth-orderTaylor polynomial
generated byyaboutx = 1.
(d) Estimatey(1.3)using(c).
9 Afunction,y(x),satisfiesthe equation
y ′′ +y 2 =x 3
y(0) =1,y ′ (0) = −1
(a) Estimatey(0.25)usingathird-orderTaylor
polynomial.
(b) Estimatey(0.25)usingafourth-orderTaylor
polynomial.
10 Afunction,y(x),hasy(1) = 3,y ′ (1) = 6,y ′′ (1) = 1
andy (3) (1) = −1.
(a) Estimatey(1.2)usingathird-orderTaylor
polynomial.
(b) Estimatey ′ (1.2)usingan appropriate
second-order Taylor polynomial.
[Hint: define anew variable,z,given byz =y ′ .]
Solutions
1 (a) 3+x− x2
2 + x3
3
2 (a)
t 4 8 − 5t3
6 +t2 +6t− 31
3
(b) 0.1589
3 p 3 (x)=x− x3
3! ,
p 4 (x) =x− x3
3! ,
p 5 (x) =x− x3
3! + x5
5!
4 p 3 (x)=1− x2
2! ,
p 4 (x) =1− x2
2! + x4
4! ,
p 5 (x) =1− x2
2! + x4
4!
(b) 3.1827
5 (a) p 3 (x) =kx− k3 x 3
3! ,
p 4 (x) =kx− k3 x 3
3! ,
p 5 (x) =kx− k3 x 3
3!
(b) p 3 (x) = −x + x3
3! ,
p 4 (x) = −x+ x3
3! ,
+ k5 x 5
5!
p 5 (x) = −x+ x3
3! − x5
5!
6 (a) p 3 (x) =1− k2 x 2
2! ,
p 4 (x) =1− k2 x 2
2!
p 5 (x) =1− k2 x 2
2!
+ k4 x 4
4! ,
+ k4 x 4
4!
18.5 Taylor’s formula and the remainder term 521
(b) p 3 (x) =1−2x 2 ,
p 4 (x) =1−2x 2 + 2x4
3 ,
p 5 (x) =1−2x 2 + 2x4
3
4x 3
8 (a)
3 −2x2 +2x− 1 3
(b) 1.816
5x 4
(c)
12 − x3
3 + x2
2 + x 3 + 1 12
(d) 1.8194
9 (a) 0.7240 (b) 0.7240
10 (a) 4.2187 (b) 6.18
TechnicalComputingExercises18.4
1 Drawy(x) = 1 for1 x8. On the same axesdraw
x
the fourth-orderTaylor polynomial generated byy(x)
aboutx = 3.
3 Drawy(x) = lnx for0.5 x10.On the same axes,
draw the third-, fourth- andfifth-orderTaylor
polynomials generatedbyy(x)aboutx = 1.
2 Drawy = tanx for −1 x1. Using the same axes,
drawthe fifth-orderTaylor polynomial generatedby
y(x)aboutx = 0.
18.5 TAYLOR’SFORMULAANDTHEREMAINDERTERM
So far we have found Taylor polynomials of orders 1, 2, 3, 4 and so on. Example 18.6
suggeststhatthegeneratingfunction,y(x),andtheTaylorpolynomialsareincloseagreementforvalues
ofxneartothe pointwherex =a. Itisreasonable toask:
‘HowaccuratelydoTaylorpolynomialsgeneratedbyy(x)atx =aapproximateto
y atvaluesofxotherthana?’
‘IfmoreandmoretermsareusedintheTaylorpolynomialwillthisproduceabetter
and better approximation toy?’
To answerthesequestionsweintroduceTaylor’s formulaand the remainder term.
Suppose p n
(x) is an nth-order Taylor polynomial generated by y(x) about x = a.
ThenTaylor’s formulastates:
y(x) = p n
(x) +R n
(x)
whereR n
(x) iscalled theremainderofordernand isgiven by
remainder ofordern =R n
(x) = y(n+1) (c)(x −a) n+1
for somenumbercbetweenaandx.
(n +1)!
Theremainderofordernisalsocalledtheerrorterm.Theerrortermineffectgives
thedifferencebetweenthefunction,y(x),andtheTaylorpolynomialgeneratedbyy(x).
ForaTaylorpolynomialtobeacloseapproximationtothegeneratingfunctionrequires
the error termtobesmall.
522 Chapter 18 Taylor polynomials, Taylor series and Maclaurin series
Example18.8 Calculate the error term of order 5 due to y(x) = e x generating a Taylor polynomial
aboutx = 0.
Solution Heren = 5 and son +1 = 6.Inthisexamplea = 0.We see thaty = e x and so
y ′ (x) =y ′′ (x) = ··· =y (5) (x) =y (6) (x) =e x
and soy (6) (c) = e c . The remainder termoforder 5,R 5
(x), isgiven by
R 5
(x) = ec x 6
6!
forsome numbercbetween 0 andx
Example18.9 (a) Calculatethe fourth-orderTaylor polynomial, p 4
(x), generatedbyy(x) = sin2x
aboutx = 0.
(b) State the fourth-order errorterm,R 4
(x).
(c) Calculate anupper bound forthis errortermgiven |x| < 1.
(d) Comparey(0.5) and p 4
(0.5).
Solution (a) y(x) =sin2x, y(0) = 0
y ′ (x)=2cos2x, y ′ (0) =2
y ′′ (x)=−4sin2x, y ′′ (0) =0
y (3) (x)=−8cos2x,
y (3) (0) = −8
y (4) (x)=16sin2x, y (4) (0) =0
Thefourth-orderTaylor polynomial, p 4
(x), is
p 4
(x) =y(0) +y ′ (0)x +y ′′ (0) x2
2! +y(3) (0) x3
3! +y(4) (0) x4
4!
=0+2x+0− 8x3
6 +0
=2x− 4x3
3
(b) Wenotethaty (5) (x) = 32cos2xandsoy (5) (c) = 32cos(2c).Theerrorterm,R 4
(x),
isgiven by
R 4
(x) =y (5) (c) x5
5!
= 32cos(2c)x5
120
= 4
15 cos(2c)x5 wherecisanumber between 0 andx
(c) In order to calculate an upper bound for this error term we note that |cos(2c)| 1
forany value ofc. Hence an upper bound forR 4
(x) isgiven by
|R 4
(x)|
∣
4x 5
15
∣
We know |x| < 1, and so 4x5
15
18.5 Taylor’s formula and the remainder term 523
4
is never greater than . Hence an upper bound for
15
the error term is 4 15 . If we use p 4
(x) to approximatey = sin2x the error will be no
greater than 4 provided |x| < 1.
15
(d) We letx = 0.5.
y(0.5) = sin1 = 0.8415
p 4
(0.5) = 2(0.5) − 4 3 (0.5)3 = 0.8333
Thedifferencebetweeny(0.5)andp 4
(0.5)canneverbegreaterthananupperbound
of the errortermevaluated atx = 0.5. This isverified numerically.
and
y(0.5) −p 4
(0.5) = 0.8415 −0.8333 = 0.0082
|R 4
(0.5)| 4
15 (0.5)5 = 0.0083
EXERCISES18.5
1 Thefunction,y(x),is given byy(x) = sinx.
(a) Calculate the fifth-orderTaylor polynomial
generated byyaboutx = 0.
(b) Find an expression forthe remainder term of
order 5.
(c) Statean upper bound foryour expression in (b).
2 RepeatQuestion1withy(x) = cosx.
3 Thefunctiony(x) = e x maybe approximated by the
quadraticexpression1 +x+ x2
2 .Findanupperbound
forthe error term given |x| < 0.5.
4 (a) Find the third-orderTaylor polynomial generated
byh(t) = 1 aboutt = 2.
t
(b) State the error term.
(c) Find an upper bound forthe errorterm given
1t4.
5 The functiony(x) =x 5 +x 6 isapproximated bya
third-orderTaylor polynomial aboutx = 1.
(a) Find an expression forthe third-ordererror term.
(b) Find an upper boundforthe errorterm given
0x2.
Solutions
1 (a) x− x3
3! + x5
5!
(b) R 5 =− (sinc)x6
6!
whereclies between 0 andx
x 6
(c)
∣6!
∣
2 (a) 1− x2
2! + x4
4!
(b) R 5 =− (cosc)x6
6!
whereclies between 0 andx
x 6
(c)
∣6!
∣
524 Chapter 18 Taylor polynomials, Taylor series and Maclaurin series
3 3.435 ×10 −2
4 (a) − t3
16 + t2 2 − 3t
2 +2
(t −2) 4
(b)
c 5 whereclies between 2 andt
(c) 0.5
5 (a) 5c(1 +3c)(x −1) 4 forcbetween1andx
(b) 70
18.6 TAYLORANDMACLAURINSERIES
We have seen that y and its first n derivatives evaluated at x = a match p n
(x) and its
firstnderivatives evaluated atx = a. We have studied the difference betweeny(x) and
p n
(x), thatisthe errorterm,R n
(x).
AsmoreandmoretermsareincludedintheTaylorpolynomial,weobtainaninfinite
series,called aTaylorseries. We denotethisinfiniteseries by p(x).
Taylor series:
p(x) =y(a) +y ′ (a)(x −a) +y ′′ (x −a)2
(a)
2!
+···+y (n) (x −a)n
(a) +···
n!
+y (3) (x −a)3
(a)
3!
ForsomeTaylorseries,thevalueoftheseriesequalsthevalueofthegeneratingfunction
for every value ofx. For example, the Taylor series for e x , sinx and cosx equal the
valuesofe x ,sinxandcosxforeveryvalueofx.However,somefunctionshaveaTaylor
series which equals the function only for a limited range of x values. Example 18.14
gives a case of a function which equals its Taylor series only when −1 <x<1.
To determine whether a Taylor series, p(x), is equal to its generating function,y(x),
we need to examine the error term of ordern, that isR n
(x). We examine this error term
as more and more terms are included in the Taylor polynomial, that is as n tends to
infinity. If this error term approaches 0 asnincreases, then the Taylor series equates to
thegeneratingfunction,y(x).Sometimestheerrortermapproaches 0asnincreasesfor
all values ofx, sometimes it approaches 0 only whenxlies in some specified interval,
say, for example, (−1,1). Hence wehave:
IfR n
(x) → 0 asn → ∞ for all values ofx, then the Taylor series, p(x), and the
generatingfunction,y(x), are equal forallvalues ofx.
IfR n
(x) → 0asn → ∞forvaluesofxintheinterval (α,β),thentheTaylor
series,p(x),andthegeneratingfunction,y(x),areequalforxvaluesontheinterval
(α,β). For values ofxoutside the interval (α,β), the values of p(x) andy(x) are
different.
By examining the error terms associated withy = e x ,y = sinx andy = cosx it is
possible to show that these errors all approach 0 asn → ∞ for all values ofx. Hence
the functionsy = e x ,y = sinx andy = cosx are allequal totheircorrespondingTaylor
series forall values ofx.
18.6 Taylor and Maclaurin series 525
Aspecial,commonlyused,caseofaTaylorseriesoccurswhena = 0.Thisisknown
as theMaclaurin series.
Maclaurin series:
p(x) =y(0) +y ′ (0)x +y ′′ (0) x2
2! +y(3) (0) x3
3! +···+y(n) (0) xn
n! +···
Example18.10 Determinethe Maclaurinseries fory = e x .
Solution Inthisexampley(x) = e x and clearlyy ′ (x) = e x too.Similarly,
y ′′ (x) =y (3) (x) = ··· =y (n) (x) =e x
forall values ofn. Evaluating atx = 0 yields
and so
y(0)=y ′ (0)=y ′′ (0)=···=y (n) (0)=1
p(x)=1+x+ x2
2! + x3
3! +···+xn n! +···
Asmentionedearlier,theseriesandthegeneratingfunctionareequalforallvaluesofx.
Hence,
e x =1+x+ x2
2! + x3
3! +···
forall values ofx, that is
∞∑
e x x n
=
n!
n=0
Example18.11 Obtain the Maclaurinseries fory(x) = sinx.
Solution y(x)=sinx, y(0)=0
y ′ (x)=cosx, y ′ (0)=1
y ′′ (x)=−sinx, y ′′ (0)=0
y (3) (x)=−cosx,
y (3) (0)=−1
y (4) (x)=sinx, y (4) (0)=0
p(x) =y(0) +y ′ (0)x + y′′ (0)x 2
2!
=x − x3
3! + x5
5! − x7
7! +···
∞∑ (−1) i x 2i+1
=
(2i +1)!
i=0
+ y(3) (0)x 3
3!
+···
526 Chapter 18 Taylor polynomials, Taylor series and Maclaurin series
Since the generating function and series are equal forall values ofxwehave
∞∑ (−1) i x 2i+1
sinx= =x − x3
(2i +1)! 3! + x5
5! − x7
7! +···
i=0
Example18.12 Obtain the Maclaurinseries fory(x) = cosx.
Solution y(x)=cosx, y(0)=1
y ′ (x)=−sinx, y ′ (0)=0
y ′′ (x)=−cosx,
y ′′ (0)=−1
y (3) (x)=sinx, y (3) (0)=0
y (4) (x)=cosx, y (4) (0)=1
and soon. Therefore
p(x) =y(0) +y ′ (0)x +y ′′ (0) x2
2! +y(3) (0) x3
3! +y(4) (0) x4
4! +···
= 1 − x2
2! + x4
4! −···
∞∑ (−1) i x 2i
=
(2i)!
i=0
Since the series and the generating function areequal for all values ofxthen
∞∑ (−1) i x 2i
cosx= = 1 − x2
(2i)! 2! + x4
4! − x6
6! +···
i=0
FromExamples18.10, 18.11and 18.12wenotethreeimportantMaclaurinseries:
e x =1+x+ x2
2! + x3
3! + x4
+ ··· forall values ofx
4!
sinx=x− x3
3! + x5
5! − x7
+ ··· forall values ofx
7!
cosx=1− x2
2! + x4
4! − x6
+ ··· forall values ofx
6!
Example18.13 Findthe Maclaurinseries forthe following functions:
( x
(a)y = e 2x (b)y = sin3x (c)y = cos
2)
Solution We use the previously stated series.
(a) We note that
e z =1+z+ z2
2! + z3
3! + z4
4! +···
18.6 Taylor and Maclaurin series 527
Substitutingz = 2x we obtain
(b) We note that
e 2x =1+2x+ (2x)2
2!
+ (2x)3
3!
+ (2x)4
4!
=1+2x+2x 2 + 4x3
3 + 2x4
3 +···
sinz=z− z3
3! + z5
5! − z7
7! +···
By putting z = 3x we obtain
(c) We note that
sin3x=3x− (3x)3
3!
+ (3x)5
5!
− (3x)7
7!
+···
+···
=3x− 9 2 x3 + 81
40 x5 − 243
560 x7 +···
cosz=1− z2
2! + z4
4! − z6
6! +···
Puttingz = x we obtain
2
( x
cos = 1 −
2) (x/2)2 + (x/2)4 − (x/2)6 +···
2! 4! 6!
= 1 − x2
8 + x4
384 − x6
46080 +···
Example18.14 Determinethe Maclaurinseries fory(x) = 1
1 +x .
Solution The value ofyand its derivatives atx = 0 arefound.
y(x) = 1
1 +x , y(0)=1
y ′ (x) =
−1
(1 +x) 2, y′ (0)=−1
y ′′ 2!
(x) =
(1 +x) 3, y′′ (0) =2!
y ′′′ (x) = −3!
(1 +x) 4, y′′′ (0) = −3!
.
.
y (n) (x) = (−1)n n!
(1 +x) n+1, y(n) (0) = (−1) n n!
Hence usingthe formulafor the Maclaurin series wefind
p(x) = 1 −1(x) +2! x2
2! −3!x3 3! +···+ (−1)n n! xn
n! +···
=1−x+x 2 −x 3 +x 4 −···+(−1) n x n +···
528 Chapter 18 Taylor polynomials, Taylor series and Maclaurin series
1
Itcan beshown thatthisseries converges to for |x| < 1.Hence,
1 +x
1
∞∑
1 +x =1−x+x2 −x 3 +···= (−1) n x n for |x| < 1
0
1
Forvaluesofxoutside (−1,1)thevaluesof
1 +x and ∑ (−1) n x n aresimplynotequal;
try evaluating the l.h.s. and r.h.s.with,say,x = −2.
Example18.15 Findthe Taylor series fory(x) = e −x aboutx = 1.
Solution y=e −x , y(1)=e −1
y ′ = −e −x ,
y ′′ =e −x ,
and soon. Hence,
y ′ (1) = −e −1
y ′′ (1) =e −1
e −x = e −1 − (e −1 −1
(x −1)2
)(x −1) +e
2!
{
= e −1 (x −1)2
1 −(x−1)+ −
2!
∑ ∞
e −x = e −1 n
(x −1)n
(−1)
n!
0
−1
(x −1)3
−e
3!
}
(x −1)3
3!
+···
+···
Example18.16 Findthe Maclaurinseries fory(x) =xcosx.
Solution TheMaclaurinseries, p(x), forcosx is
cosx=p(x)=1− x2
2! + x4
4! − x6
6! +···
So the Maclaurin series forxcosx isxp(x), that is
xcosx=xp(x)=x− x3
2! + x5
4! − x7
6! +···
An alternative form of Taylor series is often used in numerical analysis. We know
that the Taylor series fory(x) generatedaboutx =aisgiven by
y(x) =y(a) +y ′ (a)(x −a) +y ′′ (x −a)2
(a) +y (3) (x −a)3
(a) +···
2! 3!
Replacingabyx 0
we obtain
y(x) =y(x 0
) +y ′ (x 0
)(x −x 0
) +y ′′ (x 0
) (x−x 0 )2
2!
If we now letx−x 0
=h, wesee that
+y (3) (x 0
) (x−x 0 )3
3!
y(x 0
+h) =y(x 0
) +y ′ (x 0
)h +y ′′ (x 0
) h2
2! +y(3) (x 0
) h3
3! +···
To interpret thisformof Taylor series werefer toFigure 18.6.
+···
18.6 Taylor and Maclaurin series 529
y
y(x)
x 0 x 0 +h
x
Figure18.6
Thevaluey(x 0 +h) can be estimated usingvalues ofy
and itsderivatives atx =x 0 .
Ifyand its derivatives are known whenx =x 0
, then the Taylor series can be used to
findyatanearbypoint,wherex =x 0
+h.ThisformofTaylorseriesisusedinnumerical
methods of solving differential equations.
Engineeringapplication18.5
Electricfieldofanelectrostaticdipole
Engineersoftenneedtocalculatetheelectricalfieldduetoseveralstationaryelectric
charges.Theseareknownaselectrostaticproblems.Oneofthesimplestelectrostatic
configurations is that of two charges of opposite polarity separated by a distance,d.
Thisarrangementisknownasanelectrostaticdipole.ItisillustratedinFigure18.7
forcharges +Q and −Q.
The origin of the x axis is located midway between the two charges so that the
charge −Q has coordinate −d/2 and the charge +Qhas coordinated/2.
d
–Q 0 +Q
x
Figure18.7
Electrostatic dipole with two point charges of −Q
and +Qcoulombs respectively.
We wish to calculate the combined electric field of the two charges as a function
of the distance along thexaxis.
The electricfield,E, ofasinglecharge isgiven by
E =
q
4πε 0
r 2
where q is the charge in coulombs, ε 0
is the permittivity of free space (a constant
of approximately 8.85 ×10 −12 F m −1 ), andris the distance from the charge to the
measuring point.
The electricfield atpointxdue tothe left-hand charge is
E LEFT
=
−Q
(
4πε 0
x + d ) 2
2
and forthe right-hand charge the electric field atpointxis
E RIGHT
=
+Q
(
4πε 0
x − d ) 2
2
➔
530 Chapter 18 Taylor polynomials, Taylor series and Maclaurin series
Thetotalelectricfield,E T
,canbeobtainedbyaddingthetwocontributionstogether:
E T
=E LEFT
+E RIGHT
=
⎡
= Q ⎣
4πε 0
1
(
x − d 2
−Q Q
(
4πε 0
x + d ) 2
+ (
4πε
2 0
x − d ) 2
2
) 2
−
1
⎤
(
x + d ) 2 ⎦
2
It is possible to gain further insight into the properties of the electrostatic dipole by
carrying out a power series expansion. To prepare the equation we take a factor of
1/x 2 outside of the brackets
E T
=
⎡
Q ⎣
4πε 0
x 2
1
(
1 − d 2x
) 2
−
1
(
1 + d 2x
) 2
⎤
⎦
1
Now consider the Maclaurin series of
(1−α) 2.Following
the process explained in
the previous examples, the first five termsare calculated:
1
y(α) =
(1−α) 2, y(0)=1
y ′ 2!
(α) =
(1−α) 3, y′ (0)=2!
y ′′ 3!
(α) =
(1−α) 4, y′′ (0) =3!
y (3) 4!
(α) =
(1−α) 5, y(3) (0) =4!
y (4) 5!
(α) =
(1−α) 6, y(4) (0) =5!
1
So, = 1+2!α+3! α2
(1−α) 2 2! +4!α3 3! +5!α4 4! +··· = 1+2α+3α2 +4α 3 +
5α 4 +···.
Following a similarprocess for the Maclaurin series of 1/(1 + α) 2 ,
1
y(α) =
(1+α) 2, y(0)=1
y ′ (α) = −2!
(1+α) 3, y′ (0) = −2!
y ′′ 3!
(α) =
(1+α) 4, y′′ (0) =3!
y (3) (α) = −4!
(1+α) 5, y(3) (0) = −4!
y (4) 5!
(α) =
(1+α) 6, y(4) (0) =5!
1
weobtain =1+ (−2!)α +3!α2
(1+α)
2
2! + (−4!)α3 3! +5!α4 4! +···
=1−2α+3α 2 −4α 3 +5α 4 +···.
These two expansions may be used to approximate the total electric field by substituting
α =d/(2x).
18.6 Taylor and Maclaurin series 531
The total electricfield isthus
E T
=
Q
+4α 3 +5α 4 +···)
4πε 0
x 2[(1+2α+3α2
−(1−2α+3α 2 −4α 3 +5α 4 +···)]
E T
=
Q +···]
4πε 0
x 2[4α+8α3
Providing x is much larger than d, then d/(2x), that is α is small, and the higher
order terms in the series become increasingly small. Thus we can approximate the
field using only the first term,
E T
∼ =
Q
4πε 0
x 2[4α] =
Q
4πε 0
x 24 d 2x =
Qd
2πε 0
x 3
It can now be seen that the total electric field of the dipole decays asx −3 rather than
x −2 , as would be the case for a singlepoint charge.
EXERCISES18.6
1 Usethe Maclaurin series forsinx to write down the
Maclaurinseries forsin5x.
2 Usethe Maclaurin series forcosx to write down the
Maclaurinseries forcos3x.
3 Usethe Maclaurin series fore x to write down the
Maclaurinseries for 1 e x.
4 Find the Taylor seriesfory(x) = √ x aboutx = 1.
5 (a) Findthe Maclaurin seriesfor
y(x) =x 2 +sinx.
(b) Deduce the Maclaurin series for
y(x) =x n +sinx forany positiveintegern.
6 (a) Obtainthe Maclaurinseries fory(x) =xe x .
(b) Statethe range ofvalues ofxforwhichy(x)
equals itsMaclaurin series.
7 Find the Taylor series fory(x) =x +e x aboutx = 1.
8 Find the Maclaurin seriesfory(x) = ln(1 +x).
Solutions
1 5x− (5x)3
3!
2 1 − (3x)2
2!
+ (5x)5
5!
+ (3x)4
4!
3 1−x+ x2
2! − x3
3! + x4
4! −···
(x −1)2
8
4 1 + x −1 −
2
5(x −1)4
− +···
128
5 (a) x+x 2 − x3
3! + x5
5! − x7
−···
(b) x n +x− x3
3! + x5
5! − x7
7! +···
− (3x)6 +··· 6 (a) x+x 2 + x3
6!
2! + x4
3! + x5
4! +···
(b) Forallvalues ofx
7 p(x) = (1 +e)x
(x −1)3
{
+ (x −1)
16
2 (x −1)3
+e + +
2! 3!
7! +··· 8 x − x2
2 + x3
3 − x4
4 + x5
5 −···
(x −1)4
4!
+···
}
532 Chapter 18 Taylor polynomials, Taylor series and Maclaurin series
REVIEWEXERCISES18
1 Afunction, f (t),and itsfirstand second derivatives (a) Calculatey ′ (1),y ′′ (1)andy (3) (1).
dy
dx + y2
2 =xy y(1)=2 sin(kx)
(c) lim 1 −cos(kx)
are evaluated att = 3. Find the second-order Taylor
polynomial generated by f aboutt = 3.
(b) Statethe third-orderTaylor polynomial generated
byy(x)aboutx = 1.
f(3)=2 f ′ (3)=−1 f ′′ (3)=1 (c) Estimatey(1.25).
2 Afunction,s(t),andits firstderivative are evaluated
att =3;s(3) =4,s ′ (3) = −1.
(a) Find the first-orderTaylor polynomial generated
bysatt=3.
(b) Estimates(2.9)ands(3.2).
(c) Ifadditionally weknows ′′ (3) = 1, use a
second-order Taylor polynomial to estimate
s(2.9)ands(3.2).
3 Obtain alinear approximation to
y(t) =at 4 +bt 3 +ct 2 +dt +earoundt =1.
4 Find a quadratic approximation toy =x 3 nearx = 2.
5 Find linear approximations to
8 (a) Findthe third-orderTaylor polynomial generated
byy(x) = e −x aboutx = 1.
(b) Statethe third-ordererror term.
(c) Findan upperbound forthe error termgiven
|x| < 1.
9 (a) Findaquadraticapproximation toy(x) = sin 2 x
aboutx = 0.
(b) Statethe remainder termoforder2.
(c) Statean upper boundforthe remainderterm
given |x| < 0.5.
10 (a) Findacubicapproximation toy(x) =xcosx
aboutx = 0.
(b) Statethe error termoforder3.
(a) z(t) =e t neart =1
(c) Statean upper boundforthe errorterm given
(b) w(t) =sin3t neart =1
|x| < 0.25.
(c) v(t) =e t +sin3tneart =1
6 Thefunction,y(x),satisfiesthe equation
y ′′ +xy ′ −3y=x 2 +1
11 Giventhaty(x) =x 2 ,
(a) Calculate the Taylor seriesofy(x) aboutx =a.
(b) Calculate the Maclaurin series ofy(x).
y(0) =1 y ′ (0) =2
12 Findthe Maclaurinexpansion ofy(x) = ln(2 +x) up
(a) Evaluatey ′′ (0).
to andincluding the terminx 4 .
(b) Differentiate the equation.
(c) Evaluatey (3) (0).
13 Byconsideringthe Maclaurinexpansions ofsin(kx)
andcos(kx),kconstant,evaluate ifpossible
(d) Write down acubic approximation fory(x).
sin(kx)
(a) lim
(e) Estimatey(0.5).
x→0 x
7 Given thatysatisfiesthe equation
cos(kx) −1
(b) lim x→0 x
Solutions
1
t 2 2 −4t+19 2
2 (a) −t +7 (b) 4.1, 3.8 (c) 4.105, 3.82
3 (4a+3b+2c+d)t−3a−2b−c+e
4 6x 2 −12x+8
5 (a) et (b) −2.97t +3.11
(c) −0.25t +3.11
6 (a) 4 (b) y ′′′ +xy ′′ −2y ′ = 2x
(c) 4 (d) 1+2x+2x 2 + 2 3 x3
(e) 2.5833
Review exercises 18 533
7 (a) 0, 2, −2 (b) − x3
3 +2x2 −3x+ 10
3
(c) 2.0573
{
}
8 (a)e −1 − x3
6 +x2 − 5x
2 + 8 3
(b) e−c (x −1) 4
4!
(c) 2e
3
9 (a) x 2
whereclies between 1 andx
(b) − 2sin(2c)x3 forcbetween 0 andx
3
(c) 0.07
10 (a) x− x3
2!
(b) (4sinc +ccosc) x4 forcbetween 0 andx
4!
(c) 2 ×10 −4
11 (a) x 2 (b) x 2
12 ln2+ x 2 − x2
8 + x3
24 − x4
64 +···
13 (a) k (b) 0 (c) notdefined
19 Ordinarydifferential
equationsI
Contents 19.1 Introduction 534
19.2 Basicdefinitions 535
19.3 First-orderequations:simpleequationsandseparationofvariables 540
19.4 First-orderlinearequations:useofanintegratingfactor 547
19.5 Second-orderlinearequations 558
19.6 Constantcoefficientequations 560
19.7 Seriessolutionofdifferentialequations 584
19.8 Bessel’sequationandBesselfunctions 587
Reviewexercises19 601
19.1 INTRODUCTION
Thesolutionofproblemsconcerningthemotionofobjects,theflowofchargedparticles,
heat transport, etc.,often involves discussion of relations of the form
d 2 x
dt 2 +6dx dt +2x=3t or dq
dt +8q=sint
In the first equation,xmight represent distance. For this case dx is the rate of change
dt
of distance with respect to timet, that is speed, and d2 x
represents acceleration. In the
dt2 second equation,qmight be charge and dq the rate of flow of charge, that is current.
dt
These are examples of differential equations, so called because they are equations involving
the derivatives of various quantities. Such equations arise out of situations in
whichchangeisoccurring.Tosolvesuchadifferentialequationmeanstofindthefunctionx(t)orq(t)whenwearegiventhedifferentialequation.Solutionstotheseequations
maybeanalyticalinthatwecanwritedownananswerintermsofcommonelementary
functions such as e t , sint and so on. Alternatively, the problem may be so difficult that
only numerical methodsare available, which produce approximate solutions.
19.2 Basic definitions 535
In engineering, differential equations are most commonly used to model dynamic
systems. These are systems which change with time. Examples include an electronic
circuit with time-dependent currents and voltages, a chemical production line in which
pressures, tank levels, flow rates, etc., vary with time, and a semiconductor device in
which hole and electron densities change with time.
19.2 BASICDEFINITIONS
Inordertosolveadifferentialequationitisimportanttoidentifycertainfeatures.Recall
fromChapter2thatinafunctionsuchasy =x 2 +3xwesayxistheindependentvariable
andyis the dependent variable since the value ofydepends upon the choice we have
made forx.
Inadifferential equation suchas
dy
dx −2y=3x2
x isthe independent variable, andyisthe dependent variable.
Similarly, forthe differential equation
dx
dt +7x=et
t isthe independent variable andxisthe dependent variable.
We see thatthe variable being differentiated isthe dependent variable.
Before classifying differential equations, wewill derive one.
Engineeringapplication19.1
AnRC chargingcircuit
Consider the RC circuit of Figure 19.1. Suppose we wish to derive a differential
equation which models the circuit so that we can determine the voltage across the
capacitor at any time,t. Clearly there are two different cases corresponding to the
switchbeingopenandtheswitchbeingclosed.Wewillconcentrateonthelatterand
for convenience assume that the switch is closed att = 0. From Kirchhoff’s voltage
law wehave
v S
=v R
+v C
thatis v R
=v S
−v C
where v S
isthevoltageofthesupply, v C
isthevoltageacrossthecapacitor,and v R
is
the voltageacrossthe resistor.Using Ohm’slaw forthe resistor thengives
i = v S −v C
R
whereiisthecurrentflowinginthecircuitaftertheswitchisclosed.Forthecapacitor
i =C dv C
dt
Combining these equations gives
C dv C
dt
= v S −v C
R
➔
536 Chapter 19 Ordinary differential equations I
y R
y S
+
–
i
R
C
y C
Figure19.1
AnRC charging circuit.
and hence
RC dv C
+v
dt C
=v S
This isthedifferential equation which models the variation involtageacross thecapacitorwithtime.Here
v C
isthedependentvariableandt istheindependentvariable,
and when we are required to solve this differential equation we must attempt to find
v C
as a function oft.
19.2.1 Order
Differential equations which have features in common are often grouped together and
givencertainclassificationsanditisusuallythecasethatappropriatemethodsofsolution
depend upon the classifications. Someimportantterminology isnow given.
Theorder ofadifferential equation isthe orderofits highestderivative.
Example19.1 State the orderof
(a) d2 y
dx 2 + dy
dx =x
dx
(b)
dt = (xt)5
Solution (a) The highest derivative is d2 y
dx2, a second derivative. The order istherefore two.
(b) The only derivative appearing is dx , a first derivative. The order istherefore one.
dt
19.2.2 Linearity
Recall that in a differential equation such as dy
dx +3y =x2 , the independent variable is
x and the dependent variable isy.
Adifferential equation issaidtobelinearif:
(1) the dependent variable and itsderivatives occur tothe first power only,
(2) thereare no products involving the dependent variable with its derivatives, and
(3) there are no non-linear functions of the dependent variable such as sine, exponential,etc.
Ifanequationisnotlinear,thenitissaidtobenon-linear.Notefrom(2)thataproduct
of terms involving the dependent variable such asy dy is non-linear. Note from (3) that
dx
the existence of termssuch asy 2 , siny and e y causes anequation tobe non-linear.
19.2 Basic definitions 537
Note also that the conditions for linearity are conditions on the dependent variable.
The linearity of a differential equation is not determined or affected by the presence of
non-linear terms involving the independent variable.
The distinction between a linear and a non-linear differential equation is important
because the methods of solution depend upon whether an equation is linear or nonlinear.
Furthermore, it is usually the case that a linear differential equation is easier to
solve.
Example19.2 Decide whether or notthe following equations are linear:
(a) sinx dy
dx +y=x
dx
(b)
dt +x=t3
d 2 y
(c)
dx 2 +y2 =0
dy
(d)
dx +siny=0
Solution In (a), (c) and (d) the dependent variable isy, and the independent variable isx. In (b)
the dependent variable isxand the independent variable ist.
(a) This equation islinear.
(b) Thisequationislinear.Itdoesnotmatterthatthetermint,theindependentvariable,
israised tothe power 3.
(c) This equation isnon-linear, the non-linearity arising through the termy 2 .
(d) This equation isnon-linear, the non-linearity arising through the termsiny.
19.2.3 Thesolutionofadifferentialequation
The solution of a differential equation is a relationship between the dependent and independent
variables such that the differential equation is satisfied for all values of the
independentvariable over a specified domain.
Example19.3 Verifythaty = e x isasolution ofthe differential equation
dy
dx =y
Solution If y = e x then dy
dx = ex . For all values of x, we see that dy
dx =yandsoy=ex isa
solution.Note alsothatthisequation isfirst order and linear.
Therearefrequentlymanydifferentfunctionswhichsatisfyadifferentialequation;that
is,therearemanysolutions.Thegeneralsolutionembracesalloftheseandallpossible
solutionscan beobtained from it.
538 Chapter 19 Ordinary differential equations I
Example19.4 Verifythaty =Ce x is a solution of dy =y, whereC is any constant.
dx
Solution If y = Ce x , then dy
dx = Cex . Therefore, for all values of x, dy = y and the equation
dx
is satisfied for any constantC;C is called an arbitrary constant and by varying it, all
possible solutions can be obtained. For example, by choosingC to be 1, we obtain the
solution of the previous example. Infact,y =Ce x isthe general solution of dy
dx =y.
More generally, to determine C we require more information given in the form of a
condition.Forexample,ifwearetoldthat,atx = 0,y = 4thenfromy =Ce x wehave
4=Ce 0 =C
so thatC = 4. Therefore y = 4e x is the solution of the differential equation which
additionally satisfies the condition y(0) = 4. This is called a particular solution. In
general, application of conditions to the general solution yields the particular solution.
Toobtainaparticularsolution,thenumberofgivenindependentconditionsmustbethe
same asthe number ofconstants.
Consider the following example.
Example19.5 Consider the second-order differential equation
d 2 y
dx 2 +y=0
The general solution of this equation can be shown tobe
y=Acosx+Bsinx
whereAandBarearbitraryconstants.
Find the particular solution which satisfies the conditions
(a) whenx = 0,theny = 0,and
(b) whenx = 3π 2 ,theny=1.
Solution Wenotethatbecausethegeneralsolutionhastwoarbitraryconstants,AandB,thentwo
conditions are necessary to obtain a particular solution. Applying the first condition to
the general solution gives
0=Acos0+Bsin0
=A
ThereforeA = 0,andthesolutionreducestoy =Bsinx.Applyingthesecondcondition
we find
1=Bsin 3π 2
19.2 Basic definitions 539
fromwhich
1 =B(−1)
B=−1
The particular solution becomesy = −sinx.
Sometimes the conditions involve derivatives.
Example19.6 ForthedifferentialequationofExample19.5findtheparticularsolutionwhichsatisfies
the conditions
(a) whenx = 0,theny = 0,and
(b) whenx = 0,then dy
dx = 5.
Solution Application ofthe first condition tothe general solutiony =Acosx +Bsinx gives
0=Acos0+Bsin0
=A
ThereforeA = 0 and the solution becomesy = Bsinx. To apply the second condition
wemustdifferentiatey:
dy
dx =Bcosx
Then applying the second condition weget
5=Bcos0
=B
so thatB = 5.Finallythe required particular solution isy = 5sinx.
In this example both conditions have been specified atx = 0, and are often referred
toas initialconditions.
EXERCISES19.2
1 Verifythaty = 3sin2xisasolutionof d2 y
dx 2 +4y=0.
2 Verifythat 3e x ,Axe x ,Axe x +Be x ,whereA,Bare
constants,allsatisfythe differentialequation
d 2 y
dx 2 −2dy dx +y=0
3 Verifythatx =t 2 +Alnt +Bis asolution of
t d2 x
dt 2 + dx
dt = 4t
4 Verify thaty =Acosx +Bsinx satisfiesthe
differentialequation
d 2 y
dx 2 +y=0
Verify alsothaty =Acosx andy =Bsinx each
individually satisfythe equation.
5 Ify =Ae 2x is the general solution of dy = 2y, find
dx
the particular solution satisfyingy(0) = 3.What is
the particular solution satisfying dy
dx =2whenx=0?
540 Chapter 19 Ordinary differential equations I
6 Identifythe dependent andindependent variables of
the followingdifferential equations. Give the order of
the equations and state which are linear.
(a)
(b)
(c)
dy
dx +9y=0
( )( )
dy d 2 y
dx dx 2 +3 dy
dx = 0
d 3 x
dt 3 +5dx dt =sinx
7 Show thatx(t) = 7cos3t −2sin2t isasolution of
d 2 x
dt 2 +2x=−49cos3t+4sin2t
8 Thegeneral solution ofthe equation
d 2 x
dt 2 −3dx +2x = 0 is given by
dt
x=Ae t +Be 2t
Findthe particularsolution whichsatisfiesx = 3 and
dx
dt =5whent=0.
9 Thegeneral solution of d2 y
dx 2 −2dy dx +y=0is
y =Axe x +Be x .Findthe particular solution
satisfyingy(0) = 0, dy (0) =1.
dx
10 Thegeneral solution of d2 x
dt 2 = −ω2 xis
x =Ae jωt +Be −jωt ,where j 2 = −1.Verify that this
isindeed asolution. What is the particular solution
satisfyingx(0) = 0, dx (0) = 1? Expressthe general
dt
solution and the particular solution in terms of
trigonometric functions.
Solutions
5 y(x) =3e 2x ,y(x) =e 2x
6 (a) y isthe dependentvariable;xisthe independent
variable;firstorder, linear
(b) y isthe dependentvariable;xisthe independent
variable;second order,non-linear
(c) x is the dependent variable;t is the independent
variable;thirdorder,non-linear.
8 x=e t +2e 2t
9 y=xe x
10 particularsolution: sin ωt/ω;general solution:
(A +B)cosωt + (A −B)jsinωt
19.3 FIRST-ORDEREQUATIONS:SIMPLEEQUATIONS
ANDSEPARATIONOFVARIABLES
19.3.1 Simpleequations
Thesimplest first-orderequationstodealwith arethose ofthe form
dy
dx =f(x)
where the r.h.s. is a function of the independent variable only. No special treatment is
necessary and direct integration yieldsyas a function ofx, thatis
∫
y = f(x)dx
19.3 First-order equations: simple equations and separation of variables 541
Example19.7 Find the general solution of dy
dx = 3cos2x.
Solution Given that dy
dx = 3cos2x, theny = ∫ 3cos2xdx = 3 sin2x +C. This is the required
2
general solution.
If dy
dx =f(x)theny= ∫
f(x)dx.
19.3.2 Separationofvariables
When the function f on the r.h.s. of the equation depends upon both independent and
dependentvariablestheapproachofSection19.3.1isnotpossible.However,first-order
equations which can be writteninthe form
dy
= f(x)g(y) (19.1)
dx
forman important class known asseparable equations. For example,
dy
dx = 3x2 e −4y
isaseparable equation forwhich
f(x) = 3x 2 and g(y) = e −4y
To obtain a solution wefirst divide both sides of Equation (19.1) byg(y) togive
1 dy
g(y) dx =f(x)
Integrating both sides with respecttoxyields
∫ ∫ ∫
1 dy
g(y) dx dx = 1
g(y) dy = f(x)dx
Theequationisthensaidtobeseparated.Ifthelasttwointegralscanbefound,weobtain
arelationshipbetweenyandx,althoughitisnotalwayspossibletowriteyexplicitlyin
termsofxasthe following examples will show.
Separationofvariables:
Thesolution of dy = f (x)g(y) isfound from
dx
∫ ∫
1
g(y) dy = f(x)dx
542 Chapter 19 Ordinary differential equations I
Example19.8 Solve dy
dx = e−x
y .
Solution Here f (x) = e −x andg(y) = 1 . Multiplication through byyyields
y
y dy
dx = e−x
Integration of both sides with respect toxgives
∫ ∫
ydy= e −x dx
so that
y 2
2 =−e−x +C
Note that the constants arising from the two integrals have been combined to give a
single constant,C. Finally we can rearrange this expression togiveyinterms ofx:
that is,
y 2 =−2e −x +2C
y=± √ D−2e −x
whereD = 2C.
It is important to stress that the constant of integration must be inserted at the stage at
which the integration is actually carried out, and not simply added to the answer at the
end.
Example19.9 Solve dy
dx = 3x2 e −y subjecttoy(0) = 1.
Solution Hereg(y) = e −y and f (x) = 3x 2 . Separating the variables and integrating wefind
∫ ∫
e y dy= 3x 2 dx
so thate y =x 3 +C. Imposing the initial conditiony(0) = 1 wefind
e 1 =(0) 3 +C
so thatC = e.Therefore,
e y =x 3 +e
Note that since the exponential function is always positive, the solution will be valid
only forx 3 +e > 0.Taking natural logarithms gives the particular solution explicitly:
y =ln(x 3 +e)
19.3 First-order equations: simple equations and separation of variables 543
Example19.10 Solve
dx
dt = t2 +1
x 2 +1
Solution Separating the variables and integrating we find
∫ ∫
x 2 +1dx= t 2 +1dt
Therefore,
x 3
3 +x = t3 3 +t+C
which is the general solution. Here we note that x has not been obtained explicitly in
terms oft, although we have found a relationship between x andt which satisfies the
differential equation. To obtain the value of x at any givent it would be necessary to
solve the cubic equation.
Sometimes,equationswhicharenotimmediatelyseparablecanbereducedtoseparable
formby anappropriatesubstitution asthe following example shows.
Example19.11 Bymeansofthe substitutionz = y , solve the equation
x
dy
dx = y2
x + y +1 (19.2)
2 x
Solution Ifz = y theny = zx. Because the solution,y, is a function ofxthe variablezdepends
x
uponxalso. The product rulegives dy
dx =z+xdz , so thatEquation (19.2) becomes
dx
z +x dz
dx =z2 +z+1
thatis,
x dz
dx =z2 +1
Thisnewequationhasindependentvariablexanddependentvariablez,andisseparable.
We find
∫
∫
dz dx
z 2 +1 = x
so thattan −1 z = ln|x| +C. WritingC = ln |D|wehave
tan −1 z = ln|x| +ln|D| = ln|Dx|
so that z = tan(ln|Dx|). Returning to the original variables we see that the general
solution is
y =zx =xtan(ln|Dx|)
544 Chapter 19 Ordinary differential equations I
Engineeringapplication19.2
RLcircuitwithstepinput
Writedownthedifferentialequationgoverningthecurrent,i,flowingintheRLcircuit
shown in Figure 19.2 when a stepvoltage of magnitudeE isapplied tothe circuitat
t = 0.Solvethisdifferentialequationtoobtaini(t).Assumethatwhent = 0,i = 0.
Solution
Applying Kirchhoff’s voltage law and Ohm’s law tothe circuitwefind
iR+L di =E
dt
for t0
thatis,
L di
dt =E−iR
so that
∫ ∫
L
E−iR di = dt
R
L
i
E/R
i
Closed
at t = 0
+ –
E
Figure19.2
Astepvoltage isapplied to the circuitat
t=0.
0.63E/R
t
Figure19.3
Responseofthe circuitofFigure 19.2 to
astepinput.
NoteinparticularthatinthisequationL,E andRareconstantsandsothevariablesi
andt have been separated. If the applied voltage,E, varied with time this would not
havebeenthecasesincethel.h.s.wouldcontaintermsdependentupont.Integrating,
wefind ∫
∫
L −L
di =
E−iR R
−R
E−iR di=−L R ln(E−iR)=t+C
Tofindtheconstantofintegration,C,aconditionisrequired.Thephysicalcondition
i = 0 att = 0 provides this. Applyingi = 0 whent = 0 we find
C = − L R lnE
Substituting thisvalue gives
− L R ln(E−iR)=t−L R lnE
Thus,
L
R [lnE−ln(E−iR)]=t
t
19.3 First-order equations: simple equations and separation of variables 545
so that
L
R ln E
E−iR =t
Then
E
ln
E−iR = Rt
L
hence
E
E−iR = eRt/L
Rearranging toobtainE gives
hence
and so
so
E =(E−iR)e Rt/L
E =Ee Rt/L −iRe Rt/L
iRe Rt/L =E ( e Rt/L −1 )
iR=E ( 1 −e −Rt/L)
Finally we have
i = E ( )
1 −e
−Rt/L
R
ThegraphofthiscurrentagainsttimeisshowninFigure19.2.Wenotethatast → ∞,
i → E R .Therateatwhichthecurrentincreasestowardsitsfinalvaluedependsupon
thevaluesofthecomponentsRandL.Itiscommontodefineatimeconstant, τ,for
the circuit. Inthis case τ = L and the equation forthe current can be written as
R
i = E R
(
1 −e
−t/τ )
The smaller the value of τ, the more rapidly the current reaches its final value. It is
possibletoestimateavalueof τ fromalaboratorytestcurvebynotingthatafterone
time constant, thatist = τ,ihas reached 1 −e −1 ≈ 0.63 of its final value.
EXERCISES19.3
1 Find the general solution ofthe followingequations:
(a)
(c)
dy
dx =3
dy
dx = 2x
(b) dx
dt = 5
(d) dy
dt = 6t
(e)
(g)
(i)
dy
dx = 8x2
dy
dx = x2
y
dx
dt = et
x
(f)
(h)
(j)
dx
dt =3t3
dx
dt = t3
x 2
dy
dx = e−2x
y 2
546 Chapter 19 Ordinary differential equations I
dy
(k)
dx = 6sinx dx
(l)
y dt = 9cos4t
dy
x 2
(b)
dx = −ky, k constant
dx
(m)
dt = 3cos2t+8sin4t
dy
(c)
x 2 dx =y2
+x
(d) y dy
2 Find the particular solution ofthe following
dx =sinx
equations:
(e) y dy
dx
dx =x+2
(a) =3t, x(0) =1
dt (f) x 2dy
dy
(b)
dx = 6x2
dx =2y2 +yx
y , y(0)=1
dx
(g)
dy
(c)
dt = 3sint
dt = t4
x 5
, y(0)=2
y
5 Findthe general solutionsofthe followingequations:
dy
(d)
dx = e−x
y , y(0)=3
dx
(a)
dt =xt (b) dy
dx = x y
dx
(e)
dt = 4sint+6cos2t , x(0)=2 (c) t dx
x
dt =tanx
3 Find the general solution of dx
dx
= lnt.Find the (d)
dt dt = x2 −1
t
particular solution satisfyingx(1) = 1.
6 Findthe general solution ofthe equation
4 Find the general solutionsofthe following equations: dx
=t(x −2).Findthe particular solution which
dy
(a)
dx =kx, k constant dt
satisfiesx(0) = 5.
Solutions
1 (a) y=3x+c (b) x=5t+c 3 tln|t|−t+c,tln|t|−t+2
(c) y=x 2 +c (d) y=3t 2 +c
(e) y= 8 3 x3 +c (f) x= 3 4 (a) kx2
2 +c
4 t4 +c
(b)Ae −kx
y 2
(g)
2 = x3
3 +c (h) x 3
3 = t4 4 +c (c) − 1
x +c
x 2 y 3 (i)
2 =et e−2x
+c (j) =c − (d)y 2 =2(c −cosx)
3 2
y 2
(k)
2 =c−6cosx (l) x 3
3 = 9 (e)y 2 =x 2 +4x+c
4 sin4t+c x
(f)
x 3
(m)
3 + x2
2 = 3 A−2ln|x|
2 sin2t−2cos4t+c
2 (a) x= 3 (g) x6
2 t2 +1 (b) y 2 =4x 3 6 = t5 5 +c
+1
5 (a) x=Ae t2 /2 (b) y 2 =x 2 +c
y 2
(c)
2 =5−3cost (d) y2 =11−2e −x
(c) x = sin −1 (kt) (d) x = 1+At2
x 2
1−At 2
(e)
2 =3sin2t−4cost+6 6 2+Ae t2 /2 ,2+3e t2 /2
19.4 First-order linear equations: use of an integrating factor 547
19.4 FIRST-ORDERLINEAREQUATIONS:USEOFAN
INTEGRATINGFACTOR
Inthissection wedevelop a method for solvingfirst-order linear differential equations.
19.4.1 Exactequations
Consider the differential equation
dy
dx = 3x2
This can be solved very easily by simplyintegrating both sidestogive
∫
y = 3x 2 dx
=x 3 +c
wherecis the constant of integration. An equation which can be solved by integrating
both sides is said to be an exact equation. A more complicated example which, nevertheless,can
besolved inthe same way is
d
(xy) = 3x2
dx
Integrating both sides wefind
∫
xy = 3x 2 dx
=x 3 +c
and dividing through byxgives the general solution
y=x 2 + c x
The differential equation we have justsolved isan exact equation.
Example19.12 Solve the equation
d (
x 2 y ) =cosx
dx
Solution Integrating both sides wefind
∫
x 2 y = cosxdx
so that
=sinx+c
y = sinx
x 2 + c x 2
548 Chapter 19 Ordinary differential equations I
Consider again the differential equation of the previous example:
d
dx (x2 y) =cosx
Using the product rulefordifferentiation we can expand the l.h.s. as follows:
d
dx (x2 y) =x 2dy
dx +2xy
Doing this, the differential equation can bewritten inthe equivalent form
x 2dy
dx +2xy=cosx
Suppose we had posed the question in this form. A method of solving this equation
would be to recognize that the equation is exact and that the l.h.s. could be written as
d
dx (x2 y).
Itiseasy torecognize an exact equation because itwill always take the form
µ dy
dx +µ′ y=f(x)
where µ is some function ofx. That is, the coefficient ofyis the derivative of the coefficient
of dy
dx . When thisisthe case the l.h.s. can be written d dx (µy).
Example19.13 The following equations are exact. Note in each case that the coefficient of y is the
derivative ofthe coefficient of dy . Solve them.
dx
(a) x 3dy
dx +3x2 y =e 2x
(b) cosx dy − (sinx)y =1
dx
Solution (a) The equation can be written
d
dx (x3 y) =e 2x
and so, upon integrating,
∫
x 3 y = e 2x dx
so that
= e2x
2 +c
y = e2x
2x 3 + c x 3
(b) The equation can be written
d
((cosx)y) =1
dx
and so, upon integrating,
so that
(cosx)y=x+c
y =
19.4 First-order linear equations: use of an integrating factor 549
x
cosx + c
cosx
EXERCISES19.4.1
1 Eachofthe followingequations is exact.Solvethem.
(a) x 2dy
dx +2xy=x3
(b)
1 dy
x 2 dx − 2 x 3y=5x3
(c) e x (
y + dy
dx
)
=cosx
Solutions
1 (a) y= x2
4 + C x 2 (b) y= 5x6
4 +Cx2 (c) y=e −x sinx+Ce −x
19.4.2 Apreliminaryresultinvolvingseparationofvariables
Consider the following differential equation forthe dependentvariable µ:
dµ
dx
= µP (19.3)
wherePissome function ofxonly. Usingseparation ofvariables wehave
1 dµ
µ dx =P
and integrating both sides
∫ 1
µ dµ = ∫
Pdx
∫
lnµ= Pdx
µ = e ∫ Pdx
Inthisdevelopmenttheconstantofintegrationhasbeenomitted.Thereasonforthiswill
be apparent in what follows. So the solution of Equation (19.3), for any functionP(x),
isµ=e ∫ Pdx .
For example, if P(x) = 1 dµ
, then Equation (19.3) is
x dx = µ and its solution is
x
µ=e ∫ (1/x)dx = e lnx =x.
550 Chapter 19 Ordinary differential equations I
19.4.3 First-orderlinearequations
First-order linear differential equations can always bewritten inthe ‘standard’ form
dy
+P(x)y =Q(x) (19.4)
dx
wherePandQare both functions of the independent variable,x, only. In some cases,
either of these may be simplyconstants.
An example of suchan equation is
dy
+3xy = 7x2
dx
Comparing this with the standard form in Equation (19.4), note that P(x) = 3x and
Q(x) = 7x 2 . As a second example consider
dy
dx − 2y
x =4e−x
HereP(x) = − 2 x (note inparticular the minus sign) andQ(x) = 4e−x .
Finally, inthe equation
dy
dx −5y=sinx
note thatP(x) issimplythe constant −5.
Variables other thanyandxmay be used. So, forexample,
dx
dt +8tx=3t2 −5t
isafirst-orderlinearequationintheformofEquation(19.4)butwithindependentvariablet
and dependent variablex. HereP(t) = 8t andQ(t) = 3t 2 −5t.
In what follows it will be important that you can distinguish between the dependent
and independent variables, and alsothatyou areable toidentifythe functionsPandQ.
Equations such as these arise naturally when modelling many engineering applications.Forexample,theequationwhichdeterminesthecurrentflowinaseriesRLcircuit
when the applied voltage takes the form of a ramp isgiven by
di
dt + R L i = t L
This is a first-order linear equation in whichP(t) is the constant R L andQ(t) = t L . You
will learnhow tosolve such equations inthe following section.
19.4.4 Theintegratingfactormethod
All first-order linear equations, even when they are not exact, can be made exact by
multiplying them through by a function known as an integrating factor. As we have
seen, the solution then follows by performing an integration. For example, the linear
equation
dy
dx + 3 x y = e2x
x 3
is not exact, but multiplying it through by x 3 produces the differential equation in
Example 19.13(a) which isexact.
19.4 First-order linear equations: use of an integrating factor 551
Consider again a first-order linear equation instandard form:
dy
+Py=Q (19.5)
dx
where P and Q are functions of x only. We are aiming to solve this and express the
dependentvariableyintermsoftheindependentvariablex.Theaimistomultiply(19.5)
through by a function µ to make the equation exact. That is, so that the l.h.s. can be
written inthe form
d
dx (µy)
At thisstage the function µisnot known. Multiplying (19.5) through by µyields
µ dy +µPy=µQ (19.6)
dx
If the l.h.s. istoequal
d
(µy) = µdy
dx dx + µPy
d
(µy) then wemusthave
dx
Expanding the l.h.s. using the product rulegives
µ dy
dx +ydµ dx = µdy dx + µPy
which simplifies to
y dµ
dx = µPy
and consequently
dµ
dx = µP
Usingthe resultinSection 19.4.2 wesee that thisequation has solution
µ = e ∫ Pdx
The function µ is called an integrating factor and is a function of x only. With this
choice of µ, the l.h.s. of(19.6) isthe same as d (µy) and hence (19.6) can bewritten
dx
d
(µy) = µQ
dx
This exact equation can be solved by integration togive
∫
µy = µQdx
and consequently
y = 1 ∫
µQdx
µ
552 Chapter 19 Ordinary differential equations I
Insummary:
Given any first-order linear equation instandard form
dy
dx +Py=Q
wherePandQarefunctions ofx, the integrating factor µ isgiven by
µ = e ∫ Pdx
and the solution of the equation isthen obtained from
∫
µy = µQdx
It is important when working on a particular differential equation to rewrite the standard
formulae in the correct form before use. Essentially this means using the correct
dependentandindependentvariablesintheequations.Forexample,ifxisthedependent
variable andt isthe independent variable then the equations areas follows:
Theintegrating factorfor
dx
dt +Px=Q
wherePandQarefunctions oft, isgiven by
µ = e ∫ Pdt
and the solution of the equation isobtained from
∫
µx = µQdt
These results areillustratedinthe examples which follow.
Example19.14 Solve the differential equation dy
dx + y = 1 using the integrating factor method.
x
Solution Referring tothe standard first-order linear equation
dy
+P(x)y =Q(x)
dx
weseethatP(x) = 1 andQ(x) = 1.Usingthe previous formula for µ(x), we find
x
µ(x) = e ∫ (1/x)dx
= e lnx
=x
19.4 First-order linear equations: use of an integrating factor 553
Then fromthe first key pointon page 552 with µ =xandQ = 1 wehave
∫
xy = xdx= x2
2 +C
and finallyy = x 2 +C x
isthe required general solution.
Engineeringapplication19.3
RLcircuitwithrampinput
Recall from Section 2.4.7 that a standard ramp function has value 0 fort < 0 and a
valuect fort 0. This is shown in Figure 2.46. A voltage ramp signal with a value
c = 1 is applied to an RL circuit. The arrangement is shown in Figure 19.4. The
differential equation governing the currentflow,i(t), inthiscircuitisgiven by
iR+L di =t for t0 i(0)=0
dt
ShowthatthisequationcanbewrittenintheformofEquation(19.4).Henceusethe
integrating factor method tofindi(t).
i
y (t)
R
0
t
L
Figure19.4
A ramp input signal applied to aseriesRL
circuit.
Solution
This equation can bewritten as
di
dt + R L i = t for t0
L
which is a first-order linear equation.
Noteinthiscasethattheindependentvariableist andthedependentvariableisi.
Instandard form, wehave
di
+P(t)i =Q(t)
dt
whereP(t) = R L andQ(t) = t . The integrating factor, µ, isgiven by
L
µ = e ∫ P(t)dt
= e ∫ (R/L)dt
= e Rt/L ➔
554 Chapter 19 Ordinary differential equations I
Then
ie Rt/L = 1 L
= 1 L
∫
te Rt/L dt
( te
Rt/L
R/L − ∫ e
Rt/L
R/L dt )
= teRt/L
R − L R 2 eRt/L +K
whereK isthe constant of integration. The general solution is
i = t R − L R 2 +Ke−Rt/L
When t = 0, i = 0. This gives the initial condition required to find K. Applying
i(0) = 0 givesK =L/R 2 so thatthe particular solution is
i = t R + L R 2 (
e −Rt/L −1 )
In many engineering applications the terms transient response, steady-state response,
zero-input response, and zero-state response arise through the solution of differential
equations.These termsare explained inthe following example.
Engineeringapplication19.4
RC circuit:zero-inputresponseandzero-stateresponse
The differential equation which is used to model the variation in voltage across a
capacitor, v C
(t), inaseriesRC circuitwasderived inEngineeringapplication 19.1:
RC dv C
+v
dt C
=v S
(t)
where v S
(t) is the applied voltage. Suppose that this applied voltage takes the form
v S
(t) = V cos ωt. Suppose also that when the switch is closed att = 0, the initial
voltage across the capacitor isV 0
, thatis v C
(0) =V 0
.
(a) Show that this equation can be written in the form of Equation (19.4); that is, it
is a first-order linear equation.
(b) Usetheintegratingfactormethodtoobtaintheparticularsolutionofthisequation
which satisfies the given initial condition.
(c) Obtain the particular solution of the equation subject instead to the initial condition
v C
(0) = 0. It corresponds to the response of the system when there is no
initialenergyinthecircuit.Thissolutionisreferredtoasthezero-stateresponse.
(d) Obtainthesolutionoftheequationinthecasewhenthesupplyvoltageisidenticallyzero,
v S
(t) = 0,subjecttothegivencondition v C
(0) =V 0
.Thissolutionis
often referred to as the zero-input response, and represents the response of the
systemwhen the input iszero.
19.4 First-order linear equations: use of an integrating factor 555
(e) Showthatthesolutionin(b)canbewrittenasthesumofthezero-stateresponse
and the zero-input response.
(f) Identifythe transient and steady-state termsinthe solution topart(e).
Solution
(a) The given differential equation can berewritten as
dv C
+ 1
dt RC v C = 1
RC v S (t)
Comparing the form of this equation with (19.4) we see that it is a first-order
linearequation,withindependentvariablet,anddependentvariable v C
,inwhich
P(t) = 1
RC andQ(t) = 1
RC v S (t).
(b) The integrating factor isgiven by
µ = e ∫ (1/RC)dt = e t/(RC)
Itfollows from the second key point on page 552 that
e t/(RC) v C
= V ∫
e t/(RC) cosωtdt
RC
Thisintegralcanbeevaluatedusingintegrationbypartstwicefollowingthetechnique
inExample 14.4. You should verify that
∫
e t/(RC) cosωtdt = R2 C 2 e t/(RC) [
cos ωt
]
ωsinωt +
R 2 C 2 ω 2 +1 RC
Then
e t/(RC) v C
= VRCet/(RC)
R 2 C 2 ω 2 +1
+constant of integration
[
ωsinωt +
cos ωt
]
+K
RC
from which
VRC
[
v C
= ωsinωt + cosωt ]
+Ke −t/(RC)
R 2 C 2 ω 2 +1 RC
Thisisthegeneralsolutionofthedifferentialequation.Applyingtheinitialcondition
gives us a value forK. Whent = 0, v C
=V 0
, so
[ ]
VRC 1
V 0
=
+K
R 2 C 2 ω 2 +1 RC
from which
Finally,
K=V 0
−
v C
=
V
R 2 C 2 ω 2 +1
VRC
[
ωsinωt+
R 2 C 2 ω 2 +1
cos ωt
] (
+ V
RC 0
−
)
V
e −t/(RC)
R 2 C 2 ω 2 +1
This is the particular solution which satisfies the given initial condition. It tells
us the voltage across the capacitor as a function of timet.
➔
556 Chapter 19 Ordinary differential equations I
(c) To find the particular solution subject to the condition v C
(0) = 0 we need only
replaceV 0
by0intheparticularsolutionobtainedinpart(b).Hencethezero-state
response is
v C
(t) =
VRC
[
ωsin ωt +
R 2 C 2 ω 2 +1
cos ωt
]
−
RC
V
R 2 C 2 ω 2 +1 e−t/(RC)
(d) In the case of zero input we have v S
(t) = 0. In part (b) we solved the equation
with v S
(t) =V cos ωt.Hence,puttingV = 0inthesolutiontopart(b)willyield
the solution when v S
(t) = 0.So,
v C
(t) =V 0
e −t/(RC)
Alternatively we can note thatwhen v S
(t) = 0 the originalequation becomes
dv C
+ 1
dt RC v C =0
subject to v C
(0) =V 0
. This can be solved usingseparation of variables. So
1 dv C
= − 1
v C
dt RC
Integration yields
∫ 1
v C
dv C
=−∫
1
RC dt
lnv C
=− t
RC +k
v C
= e −t/(RC)+k
= e k e −t/(RC)
=Ae −t/(RC)
(k a constant)
where A is the constant e k . Applying the initial condition v C
(0) = V 0
we find
V 0
=Aand so finally
as before.
v C
(t) =V 0
e −t/(RC)
(e) Inspectionofpart(b)showsthatitisthesumofthezero-inputresponseandthe
zero-state response:
zero-input zero-state response
response
{ }} { { }} {
v C
(t)=V 0
e −t/(RC) +
VRC [
cos ωt
]
V
ωsinωt+ −
R 2 C 2 ω 2 +1 RC R 2 C 2 ω 2 +1 e−t/(RC)
(f) In this example, the terms involving e −t/(RC) tend to zero ast increases, and are
termed transients. Once the system has settled down their contribution will not
be important. The remaining terms represent the longer term behaviour of the
systemor the so-calledsteady state.
Hence the transient termsare
V 0
e −t/(RC) V
−
R 2 C 2 ω 2 +1 e−t/(RC)
19.4 First-order linear equations: use of an integrating factor 557
and the steady-state terms are
[
ωsinωt +
VRC
R 2 C 2 ω 2 +1
cos ωt
]
RC
EXERCISES19.4.4
1 Find the general solution ofthe followingequations:
dy
(a)
dx +y=1 (b) dy
dx +2y=6
(c)
(e)
dx
dt +6x=4
dy
dx =6y+9
(d) dy
dx −3y=2
(f) dx
dt =3x−8
2 Find the particular solution ofthe following
equations:
(a) dy
dx +4y=7, y(0)=1
(b) dx
dt −x=4,
(c) dy
dt =3y+2,
(d) dy
dx =4y−8,
x(0)=2
y(0)=2
y(1)=2
3 Find the general solution of dx
dt =2x+4t.Whatis
the particular solution which satisfiesx(1) = 2?
4 Find the general solution of dy
dx +y=2x+5.
5 Solve dx
dt =t−tx,x(0)=0.
6 Useanintegratingfactortoobtainthegeneralsolution
ofiR+L di = sin ωt,whereR,Land ω areconstants.
dt
7 Solvex dy
dx +y=x4 .
8 Use an integrating factorto findthe general solution
oft dx
dt +x=3t.
9 Find the general solution of dx +2xt =t.Findthe
dt
particular solution satisfyingthe condition
x(0) = −1.
10 Find the general solution of
tẋ+3x= et
t 2
Solutions
1 (a) y=1+ce −x
(b) y=3+ce −2x
(c) x= 2 3 +ce−6t
(d) y=ce 3x − 2 3
(e) y=ce 6x − 3 2
(f) x= 8 3 +ce3t
2 (a)y= 7 4 − 3 4 e−4x
(b)x=6e t −4
(c)y= 8 3 e3t − 2 3
(d)y=2
3 −2t−1+ce 2t ,−2t−1+5e 2(t−1)
4 2x+3+ce −x
5 1−e −t2 /2
6
7
8
9
L((R/L)sinωt − ωcosωt)
R 2 +L 2 ω 2
x 4
5 + c x
3t
2 + c t
1
2 − 3 2 e−t2
10 et +c
t 3
+ce −Rt/L
558 Chapter 19 Ordinary differential equations I
19.5 SECOND-ORDERLINEAREQUATIONS
The general form ofasecond-order linear ordinary differential equation is
p(x) d2 y
dx +q(x)dy +r(x)y = f(x) (19.7)
2 dx
where p(x),q(x),r(x) and f (x)arefunctions ofxonly.
An important relative of thisequation is
p(x) d2 y
dx +q(x)dy +r(x)y = 0 (19.8)
2 dx
whichisobtainedfromEquation(19.7)byignoringthetermwhichisindependentofy.
Equation (19.8) is said to be a homogeneous equation -- all its terms contain y or its
derivatives. Equation (19.7) issaidtobeinhomogeneous.
For example,
x 2d2 y
dx 2 +xdy dx + (x2 −1)y =e −x
isaninhomogeneoussecond-orderlinearequationinwhichp(x) =x 2 ,q(x) =x,r(x) =
x 2 −1 and f (x) = e −x . The associated homogeneous equation is
x d2 y
dx +xdy 2 dx +(x2 −1)y=0
The following properties of linear equations are necessary for finding solutions of
second-order linear equations.
19.5.1 Property1
Ify 1
(x) andy 2
(x) are any two linearly independent solutions of a second-order homogeneous
equation then the general solution,y H
(x), is
y H
(x) =Ay 1
(x) +By 2
(x)
whereA,Bare constants.
We see that the second-order linear ordinary differential equation has two arbitrary
constantsinitsgeneralsolution.Thefunctionsy 1
(x)andy 2
(x)arelinearlyindependent
ifone isnotsimplyamultiple ofthe other.
19.5.2 Property2
Lety P
(x) be any solution of an inhomogeneous equation. Lety H
(x) be the general solution
of the associated homogeneous equation. The general solution of the inhomogeneous
equation isthengiven by
y(x) =y H
(x) +y P
(x)
Inotherwords,tofindthegeneralsolutionofaninhomogeneousequationwemustfirst
findthegeneralsolutionofthecorrespondinghomogeneousproblem,andthenaddtoit
any solutionofthe inhomogeneousequation.
Thefunctiony H
(x)isknownasthecomplementaryfunctionandy P
(x)iscalledthe
particularintegral.Clearlythecomplementaryfunctionofahomogeneousproblemis
19.5 Second-order linear equations 559
thesameasitsgeneralsolution;weshalloftenwritey(x)forboth.Ifconditionsaregiven
theyareappliedtothegeneralsolutionoftheinhomogeneousequationtodetermineany
unknown constants. This yields the particular solution satisfyingthe given conditions.
Example19.15 Verify that y 1
(x) = x and y 2
(x) = 1 both satisfy d2 y
= 0. Write down the general
dx2 solution of thisequation and verifythatthis indeed satisfies the equation.
Solution Ify 1
(x) = x then dy 1
y 1
dx =1andd2 = 0, so thaty
dx 2 1
satisfies d2 y
dx = 0.Ify 2 2
(x) = 1,
then dy 2
y 2
dx =0andd2 = 0, so thaty
dx 2 2
satisfies d2 y
= 0. From Property 1, the general
dx2 solution of d2 y
dx =0is 2
y H
(x) =Ax +B(1)
=Ax+B
To verifythat thissatisfies the equation proceed as follows:
dy H
dx =A
d 2 y H
dx 2 = 0
and soy H
(x) satisfies d2 y
dx 2 = 0.
Example19.16 Given
d 2 y
+y=x (19.9)
dx2 (a) Show thaty H
= Acosx +Bsinx is a solution of the corresponding homogeneous
equation.
(b) Verifythaty P
=xisaparticular integral.
(c) Verifythaty H
+y P
does indeed satisfythe inhomogeneous equation.
Solution (a) Ify H
=Acosx +Bsinx, then
y ′ H =−Asinx+Bcosx
y′′ H =−Acosx−Bsinx
We see immediately thaty H
+y ′′ H = 0 so thaty H
is a solution of the homogeneous
equation.
(b) Ify P
= x theny ′ P = 1 andy′′ P
= 0. Substitution into the inhomogeneous equation
shows thaty P
satisfies Equation (19.9),thatisy P
=xisaparticular integral.
(c) Writing
we have
y=Acosx+Bsinx+x
y ′ =−Asinx+Bcosx+1
y ′′ =−Acosx−Bsinx
560 Chapter 19 Ordinary differential equations I
Substitution into the l.h.s. of Equation (19.9) gives
(−Acosx−Bsinx)+(Acosx+Bsinx+x)
which equals x, and so the complementary function plus the particular integral is
indeed a solution ofthe inhomogeneous equation, as required by Property 2.
19.6 CONSTANTCOEFFICIENTEQUATIONS
Wenowproceedtostudyindetailthosesecond-orderlinearequationswhichhaveconstant
coefficients. The general formof such an equation is
a d2 y
dx +bdy +cy = f(x) (19.10)
2 dx
wherea,b,careconstants. The homogeneous formof Equation (19.10) is
a d2 y
dx +bdy +cy=0 (19.11)
2 dx
Equationsofthisformariseintheanalysisofcircuits.Considerthefollowingexample.
Engineeringapplication19.5
TheLCRcircuit
WritedownthedifferentialequationgoverningthecurrentflowingintheseriesLCR
circuitshown inFigure 19.5.
Solution
UsingKirchhoff’s voltage lawand the individual laws foreachcomponent wefind
L di ∫
dt +iR+1 idt = v(t)
C
If thisequation isnow differentiated w.r.t.t wefind
L d2 i
dt +Rdi 2 dt + 1 C i = dv(t)
dt
Thisisaninhomogeneoussecond-orderdifferentialequation,withtheinhomogeneity
arising from the term dv . When the circuit componentsL,RandC are constants
dt
wehavewhatistermedalineartime-invariantsystem,andthedifferentialequation
then has constant coefficients.
L C R
i
y (t)
Figure19.5
AnLCRcircuit.
19.6 Constant coefficient equations 561
A linear time-invariant system has components whose properties do not vary with
time and as such can be modelled by a linear constant coefficient differential
equation.
19.6.1 Findingthecomplementaryfunction
AsstatedinProperty2(Section19.5.2),findingthegeneralsolutionofay ′′ +by ′ +cy = f
is a two-stage process. The first task is to determine the complementary function. This
is the general solution of the corresponding homogeneous equation, that isay ′′ +by ′ +
cy = 0.We now focus attention on the solution of such equations.
Example19.17 Verify that y 1
= e 4x and y 2
= e 2x both satisfy the constant coefficient homogeneous
equation
d 2 y
dx −6dy +8y=0 (19.12)
2 dx
Writedown the general solution of thisequation.
Solution Ify 1
= e 4x , differentiation yields
dy 1
dx =4e4x
and similarly,
d 2 y 1
dx 2 = 16e4x
Substitution into Equation (19.12) gives
16e 4x −6(4e 4x )+8e 4x =0
so that y 1
= e 4x is indeed a solution. Similarly if y 2
= e 2x , then dy 2
dx = 2e2x and
d 2 y 2
dx 2 = 4e2x . Substitution into Equation (19.12) gives
4e 2x −6(2e 2x )+8e 2x =0
so that y 2
= e 2x is also a solution of Equation (19.12). Now e 2x and e 4x are linearly
independent functions. So,from Property 1 we have
y H
(x)=Ae 4x +Be 2x
as the general solution of Equation (19.12).
Example19.18 Findvalues ofkso thaty = e kx isasolution of
d 2 y
dx − dy
2 dx −6y=0
Hence statethe general solution.
562 Chapter 19 Ordinary differential equations I
Solution As suggested we try a solution of the formy = e kx . Differentiating we find dy
dx =kekx
and d2 y
dx 2 =k2 e kx . Substitution into the given equation yields
that is,
k 2 e kx −ke kx −6e kx =0
(k 2 −k−6)e kx =0
The only way this equation can be satisfied forall values ofxisif
that is,
k 2 −k−6=0 (19.13)
(k−3)(k+2)=0
so thatk = 3 ork = −2. That is to say, ify = e kx is to be a solution of the differential
equationkmustbe either 3 or −2. We therefore have found two solutions
y 1
(x) = e 3x and y 2
(x) = e −2x
These two functions are linearly independent and we can therefore apply Property 1 to
give the general solution:
y H
(x) =Ae 3x +Be −2x
Equation (19.13) fordeterminingkiscalled theauxiliary equation.
Example19.19 Findthe auxiliary equation ofthe differential equation
a d2 y
dx 2 +bdy dx +cy=0
Solution We try a solution of the formy = e kx so that dy
dx =kekx and d2 y
dx = 2 k2 e kx . Substitution
into the given differential equation yields
that is,
ak 2 e kx +bke kx +ce kx =0
(ak 2 +bk+c)e kx =0
Since thisequation istobe satisfied for all values ofx, then
ak 2 +bk+c=0
isthe required auxiliary equation.
19.6 Constant coefficient equations 563
The auxiliary equation ofa d2 y
dx 2 +bdy dx +cy=0is
ak 2 +bk+c=0
Solution of this quadratic equation gives the values of k which we seek. Clearly the
nature of the roots will depend upon the values ofa,bandc. Ifb 2 > 4ac the roots will
be real and distinct. The two values ofkthus obtained,k 1
andk 2
, will allow us to write
down two independent solutions:
y 1
(x) =e k 1 x
y 2
(x) =e k 2 x
and so the general solution of the differential equation will be
y(x) =Ae k 1 x +Be k 2 x
Iftheauxiliaryequationhasreal,distinctrootsk 1
andk 2
,thecomplementaryfunction
will be
y(x) =Ae k 1 x +Be k 2 x
On the other hand, if b 2 = 4ac the two roots of the auxiliary equation will be equal
andthismethodwillthereforeonlyyieldoneindependentsolution.Inthiscase,special
treatmentisrequired.Ifb 2 <4acthetworootsoftheauxiliaryequationwillbecomplex,
that isk 1
andk 2
will be complex numbers. The procedure for dealing with such cases
will become apparentinthe following examples.
Example19.20 Findthe generalsolutionof
d 2 y
dx 2 +3dy dx −10y=0
Find the particular solution which satisfies the conditionsy(0) = 1 andy ′ (0) = 1.
Solution By lettingy = e kx , so that dy
dx =kekx and d2 y
dx = 2 k2 e kx , the auxiliary equation is found
tobe
k 2 +3k−10=0
Therefore,
(k−2)(k+5)=0
sothatk = 2andk = −5.Thusthereexisttwosolutions,y 1
= e 2x andy 2
= e −5x .From
Property 1 we can writethe general solution as
y=Ae 2x +Be −5x
564 Chapter 19 Ordinary differential equations I
To find the particular solution we mustnow imposethe given conditions:
y(0)=1 gives 1=A+B
y ′ (0)=1 gives 1=2A−5B
from which A = 6 7 andB = 1 . Finally, the required particular solution is
7
y = 6 7 e2x + 1 7 e−5x .
Example19.21 Findthe generalsolution of
d 2 y
dx 2 +4y=0
Solution As before, lety = e kx so that dy
dx = kekx and d2 y
dx = 2 k2 e kx . The auxiliary equation is
easily found tobe
that is
so that
k 2 +4=0
k 2 =−4
k=±2j
that is,we have complex roots.The two independent solutions ofthe equation arethus
y 1
(x) = e 2jx and y 2
(x) = e −2jx
so thatthe general solution can be written inthe form
y(x) =Ae 2jx +Be −2jx
However, in cases such as this, it is usual to rewrite the solution in the following way.
Recall from Chapter 9 thatEuler’s relations give
and
so that
e 2jx = cos2x +jsin2x
e −2jx = cos2x −jsin2x
y(x) =A(cos2x +jsin2x) +B(cos2x −jsin2x)
If we now relabel the constants such that
A+B=C and Aj−Bj=D
we can write the general solution inthe form
y(x) =Ccos2x +Dsin2x
19.6 Constant coefficient equations 565
Engineeringapplication19.6
Oscillatingmass--springsystem
Consider the simple mechanical problem of a mass resting on a smooth frictionless
table. A spring is attached to the mass and an adjacent anchor point as shown in
Figure 19.6. The spring is capable of being compressed as well as stretched. When
the spring isneither compressed nor stretched the mass islocated atx = 0.
m
0
x
Figure19.6
Mass--spring system.
If the mass is pulled in thexdirection and let go, it will oscillate about thex = 0
position. We wish to find the position of the mass,x, as a function of time,t. Differential
equations are needed todescribe thisproblem fully.
Newton’s second law states that if a forceF is applied to a body of massmthen
themotionofthebodyisgovernedbyF =ma,whereaistheacceleration.Applying
Newton’s second law, and noting thata = d2 x
dt2,we obtain
F =m d2 x
dt 2
The force, F, is provided by the spring. The force exerted by a spring is given by
Hooke’slaw,whichstatesthattheforceisproportionaltotheextensionorcompression
of the spring,
F=−kx
wherekisthespringconstantforthespringinuse.Theminussignisrequiredsothat
when the spring is stretched (x > 0) the force is in the negativexdirection. When
the spring iscompressed (x < 0) the force isinthe positivexdirection.
If the table is sufficiently smooth and if there are no other external forces acting,
thenm d2 x
dt 2 = −kx.
Therefore thedifferentialequationthatgoverns motioninthe system is
m d2 x
dt 2 +kx=0
We write thisas
d 2 x
dt 2 + k m x = 0
Let k m = ω2 ,giving
d 2 x
dt 2 +ω2 x=0
➔
566 Chapter 19 Ordinary differential equations I
Usingthe technique illustratedinExample 19.21 we obtain
x(t) =Acosωt +Bsinωt
whereA,Bareconstants. Note this solution may alsobe expressed as a singlewave
x =Ccos(ωt +φ)
whereC and φ are constants, using the technique described in Section 3.7.1, Combining
waves.
We note that this solution is sinusoidal and oscillates with time. It gives the position
of the mass at a given point in time. The constantsC and φ depend on the
position and velocity of the mass when it is released. These are known as theinitial
conditionsofthedifferentialequation(seeExample19.6onpage539).Itisintuitive
thatanequationthatdescribesthepositionofthemassatagiventimemusttakethese
into account.
Mathematically describing or modelling systems like this one using differential
equationsisanextremelyimportantdiscipline.Mathematicalmodelsofmechanical,
electricalandothersubsystemscanbelinkedtogetherandasaresultwholesystems
can be accurately characterized.
Example19.22 Givenay ′′ +by ′ +cy = 0,writedowntheauxiliaryequation.Iftherootsoftheauxiliary
equation arecomplex and aredenoted by
k 1
=α+βj
k 2
=α−βj
show thatthe general solution is
y(x) =e αx (Acosβx+Bsinβx)
Solution Substitution ofy = e kx into the differential equation yields
(ak 2 +bk+c)e kx =0
and so
ak 2 +bk+c=0
This isthe auxiliary equation. Ifk 1
= α + βj,k 2
= α − βj thenthe general solution is
y =Ce (α+βj)x +De (α−βj)x
whereC andDarearbitraryconstants. Usingthe laws ofindices thisisrewritten as
y=Ce αx e βjx +De αx e −βjx =e αx (Ce βjx +De −βjx )
Then, usingEuler’s relations, weobtain
y =e αx (Ccosβx+Cjsinβx+Dcosβx−Djsinβx)
=e αx {(C +D)cosβx+ (Cj−Dj)sinβx}
19.6 Constant coefficient equations 567
WritingA =C +DandB =Cj −Dj,wefind
y =e αx (Acosβx+Bsinβx)
This isthe required solution.
Iftheauxiliaryequationhascomplexroots α +βjand α −βj,thenthecomplementaryfunction
is
y =e αx (Acosβx+Bsinβx)
Note that Example 19.21 isaspecial case of Example 19.22 with α = 0 and β = 2.
Example19.23 Find the general solution ofy ′′ +2y ′ +4y = 0.
Solution The auxiliary equation isk 2 +2k +4 = 0.This equation has complex roots given by
k = −2 ± √ 4−16
2
= −2 ± √ 12j
2
=−1± √ 3j
Using the result of Example 19.22 with α = −1 and β = √ 3 we find the general
solution is
y = e −x (Acos √ 3x+Bsin √ 3x)
Example19.24 Theauxiliaryequationofay ′′ +by ′ +cy = 0isak 2 +bk +c = 0.Supposethisequation
has equal rootsk =k 1
. Verifythaty =xe k 1 x isasolution ofthe differential equation.
Solution We have
y =xe k 1 x y ′ =e k 1 x (1+k 1
x) y ′′ =e k 1 x (k 2 1 x +2k 1 )
Substitutioninto the l.h.s. ofthe differential equation yields
e k x{ 1 a ( k 2 1 x +2k )
1 +b(1+k1 x)+cx } =e k x{( 1 ak 2 1 +bk 1 +c) x +2ak 1
+b }
Butak1 2 +bk 1 +c = 0 sincek 1
satisfies the auxiliary equation. Also,
k 1
= −b ± √ b 2 −4ac
2a
butsincetherootsareequal,thenb 2 −4ac = 0andhencek 1
= − b
2a .So2ak 1 +b=0.
We conclude thaty = xe k 1 x is a solution ofay ′′ +by ′ +cy = 0 when the roots of the
auxiliary equation areequal.
568 Chapter 19 Ordinary differential equations I
If the auxiliary equation has two equal roots,k 1
, the complementary function is
y =Ae k 1 x +Bxe k 1 x
Example19.25 Obtain the general solution of the equation
d 2 y
dx 2 +8dy dx +16y=0
Solution As before, a trialsolution of the formy = e kx yields an auxiliary equation:
k 2 +8k+16=0
This equation factorizes so that
(k+4)(k+4)=0
and we obtain equal roots, that is k = −4 (twice). If we proceed as before, writing
y 1
(x) = e −4x , y 2
(x) = e −4x , it is clear that the two solutions are not independent. To
apply Property 1 we need to find a second independent solution. Using the result of
Example 19.24 we conclude that, because the roots of the auxiliary equation are equal,
the second independent solution isy 2
=xe −4x . The general solution isthen
y(x) =Ae −4x +Bxe −4x
EXERCISES19.6.1
1 Obtainthe general solutions,that isthe
complementary functions,ofthe following
homogeneous equations:
(a)
(b)
(c)
(d)
(e)
(f)
(g)
d 2 y
dx 2 −3dy dx +2y=0
d 2 y
dx 2 +7dy dx +6y=0
d 2 x
dt 2 +5dx dt +6x=0
d 2 y
dt 2 +2dy dt +y=0
d 2 y
dx 2 −4dy dx +4y=0
d 2 y
dt 2 + dy
dt +8y=0
d 2 y
dx 2 −2dy dx +y=0
(h)
(i)
(j)
(k)
(l)
d 2 y
dt 2 + dy
dt +5y=0
d 2 y
dx 2 + dy
dx −2y=0
d 2 y
dx 2 +9y=0
d 2 y
dx 2 −2dy dx = 0
d 2 x
dt 2 −16x=0
2 Findthe auxiliaryequation forthe differential
equation
L d2 i
dt 2 +Rdi dt + 1 C i = 0
Hence write down the complementary function.
3 Findthe complementaryfunctionofthe equation
d 2 y
dx 2 + dy
dx +y=0
19.6 Constant coefficient equations 569
Solutions
1 (a) y=Ae x +Be 2x
(b)y=Ae −x +Be −6x
(c) x =Ae −2t +Be −3t
(d)y=Ae −t +Bte −t
(e) y=Ae 2x +Bxe 2x
(f) y = e −0.5t (Acos2.78t +Bsin2.78t)
(g)y=Ae x +Bxe x
(h) y = e −0.5t (Acos2.18t +Bsin2.18t)
(i) y=Ae −2x +Be x
(j) y=Acos3x+Bsin3x
(k)y=A+Be 2x
(l) x=Ae 4t +Be −4t
2 Lk 2 +Rk+ 1 C =0 i(t) =Aek 1 t +Be k 2 t
where √
R 2 C −4L
−R ±
C
k 1 ,k 2 =
2L
√ √ )
3x 3x
3 e
(Acos −x/2 +Bsin
2 2
19.6.2 Findingaparticularintegral
We stated in Property 2 (Section 19.5.2) that the general solution of an inhomogeneous
equation is the sum of the complementary function and a particular integral. We have
seenhowtofindthecomplementaryfunctioninthecaseofaconstantcoefficientequation.
We shall now deal with the problem of finding a particular integral. Recall that
theparticularintegralisanysolutionoftheinhomogeneous equation.Thereareanumber
of advanced techniques available for finding such solutions but these are beyond
the scope of this book. We shall adopt a simpler strategy. Since any solution will do
we shall try to find such a solution by a combination of educated guesswork and trial
and error.
Example19.26 Findthe generalsolutionofthe equation
d 2 y
dx − dy
2 dx −6y=e2x (19.14)
Solution ThecomplementaryfunctionforthisequationhasalreadybeenshowninExample19.18
tobe
y H
=Ae 3x +Be −2x
We shall attempt to find a solution of the inhomogeneous problem by trying a function
of the same form as that on the r.h.s. In particular, let us tryy P
(x) = αe 2x , where α is a
constant that we shall now determine. Ify P
(x) = αe 2x then dy P
dx = 2αe2x and d2 y P
=
dx 2
4αe 2x . Substitution inEquation (19.14) gives
thatis,
4αe 2x −2αe 2x −6αe 2x = e 2x
−4αe 2x = e 2x
570 Chapter 19 Ordinary differential equations I
sothaty P
willbeasolutionif αischosensothat −4α = 1,thatis α = − 1 4 .Thereforethe
particularintegralisy P
(x) = − e2x
4 .FromProperty2thegeneralsolutionoftheinhomogeneous
equation is found by summing this particular integral and the complementary
function
y(x) =Ae 3x +Be −2x − 1 4 e2x
Example19.27 Obtain a particularintegral ofthe equation
d 2 y
dx 2 −6dy dx +8y=x
Solution In the last example, we found that a fruitful approach was to assume a solution in the
same form as that on the r.h.s. Suppose we assume a solutiony P
(x) = αx and proceed
to determine α. This approach will actually fail, but let us see why. Ify P
(x) = αx then
dy P
y P
dx =αandd2 = 0.Substitution into the differential equation yields
dx2 0−6α+8αx=x
and α ought now to be chosen so that this expression is true for allx. If we equate the
coefficients ofxwe find 8α = 1 so that α = 1 , but with this value of α the constant
8
terms are inconsistent. Clearly a particular integral of the form αx is not possible. The
problemarisesbecausedifferentiationoftheterm αxproducesconstanttermswhichare
unbalanced on the r.h.s.So, wetry a solution of the form
y P
(x)=αx+β
with α, β constants. Proceeding as before, dy P
dx = α, d2 y P
= 0. Substitution in the
dx
differential equation now gives
2
0−6α+8(αx+β)=x
Equating coefficients ofxwefind
8α=1 (19.15)
and equating constant termswe find
−6α+8β=0 (19.16)
From Equation (19.15), α = 1 and then from Equation (19.16)
8
( 1
−6 +8β=0
8)
so that
8β = 3 4
thatis,
β = 3
32
The required particular integral isy P
(x) = x 8 + 3
32 .
19.6 Constant coefficient equations 571
Experienceleadstothe trialsolutionssuggestedinTable 19.1.
Table19.1
Trial solutionsto find the particular
integral.
f (x)
constant
polynomial inx
ofdegreer
coskx
sinkx
ae kx
Trialsolution
constant
polynomial inx
ofdegreer
acoskx +bsinkx
acoskx +bsinkx
αe kx
Example19.28 Findaparticular integralforthe equation
d 2 y
dx 2 −6dy dx +8y=3cosx
Solution We shall try a solution ofthe form
y P
(x)=αcosx+βsinx
Differentiating,we find
dy P
dx =−αsinx+βcosx
d 2 y P
dx 2 =−αcosx−βsinx
Substitution into the differential equation gives
(−αcosx−βsinx)−6(−αsinx+βcosx)+8(αcosx+βsinx)
=3cosx
Equating coefficients of cosxwefind
−α−6β+8α=3 (19.17)
whilethose of sinx give
−β+6α+8β=0 (19.18)
572 Chapter 19 Ordinary differential equations I
Solving Equations (19.17) and (19.18) simultaneously we find α = 21
85 andβ=−18 85 ,
so thatthe particular integral is
y P
(x) = 21
85 cosx−18 85 sinx
Engineeringapplication19.7
AnLC circuitwithsinusoidalinput
The differential equation governing the flow of current in a seriesLC circuit when
subject toanapplied voltage v(t) =V 0
sinωt is
L d2 i
dt + 1 2 C i = ωV 0cos ωt
Derive this equation and then obtain its general solution.
Solution
Kirchhoff’s voltage law and the component laws give
L di
dt + 1 ∫
idt =V
C 0
sinωt
To avoid processes of differentiation and integration in the same equation let us differentiate
thisequation w.r.t.t. This yields
L d2 i
dt + 1 2 C i = ωV 0cos ωt
as required.
The homogeneous equation isL d2 i
dt + i 2 C = 0. Lettingi = ekt we find the auxiliary
equation isLk 2 + 1 C =0sothatk=± √ j . Therefore, using the result of
LC
Example 19.22, with α = 0 and β = 1 √
LC
, the complementary function is
t t
i=Acos√ +Bsin√ LC LC
Tofindaparticularintegral,tryi =Ecos ωt+F sin ωt,whereE andF areconstants.
We find
di
= −ωEsinωt + ωFcosωt
dt
d 2 i
dt 2 = −ω2 Ecosωt −ω 2 Fsinωt
Substitution into the inhomogeneous equation yields
L(−ω 2 Ecosωt − ω 2 Fsinωt) + 1 C (Ecosωt +Fsinωt) =V 0 ωcosωt
Equating coefficients of sinωt gives
−ω 2 LF + F C = 0
19.6 Constant coefficient equations 573
Equating coefficients of cosωt gives
−ω 2 LE + E C =V 0 ω
CV
sothatF = 0andE = 0
ω
. It follows that the particular integral is
1−ω 2 LC
i =
CV 0 ω cos ωt. Finally, the general solution is
1−ω 2 LC
t t
i=Acos√ +Bsin√ + CV 0ωcos ωt
LC LC 1−ω 2 LC
The terms zero-input response and zero-state response were introduced in Section 19.4
inconnectionwithfirst-orderequations.Thisterminologyisidenticalwhenwedealwith
second-order equationsasthe following example illustrates.
Engineeringapplication19.8
ParallelRLC circuit
Figure19.7showsaparallelRLCcircuitwhichhasacurrentsourcei S
(t).Theinductor
current,i, can befound bysolvingthe second-order differential equation
LC d2 i
dt + L di
2 Rdt +i=i S (t) fort0
It would be a useful exercise for you to derive this equation. Note that this equation
issecond order, and inhomogeneous due tothe source termi S
(t).
i S (t)
R L C
i
Figure19.7
AparallelRLC circuit.
SupposeL = 10 H,R = 10 ,C = 0.1 F andi S
(t) = e −2t and that the initial
conditions arei = 1 and di
dt =2whent=0.
(a) Obtain the solution of this equation subject to the given initial conditions and
hence statethe inductor currenti.
(b) Obtain the zero-input response. This isthe solution wheni S
(t) = 0.
(c) Obtain the zero-state response. This is the solution of the inhomogeneous equation
subject to the conditionsi = 0 and di = 0 att = 0. It corresponds to there
dt
being no initial energy inthe circuit.
(d) Show that the solution in (a) is the sum of the zero-input response and the zerostateresponse.
➔
574 Chapter 19 Ordinary differential equations I
Solution
(a) With the given parameter values the differential equation becomes
d 2 i
dt + di
2 dt +i=e−2t
Itisfirstnecessarytofind thecomplementary function. Lettingi = e kt , theauxiliaryequation
isk 2 +k+1 = 0 which has complex solutions
k = − 1 √
3
2 ± 2 j
The complementary function istherefore
√ √ ) 3 3
i =e
(Acos
−t/2 2 t+Bsin 2 t
For a particular integral we try a solution of the form i = αe −2t . Substitution
into the differential equation gives
so that
4αe −2t −2αe −2t + αe −2t = e −2t
3α=1, thatis α= 1 3
Hence a particular integral isi = 1 3 e−2t . The general solution is the sum of the
complementary function and the particular integral:
√ √ ) 3 3
i =e
(Acos
−t/2 2 t+Bsin 2 t + 1 3 e−2t
We now apply the initial conditions to find the constantsAandB. Giveni = 1
whent = 0 means
1=A+ 1 so that A = 2 3 3
To apply the second condition weneed tofind di
dt :
( √ √ √ √ )
di 3 3 3 3
dt = e−t/2 −
2 Asin 2 t + 2 Bcos 2 t
− 1 2 e−t/2 (Acos
Given di = 2 whent = 0 means
dt
√
3
2 =
2 B − 1 2 (A) − 2 3
√
3
2 =
2 B − 1 ( ) 2
− 2 2 3 3
√
3
2 t+Bsin √
3
2 t )
− 2 3 e−2t
2 =
3 =
√
3
2 B −1
√
3
2 B
19.6 Constant coefficient equations 575
B = 6 √
3
= 2 √ 3
Finally, the required particular solution which gives the current through the
inductor atany time is
( √ √ )
2 3 3
i =e −t/2 3 cos 2 t+2√ 3sin
2 t + 1 3 e−2t
(b) The zero-input response is obtained by ignoring the source term i S
(t). From
(a)we see thatthis isjustthe complementary function:
√ √ ) 3 3
i =e
(Acos
−t/2 2 t+Bsin 2 t
Applying the given initial conditions to this solution givesA = 1 andB = √ 5 3
and so the zero-input response is
√
3
i =e
(cos
−t/2 2 t + √ 5 √ ) 3
sin
3 2 t
(c) Thezero-stateresponseoftheinhomogeneousequationisfoundbyapplyingthe
conditionsi = 0 and di = 0 att = 0 tothe general solution already obtained in
dt
(a). It is straightforward to show thatA = − 1 3 andB= √ 1 . So the zero-state
3
response is
(
i =e −t/2 − 1 √
3
3 cos 2 t + √ 1
√ ) 3
sin
3 2 t + 1 3 e−2t
(d) Inspectionofthepreviousworkingshowsthattheparticularsolutionobtainedin
(a)isthe sum of the zero-state response and zero-input response.
19.6.3 Inhomogeneoustermappearsinthecomplementaryfunction
In some examples, terms which form part of the complementary function also appear
in the inhomogeneous term. This gives rise to an additional complication. Consider
Example 19.29.
Example19.29 Considertheequationy ′′ −y ′ −6y = e 3x .Itisstraightforwardtoshowthatthecomplementaryfunctionis
y=Ae 3x +Be −2x
Findaparticular integraland deducethe generalsolution.
Solution Suppose we try to find a particular integral by using a trial solution of the form
y P
= αe 3x . Substitutioninto the l.h.s. ofthe inhomogeneous equation yields
9αe 3x −3αe 3x −6αe 3x whichsimplifies to0
so that αe 3x is clearlynotasolution of the inhomogeneousequation. Thereason is that
e 3x is part of the complementary function and so causes the l.h.s. to vanish. To obtain a
576 Chapter 19 Ordinary differential equations I
particularintegralinsuchacase,wecarryouttheprocedurerequiredwhentheauxiliary
equation has equal roots.Thatis, we tryy P
= αxe 3x . We find
y ′ =αe 3x (3x+1)
y ′′ =αe 3x (9x+6)
Substitution into the inhomogeneous equation yields
αe 3x (9x +6) − αe 3x (3x +1) −6αxe 3x =e 3x
Mosttermscancel, leaving
5αe 3x = e 3x
sothat α = 1 5 .Finally,therequiredparticularintegralisy P =xe3x 5 .Thegeneralsolution
istheny=Ae 3x +Be −2x + xe3x
5 .
Engineeringapplication19.9
Transmissionlines
A transmission line is an arrangement of electrical conductors for transporting
electromagnetic waves. Although this definition could be applied to most electrical
cables,itisusuallyrestrictedtocablesusedtotransporthigh-frequencyelectromagneticwaves.Thereareseveraldifferenttypesoftransmissionlines.Themostfamiliar
one is the coaxial cable which is used to carry the signal from a television aerial to
atelevisionset(seeFigure19.8).Whenahigh-frequencywaveisbeingcarriedbya
cableseveraleffectsbecomeimportantwhichcanusuallybeneglectedwhendealing
with a low-frequency wave. These are:
(1) the capacitance,C, between the two conductors,
(2) the series inductance,L, ofthe two conductors,
(3) theleakagecurrentthroughtheinsulationlayerthatseparatesthetwoconductors.
The electrical parameters of a coaxial cable are evenly distributed along its length.
Thisistruefortransmissionlinesingeneralandsoitisusualtospecifyperunitlength
values for the parameters. When constructing a mathematical model of a transmission
line it is easier to think in terms of lumped components spanning a distance δz
and then allow δz to tend to zero (see Figure 19.9). The leakage between the two
conductors is conventionally modelled by a conductance, G, as this simplifies the
mathematics and avoids confusion with the line resistance,R. Note thatC,G,Land
Rareper unit length values forthe transmissionline.
×
Outer conductor
Inner conductor
Protective coating
Insulator
Figure19.8
Acoaxial cable.
19.6 Constant coefficient equations 577
For many transmission lines of interest the signal that is being carried varies sinusoidally
with time. Therefore the voltage and current depend on both position along
theline,z,andtime,t.However,itiscommontoseparatethetimedependencefrom
the voltage and current expressions in order to simplify the analysis. It must be remembered
that any voltage and current variation with position has superimposed
uponitasinusoidalvariationwithtime.Therefore,ignoringthetime-dependentelement
we write the voltage as v and the current asi, knowing that they are functions
ofz.
y
i
i + di
di
Cdz
y L
Ldz
Gdz
y R
Rdz
y + dy
i C
iG
z
dz
Figure19.9
A section ofatransmissionline.
ConsiderthecircuitofFigure19.9whichrepresentsasectionofthetransmission
line oflength δz. Applying Kirchhoff’s voltage law tothe circuityields
v+δv−v+v L
+v R
=0
δv=−v L
−v R
where v L
is the voltage across the inductor and v R
is the voltage across the resistor.
Using the individual component laws for the inductor and resistorgives
δv = −ijωLδz −iRδz
= −i(R +jωL)δz
Notethat δihasbeenignoredbecauseitissmallcomparedtoi.Nowconsidertheparallelcombinationofthecapacitorandresistor(withunitsofconductance).Applying
Kirchhoff’s current law tothiscombination yields
δi=i C
+i G
wherei C
isthecurrentthroughthecapacitorandi G
isthecurrentthroughtheresistor.
Using the individual component laws for the capacitor and resistor gives
δi = −vjωCδz − vGδz
= −v(G +jωC)δz
Dividing these two circuitequations by δzyields
δv
δz
= −i(R +jωL)
δi
δz = −v(G+jωC)
➔
578 Chapter 19 Ordinary differential equations I
Inordertomodelacontinuoustransmissionlinewithevenlydistributedparameters,
δz isallowed totend tozero. Inthe limitthe two circuitequations become
dv
= −i(R +jωL) (19.19)
dz
di
= −v(G+jωC) (19.20)
dz
Differentiating Equation (19.19) yields
d 2 v
= −(R +jωL)di
dz2 dz
Substituting for di fromEquation (19.20) yields
dz
d 2 v
= (R +jωL)(G +jωC)v
dz2 This isusually writtenas
d 2 v
dz 2 = γ2 v where γ 2 = (R +jωL)(G +jωC) (19.21)
This is the differential equation that describes the variation of the voltage, v, with
position,z,alongthetransmissionline.Thegeneralsolutionofthisequationiseasily
shown tobe
v=v 1
e −γz +v 2
e γz (19.22)
where v 1
and v 2
areconstantsthatdependontheinitialconditionsforthetransmission
line. It is useful to write γ = α +jβ thus separating the real and imaginary parts of
γ. Equation (19.22) can then be writtenas
v = v 1
e −αz e −jβz + v 2
e αz e jβz (19.23)
The quantity v 1
e −αz e −jβz represents the forward wave on the transmission line. It
consists of a decaying exponential multiplied by a sinusoidal term. The decaying
exponentialrepresentsagradualattenuationofthewavecausedbylossesasittravels
along the transmission line. The quantity v 2
e αz e jβz represents the backward wave
produced by reflection. Reflection occurs if the transmission line is not matched
withitsload.Asthewaveistravellingintheoppositedirectiontotheforwardwave,
e αz stillrepresents an attenuation butinthiscase an attenuation aszdecreases.
Alosslesslineisoneinwhichtheattenuationisnegligible.Thiscasecorresponds
toα=0,andsoγ =jβ.Ifγ =jβthenγ 2 = −β 2 so that, from Equation (19.21),
(R + jωL)(G + jωC) must be real and negative. We see that this is the case when
R = 0 andG = 0. This agrees with what would be expected in practice as it is the
resistive and conductive termsthatlead toenergy dissipation.
Engineeringapplication19.10
Voltagereflectioncoefficient
Consider a lossless transmission line of the type already described in Engineering
application19.9.Weknowingeneralthattheforwardwaveatpositionzisgivenby
v 1
e −αz e −jβz . For the lossless line this simplifies to v 1
e −jβz . The reverse term, by the
samereasoning,is v 2
e jβz .
19.6 Constant coefficient equations 579
A definition used regularly when analysing transmission lines is the voltage
reflection coefficient. Usually this is denoted by ρ and can be defined either at a
specific position on a lineor as a function ofdistance
ρ(z) = v 2 ejβz
v 1
e −jβz = v 2
v 1
e 2jβz
Itisadimensionlessquantityandistheratioofthereversetoforwardwavecomponents.
Inmanysystemsencounteredinradio-frequency(RF)engineeringitisdesirable
to minimize the relative amplitude of the backward wave component and hence the
reflection coefficient. An example of this can be found in the transmit circuit of a
mobilehandsetwhereatransmissionlinecarryingthesignalisattachedtoanantenna.
Duringtransmission,forwardwavespropagatetowardstheantennaterminals.Atthis
point they are either radiated from or dissipated within the antenna, or they reflect
back.Ifreflectedbacktheymayreturntotheamplifiercircuitwhichgeneratedthem,
and are wasted in the form of heat energy. As a consequence, battery life can be
reduced due to wasted power. Hence the minimization of the reflection coefficient,
usually by carefully designing the antenna, at the working frequency of the handset
isan important activity.
Fortheantennaitisdesirablefor ρ(z)tobeassmallaspossibleandfor v 1
≫ v 2
.
Note that in a system such as this |ρ(z)| < 1 and that ρ(z) is in general a complex
number.
We now consider a transmission line of length l with characteristic impedance
Z 0
terminated at z = l with a load having impedance Z L
. The setup is shown in
Figure 19.10.
z = 0
y 2 e jbz
Z 0
y 1 e –jbz
z = l
Z L
z
Figure19.10
Lossless transmissionline with
termination.
The total voltage at the load end of the line, v L
, is the sum of the forward and
backward wave componentsatz = l, thatis
v L
= v 1
e −jβl +v 2
e jβl
Using the definition ofthe reflection coefficient atz = l,
ρ(l) = v 2
v 1
e 2jβl
we can rewrite the voltage atthe load as
v L
= v 1
e −jβl e jβl
+ρ(l)v 1
e = v j2βl 1 e−jβl + ρ(l)v 1
e −jβl = v 1
e −jβl [1 + ρ(l)]
Thecurrentattheload,i L
,canbefoundfromfirstprinciplesbyasimilaranalysisto
the voltage, butwill be statedhere for simplicity
i L
= 1 Z 0
(v 1
e −jβl − v 2
e jβl )
➔
580 Chapter 19 Ordinary differential equations I
This equation can alsobe written intermsof the reflection coefficient
i L
= v 1 e−jβl
[1 − ρ(l)]
Z 0
The voltage and currentatthe load arerelated by the simple equation
Z L
= v L
i L
and hence
Z L
= v 1 e−jβl [1 + ρ(l)]
v 1
e −jβl
[1 − ρ(l)]
Z 0
=Z 0
[1 + ρ(l)]
[1 − ρ(l)]
Itcan be shown by further manipulation that
ρ(l) = Z L −Z 0
Z L
+Z 0
Thereflectioncoefficientattheloadisthereforedependentonlyonthecharacteristic
impedance of the line, which is usually known, and the load impedance. Some RF
measuringinstrumentssuchasnetworkanalyserscanmeasuretheamountofforward
and backward waves. From thisthey areable todetermine the load impedance.
Engineeringapplication19.11
Standingwavesontransmissionlines
Thedifferentialequationsthatmodelvoltageandcurrentwavesontransmissionlines
alsodescribethepresenceof standingwaves.Astandingwave issocalledbecause
it appears to remain stationary in space. Standing waves are an effect caused by the
interference pattern created when two waves propagate in opposite directions in the
sametransmissionmedium.
Thepreviousexamplesuggestedthatminimizingbackwardwavesisoftendesired.
In this case the standing wave component on the line is also minimized. In order to
studythisinmoredetailwewishtoplotthevoltagestandingwavepatternforagiven
loadimpedance,Z L
.
Recall from Equation (19.23) (with α = 0 for a lossless line) that the voltage at
any pointzon the lineisgiven by
v(z) = v 1
e −jβz +v 2
e jβz
This voltage is the sum of the forward and reflected waves. The modulus of this
function can be plotted withzas the independent variable. Once the amplitude v 1
is
known, then the amplitude v 2
can be determined using the reflection coefficient at
the load
ρ(l) = v 2
v 1
e j2βl
from which v 2
= v 1
ρ(l)e −j2βl so that
v(z) = v 1
e −jβz + v 1
ρ(l)e −j2βl e jβz
19.6 Constant coefficient equations 581
which simplifies to
v(z) = v 1
e −jβz[ 1 + ρ(l)e j2β(z−l)]
The modulus of v(z) is now plotted against z for several different values of the
reflection coefficient at the load ρ(l). Consider a transmission line of total length
l = 10 m. Another quantity weneed toknow inorder toplot the voltage on the line
is β, which is the phase constant. For a transmission line of given construction and
ataparticular frequency, β isconstantand represents thephase change per metreof
transmissionline. Herewe take β = π/2 rad m −1 .
Case1: ρ(l)= −1
Consider the case when the transmission line is terminated in a short circuit. The
reflection coefficient atthe load is
ρ(l) = Z L −Z 0
Z L
+Z 0
= 0−Z 0
0+Z 0
= −1
Taking the amplitude v 1
= 1, which represents a 1 V peak sine wave, and plotting
the modulus of v(z), gives the resultshown inFigure 19.11.
y(z)
2
1
0
2 4 6 8 10 z
Figure19.11
Voltage standing wave pattern forashort-circuitloadwith a 1 Vinput wave.
Noticethatthevoltagepeakonthelineis2Vwhereasthevoltageinputwasonly
1V.Thisisduetotheforward-going1Vinputwavereachingtheendofthelineand
reflectingbackuponitself.Atsomevaluesofzitconstructivelyinterfereswithitself
giving double the input; atothers itdestructively interferesgiving zero volts.
Case2: ρ(l)=1
Asimilareffectisseenforanopen-circuitload.HerebyconsideringZ L
→ ∞,ρ(l)
can be shown to equal +1, giving rise to the standing wave pattern shown in
Figure 19.12. Note here that the voltage maximum is at the load, unlike the case
ofthe shortcircuitwhere the minimum wasatthe load.
y(z)
2
1
0
2 4 6 8 10 z
Figure19.12
Voltage standing wave pattern foran open-circuit loadwith a1Vinput wave.
➔
582 Chapter 19 Ordinary differential equations I
Case3: ρ(l)=0
Consider now the case in which the load impedance is equal to the characteristic
impedance, thatis
Z L
=Z 0
This time, ρ(l) = 0 and hence v(z) = v 1
e −jβz . The magnitude of this complex
exponential is shown plotted in Figure 19.13 and is a horizontal straight line with
|v(z)| = 1 for all values ofz.
y(z)
2
1
0
2 4 6 8 10 z
Figure19.13
Voltage onatransmissionline with a reflection coefficient of ρ(l) = 0.
No standing wave component is present and the line is said to be matched. This
is the ideal case for power transfer because it represents the case in which all of the
input wave isdelivered tothe load and nothing isreflected back.
Case4:ageneralcase
Oftenitisdifficulttoarrangeforaperfectlymatchedtransmissionline.MoregenerallythepeakvoltageonthelineappearsliketheoneshowninFigure19.14.Herean
assumed reflection coefficient of ρ(l) = 0.187 −j 0.015 isused.
y(z)
2
1
0
2 4 6 8 10 z
Figure19.14
Voltage onatransmissionline with a reflection coefficient of ρ(l) = 0.187 − j0.015.
The standing wave component is still present although it is of smaller magnitude
than seen for the open- or short-circuit loads. Most of the input wave is transferred
tothe load but a proportion isreflected back and interferes.
The standing wave pattern of a transmission line can be measured by a special
piece of apparatus known as a slotted line. The device consists of a length of transmissionlinewithaslotintowhichasmallprobeisinserted.Theprobeisusedtofind
thevoltageasafunctionofdistanceasitismovedalongthelinetowardsanunknown
load. The equations presented above can be used with the results of the slotted line
experiment todetermine the reflection coefficient ofthe load.
19.6 Constant coefficient equations 583
EXERCISES19.6.3
1 Find the general solution ofthe followingequations:
(a) d2 x
dt 2 −2dx dt −3x=6
(b) d2 y
dx 2 +5dy dx +4y=8
(c) d2 y
dt 2 +5dy dt +6y=2t
(d) d2 x
dt 2 +11dx dt +30x=8t
(e) d2 y
dx 2 +2dy dx +3y=2sin2x
d 2 y
(f)
dt 2 + dy
dt +y=4cos3t
(g) d2 y
dx 2 +9y=4e8x
(h) d2 x
dt 2 −16x=9e6t
2 Find aparticular integral forthe equation
d 2 x
dt 2 −3dx dt +2x=5e3t
3 Find aparticular integral forthe equation
d 2 x
dt 2 −x=4e−2t
4 Obtainthe general solution of
y ′′ −y ′ −2y=6.
5 Obtainthe general solution ofthe equation
d 2 y
dx 2 +3dy dx +2y=10cos2x
Find the particularsolution satisfying
y(0) =1, dy (0) =0.
dx
6 Find aparticularintegralforthe equation
d 2 y
dx 2 + dy
dx +y=1+x
7 Find the general solution of
d 2 x
(a)
dt 2 −6dx dt +5x=3
d 2 x
(b)
dt 2 −2dx dt +x=et
8 Forthe circuitshown in Figure19.15showthat
RCL d2 i 2
dt 2 +Ldi 2
+Ri
dt 2 =E(t)
IfL=1mH,R=10,C=1Fand
E(t) = 2sin100πt,findthe complementary function.
9 Find the general solution of
i
d 2 i
dt 2 +8di +25i = 48cos3t −16sin3t
dt
R
Figure19.15
i 1
i 2
E(t)
C
L
Solutions
1 (a) x=Ae −t +Be 3t −2
(f) y = e −0.5t (Acos0.866t +Bsin0.866t) −
0.438cos3t +0.164sin3t
(b) y=Ae −x +Be −4x +2
(c) y=Ae −2t +Be −3t + t 3 − 5 (g) y =Acos3x +Bsin3x +0.0548e 8x
18
(d) x =Ae −6t +Be −5t +0.267t −0.0978 (h) x=Ae 4t +Be −4t + 9
20 e6t
(e) y = e −x [Asin √ 2x+Bcos √ 2x] − 8 17 cos2x − 2 x=2.5e 3t
2
17 sin2x 3 x = 4 3 e−2t
584 Chapter 19 Ordinary differential equations I
4 Ae 2x +Be −x −3
5 Ae −2x +Be −x + 3 2 sin2x − 1 2 cos2x,
3
2 e−2x + 3 2 sin2x − 1 2 cos2x
6 x
7 (a) Ae t +Be 5t + 3 5
(b) Ae t +Bte t + 1 2 t2 e t
8 Ae −11270t +Be −88730t
9 e −4t (Asin3t +Bcos3t)+
14sin3t +18cos3t
13
19.7 SERIESSOLUTIONOFDIFFERENTIALEQUATIONS
Wenowintroduceatechniqueforfindingsolutionsofdifferentialequationsthatcanbe
∞∑
expressed in the form of a power seriesy = a m
x m =a 0
+a 1
x +a 2
x 2 + ....Here,
a m
,m = 0,1,2,3... areconstants whosevalues needtobefound.
With the given power series form of y, we can differentiate term by term to obtain
power series for dy
dx and d2 y
dx2. By substituting these series into certain classes of differentialequationwecandeterminetheconstantsa
m
,form = 0,1,2,3...andthusobtain
the power series solution fory. This technique of representing a solution in the form of
a power series paves the way for the introduction of an important family of differential
equations,knownasBessel’sequations,whichcanbesolvedbythismethod.Thepower
series solutions so formed are known as Bessel functions which will be introduced in
Section 19.8.
Consider the following example:
Example19.30 The general solution of the differential equation d2 y
+y = 0 was shown in Example
dx2 19.16 tobe
Solution We let
y=Acosx+Bsinx
whereAandBarearbitraryconstants.Obtainthisresultbylookingforasolutionofthe
∞∑
equation inthe form ofapower seriesy = a m
x m .
y =
m=0
m=0
∞∑
a m
x m =a 0
+a 1
x+a 2
x 2 +a 3
x 3 +a 4
x 4 +a 5
x 5 +...
m=0
from which, by differentiating with respecttox,
and
∞
dy
dx = ∑
ma m
x m−1 =a 1
+2a 2
x+3a 3
x 2 +4a 4
x 3 +5a 5
x 4 +...
m=0
d 2 y
dx = ∑ ∞
(m −1)ma 2 m
x m−2 = 2a 2
+ (2)(3)a 3
x + (3)(4)a 4
x 2 + (4)(5)a 5
x 3 + ...
m=0
19.7 Series solution of differential equations 585
These expressions for y and d2 y
are substituted into the given differential equation
dx2 d 2 y
dx +y=0.Thus 2
(2a 2
+ (2)(3)a 3
x + (3)(4)a 4
x 2 + (4)(5)a 5
x 3 + ...) +
(a 0
+a 1
x+a 2
x 2 +a 3
x 3 +a 4
x 4 +a 5
x 5 +...) =0.
We now use the technique of equating coefficients. This was first introduced in
Section 1.7 Partial fractions. Equating the constant terms:
2a 2
+a 0
=0 sothat a 2
=− 1 2 a 0 .
Equating coefficients ofx:
(2)(3)a 3
+a 1
=0 sothat a 3
= − 1
(2)(3) a 1 .
Equating coefficients ofx 2 :
(3)(4)a 4
+a 2
=0 sothat a 4
= − 1
(3)(4) a 2 = 1
(2)(3)(4) a 0 .
Equating coefficients ofx 3 :
(4)(5)a 5
+a 3
=0 sothat a 5
= − 1
(4)(5) a 3 = 1
(2)(3)(4)(5) a 1 .
Note that by using factorial notation we can express these coefficients concisely as:
a 2
=− 1 2! a 0 , a 3 =−1 3! a 1 , a 4 = 1 4! a 0 , a 5 = 1 5! a 1 .
Byequatinghigherordertermsinthesamewayfurthercoefficientsa m
canbedetermined
∞∑
and the power seriesy = a m
x m can then be written out explicitly. Observe that all
m=0
coefficientsa m
are multiples ofa 0
whenmis even, and multiples ofa 1
whenmis odd.
We can therefore collect terms together as follows:
∞∑
y = a m
x m
m=0
=a 0
+a 1
x+a 2
x 2 +a 3
x 3 +a 4
x 4 +a 5
x 5 +...
=a 0
+a 1
x− 1 2! a 0 x2 − 1 3! a 1 x3 + 1 4! a 0 x4 + 1 5! a 1 x5 +...
=a 0
(1− 1 2! x2 + 1 4! x4 −...)+a 1
(x− 1 3! x3 + 1 5! x5 − ...) (19.24)
ObservethattheseriesinthebracketsinEquation19.24arethepowerseriesexpansions
of cosx and sinx as discussed inSection 18.6. Hence wecan writethe solution as
y=a 0
cosx+a 1
sinx.
We see that this is simply the general solution of the differential equation as given
in the question but where the coefficients a 0
and a 1
are the arbitrary constants. Had
we not recognised the series expansions for the sine and cosine functions we would
still have obtained the general solution as expressed in the form of the power series in
Equation 19.24.
586 Chapter 19 Ordinary differential equations I
This method of solution iswidely applicable and involves the following steps:
• given a differential equation, we seek a solution in the form of the power series
∞∑
y = a m
x m ,
m=0
• thispower series issubstitutedinto the differential equation
• the coefficientsa m
are then found by the technique of equating coefficients.
Once the values of a m
are found, the solution follows in the form of a power series.
Some differential equations and their solutions have such importance in engineering
and scientific applications that the series obtained are used to introduce new functions
in much the same way as the sine and cosine functions appeared in Equation 19.24.
We shall see in the following section how an important class of functions known as
Besselfunctionsarisesinthisway.Thereisofcoursethequestionofwhetheranypower
series expansion converges but such discussion is beyond the scope of the introductory
treatment given here.
EXERCISES19.7
1 Thegeneral solution ofthe equation d2 y
dx 2 −y=0is
y =Ae x +Be −x whereAandBarearbitraryconstants.
Obtain thisresultby lookingforasolution ofthe
∞∑
equation in the form ofapower seriesy = a m x m .
2 Thegeneral solution ofthe equation
m=0
d 2 y
dx 2 −3dy dx +2y = 0 isy =Aex +Be 2x whereAand
Barearbitraryconstants.Obtainthisresultbylooking
forasolution ofthe equation in the formofapower
∞∑
seriesy = a m x m .
m=0
3 Verifyusingthemethodofseparationofvariablesthat
the generalsolutionof dy
dx = 2xyisy =Kex2 whereK
isan arbitraryconstant. Obtainthe same resultby
seeking apower seriessolution ofthe differential
∞∑
equationin the formy = a m x m .
m=0
4 Obtainthe first fournon-zeroterms in the power
seriessolution ofthe initialvalue problem
dy
dx +xy=0,y(0)=1.
5 Obtainthe power series solution ofthe equation
d 2 y
dx 2 +xy = 0,upto termsinvolvingx7 . This
differentialequation isknown asanAiryequation.
Solutions
4 y=1− x2
2 + x4
8 − x6
48 +.... 5
y=a 0
(
1 − x3
6 + x6
180 −... )
(
)
+a 1 x − x4
12 + x7
504 −... .
19.8 Bessel’s equation and Bessel functions 587
19.8 BESSEL’SEQUATIONANDBESSELFUNCTIONS
Alineardifferentialequationthathasimportantapplicationsinthetheoryofwavepropagation,
particularly when using cylindrical or spherical polar coordinates, is known as
Bessel’sequation.InfactthetermBessel’sequationrepresentsawholefamilyofequations
having the form
x 2d2 y
dx +xdy 2 dx +(x2 −v 2 )y=0.
Herexandyaretheindependentvariableanddependentvariablerespectively.Thethird
quantity v,termedaparameter,isknownastheorderoftheequation. 1 Aswevarythe
parameter v we obtain the family of differential equations. Note that v can be any real
number. In Sections 19.8.1 and 19.8.2 we will solve Bessel’s equation when v = 0 and
v = 1.Intheengineeringapplicationswhichfollowlaterinthechapteryouwillseethat
v may be other integers including those that are negative. Observe that the equation is
linearandhomogeneous,butitdoesnothaveconstantcoefficients:thecoefficientof d2 y
dx 2
isx 2 , a function of the independent variable. In what follows we show that solutions of
Bessel’sequationcanbefoundintheformofinfinitepowerserieswhichwecallBessel
functions.
Bessel functions have many applications in physics and engineering. In electrical
engineeringtheyarecommonlyusedinthefieldofcommunications.Theyareimportant
in finding the bandwidth and frequency content of some radio signals. They are also
essentialintheanalysisofpropagatingmodesincylindricalwaveguidesandcylindrical
cavityresonators,bothofwhichareimportantdevicesusedinhighfrequencymicrowave
engineering(seeEngineeringapplication19.12).Besselfunctionsarealsoutilisedinthe
design of filters.
19.8.1 Bessel’sequationoforderzero
To introduce the solution of Bessel’s equation and thereby introduce a Bessel function
the following example deals with the specialcase when v = 0.
Example19.31 By seeking a power series solution in the form
∞∑
a m
x m obtain the first four non-zero
termsinapower series solution ofBessel’s equation inthe specialcase when v = 0.
Solution When v = 0 Bessel’s equation is:
x 2d2 y
dx +xdy 2 dx +x2 y=0.
which fornon-zeroxbecomes
x d2 y
dx + dy +xy=0. (19.25)
2 dx
m=0
1 Note that in this context the word ‘order’ means something different from the order of a differential
equation, which wasdefined in 19.2.1.
588 Chapter 19 Ordinary differential equations I
As was done inSection 19.7 we seekapower series solution inthe form
y =
∞∑
a m
x m =a 0
+a 1
x+a 2
x 2 +a 3
x 3 +a 4
x 4 +a 5
x 5 +...
m=0
The strategy is to substitutey, and its first and second derivatives into Equation 19.25
and choose valuesa m
so that the equation is satisfied.yand its first two derivatives can
be written:
y=a 0
+a 1
x+a 2
x 2 +a 3
x 3 +a 4
x 4 +a 5
x 5 +...
dy
dx =a 1 +2a 2 x+3a 3 x2 +4a 4
x 3 +5a 5
x 4 +...
d 2 y
dx 2 =2a 2 +6a 3 x+12a 4 x2 +20a 5
x 3 +...
Substitution intox d2 y
dx + dy +xy = 0gives
2 dx
(2a 2
x +6a 3
x 2 +12a 4
x 3 +20a 5
x 4 ...) + ( a 1
+2a 2
x+3a 3
x 2 +4a 4
x 3 ... )
+ (a 0
x+a 1
x 2 +a 2
x 3 +a 3
x 4 +a 4
x 5 ...) =0
We now equate coefficients on both sides of this equation starting with the constant
term:
a 1
=0.
Equating coefficients ofx:
4a 2
+a 0
=0 sothat a 2
=− 1 4 a 0 .
Equating coefficients ofx 2 :
9a 3
+a 1
=0 sothat a 3
=− 1 9 a 1 = 0 sincea 1 = 0.
Equating coefficients ofx 3 :
16a 4
+a 2
=0 sothat a 4
=− 1 (
16 a 2 = − 1 ) (
− 1 )
a
16 4 0
= 1
64 a 0 .
Itiseasytoverifythatallsubsequenttermsa m
,whenmisodd,arezero.Youshouldverify,inasimilarway,thata
6
= − 1
2304 a 0 .Whenmiseven,alla m aremultiplesofa 0 .We
can now write down a power series that satisfies Bessel’s differential
Equation 19.25:
(
y=a 0
1 − 1 4 x2 + 1
64 x4 − 1 )
2304 x6 +...
(19.26)
We have succeeded in obtaining a power series solution of Bessel’s Equation 19.25
wherea 0
isan arbitraryconstant.
As we have already noted, a 0
is an arbitrary constant. However, historically its value
is chosen to be 1 in Equation 19.26. The resulting power series is used to define a
19.8 Bessel’s equation and Bessel functions 589
J o (x)
1
0
–0.4
10 20 30
x
Figure19.16
TheBesselfunction ofthe firstkind oforder
zero,J 0 (x).
function called a Bessel function of the first kind of order zero that is conventionallydenotedJ
0
(x). Itispossible toshow thatitcan beexpressed concisely usingsigma
notation as
J 0
(x) =
∞∑
m=0
(−1) m x 2m
2 2m (m!) 2 . (19.27)
You shouldverifythisbywritingoutthefirstfewtermsofthisinfiniteseries.Note that
this method of solving the differential equation produces a power series solution and is
usedtointroduceanewfunction,J 0
(x).ItisnotpracticaltotrytodrawagraphofJ 0
(x)
by hand but this can be obtained readily using a technical computing language such as
MATLAB ® .Figure19.16showsagraphobtainedinthisway.ObservethatJ 0
hassome
similar characteristics to a cosine wave: its value whenx = 0,J 0
(0), equals 1, and the
functionoscillates,albeitwithdecreasingamplitude.Thevaluesofxwherethefunction
crosses the horizonal axis are known as zeros of the Bessel function and these values
arise in important applications as we shall see in Engineering application 19.12: The
modes of a cylindrical microwave cavity.
WhenaspecificvalueofaBesselfunctionisrequireditisusualpracticetolookthis
up rather than work directly with the power series definition. Both printed tables and
computer packages are available forthispurpose.
Recall from Section 19.5 that the general solution of a second order linear equation
requires two independent solutions. The function J 0
(x) is one such solution. Another
independent solution is called a Bessel function of the second kind of order
zero and is denoted byY 0
(x). Derivation of this function is beyond the scope of this
book but, like J 0
(x), values ofY 0
(x) can be obtained from tables and by using computer
software. The general solution of Bessel’s equation of order zero can then be
written
y =AJ 0
(x) +BY 0
(x)
whereAandBare arbitraryconstants.
19.8.2 Bessel’sequationoforderone
Here we adopt the previous approach to obtain a solution of Bessel’s equation of order
one, and thereby introduce the Bessel function of the first kind of order one.
590 Chapter 19 Ordinary differential equations I
Example19.32 By seeking a power series solution in the form
∞∑
a m
x m obtain the first three non-zero
m=0
terms inthe solution of Bessel’s equation inthe case when v = 1:
x 2d2 y
dx 2 +xdy dx + (x2 −1)y = 0. (19.28)
Solution As before weseek a power series solution inthe form
∞∑
y = a m
x m =a 0
+a 1
x+a 2
x 2 +a 3
x 3 +a 4
x 4 +a 5
x 5 ...
m=0
The strategy is to substitutey, and its first and second derivatives into Equation 19.28
and choose valuesa m
so that the equation is satisfied.yand its first two derivatives can
be written:
y=a 0
+a 1
x+a 2
x 2 +a 3
x 3 +a 4
x 4 +a 5
x 5 ...
dy
dx =a 1 +2a 2 x+3a 3 x2 +4a 4
x 3 +5a 5
x 4 ...
d 2 y
dx 2 =2a 2 +6a 3 x+12a 4 x2 +20a 5
x 3 ...
Substitution intox 2d2 y
dx 2 +xdy dx + (x2 −1)y =0gives
2a 2
x 2 +6a 3
x 3 +12a 4
x 4 +20a 5
x 5 ... + ( a 1
x +2a 2
x 2 +3a 3
x 3 +4a 4
x 4 +5a 5
x 5 ... )
Removing the brackets
+ (x 2 −1)(a 0
+a 1
x+a 2
x 2 +a 3
x 3 +a 4
x 4 +a 5
x 5 ...) =0
2a 2
x 2 +6a 3
x 3 +12a 4
x 4 +20a 5
x 5 ...+ ( a 1
x +2a 2
x 2 +3a 3
x 3 +4a 4
x 4 +5a 5
x 5 ... )
+a 0
x 2 +a 1
x 3 +a 2
x 4 +a 3
x 5 +a 4
x 6 +a 5
x 7 ...
−(a 0
+a 1
x+a 2
x 2 +a 3
x 3 +a 4
x 4 +a 5
x 5 ...) =0
We now equate coefficients on both sides of this equation starting with the constant
term:
−a 0
= 0 so that a 0
= 0.
Equating coefficients ofx:
a 1
−a 1
= 0
sothata 1
isarbitrary.
Equating coefficients ofx 2 :
2a 2
+2a 2
+a 0
−a 2
= 0 sothat 3a 2
= −a 0
, from which a 2
= 0.
Equating coefficients ofx 3 :
6a 3
+3a 3
+a 1
−a 3
= 0 sothat 8a 3
= −a 1
, from which a 3
= − 1 8 a 1 .
It is easy to verify that all subsequent terms a m
when m is even, are zero. Equating
coefficients ofx 5 :
20a 5
+5a 5
+a 3
−a 5
= 0 so that 24a 5
= −a 3
fromwhich
a 5
=− 1
24 a 3 = 1
192 a 1
using a 3
= − 1 8 a 1 .
19.8 Bessel’s equation and Bessel functions 591
Note that all non-zero coefficients are multiples ofa 1
. We can now write down the first
threenon-zero terms inthe power series solution:
(
y=a 1
x − 1 8 x3 + 1 )
192 x5 −...
(19.29)
Herea 1
isanarbitraryconstant.Wehavesucceededinobtainingapowerseriessolution
of Bessel’s Equation 19.28.
Although a 1
is an arbitrary constant, historically its value is chosen to equal 1 2 in
Equation19.29andtheresultingpowerseriesisusedtodefineafunctioncalledaBessel
functionofthefirstkindoforderonethatisconventionallydenotedJ 1
(x).Itispossible
toshow thatitcan beexpressed concisely using sigmanotation as
J 1
(x) =
∞∑
m=0
(−1) m x 2m+1
2 2m+1 m!(m +1)! . (19.30)
You should verify this by writing out the first few terms of this infinite series. Figure
19.17 shows a graph of J 1
(x) obtained using computer software. Observe that the
graph is similar to that of the sine function but has decreasing amplitude. Here,
J 1
(0) =0.
J 1 (x)
0.5
0
10 20 30
x
–0.3
Figure19.17
The Besselfunction ofthe firstkind of
order 1,J 1 (x).
A second independent solution is called a Bessel function of the second kind of
orderoneandisdenotedbyY 1
(x).Derivationofthisfunctionisbeyondthescopeofthis
bookbut,likeJ 1
(x),valuesofY 1
(x)canbeobtainedfromtablesandbyusingcomputer
software. The general solution ofBessel’s equation of order one can thenbe written
y =AJ 1
(x) +BY 1
(x)
whereAandBare arbitraryconstants.
592 Chapter 19 Ordinary differential equations I
EXERCISES19.8.2
1 Write outexplicitly the firstfiveterms in the series
expansion ofJ 0 (x) andthe firstfour terms in the
expansion ofJ 1 (x).BydifferentiatingJ 0 (x) verify
that d dx (J 0 (x)) = −J 1 (x).
2 (Method ofFrobenius) Substituteapower series in
∞∑
the more general formy = a m x m+r ,whereris to
m=0
be determined, intoBessel’sequationoforder 1:
x 2d2 y
dx 2 +xdy dx +(x2 −1)y=0.
(i) By equatingcoefficients ofx r show thatr
satisfies theindicialequationr 2 −1 = 0 and
hence deducer = ±1.
(ii) Continuethe solution forthe caser = 1 to
obtain a seriessolution ofthe differential
equation. Findthe first 3 non-zeroterms.
(iii) Confirm that the solution is equivalent to that
obtained previously in Equation19.29.
19.8.3 Bessel’sequationandBesselfunctionsofhigherorder
Tocopewithmoregeneralcases,includingthoseofanon-integerorder,theusualpractice
is to seek solutions in the form a m
∞∑
x m+r where r is an unknown real number
m=0
that is chosen during the course of the calculation in order to generate a solution (see
Exercise 19.8.2, question 2). For details on how to proceed in such cases you should
consultanadvanced texton differential equations.
When the order, v, is a positive integer, it is conventional to label it asninstead. A
solution of Bessel’s equation of positive integer ordernis a Bessel function of the first
kind ofordern, denotedJ n
(x), and can beshown tobe
J n
(x) =
∞∑
m=0
(−1) m x 2m+n
2 2m+n m!(m +n)! , forn=0,1,2,...
Graphs of these functions forn = 0,1,2,3,4 are shown inFigure 19.18.
Whilst to this point we have only discussed cases where the order is non-negative,
in fact Bessel functions are defined for any order v, −∞ < v < ∞. Later, in Engineering
application 19.13 we shall make use of Bessel functions with negative integer
order. In such cases, careful inspection of the above definition shows that it will fail.
When n is negative, then m + n will be negative for some values of m and in these
case (m +n)! is not defined. For a complete treatment, and one which is beyond the
scope of this book, it is necessary to define a new function - the gamma function Ŵ(x)
- which can be thought of as a generalisation of factorials to deal with negative arguments.
Nevertheless, this is possible and it can be shown that Bessel functions with
negativeintegerorderarerelatedtotheircounterpartswithpositiveintegerorderbythe
result
J −n
(x) = (−1) n J n
(x).
(See Exercise 19.8.3)
n 5 2
n 5 3n
5 4
19.8 Bessel’s equation and Bessel functions 593
J n (x)
1
n 5 0
n 5 1
0
2 4 6 8 10 12 x
–0.4
Figure19.18
BesselfunctionsJ n (x),forn = 0 . . .4.
Engineeringapplication19.12
Modesofacylindricalmicrowavecavity
Circularcross-sectionwaveguidesandcylindricalmicrowavecavitiesarebothcomponents
that are found in high power microwave systems such as radar transmitters.
Bessel functions are used in the mathematical description of electromagnetic wave
behaviour in such devices. By way of example, consider the microwave cavity depictedinFigure
19.19.
r
l
Figure19.19
Acylindrical microwave cavity.
➔
594 Chapter 19 Ordinary differential equations I
Thewallsofmicrowavecavitiesareusuallymadeofmetalswithgoodconductivity
such as copper, steel or aluminium but in some applications the internal surfaces
areplatedwithpreciousmetalssuchasgoldtopreventoxidationwhichwouldreduce
the surface conductivity. When used at high power levels the conductivity is important.Lowconductivity
canleadtolossofpower andheatingofthecavityduetothe
currents flowing on the internal walls. The space inside the cavity can be empty or
it can contain a dielectric insulating material such as PTFE. The analysis here will
consider an empty cavity.
The purpose of the device is to act as a narrow band filter which selects a specific
frequency. In fact, there is a set of frequencies at which the microwave cavity
will work, known as the resonant frequencies. The electromagnetic waves inside
the cavity are governed by the propagation modes within the cylinder. These modes
are described mathematically by Bessel functions. It can be shown that the resonant
frequencies, f mnp
, associated with the transverse magnetic modes of the cavity are
given by
f mnp
= c
2π
√ (Xmn
r
) 2 ( pπ
+
l
) 2
wherem,nand paremodeindexnumbers,0,1,2,...andX mn
denotesthen-thzero
oftheBesselfunctionofthefirstkindoforderm,J m
(x),wherecisthespeedofwave
propagation inthe medium.
The zeros can be observed in Figure 19.18 as the points where the graph crosses
thexaxis.UsuallysoftwaresuchasMATLAB ® hasthecapabilitytocalculatenumericallythezerosofafunction.IfweobservethatthefirstzeroofJ
0
(x)isapproximately
x = 2.5 (see Figure 19.18) we can type the following at the MATLAB ® command
prompt:
X = fzero(@(x)besselj(0,x),2.5)
which gives the resultX = 2.4048.
The second zero ofJ 0
(x) isclose to5.5,thus wetype,
X = fzero(@(x)besselj(0,x),5.5)
which gives the resultX = 5.5201.
The first zero ofJ 1
(x) isapproximately 4,so,
X = fzero(@(x)besselj(1,x),4)
which gives the resultX = 3.8317.
Table19.2showsasummaryoftheresultsofapplying thismethodform = 0,1,
and 2.
Consideramicrowavecavityresonatormadefromanaluminiumfizzydrinkscan
with dimensionsr = 33 mm and l = 120 mm. The transverse magnetic modes are
labelledTM nmp
andthefirstcommonlyusedmodeisTM 011
.Theresonantfrequency
would be
19.8 Bessel’s equation and Bessel functions 595
Table19.2
Besselfunction zeros foramicrowave
cavity.
m X m1 X m2 X m3
0 2.4048 5.5201 8.6537
1 3.8317 7.0156 10.173
2 5.1356 8.4172 11.620
f 011
= 3×108
2π
√ (2.4048 ) 2
+
0.033
( ) 1 × π 2
= 3.70GHz
0.120
EXERCISES19.8.3
1 WehaveseenthatforpositiveintegerordertheBessel
functionsofthe firstkind are defined by
∞∑ (−1) m x 2m+n
J n (x) =
2 2m+n m!(m +n)! ,
m=0
for n=0,1,2,...
(a)Now suppose that weare interestedin Bessel
functionswithnegativeintegerorder.Replacenby
−nin thisdefinition to showthat
∞∑ (−1) m x 2m−n
J −n (x) =
2 2m−n m!(m −n)! ,
m=0
withn=1,2,...
(b)Now make the assumption that (m −n)!,which
appears in the denominator ofeach term,becomes
infinite whenm = 0,1,2, . . . ,n −1.Itis possible
to prove thisusingproperties ofthe Gamma
functionbutisbeyondthescopeofthisbook.With
thisassumption showthat
∞∑ (−1) m x 2m−n
J −n (x) =
2
m=n
2m−n m!(m −n)! ,
withn=1,2,...
(c)Byintroducing anew dummy variable,s,where
m =s +n, showthat
∞
J −n (x) = (−1) n ∑ (−1) s x 2s+n
2 2s+n m!s!
s=0
for n=0,1,2,...
and hence deduceJ −n (x) = (−1) n J n (x).
19.8.4 Besselfunctionsandthegeneratingfunction
A different set of modes known asTE modes may also be calculated using Bessel
functions. Rather than using zeros of the Bessel functions directly it is necessary to
usethederivativesoftheBesselfunctions,atopicbeyondthescopeofthisintroductorytreatment.
SeveralimportantpropertiesofBesselfunctionsusedinphysicsandengineeringapplications
can be deduced from knowledge of the so-called generating function. In this
sectionweintroducethisgeneratingfunctionanduseittoderivetheJacobi-Angeridentitieswhich
wethen employ inEngineering application 19.13.
Firstly, recall the power series expansion of e x introduced inSection 6.5.
∞
e x =1+x+ x2
2! + x3
3! + x4
4! +...= ∑ 1
n! xn . (19.31)
n=0
596 Chapter 19 Ordinary differential equations I
Recallalsothatthereisnothingspecialaboutlabellingthedummyvariablen.Anyother
letter could have been used and the statement of the expansion would be equivalent.
Now replacexfirstly by xt
2 and then by − x in Equation 19.31 to produce the following
expansions where we have usedr andsasthe dummy
2t
variables:
and
e xt
2 =
e − x 2t =
∞∑
r=0
∞∑
s=0
1
r!
( xt
) r
2
1
(
− x ) s
s! 2t
Multiplying these two expansions together wefind
e xt
2 e
− x 2t = e x 2 (t−1 t ) =
∞∑
r=0
1
r!
( xt
) r ∑ ∞
1
(
− x ) s
2 s! 2t
s=0
or, more explicitly,
[
]
e x 2 (t−1 t ) = 1 + xt
1!2 + x2 t 2
2!2 2 + x3 t 3
3!2 3 + x4 t 4
4!2 4 +... ×
[
]
1 − x
1!2t + x2
2!2 2 t 2 − x3
3!2 3 t 3 + x4
4!2 4 t 4 −...
(19.32)
Now imagine multiplying out the brackets on the right and observe that the resulting
expression can be considered as the sum or difference of terms involving powers oft,
that is, t, t 2 ,...t n ... and t −1 ,t −2 ,...t −n ,.... In fact we can express Equation 19.32
concisely as
e x 2 (t−1 t ) =
∞∑
n=−∞
y n
(x)t n (19.33)
where eachy n
(x) is a function ofxthat we can regard as the coefficient of the termt n
in the expansion. For example, let us focus first on the those terms in Equation 19.32
whichwhenmultipliedoutresultinmultiplesoft 0 ,aswillhappenbyformingproducts
such as
( )( )
1 1 ,
( xt
) (
− x )
,
1!2 1!2t
( )( )
x 2 t 2 x 2
2!2 2 2!2 2 t 2 ,
(
x 3 t 3
3!2 3 )(
− x3
3!2 3 t 3 )
Then together, these termswill formy 0
(x)t 0 inEquation 19.33, that is
y 0
(x)=1−
or more concisely,
y 0
(x) =
∞∑
n=0
x2
(1!) 2 2 + x 4
2 (2!) 2 (2 2 ) − x 6
2 (3!) 2 (2 3 ) +... 2
and soon.
(−1) n x 2n
(n!) 2 2 2n (19.34)
Comparing Equation 19.34 with Equation 19.27 we see thaty 0
(x) is simply the Bessel
function oforder zero,J 0
(x).
19.8 Bessel’s equation and Bessel functions 597
In a similar manner, suppose we focus attention on the terms in Equation 19.32 that
when multiplied outresultint 1 as will happen when weform products such as
( xt
)
(1) ,
1!2
( x 2 t 2 ) (
− x )
,
2!2 2 1!2t
( x 3 t 3
3!2 3 )( x
2
2!2 2 t 2 )
and so on.
Then, together these will formy 1
(x)t 1 inEquation 19.33, so that
y 1
(x) = x
1!2 − x3
2!2 3 + x5
3!2!2 5 ...
or more concisely,
y 1
(x) =
∞∑
m=0
(−1) m x 2m+1
(m +1)!m!2 2m+1 (19.35)
Comparing Equation 19.35 with 19.30 we see thaty 1
(x) is simply the Bessel function
of order one,J 1
(x).
Inthesameway,itispossibletoshowthatalltheothercoefficientsy n
(x)inEquation
19.33 are the Bessel functions J n
(x). The function e x 2 (t−1 t ) in Equation 19.33 is called
a generating function for the Bessel functions because the coefficient oft n in the expansion
isthe Bessel functionJ n
(x). Thus the expansion isused togenerate the Bessel
functions.We have the following important result:
Generatingfunctionforthe Bessel functions:
∞∑
e x 2 (t−1 t ) = J n
(x)t n (19.36)
n=−∞
Ifthefunctionontheleftisexpandedinpowersoft thenthecoefficientoft n isthe
Bessel function of ordern.
EXERCISES19.8.4
1 Usethe generating functionto show thatifnis an
even integer, thenthe BesselfunctionJ n (x)is an even
function, thatisJ n (−x) =J n (x).Ifnisan odd
integer, show thatJ n (x) isan odd function, thatis
J n (−x) = −J n (x).
19.8.5 TheJacobi-Angeridentities
Two results which relate to Bessel functions and which are important in communicationsengineering
are known as the Jacobi-Anger identities. We first statethem before
deriving them from the generatingfunction.
598 Chapter 19 Ordinary differential equations I
Jacobi-Anger identities
∞∑
e jxcos θ = J n
(x)j n e jnθ
n=−∞
∞∑
e jxsinθ = J n
(x)e jnθ
n=−∞
To obtain the first identity we let t = je jθ , so that t n = j n e jnθ , in Equation 19.36.
Noting that
t − 1 t = jejθ − 1
je jθ = jejθ +je −jθ = 2jcos θ
(where we have used the Euler relation cosθ = ejθ +e −jθ
, Section 9.7, p 339) then the
2
left hand sideof Equation 19.36 is
e x 2 (t−1 t ) = e x 2 (2jcosθ) = e jxcosθ
and sofrom Equation 19.36 we have
∞∑
e jxcosθ = J n
(x)j n e jnθ .
n=−∞
Inasimilarfashion, welett = e jθ , sothatt n = e jnθ , inEquation 19.36. Noting that
t − 1 t =ejθ − 1
e jθ = ejθ −e −jθ = 2jsin θ
(where we have used the Euler relation sinθ = ejθ −e −jθ
, Section 9.7 p 339) then the
2j
left hand sideof Equation 19.36 is
e x 2 (t−1 t ) = e x 2 (2jsinθ) = e jxsin θ
and sofrom Equation 19.36 we have
∞∑
e jxsinθ = J n
(x)e jnθ .
n=−∞
Engineeringapplication19.13
UseofBesselfunctionsinfrequencymodulation
A common type of modulation scheme used in electrical engineering is frequency
modulation(FM).Thepurposeofmodulationistoprepareasignalfortransmission,
such as when a signal is to be sent from a transmitter to a remote receiver during a
radiobroadcast.Atthereceiverthesignalisrecoveredfromitsmodulatedformbya
device called a demodulator.
Even though the process of electronically producing an FM signal is fairly
straightforward, the mathematics underpinning frequency modulation can become
complicatedandrequirestheuseofBesselfunctionsandaJacobi-Angeridentity.In
19.8 Bessel’s equation and Bessel functions 599
order to explain how FM works it is helpful to start with the electronic process by
which the FM signal is generated and to then derive the properties of the modulated
signal mathematically.
The FM signal is usually produced by a device called a voltage controlled oscillator
(VCO). The VCO generates a sinusoidal signal on its output, the frequency
of which is controlled by a voltage on its signal input. If we were to connect a d.c.
voltage to the input then we would observe a simple sine wave on the output. The
frequencyofthesinewavecouldbechangedbychangingthed.c.voltage.Inananalogueradiobroadcastsystemhoweverwewishtotransmitatime-varyingsignaland
this, rather thanad.c. voltage, isused as inputtothe VCO.
Consider the case of a sinusoidal modulating signal, sin(2πf m
t), that we wish
totransmitsuchasthatshowninFigure19.20.Here f m
isthefrequencyofthismodulating
signal. Another sinusoidal signal, termed the carrier, is used to carry the
modulatedsignal.Thefrequencyofthecarrierisdenoted f c
andisthefrequencyyou
wouldtunetoonaradiodialtoreceivetheradiostation.Letussuppose,forexample,
that the carrier isAcos(2πf c
t) as illustratedinFigure 19.21.
VCO input voltage
1
–1
Figure19.20
A sinusoidalmodulating signal: the signal we wishto transmit.
Time, t
Carrier signal
A
–A
Time, t
Figure19.21
A sinusoidalcarrier signal.
Itis usefultodefine a term, β, which isknown as themodulation index:
β = f
f m
where f isthemaximumchangeinthefrequencyoftheoutputfromtheVCOwhen
the modulating signal is applied. In other words the largest instantaneous frequency
of the carrier is f c
+ f and the smallest is f c
− f. It may seem intuitive that the
frequencieswithintheoutputsignalwouldbebetween f c
+f and f c
−f,butthe
situation is actually more complex. The changing signal produces a different set of
output frequencies. Consider the following analysis.
We can describe this modulation process mathematically as
v(t) =Acos [ 2πf c
t + βsin(2πf m
t) ]
➔
600 Chapter 19 Ordinary differential equations I
and thisoutput waveform isshown inFigure 19.22.
VCO output y(t)
A
–A
Time, t
Figure19.22
Theoutput signal.
We may rewrite thisequation using phasor notation as:
v(t) =Re
[Ae j[2πf c t+βsin(2πf t)]] m
= Re [ Ae j2πf c t e jβsin(2πf m t)]
where Re means the real part of what follows. Notice that the second exponential
∞∑
term is in the same form as the Jacobi-Anger identity, e jxsinθ = J n
(x)e jnθ , and
n=−∞
can be expanded outinterms ofBessel functions of the first kind.Thus,
[ ∞
]
∑
v(t) =Re Ae j2πf c t J n
(β)e jn2πf m t
n=−∞
This can be rearranged by combining the exponentials:
[
]
∞∑
v(t) =Re A J n
(β)e jn2πf m t e j2πf c t
v(t) =Re
[
A
n=−∞
∞∑
n=−∞
J n
(β)e j2π(f c +nf m )t ]
Convertingbackfromthephasornotation,andforsimplicitytakingA = 1,weobtain:
v(t) =J 0
(β)cos(2πf c
t)
+J −1
(β)cos[2π(f c
− f m
)t] +J 1
(β)cos[2π(f c
+ f m
)t]
+J −2
(β)cos[2π(f c
−2f m
)t] +J 2
(β)cos[2π(f c
+2f m
)t]
+J −3
(β)cos[2π(f c
−3f m
)t] +J 3
(β)cos[2π(f c
+3f m
)t]
+ ...
We sawinSection 19.8.3 thatforinteger values ofn,J −n
(x) = (−1) n J n
(x), hence,
v(t) =J 0
(β)cos(2πf c
t)
−J 1
(β)cos[2π(f c
− f m
)t] +J 1
(β)cos[2π(f c
+ f m
)t]
+J 2
(β)cos[2π(f c
−2f m
)t] +J 2
(β)cos[2π(f c
+2f m
)t]
−J 3
(β)cos[2π(f c
−3f m
)t] +J 3
(β)cos[2π(f c
+3f m
)t]
+ ...
WecanseethattheoutputfromtheVCOisactuallymadeupfromasetofsinusoidal
waveformswithfrequenciesf c
−nf m
,...,f c
−3f m
,f c
−2f m
,f c
−f m
,f c
,f c
+f m
,f c
+
2f m
, f c
+ 3f m
,..., f c
+nf m
. This is known as the frequency content of the signal
andwillbediscussedinmoredetailinChapters23&24.Theequationsuggeststhat
a broadcast station with a carrier at f c
would have an infinite number of other fre-
Review exercises 19 601
quencies, known as side bands associated with it. The amplitudes of the side bands
are given by the values of the Bessel functions. When these are evaluated, the values
ofJ n
(β) generally diminish with increasingnhence only the sidebands near to
the carrier frequency have significant amplitude. Therefore frequencies that are a
long way away from f c
can be safely ignored when designing the communications
system.
TabulatedvaluesofBesselfunctionsareavailablesimilartothoseshowninTable
19.3. It can be seen how the signal amplitudes generally decrease with increasing
order of the Bessel function. A larger value of β also implies a larger number of
side bands are significant and need to be considered. The use of Bessel functions
whendesigningradiocommunicationssystemsisveryimportantbecauseeachradio
channel has an allocated bandwidth outside of which it should not stray. To do so
would create the possibility of different radio channels interfering with each other.
Thismathematicalmethodallowstheamplitudesofthesidebandstobecalculatedto
ensuretheyaresmallenoughsoastonotinterferesignificantlywithotherchannels.
Table19.3
Table ofBesselfunctionsJ n (β)ofinteger order,n,fordifferent values of
modulation index, β.
β n=0 n=1 n=2 n=3 n=4 n=5 n=6 n=7
0.00 1.00 – – – – – – –
0.25 0.98 0.12 – – – – – –
0.50 0.94 0.24 0.03 – – – – –
0.75 0.86 0.35 0.07 – – – – –
1.00 0.77 0.44 0.11 0.02 – – – –
1.25 0.65 0.51 0.17 0.04 – – – –
1.50 0.51 0.56 0.23 0.06 0.01 – – –
1.75 0.37 0.58 0.29 0.09 0.02 – – –
2.00 0.22 0.58 0.35 0.13 0.03 – – –
2.25 0.08 0.55 0.40 0.17 0.05 0.01 – –
2.50 -0.05 0.50 0.45 0.22 0.07 0.02 – –
2.75 -0.16 0.43 0.47 0.26 0.10 0.03 – –
3.00 -0.26 0.34 0.49 0.31 0.13 0.04 0.01 –
3.25 -0.33 0.24 0.48 0.35 0.17 0.06 0.02 –
3.50 -0.38 0.14 0.46 0.39 0.20 0.08 0.03 –
3.75 -0.40 0.03 0.42 0.41 0.24 0.10 0.04 0.01
4.00 -0.40 -0.07 0.36 0.43 0.28 0.13 0.05 0.02
REVIEWEXERCISES19
1 Find the general solution ofthe followingequations:
(a)
(c)
dx
dt =2x (b) (1+t)dx
dt = 3
dy
dx =y2 cosx
3
2 Solve dy = 2,subjecttoy(0) = 3.
dx
Find the general solution oftẋ+x=2t.
4 Solve dx
dt +2x=e2t cost
(a)byusingan integrating factor
(b)byfinding itscomplementary function anda
particular integral.
602 Chapter 19 Ordinary differential equations I
5 Find the general solution ofy ′′ +16y =x 2 .
6 Find the particular solution of
y ′′ +3y ′ −4y =e x ,y(0) =2,y ′ (0) =0.
7 Aparticle moves in astraightline such that its
displacement from the origin Oisx,wherexsatisfies
the differentialequation
d 2 x
dt 2 +16x=0
(a) Findthe general solution ofthisequation.
( ) ( )
π
π
(b) Ifx = −12, andẋ = 20,findthe
4 4
displacement ofthe particlewhent = π 2 .
8 Usean integrating factorto solvethe differential
equation
dx
+xcott =cos3t
dt
Solutions
1 (a) x=Ae 2t
(b) x=3ln|1+t|+c
1
(c) y=
A−sinx
2 y=2x+3
3 x=t+ c t
4 x = e2t (4cost +sint)
17
+ce −2t
5 y=Acos4x+Bsin4x+ x2
16 − 1
128
6 y = 39ex +11e −4x
25
+ xex
5
7 (a) Asin4t +Bcos4t (b) 12
cos2t − 1
8 sint;x(t) =
2 cos4t+c
4sint
20 Ordinarydifferential
equationsII
Contents 20.1 Introduction 603
20.2 Analoguesimulation 603
20.3 Higherorderequations 606
20.4 State-spacemodels 609
20.5 Numericalmethods 615
20.6 Euler’smethod 616
20.7 ImprovedEulermethod 620
20.8 Runge--Kuttamethodoforder4 623
Reviewexercises20 626
20.1 INTRODUCTION
Engineerssometimesfinditconvenienttorepresentadifferentialequationusinganelectronic
circuit. This is known as analogue simulation. Analogue controllers are also frequentlyusedincontrolsystemsandthesearebasedonsimilarcircuits.Complicatedengineering
systems need to be modelled using a set of differential equations. This leads
tothetopicofstate-spacemodelling.Suchmodelsfindwidespreaduseintheaerospace
industries as well as in other areas that have complex systems. Numerical techniques
are very important as they allow differential equations to be solved using computers.
Often this is the only way to obtain a solution of a differential equation, owing to its
complexity. State-space models arealmost always solved using computers.
20.2 ANALOGUESIMULATION
Itispossibletosolvedifferentialequationsusingelectroniccircuitsbasedonoperational
amplifiers. The advantage of this approach is the ease with which the coefficients of
the differential equation can be adjusted and the effect on the solution observed. The
604 Chapter 20 Ordinary differential equations II
y ic
y 1
10
y 1
1
y 2
1
y o
y 2
1
y o
y 3
1
y 3
10
Figure20.1
An integrator.
Figure20.2
A summer.
technique is known as analogue simulation. Prior to the mass availability of powerful
digital computers this was a common approach in engineering design because it providedtheengineerwithanautomatedmethodtosolvearangeofmathematicalmodels.
Special-purposecomputers,knownasanaloguecomputers,werehistoricallyusedwhich
had the electronic circuits already incorporated, thus making iteasier tosimulate a particular
differential equation.
Three basic types of circuit are required to enable ordinary differential equations
with constant coefficients to be simulated. The first type is an integrator which has already
been discussed in Section 13.2. The usual symbol for such a circuit is shown in
Figure 20.1.
If an analogue computer is used it is common for there to be a gain of only 1 or 10
ontheinputvoltage.Againabove10isbetterachievedbylinkingtogethertwocircuits
in series. In addition there is usually a facility to allow initial conditions to be set. The
equation forthe circuitof Figure 20.1 is
v o
=−
∫ t
0
(10v 1
+ v 2
+ v 3
)dt − v ic
(20.1)
where v ic
indicatestheinitialoutputvoltage.Notethatthegainisusuallywrittenonthe
input line.
The second type of circuit is the summer. It has the symbol shown in Figure 20.2.
The equation forthe circuitof Figure 20.2 is
v o
= −(v 1
+ v 2
+10v 3
) (20.2)
Finally, a circuit is required to allow gains to be varied. This is simply a potentiometer
andisillustratedtogetherwithitssymbolinFigure20.3.Theequationforthecircuitof
Figure 20.3 issimply
v o
=kv i
0k1 (20.3)
The potentiometer only allows the gain to be varied between 0 and 1. If a variable gain
greater than 1 is required, the potentiometer must be placed in series with a circuit that
increases gain, for example a one-input summer withagain of 10.
Thesecircuitscanbecombinedtoobtainthesolutionsofdifferentialequations.This
isbestillustratedbymeansofanexample.Considerthegeneralformofasecond-order
differential equation with constant coefficients:
a d2 y
dt +bdy +cy=f(t) (20.4)
2 dt
The general approach to obtaining the circuit for a particular differential equation is to
assume a certain point in the circuit corresponds to a particular term and then arrange
20.2 Analogue simulation 605
y i
y i y o
k
y o
d
a —
2 y
dt 2
1—
a
d 2 y
—
dt 2
dy
– —
1 dt 1
y
Figure20.3
Apotentiometer.
Figure20.4
Firststage in synthesizing Equation(20.4).
toconnectthatpointtothecorrectlysynthesizedvalue.Itismorestraightforwardifthe
assumed point corresponds to the highest derivative. Rearranging Equation (20.4) we
find
a d2 y
dt =f(t)−bdy −cy (20.5)
2 dt
Assuminga d2 y
dt isalreadyavailable,apotentiometercanbeusedtoobtain d2 y
2 dt 2.Integratorscanthenbeusedtoobtainthevariables
dy
dt andy.ThisisshowninFigure20.4.The
nextstageistoobtaintheexpressioncorrespondingtother.h.s.ofEquation(20.5).This
requiresasummertoaddtogethertheindividualtermsandpotentiometerstoallowindividualcoefficientstobeobtained.Aninvertorisalsorequiredtoobtainthecorrectsign
forthevariable dy
dt .Thissimplyconsistsofasummerwithoneinputandagainof1.The
final circuit is shown in Figure 20.5. The three potentiometers allow different values of
a,bandctobeobtained.Ifgainsgreaterthan1wererequiredthenextrasummerswould
be needed, or alternatively the input gains of the existing summers could be adjusted.
The input to the circuit, f (t), can be simulated by applying a signal at the appropriate
point in the circuit. The output of the circuit, y(t), corresponds to the solution of the
differential equation. It may be necessary to include initial conditions corresponding to
y(0)and dy
dt (0).
Although analogue computers remain an interesting historical development in the
simulation of control systems it is now much more common to use digital computers.
Thesystemtobesimulatedcanberepresentedeitherdirectlyintheformofmathematical
equations or by block diagrams similar to that shown in Figure 20.5. The response to
various inputs can be readily obtained.
f(t)
1
–f(t)
1
1
1
a
b dy — dt
d 2 y
—
dt 2
1
1—
a
–b dy — dt
d 2 y
—
dt 2
– dy —
1 dt 1 y
b
cy
c
Figure20.5
Thecomplete circuit to synthesize Equation(20.4).
606 Chapter 20 Ordinary differential equations II
20.3 HIGHERORDEREQUATIONS
Inthissectionweshallconsidersecond-andhigherorderequationsandshowhowthey
can be represented as a set of simultaneous first-order equations. The main reason for
doingthisisthatwhenacomputersolutionisrequireditisusefultoexpressanequation
in this form. Details of the analytical solution of such systems are not considered here
although one technique isdiscussed inSection 21.10.
It is possible to express a second-order differential equation as two first-order equations.Thus
ifwehave
d 2 ( )
y dy
dx = f 2 dx ,y,x (20.6)
we can introduce the new dependent variablesy 1
andy 2
such thaty 1
= y andy 2
= dy
dx .
Equation (20.6) thenbecomes
dy 1
dx =y 2
dy 2
dx = f(y 2 ,y 1
,x) (20.7)
These first-order simultaneous differential equations are often referred to as coupled
equations.
Example20.1 Expressthe equation
d 2 y
dx 2 −7dy dx +3y=0
as a set of first-order equations.
Solution Lettingy 1
= y, andy 2
= dy
dx , we find dy 2
dx = d2 y
dx2. Therefore the differential equation
becomes
dy 1
dx =y 2
dy 2
dx −7y 2 +3y 1 =0
We note thatthese equations can alsobewritten as
dy 1
dx =y 2
dy 2
dx = −3y 1 +7y 2
or, inmatrix form,
( ) ( ) ( ) y
′
1
0 1 y1
=
−3 7 y 2
y ′ 2
where ′ denotes d dx .
20.3 Higher order equations 607
Higher order differential equations can be reduced to a set of first-order equations in a
similarway.
Example20.2 Express the equation
d 3 x
dt 3 −7d2 x
dt 2 +3dx dt +2x=0
as a set of first-order equations.
Solution Lettingx 1
=x,x 2
= dx
dt andx 3 = d2 x
dt 2 wefind
dx 1
dt
dx 2
dt
=x 2
=x 3
dx 3
dt
−7x 3
+3x 2
+2x 1
=0
isthe set of first-order equations representing the given differential equation.
Example20.3 (a) Expressthe coupledfirst-order equations
dy 1
dx =y 1 +y 2
dy 2
dx = 4y 1 −2y 2
as a second-order ordinary differential equation, and obtain its general solution.
(b) Express the given equations inthe form
( ) ) y
′
1
y1
=A(
y 2
y ′ 2
whereAisa2 ×2 matrix.
Solution (a) Differentiating the second of the given equations we have
d 2 y 2
dx = 4dy 1
2 dx −2dy 2
dx
and then, using the first, we find
d 2 y 2
dx =4(y 2 1 +y 2 )−2dy 2
dx
But fromthe second given equation 4y 1
= dy 2
dx +2y 2
, and therefore
d 2 y 2
dx = dy 2
2 dx +2y 2 +4y 2 −2dy 2
dx
608 Chapter 20 Ordinary differential equations II
thatis,
d 2 y 2
dx 2 + dy 2
dx −6y 2 =0
Writingyfory 2
wefind
d 2 y
dx 2 + dy
dx −6y=0
To solve thiswe lety = e kx toobtain
k 2 +k−6=0
(k−2)(k+3)=0
Therefore,
k=2,−3
The general solution is theny = y 2
= Ae 2x +Be −3x . It is straightforward to show
thaty 1
satisfiesthesamesecond-orderdifferentialequationandhencehasthesame
general solution, butwith different arbitraryconstants.
(b) The first-order equations can be written as
( ) ( ) ( ) y
′
1
1 1 y1
y ′ =
2
4 −2 y 2
( ) 1 1
ThereforeA = .
4 −2
EXERCISES20.3
1 Expressthe followingequations asasetoffirst-order
equations:
(a) d2 y
dx 2 +2dy dx +3y=0
(b) d2 y
dx 2 +8dy dx +9y=0
(c) d2 x
dt 2 +4dx dt +6x=0
(d) d2 y
dt 2 +6dy dt +7y=0
(e) d3 y
dx 3 +6d2 y
dx 2 +2dy dx +y=0
(f) d3 x
dt 3 +2d2 x
dt 2 +4dx dt +2x=0
2 Expressthe followingcoupled first-orderequations as
a single second-order differentialequation:
(a)
dy 1
dx =y 1 +y 2 ,dy 2
dx = 2y 1 −2y 2
(b) dy 1
dx =2y 1 −y 2 ,dy 2
dx =4y 1 +y 2
(c)
dx 1
dt
(d) dy 1
dt
3 Express
=3x 1 +x 2 , dx 2
dt
=2y 1 +4y 2 , dy 2
dt
dy 1
dt
= 2x 1 −3x 2
= 2y 1 +6y 2 and
= 6y 1 −7y 2
dy 2
= −2y
dt 1 −5y 2
asasingle second-order equation. Solve thisequation
and hence findy 1 andy 2 .Expressthe equations in the
formy ′ =Ay.
20.4 State-space models 609
Solutions
1 (a)
(b)
(c)
(d)
(e)
(f)
dy 1
dx =y dy 2
2
dx = −3y 1 −2y d 2 y
2 2 (a)
dx 2 + dy
dx −4y=0
dy 1
dx =y dy 2
2
dx = −9y 1 −8y 2
d 2 y
(b)
dx 2 −3dy dx +6y=0
dx 1 dx
=x 2
dt 2 = −6x
dt 1 −4x 2
d 2 x
(c)
dy 1 dy dt 2 −11x=0
=y 2
dt 2 = −7y
dt 1 −6y 2
d 2 y
(d)
dy 1
dx =y dy 2
2
dx =y dt 2 +5dy dt −38y=0
3
3 y ′′ +3y ′ +2y=0
dy 3
dx = −y 1 −2y 2 −6y 3
y 1 (t) =Ae −2t +Be −t
dx 1 dx
=x 2
y
dt 2 =x 2 (t) = − 2
dt 3
3 Ae−2t − 1 2 Be−t
( ) ( )
y ′ (y1 ) 2 6
dx 1
3
= −2x
dt 1 −4x 2 −2x 3 y 2
′ =
−2 −5 y 2
20.4 STATE-SPACEMODELS
Thereareseveralwaystomodellineartime-invariantsystemsmathematically.Oneway,
which we have already examined, is to use linear differential equations with constant
coefficients. A second method is to use transfer functions which will be discussed in
Chapter 21. A third type of model is the state-space model. The state-space technique
is particularly useful for modelling complex engineering systems in which there are
several inputs and outputs. It also has a convenient form for solution by means of a
digital computer.
Thebasisofthestate-spacetechniqueistherepresentationofasystembymeansofa
setoffirst-ordercoupleddifferentialequations,knownasstateequations.Thenumber
of first-order differential equations required to model a system defines the order of the
system. For example, if three differential equations are required then the system is a
third-ordersystem.Associatedwiththefirst-orderdifferentialequationsareasetofstate
variables, the samenumber astherearedifferential equations.
The concept of a state variable lies at the heart of the state-space technique. A system
is defined by means of its state variables. Provided the initial values of these state
variables are known, it is possible to predict the behaviour of the system with time by
means of the first-order differential equations. One complication is that the choice of
statevariablestocharacterizeasystemisnotunique.Manydifferentchoicesofasetof
state variables for a particular system are often possible. However, a system of ordern
only requires n state variables to specify it. Introducing more state variables than this
only introducesredundancy.
Thechoiceofstatevariablesforasystemis,tosomeextent,dependentonexperience
buttherearecertainguidelinesthatcanbefollowedtoobtainavalidchoiceofvariables.
One thing that is particularly important is that the state variables are independent of
610 Chapter 20 Ordinary differential equations II
y C
C
R
i
~
L
Figure20.6
Thecircuitcould be modelled using v C andiasthe state
variables.
each other. For example, for the electrical system of Figure 20.6, the choice of v C
and
3v C
wouldleadtothetwostatevariablesbeingdependentoneachother.Avalidchoice
wouldbe v C
andiwhicharenotdirectlydependentoneachother.Thisisasecond-order
systemand so only requires two statevariables tomodel itsbehaviour.
Many engineering systems may have a high order and so require several differential
equations to model their behaviour. For this reason, a standard way of laying out these
equations has evolved to reduce the chance of making errors. There is also an added
advantage in that the standard layout makes it easier to present the equations to a digital
computer for solution. Before introducing the standard form, an example will be
presented toillustratethe statevariable method.
Engineeringapplication20.1
State-spacemodelforaspring-mass-dampersystem
ConsiderthemechanicalsystemillustratedinFigure20.7.Amassrestsonafrictionlesssurfaceandisconnectedtoafixedwallbymeansofanidealspringandanideal
damper. Aforce, f (t), isapplied tothe mass and the position ofthe mass is w.
K
B
M
w
f(t)
Figure20.7
Asecond-order mechanicalsystem.
By considering the forces acting on the mass,M, it is possible to devise a differential
equation that models the behaviour of the system. The force produced by the
spring isKw (K is the spring stiffness) and opposes the forward motion. The force
produced by the damper isB dw , where B is the damping coefficient, and this also
dt
opposestheforwardmotion.Sincethemassisconstant,Newton’ssecondlawofmotion
states that the net force on the mass is equal to the product of the mass and its
acceleration. Thus wefind
f(t)−B dw −Kw=M d2 w
(20.8)
dt dt 2
Note that this is a second-order differential equation and so the system is a secondorder
system.
Inordertoobtainastate-spacemodelthestatevariableshavetobechosen.There
areseveralpossiblechoices.Anobviousoneistheposition, w,ofthemass.Asecond
20.4 State-space models 611
statevariableisrequiredasthesystemissecondorder.Inthiscasethevelocityofthe
masswillbechosen.Thevelocityofthemassisnotdirectlydependentonitsposition
and so the two variables are independent. Another possible choice would have been
dw
−w.Thismayseemaclumsychoicebutforcertainproblemssuchchoicesmay
dt
lead tosimplifications inthe statevariable equations.
Itiscustomarytousethesymbolsx 1
,x 2
,x 3
,...torepresentvariablesforreasons
that will become clear shortly. So,
x 1
=w
(20.9a)
x 2
= dw
(20.9b)
dt
Because of the particular choice of state variables, it is easy to obtain the first of
thefirst-orderdifferentialequations--thusillustratingtheneedforexperiencewhen
choosing statevariables.
Differentiating Equation (20.9a) gives
dx 1
= dw =x
dt dt 2
This isthe first stateequation although itisusually written as
ẋ 1
=x 2
where ẋ 1
denotes dx 1
. The second of the first-order equations is obtained by rearranging
Equation
dt
(20.8):
d 2 w
= − K dt 2 M w − B dw
+ 1 f (t) (20.10)
M dt M
However, differentiating Equation (20.9b) we get
d 2 w
dt 2 =ẋ 2
Then, usingEquation (20.10) weobtain
ẋ 2
=− K M x 1 − B M x 2 + 1 M f(t)
Finally, itisusual toarrange the stateequations inaparticular way:
ẋ 1
= + x 2
ẋ 2
=− K M x 1 − B M x 2 + 1 M f(t)
w=x 1
Note that it is conventional to relate the output of the system to the state variables.
Assume that for this system the required output variable is the position of the mass,
that is w.
Itisstraightforward torewrite these equations inmatrix form:
) ( )( ) ( )
(ẋ1 0 1 x1 0
=
+ f(t)
ẋ2 −K/M −B/M x 2
1/M
w = ( 1 0 ) ( )
x 1
x 2
612 Chapter 20 Ordinary differential equations II
More generally, the standard form ofthe stateequations for a linear system isgiven by
ẋ(t) =Ax(t) +Bu(t)
y(t) =Cx(t) +Du(t)
For a systemwithnstatevariables,rinputs and poutputs:
x(t) is ann-component column vector representing the states of thenth-order system.
It is usuallycalled thestatevector.
u(t) is an r-component column vector composed of the input functions to the
system. It is usuallycalled theinputvector.
y(t) is a p-component column vector composed of the defined outputs of the
system. It is referred toas theoutputvector.
A = (n ×n)matrix, knownas the statematrix.
B = (n ×r)matrix, knownas the inputmatrix.
C = (p ×n)matrix, knownas the outputmatrix.
D = (p ×r)matrix, knownas the directtransmission matrix.
A,B,C andDhave constant elements if the system is time invariant. When presented
in this form the equations appear to be extremely complicated. In fact, the problem is
only one of notation. The general nature of the notation allows any linear system to be
specified but in many cases the matrices are zero or have simple coefficient values. For
example,thematrixDisoftenzeroforasystemasitisunusualtohave directcoupling
betweentheinputandtheoutputofasystem.Inthisformatitwouldbestraightforward
topresent the equations toadigital computer for solution.
Engineeringapplication20.2
Anarmature-controlledd.c.motor
Deriveastate-spacemodelforanarmature-controlledd.c.motorconnectedtoamechanicalloadwithcombinedmomentofinertiaJ,andviscousfrictioncoefficientB.
The arrangement isshown inFigure 20.8.
v a
= applied armature voltage
i a
= armature current
R a
= armature resistance
L a
= armature inductance
e b
= back e.m.f.of the motor
ω m
= angular speed of the motor
T = torque generated by the motor.
Solution
Letusassumethatthesysteminputisthearmaturevoltage, v a
,andthesystemoutput
istheangularspeedofthemotor, ω m
.Thisisasecond-ordersystemandsotwostate
variablesarerequired.Wewillchoosethearmaturecurrentandtheangularspeedof
the motor. So,
x 1
=i a
x 2
=ω m
20.4 State-space models 613
R a L a
T
i a
Fixed field
y a
e b
v m
J
Bv m
Figure20.8
An armaturecontrolled
d.c. motor.
The next stage is to obtain a mathematical model for the system. Using Kirchhoff’s
voltage law and the component laws for the resistor and inductor we obtain, for the
armature circuit,
v a
=i a
R a
+L a
di a
dt
+e b
Now for a d.c. motor the back e.m.f. is proportional to the speed of the motor and is
given by e b
=K e
ω m
,K e
constant. So,
di
v a
=i a
R a
+L a
a
+K
dt e
ω m
di a
= − R a
i
dt L a
− K e
ω
a
L m
+ 1 v
a
L a
(20.11)
a
Let us now turn to the mechanical part of the system. If G is the net torque about
theaxisofrotationthentherotationalformofNewton’ssecondlawofmotionstates
G = J dω
dω
, whereJ is the moment of inertia, and is the angular acceleration. In
dt dt
this example, the torques are that generated by the motor,T, and a frictional torque
Bω m
which opposes the motion, so that
T−Bω m
=J dω m
dt
For a d.c. motor, the torque developed by the motor is proportional to the armature
current and isgiven byT =K T
i a
, whereK T
isaconstant. So,
K T
i a
−Bω m
=J dω m
dt
dω m
dt
= K T
J i a − B J ω m
(20.12)
Equations (20.11) and (20.12) are the state equations for the system. They can be
arranged inmatrix form togive
( ) ( )( ) ( )
˙i a −Ra /L
= a
−K e
/L a ia 1/La
+ v
˙ω m
K T
/J −B/J ω m
0 a
( ) ( )
x1 ia
Alternatively the notation x = = andu = v
x 2
ω a
can be used. However,
m
whenthereisnoconfusionitisbettertoretaintheoriginalsymbolsbecauseitmakes
iteasier tosee ataglance what the statevariables are.
➔
614 Chapter 20 Ordinary differential equations II
Finally,anoutputequationisneeded.Inthiscaseitistrivialastheoutputvariable
isthe same as one of the statevariables. So,
ω m
= ( 0 1 )( )
i a
ω m
Engineeringapplication20.3
State-spacemodelforacoupled-tankssystem
Derive a state-space model for the coupled tank system shown in Figure 20.9. The
tankshavecross-sectionalareasA 1
andA 2
,valveresistancesR 1
andR 2
,fluidheights
h 1
and h 2
, input flows q 1
and q 2
. Additionally, there is a flow, q o
, out of tank 2.
Assumethatthevalvescanbemodelledaslinearelementsandletthedensityofthe
fluid inthe tanks be ρ. Letq i
be the intermediate flowbetween the two tanks.
q 1
q 2
h 1
q i
qo
h 2
Tank 1, R 1 Tank 2, R 2 Figure20.9
area A 1 area A 2 Coupled tanks.
Solution
A convenient choice of state variables is the height of the fluid in each of the tanks,
although a perfectly acceptable choice would be the volume of fluid in each of the
tanks.For tank 1,conservation of mass gives
q 1
−q i
=A 1
dh 1
dt
Fortheresistanceelement,R 1
,thepressuredifferenceacrossthevalveisequaltothe
productoftheflowthroughthevalveandthevalveresistance.Thiscanbethoughtof
asafluidequivalentofOhm’slaw.Notethatatmosphericpressurehasbeenignored
as itisthe same on both sides of the valve. So,
ρgh 1
−ρgh 2
=q i
R 1
q i
= ρg (h
R 1
−h 2
) (20.13)
1
Combining these two equations gives
dh 1
= − ρg h
dt R 1
A 1
+ ρg h
1
R 1
A 2
+ 1 q
1
A 1
(20.14)
1
For tank 2,
q i
+q 2
−q o
=A 2
dh 2
dt
ρgh 2
=R 2
q o
(20.15)
20.5 Numerical methods 615
Combining these equations and usingEquation (20.13) toeliminateq i
andq o
gives
dh 2
= ρg ( ρg
h
dt R 1
A 1
− + ρg )
h
2
R 1
A 2
R 2
A 2
+ 1 q
2
A 2
(20.16)
2
Equations (20.14) and (20.16) are the state-space equations for the system and can
be written inmatrix formas
⎛
−ρg ρg
⎞ ⎛ ⎞
1
)
(ḣ1
R 1
A 1
R 1
A 1
( )
0
h1
A ( )
1 = ⎜
ḣ 2 ⎝ ρg
− ρg − ρg
⎟ +
q1
⎠ h ⎜ ⎟
2 ⎝ 1 ⎠
q 2
0
R 1
A 2
R 1
A 2
R 2
A 2 A 2
Note that in this case the input vector is two-dimensional as there are two inputs to
the system.The output equation isgiven by
q o
= ( ) ( )
h
0 ρg/R 1
2
h 2
and isobtained directly from Equation (20.15).
20.5 NUMERICALMETHODS
All the techniques we have so far met for solving differential equations are known as
analytical methods, and these methods give rise to a solution in terms of elementary
functionssuchassinx,e x ,x 3 ,etc.Inpractice,mostengineeringproblemsinvolvingdifferential
equations are too complicated to be solved easily using analytical techniques.
It is therefore frequently necessary to make use of computers. One such approach is to
use analogue simulation, which we discussed in Section 20.2. Another, more common,
approachistouse digital computers.
There are a variety of computer software packages available for solving differential
equations,includingMATLAB ® whichhasalreadybeenintroduced.Oftenthedetailof
howthecomputersolvestheequationishiddenfromtheuseranditusesoneofarange
of numericalmethods.Wewillexaminesomeofthemethodsthatmightbeappliedby
the computer inthissection.
Whenusingnumericalmethodsitisimportanttonotethattheyresultinapproximate
solutions to differential equations and solutions are only calculated at discrete intervals
oftheindependentvariable,typicallyxort.Weshallbeginbyexaminingthefirst-order
differential equation
dy
dx =f(x,y)
subject to the initial condition y(x 0
) = y 0
. Usually the solution is obtained at equally
spaced values of x, and we call this spacing the step size, denoted by h. By choosing
a suitable value ofhwe can control the accuracy of the approximate solution obtained.
Weshallwritey n
forthisapproximatesolutionatx =x n
,whereaswewritey(x n
)forthe
true solution atx = x n
. Generally these values will not be the same although we try to
ensure the difference issmallfor obvious reasons.
The simplest numerical method forthe solution of the differential equation
dy
dx =f(x,y)
616 Chapter 20 Ordinary differential equations II
with initial condition
y(x 0
) =y 0
isEuler’s method which weshall study inthe next section.
20.6 EULER’SMETHOD
You will recall, from Chapter 12, that given a functiony(x), the quantity dy
dx represents
thegradientofthatfunction.Soif dy
dx = f (x,y)andweseeky(x),weseethatthedifferential
equation tells us the gradient of the required function. Given the initial condition
y =y 0
whenx =x 0
wecanpicturethissinglepointasshowninFigure20.10.Moreover,
we know the gradient of the solution here. Because dy
dx = f(x,y)weseethat
dy
dx∣ =f(x 0
,y 0
)
x=x0
which equals the gradient of the solution at x = x 0
. Thus the exact solution passes
through (x 0
,y 0
)andhasgradient f (x 0
,y 0
)there.Wecandrawastraightlinethroughthis
point with the required gradient to approximate the solution as shown in Figure 20.10.
Thisstraightlineapproximatesthetruesolution,butonlynear (x 0
,y 0
)because,ingeneral,
the gradient is not constant but changes. So, in practice we only extend it a short
distance,h,alongthexaxistowherex =x 1
.Theycoordinateatthispointisthentaken
asy 1
.Wenowdevelopanexpressionfory 1
.Thestraightlinehasgradient f (x 0
,y 0
)and
passes through (x 0
,y 0
). Itcan beshown thatits equation istherefore
y=y 0
+(x−x 0
)f(x 0
,y 0
)
Whenx =x 1
theycoordinate isthen given by
y 1
=y 0
+(x 1
−x 0
)f(x 0
,y 0
)
and sincex 1
−x 0
=hwe find
y 1
=y 0
+hf(x 0
,y 0
)
Thisequationcanbeusedtofindy 1
.Wethenregard (x 1
,y 1
)asknown.Fromthisknown
point the whole process isthen repeated using the formula
y i+1
=y i
+hf(x i
,y i
)
y
y(x 1 )
y 1
True solution
Tangent line
approximation
y 0
x 0 x 1 x
0
Figure20.10
Approximation usedin Euler’smethod.
20.6 Euler’s method 617
and we can therefore generate a whole sequence of approximate values ofy. Naturally,
theaccuracyofthesolutionwilldependuponthestepsize,h.Infact,forEuler’smethod,
theerrorincurredisroughlyproportionaltoh,sothatbyhalvingthestepsizeweroughly
halve the error.
An alternative way of deriving Euler’s method is to use a Taylor series expansion.
Recall from Chapter 18 that
y(x 0
+h) =y(x 0
) +hy ′ (x 0
) + h2
2! y′′ (x 0
)+···
If wetruncate afterthe second termwe find
thatis,
y(x 0
+h) ≈y(x 0
) +hy ′ (x 0
)
y 1
=y 0
+hy ′ (x 0
)=y 0
+hf(x 0
,y 0
)
sothatEuler’smethodisequivalenttotheTaylorseriestruncatedafterthesecondterm.
Example20.4 UseEuler’s methodwithh = 0.25 toobtain a numericalsolution of
dy
dx = −xy2
subject toy(0) = 2, giving approximate values ofyfor 0 x 1. Work throughout
tothreedecimal places and determine the exact solution for comparison.
Solution We need to calculatey 1
,y 2
,y 3
andy 4
. The correspondingxvalues arex 1
= 0.25,x 2
=
0.5,x 3
= 0.75 andx 4
= 1.0.Euler’s method becomes
We find
y i+1
=y i
+0.25(−x i
y 2 i ) with x 0 =0 y 0 =2
y 1
= 2 −0.25(0)(2 2 ) = 2.000
y 2
= 2 −0.25(0.25)(2 2 ) = 1.750
y 3
= 1.750 −0.25(0.5)(1.750 2 ) = 1.367
y 4
= 1.367 −0.25(0.75)(1.367 2 ) = 1.017
The exact solution can be found by separating the variables:
∫ ∫ dy
y = − xdx
2
so that
− 1 y = −x2 2 +C
Imposingy(0) = 2 givesC = − 1 so that
2
Finally,
− 1 y = −x2 2 − 1 2
y = 2
x 2 +1
618 Chapter 20 Ordinary differential equations II
Table20.1
ComparisonofnumericalsolutionbyEuler’s
method with exact solution.
i x i y i y(x i )
numerical exact
0 0.000 2.000 2.000
1 0.250 2.000 1.882
2 0.500 1.750 1.600
3 0.750 1.367 1.280
4 1.000 1.017 1.000
Table20.2
The solution to Example 20.5
byEuler’smethod.
i x i y i
0 1.00 1.00
1 1.20 1.20
2 1.40 1.40
3 1.60 1.60
4 1.80 1.80
5 2.00 2.00
Table 20.1 summarizes the numerical and exact solutions. In this example only one
correct significant figure is obtained. In practice, for most equations a very small step
size is necessary which means that computation is extremely time consuming. An improvement
to Euler’s method, which usually yields more accurate solutions, is given in
Section 20.7.
Example20.5 Obtain a solution forvalues ofxbetween 1 and 2 of
dy
dx = y x
subject toy = 1whenx = 1usingEuler’smethod.Useastepsizeofh = 0.2,working
throughouttotwodecimalplacesofaccuracy.Compareyouranswerwiththeanalytical
solution. Comment upon the approximate and exact solutions.
Solution Here we have f (x,y) = y x , y 0 = 1, x 0
= 1 and h = 0.20. With a step size of 0.2,
x 1
= 1.2,x 2
= 1.4,...,x 5
= 2.0. We need to calculatey 1
,y 2
,...,y 5
. Euler’s method,
y i+1
=y i
+hf (x i
,y i
), reduces to
( )
yi
y i+1
=y i
+0.2
x i
Therefore,
Similarly,
y 1
=y 0
+0.2
y 2
=y 1
+0.2
(
y0
(
y1
x 0
)
( 1
=1+0.2 = 1.20
1)
) ( ) 1.20
= 1.20 +0.2 = 1.40
x 1
1.20
ContinuinginasimilarfashionweobtaintheresultsshowninTable20.2.Toobtainthe
analytical solution we separate the variables togive
∫ ∫ dy dx
y = x
so that
lny=lnx+lnD=lnDx
Therefore,
y=Dx
20.6 Euler’s method 619
Whenx = 1,y = 1 so thatD = 1, and the analytical solution is thereforey = x. We
see that in this example the numerical solution by Euler’s method produces the exact
solution. This will always be the case when the exact solution is a linear function and
exact arithmeticisemployed.
EXERCISES20.6
1 Using Euler’s method estimatey(3) given
y ′ = x +y
x
y(2) =1
Useh = 0.5 andh = 0.25. Solvethisequation
analytically and compare your numerical solutions
with the true solution.
2 Findy(0.5)ify ′ =x +y,y(0) = 0.Useh = 0.25and
h = 0.1.Find the truesolution forcomparison.
3 Use Euler’s methodto find v(0.01)given
10 −2dv +v =sin100πt v(0) =0
dt
Takeh = 0.005 andh = 0.002. Find the analytical
solution forcomparison.
Solutions
1 Exact:y =xln |x| −0.1931x
x i y i y i y
(h = 0.5) (h = 0.25) (exact)
2.00 1.0000 1.0000 1.0000
2.25 -- 1.3750 1.3901
2.50 1.7500 1.7778 1.8080
2.75 -- 2.2056 2.2509
3.00 2.6000 2.6561 2.7165
3 Exact:
sin100πt − πcos100πt + πe−100t
v =
π 2 +1
t i v i v
(h = 0.005) (exact)
0 0.0000 0.0000
0.005 0.0000 0.2673
0.010 0.5000 0.3954
2 Exact:y=−x−1+e x
x i y i y
(h = 0.25) (exact)
0 0.0000 0.0000
0.25 0.0000 0.0340
0.50 0.0625 0.1487
x i y i y
(h = 0.1) (exact)
t i v i v
(h = 0.002) (exact)
0 0.0000 0.0000
0.002 0.0000 0.0569
0.004 0.1176 0.1919
0.006 0.2843 0.3354
0.008 0.4176 0.4178
0.010 0.4517 0.3954
0 0.0000 0.0000
0.1 0.0000 0.0052
0.2 0.0100 0.0214
0.3 0.0310 0.0499
0.4 0.0641 0.0918
0.5 0.1105 0.1487
620 Chapter 20 Ordinary differential equations II
20.7 IMPROVEDEULERMETHOD
YouwillrecallthatEuler’smethodisobtainedbyfirstfindingtheslopeofthesolutionat
(x 0
,y 0
) and imposing a straight line approximation (often called a tangent line approximation)
there as indicated in Figure 20.10. If we knew the gradient of the solution at
x =x 1
in addition to the gradient atx =x 0
, a better approximation to the gradient over
the whole interval might be the mean of the two. Unfortunately, the gradient atx = x 1
which is
dy
dx∣ =f(x 1
,y 1
)
x=x1
cannot be obtained until we knowy 1
. What we can do, however, is use the value ofy 1
obtained by Euler’s method in estimating the gradient atx = x 1
. This gives rise to the
improved Euler method:
y 1
=y 0
+h×(average of gradients atx 0
andx 1
)
{ } f(x0 ,y
=y 0
+h× 0
)+f(x 1
,y 1
)
2
=y 0
+ h 2 {f(x 0 ,y 0 )+f(x 1 ,y 0 +hf(x 0 ,y 0 ))}
Then, knowingy 1
the whole process isstartedagain tofindy 2
, etc. Generally,
y i+1
=y i
+ h 2 {f(x i ,y i )+f(x i+1 ,y i +hf(x i ,y i ))}
It can be shown that, like Euler’s method, the improved Euler method is equivalent to
truncating the Taylor series expansion, inthis case afterthe third term.
Example20.6 Use the improved Euler method to solve the differential equationy ′ = −xy 2 ,y(0) = 2,
inExample20.4.Asbeforetakeh = 0.25,butworkthroughouttofourdecimalplaces.
Solution Here f(x,y) = −xy 2 ,y 0
= 2,x 0
= 0.Wefind f(x 0
,y 0
) = f(0,2) = −0(2 2 ) = 0.The
improved Euler method
yields
y 1
=y 0
+ h 2 {f(x 0 ,y 0 )+f(x 1 ,y 0 +hf(x 0 ,y 0 ))}
y 1
=2+ 0.25 {0 + f(0.25,2)}
2
Now f (0.25,2) = −0.25(2 2 ) = −1,so that
y 1
= 2 +0.125(−1) = 1.875
WeshallsetoutthecalculationsrequiredinTable20.3.Youshouldworkthroughthenext
fewstagesyourself.Comparingthevaluesobtainedinthiswaywiththeexactsolution,
we see that, over the interval of interest, our solution is usually correct to two decimal
places (Table 20.4).
20.7 Improved Euler method 621
Table20.3
Applying the improved Eulermethod to Example 20.6.
i x i y i f(x i ,y i ) y i +hf(x i ,y i ) f(x i+1 ,y i y i+1
+hf(x i ,y i ))
0 0 2 0 2 −1 1.8750
1 0.25 1.8750 −0.8789 1.6553 −1.3700 1.5939
2 0.5 1.5939 −1.2703 1.2763 −1.2217 1.2824
3 0.75 1.2824 −1.2334 0.9741 −0.9489 1.0096
Table20.4
Comparison of the improved Euler
method with the exact solution.
i x i y i y(x i )
0 0.000 2.0000 2.0000
1 0.2500 1.8750 1.8824
2 0.5000 1.5939 1.6000
3 0.7500 1.2824 1.2800
4 1.0000 1.0096 1.0000
Example20.7 Apply both Euler’s method and the improved Euler method tothe solution of
dy
dx =2x y=1whenx=0
for 0 x 0.5 using h = 0.1. Compare your answers with the analytical solution.
Work throughout tothreedecimal places.
Solution We have dy
dx = 2x,x 0 = 0,y 0
= 1.Therefore, usingEuler’s method we find
y i+1
=y i
+hf(x i
,y i
)
Therefore,
=y i
+0.1(2x i
)
=y i
+0.2x i
y 1
=y 0
+0.2x 0
=1+ (0.2)(0) =1
y 2
=y 1
+0.2x 1
= 1 + (0.2)(0.1) = 1.02
The complete solution appears in Table 20.5. Check the values given in the table for
yourself.Usingthe improved Euler method we have
y i+1
=y i
+ h 2 {f(x i ,y i )+f(x i+1 ,y i +hf(x i ,y i ))}
=y i
+0.05(2x i
+2x i+1
)
=y i
+0.1(x i
+x i+1
)
Table20.5
ThesolutionofExample20.7usingEuler’smethod,
the improved Eulermethod and the exact solution.
x i y i (Euler) y i (improved y(x i )
Euler) (exact)
0 1.000 1.000 1.000
0.1 1.000 1.010 1.010
0.2 1.020 1.040 1.040
0.3 1.060 1.090 1.090
0.4 1.120 1.160 1.160
0.5 1.200 1.250 1.250
622 Chapter 20 Ordinary differential equations II
Therefore,
y 1
=y 0
+0.1(x 0
+x 1
) = 1 +0.1(0 +0.1) = 1.01
y 2
=y 1
+0.1(x 1
+x 2
) = 1.01 +0.1(0.1 +0.2) = 1.04
and so on. The complete solution appears in Table 20.5. Check this for yourself. The
analytical solution is
∫
y = 2xdx=x 2 +c
Applyingtheconditiony(0) = 1givesc = 1,andsoy =x 2 +1.Valuesofthisfunction
arealsoshowninthetableandwenotethemarkedimprovementgivenbytheimproved
Euler method.
EXERCISES20.7
1 Apply the improved Eulermethod to Question 1 in
Exercises20.6.
2 Apply the improved Eulermethod to Question 2 in
Exercises20.6.
3 Apply the improved Eulermethod to Question 3 in
Exercises20.6.
Solutions
1
x i
y i (h = 0.5) y i (h = 0.25) y(exact)
3
t i v i (h = 0.005) v(exact)
2
2.00 1.0000 1.0000 1.0000
2.25 -- 1.3889 1.3901
2.50 1.8000 1.8057 1.8080
2.75 -- 2.2476 2.2509
3.00 2.7017 2.7123 2.7165
x i y i (h = 0.25) y(exact)
0 0.0000 0.0000
0.25 0.0313 0.0340
0.50 0.1416 0.1487
0 0.0000 0.0000
0.005 0.2500 0.2673
0.010 0.2813 0.3954
t i v i (h = 0.002) v(exact)
0 0.0000 0.0000
0.002 0.0588 0.0569
0.004 0.1903 0.1919
0.006 0.3273 0.3354
0.008 0.4032 0.4178
0.010 0.3777 0.3954
x i y i (h = 0.1) y(exact)
0 0.0000 0.0000
0.1 0.0050 0.0052
0.2 0.0210 0.0214
0.3 0.0492 0.0499
0.4 0.0909 0.0918
0.5 0.1474 0.1487
20.8 Runge--Kutta method of order 4 623
20.8 RUNGE--KUTTAMETHODOFORDER4
The‘Runge--Kutta’methods arealargefamilyofmethods usedforsolvingdifferential
equations. The Euler and improved Euler methods are special cases of this family. The
orderofthemethodreferstothehighestpowerofhincludedintheTaylorseriesexpansion.Weshallnowpresentthefourth-orderRunge--Kuttamethod.Thederivationofthis
isbeyond the scope ofthis book.
To solve the equation dy
dx = f (x,y) subject toy = y 0 whenx = x 0
we generate the
sequence ofvalues,y i
, from the formula
where
y i+1
=y i
+ h 6 (k 1 +2k 2 +2k 3 +k 4 ) (20.17)
k 1
= f(x i
,y i
)
(
k 2
=f x i
+ h )
2 ,y i +h 2 k 1
(
k 3
=f x i
+ h )
2 ,y i +h 2 k 2
k 4
=f(x i
+h,y i
+hk 3
)
Example20.8 UsetheRunge--Kuttamethodtosolve dy
dx = −xy2 ,for0 x 1,subjecttoy(0) = 2.
Useh = 0.25 and work tofour decimal places.
Solution We shall show the calculations required to findy 1
. You should follow this through and
then verifythe results inTable 20.6. The exact solution is shown forcomparison.
f(x,y)=−xy 2 h=0.25 x 0
=0 y 0
=2
Takingi = 0 inEquation (20.17), wehave
k 1
= f(x 0
,y 0
) = −0(2) 2 =0
k 2
= f (0.125,2) = −0.125(2) 2 = −0.5
(
k 3
=f 0.125,2 + 0.25 )
2 (−0.5) = f (0.125,1.9375)
= −0.125(1.9375) 2 = −0.4692
k 4
= f (0.25,2 +0.25(−0.4692)) = f (0.25,1.8827)
Therefore,
= −0.25(1.8827) 2 = −0.8861
y 1
=2+ 0.25 (0 +2(−0.5) +2(−0.4692) + (−0.8861)) = 1.8823
6
624 Chapter 20 Ordinary differential equations II
Table20.6
Comparison of the Runge--Kutta method
with exact solutions.
i x i y i y(x i )
0 0.0 2.0 2.0
1 0.25 1.8823 1.8824
2 0.5 1.5999 1.6000
3 0.75 1.2799 1.2800
4 1.0 1.0000 1.0000
Table20.7
The solution to
Example 20.9.
x i
y i
0.0 0.0
0.2 0.0213
0.4 0.0897
0.6 0.2112
Example20.9 Usethe fourth-order Runge--Kutta method toobtain a solution of
Solution We have
dy
dx =x2 +x−y
subject toy = 0 whenx = 0, for 0 x 0.6 withh = 0.2. Work throughout to four
decimal places.
where
y i+1
=y i
+ h 6 (k 1 +2k 2 +2k 3 +k 4 )
k 1
= f(x i
,y i
)
(
k 2
=f x i
+ h )
2 ,y i +h 2 k 1
(
k 3
=f x i
+ h )
2 ,y i +h 2 k 2
k 4
=f(x i
+h,y i
+hk 3
)
Inthisexample, f (x,y) =x 2 +x−yandx 0
= 0,y 0
= 0.Thefirststageinthesolution
isgiven by
k 1
=f(x 0
,y 0
)=0
k 2
= f(0.1,0) = 0.11
k 3
= f(0.1,0 + (0.1)(0.11)) = f(0.1,0.011) = 0.099
k 4
= f (0.2,0 + (0.2)(0.099)) = f (0.2,0.0198) = 0.2202
Therefore,
y 1
=0+ 0.2 (0 +2(0.11) +2(0.099) +0.2202) = 0.0213
6
which, in fact, is correct to four decimal places. Check the next stage for yourself. The
complete solution isshown inTable 20.7.
20.8.1 Higherorderequations
20.8 Runge--Kutta method of order 4 625
Thetechniquesdiscussedforthesolutionofsinglefirst-orderequationsgeneralizereadilytohigherorderequationssuchasthosedescribedinSection20.3andSection20.4.It
isobviousthatacomputersolutionisessentialandthereareawidevarietyofcomputer
packages available tosolve such equations.
EXERCISES20.8
1 Findy(0.4)ify ′ = (x +y) 2 andy(0) = 1usingthe
Runge--Kutta method oforder 4. Take (a) h = 0.2
and(b) h = 0.1.
2 Repeat Question 1 in Exercises20.6 usingthe
fourth-orderRunge--Kutta method.
3 Repeat Question 2 in Exercises20.6 usingthe
fourth-orderRunge--Kutta method.
4 Repeat Question 3 in Exercises20.6 usingthe
fourth-orderRunge--Kutta method.
Solutions
1 (a)
2
x i
y i
0 1.0000
0.2 1.3085
0.4 2.0640
(b)
x i
y i
0 1.0000
0.1 1.1230
0.2 1.3085
0.3 1.5958
0.4 2.0649
x i y i y i y
(h = 0.5) (h = 0.25) (exact)
2.00 1.0000 1.0000 1.0000
2.25 -- 1.3900 1.3901
2.50 1.8078 1.8079 1.8080
2.75 -- 2.2507 2.2509
3.00 2.7163 2.7164 2.7165
4
x i y i y
(h = 0.1) (exact)
0.0 0.0000 0.0000
0.1 0.0052 0.0052
0.2 0.0214 0.0214
0.3 0.0499 0.0499
0.4 0.0918 0.0918
0.5 0.1487 0.1487
t i v i v
(h = 0.005) (exact)
0 0.0000 0.0000
0.005 0.2675 0.2673
0.010 0.3959 0.3954
3
x i y i y
(h = 0.25) (exact)
0.00 0.0000 0.0000
0.25 0.0340 0.0340
0.50 0.1487 0.1487
t i v i v
(h = 0.002) (exact)
0 0.0000 0.0000
0.002 0.0569 0.0569
0.004 0.1919 0.1919
0.006 0.3352 0.3354
0.008 0.4178 0.4178
0.010 0.3954 0.3954
626 Chapter 20 Ordinary differential equations II
REVIEWEXERCISES20
1 UseEuler’smethodwithh = 0.1 to estimatex(0.4)
given dx
dt =x2 −2xt,x(0) =2.
2 Thegeneralization ofEuler’smethodto the two
coupledequations
dy 1
dx = f(x,y 1 ,y 2 )
dy 2
dx =g(x,y 1 ,y 2 )
is given by
y 1(i+1)
= y 1(i)
+hf(x i ,y 1(i)
,y 2(i)
)
y 2(i+1)
= y 2(i)
+hg(x i ,y 1(i)
,y 2(i)
)
Given the coupled equations
dy 1
dx =xy 1 +y dy 2
2
dx =xy 2 +y 1
estimatey 1 (0.3)andy 2 (0.3),ify 1 (0) = 1 and
y 2 (0) = −1.Takeh = 0.1.
3 Expressthe followingequations asasetoffirst-order
equations:
(a)
d 2 y
dt 2 +6dy dt +8y=0
(b) 3 d2 x
dt 2 +5dx dt +4x=0
(c) 4 d3 x
dt 3 +8d2 x
dt 2 +6dx dt +5x=0
4 Expressthe followingcoupledfirst-order equationsas
asingle second-order differentialequation:
(a)
(b)
(c)
dy 1
dx =3y 1 +4y 2 ,2dy 2
dx = 6y 1 +8y 2
dx 1
dt
dy 1
dt
=6x 1 −5x 2 ,2 dx 2
dt
=8y 1 +4y 2 ,2 dy 2
dt
= 4x 1 −3x 2
= 4y 1 −6y 2
Solutions
1 4.7937
4 (a)
2 0.7535, −0.7535
dy
3 (a) 1 dy (b)
=y 2
dt 2 = −8y
dt 1 −6y 2
dx
(b) 1
=x
dt 2 3 dx 2
(c)
= −4x
dt 1 −5x 2
dx
(c) 1 dx
=x 2
dt 2 =x
dt 3
4 dx 3
= −5x
dt 1 −6x 2 −8x 3
d 2 y
dx 2 −7dy dx = 0
d 2 x
dt 2 − 9 dx
2 dt +x=0
d 2 y
dt 2 −5dy dt −32y=0
21 TheLaplacetransform
Contents 21.1 Introduction 627
21.2 DefinitionoftheLaplacetransform 628
21.3 Laplacetransformsofsomecommonfunctions 629
21.4 PropertiesoftheLaplacetransform 631
21.5 Laplacetransformofderivativesandintegrals 635
21.6 InverseLaplacetransforms 638
21.7 UsingpartialfractionstofindtheinverseLaplacetransform 641
21.8 FindingtheinverseLaplacetransformusingcomplexnumbers 643
21.9 Theconvolutiontheorem 647
21.10 Solvinglinearconstantcoefficientdifferentialequationsusing
theLaplacetransform 649
21.11 Transferfunctions 659
21.12 Poles,zerosandthesplane 668
21.13 Laplacetransformsofsomespecialfunctions 675
Reviewexercises21 678
21.1 INTRODUCTION
TheLaplacetransformisusedtosolvelinearconstantcoefficientdifferentialequations.
This is achieved by transforming them to algebraic equations. The algebraic equations
are solved, then the inverse Laplace transform is used to obtain a solution in terms of
the original variables. This technique can be applied to both single and simultaneous
differentialequationsandsoisextremelyusefulgiventhatdifferentialequationmodels
arecommon as we sawinChapters 19 and 20.
TheLaplacetransformisalsousedtoproducetransferfunctionsfortheelementsofan
engineeringsystem.Thesearerepresentedindiagrammaticformasblocks.Thevarious
628 Chapter 21 The Laplace transform
blocksofthesystem,correspondingtothesystemelements,areconnectedtogetherand
the result is a block diagram for the whole system. By breaking down a system in this
way it is much easier to visualize how the various parts of the system interact and so
a transfer function model is complementary to a time domain model and is a valuable
way of viewing an engineering system. Transfer functions are useful in many areas of
engineering, butareparticularly important inthe design of control systems.
21.2 DEFINITIONOFTHELAPLACETRANSFORM
Let f (t) be a function of time t. In many real problems only values of t 0 are of
interest.Hence f (t)is given fort 0,and forallt < 0, f (t) istaken tobe 0.
The Laplace transform of f (t)isF(s), defined by
F(s) =
∫ ∞
0
e −st f (t)dt
Note that to find the Laplace transform of a function f (t), we multiply it by e −st and
integrate between the limits 0 and ∞.
TheLaplacetransformchanges,ortransforms,thefunction f (t)intoadifferentfunctionF(s).Notealsothatwhereas
f (t)isafunctionoft,F(s)isafunctionofs.Todenote
the Laplace transform of f (t) we write L{f (t)}. We use a lower case letter to represent
the time domain function and an upper case letter to represent thesdomain function.
Thevariablesmayberealorcomplex.Astheintegralisimproperrestrictionsmayneed
tobe placed onstoensure thatthe integral does not diverge.
Example21.1 Find the Laplace transformsof
(a) 1
(b) e −at
∫ ∞
[ ] e
Solution (a) L{1} = e −st −st ∞
1dt = = 1
0 −s
0
s =F(s)
This transformexists provided the real partofs,Re(s), ispositive.
(b) L{e −at } =
=
∫ ∞
0
∫ ∞
0
e −st e −at dt
e −(s+a)t dt
[ e
−(s+a)t ∞
=
−(s +a)]
0
( )
1
= 0 −
−(s +a)
= 1
s +a =F(s)
This transformexists provideds +a > 0.
Throughoutthe chapteritisassumed thatshas a valuesuchthatall integrals exist.
21.3 Laplace transforms of some common functions 629
21.3 LAPLACETRANSFORMSOFSOMECOMMON
FUNCTIONS
Determining the Laplace transform of a given function, f (t), is essentially an exercise
inintegration.Inordertosaveeffortalook-uptableisoftenused.Table21.1listssome
common functions and their corresponding Laplace transforms.
Table21.1
TheLaplace transformsofsome common functions.
Function, f (t) Laplacetransform,F(s) Function, f (t) Laplacetransform,F(s)
1
t
1
s
1
s 2
t 2 2
s 3
t n n!
s n+1
e at 1
s −a
e −at 1
s +a
e −at cosbt
sinhbt
coshbt
e −at sinhbt
e −at coshbt
tsinbt
t n e −at n!
(s +a) n+1 tcosbt
sinbt
cosbt
e −at sinbt
b
s 2 +b 2
s
s 2 +b 2
u(t)unitstep
u(t −d)
b
(s+a) 2 +b 2 δ(t) 1
δ(t −d)
s +a
(s+a) 2 +b 2
b
s 2 −b 2
s
s 2 −b 2
b
(s+a) 2 −b 2
s +a
(s+a) 2 −b 2
2bs
(s 2 +b 2 ) 2
s 2 −b 2
(s 2 +b 2 ) 2
1
s
e −sd
s
e −sd
Example21.2 UseTable 21.1 todetermine the Laplace transformofeachofthe following functions:
(a) t 3 (b) t 7
(c) sin4t (d) e
( −2t
t
(e) cos (f) sinh3t
2)
(g) cosh5t
(i) e −t sin2t
(h) tsin4t
(j) e 3t cost
630 Chapter 21 The Laplace transform
Solution From Table 21.1 wefind the results listedinTable 21.2.
Table21.2
f(t) F(s) f(t) F(s) f(t) F(s)
(a) t 3 6
s 4
(b) t 7 7!
s 8
(c) sin4t
4
s 2 +16
(d) e −2t 1
s +2
(e) cos(t/2)
(f) sinh3t
(g) cosh5t
s
s 2 +0.25
3
s 2 −9
s
s 2 −25
(h) tsin4t
(i) e −t sin2t
(j) e 3t cost
8s
(s 2 +16) 2
2
(s+1) 2 +4
s −3
(s−3) 2 +1
EXERCISES21.3
1 Determine the Laplacetransformsofthe following
functions:
(a) sin6t
( )
2t
(c) sin
3
(b) cos4t
( )
4t
(d) cos
3
(e) t 4 (f) t 2 t 3
(g) e −3t
(i)
1
e 4t
(h) e 3t
(j) tcos3t
(k) tsint
cos7t
(m)
e 5t
(o) cosh5t
(q) e −2t cosh7t
(s) e 7t cosh9t
(l) e −t sin3t
(n) sinh6t
(p) e −3t sinh4t
(r) e 4t sinh3t
2 ShowfromthedefinitionoftheLaplacetransformthat
L{u(t −d)} = e−sd ,d>0
s
Solutions
6
1 (a)
s 2 +36
6
(c)
9s 2 +4
24
(e)
s 5
1
(g)
s +3
(i)
1
s +4
(b)
(d)
(f)
(h)
(j)
s
s 2 (k)
+16
9s
9s 2 (m)
+16
120
s 6
(o)
1
s −3
(q)
s 2 −9
(s 2 +9) 2 (s)
2s
(s 2 +1) 2 (l)
s +5
(s+5) 2 +49
s
s 2 −25
s +2
(s+2) 2 −49
s −7
(s−7) 2 −81
(n)
(p)
(r)
3
(s+1) 2 +9
6
s 2 −36
4
(s+3) 2 −16
3
(s−4) 2 −9
21.4 Properties of the Laplace transform 631
21.4 PROPERTIESOFTHELAPLACETRANSFORM
There are some useful properties of the Laplace transform that can be exploited. They
allow us to find the Laplace transforms of more difficult functions. The properties we
shall examine are:
(1) linearity;
(2) shift theorems;
(3) final value theorem.
21.4.1 Linearity
Let f andgbe two functions oft and letkbe a constant which may benegative. Then
L{f +g}= L{f}+L{g}
L{kf}=kL{f}
The first property states that to find the Laplace transform of a sum of functions, we
simply sum the Laplace transforms of the individual functions. The second property
says that if we multiply a function by a constant k, then the corresponding transform
is also multiplied by k. Both of these properties follow directly from the definition of
the Laplace transform and linearity properties of integrals, and mean that the Laplace
transformisalinearoperator.UsingthelinearitypropertiesandTable21.1,wecanfind
the Laplace transformsof more complicated functions.
Example21.3 Find the Laplace transforms ofthe following functions:
(a) 3 +2t (b) 5t 2 −2e t
Solution (a) L{3 +2t} = L{3} + L{2t}
= 3L{1} +2L{t}
= 3 s + 2 s 2
(b) L{5t 2 −2e t }= L{5t 2 } + L{−2e t }
= 5L{t 2 } −2L{e t }
= 10
s − 2
3 s −1
With a littlepractice, someofthe intermediatesteps maybeexcluded.
Example21.4 Findthe Laplace transforms ofthe following:
(a) 5cos3t+2sin5t−6t 3
Solution (a) L{5cos3t +2sin5t −6t 3 } =
(b) −e −t + 1 (sint +cost)
2
5s
s 2 +9 + 10
s 2 +25 − 36
s 4
632 Chapter 21 The Laplace transform
{
(b) L −e −t + sint+cost }
= −1
2 s +1 + 1
2(s 2 +1) + s
2(s 2 +1)
= −1
s +1 + 1 +s
2(s 2 +1)
21.4.2 Firstshifttheorem
If L{f(t)} =F(s)then
L{e −at f (t)} =F(s +a)
aconstant
We obtainF(s +a) by replacing every ‘s’ inF(s) by ‘s +a’. The variableshas been
shifted by anamounta.
Example21.5 (a) UseTable 21.1 tofind the Laplace transform of
f(t)=tsin5t
(b) Usethe first shift theoremtowritedown
L{e −3t tsin5t}
Solution (a) FromTable 21.1 wehave
L{f(t)} = L{tsin5t} =
10s
(s 2 +25) 2 =F(s)
(b) Fromthe first shift theorem witha = 3 we have
L{e −3t f(t)} =F(s +3)
10(s +3)
=
((s +3) 2 +25) 2
=
10(s +3)
(s 2 +6s +34) 2
Example21.6 TheLaplace transformofafunction, f (t), isgiven by
F(s) = 2s+1
s(s +1)
State the Laplace transformof
(a) e −2t f (t)
(b) e 3t f (t)
Solution (a) Usethe first shift theorem witha = 2.
L{e −2t f(t)} =F(s +2)
=
=
2(s+2)+1
(s+2)(s+2+1)
2s+5
(s +2)(s +3)
(b) Usethe first shift theorem witha = −3.
L{e 3t f(t)} =F(s−3)
=
=
2(s−3)+1
(s−3)(s−3+1)
2s−5
(s −3)(s −2)
21.4 Properties of the Laplace transform 633
21.4.3 Secondshifttheorem
If L{f(t)} =F(s)then
L{u(t −d)f(t −d)} =e −sd F(s) d >0
The function, u(t − d)f (t − d), is obtained by moving u(t)f (t) to the right by an
amount d. This is illustrated in Figure 21.1. Note that because f (t) is defined to be
0fort < 0,then f (t −d) = 0fort <d.Itmayappearthatu(t −d)isredundant.However,
it is necessary for inversion of the Laplace transform, which will be covered in
Section21.6.
u(t) f(t)
u(t – d) f(t – d)
(a)
t
Figure21.1
Shifting the functionu(t)f (t)to the rightbyan amountd yields the function
u(t−d)f(t−d).
(b)
d
t
Example21.7 Given L{f (t)} =
2s ,find L{u(t −2)f(t −2)}.
s +9
Solution Usethe second shift theorem withd = 2.
L{u(t −2)f(t −2)} = 2se−2s
s +9
Example21.8 TheLaplace transform ofafunctionis e−3s
. Find the function.
s2 Solution Theexponentialterminthetransformsuggeststhatthesecondshifttheoremisused.Let
e −sd F(s) = e−3s
s 2
634 Chapter 21 The Laplace transform
sothatd =3andF(s) = 1 s2. If we let
L{f(t)} =F(s) = 1 s 2
then f (t) =t and so f (t −3) =t −3.Now, usingthe second shift theorem
L{u(t −3)f(t −3)} = L{u(t −3)(t −3)} = e−3s
s 2
Hence the required function isu(t −3)(t −3) as shown inFigure 21.2.
f(t)
3 t
Figure21.2
Thefunction: f(t) =u(t −3)(t −3).
21.4.4 Finalvaluetheorem
This theorem applies only to real values of s and for functions, f (t), which possess a
limitast → ∞.
Thefinal valuetheorem states:
limsF(s) = lim f(t)
s→0 t→∞
Some care is needed when applying the theorem. The Laplace transform of some functions
exists only for Re(s) > 0 and for these functions taking the limit ass → 0 is not
sensible.
Example21.9 Verifythe final value theoremfor f (t) = e −2t .
Solution ∫ ∞
0
e −st e −2t dt exists provided Re(s) > −2 and so
F(s) = 1
s +2
Re(s) > −2
Since weonly require Re(s) > −2, itispermissibletolets → 0.
s
limsF(s) =lim
s→0 s→0 s +2 = 0
Furthermore
lim
t→∞ e−2t = 0
So lim s→0
sF(s) = lim t→∞
f (t)and the theorem isverified.
21.5 Laplace transform of derivatives and integrals 635
EXERCISES21.4
1 Find the Laplace transformsofthe following
functions:
(a)3t 2 −4
(c)2−t 2 +2t 4
( )
(e) 1 3 sin3t −4cos t
2
(f) 3t 4 e 5t +t
(g)sinh2t +3cosh2t
(h)e −t sin3t +4e −t cos3t
(b) 2sin4t+11−t
(d) 3e 2t +4sint
2 TheLaplacetransform of f (t)is given as
F(s) = 3s2 −1
s 2 +s+1
Find the Laplace transformof
(a)e −t f (t) (b) e 3t f (t) (c) e −t/2 f (t)
3 Given
find
L{f(t)} =
4s
s 2 +1
(a) L{u(t −1)f(t −1)}
(b) L{3u(t −2)f(t −2)}
{
}
(c) L u(t−4) f(t−4)
2
4 Find the final value ofthe followingfunctionsusing
the final value theorem:
(a) f(t) =e −t sint
(b) f(t)=e −t +1
(c) f(t) =e −3t cost +5
Solutions
1 (a)
(c)
(e)
(g)
6
s 3 − 4 s
2
s − 2 s 3 + 48
s 5
1
s 2 +9 − 16s
4s 2 +1
3s+2
s 2 −4
(b)
(d)
(f)
(h)
8
s 2 +16 + 11
s − 1 s 2
3
s −2 + 4
s 2 +1
72
(s −5) 5 + 1 s 2
4s+7
(s+1) 2 +9
2 (a)
(c)
3 (a)
3s 2 +6s+2
s 2 +3s+3
12s 2 +12s −1
4s 2 +8s+7
4se −s
s 2 (b)
+1
(b)
12se −2s
s 2 +1
4 (a)0 (b)1 (c)5
3s 2 −18s +26
s 2 −5s+7
(c)
2se −4s
s 2 +1
21.5 LAPLACETRANSFORMOFDERIVATIVES
ANDINTEGRALS
In later sections we shall use the Laplace transform to solve differential equations. In
order to do this we need to be able to find the Laplace transform of derivatives of functions.Let
f (t)beafunctionoft,and f ′ and f ′′ thefirstandsecondderivativesof f.The
Laplace transform of f (t)isF(s). Then
L{f ′ } =sF(s) − f(0)
L{f ′′ } =s 2 F(s)−sf(0)−f ′ (0)
where f(0) and f ′ (0) are the initial values of f and f ′ . The general case for the
Laplace transform ofannthderivative is
L{f (n) } =s n F(s) −s n−1 f(0) −s n−2 f ′ (0)−···−f (n−1) (0)
636 Chapter 21 The Laplace transform
Another usefulresultis
{ ∫ }
t
L f(t)dt = 1 s F(s)
0
Example21.10 The Laplace transform ofx(t) isX(s). Givenx(0) = 2 andx ′ (0) = −1, write expressionsforthe
Laplace transformsof
(a) 2x ′′ −3x ′ +x (b) −x ′′ +2x ′ +x
Solution L{x ′ } =sX(s) −x(0) =sX(s) −2
L{x ′′ } =s 2 X(s)−sx(0)−x ′ (0) =s 2 X(s)−2s+1
(a) L{2x ′′ −3x ′ +x} = 2(s 2 X(s) −2s +1) −3(sX(s) −2) +X(s)
= (2s 2 −3s+1)X(s)−4s+8
(b) L{−x ′′ +2x ′ +x} = −(s 2 X(s) −2s +1) +2(sX(s) −2) +X(s)
= (−s 2 +2s+1)X(s)+2s−5
Engineeringapplication21.1
Voltageacrossacapacitor
Thevoltage, v(t), acrossacapacitorofcapacitanceC isgiven by
v(t) = 1 C
∫ t
0
i(t)dt
Taking Laplace transforms yields
V(s)= 1 Cs I(s)
whereV(s) = L{v(t)}andI(s) = L{i(t)}.
Engineeringapplication21.2
Frequencyresponseofasystem
The Laplace variablesis sometimes referred to as the generalized or complex frequency
variable. It consists of a real and an imaginary part, wheres = σ + jω. If
onlythesinusoidalsteady-stateresponsetoasinewaveinputforasystemisrequired
then we can obtain this by putting σ = 0 into the expression for s and sos = jω.
Thusitispossibletomakethissubstitutioninanytransferfunctiongivenintermsof
the Laplace variablesto obtain the sinusoidal steady-state frequency response. We
willnotprovethisassertionhereforreasonsofbrevity.Insteadwewilldemonstrate
its usefulness.
21.5 Laplace transform of derivatives and integrals 637
Consider the Laplace transformof theequation forthe voltage across acapacitor
of capacitanceC. Recall fromEngineering application 21.1 that
V(s)= 1 Cs I(s)
Rearrangingthisexpressiongivesaformulafortheimpedanceofthecapacitor,Z(s),
inthesdomain
Z(s) = V(s)
I(s) = 1 Cs
Thisformoftheequationcanbeusedtodeterminethebehaviourofthecapacitorin
a variety of situations where the voltages and currents can take a number of forms,
for example step inputs, sine waves, triangular waves, etc. However, if we are only
interested in the sinusoidal steady-state response of the system when all transients
have decayed, then itispossible tosubstitutes = jω into the expression forZ(s)
Z(jω) = V(jω)
I(jω) = 1
jωC
This form of the impedance is the one regularly used in a.c. circuit theory. It gives
theimpedanceofthecapacitorintermsofthecapacitanceandtheangularfrequency
ω,where ω = 2πf.(Thisresulthasalreadybeendiscussedinthecontextofphasors
on page 342).
Thesamesubstitutioncan bemade toobtain thefrequency responseofasystem,
givenitstransferfunction.Thesystemfrequencyresponseissomethingthatcanusuallybeobtainedeasilybyexperiment.Allthatisrequiredisasignalsourceofaknown
amplitude and a means of measuring the output amplitude and phase relative to the
original signal. Measurements of this sort can reveal some of the properties of the
systeminquestionreflectingthecloserelationshipbetweenthetransferfunctionand
the frequency response.
EXERCISES21.5
1 TheLaplacetransform ofy(t)isY(s),y(0) = 3,
y ′ (0) = 1. Find the Laplacetransformsofthe
followingexpressions:
(a) y ′ (b) y ′′
(c) 3y ′′ −y ′
(d) y ′′ +2y ′ +3y
(e) 3y ′′ −y ′ +2y
(f) −4y ′′ +5y ′ −3y
(g) 3 d2 y
dt 2 +6dy dt +8y
(h) 4 d2 y
dt 2 −8dy dt +6y
2 Given the Laplacetransform of f (t)isF(s),
f(0)=2,f ′ (0)=3andf ′′ (0) = −1,findthe
Laplace transformsof
(a) 3f ′ −2f
(c) f ′′′
(b) 3f ′′ − f ′ + f
(d) 2f ′′′ − f ′′ +4f ′ −2f
3 (a) IfF(s) = L{f(t)} = ∫ ∞
0 e −st f (t)dt,showusing
integration byparts that
(i) L{f ′ (t)} =sF(s)−f(0)
(ii) L{f ′′ (t)} =s 2 F(s)−sf(0)−f ′ (0)
(b) IfF(s) = L{f (t)}prove that
L{e −at f(t)} =F(s +a)
Deduce L{te −t },given L{t} = 1 s 2.
638 Chapter 21 The Laplace transform
Solutions
1 (a) sY−3
(b) s 2 Y−3s−1
(c) 3s 2 Y −sY −9s
(d) (s 2 +2s+3)Y−3s−7
(e) (3s 2 −s +2)Y −9s
(f) (−4s 2 +5s −3)Y +12s −11
(g) (3s 2 +6s +8)Y −9s −21
(h) (4s 2 −8s +6)Y −12s +20
2 (a) (3s−2)F−6
(b) (3s 2 −s+1)F−6s−7
(c) s 3 F−2s 2 −3s+1
(d) (2s 3 −s 2 +4s−2)F−4s 2 −4s−3
1
3 (b)
(s +1) 2
21.6 INVERSELAPLACETRANSFORMS
As mentioned in Section 21.5, the Laplace transform can be used to solve differential
equations. However, before such an application can be put into practice, we must study
theinverseLaplacetransform.Sofarinthischapterwehavebeengivenfunctionsoftand
foundtheirLaplacetransforms.Wenowconsidertheproblemoffindingafunction f (t),
havingbeengiventheLaplacetransform,F(s).ClearlyTable21.1andthepropertiesof
Laplace transformswill helpus todothis. If L{f (t)} =F(s)wewrite
f(t) = L −1 {F(s)}
and call L −1 the inverse Laplace transform. Like L, L −1 can be shown to be a linear
operator.
Example21.11 Findthe inverse Laplace transformsofthe following:
(a)
2
s 3
(b)
16
s 3
(c)
s
s 2 +1
(d)
1
s 2 +1
(e)
s +1
s 2 +1
Solution (a) We need to find a function of t which has a Laplace transform of
Table 21.1 wesee L −1 { 2
s 3 }
=t 2 .
{ } { } 16 2
(b) L −1 = 8L −1 = 8t 2
s 3 s 3
{ } s
(c) L −1 =cost
s 2 +1
{ } 1
(d) L −1 =sint
s 2 +1
{ } s +1
(e) L −1 s 2 +1
{ } { } s 1
= L −1 + L −1 =cost+sint
s 2 +1 s 2 +1
2
s3. Using
In parts (a), (c) and (d) we obtained the inverse Laplace transform by referring directly
to the table. In (b) and (e) we used the linearity properties of the inverse transform, and
then referredtoTable 21.1.
21.6 Inverse Laplace transforms 639
Example21.12 Find the inverse Laplace transformsof the following functions:
10 (s+1)
(a) (b) (c)
(s +2) 4 (s+1) 2 +4
{ }
Solution (a) L −1 10
= 10 { }
6
(s +2) 4 6 L−1 (s +2) 4
15
(s−1) 2 −9
= 5t3 e −2t
3
{ } { }
(b) L −1 s +1
= L −1 s +1
= e −t cos2t
(s+1) 2 +4 (s+1) 2 +2 2
{ } { }
(c) L −1 15
= 5L −1 3
= 5e t sinh3t
(s−1) 2 −9 (s−1) 2 −3 2
The function is written to match exactly the standard forms given in Table 21.1, with
possibly a constant factor being present. Often the denominator needs to be written in
standard form asillustratedinthe next example.
Example21.13 Findthe inverse Laplace transformsofthe following functions:
(a)
s +3
s 2 +6s+13
(b)
2s+3
s 2 +6s+13
Solution (a) By completing the square we can write
(c)
s 2 +6s+13=(s+3) 2 +4=(s+3) 2 +2 2
s −1
2s 2 +8s+11
(b)
Hence we may write
{ } { }
L −1 s +3
= L −1 s +3
= e −3t cos2t
s 2 +6s+13 (s+3) 2 +2 2
2s+3
s 2 +6s+13 = 2s+3
(s+3) 2 +2 = 2s+6
2 (s+3) 2 +2 − 3
2 (s+3) 2 +2 2
(
(
= 2
s +3
(s+3) 2 +2 2 )
− 3 2
)
2
(s+3) 2 +2 2
The expressions in brackets are standard forms so their inverse Laplace transforms
can befound fromTable 21.1.
{ }
L −1 2s+3
= 2e −3t cos2t − 3e−3t sin2t
s 2 +6s+13
2
(c) We write the expression usingstandard forms:
s −1
2s 2 +8s+11 = 1 s −1
2s 2 +4s+5.5 = 1 s −1
2 (s +2) 2 +1.5
= 1 (
)
s +2
2 (s +2) 2 +1.5 − 3
(s +2) 2 +1.5
= 1 (
s +2
2 (s +2) 2 +1.5 − √ 3
√ ) 1.5
1.5 (s +2) 2 +1.5
640 Chapter 21 The Laplace transform
Havingwrittentheexpressionintermsofstandardforms,theinverseLaplacetransform
can now be found:
{ }
L −1 s −1
= 1 (
e −2t cos √ 1.5t − √ 3 e −2t sin √ )
1.5t
2s 2 +8s+11 2 1.5
In every case the given function ofsis written as a linear combination of standard
formscontained inTable 21.1.
EXERCISES21.6
1 Byusingthe standard formsin Table 21.1 findthe
inverseLaplacetransformsofthefollowingfunctions:
1 2
(a) (b)
s s 3
8 1
(c) (d)
s s +2
6 5
(e) (f)
s +4 s −3
7 2
(g) (h)
s +8 s 2 +4
s 6s
(i)
s 2 (j)
+9 s 2 +4
20 2
(k)
s 2 (l)
+16 (s +1) 2
8 9
(m)
(s +2) 2 (n)
(s +3) 3
e −3s 4e −6s
(o) (p)
s s
(q) e −9s (r) 4e −8s
4 s
(s)
s 2 (t)
−16 s 2 −9
12 6s
(u)
s 2 (v)
−9 s 2 −8
2
(w)
(s+1) 2 −4
2
(y)
(s+3) 2 −4
s +3
(x)
(s+3) 2 −4
6s
(z)
s 2 −5
2 Findthe inverse Laplacetransformsofthe following
functions:
3 4
(a) (b)
2s s − 1 s 3
30
1
(c)
s 2 (d)
3(s +2)
3s−7 s −6
(e)
s 2 (f)
+9 s −4
s +4 5
(g)
(s+4) 2 (h)
+1 (s+4) 2 +1
(i)
(k)
(m)
6s+17
(s+4) 2 +1
(j)
0.5
(s +0.5) 2 (l)
6s+9
s 2 +2s+10
(n)
s
s 2 +2s+7
s +5
s 2 +8s+20
7s+3
s 2 +4s+8
Solutions
1 (a) 1 (b) t 2
(o) u(t −3) (p) 4u(t −6)
(c) 8 (d) e −2t
(q) δ(t −9) (r) 4δ(t −8)
(e) 6e −4t (f) 5e 3t
(s) sinh4t (t) cosh3t
(g) 7e −8t (h) sin2t
(u) 4sinh3t (v) 6cosh √ 8t
(i) cos3t (j) 6cos2t
(w) e −t sinh2t (x) e −3t cosh2t
(k) 5sin4t (l) 2te −t
(y) e −3t sinh2t (z) 6cosh √ 5t
(m) 8te −2t 9
(n)
2 t2 e −3t
21.7 Using partial fractions to find the inverse Laplace transform 641
2 (a)
3
2
(c) 30t
(b) 4− t2 2
(d)
(e) 3cos3t − 7 3 sin3t
(f) δ(t)−2e 4t
(g) e −4t cost
(h) 5e −4t sint
e −2t
3
(i) 6e −4t cost −7e −4t sint
(j) e −t cos √ 6t − 1 √
6
e −t sin √ 6t
(k)
te −t/2
2
(l) e −4t cos2t + 1 2 e−4t sin2t
(m) 6e −t cos3t +e −t sin3t
(n) 7e −2t cos2t − 11
2 e−2t sin2t
21.7 USINGPARTIALFRACTIONSTOFINDTHEINVERSE
LAPLACETRANSFORM
TheinverseLaplacetransformofafractionisoftenbestfoundbyexpressingitasitspartialfractions,andfindingtheinversetransformofthese.(SeeSection1.7foratreatment
ofpartialfractions.)
Example21.14 Findthe inverse Laplace transform of
4s−1 6s+8
(a) (b)
s 2 −s s 2 +3s+2
Solution (a) We express 4s−1
s 2 −s
as its partialfractions:
4s−1
s 2 −s = 4s−1
s(s −1) = A s + B
s −1
Using the technique of Section 1.6 wefind thatA = 1,B = 3 and hence
4s−1
s 2 −s = 1 s + 3
s −1
The inverse Laplace transform of each partialfraction isfound:
( ) ( )
1 3
L −1 = 1, L −1 =3e t
s s −1
Hence
L −1 ( 4s−1
s 2 −s
)
=1+3e t
(b) The expression isexpressed as its partialfractions. This was done inSection 1.7.
6s+8
s 2 +3s+2 = 2
s +1 + 4
s +2
The inverse Laplace transform of each partialfraction isnoted:
( )
( )
2
4
L −1 = 2e −t L −1 = 4e −2t
s +1 s +2
642 Chapter 21 The Laplace transform
and so
( ) 6s+8
L −1 s 2 +3s+2
=2e −t +4e −2t
Example21.15 Findthe inverse Laplace transformof 3s2 +6s+2
s 3 +3s 2 +2s .
Solution Following fromthe work inSection 1.7.1,we have
3s 2 +6s+2
s 3 +3s 2 +2s = 1 s + 1
s +1 + 1
s +2
The inverse Laplace transform of the partialfractions iseasily found:
{ } 3s
L −1 2 +6s+2
=1+e −t +e −2t
s 3 +3s 2 +2s
Example21.16 Findthe inverse Laplace transformof
Solution From Example 1.33 wehave
3s 2 +11s +14
s 3 +2s 2 −11s−52 .
3s 2 +11s +14
s 3 +2s 2 −11s−52 = 2
s −4 + s +3
s 2 +6s+13
We find the inverse Laplace transformsof the partialfractions:
{ } 2
L −1 =2e 4t
s −4
{ }
L −1 s +3
= e −3t cos2t
s 2 +6s+13
So
{ } 3s
L −1 2 +11s +14
= 2e 4t +e −3t cos2t
s 3 +2s 2 −11s−52
EXERCISES21.7
1 Expressthe followingaspartialfractions andhence
findtheir inverse Laplacetransforms:
(a)
(b)
(c)
5s+2
(s +1)(s +2)
3s+4
(s +2)(s +3)
4s+1
(s +3)(s +4)
(d)
(e)
(f)
(g)
6s−5
(s +5)(s +3)
4s+1
s(s +2)(s +3)
7s+3
s(s +3)(s +4)
6s+7
s(s +2)(s +4)
21.8 Finding the inverse Laplace transform using complex numbers 643
(h)
(i)
(j)
8s−5
(s +1)(s +2)(s +3)
3s+5
(s +1)(s 2 +3s +2)
2s−8
(s +2)(s 2 +7s +6)
2 Expressthe following fractions aspartialfractions
andhence findtheir inverse Laplacetransforms:
(a)
(b)
3s+3
(s −1)(s +2)
5s
(s +1)(2s −3)
(c) 2s+5
s +2
s 2 +4s+4
(d)
s 3 +2s 2 +5s
1 −s
(e)
(s +1)(s 2 +2s +2)
s +4
(f)
s 2 +4s+4
(g)
(h)
2(s 3 −3s 2 +s−1)
(s 2 +4s +1)(s 2 +1)
3s 2 −s+8
(s 2 −2s +3)(s +2)
Solutions
1 (a) 8e −2t −3e −t
(b) 5e −3t −2e −2t
(c) 15e −4t −11e −3t
35
(d)
2 e−5t − 23
2 e−3t
1
(e)
6 + 7 2 e−2t − 11
3 e−3t
1
(f)
4 +6e−3t − 25
4 e−4t
7
(g)
8 + 5 4 e−2t − 17
8 e−4t
(h) 21e −2t − 29
2 e−3t − 13
2 e−t
(i) e −t +2te −t −e −2t
(j) 3e −2t −e −6t −2e −t
2 (a) 2e t +e −2t
(b) e −t + 3e3t/2
2
(c) 2δ(t) +e −2t
(d) 1 5 (4 +e−t cos2t + 11 2 e−t sin2t)
(e) e −t (2−2cost −sint)
(f) e −2t (1 +2t)
(
(g) e −2t 3cosh √ 3t − √ 8 sinh √ )
3t −cost
3
(h) e t (cos √ 2t + √ 2sin √ 2t) +2e −2t
21.8 FINDINGTHEINVERSELAPLACETRANSFORMUSING
COMPLEXNUMBERS
InSections21.6and21.7wefoundtheinverseLaplacetransformusingstandardforms
and partial fractions. We now look at a method of finding inverse Laplace transforms
usingcomplexnumbers.Essentiallythemethodisoneusingpartialfractions,butwhere
all the factors in the denominator are linear -- that is, there are no quadratic factors. We
illustrate the methodusingExample21.13.
Example21.17 Findthe inverse Laplace transformsofthe following functions:
(a)
s +3
s 2 +6s+13
(b)
2s+3
s 2 +6s+13
(c)
s −1
2s 2 +8s+11
644 Chapter 21 The Laplace transform
Solution (a) We first factorize the denominator. To do this we solves 2 +6s +13 = 0 using the
formula
s = −6 ± √ 36 −4(13)
2
= −6 ± √ −16
2
= −6±4j
2
=−3±2j
It then follows that the denominator can be factorized as (s − a)(s − b) where
a = −3 +2jandb=−3 −2j. Then, usingthe partialfractions method
s +3
s 2 +6s+13 = s +3
(s −a)(s −b) = A
s −a + B
s −b
The unknown constantsAandBcan now be found:
s+3=A(s−b)+B(s−a)
Puts=a=−3+2j
2j =A(−3 +2j −b) =A(4j)
A = 1 2
Equate the coefficients ofs
1=A+B
B = 1 2
So,
s +3
s 2 +6s+13 = 1 ( 1
2 s −a + 1 )
s −b
The inverse Laplace transform can now be found:
{ }
L −1 s +3
= 1 { 1
s 2 +6s+13 2 L−1 s −a + 1 }
s −b
= 1 2 (eat +e bt ) = 1 2 (e(−3+2j)t +e (−3−2j)t )
= 1 2 e−3t (e 2jt +e −2jt )
= 1 2 e−3t (cos2t +jsin2t +cos2t −jsin2t)
= e −3t cos2t
21.8 Finding the inverse Laplace transform using complex numbers 645
(b)
2s+3
s 2 +6s+13 = 2s+3
(s −a)(s −b) = A
s −a + B
s −b
wherea = −3 +2jandb= −3 −2j. Hence,
2s+3=A(s−b)+B(s−a)
Puts=a=−3+2j
−3 +4j =A(4j)
A=1+0.75j
Equate the coefficients ofs
Hence,
2=A+B
B=1−0.75j
2s+3
s 2 +6s+13 = 1 +0.75j + 1 −0.75j
s −a s −b
(c)
Taking the inverse Laplace transform yields
{ }
L −1 2s+3
= (1 +0.75j)e at + (1 −0.75j)e bt
s 2 +6s+13
= (1 +0.75j)e (−3+2j)t + (1 −0.75j)e (−3−2j)t
s −1
2s 2 +8s+11 = 1 (
2
= (1 +0.75j)e −3t (cos2t +jsin2t)
+ (1 −0.75j)e −3t (cos2t −jsin2t)
= e −3t (2cos2t −1.5sin2t)
)
= 1 (
2
s −1
s 2 +4s+5.5
s −1
(s −a)(s −b)
)
wherea = −2 + √ 1.5j,b = −2 − √ 1.5j.Applyingthemethodofpartialfractions
produces
Hence,
s −1
(s −a)(s −b) = A
s −a + B
s −b
s−1=A(s−b)+B(s−a)
By lettings =a, thens =binturn, gives
A=0.5+ √ 1.5j B = 0.5 − √ 1.5j
Hence we may write
s −1
(s −a)(s −b) = 0.5 + √ 1.5j
+ 0.5 − √ 1.5j
s −a s −b
646 Chapter 21 The Laplace transform
Taking the inverse Laplace transformyields
Hence
{ }
L −1 s −1
= (0.5 + √ 1.5j)e at + (0.5 − √ 1.5j)e bt
(s −a)(s −b)
= (0.5 + √ 1.5j)e (−2+√ 1.5j)t
+ (0.5 − √ 1.5j)e (−2−√ 1.5j)t
= e −2t {(0.5 + √ 1.5j)e √ 1.5jt
+ (0.5 − √ 1.5j)e −√ 1.5jt }
= e −2t {(0.5 + √ 1.5j)(cos √ 1.5t +jsin √ 1.5t)
+ (0.5 − √ 1.5j)(cos √ 1.5t −jsin √ 1.5t)}
= e −2t (cos √ 1.5t −2 √ 1.5sin √ 1.5t)
{ }
L −1 s −1
= e−2t
2s 2 +8s+11 2 (cos √ 1.5t −2 √ 1.5sin √ 1.5t)
EXERCISES21.8
AsseenfromExample21.17,whencomplexnumbersareallowed,allthefactorsinthe
denominatorarelinear.Theunknown constantsareevaluatedusingparticularvaluesof
s orequating coefficients.
1 Expressthe followingexpressions aspartialfractions,
usingcomplex numbers ifnecessary. Hence findtheir
inverse Laplacetransforms.
(a)
3s−2
s 2 +6s+13
(b)
2s+1
s 2 −2s+2
(c)
(e)
s 2
(s 2 /2)−s+5
2s+3
−s 2 +2s−5
(d)
s 2 +s+1
s 2 −2s+3
Solutions
3
1 (a) 2 +11j/4
+
s −a
3
2 −11j/4
s −b
wherea=−3+2j,b=−3−2j,
e −3t[ 3cos2t − 11
2 sin2t ]
(b) 1 −3j/2 + 1 +3j/2
s −a s −b
wherea=1+j,b=1−j,
e t (2cost+3sint)
(c) 2 +8j/3 + 2 −8j/3 +2
s −a s −b
wherea=1+3j,b=1−3j,
(
e t 4cos3t − 16 )
3 sin3t +2δ(t)
3
(d)1+ 2 −j/(2 √ 2)
+
s −a
(e)
3
2 +j/(2√ 2)
s −b
wherea=1+ √ 2j,b=1− √ 2j,
)
δ(t)+e t (3cos √ 2t + 1 √
2
sin √ 2t
−1 +5j/4
s −a
+
−1 −5j/4
s −b
wherea=1+2j,b=1−2j,
e t (−2cos2t − 5 2 sin2t)
21.9 The convolution theorem 647
21.9 THECONVOLUTIONTHEOREM
Let f (t) andg(t) be two piecewise continuous functions. The convolution of f (t)
andg(t), denoted (f ∗g)(t), isdefined by
(f ∗g)(t)=
∫ t
0
f(t −v)g(v)dv
Example21.18 Find the convolution of2t andt 3 .
Solution f (t) = 2t,g(t) =t 3 , f(t −v) =2(t −v),g(v) = v 3 .Then
2t∗t 3 =
∫ t
0
2(t −v)v 3 dv =2
∫ t
0
tv 3 −v 4 dv
[ tv
4
= 2
4 − v5
5
= t5
10
] t
0
[ ] t
5
= 2
4 − t5 5
Itcan beshown that
f∗g=g∗f
butthe proof isomitted. Instead, thispropertyisillustratedby anexample.
Example21.19 Show that f ∗g=g ∗ f where f (t) = 2t andg(t) =t 3 .
Solution f ∗g= 2t ∗t 3 =t 5 /10 by Example 21.18. Fromthe definitionofconvolution
(g∗f)(t)=
∫ t
0
g(t −v)f(v)dv
We haveg(t) =t 3 ,sog(t −v) = (t −v) 3 ,and f(v) =2v.Therefore
g∗f=t 3 ∗2t=
=
= 2
∫ t
0
∫ t
0
∫ t
(t −v) 3 2vdv
(t 3 −3t 2 v +3tv 2 −v 3 )2vdv
0
t 3 v −3t 2 v 2 +3tv 3 −v 4 dv
[ t 3 v 2
] t
= 2
2 −t2 v 3 + 3tv4
4 − v5
= t5
5
0
10
648 Chapter 21 The Laplace transform
For any functions f (t)andg(t)
f∗g=g∗f
21.9.1 Theconvolutiontheorem
Let f (t) and g(t) be piecewise continuous functions, with L{f (t)} = F(s) and
L{g(t)} = G(s). The convolution theorem allows us to find the inverse Laplace transform
of a product oftransforms,F(s)G(s):
L −1 {F(s)G(s)} = (f ∗g)(t)
Example21.20 Use the convolution theorem to find the inverse Laplace transforms of the following
functions:
1 3
(a) (b)
(s +2)(s +3) s(s 2 +4)
Solution (a) LetF(s) = 1
s +2 ,G(s) = 1
s +3 .
Then f(t) = L −1 {F(s)} = e −2t ,g(t) = L −1 {G(s)} = e −3t .
{ }
L −1 1
= L −1 {F(s)G(s)} = (f ∗g)(t)
(s +2)(s +3)
=
=
∫ t
0
∫ t
0
e −2(t−v) e −3v dv =
e −2t e −v dv
∫ t
0
e −2t e 2v e −3v dv
= e −2t [−e −v ] t 0 = e−2t (−e −t +1) = e −2t −e −3t
(b) LetF(s) = 3 s ,G(s) = 1
s 2 +4 .Thenf(t) =3,g(t) = 1 sin2t. So,
2
{ }
L −1 3
= L −1 {F(s)G(s)}
s(s 2 +4)
=(f ∗g)(t)
∫ t
∫ t
= 3 sin2v dv = 3 sin2vdv
0 2 2 0
= 3 [ −cos2v
= 3 2 2 4 (1−cos2t)
] t
0
21.9 Solving linear constant coefficient differential equations 649
EXERCISES21.9
1 Find
(a) e −2t ∗e −t
(b) t 2 ∗e −3t
2 Find f ∗gwhen
(a) f =1,g=t
(b) f =t 2 ,g=t
(c) f =e t ,g=t
(d) f =sint,g=t
Ineach caseverify that
L{f}×L{g}=L{f ∗g}.
3 IfF(s) = 1
s −1 ,G(s) = 1 s andH(s) = 1
2s+3
use the convolution theoremto find the inverse
Laplace transformsof
(a) F(s)G(s)
(b) F(s)H(s)
(c) G(s)H(s)
4 Use the convolution theoremto determine the inverse
Laplace transformsof
(a)
1
s 2 (s+1)
(b)
1
(s +3)(s −2)
(c)
1
(s 2 +1) 2
Solutions
1 (a) e −t −e −2t
t 2 (b)
3 − 2t
9 + 2 27 − 2e−3t
27
t 2 t 4
2 (a) (b)
2 12
(c) −t−1+e t (d)t−sint
3 (a)e t −1 (b) et −e −3t/2
4 (a) t−1+e −t (b) e2t −e −3t
(c) sint−tcost
2
5
5
(c) 1 −e−3t/2
3
21.10 SOLVINGLINEARCONSTANTCOEFFICIENT
DIFFERENTIALEQUATIONSUSINGTHELAPLACE
TRANSFORM
SofarwehaveseenhowtofindtheLaplacetransformofafunctionoftimeandhowto
find the inverse Laplace transform. We now apply this to finding the particular solution
ofdifferentialequations.Theinitialconditionsareautomaticallysatisfiedwhensolving
an equation using the Laplace transform. They are contained in the transform of the
derivative terms.
TheLaplacetransformoftheequationisfound.Thistransformsthedifferentialequationintoanalgebraicequation.Thetransformofthedependentvariableisfoundandthen
the inverse transform iscalculated toyield the required particular solution.
Example21.21 Solve the differential equation
dx
dt +x=0
x(0)=3
usingLaplace transforms.
650 Chapter 21 The Laplace transform
Solution The Laplace transformof each termisfound:
L(x) =X(s),
( ) dx
L =sX(s)−x(0)=sX(s)−3
dt
Note that we write X(s) to draw attention to the fact that X is a function of s; X(s)
does not mean X multiplied by s. Taking the Laplace transform of the equation
yields
sX(s)−3+X(s)=0
The equation isrearranged forX(s):
sX(s)+X(s) =3
(s+1)X(s) =3
X(s) = 3
s +1
Taking the inverse Laplace transform of both sides of the equation gives
x(t) =3e −t
Example21.22 Solve
dx
dt +x=9e2t
x(0)=3
using the Laplace transform.
Solution The Laplace transformof both sides of the equation isfound:
sX(s) −x(0) +X(s) = 9
s −2
sX(s)−3+X(s) = 9
s −2
(s +1)X(s) = 9
s −2 +3
3(s +1)
(s +1)X(s) =
s −2
X(s) = 3
s −2
Taking the inverse Laplace transform yields
x(t) =3e 2t
21.10 Solving linear constant coefficient differential equations 651
Engineeringapplication21.3
RLcircuitwithrampinput
Recall from Engineering application 19.3 the differential equation for an RL circuit
with a unit ramp input isgiven by
iR+L di
dt
=t t0 i(0)=0
Usethe Laplace transform tosolve thisequation.
Solution
Let
L(i(t)) =I(s)
Then
( ) di
L =sI(s)−i(0) =sI(s)−0 =sI(s)
dt
From Table 21.1 wesee that
L(t) = 1 s 2
We note thatRandLare constants. We can now take the Laplace transform of the
given equation. This gives
I(s)R +LsI(s) = 1 s 2
This equation issolved forI(s):
I(s)(R +Ls) = 1 s 2
1
I(s) =
s 2 (R +Ls)
InordertotaketheinverseLaplacetransformandhencefindi(t)weexpressI(s)as
the sum of itspartial fractions. The expression
1
s 2 (R +Ls)
has a repeated linear factor,s 2 inthe denominator, giving risetopartialfractions
A
s + B s 2
The linear factor,R+Ls, gives risetoapartialfraction of the form
Hence
C
R+Ls
I(s) =
1
s 2 (R +Ls) = A s + B s + C
2 R+Ls
➔
652 Chapter 21 The Laplace transform
The unknown constants, A, B andC, need to be expressed in terms of the known
constantsRandL. Multiplying the above equation bys 2 (R +Ls) yields
1 =As(R +Ls) +B(R +Ls) +Cs 2 (1)
Lettings = 0 inEquation (1)gives
1=BR
andsoB= 1 R .
Wenotethatwhens = − R L thenR+Lsis0.Hencelettings = −R L inEquation(1)
gives
(
1 =C − R ) 2
fromwhichC = L2
L R 2
Comparing the coefficient ofson both sides of Equation (1) gives
0=AR+BL
A = − BL
R = − L R 2
Substituting the expressions forA,BandC into the expression forI(s)gives
I(s) = − L
R 2 s + 1
Rs 2 + L 2
R 2 (R +Ls)
InreadinessfortakinginverseLaplacetransformswewritethefinaltermasfollows:
L 2
R 2 (R +Ls) = L 2
R 2 L (R/L +s) = L
R 2 (R/L +s)
Hence
I(s) = − L
R 2 s + 1
Rs + L
2 R 2 (R/L +s)
Taking the inverse Laplace transformyields
i(t) = − L R 2 + t R + L R 2 e−(R/L)t t 0
This may berearranged as
i(t) = t R + L R 2 (e−(R/L)t −1)
t 0
Example21.23 Solve
y ′′ −y=−t 2 y(0) =2, y ′ (0) =0
using the Laplace transform.
Solution Let L(y) =Y(s). Thenusingthe resultstated inSection21.5 wehave
L(y ′′ ) =s 2 Y(s) −sy(0) −y ′ (0) =s 2 Y(s) −2s
21.10 Solving linear constant coefficient differential equations 653
We also notethat
L(−t 2 ) = − 2 s 3
The Laplace transform of the differential equation is taken. This yields
s 2 Y(s)−2s−Y(s)=− 2 s 3
The equation isrearranged forY(s):
(s 2 −1)Y(s) = − 2 s 3 +2s
= 2(s4 −1)
s 3
= 2(s2 −1)(s 2 +1)
s 3
Bydividing the equation by (s 2 −1) weobtain
Y(s)= 2(s2 −1)(s 2 +1)
s 3 (s 2 −1)
= 2(s2 +1)
s 3
= 2s2
s 3 + 2 s 3 = 2 s + 2 s 3
Taking the inverse Laplace transform gives
y(t)=2+t 2
Example21.24 Solve
y ′′ +y ′ −2y=−2 y(0)=2,y ′ (0)=1
Solution Let L(y) =Y(s). Then
L(y ′ ) =sY(s) −y(0) =sY(s) −2
L(y ′′ ) =s 2 Y(s) −sy(0) −y ′ (0) =s 2 Y(s) −2s −1
We also notethat
L(−2) = − 2 s
We can now take the Laplace transform of the differential equation toget
and so
s 2 Y(s)−2s−1+sY(s)−2−2Y(s)=− 2 s
(s 2 +s−2)Y(s)=− 2 s +2s+3
(s −1)(s +2)Y(s) = 2s2 +3s−2 (2s −1)(s +2)
=
s s
Dividing the equation by (s −1)(s +2) gives
Y(s)=
(2s −1)(s +2)
s(s −1)(s +2) = 2s−1
s(s −1)
654 Chapter 21 The Laplace transform
The expression,
Y(s)= 1 s + 1
s −1
2s−1
, iswritten as itspartialfractions:
s(s −1)
Taking the inverse Laplace transform yields
y(t)=1+e t
Example21.25 Solve
x ′′ +2x ′ +2x =e −t x(0) =x ′ (0) =0
usingLaplace transforms.
Solution Taking the Laplace transform ofboth sides:
s 2 X(s) −sx(0) −x ′ (0) +2(sX(s) −x(0)) +2X(s) = 1
s +1
Therefore,
(s 2 +2s +2)X(s) = 1 sincex(0) =x ′ (0) =0
s +1
1
X(s)=
(s +1)(s 2 +2s +2) = 1
(s +1)(s −a)(s −b)
wherea = −1 +j,b = −1 −j.Usingpartialfractions gives
X(s)= 1
s +1 − 1 ( 1
2 s −a + 1 )
s −b
Taking the inverse Laplace transform
x(t) =e −t − 1 2 (eat +e bt )
=e −t − 1 2 (e(−1+j)t +e (−1−j)t )
=e −t − e−t
2 (ejt +e −jt )
=e −t − e−t
2 (cost+jsint+cost−jsint)
= e −t −e −t cost
Example21.26 Solve
x ′′ −5x ′ +6x=6t−4 x(0)=1,x ′ (0)=2
Solution TheLaplace transformofboth sides ofthe equation isfound.Let L{x} =X(s).
s 2 X(s) −sx(0) −x ′ (0) −5(sX(s) −x(0)) +6X(s) = 6 s 2 − 4 s
(s 2 −5s +6)X(s) = 6 s 2 − 4 s +s−3=s3 −3s 2 −4s+6
s 2
X(s)= s3 −3s 2 −4s+6
s 2 (s 2 −5s+6)
21.10 Solving linear constant coefficient differential equations 655
= A s + B s + C
2 s −2 + D
s −3
= s3 −3s 2 −4s+6
s 2 (s −2)(s −3)
The constantsA,B,C andDare evaluated inthe usual way:
X(s)= 1 6s + 1 s 2 + 3
2(s −2) − 2
3(s −3)
Taking the inverse Laplace transform yields
x(t) = 1 6 +t + 3 2 e2t − 2 3 e3t
Engineeringapplication21.4
Dischargeofacapacitorthroughaloadresistor
InEngineeringapplication2.8,weexaminedthevariationinvoltageacrossacapacitor,C,
when it was switched in series with a resistor,R, at timet = 0. We stated a
relationshipforthetime-varyingvoltage, v,acrossthecapacitor.Provethisrelationship.
Refer tothe example fordetails ofthe circuit.
Solution
Firstwemustderiveadifferentialequationforthecircuit.UsingKirchhoff’svoltage
law and denoting the voltageacrossthe resistorby v R
weobtain
v+v R
=0
Using Ohm’slaw and denoting the currentinthe circuitbyiweobtain
v+iR=0
For the capacitor,
i =C dv
dt
Combining these equations gives
v+RC dv = 0
dt
We now take the Laplace transformof this equation. Using L{v} =V(s)weobtain
V(s) +RC(sV(s) − v(0)) =0
V(s)(1 +RCs) =RCv(0)
V(s)= RCv(0)
1+RCs = v(0)
1
RC +s
Taking the inverse Laplace transform ofthe equation yields
v = v(0)e −t/(RC)
t 0
This isequivalent tothe relationship statedinEngineering application 2.8.
656 Chapter 21 The Laplace transform
Engineeringapplication21.5
Electronicthermometermeasuringoventemperature
Many engineering systems can be modelled by a first-order differential equation.
Thetimeconstantisameasureoftherapiditywithwhichthesesystemsrespondto
a change in input. Suppose an electronic thermometer is used to measure the temperature
of an oven. The sensing element does not respond instantly to changes in
the oven temperature because it takes time for the element to heat up or cool down.
Provided the electronic circuitry does not introduce further time delays then the differential
equation thatmodels the thermometer isgiven by
τ dv m
+v
dt m
=v o
where v m
=measuredtemperature, v o
=oventemperature, τ =timeconstantofthe
sensor. For convenience the temperature is measured relative to the ambient room
temperature, which forms a ‘baseline’ for temperature measurement.
Suppose the sensing element of an electronic thermometer has a time constant
of 2 seconds. If the temperature of the oven increases linearly at the rate of 3 ◦ Cs −1
startingfromanambientroomtemperatureof20 ◦ Catt = 0,calculatetheresponseof
the thermometer to the changing oven temperature. State the maximum temperature
error.
Solution
Taking Laplace transforms ofthe equation gives
τ(sV m
(s) − v m
(0)) +V m
(s) =V o
(s)
v m
(0) = 0 as the oven temperature and sensor temperature are identical att = 0.
Therefore,
τsV m
(s) +V m
(s) =V o
(s)
V m
(s) = V o (s)
1+τs
(21.1)
Forthisexample,theinputtothethermometerisatemperaturerampwithaslopeof
3 ◦ Cs −1 . Therefore, v o
= 3t fort 0:
V o
(s) = L{v o
(t)} = 3 s 2 (21.2)
Combining Equations (21.1) and (21.2) yields
V m
(s) =
3
s 2 (1+τs) = 3
s 2 (1 +2s)
Then using partialfractions, wehave
V m
(s) = 3 s 2 − 6 s + 12
1+2s
Taking the inverse Laplace transformyields
since τ = 2
v m
=3t−6+6e −0.5t
t0
21.10 Solving linear constant coefficient differential equations 657
This response consists of threeparts:
(1) a decaying transient, 6e −0.5t , which disappears with time;
(2) a ramp,3t, with the same slope as the oven temperature;
(3) a fixed negative temperature error, −6.
Therefore,afterthetransienthasdecayedthemeasuredtemperaturefollowstheoven
temperaturewithafixednegativeerror.Itisinstructivetoobtainthetemperatureerror
by an alternative method. Given thatthe temperature error is v e
, then
and
v e
=v m
−v o
V e
(s) =V m
(s) −V o
(s) (21.3)
Combining Equations (21.1) and (21.3) yields
V e
(s) = V o (s)
1+τs −V o (s)= −τsV o (s)
1+τs
Combining Equations (21.2) and (21.4) yields
V e
(s) = −τs 3
1+τss = −3τ
2 s(1+τs)
The final value theorem can beused tofind the steady-state error:
[ ] −3τs
lim v
t→∞
e
(t) =lim sV e
(s) = lim =−3τ=−6
s→0 s→0 s(1+τs)
(21.4)
that is, the steady-state temperature error is −6 ◦ C. It is important to note that the
final value theorem can only be used if it is known that the time function tends to a
limitast → ∞. Inmany cases engineers know thisisthe case fromexperience.
The Laplace transform technique can also be used to solve simultaneous differential
equations.
Example21.27 Solve the simultaneousdifferential equations
x ′ +x+ y′
=1 x(0)=y(0)=0
2
x ′
2 +y′ +y=0
Solution Take the Laplace transforms of both equations:
sY(s) −y(0)
sX(s) −x(0) +X(s) + = 1 2 s
sX(s) −x(0)
+sY(s)−y(0)+Y(s)=0
2
These arerearranged togive
(s +1)X(s) + sY(s) = 1 2 s
sX(s)
+(s+1)Y(s)=0
2
658 Chapter 21 The Laplace transform
These simultaneous algebraic equations need to be solved for X(s) and Y(s). By
Cramer’s rule(see Section 8.7.2)
∣ s +11/s
s/2 0 ∣ −1/2
Y(s)=
∣ s +1 s/2
=
(s+1) 2 −s 2 /4
s/2 s+1∣
= − 1 { }
1
2 3s 2 /4+2s+1
= − 1 { }
4
2 3s 2 +8s+4
= − 1 { }
4
2 (3s +2)(s +2)
Using partialfractions wefind
Y(s)=− 1 { 3
2 3s+2 − 1 }
s +2
= − 1 { 1
− 1 }
2 s + 2 s +2
3
and hence
Similarly,
and so
y(t) = 1 2 (e−2t −e −2t/3 )
X(s)=
4(s +1)
s(3s +2)(s +2) = 1 s − 1
2 ( 1
) −
s + 2 2(s +2)
3
x(t)=1− 1 2 (e−2t/3 +e −2t )
EXERCISES21.10
1 UseLaplace transformsto solve
(a) x ′ +x=3,
x(0) =1
(b) 3 dx
dt +4x=2,
x(0) =2
(c) 2 dy
dt +4y=1,
y(0) =4
(d) 4 dy
dt +8y=7,
y(0) =6
(e) y ′′ +y ′ +y=1,
y(0) =1,y ′ (0) =3
2 UseLaplacetransformsto solve
(a) x ′′ +x =2t,
x(0) =0,x ′ (0) =5
(b) 2x ′′ +x ′ −x =27cos2t +6sin2t,
x(0) = −1,x ′ (0) = −2
21.11 Transfer functions 659
(c) x ′′ +x ′ −2x=1−2t,
x(0) =6,x ′ (0) = −11
(d) x ′′ −4x = 4(cos2t −1),
x(0) =1,x ′ (0) =0
(e) x ′ −2x−y ′ +2y=−2t 2 +7,
x ′
2 +x+3y′ +y=t 2 +6
x(0) =3,y(0) =6
(f) x ′ +x+y ′ +y=6e t ,
x ′ +2x−y ′ −y=2e −t
x(0) =2,y(0) =1
3 Using Laplace transformsfindthe particular
solution of
d 2 y
dt 2 −5dy dt −6y=14e−t
satisfyingy = 3 and dy
dt =8whent=0.
Solutions
1 (a) x(t)=3−2e −t
(b) 3 2 e−4t/3 + 1 2
1
(c)
4 + 15 4 e−2t
(d) y(t) = 7 8 + 41
8 e−2t
(e) y(t) = 1 +3.464e −0.5t sin0.866t
2 (a) 3sint+2t
(b) −3cos2t +2e −t
(c) t +6e −2t
(d)
e −2t
4 + e2t
4 − 1 2 cos2t+1
(e) x=t 2 +3,y=6−t
(f) x(t) = 6et
5 +2e−t − 6e−3t/2 ,
5
y(t) = 6e−3t/2
5
3 −2te −t + 8e−t
7
−2e −t + 9et
5
+ 13e6t
7
21.11 TRANSFERFUNCTIONS
It is possible to obtain a mathematical model of an engineering system that consists of
one or more differential equations. This approach was introduced in Section 20.4. We
have already seen that the solution of differential equations can be found by using the
Laplace transform. This leads naturally to the concept of a transfer function which will
bedeveloped inthissection.Consider the differential equation
dx(t)
+x(t) = f(t) x(0) =x
dt
0
(21.5)
andassumethatitmodelsasimpleengineeringsystem.Then f (t)representstheinputto
thesystemandx(t)representstheoutput,orresponseofthesystemtotheinput f (t).For
reasonsthatwillbeexplainedbelowitisnecessarytoassumethattheinitialconditions
associatedwiththedifferentialequationarezero.InEquation(21.5)thismeanswetake
x 0
tobe zero.Taking the Laplace transform of Equation (21.5) yields
sX(s)−x 0
+X(s) =F(s)
(1 +s)X(s) =F(s) assumingx 0
=0
so that
X(s)
F(s) = 1
1 +s
The function, X(s)/F(s), is called a transfer function. It is the ratio of the Laplace
transform of the output to the Laplace transform of the input. It is often denoted by
660 Chapter 21 The Laplace transform
G(s). Therefore, forEquation (21.5),
G(s) = 1
1 +s
The concept of a transferfunction isvery usefulinengineering. Itprovides a simple
algebraic relationship between the input and the output. In other words, it allows the
analysisofdynamicsystemsbasedonthedifferentialequationtoproceedinarelatively
straightforward manner. Earlier we noted that it was necessary to assume zero initial
conditions in order to form the transfer function. Without such an assumption, the relationship
between the input and the output would have been more complicated. What
is more, the relationship would vary depending on how much energy is stored in the
system at t = 0. By assuming zero initial conditions, the transfer function depends
purely on the system characteristics. Such an approach, whilst very convenient, does
have its limitations. The transfer function approach is useful for analysing the effect of
aninputtothesystem.However,ifonerequirestheeffectofthecombinationofasystem
inputand initial conditions,thenitisnecessarytocarryoutafullsolution ofthedifferential
equation as we did in Section 21.10. In practice there are many cases where the
simplified treatment provided by the transfer function is perfectly acceptable. An alternativeapproachisprovidedbystate-spacemodels,whichweexaminedinSection20.4.
Theseprovideanaturalwayofhandlinginitialconditions.Inordertosolveastate-space
model wherethereareinitialconditions,allthatisnecessaryistouseanon-zeroinitial
valueofthestatevector.RecallthatinSection20.4weonlysetupthestate-spacemodels
and did not solve them. However, the method for the standard solution of a state-space
model can befound inmany advanced textbooks on control and systems engineering.
Example21.28 Findthetransferfunctionsofthefollowingequationsassumingthat f (t)representsthe
inputandx(t) represents the output:
(a)
(b)
dx(t)
dt
d 2 x(t)
dt 2
−4x(t) =3f(t), x(0) =0
+3 dx(t)
dt
−x(t) = f(t),
dx(0)
dt
=0, x(0)=0
Solution (a) Taking Laplace transforms ofthe differential equation gives
sX(s) −x(0) −4X(s) =3F(s)
(s −4)X(s) =3F(s) asx(0) =0
X(s)
F(s) =G(s) = 3
s −4
(b) Taking Laplace transforms ofthe differential equation gives
s 2 X(s) −sx(0) − dx(0) +3(sX(s) −x(0)) −X(s) =F(s)
dt
(s 2 +3s −1)X(s) =F(s) as dx(0) =0 and x(0)=0
X(s)
F(s) =G(s) = 1
s 2 +3s−1
dt
Whencreatingamathematicalmodelofanengineeringsystemitisoftenconvenientto
thinkofthevariableswithinthesystemassignalsandelementsofthesystemasmeans
21.11 Transfer functions 661
R(s)
G(s)
Y(s)
Figure21.3
Therelationship
Y(s) =G(s)R(s)holds
forasingle block.
by which these signals are modified. The word signal is used in a very general sense
and is not restricted to, say, voltage. On this basis each of the elements of the system
can be modelled by a transfer function. A transfer function defines the relationship between
an input signal and an output signal. The relationship is defined in terms of the
Laplace transforms of the signals. The advantage of this is that the rules governing the
manipulation of transfer functions are then of a purely algebraic nature. Consider
Figure 21.3. If
then
R(s) = L{r(t)} = Laplace transform of the input signal
Y(s) = L{y(t)} = Laplace transformof the output signal
G(s) = transferfunction
Y(s) =G(s)R(s)
Transfer functions are represented schematically by rectangular blocks, while signals
are represented as arrows. Engineers often speak of the time domain and the s domain
in order to distinguish between the two mathematical representations of an engineering
system. However, it is important to emphasize the equivalence between the
two domains.
Oftenwhenconstructingamathematicalmodelofasystemusingtransferfunctions,
it is convenient first to obtain transfer functions of the elements of the system and then
combinethem.Beforetheoveralltransferfunctioniscalculatedablockdiagramisdrawn
which shows the relationship between the various transfer functions. Block diagrams
consistof threebasic components. These are shown inFigure 21.4.
R(s)
G(s)
Y(s)
R(s)
+
–
R(s) – X(s)
Y(s)
Y(s)
X(s)
(a) (b) (c)
Figure21.4
Thethree components ofblock diagrams. (a)Abasic block; the block contains a
transferfunction which relatesthe input and outputsignals. (b)Asummingpoint.
(c) Atake-off point.
We have already examined the basic block which is governed by the relationship
Y(s) =G(s)R(s).Asummingpointaddstogethertheincomingsignalstothesumming
point and produces an outgoing signal. The polarity of the incoming signals is denoted
bymeansofapositiveornegative sign.Therecanbeseveral incomingsignalsbutonly
one outgoing signal. A take-off point is a point where a signal is tapped. This process
of tapping the signal has no effect on the signal value; that is, the tap does not load the
original signal. There are several rules governing the manipulation of block diagrams.
Only two will beconsidered here.
21.11.1 Rule1.Combiningtwotransferfunctionsinseries
Consider Figure 21.5. The following relationships hold:
X(s) =G 1
(s)R(s)
Y(s) =G 2
(s)X(s)
Y(s)
662 Chapter 21 The Laplace transform
R(s)
G 1 (s)
X(s)
G 2 (s)
Y(s)
R(s)
G 1 (s) G 2 (s)
Y(s)
Figure21.5
Two blocksin series.
Figure21.6
Figure21.5issimplifiedtoasingleblock.
EliminatingX(s) from these equations yields
Y(s) =G 1
(s)G 2
(s)R(s)
Y(s)
R(s) =G 1 (s)G 2 (s)
Finally the overall transfer function, G(s), is given by G 1
(s)G 2
(s) as shown in
Figure 21.6.
Fortwo transferfunctions inseries the overall transferfunction isgiven by
G(s) =G 1
(s)G 2
(s)
21.11.2 Rule2.Eliminatinganegativefeedbackloop
Consider Figure 21.7 which shows a negative feedback loop. It is so called because the
outputsignalis‘fedback’andsubtractedfromtheinputsignal.Suchloopsarecommon
inavarietyofengineeringsystems.ThequantitiesX 1
(s)andX 2
(s)representintermediate
signals in the system. We wish to obtain an overall transfer function for this system
relatingY(s)andR(s). For the two transferfunctions the following hold:
Y(s) =G(s)X 2
(s) (21.6)
X 1
(s) =H(s)Y(s) (21.7)
For the summingpoint
X 2
(s) =R(s) −X 1
(s) (21.8)
Combining Equations(21.7)and (21.8) gives
X 2
(s) =R(s) −H(s)Y(s) (21.9)
Combining Equations(21.6)and (21.9) gives
Y(s) =G(s)(R(s) −H(s)Y(s)) =G(s)R(s) −G(s)H(s)Y(s)
Y(s)(1 +G(s)H(s)) =G(s)R(s)
Y(s)=
G(s)R(s)
1 +G(s)H(s)
Theoverall transferfunctionforanegative feedback loop isgiven by
Y(s)
R(s) = G(s)
1 +G(s)H(s)
21.11 Transfer functions 663
R(s) +
–
X 2 (s)
G(s)
Y(s)
X 1 (s)
H(s)
R(s)
G(s)
1 + G(s)H(s)
Y(s)
Figure21.7
Block diagram foranegative feedback loop.
Figure21.8
Simplifiedblock diagram foranegative
feedback loop.
The simplified block diagram for a negative feedback loop isshown inFigure 21.8.
A complicated engineering system may be represented by many differential equations.
The output from one part of the system may form the input to another part. Consider
the following example.
Example21.29 Asystemismodelled bythe differential equations
x ′ +2x=f(t) (21.10)
2y ′ −y =x(t) (21.11)
In Equation (21.10) the input is f (t) and the output is x(t). In Equation (21.11), x(t)
is the input andy(t) is the final output of the system. Find the overall system transfer
functionassumingzero initial conditions.
Solution The output from Equation (21.10) isx(t); this forms the input to Equation (21.11). The
blockdiagramsforEquations(21.10)and(21.11)arecombinedintoasingleblockdiagramasshown
inFigure 21.9.
UsingRule 1,the overall systemtransferfunction can thenbefound:
G(s) = Y(s)
F(s) = 1
(s +2)(2s −1)
This transferfunction relatesY(s)andF(s)(see Figure 21.10).
F(s)
1
X(s)
1
Y(s)
s + 2
2s – 1
F(s)
1
(s + 2)(2s – 1)
Y(s)
Figure21.9
Combinedblock diagram for
Equations(21.10)and (21.11).
Figure21.10
Theoverall systemtransfer function.
Example21.30 Asystemisrepresentedby the differential equations
2x ′ −x=f(t)
y ′ +3y =x(t)
z ′ +z=y(t)
The initial input is f (t) and the final output is z(t). Find the overall system transfer
function, assumingzeroinitial conditions.
664 Chapter 21 The Laplace transform
Solution The transfer function for each equation is found and combined into one block diagram
(see Figure 21.11). The three blocks are simplified to a single block as shown in Figure
21.12. The overall systemtransfer function is
G(s) = Z(s)
F(s) = 1
(2s −1)(s +3)(s +1) .
F(s)
1
2s – 1
X(s)
1
s + 3
Y(s)
1
s + 1
Z(s)
F(s)
1
(2s – 1)(s + 3)(s + 1)
Z(s)
Figure21.11
Block diagram forthe system given in
Example 21.30.
Figure21.12
Simplified block diagram for
Example 21.30.
Engineeringapplication21.6
Transportlaginasystemtosupplyfueltoafurnace
Transportlagisatermusedtodescribethetimedelaythatmaybepresentincertain
engineering systems. A typical example would be a conveyor belt feeding a furnace
with coal supplied by a hopper (see Figure 21.13). The amount of fuel supplied to
the furnace can be varied by varying the opening at the base of the hopper but there
is a time delay before this changed quantity of fuel reaches the furnace. The time
delaydependsonthespeedandlengthoftheconveyor.Mathematically,thefunction
describing the variation in the quantity of fuel entering the furnace is a time-shifted
version of the function describing the variation in the quantity of fuel placed on the
conveyor (see Figure 21.14).
Let
u(t)q(t) = quantity offuelplacedon the conveyor,
whereu(t) isthe unit stepfunction,
Coal
Conveyor velocity, y
l
Conveyor
Furnace
Figure21.13
Coalisfed intothe furnacevia the conveyor
belt.
21.11 Transfer functions 665
Fuel variation
(mass of fuel
per unit length
of conveyor,
kgm –1 )
d
q(t) q(t – d)
Q(s)
e –sd
Q d (s)
Time (s)
Figure21.14
Thetimedelay,d,isintroducedbytheconveyorbelt.
Figure21.15
Block diagram forthe
conveyor belt.
u(t −d)q(t −d) = quantity of fuelentering the furnace,
d = time delay introduced by the conveyor,
l = length of conveyor (m),
v = speed of conveyor (ms −1 ).
The input to the conveyor is u(t)q(t). The output from the conveyor is
u(t −d)q(t −d). If the conveyor is moving at a constant speed then v = l d and
sod= l . The transfer function that models the conveyor belt can be obtained by
v
using the second shift theorem:
Q d
(s) = L{u(t −d)q(t −d)} = e −sd L{q(t)} = e −sd Q(s)
The transfer function forthe conveyor isshown inFigure 21.15.
Transport lags can cause difficulty when trying to control a system because of
the delay between taking a control action and its effect being felt. In this example,
increasingthequantityoffuelontheconveyordoesnotleadtoanimmediateincrease
in fuel entering the furnace. The difficulty of controlling the furnace temperature
increases as the transport lag introduced by the conveyor increases.
Engineeringapplication21.7
Positioncontrolsystem
There are many examples of position control systems in engineering, for example
controlofthepositionofaplotterpen,andcontrolofthepositionofaradiotelescope.
Thecommon termforthesesystems is servo-systems.
Consider the block diagram of Figure 21.16 which represents a simple servosystem.
The actual position of the motor is denoted by a
(s) in the s domain and
θ a
(t)inthetimedomain.Thedesiredpositionisdenotedby d
(s)and θ d
(t),respectively.
The system is a closed loop with negative feedback. The difference between
the desired and the actual position generates an error signal which is fed to a controllerwithgainK.Theoutputsignalfromthecontrollerisfedtoaservo-motorand
its associated drive circuitry. The aim of the control system is to maintain the actual
position of the motor at a value corresponding to the desired position. In practice, if ➔
666 Chapter 21 The Laplace transform
Q d (s)
+
–
K
0.5
s(s + 1)
Controller Servo-motor +
drive amplifier
Q a (s)
Figure21.16
Position control system.
anewdesiredpositionisrequestedthenthesystemwilltakesometimetoattainthis
new position. The engineer can choose a value of the controller gain to obtain the
besttypeofresponsefromthecontrolsystem.Wewillexaminetheeffectofvarying
K on the response of the servo-system.
We can use Rules 1 and 2 to obtain an overall transfer function for the system.
The forward transfer function is
G(s) = 0.5K by Rule 1
s(s +1)
The overall transferfunction a (s) isobtained by Rule 2 withH(s) = 1.So,
d
(s)
0.5K
a
(s)
d
(s) = s(s +1) 0.5K
1 + 0.5K(1) =
s(s +1) +0.5K = 0.5K
s 2 +s+0.5K
s(s +1)
Let us now examine the effect of varying K. We will consider three values,
K = 0.375,K = 0.5,K = 5, and examine the response of the system to a unit step
input ineach case.
ForK = 0.375
a
(s)
d
(s) = 0.1875
s 2 +s+0.1875
With d
(s) = 1 s , then
a
(s) =
usingpartialfractions. So,
0.1875
(s 2 +s+0.1875)s = 1 s + 0.5
s +0.75 − 1.5
s +0.25
θ a
(t) = 1 +0.5e −0.75t −1.5e −0.25t t 0
This is shown in Figure 21.17. Engineers usually refer to this as an overdamped
response. The response does notovershoot the final value.
ForK = 0.5
a
(s)
d
(s) = 0.25
s 2 +s+0.25
a
(s) =
0.25
s(s 2 +s +0.25) = 1 s − 1
s+0.5 − 0.5
(s +0.5) 2
θ a
(t) = 1 −e −0.5t −0.5te −0.5t t 0
21.11 Transfer functions 667
u a (t)
K = 5
K = 0.5
K = 0.375
t
Figure21.17
Time response forvariousvalues ofK.
This is shown in Figure 21.17 and is termed acritically damped response. It corresponds
tothe fastestrisetime of the systemwithoutovershooting.
ForK=5
a
(s)
d
(s) = 2.5
s 2 +s+2.5
2.5
a
(s) =
s(s 2 +s +2.5)
Rearranging toenable standard formstobe inverted gives
a
(s) = 1 s −
s+0.5
(s +0.5) 2 +1.5 − 0.5
2 (s +0.5 2 ) +1.5 2
θ a
(t) = 1 −e −0.5t cos1.5t − 1 3 e−0.5t sin1.5t
The trigonometric terms can be expressed as a single sinusoid using the techniques
given inSection 3.7.Thus,
θ a
(t) = 1 −1.054e −0.5t sin(1.5t +1.249) for t 0
ThisisshowninFigure21.17andistermedanunderdampedresponse.Thesystem
overshoots its final value.
Inapracticalsystemitiscommontodesignforsomeovershoot,provideditisnot
excessive,asthisenablesthedesiredvaluetobereachedmorequickly.Itisinteresting
tocomparethesystemresponseforthethreecaseswiththenatureoftheirrespective
transfer function poles. For the overdamped case the poles are real and unequal, for
the critically damped case the poles are real and equal, and for the underdamped
casethepolesarecomplex.Engineersrelyheavilyonpolepositionswhendesigning
a system to have a particular response. By varying the value of K it is possible to
obtain a range of system responses and corresponding pole positions.
EXERCISES21.11
1 Find the transfer functionforeachofthe following
equationsassuming zeroinitialconditions:
(a) x ′′ +x=f(t)
(b) 2 d2 x
dt 2 + dx
dt −x=f(t)
(d)
(c) 2 d2 y
dt 2 +3dy +6y =p(t)
dt
3 d3 y y
dt 3 +6d2 dt 2 +8dy +4y =g(t)
dt
(e) 6 d2 y
dt 2 +8dy dt +4y=3df dt +4f,
f isthe systeminput
(f) 6x ′′′ +2x ′′ −x ′ +4x = 4f ′′ +2f ′ +6f,fisthe
systeminput
668 Chapter 21 The Laplace transform
Solutions
1 (a)
(c)
1
s 2 +1
1
2s 2 +3s+6
1
(b)
2s 2 +s−1
1
(d)
3s 3 +6s 2 +8s+4
(e)
(f)
3s+4
6s 2 +8s+4
4s 2 +2s+6
6s 3 +2s 2 −s+4
21.12 POLES,ZEROSANDTHEsPLANE
Mosttransferfunctionsforengineeringsystemscanbewrittenasrationalfunctions:that
is, asratiosoftwo polynomials ins, with a constantfactor,K:
G(s) =K P(s)
Q(s)
P(s) is of order m, and Q(s) is of order n; for a physically realizable system m < n.
HenceG(s) may bewritten as
G(s) = K(s−z 1 )(s−z 2 )...(s−z m )
(s−p 1
)(s−p 2
)...(s−p n
)
The values ofsthat makeG(s) zero are known as the system zeros and correspond to
the roots ofP(s) = 0, that iss =z 1
,z 2
,...,z m
. The values ofsthat makeG(s) infinite
are known as the system poles and correspond to the roots of Q(s) = 0, that is s =
p 1
,p 2
,...,p n
. As we have seen, poles may be real or complex. Complex poles always
occur incomplex conjugate pairs whenever the polynomialQ(s)has real coefficients.
Engineersfinditusefultoplotthesepolesandzerosonansplanediagram.Acomplex
planeplotisusedwith,conventionally,arealaxislabelof σ andanimaginaryaxislabel
of jω. Poles are marked as crosses and zeros are marked as small circles. Figure 21.18
shows ansplane plot for the transfer function
G(s) =
3(s −3)(s +2)
(s+1)(s+1+3j)(s+1−3j)
Thebenefitofthisapproachisthatitallowsthecharacterofalinearsystemtobedetermined
by examining thesplane plot. In particular, the transient response of the system
can easilybe visualized by the number and positions of the systempoles and zeros.
jv
3 3
s plane
3
–2–1 3 s
3–3
Figure21.18
Polesand zerosplotted forthe transferfunction:
G(s) =
3(s −3)(s +2)
(s+1)(s+1+3j)(s+1−3j) .
21.12 Poles, zeros and thesplane 669
Example21.31 Find the poles of
s −2
(s 2 +2s +5)(s +1) .
Solution The denominator isfactorized into linear factors:
(s 2 +2s+5)(s+1)= (s−p 1
)(s−p 2
)(s−p 3
)
where p 1
= −1 +2j, p 2
= −1 −2j, p 3
= −1.The poles are −1 +2j,−1 −2j,−1.
IfX(s)= 1
s−p 1
where the pole p 1
isgiven bya +bj,then
x(t) =e p 1 t =e at e btj = e at (cosbt +jsinbt)
Hence the real part of the pole,a, gives rise to an exponential term and the imaginary
part,b,givesrisetoanoscillatoryterm.Ifa < 0theresponse,x(t),willdecreasetozero
ast→∞.
ConsidertheLaplacetransforminExample21.25.Therearethreepoles: −1, −1+j
and −1 − j. The real pole is negative, and the real parts of the complex poles are also
negative.Thisensurestheresponse,x(t),decreaseswithtime.Theimaginarypartofthe
complex poles gives rise to the oscillatory term, cost. The characteristics of poles and
the corresponding responses arenow discussed.
GivenasystemwithtransferfunctionG(s),inputsignalR(s)andoutputsignalC(s),
then
C(s)
R(s) =G(s)
thatis,
and so
C(s) =G(s)R(s)
C(s) = K(s−z 1 )(s−z 2 )...(s−z m )R(s)
(s−p 1
)(s−p 2
)...(s−p n
)
The poles and zeros of the system are independent of the input that is applied. All that
R(s)contributes tothe expression forC(s)isextra poles and zeros.
Consider the case whereR(s) = 1 , corresponding toaunit stepinput:
s
C(s) = K(s−z 1 )(s−z 2 )...(s−z m )
(s−p 1
)(s−p 2
)...(s−p n
)s
= A 1
+ A 2
+···+
A n
+ B 1
s−p 1
s−p 2
s−p n
s
whereA 1
,A 2
,...,A n
andB 1
areconstants. Taking inverse Laplace transformsyields
c(t) =A 1
e p 1 t +A 2
e p 2 t +···+A n
e p n t +B 1
If the system is stable then p 1
,p 2
,...,p n
will have negative real parts and their contribution
toc(t) will vanish ast → ∞.
Theresponsecausedbythesystempolesisoftencalledatransientresponsebecause
itdecreaseswithtimeforastablesystem.Thecomponent ofthetransientresponsedue
to a particular pole is often termed its transient. Notice that the form of the transient
responseisindependentofthesysteminputandisdeterminedbythenatureofthesystem
poles. It is now possible to derive a series of rules relating the transient response of the
systemtothe positions of the systempoles inthesplane.
670 Chapter 21 The Laplace transform
jv
s plane
Transient
response
3
3
s
t
Figure21.19
A polewith anegative real part leads to a decaying transientwhile a
pole with a zeroreal partleads to atransientthat doesnotdecay with
time.
jv
s plane
Transient
response
3
3
s
t
21.12.1 Rule1
Figure21.20
The furtherapole isfrom the imaginary axis, the quicker the decay of
its transient.
Thepolesmaybeeitherrealorcomplexbutforaparticularpoleitisnecessaryforthereal
parttobenegativeifthetransientcausedbythatpoleistodecaywithtime.Otherwisethe
transient response will increase with time and the system will be unstable, a condition
engineers usually design to avoid. In simple terms this means that the poles of a linear
system must all lie in the left half of the s plane for stability. Poles on the imaginary
axisleadtomarginal stabilityasthetransientsintroduced by suchpoles donotgrowor
decay. This isillustratedinFigure 21.19.
21.12.2 Rule2
21.12.3 Rule3
The further a pole is to the left of the imaginary axis the faster its transient decays (see
Figure 21.20). This is because its transient contains a larger negative exponential term.
Forexample,e −5t decaysfasterthane −2t .Thepolesneartotheimaginaryaxisaretermed
the dominant poles as their transients take the longest to decay. It is quite common for
engineers to ignore the effect of poles that are more than five or six times further away
from the imaginary axis than the dominant poles.
For a real system, poles with imaginary components occur as complex conjugate pairs.
The transient resulting from this pair of poles has the form of a sinusoidal term
21.12 Poles, zeros and thesplane 671
3
jv
s plane
Transient
response
3
s
t
Figure21.21
Apairofstable complex poles gives riseto a decaying sinusoid
transient.
multiplying an exponential term. For a pair of stable poles, that is negative real part,
the transient can be sketched by drawing a sinusoid confined within a ‘decaying exponentialenvelope’(seeFigure21.21).Thereasonforthisisthatwhenthesinusoidalterm
has a value of 1 the transient touches the decaying exponential. When the sinusoidal
term has a value of −1 the transient touches the reflection in thet axis of the decaying
exponential. The larger the imaginary component of the pair of poles, the higher the
frequency ofthe sinusoidal term.
It should now be clear how useful a concept thesplane plot is when analysing the
response of a linear system. A complex system may have many poles and zeros but by
plotting them on the s plane the engineer begins to get a feel for the character of the
system. The form of the transients relating to particular poles or pairs of poles can be
obtainedusingtheaboverules.Themagnitudeofthetransients,thatisthevaluesofthe
coefficientsA 1
,A 2
,...,A n
, depends on the systemzeros.
Itcanbeshownthathavingazeroneartoapolereducesthemagnitudeofthetransient
relatingtothatpole.Engineersoftendeliberatelyintroducezerosintoasystemtoreduce
the effect of unwanted poles. If a zero coincides with the pole, it cancels it and the
transient corresponding tothatpole iseliminated.
Engineeringapplication21.8
Asymptotic Bode plot of a transfer function with real poles
andzeros
Recall that the Bode magnitude plot of a simple linear circuit was examined in
Engineeringapplication2.14.Itispossibletoconstructaplotofamorecomplicated
transferfunctionby following somesimple steps.
Hereweconsideratransferfunction with only real polesand zeros
( ) ( )···( )
s −z1 s −z2 s −zm
G(s) =K
s ( ) ( )···( )
s −p 1 s −p2 s −pn
wherez 1
,z 2
... are the systemzeros and p 1
,p 2
... are the system poles.
Thefirststageistoobtainanexpressionforthefrequencyresponseofthesystem
by substitutings = jω
( )( ) )
jω −z1 jω −z2 ···(
jω −zm
G (jω) =K
jω ( )( ) )
jω −p 1 jω −p2 ···(
jω −pn
➔
672 Chapter 21 The Laplace transform
We note that ( ( )
) jω
jω −z 1 can be written as −z1 +1 . Applying this process
−z 1
toeach bracket inthe numerator and denominator yields
( )( )···( )
−z1 −z2 −zm
G (jω) =K ( )( ) )
−p1 −p2 ···(
−pn
( ) ( ) ( )
jω jω jω
+1 +1 ... +1
−z
× 1
−z 2
−z
( ) ( ) ( m
)
jω jω jω
jω +1 +1 ... +1
−p 1
−p 2
−p n
The Bode plot isalogarithmic plot of20 log 10
|G(jω)|.So we wishtoplot
( ) ( ) )
−z1 −z2 ···(
−zm
∣ ∣∣∣∣
20log 10
|G (jω)| = 20log 10
K ( )( ) )
−p1 −p2 ···(
−pn
( ) ( ) ( )
jω jω jω
+1 +1 ... +1
−z
· 1
−z 2
−z
( m ) ( ) ( )
jω jω jω
jω +1 +1 ... +1
∣
−p 1
−p 2
−p n
Using the laws of logarithms (page 86) this can be rewritten as
20log 10
|G (jω)|
∣ ∣∣∣∣
( ) ( ) )
−z1 −z2 ···(
−zm = 20log 10
K ( )( ) )
−p1 −p2 ···(
−pn ∣ +20log jω
10 ∣ +1
−z ∣
1
∣ ∣∣∣ jω
+20log 10
+1
−z ∣ +···+20log jω
10∣
+1
2
−z ∣
m
∣ ∣∣∣ jω
−20log 10
|jω| −20log 10
+1
−p ∣
1
∣ ∣∣∣ jω
−20log 10
+1
−p ∣ −···−20log jω
10∣
+1
2
−p ∣
n
Inthisform the contribution ∣ of individual poles and zeros can bestudied.
∣∣∣∣
( ) ( ) )
−z1 −z2 ···(
−zm Theterm20log 10
K ( )( ) )
−p1 −p2 ···(
−pn ∣ isconstant.Ithasnodependenceon
frequency, ω.
Theterm20log 10
|jω|hasavalueof0at ω = 1.At ω = 10ithasavalueof20,at
ω = 100 it has a value of 40, and so on. At ω = 0.1 it has a value of −20, at 0.01, a
value of −40, and so on. If we plot this term by itself using a logarithmic frequency
scaleitwouldthereforebeastraightlinepassingthroughthepointwhere ω = 1with
a gradient of 20 dBfor ∣ each decade, or factor of 10, increase infrequency.
∣∣∣ jω
Theterm20log 10
+1
−z ∣ canbeexaminedbyconsideringtheinfluenceatdif-
1
∣ ∣ ferent frequencies. If
ω ∣∣∣ ∣∣∣ jω
∣ ≪ 1, then 20log
z 10
+1
1
−z ∣ ≈ 20log 10
|1| = 0. So the
1
termhas very littleinfluence.
21.12 Poles, zeros and thesplane 673
∣ ∣ Now consider
ω ∣∣∣ ∣∣∣ jω
∣ = 10. Then 20log
z 10
+1
1
−z ∣ ≈ 20log 10
|j10| = 20. It can
∣ 1 be shown by successively considering
ω ∣∣∣
∣ = 100, 1000, 10000,... that for every
z 1
decademultiplicationinfrequency(forfixedz 1
)thetermincreasesbyanother20dB.
Graphically,itapproximatestoastraightlinethatincreasesat20dBforeverydecade
(×10)changeinfrequency.Thepointatwhichthisstraightlineintersectsthe ω axis
iscalled thebreakpoint. ∣ ∣∣∣ jω
The term −20log 10
+1
−p ∣ and others like it have similar properties to those
1
discussed for the term involving zeros. The only difference is the straight line falls
by 20 dBfor every decade multiplication infrequency away from the breakpoint.
By considering the addition of the terms discussed above we can produce an approximation
to the Bode plot. These approximate lines are asymptotes to which the
actual plots tend as one moves away from the breakpoints. Hence the term asymptoticBode
diagramisused.
Consider a practical example of finding the poles and zeros, and sketching the
transfer function
G(s) =
5000 (s +3)
s (s +100)(s +500)
Firstlywenote the position of the poles and zeros by observing that
G(s) =
5000 (s − (−3))
s (s − (−100)) (s − (−500))
There is a single zero atz 1
= −3. There are three poles at p 1
= 0,p 2
= −100,
p 3
= −500. The pole at0isusually termed thepole at the origin.
The substitutions = jω is made togive
G(jω) =
5000 (jω +3)
jω (jω +100)(jω +500)
The equation isrearranged as
G(jω) =
5000 ×3
100 ×500 ·
( jω
3 +1 )
( ) ( ) jω jω
jω
100 +1 500 +1
The constant termcontributes a gain of
∣ ∣∣∣∣
( )( ) )
−z1 −z2 ···(
−zm 20log 10
K ( )( )···( )
−p1 −p2 −pn ∣ = 20log 5000 ×3
10 ∣100 ×500∣ = 20log 10 |0.3|
The zero contributes a gain of
∣ ∣ ∣∣∣ jω
20log 10
+1
−z ∣ = 20log jω ∣∣∣
10 ∣
1
3 +1
This is approximated by a straight line on the Bode diagram starting at ω = 3 and
increasing at20dB per decade.
➔
674 Chapter 21 The Laplace transform
The pole atthe origincontributes a gain of
−20log 10
|jω|
This is plotted as a straight line with no start or end points, with a slope of −20dB
per decade passing through ω = 1.
The two other poles contribute a gain of
∣ ∣ ∣∣∣ jω
−20log 10
+1
−p ∣ = −20log jω ∣∣∣
10 ∣
1
100 +1
and
∣ ∣ ∣∣∣ jω
−20log 10
+1
−p ∣ = −20log jω ∣∣∣
10 ∣
2
500 +1
respectively.Thefirstoftheseisapproximatedbyalinewithslope−20dBperdecade
startingat ω = 100, the second has the same gradient and startsat ω = 500.
TheindividualcontributionstotheoverallplotareshowninFigure21.22.Notice
thestartingpointsoftheasymptotes--thebreakpoints--andtheircorrespondenceto
the positions of the poles and zeros.
60
40
20 log 10 jv
3
dB
20
0
–20
–40
–20 log 10 |jv|
20 log 10 |0.3|
+ 1 –20 log 10 jv
500 + 1
–20 log 10 jv
100 + 1
–60
–80
0.1 1 10 100 1000 10 4
Figure21.22
Contributionsofthe individual terms to the Bode plot.Dottedlinesare the asymptotic
approximations.
v
The asymptotes can now be added together to form a combined graph which is
shown in Figure 21.23. Also plotted on the same graph is an exact Bode plot of the
transferfunction obtained usingacomputer.
Notice that the largest error in the approximation occurs at the breakpoints,
whereas the two graphs agree well at other frequencies. The reason for the error
is most apparent when we consider the case where ω = −z 1
, for example. The gain
atthisbreak frequency according tothe lineplottedshould be
20log 10
∣ ∣∣∣ −jz 1
−z 1
∣ ∣∣∣
= 20log 10
|j| = 0
21.13 Laplace transforms of some special functions 675
0
20
dB
40
60
80
0.1 1 10 100 1000 10 4
v
Figure21.23
Comparisonbetween the asymptotic Bodeplotand the exact frequency response. Thedotted
line isthe result ofaddingthe asymptotic approximations together at each frequency point.
whereas the exact value is
∣ ∣∣∣ −jz
20log 1
10
+1
−z ∣ = 20log 10 |j +1| = 20log 10 ∣ √ 2∣ ∼ = −3.01
1
Fortunately this is the worst-case error for a single pole or zero and if a more
accuratesketchisrequireditispossibletousethisfacttoimproveontheasymptotic
approximationofthetransferfunction.EngineersfinditusefultosketchaBodeplot
inordertogeta‘feel’forthefrequencyresponseofasystem.Ifamoreaccurateplot
isneeded then itispossible touse a computer package toobtain it.
21.13 LAPLACETRANSFORMSOFSOMESPECIALFUNCTIONS
In this section we apply the Laplace transform to the delta function and periodic functions.These
functions wereintroducedinChapter2.
21.13.1 Thedeltafunction, δ(t−d)
Recallthe integral propertyofthe deltafunction(see Section16.4),
∫ ∞
−∞
f(t)δ(t −d)dt = f(d)
The Laplace transform follows from this integral property. Let f (t) = e −st so that
f(d) = e −sd .Then
thatis,
∫ ∞
−∞
e −st δ(t −d)dt =
L{δ(t −d)} = e −sd
∫ ∞
0
e −st δ(t −d)dt = f(d) =e −sd
676 Chapter 21 The Laplace transform
The Laplace transformof δ(t) follows by settingd = 0:
L{δ(t)} =1
Example21.32 For a particular circuititcan be shown that the transferfunction,G(s), isgiven by
G(s) = V o (s)
V i
(s) = 1
s +2
where V i
(s) and V o
(s) are the Laplace transforms of the input and output voltages
respectively.
(a) Find v o
(t) when v i
(t) = δ(t).
(b) Usethe convolution theorem tofind v o
(t) when
{
e
−t
t0
v i
(t) =
0 t<0
Solution (a) We aregiven the transfer function
G(s) = V o (s)
V i
(s) = 1
s +2
When v i
(t) = δ(t),V i
(s) = 1andhence
and
V o
(s) = 1
s +2
{ } 1
v o
(t) = L −1 = e −2t
s +2
Thisisknownastheimpulseresponseofthesystem,g(t).Iftheimpulseresponse,
g(t),isknownthentheresponse, v o
(t),toanyotherinput, v i
(t),canbeobtainedby
convolution, thatis v o
(t) =g(t) ∗ v i
(t).
(b) The response toan input, v i
(t) =u(t)e −t , isgiven by
v o
(t) =g(t)∗v i
(t) =
=
∫ t
0
∫ t
0
e −2λ e −(t−λ) dλ
∫ t
= e −t e −λ dλ
0
[ ] e
= e −t −λ t
−1
0
=e −t (1−e −t )
= e −t −e −2t
g(λ)v i
(t − λ)dλ
21.13.2 Periodicfunctions
21.13 Laplace transforms of some special functions 677
Recallthedefinitionofaperiodicfunction, f (t).GivenT > 0, f (t)isperiodicif f (t) =
f (t +T ) for all t in the domain. If f (t) is periodic and we know the values of f (t)
over a period, then we know the values of f (t) over its entire domain. Hence, it seems
reasonable that the Laplace transform of f (t) can be found by studying an appropriate
integral over an interval whose length is just one period. This is indeed the case and
formsthe basis of the following development.
Let f (t)be periodic, with periodT. The Laplace transformof f (t)is
L{f(t)} =F(s) =
=
∫ T
0
∫ ∞
0
e −st f (t)dt
∫ 2T ∫ 3T
e −st f(t)dt + e −st f(t)dt + e −st f(t)dt+···
T
2T
Lett =xinthefirstintegral,t =x +T inthesecond,t =x +2T inthethirdandsoon.
F(s) =
∫ T
0
∫ T
+
0
∫ T
e −sx f(x)dx +
0
e −s(x+T ) f(x+T)dx
e −s(x+2T ) f(x+2T)dx+···
Since f isperiodic with periodT then
So,
f(x)=f(x+T)=f(x+2T)=···
F(s) =
∫ T
0
∫ T
∫ T
e −sx f(x)dx + e −sT e −sx f(x)dx + e −2sT e −sx f(x)dx + ···
0
0
=(1+e −sT +e −2sT +···)
∫ T
0
e −sx f(x)dx
We recognize the terms in brackets as a geometric series whose sum to infinity
1
is . Hence,
1 −e−sT F(s) =
∫ T
0 e−st f (t)dt
1 −e −sT
Example21.33 Awaveform, f (t), isdefinedasfollows:
f(t)=
{ 2 0<t1.25
0 1.25<t1.5
and f (t)isperiodicwith periodof1.5.Findthe Laplace transformofthe waveform.
678 Chapter 21 The Laplace transform
Solution L{f (t)} =
=
∫ 1.5
e −st f (t)dt
0
1 −e −1.5s
∫ 1.25
2e −st dt + ∫ 1.5
0 1.25 0e−st dt
1 −e −1.5s
= 2(1 −e−1.25s )
s(1 −e −1.5s )
EXERCISES21.13
1 Find the Laplacetransformsof
(a) 3u(t) + δ(t) (b) −6u(t) +4δ(t)
(c) 3u(t −2)+δ(t −2)
(d) u(t −3)−δ(t −4)
1
(e) 2 u(t −4)+3δ(t −4)
whereu(t)is the unitstepfunction.
2 Find the inverse Laplace transformsof
(a) 2 s −1 (b) 2
3s + 1 2
(d) 4s−3
s
3 Aperiodicwaveform is defined by
{
t 0t1
f(t)=
2−t 1<t2
(c) 3−2s
s
andhas aperiod of2.
(a) Sketch two cycles of f (t).
(b) Find L{f (t)}.
4 Awaveform, f (t),is defined by
{
2 0t1.5
f(t)=
−2t+5 1.5<t<2.5
andhas aperiod of2.5.
(a) Sketch f (t)on [0,5].
(b) Find L{f (t)}.
5 Ifthe impulseresponse ofanetwork isg(t) = 10e −4t
findthe outputwhenthe input is f (t) = e −t cos2t,
t0.
Solutions
3
1 (a)
s +1 (b) −6 s +4
3e −2s
(c) +e −2s (d) e−3s −e −4s
s s
e −4s
(e) +3e −4s
2s
2 (a)2u(t)−δ(t) (b) 2u(t) + δ(t)
3 2
3 (b)
4 (b)
5
(c)3u(t) −2δ(t)
1−e −s
s 2 (1+e −s )
2(s +e −2.5s −e −1.5s )
s 2 (1 −e −2.5s )
(30cos2t +20sin2t)e −t
13
(d) 4δ(t) −3u(t)
− 30e−4t
13
REVIEWEXERCISES21
1 GivenF(s) = s +1
s 2 isthe Laplace transformof
+2
f (t),findthe Laplace transformsofthe following:
(a) f(t)
e 2t
(b) 3e 2t f(t)
(c)2e −t (f(t)+1) (d) u(t−1)f(t−1)
(e)4u(t −3)f(t −3)
(f) e −2t u(t −2)f(t −2)
Review exercises 21 679
2 Find the Laplace transformsofthe following:
(a) IfV o (s) = L{v o (t)}andV i (s) = L{v i (t)},show
seriesRC network are relatedbythe differential (d) Findi(t)if
equation
{
2e
CR dv −2t t 0
o
v(t) =
+v
dt o =v i 0 otherwise
(a) 2tsin3t (b) 1 2 (−3tcos2t)
that the transfer function, V o (s) , isgiven by
V i (s)
1
(c) e −t u(t) (d) e −t u(t −1)
sCR+1 .
(e) 3t 2 u(t −1) (f) e −t δ(t −2)
(b) IfC = 0.1 µF,R = 100 k find,and sketch a
3 Find the inverse Laplacetransformsof
graph ofthe response, v o (t),when
2s+3
{
(a)
5volts t0
(s +1)(s +2)
v i (t) =
0 t<0
4s
(b)
s 2 −9
(c) Using the component values given in (b)findthe
(c) 2(s3 +4s 2 +4s +64)
response when the input is aunitimpulse, δ(t).
(s 2 +4)(s 2 +16)
(d) Using the component values given in (b)use the
(d)e −2s 6 s 4 (e) e−ss +1
convolution theoremto determine the response
s 2
when v i (t) = 5e −100t .
4 Solvethe followingdifferentialequations usingthe 7 Express the squarewave
Laplacetransformmethod:
{
1 0<t<a
(a) x ′′ +2x ′ −3x =2t, x(0) =1 x ′ (0) =2 f(t)=
−1 a <t < 2a period 2a
(b) x ′′ −2x ′ +5x =cost, x(0) =0 x ′ (0) =1 in terms ofunitstepfunctions. Hencededuce its
(c) ẋ+ẏ+x+y= 3 2 (1+t)
Laplace transform.
ẋ−2ẏ+x+2y=2t
8 Consider the circuitshown in Figure 21.24.
x(0) =0 y(0) =0
(d) ẋ−ẏ+x=−1,x(0)=0 y(0)=2
1–
3 F 1 H 4 V
2ẋ−ẏ + y 2 −x=1
5 Thecurrent,i(t),in a seriesLC circuitis governed by
i(t)
L di
dt + 1 ∫ t
idt = v(t)
y(t)
C 0
Figure21.24
where v(t)is the applied voltage.
(a) Assumingzero initialconditions showthat
(a) Show that
LsI + 1 Cs I =V(s)
∫
di t
dt +4i+3 idt = v(t)
0
(b) If v(t) = δ(t)show that
(b) Assumingzero initialconditions, showthat
i(t) = 1 L cos t
√ s
LC
I(s) =
(s+1)(s+3) V(s)
6 Theinput andoutputvoltages, v i (t)and v o (t),ofa (c) Findi(t)if v(t) = δ(t).
680 Chapter 21 The Laplace transform
Solutions
s +3
1 (a)
s 2 +4s+6
3(s −1)
(b)
s 2 −4s+6
(c)
(d)
(e)
2(2s 2 +5s +5)
(s +1)(s 2 +2s +3)
e −s (s +1)
s 2 +2
4e −3s (s +1)
s 2 +2
e −2(s+2) (s +3)
(f)
s 2 +4s+6
12s
2 (a)
(s 2 +9) 2
(b) − 3(s2 −4)
2(s 2 +4) 2
(c)
1
s +1
(d) e−(s+1)
s +1
( )
(e) 3e −s 2
s 3 + 2 s 2 + 1 s
(f) e −2(s+1)
3 (a) e −t +e −2t
(b) 2e −3t +2e 3t
(c) 2(2sin2t +cos4t)
(d) u(t −2)(t −2) 3
(e) u(t −1)t
4 (a) − 11e−3t
36
(b) e t [ − 1 5
+ 7et
4 − 2t
3 − 4 9
cos2t +
13
20 sin2t ]
+ 1 5 cost− 1 10 sint
(c) x=t,y=t/2
(d) x=e t −1,y=2e t
6 (b) v o (t) =5−5e −100t
7
8 (c)
(c) 100e −100t
(d) 500te −100t
1 −e −as
s(1 +e −as )
3e −3t
2
− e−t
2
(d) −e −t +4e −2t −3e −3t
22 Differenceequations
andtheztransform
Contents 22.1 Introduction 681
22.2 Basicdefinitions 682
22.3 Rewritingdifferenceequations 686
22.4 Blockdiagramrepresentationofdifferenceequations 688
22.5 Designofadiscrete-timecontroller 693
22.6 Numericalsolutionofdifferenceequations 695
22.7 Definitionoftheztransform 698
22.8 Samplingacontinuoussignal 702
22.9 TherelationshipbetweentheztransformandtheLaplacetransform 704
22.10 Propertiesoftheztransform 709
22.11 Inversionofztransforms 715
22.12 Theztransformanddifferenceequations 718
Reviewexercises22 720
22.1 INTRODUCTION
Difference equations are the discrete equivalent of differential equations. The terminology
is similar and the methods of solution have much in common with each other.
Differenceequationsarisewheneveranindependentvariablecanhaveonlydiscretevalues.
They are of growing importance in engineering in view of their association with
discrete-timesystems based on the microprocessor.
InChapter21theLaplacetransformwasshowntobeausefultoolforthesolutionof
ordinary differential equations, and for the construction of transfer functions in circuit
analysis,controltheory,etc.Generally,Laplacetransformmethodsapplywhenthevariables
being measured are continuous. Theztransform plays a similar role for discrete
682 Chapter 22 Difference equations and the z transform
systemstothatplayedbytheLaplacetransformforcontinuousones.Inthischapteryou
will be introduced to theztransform and one of its applications -- the solution of linear
constant coefficient difference equations. The z transform is of increasing importance
as more and more engineering systems now contain a microprocessor or computer and
sohaveoneormorediscrete-timecomponents.Forexample,mostindustrialcontrollers
nowhaveanembeddedmicroprocessor,andoverallcontrolofafactoryisoftenbymeans
ofasupervisoryprocesscontrolcomputer.Manyfactoriesalsohaveaproductioncontrol
computer toschedule production.
22.2 BASICDEFINITIONS
Before classifying difference equations wewill first derive an example of one.
Suppose a microprocessor system is being used to capture and analyse images. The
number of instruction cycles,i, of the microprocessor needed to process an image depends
on the number of pixels, n, that the image is broken down into. Clearly n is a
non-negative integer, that is n ∈ N. Since i depends upon n we write i = i[n]. The
squarebracketsnotationreflectsthefactthatnisadiscretevariable(seeSection6.2).If
there arenitemsof data, the number of instruction cycles used isi[n].
Suppose that if there aren + 1 items of data, the number of instruction cycles used
increases by 10n +1.Then,
i[n+1]=i[n]+10n+1
Thisisanexampleofadifferenceequation.Thedependentvariableisi;theindependent
variable isn.
Puttingn = 0 inthe difference equation gives
i[1] =i[0] +10(0) +1 =1
Similarly,i[2] = 12,i[3] = 33,i[4] = 64andsoon.Weseethatthedifferenceequation
gives risetoasequence of values.
There are strong similarities between difference and differential equations. The important
point to note is that with difference equations, the independent variable is discrete,
not continuous. In the above example,nis the number of pixels; it can have only
integer values. This discrete property of the independent variable is an essential and
distinguishing feature of difference equations. Much of the terminology of differential
equations isapplied, withidentical meaning, todifference equations.
22.2.1 Dependentandindependentvariables
Consider a simple difference equation:
x[n +1] −x[n] = 10
The dependent variable isx; the independent variable isn. Inthe difference equation
y[k+1]−y[k]=3k+5
the dependent variable isyand the independent variable isk.
22.2 Basic definitions 683
22.2.2 Thesolutionofadifferenceequation
A solution is obtained when the dependent variable is known for each value of interest
of the independent variable. Thus the solution takes the form of a sequence. There are
frequently many different sequences which satisfy a difference equation; that is, there
aremanysolutions.Thegeneralsolutionembracesalloftheseandallpossiblesolutions
can be obtained from it.
Example22.1 Showx[n] =A2 n , whereAisaconstant, isasolution of
x[n +1] −2x[n] = 0
Solution x[n] =A2 n x[n +1] =A2 n+1 = 2A2 n
Hence,
x[n +1] −2x[n] = 2A2 n −2A2 n = 0
Hencex[n] =A2 n isasolutionofthegivendifferenceequation.Infactx[n]isthegeneral
solution.
If additionally we are given a condition, sayx[0] = 3, the constantAcan be found. If
x[n] =A2 n , thenx[0] =A2 0 =Aand hence
A = 3
The solution is thus x[n] = 3(2 n ). This is the specific solution and satisfies both the
difference equation and the given condition.
22.2.3 Linearandnon-linearequations
Anequationislinearifthedependentvariableoccursonlytothefirstpower.Ifanequation
isnot linear itisnon-linear. For example,
3x[n +1] −x[n] = 10
y[n+1]−2y[n−1]=n 2
kz[k +2] +z[k] =z[k −1]
arealllinearequations.Notethatthepresenceofthetermn 2 doesnotmaketheequation
non-linear, sincenisthe independent variable. However,
(x[n +1]) 2 −x[n] = 10
y[k+1]= √ y[k] +1
areboth non-linear. Also,
z[n +1]z[n] =n 2 +100
sinx[n] =x[n −1]
arenon-linear.Theproducttermz[n +1]z[n]andthetermsinx[n]arethecausesofthe
non-linearity.
684 Chapter 22 Difference equations and the z transform
22.2.4 Order
Theorderisthedifferencebetweenthehighestandlowestargumentsofthedependent
variable. The equation
3x[n +2] −x[n +1] −7x[n] =n
issecond order because the difference betweenn+2 andnis2.
x[n +1]x[n −1] = 7x[n −2]
isthirdorderbecausethedifferencebetweenn +1andn −2is3.Ingeneral,thehigher
the order of an equation, the more difficultitistosolve.
Insomedifferenceequationsthedependentvariableoccursonlyonce.Theseareclassified
as zero order. Engineers refer to them as non-recursive difference equations because
calculation of the value of the dependent variable does not require knowledge of
thepreviousvalues.Incontrast,differenceequationsoforder1orgreaterarereferredto
asrecursivedifferenceequationsbecausetheirsolutionrequiresknowledgeofprevious
values of the dependent variable. The difference equation
x[n]=n 2 +n+1
has zero order and so isanon-recursive difference equation.
22.2.5 Homogeneousandinhomogeneousequations
Themeaningsofhomogeneousandinhomogeneousasappliedtolineardifferenceequationsareanalogoustothosemeaningswhenappliedtodifferentialequations.Todecide
whether a linear equation is homogeneous or inhomogeneous it is written in standard
form, with all the dependent variable terms on the l.h.s. Any remaining independent
variable termsare writtenon the r.h.s.For example,
3nx[n +1] −2n 3 =x[n −1]
iswritten as
3nx[n +1] −x[n −1] = 2n 3 (22.1)
If the r.h.s. is 0, the equation is homogeneous; otherwise it is inhomogeneous. Equation
(22.1) isinhomogeneous but
3nx[n+1]−x[n−1]=0
ishomogeneous.
Engineeringapplication22.1
Signalprocessingusingamicroprocessor
In engineering, an increasing number of products contain a microprocessor or computer
which is solving one or more difference equations. The input to the microprocessorisasequenceofsignalvalues,inmanycasesformedasaresultofsamplinga
continuous inputsignal.Theoutput fromthemicroprocessorisasequence ofsignal
values whichmaybesubsequently convertedintoacontinuous signal.Forexample,
22.2 Basic definitions 685
an inhomogeneous difference equation could be of the form
y[n] −2y[n −1] = 0.1s[n] +0.2s[n −1] −0.5s[n −2]
where y[n] is the output sequence or dependent variable and s[n] is the input sequence.Notethattheinputsequencecanstillbethoughtofastheindependentvariable
but instead of being expressed analytically in terms ofnit arises as a result of
sampling. The corresponding homogeneous equation is
y[n] −2y[n −1] = 0
22.2.6 Coefficient
Thetermcoefficientreferstothecoefficientofthedependentvariable.InEquation(22.1)
the coefficients are 3nand −1.
Example22.2 (a) State the order ofeach of the following equations (i)--(vii).
(b) State whether each equation islinear or non-linear.
(c) For each linear equation, statewhether itishomogeneous or inhomogeneous.
(i) 2x[n] −3nx[n −1] +x[n −2] +n 2 = 0
1
(ii) (x[n +1] −x[n −1]) =x[n]
3
(iii) z[n +2](2n −z[n −1]) =n +1
7x[n −1]
(iv)
x[n −2] = n +1
n −1
(v) w[n +3]w[n +1] =n 3 −1
(vi) y[n +2] +2y[n +1] = 6s[n +2] −2s[n +1] +s[n]whereyisthedependent
variable
(vii) x[k+3]−2x[k+2]+x[k] =e[k+2]−e[k]wherexisthedependentvariable.
Solution (a) (i) Second order
(ii) Second order
(iii) Third order
(iv) Firstorder
(v) Second order
(vi) Firstorder
(vii) Third order
(b) (i) Linear
(ii) Linear
(iii) Non-linear
(iv) Linear
(v) Non-linear
(vi) Linear
(vii) Linear
(c) Equations (iii) and (v) are non-linear. The linear equations are written in standard
form:
(i) 2x[n] −3nx[n −1] +x[n −2] = −n 2
(ii) x[n +1] −3x[n] −x[n −1] = 0
686 Chapter 22 Difference equations and the z transform
(iv) 7(n −1)x[n −1] − (n +1)x[n −2] =0
(vi) and (vii)are already instandard form.
Hence wefind the following:
(i) Inhomogeneous
(ii) Homogeneous
(iv) Homogeneous
(vi) Inhomogeneous
(vii) Inhomogeneous
EXERCISES22.2
1 Foreach ofthe followingequations (a)--(e): (c)y[n −2] +y[n −1] +y[n] =n 2
(i) Statethe order ofthe equation. (ii)State whether
each equation islinear ornon-linear. (iii)Foreach
linear equation, state whether itis homogeneous or
inhomogeneous.
(a)n(3n +x[n]) =x[n −1]
2z[k −4]
(b)
z[k −3] =z[k−2]
(d) √ n +x[n] =x[n −2] +e[n], wherexis the
dependent variable.
(e) (2w[n−1]+1) 2 = w[n−2]+s[n−1]−s[n−2],
where w isthe dependent variable.
Solutions
1 (a) First order; linear;inhomogeneous
(b) Second order; non-linear
(c) Second order; linear;inhomogeneous
(d) Second order; non-linear
(e) Firstorder; non-linear
22.3 REWRITINGDIFFERENCEEQUATIONS
Sometimes an equation or expression can be written in different ways. At first sight, it
may appear there are two independent equations when in fact there is only one. Thus
weneedtobeabletorewriteequationssothatcomparisonscanbemade.Whengeneral
solutions of equations are to be found, usually the equation is first written in a standard
form. Soonceagain thereisaneedtorewrite equations.
Example22.3 Rewrite the equation sothat the highestargumentofthe dependentvariable isn +1.
x[n+3]−x[n+2]=2n
x[2]=7
Solution Thehighestargumentinthegivenequationisn +3;thismustbereducedby2ton +1.
To do thisnisreplaced byn −2.Theequation becomes
x[n +1] −x[n] =2(n −2) x[2] =7
Note,however,thattheinitialcondition,x[2] = 7,isnotchanged.Thisissimplystating
thatxhas a valueof7when the independentvariablehas a value of2.
22.3 Rewriting difference equations 687
Example22.4 Write the following equations so that the highest argument of the dependent variable is
n+2.
(a) 3x[n +4] −2nx[n +2] = (n −1) 2 x[3] = 6 x[4] = −7
(b) z[n−2]+z[n−1]+z[n]=1+n z[0]=1 z[1]=0
Solution (a) Thehighestargument,n+4,mustbereplacedbyn+2,thatisnisreplacedbyn−2
throughout the equation.
3x[n +2] −2(n −2)x[n] = (n −3) 2 x[3] = 6 x[4] = −7
(b) The highest argument,n, isincreased ton +2,thatisnisreplaced byn+2.
z[n]+z[n+1]+z[n+2]=n+3 z[0]=1 z[1]=0
Example22.5 Write the following equations so that the highest argument of the dependent variable
isk:
s[k +2] −2s[k +1] +s[k]
(a) a[k +2] =
15
whereaisthe dependent variable.
(b) a[k +3] = l[k+3]+l[k+2]+l[k+1]+l[k]+l[k−1]
5
whereaisthe dependent variable.
Solution (a) The highest argument,k+2, must be replaced byk, that iskis replaced byk−2
throughout the equation:
a[k] =
s[k] −2s[k −1] +s[k −2]
15
(b) The highest argument,k+3, must be replaced byk, that iskis replaced byk−3
throughout the equation:
a[k] = l[k]+l[k−1]+l[k−2]+l[k−3]+l[k−4]
5
EXERCISES22.3
1 Writeeachequation sothat the highestargument of
the dependent variableisasspecified:
(a) p[k] −3p[k +1] = p[k −2],highestargumentof
the dependent variableisto bek+2.
(b) R[n−1]−R[n−2]−R[n−3]=n,R[0]=1,
R[1] = −2, highestargumentofthe dependent
variableisto ben.
(c) q[t] +tq[t −1] = 3q[t +1],q[1] = 0,
q[2] = −2,highestargumentofthe dependent
variableis to bet −1.
(d) T[m]+(m−1)T[m−2]=m 2 ,
T[0] =T[1] = 1,highestargumentofthe
dependent variableis to bem +2.
(e) y[k +1] −y[k +3] = (s[k +2] −s[k +4])/2,
whereyisthe dependentvariable andthe highest
argumentofthe dependent variableisto bek.
688 Chapter 22 Difference equations and the z transform
Solutions
1 (a) p[k+1]−3p[k+2]=p[k−1]
(b) R[n]−R[n−1]−R[n−2]=n+1
R[0] = 1,R[1] = −2
(c) q[t −2] + (t −2)q[t −3] =3q[t −1]
q[1] = 0,q[2] = −2
(d) T[m+2]+(m+1)T[m]= (m+2) 2
T[0]=T[1]=1
(e) y[k −2] −y[k] = s[k−1]−s[k+1]
2
22.4 BLOCKDIAGRAMREPRESENTATIONOFDIFFERENCE
EQUATIONS
Manyengineeringsystemscanbemodelledbymeansofdifferenceequations.Itispossibletorepresentadifferenceequationpictoriallybymeansofablockdiagram.Theuse
ofablockdiagramrepresentationhelpsanengineertovisualizeasystemandmayoften
behelpfulinsuggestingtherequiredhardwareorsoftwaretoimplementaparticulardifferenceequation.Thisisparticularlyimportantinareassuchasdigitalsignalprocessing
and digital controlengineering.
Before discussing block diagrams it is necessary to review the topic of sampling.
Difference equations operate on discrete-time data and therefore a continuous signal
needstobesampledbeforeuse.Inthemostcommonformofsampling,asampleistaken
at regular intervals,T. A continuous signal and the sequence produced by sampling it
areshowninFigure22.1.Someauthorswritethesequenceasx[nT]toindicatethatthe
sequencehasbeenobtainedbysamplingacontinuouswaveformatintervalsT.Wewill
notuse thisconvention butsimplyrefertothe sampledsequenceasx[n].
Several components are used in a block diagram. The delay block is shown in
Figure22.2.Theeffectofthiselementistodelaythesequencebyonesamplinginterval,
T. For example,if
x[n]=6,4,3,−2,0,2,5,0,...
n=0,1,2,...
wecan write thisasx[0] = 6,x[1] = 4,x[2] = 3,x[3] = −2,..., and then
x[n−1]=0,6,4,3,−2,0,2,5,0,...
n=0,1,2,...
x(t)
x[n]
(a)
t
Figure22.1
(a) Continuoussignal; (b)sequence produced asaresult ofsampling.
(b)
T
n
22.4 Block diagram representation of difference equations 689
x[n] x[n – 1]
T
Figure22.2
Adelay block delaysasequence
byatime interval,T.
x[n] x[n – 2]
T
T
Figure22.3
Two delay blocks in series.
x[n]
x[n]
p[n]
x[n] + a
1
kx[n]
3
q[n]
p[n] + q[n] + r[n]
S
a
k
r[n]
Figure22.4
Adding aconstant
to a sequence.
Figure22.5
Scalingasequence
byaconstant.
Figure22.6
Adding sequences together
usingasummer.
Notethatx[n −1]isundefinedwhenn = 0andsothisisassignedavalueof0.Adelay
of two sampling intervals results inthe sequencex[n −2]as shown inFigure 22.3.
Another block diagram element represents the addition of a constant to a sequence
and is shown in Figure 22.4. A sequence can be scaled by a constant. This is shown in
Figure 22.5. Finally, sequences can be added together using a summer. This is shown
inFigure 22.6.
Engineeringapplication22.2
Discrete-timefilter
The term filter is used by engineers to describe an electronic device that selectively
processessignalsofdifferentfrequencies.So,forexample,alowpassfilterisafilter
that allows low frequency signals to pass with little attenuation of their amplitude.
However, high frequency signals tend to be severely attenuated. In contrast, a high
passfilterattenuateslowfrequencysignalswhileallowinghighfrequencysignalsto
pass with relative ease. A discrete-time filter or, as more commonly termed, a digital
filter is an electronic device that uses digital methods to selectively filter digital
signals.
A simple example of a discrete-time filter is one described by the difference
equation
y[n] −ay[n −1] =x[n]
wherex[n]istheinputsequence,y[n]istheoutputsequenceandaisaconstant.Ifa
is positive then the filter behaves as a low-pass filter which rejects high frequencies
but allows low frequencies to pass. If a is negative then the filter behaves as a
high-pass filter. A block diagram for the filter is shown in Figure 22.7. Note that
this is a recursive filter because calculation ofy[n] requires knowledge of previous
values of the output sequence. Note also that the block diagram contains a feedback
path. This isafeatureofrecursive difference equations.
➔
690 Chapter 22 Difference equations and the z transform
x[n]
S
y[n]
ay[n – 1]
3 T
a
y[n – 1]
Figure22.7
Discrete-time filter.
Engineeringapplication22.3
Developmentofadifferenceequationtocalculatethe
accelerationofanobject
Engineersoftenneedtocalculatetheaccelerationofanobjectwhenallthatisknown
isthepositionofthatobject.Thiscanbeachievedbydevelopingadifferenceequation
that can be solved at regular time intervals by a computer. It is first necessary to
use a device that converts the position signal into a form suitable for analysis by a
computer. Usually the position signal will be continuous and therefore in analogue
form.Itisthereforenecessarytouseananalogue-to-digitalconvertertotransform
the analogue signal into a digital signal. Consider the following problem. Develop
a difference equation to convert a digital position signal into a digital acceleration
signal given that this position signal varies with time. Derive an associated block
diagramforthisdifference equation.
Lets =position, v =speed anda =acceleration:
v = ds a = dv = d2 s
dt dt dt 2
Therefore, in order to obtain an acceleration signal the position signal must be differentiatedtwice.ForasmalltimeintervalT,thederivative,y(t),ofasignalx(t)can
be approximated by
y(t) ≈ x(t)−x(t−T)
T
This follows directly from the definition of differentiation. If the signalx(t) is sampled
to givex[n] then the process of differentiation is represented by the difference
equation
x[n] −x[n −1]
y[n] =
T
Figure 22.8 shows a block diagram for the differentiator. It is important to note that
this difference equation is only an approximation to the process of differentiation.
This could be implemented using special-purpose hardware or by software on a microprocessor.
Itfollows thatthe speed of the object isgiven by
s[n] −s[n −1]
v[n] =
T
The problem of finding the acceleration, a[n], can now be solved by coupling two
differentiators together as shown in Figure 22.9. An alternative approach to this
22.4 Block diagram representation of difference equations 691
x[n]
x[n]
S
3
y[n]
T
3
–x[n – 1]
1
— T
–1
Figure22.8
Block diagram ofadifferentiator.
s[n]
y[n]
s[n]
3
y[n]
3
a[n]
T 3
—T
1
3
—T
1
–s[n – 1] –y[n – 1]
Figure22.9
Two differentiators in series.
–1 –1
problemistoobtainadifferenceequationfortheprocessoffindingasecondderivative.
Given that
s[n] −s[n −1]
v[n] = (22.2)
T
and
v[n] − v[n −1]
a[n] = (22.3)
T
then substituting Equation (22.2) into Equation (22.3) gives
a[n] =
(s[n] −s[n −1]) − (s[n −1] −s[n −2])
T 2
s[n] −2s[n −1] +s[n −2]
a[n] =
T 2
TheblockdiagramforthisdifferenceequationisshowninFigure22.10.Notethatthe
difference equation is non-recursive and so there are no feedback paths in the block
diagram.Theoutputsequenceisa[n]andtheinputsequenceiss[n].Theindependent
variable isn.
–2
T
3
–2s[n – 1]
s[n]
T
T
s[n]
S
s[n – 2]
3
1—
T 2
a[n]
Figure22.10
a[n] is obtained by
calculating the second
derivative ofs[n].
692 Chapter 22 Difference equations and the z transform
Engineeringapplication22.4
Development of a moving average filter to calculate the height
ofacidinachemicaltank
A common problem in chemical engineering is that the height of liquid in a tank is
difficulttomeasureaccuratelybecauseoftheliquidswillingaround.Onesolutionto
thisproblemistouseamovingaveragefiltertosmoothoutanyvariationsinheight
obtainedfromthetankpositionsensor.Akeydesigndecisionishowmanypastdata
points to use when constructing the moving average filter. In practice this depends,
amongst other things, on how much the liquid moves around. It is important not to
usetoomanypointsorsignificantvariationsinheightmaynotbepickedupintime.
However,iftoofewpointsareusedthentheoutputfromthefiltermaybetoospikyto
beofuse.Considerthecaseofamovingaveragefilterthatmakesuseofthefivemost
recentlymeasuredvaluesoftheleveloftheliquidinthetank.Formulateadifference
equation tocarryout thisaveraging and draw an equivalent block diagram.
Leta[n] represent the average level of the acid in the tank and letl[n] represent
the sampled values of the level measurements received from the transducer. Then,
a[n] = l[n]+l[n−1]+l[n−2]+l[n−3]+l[n−4]
5
Theblockdiagramforthisdifference equationisshowninFigure22.11. Theaction
of taking a moving average of sampled values is equivalent to passing the sampled
valuesthroughalow-passfilterbecauseitfiltersouthigh-frequencyvariationsinthe
sampledvalues.Thisprocessistermeddigitalfilteringordigitalsignalprocessing.
T
l[n – 1]
l[n]
l[n]
T
T
l[n – 2]
S
3
a[n]
T
T
T
T
T
T
l[n – 3]
T
l[n – 4]
1—
5
Figure22.11
Block diagram ofamoving averager.
EXERCISES22.4
1 Design adigital filterbasedontaking a moving
average ofthe last threevalues ofasampledsignal.
2 Acomputerisfedasignalrepresentingthevelocityof
an objectasafunctionoftime.Prior to enteringthe
computer the signal is sampled usingan
analogue-to-digitalconverter. Derive adifference
equationandassociatedblock diagramto obtain the
accelerationofthe objectasafunctionoftime.
22.5 Design of a discrete-time controller 693
Solutions
1 SeeFigure S.23.
2 a[n] =
v[n] − v[n −1]
.SeeFigure S.24.
T
x[n]
T
S 3 a[n]
y[n]
S 3
a[n]
FigureS.23
T
T
1–
3
FigureS.24
T
3
–1
1 –
T
22.5 DESIGNOFADISCRETE-TIMECONTROLLER
Figure22.12showsablockdiagramofasingleloopindustrialcontrolsystem.Thecontrol
system employs feedback to compare the desired value of a process variable with
the actual value. Any difference between the two generates an error signal e(t). This
signalisprocessedbythecontrollertoproduceacontrollersignalm(t).Anamplifieris
oftenpresenttomagnifythissignaltomakeitsuitablefordrivingtheplantthatisbeing
controlled.
Thecontrollercanbeimplementedbymeansofananalogueelectroniccircuit.However,
digital computers are being used increasingly as controllers. The signalse(t) and
m(t) are both continuous in time and so it is necessary to samplee(t) before it can be
used by the computer and to reconstruct the signal generated by the computer to producethecontrolleroutput,m(t).Thearrangementforimplementingadigitalcontroller
isshown inFigure 22.13.
r(t) +
e(t)
Controller
m(t)
Amplifier
Plant
c(t)
–
Sensor
Figure22.12
Asingle loop industrial controlsystem.
e(t)
Analogue-todigital
converter
Digital
computer
Digital-toanalogue
converter
m(t)
Figure22.13
Block diagramofadigital controller.
694 Chapter 22 Difference equations and the z transform
The most common type of controller used in industry is the proportional/integral/
derivative (p.i.d.)controller. Itcanbeshown thattheanalogue formofthiscontrolleris
modelled by the equation
m(t) =K p
e(t)+K i
∫ t
0
e(t)dt +K d
de(t)
dt
(22.4)
whereK p
,K i
andK d
are constants. In order to implement a discrete-time (digital) controlleritisnecessarytoconvertthisequationintoanequivalentdifferenceequation.The
approximation for the process of differentiation has already been examined in
Engineering application 22.3, and isgiven by
de(t)
dt
≈
e[n] −e[n −1]
T
Thereareseveralpossiblewaysofapproximatingtheprocessofintegration.Onemethod
is illustrated in Figure 22.14. Here the area under the curve is approximated by a series
of rectangles, each of widthT. If the approximate area under the curve fromt = 0 to
t =nT isdenoted byx[n], then
x[n] =x[n −1] +Te[n] (22.5)
The discrete form ofEquation (22.4) can now be formulated. Itisgiven by
m[n] =K p
e[n] +K i
x[n] +K d
e[n] −e[n −1]
T
(22.6)
Equations (22.5) and (22.6) form a set of equations toimplement a discrete form of the
p.i.d.controlleronadigitalcomputerormicroprocessor.Thesetwoequationsaretermed
coupled difference equations because both are needed to calculate m[n]. In addition,
Equation(22.5)isrecursive.Aflowchartforimplementingtheseequationsisshownin
Figure 22.15.
Initialization
Time to
sample?
YES
NO
Get e[n]
e(t)
Calculate
m[n]
Output m[n]
t
Figure22.14
Approximating the areaunder the curve by a
seriesofrectangles.
Figure22.15
Flow chartforap.i.d. controller.
22.6 Numerical solution of difference equations 695
EXERCISES22.5
1 Atransduceris used to measure the speed ofamotor
car.Design adigital filterto calculate the distance
travelled bythe car.
2 Draw a block diagram forEquations(22.5)
and (22.6).
Solutions
1 s[n] =s[n −1] +Tv[n]. SeeFigure S.25.
y[n]
3 S
s[n]
T
T
FigureS.25
22.6 NUMERICALSOLUTIONOFDIFFERENCEEQUATIONS
Having seen how difference equations are formulated we now proceed to methods of
solution. The numerical method illustrated may be applied to all classes of difference
equation.
Example22.6 Given
x[n+1]−x[n]=n
x[0]=1
determinex[1],x[2] andx[3].
Solution Thetermsinthe equation areevaluated forvarious values ofn:
n = 0
x[1] −x[0] = 0
x[1] = 1
n = 1
x[2] −x[1] = 1
x[2] = 2
n = 2
x[3] −x[2] = 2
x[3] = 4
Example22.7 Determinex[4]given
2x[k +2] −x[k +1] +x[k] = −k 2 x[0] = 1 x[1] = 3
696 Chapter 22 Difference equations and the z transform
Solution k =0 k =1
2x[2] −x[1] +x[0] = 0 2x[3] −x[2] +x[1] = −1
x[2] = 1 x[3] = − 3 2
k =2
2x[4] −x[3] +x[2] = −4
x[4] = − 13 4
Asthepreviousexamplesillustrate,todetermineauniquesolutiontoafirst-orderequation
requires one initial condition; for a second-order equation, two initial values are
required.
Engineeringapplication22.5
Numericalsolutionoftheoutputfromadigitallow-passfilter
Recall from Engineering application 22.2 that a low-pass filter is a filter that is designedtoattenuatehighfrequencysignals.Asimpledifferenceequationforadigital
filter isgiven by:
y[n] −ay[n −1] =x[n]
Ifaispositive thenthe filter isalow-passfilter. We will choosea = 0.5 and so
y[n] = 0.5y[n −1] +x[n]
Letusexaminetheresponseofthisfiltertoaunitstepinputappliedatn = 0,thatis
{ 0 n<0
x[n] = n ∈ Z
1 otherwise
Assume the output of the filter is zero prior to the application of the step input, that
isy[n] = 0 forn −1.Fromthe difference equation,wefind
Similarly,
y[0] = 0.5y[−1] +x[0]
= 0.5(0) +1
= 1
y[1] = 0.5y[0] +x[1]
= 0.5(1) +1
= 1.5
andsoon.Whennumericallysolvingadifferenceequation,itisoftenusefultoform
a table with intermediateresults. Table 22.1 shows suchatable.
Figure 22.16 shows the input and output sequences superimposed on the same
graph. The sequence values have been joined to illustrate their trends. Note that the
input signal reaches its final value immediately whereas the output signal takes several
sample intervals to reach its final value. Engineers often refer to this process as
‘smoothing’theinputsignal.WewillseeinChapter23thatrapidlychangingsignals
22.6 Numerical solution of difference equations 697
Table22.1
Numerical solution of a difference
equation.
n x[n] y[n −1] y[n]
−1 0 0 0
0 1 0 1
1 1 1 1.5
2 1 1.5 1.75
3 1 1.75 1.88
4 1 1.88 1.94
5 1 1.94 1.97
6 1 1.97 1.98
7 1 1.98 1.99
8 1 1.99 2.00
9 1 2.00 2.00
10 1 2.00 2.00
2
1
0 12345678910
Figure22.16
Input and output sequences forthe
low-pass filter.
y[n]
x[n]
n
tend to be richer in high frequencies than those that change more slowly. The effect
of the low-pass filter is to filter out these high frequencies and so the output from
the filter changes more slowly than the input. In Chapter 23 we will also examine a
continuous low-pass filter which does the same jobforcontinuous signals.
Numerical solution of difference equations is often the only feasible method of obtainingasolutionformanypracticalengineeringsystems.Thisisbecausetheinputterms
are usually obtained as a result of sampling an input signal and so cannot be expressed
analytically. However, in some cases it is possible to express the input analytically and
then an analytical solution to the difference equation may be feasible. While analytical
methods analogous to those applied to differential equations are available,ztransform
techniquesaremorepopularwithengineersandsotheseareintroducedinthefollowing
sections.
EXERCISES22.6
1 Given
x[n+2]+x[n+1]−x[n]=2
x[0] = 3 x[1] = 5
findx[2],x[3],x[4]andx[5].
2 If
z[n]z[n −1] =n 2 z[1] = 7
findz[2],z[3]andz[4].
3 Determinex[2]andx[3]given
(a) 2x[n +2] −5x[n +1] = 4n,x[1] = 2
(b) 6x[n] −x[n −1] +2x[n −2]
=n 2 −n,x[0]=1,x[1]=2
(c) 3x[n −1] +x[n −2] −9x[n −3]
= (n−1) 2 ,x[0] =3,x[1] =2
4 Calculate the firstfive termsofthe following
difference equations with the given initialconditions:
(a) x[n +1] −x[n] = 2,x[0] = 3
(b) x[n +2] +x[n +1] −x[n] = 4,
x[0] = 5,x[1] = 7
(c) x[p+2]−x[p+1]+2x[p] =2,
x[0] = 1,x[1] = 1
698 Chapter 22 Difference equations and the z transform
Solutions
1 0, 7, −5, 14
4
2
7 , 63
4 , 64
63
3 (a) 5, 29
2
(b)
1
3 , 7 18
(c)
29
3 , 52
9
4 (a) 3,5,7, 9,11
(b) 5,7,2, 9, −3
(c) 1,1,1, 1,1
TechnicalComputingExercises22.6
1 Writeacomputer program to implementthe
difference equations given in Engineeringapplication
22.5. Verifythat your resultsagreepreciselywith
those given in Table 22.1.
2 Modifyyour program sothat the input sequence is
x[n] = [0 1 0 0 0 0 0 0 0 0 0 0].Theoutput
is known astheimpulseresponse ofthe system.
Observe the shape ofthe plot andcompareitto the
output sequence in the Example.
3 Now change your program sothatitwill generate a
total of100 samples ofoutputandchange the input
sequence to:
x[n] = sin(2πn/100)
You should observe asinewave on the outputy(n).
Makeanoteofits amplitude.
4 Investigatewhat happensto the amplitudeofthe
outputwhenyou increasethe frequency ofthe input
sinewave. The frequency can be increased by
reducing the value ofthe denominatorin the sine
function, forexample,
x[n] = sin(2πn/10)
5 Does the result agreewith the description ofthe
operation ofthe discrete-time filterdescribedin the
example?
22.7 DEFINITIONOFTHEzTRANSFORM
Supposewehaveasequence f[k],k ∈ N.Suchasequencemayhavearisenbysampling
a continuoussignal. We define itsztransformtobe
∞∑
F(z) = Z{f[k]} = f[k]z −k (22.7)
k=0
Weseefromthedefinitionthattheztransformisaninfiniteseriesformedfromtheterms
ofthe sequence.Explicitly, wehave
Z{f[k]}=f[0]+ f[1]
z
+ f[2]
z 2 + f[3]
z 3 +···
Inmostengineeringapplicationswedonotactuallyneedtoworkwiththeinfiniteseries
since it is often possible to express this in a closed form. The closed form is generally
valid for values ofzwithin a region known as the radius of absolute convergence as
will become apparent from the following examples.
22.7 Definition of the z transform 699
Example22.8 Find theztransformof the sequence defined by
{
1 k=0
f[k]=
0 k≠0
Solution Z{f[k]} =
This sequence is sometimes called the Kronecker delta sequence, often denoted by
δ[k].
∞∑
f[k]z −k
k=0
=f[0]+ f[1]
z
=1 + 0 z + 0 z 2 + 0 z 3 +···
=1
HenceF(z) = 1.
+ f[2]
z 2 + f[3]
z 3 +···
Example22.9 Findtheztransformofthe sequencedefined by
f[k]=1
k∈N
Solution Z{f[k]} =
This istheunit stepsequence, oftendenotedbyu[k].
∞∑
f[k]z −k
k=0
=1 + 1 z + 1 z 2 + 1 z 3 +···
This is a geometric progression with first term 1 and common ratio 1 . The progression
z
converges if |z| > 1 inwhich case the sum toinfinity is
1
1−1/z = z
z −1
We see thatF(z)has the convenient closed form
F(z) =
for |z| > 1.
z
z −1
Note that the process of taking theztransform converts the sequence f[k] into the continuousfunctionF(z).
Example22.10 Findtheztransformofthesequencedefinedby f[k] =k,k ∈ N.Thissequenceiscalled
theunit ramp sequence.
700 Chapter 22 Difference equations and the z transform
Solution F(z) = Z{f[k]} =
∞∑
k=0
kz −k
= 1 z + 2 z 2 + 3 z 3 +···
= 1 z
{1 + 2 z + 3 z 2 +···}
Ifweusethebinomialtheorem(Section6.4)toexpress
(
1− 1 z
) −2
asaninfiniteseries,
we find that
(
1 − 1 ) −2
= 1 + 2 z z + 3 ∣ ∣∣∣
z + ··· provided 1
2 z ∣ < 1,that is |z| > 1
Using thisresultwe see that Z{f[k]}can be written as
F(z) = 1 (
1 − 1 ) −2
z z
and so
F(z) = 1 z
1
(1 −1/z) = z
for |z| > 1
2 (z −1)
2
Example22.11 Findtheztransform ofthe sequencedefinedby
f[k] =Ak
A constant
Solution We find
F(z) = Z{f[k]} =
=A
=
∞∑
Akz −k
k=0
∞∑
k=0
kz −k
Az
(z −1) 2 usingExample 22.10
In the same way as has been done for Laplace transforms, we can build up a library of
sequences and theirztransforms. Some common examples appear in Table 22.2. Note
that inTable22.2aandbareconstants.
Example22.12 UseTable 22.2 tofind theztransformsof
(a) sin 1 2 k
(b) e3k cos2k
Solution Directly fromTable 22.2 wefind
(a) Z{sin 1 2 k} = zsin 1 2
z 2 −2zcos 1 2 +1
(b) Z{e 3k cos2k} =
z 2 −ze 3 cos2
z 2 −2ze 3 cos2 +e 6
22.7 Definition of the z transform 701
Table22.2
Theztransformsofsome common functions.
f[k] F(z) f[k] F(z)
{ 1 k=0
δ[k] =
0 k≠0
{ 1 k0
u[k] =
0 k<0
k
e −ak
a k
ka k
k 2 a k
1 k 2 z(z +1)
(z −1) 3
z
z −1
z
(z −1) 2 sinak
z
z−e −a
z
z −a
k 3 z(z 2 +4z +1)
(z −1) 4
cosak
e −ak sinbk
az
(z −a) 2 e −ak cosbk
az(z +a)
(z −a) 3
zsina
z 2 −2zcosa+1
z(z −cosa)
z 2 −2zcosa+1
ze −a sinb
z 2 −2ze −a cosb +e −2a
z 2 −ze −a cosb
z 2 −2ze −a cosb +e −2a
EXERCISES22.7
1 Using the definition oftheztransform, find
closed-form expressions fortheztransformsofthe
followingsequences f[k] where
(a) f[0]=0,f[1]=0,f[k]=1fork 2
{ 0 k=0,1,...,5
(b) f[k] =
4 k>5
(c) f[k]=3k,k 0
(d) f[k]=e −k ,k=0,1,2,...
(e) f[0]=1,f[1]=2,f[2]=3,f[k]=0,k 3
(f) f[0]=3,f[k]=0,k ≠0
{ 2 k0
(g) f[k] =
0 k<0
2 UseTable 22.2 to find theztransformsof
(a) cos3k (b) e k (c) e −2k cosk
(d) e 4k sin2k (e) 4 k (f) (−3)
( ) ( )
k
kπ kπ
(g) sin (h) cos
2 2
3 Find, fromTable 22.2, the sequences whichhave the
followingztransforms:
z 2z 3z
(a) (b) (c)
z +4 2z−1 3z+1
z z
(d)
z−e 3 (e)
z 2 +1
4 Byconsidering the Taylor seriesexpansion ofe 1/z ,
find theztransformofthe sequence
f[k]= 1 k!
k 0
Solutions
1 (a)
(c)
1 4
(b)
z(z −1) z 5 (z−1)
3z ez
(z −1) 2 (d)
ez−1
(e)
2 (a)
z 2 +2z+3
z 2 (f) 3 (g)
z(z −cos3)
z 2 −2zcos3+1
(b)
z
z −e
2z
z −1
702 Chapter 22 Difference equations and the z transform
(c)
(d)
(e)
z 2 −ze −2 cos1
z 2 −2ze −2 cos1 +e −4
ze 4 sin2
z 2 −2ze 4 cos2 +e 8
z z
(f)
z −4 z +3
(g)
z
z 2 +1
(h)
z 2
z 2 +1
3 (a) (−4) k (b) (1/2) k (c) (−1/3) k
(d) e 3k
4 e 1/z
(e) sin(πk/2)
22.8 SAMPLINGACONTINUOUSSIGNAL
WehavealreadyintroducedsamplinginSection22.4.Wenowreturntothetopic.Most
of the signals that are encountered in the physical world are continuous in time. This
meansthattheyhaveasignallevelforeveryvalueoftimeoveraparticulartimeinterval
of interest. An example is the measured value of the temperature of an oven obtained
usinganelectronicthermometer.Thistypeofsignalcanbemodelledusingacontinuous
mathematical function in which for each value oft there is a continuous signal level,
f (t). Several engineering systems contain signals whose values are important only at
particularpointsintime.Thesepointsareusuallyequallyspacedandseparatedbyatime
interval, T. Such signals are referred to as discrete time, or more compactly, discrete
signals. They are modelled by a mathematical function that is only defined at certain
points in time. An example of a discrete system is a digital computer. It carries out
calculationsatfixed intervals governed by anelectronic clock.
Supposewehaveacontinuoussignal f (t),definedfort 0,whichwesample,that
is measure, at intervals of time, T. We obtain a sequence of sampled values of f (t),
thatis f[0], f[1], f[2],..., f[k],....Returningtotheexampleoftheoventemperature
signal, a discrete signal with a time interval of 5 seconds can be obtained by noting
the value of the electronic thermometer display every 5 seconds. Some textbooks use
the notation f[kT] as a reminder that the sequence has been obtained by sampling at
an interval T. We will not use this notation as it can become clumsy. However, it is
important to note that changing the value ofT changes theztransform as we shall see
in Example 22.13. It can be shown that sampling a continuous signal does not lose the
essence of the signal provided the sampling rate is sufficiently high, and it is in fact
possible to recreate the original continuous signal from the discrete signal, if required.
Itisoftenconvenienttorepresentadiscretesignalasaseriesofweightedimpulses.The
strengthofeachimpulseisthelevelofthesignalatthecorrespondingpointintime.We
write
f ∗ (t) =
∞∑
f[k]δ(t −kT) (22.8)
k=0
the * indicating that f (t) has been sampled. This representation is discussed in
Appendix I. This is a useful mathematical way of representing a discrete signal as the
propertiesoftheimpulsefunctionlendthemselvestoavaluethatonlyexistsforashort
interval of time. In practice, no sampling method has zero sampling time but provided
the sampling time is much smaller than the sampling interval, then this is a valid mathematical
model ofadiscrete signal.
22.8 Sampling a continuous signal 703
Wecanapplytheztransformdirectlytoacontinuousfunction, f (t),ifweregardthe
function as having been sampled at discrete intervals of time. Consider the following
example.
Example22.13 (a) Find theztransformof f (t) = e −t sampled att = 0,0.1,0.2,....
(b) Find theztransform of f (t) = e −t sampled att = 0,0.01,0.02,....
(c) Express the sequences obtained in(a)and (b) as series of weighted impulses.
Solution (a) The sequence of sampled values is
that is,
1,e −0.1 ,e −0.2 ,...
e −0.1k k ∈ NandT = 0.1
From Table 22.2 wefind theztransform ofthis sequence is
(b) The sequence of sampled values is
that is,
1,e −0.01 ,e −0.02 ,...
z
z −e −0.1.
e −0.01k k ∈ N and T = 0.01
z
From Table 22.2 we find the z transform of this sequence is
z −e−0.01. We note
that modifying the sampling interval,T, alters theztransform even though we are
dealing with the same function f (t).
(c) WhenT = 0.1 we have, from Equation (22.8),
∞∑
f ∗ (t) = e −0.1k δ(t −0.1k)
k=0
=1δ(t)+e −0.1 δ(t −0.1)+e −0.2 δ(t −0.2)+···
Notethatanadvantageofexpressingthesequenceasaseriesofweightedimpulsesis
thatinformationconcerningthetimeofoccurrenceofaparticularvalueiscontained
inthe corresponding δ term.
WhenT = 0.01, we have
∞∑
f ∗ (t) = e −0.01k δ(t −0.01k)
k=0
= 1δ(t) +e −0.01 δ(t −0.01) +e −0.02 δ(t −0.02) + ···
Example22.14 (a) Findtheztransform ofthe continuousfunction f (t) = cos3t sampledatt =kT,
k∈N.
(b) WritedownthefirstfourtermsofthesampledsequencewhenT = 0.2,andexpress
the sequenceasaseries ofweighted impulses.
704 Chapter 22 Difference equations and the z transform
Solution (a) The sampled sequence is
f[k] = cos3kT = cos((3T)k)
The z transform of this sequence can be obtained directly from Table 22.2 from
which we have
Z{cosak} =
Writinga = 3T wefind
Z{cos3kT} =
z(z −cosa)
z 2 −2zcosa+1
z(z −cos3T)
z 2 −2zcos3T +1
(b) WhenT = 0.2,the first four termsare
1 cos0.6 cos1.2 cos1.8
From Equation (22.8), the sequence of sampled values can be represented as the
following series of weighted impulses:
f ∗ (t) =
=
∞∑
cos3kTδ(t −kT)
k=0
∞∑
cos0.6kδ(t −0.2k)
k=0
= δ(t) +cos0.6δ(t −0.2) +cos1.2δ(t −0.4) +cos1.8δ(t −0.6) + ···
22.9 THERELATIONSHIPBETWEENTHEzTRANSFORM
ANDTHELAPLACETRANSFORM
We have defined theztransform quite independently of any other transform. However,
there is a close relationship between the z transform and the Laplace transform, the z
being regarded as the discrete equivalent of the Laplace. This can be seen from the following
argument.
Ifthecontinuoussignal f (t)issampledatintervalsoftime,T,weobtainasequence
of sampled values f[k], k ∈ N. From Section 22.8 we note that this sequence can be
regarded asatrainofimpulses.
f ∗ (t) =
∞∑
f[k]δ(t −kT)
k=0
Taking the Laplace transform,wehave
L{f ∗ (t)} =
=
∫ ∞
0
∑ ∞
e −st f[k]δ(t −kT)dt
∞∑
f[k]
k=0
k=0
∫ ∞
0
e −st δ(t −kT)dt
22.9 The relationship between the z transform and the Laplace transform 705
assuming that it is permissible to interchange the order of summation and integration.
NotingfromTable21.1thattheLaplacetransformofthefunction δ(t −kT )ise −skT ,we
can write
L{f ∗ (t)} =
∞∑
f[k]e −skT (22.9)
k=0
Now, making the change of variablez = e sT , we have
L{f ∗ (t)} =
∞∑
f[k]z −k
k=0
whichisthedefinitionoftheztransform.Theexpression L{f ∗ (t)}iscommonlywritten
asF ∗ (s).
In Appendix I it is shown that the continuous function f (t) can be approximated by
multiplying the function f ∗ (t) by the sampling intervalT. Correspondingly,TF ∗ (s)is
an approximation to the Laplace transform of f (t). This is illustrated in the following
example.
Example22.15 Considerthefunction f (t) =u(t)e −t whichhasLaplacetransformF(s) = 1
s +1 .Suppose
f (t) is sampled at intervals T to give the sequence f[k] = e −kT , for
k=0,1,2....
(a) UseTable 22.2 tofind theztransform of f[k].
(b) Make the change of variablez = e sT toobtainF ∗ (s).
(c) ShowthatprovidedthesampleintervalT issufficientlysmall,TF ∗ (s)approximates
the Laplace transformF(s).
Solution (a) From Table 22.2 wefind Z{f[k]} =F(z) =
(b) Lettingz = e sT givesF ∗ (s) =
e sT givesF ∗ (s) =
1
1−e −T(1+s).
z
z−e −T.
e sT
. Dividing numerator and denominator by
e sT −e−T (c) Usingthepowerseriesexpansionfore x wecanwritee x = 1 +x+ x2
2! +···.Sowe
can approximate e −T(1+s) for sufficiently smallT as1 −T(1 +s). Hence
F ∗ (s) ≈
=
1
1−(1−T(1+s))
1
T(1+s)
andsoTF ∗ (s) ≈ 1 , thatisthe Laplace transform of f (t).
1 +s
We have illustrated the connection between the two transforms and shown how the z
transformcan beregarded as the discrete equivalent of the Laplace transform.
706 Chapter 22 Difference equations and the z transform
22.9.1 Mappingthesplanetothezplane
When designing an engineering system it is often useful to consider ansplane representationofthesystem.Thecharacteristicsofasystemcanbequicklyidentifiedbythe
positionsofthepolesandzerosaswesawinChapter21.Engineerswilloftenmodifythe
system characteristics by introducing new poles and zeros or by changing the positions
ofexistingones.Unfortunately,itisnotconvenienttousethesplanetoanalysediscrete
systems. For a sampled signalEquation (22.9) yields
∞∑
F ∗ (s)=L{f ∗ (t)} = f[k]e −skT
k=0
ThecontinuoussignalsandsystemsthatwereanalysedinChapter21hadLaplacetransformsthatweresimpleratiosofpolynomials
ins.Thiswasoneofthemainreasonsfor
usingLaplacetransformstosolvedifferentialequations;theproblemwasreducedtoone
ofreasonablystraightforwardalgebraicmanipulation.HerewehaveaLaplacetransform
thatisverycomplicated.Infactitcanhaveaninfinitenumberofpolesandzeros.Tosee
this consider the following example.
( ) πt
Example22.16 The continuous signal f (t) = cos is sampled at 1 second intervals starting from
2
t=0.
(a) Find the Laplace transformof the sampled signal f ∗ (t).
(b) Show thatF ∗ (s)has an infinity of poles.
(c) Find theztransformof the sampled signaland show thatthis has justtwo poles.
( ) πt
Solution (a) The continuous signal f (t) = cos sampled at 1 second intervals gives rise to
2
the sequence 1,0, −1,0,1,0, −1,.... Consequently, fromEquation (22.9)
∞∑
∞∑
F ∗ (s)=L{f ∗ (t)} = f[k]e −skT = f[k]e −sk sinceT = 1
thatis,
k=0
k=0
F ∗ (s)=1+0−e −2s +0+e −4s +0−e −6s +···
This is a geometric progression with common ratio −e −2s and hence its sum to
infinity is
thatis,
1
1 − (−e −2s )
F ∗ 1
(s) =
1 +e −2s
(b) PolesofF ∗ (s)willoccurwhen1 +e −2s = 0.Writings = σ +jω weseethatpoles
will occur when e −2(σ+jω) = −1. Since −1 can be written as e j(2n−1)π ,n ∈ Z (see
Chapter 9), wesee thatpoles will occur when
e −2σ−2jω = e j(2n−1)π
22.9 The relationship between the z transform and the Laplace transform 707
jv
5p —2
3
s plane
3p —2
p —2
3
3
– p —2
–3p —2
3
3
s
jy
1 3
z plane
–5p —2
3
Figure22.17
Thesampled signal has an infinite
number ofpoles.
–1 3
Figure22.18
There are two polesatz = ±j.
x
thatis,when σ = 0and ω = −(2n −1)π/2.Thusthereexistaninfinitenumberof
poles occurring when
s=−(2n−1) π 2 j
n∈Z
Some of these areillustratedinFigure 22.17.
(c) Theztransformof the sampled signalis
Z{f ∗ (t)}=1+ 0 z − 1 z 2 + 0 z 3 + 1 z 4 +···
=
1
1 − (−1/z 2 )
= z2
z 2 +1
which has justtwo poles atz = ±j asshown inFigure 22.18.
It is possible to show in general that the result of sampling a continuous signal is
to convert each simple pole of the Laplace transform into an infinite set of poles. Suppose
that the Laplace transform of the signal f (t) can be broken down by using partial
fractions into a series of n + 1 terms with simple poles a 0
,a 1
,...,a n
. For simplicity,
repeatingpoleswill notbeconsideredbut the proof forsuchacase issimilar. So,
F(s) = A 0
s−a 0
+ A 1
s−a 1
+ A 2
s−a 2
+···+
Inthe time domain thiscorresponds to
f(t) =A 0
e a 0 t +A 1
e a 1 t +···+A n
e a n t
A n
s−a n
708 Chapter 22 Difference equations and the z transform
Now we consider the Laplace transformof the sampled signal f ∗ (t):
F ∗ (s)=L{f ∗ (t)} =
=
∞∑
f[k]e −skT
k=0
∞∑
(A 0
e a 0 kT +A 1
e a 1 kT + ··· +A n
e a n kT )e −skT
k=0
∑ ∞
∑ ∞
=A 0
e −kT(s−a 0 ) +A 1
e −kT(s−a 1 ) +···
k=0
+A n
∞
∑
k=0
e −kT(s−a n )
Now each of the summations can be converted into a closed form. For example,
∞∑
e −kT(s−a 0 ) =1+e −T(s−a 0 ) +e −2T(s−a 0 ) +e −3T(s−a 0 ) +···
k=0
Therefore,
F ∗ (s) =
1
=
1−e −T(s−a 0 )
k=0
A 0
1−e + A 1
−T(s−a 0 ) 1−e +···+ A n
−T(s−a 1 ) 1−e −T(s−a n )
It is possible to show that for each simple pole in F(s) there is now an infinite set of
poles. Consider the pole ats =a 0
. This contributes the term
A 0
1−e −T(s−a 0 )
toF ∗ (s). This term has poles whenever 1 − e −T(s−a 0 ) = 0, that is e −T(s−a 0 ) = 1. This
corresponds toT(s −a 0
) = 2πmj,m ∈ Z. Therefore,
T(s−a 0
)=2πmj
s−a 0
= 2πm
T j
s=a 0
+ 2πm
T j
m∈Z
The effect of sampling is to introduce an infinite set of poles. Each one is equal to the
pole of the original continuous signal but displaced by an imaginary component. This
is illustrated in Figure 22.19 for a real polea 0
. However, the proof is equally valid for
a complex conjugate pair of poles but the diagram is more cluttered and has, therefore,
not been shown.
Clearly discrete systems are not amenable tosplane design techniques. Fortunately,
thezplane can be used for analysing discrete systems in the same way that thesplane
can be used when analysing continuous systems. It is possible to map points from thes
plane to thezplane using the relationz = e sT which gives rise to the definition of thez
transform as described previously.
22.10 Properties of the z transform 709
jv
s plane
8p
3 a 0 + —– j T
6p
3 a 0 + —– j T
4p
3 a 0 + —– j T
2p
3 a 0 + —– j T
3 a 0
s
jv
s plane jy z plane
2p
3 a 0 – —– j T
4p
3 a 0 – —– j T
6p
3 a 0 – —– j T
8p
3 a 0 – —– j T
Figure22.19
The effect ofsampling isto introduce an
infinite set ofpoles.
s
Figure22.20
Theleft half ofthesplane maps to the inside ofthe unit
circle ofthezplane.
1
1
x
The great advantage of thezplane is that it eliminates the problem of infinitely repeatingpolesandzeroswhenanalysingdiscretesystems.Thiscanbeillustratedbyconsideringhow
the imaginary axis of thesplane maps tothezplane.
Referring to Figure 22.20, we see that s = σ + jω. On the imaginary axis σ = 0
andthereforez = e sT = e jωT .As ω variesbetween − π T and π T ,thelocusofzisacircle
of radius 1, centred at the origin (see Section 9.10). As ω is increased from 0 to π T the
upper half of the unit circle is traced out, while as ω is decreased from 0 to − π T , the
lower half of the unit circle is traced out. Increasing ω above π T
or decreasing it below
− π T leadstoaretracingoftheunitcircle.Inotherwords,therepeatedsplanepointsare
superimposedontopofeachother.Thisisthereasonwhythezplaneapproachismuch
simplerthan thesplane approach when analysing discrete systems.
Theztransforms of discrete signals and systems are, in many cases, simple ratios of
polynomials. We shall see shortly that this means the process of analysing difference
equations which model these signals and systems is reduced to relatively simple algebraicmanipulations.
22.10 PROPERTIESOFTHEzTRANSFORM
Because of the relationship between the two transforms we would expect that many of
the properties of the Laplace transform would be mirrored by properties of theztransform.
This isindeedthe caseand someoftheseproperties aregiven now. These are:
(1) linearity;
(2) shift theorems;
(3) the complex translation theorem.
710 Chapter 22 Difference equations and the z transform
22.10.1 Linearity
If f[k] andg[k] are two sequences then
Z{f[k] +g[k]} = Z{f[k]} + Z{g[k]}
This statement simply says that to find theztransform of the sum of two sequences we
canaddtheztransformsofthetwosequences.Ifcisaconstant,whichmaybenegative,
and f[k]isasequence, then
Z{cf[k]} =cZ{f[k]}
Together these two properties mean thattheztransform isalinear operator.
Example22.17 Find theztransform of e −k +k.
Solution From Table 22.2 wehave
Z{e −k z
} =
z−e −1
and
z
Z{k} =
(z −1) 2
Therefore,
Z{e −k +k} =
z
z−e −1 +
z
(z −1) 2
Example22.18 Findtheztransform of3k.
Solution FromTable 22.2 wehave
z
Z{k} =
(z −1) 2
Therefore,
Z{3k} =3Z{k} =3×
z
(z −1) = 3z
2 (z −1) 2
Example22.19 Findtheztransform ofthe function f (t) = 2t 2 sampledatt =kT,k ∈ N.
Solution Thesequenceofsampledvalues is
f[k] = 2(kT) 2 = 2T 2 k 2
TheztransformofthissequencecanbereaddirectlyfromTable22.2usingthelinearity
properties. We have
Z{2T 2 k 2 } =2T 2 Z{k 2 } = 2T2 z(z +1)
(z −1) 3
22.10 Properties of the z transform 711
22.10.2 Firstshifttheorem
If f[k]isasequence andF(z)isitsztransform,then
Z{f[k+i]} =z i F(z)−(z i f[0]+z i−1 f[1]+···+zf[i−1]),i ∈ N + (22.10)
Inparticular, ifi = 1 we have
Z{f[k +1]} =zF(z) −zf[0] (22.11)
Ifi=2wehave
Z{f[k+2]}=z 2 F(z)−z 2 f[0]−zf[1]
Example22.20 The sequence f[k]isdefined by
{ 0 k=0,1,2,3
f[k]=
1 k=4,5,6,...
Writedown the sequence f[k +1] and verify that
Z{f[k+1]}=zF(z)−zf[0]
whereF(z)istheztransform of f[k].
Solution The graph of f[k] is illustrated in Figure 22.21. The sequence f[k + 1] is defined as
follows:
Whenk = 0, f[k +1] = f[1]which is0.
Whenk = 1, f[k +1] = f[2]which is0.
Whenk = 2, f[k +1] = f[3]which is0.
Whenk = 3, f[k +1] = f[4]which is1,and soon.
Consequently,
{ 0 k=0,1,2
f[k+1]=
1 k=3,4,5,...
as illustrated in Figure 22.22. We see that the graph of f[k + 1] is simply that of f[k]
shiftedoneplacetotheleft.Moregenerally, f[k+i]isthesequence f[k]shiftediplaces
tothe left.Now
f [k]
1
f [k + 1]
1
0 1 2 3 4 5 6 7 8 9 k
Figure22.21
The sequence ofExample 22.25.
0 1 2 3 4 5 6 7 8 9 k
Figure22.22
AshiftedversionofthesequenceofFigure22.21.
712 Chapter 22 Difference equations and the z transform
F(z) = Z{f[k]} =
=
∞∑
f[k]z −k
k=0
∞∑
k=4
z −k
= 1 z 4 + 1 z 5 + 1 z 6 +···
= 1 z 4 (
1 + 1 z + 1 z 2 +···)
= 1 z 4 1
1−1/z
= 1 z 4 z
z −1
= 1 z 3 1
z −1
The same argument shows that
Z{f[k+1]}= 1 z 2 1
z −1
Itthen follows that
Z{f[k+1]}= 1 z 2 1
z −1 =zF(z)−zf[0]
since f[0] = 0.This illustratesthe first shift theorem.
22.10.3 Secondshifttheorem
Thefunction f (t)u(t)isdefined by
{ f(t) t0
f(t)u(t) =
0 t<0
whereu(t)istheunitstepfunction.Thefunction f (t−iT)u(t−iT ),whereiisapositive
integer,representsashifttotherightofisampleintervals.Supposethisshiftedfunction
issampled; thenweobtain
f[k−i]u[k−i]
k∈N
Thesecondshift theoremstates:
Z{f[k−i]u[k−i]} =z −i F(z) i ∈ N +
whereF(z)istheztransform of f[k].
22.10 Properties of the z transform 713
Example22.21 The functiont u(t) is sampled at intervals T = 1 to give k u[k]. This sample is then
shifted tothe right by one sampling interval togive
(k −1)u[k −1]
Find itsztransform.
Solution Figure 22.23(a) shows t u(t) and Figure 22.23(b) shows the sampled function. Figures
22.23(c) and (d) show (t − 1)u(t − 1) and (k − 1)u[k − 1], respectively. From
Table 22.2, we have
z
Z{k} =
(z −1) 2
and so, fromthe second shift theorem withi = 1,wehave
Z{(k −1)u[k −1]} =z −1 z
(z −1) = 1
2 (z −1) 2
2
2
1
1
1
1
1
t
1 2 3 k
1 2 3 t
(a) (b) (c) (d)
1 2 3 k
Figure22.23
GraphsforExample 22.21:(a)t u(t);(b)ku[k];(c) (t −1)u(t −1);(d) (k −1)u[k −1].
Example22.22 Find theztransform of the unit step functionu(t) and the shifted unit stepu(t − 2T),
sampledatintervals ofT seconds.
Solution If the function u(t) is sampled at intervals T then we are concerned with finding the
z transform of the sequence u[k]. This has been derived earlier: Z{u[k]} =
z
z −1 . If
u(t −2T) issampled, we have
{
1 k=2,3,4,...
u[k−2]=
0 otherwise
Therefore, by the second shift theorem,
Z{u[k −2]} =z −2 Z{u[k]}
=z −2 z
z −1
=
1
z(z −1)
714 Chapter 22 Difference equations and the z transform
Example22.23 Find the sequence whoseztransformis
Solution
1
z −1 = 1 z
z z
z −1 =z−1
z −1
From Table 22.2 wehave
Z{u[k]} =
z
z −1
So from the second shift property, wehave
Z{u[k −1]} =z −1 z
z −1
1
z −1 .
The required sequence isthereforeu[k −1].
1
Example22.24 Findthe sequencewhoseztransformis
z 2 (z −1) 2.
1
Solution The expression does not appear in the table of transforms, but we observe
z 2 (z −1)
2
that
1
z 2 (z −1) = 1 z
2 z 3 (z −1) 2
z
and does appear. Itfollows from Table 22.2 that
(z −1)
2
z
Z{k} =
(z −1) 2
From the second shift property,z −3 z
istheztransform of (k −3)u[k −3].
(z −1)
2
22.10.4 Thecomplextranslationtheorem
Z{e −bk f[k]} =F(e b z) whereF(z)istheztransform of f[k].
Example22.25 Given thattheztransform ofcos(ak) is
z(z −cosa)
z 2 −2zcosa+1
find theztransform of e −2k cos(ak).
Solution Since b = 2, the complex translation theorem states that we replace z by e 2 z in the z
transformF(z).
F(e 2 z) =
e2 z(e 2 z −cosa)
e 4 z 2 −2e 2 zcosa+1
istherefore the required transform.
22.11 Inversion of z transforms 715
EXERCISES22.10
1 UseTable 22.2 to find theztransformsof
(a)3(4) k +7k 2 ,k 0
(b)3e −k sin4k−k,k 0
2 Find theztransformsofthe following continuous
functionssampledatt =kT,k ∈ N:
(a) t 2 (b) 4t (c) sin2t
(d) u(t −4T )
(e) e 3t
3 Find theztransformof (k −3)u[k −3]bydirectuse
ofthe definition oftheztransform.Hence verifythe
resultofExample 22.24.
4 Provethat theztransformofe −at f (t)isF(e aT z).
5 Prove the firstandsecond shifttheorems.
6 Use the complex translation theoremto findthez
transformsof
(a) ke −bk
(b) e −k sink
7 If f[k] = 4(3) k find Z{f[k]}.Usethefirstshift
theorem to deduce Z{f[k +1]}.Show that
Z{f[k+1]}−3Z{f[k]}=0.
8 Write down the firstfive terms ofthe sequence
definedby f[k] = 4(2) k−1 u[k −1],k 0.Finditsz
transformdirectly, andalso byusingthe second shift
theorem.
Solutions
1 (a)
(b)
2 (a)
(c)
(e)
3z 7z(z +1)
1
+
z −4 (z −1) 3
3
z 2 (z −1) 2
3ze −1 sin4
z 2 −2ze −1 cos4 +e −2 − z
ze b
(z −1) 2 6 (a)
(ze b −1) 2
T 2 z(z +1) 4Tz
ezsin1
(z −1) 3 (b)
(z −1) 2 (b)
e 2 z 2 −2ezcos1+1
zsin2T 1
z 2 (d)
4z
−2zcos2T +1 z 3 (z−1) 7
z −3 , 12z
z −3
z
z−e 3T 8 0, 4,8,16,32.
4
z −2
22.11 INVERSIONOFzTRANSFORMS
Just as it is necessary to invert Laplace transforms we need to be able to invertztransforms
and as before we can make use of tables of transforms, partial fractions and the
shift theorems.Invery complicatedcasesmore advanced techniquesarerequired.
Example22.26 IfF(z) = z +3 ,find f[k].
z −2
Solution Note that wecan writeF(z)as
z +3
z −2 = z
z −2 + 3
z −2
Thereasonforthischoiceisthatquantitieslikethefirstonther.h.s.appearinTable22.2.
Fromthis table we find
Z{2 k } = z
z −2
716 Chapter 22 Difference equations and the z transform
and write
{ } z
Z −1 z −2
= 2 k
where Z −1 denotes the inverseztransform.Also,
3
z −2 = 3 z
zz −2
= 3z −1 z
z −2
Using the second shift property wesee that
Z{3(2) k−1 u[k −1]} = 3 z
so that
{ } z +3
Z −1 z −2
z
z −2
{ } { z
= Z −1 + Z −1
z −2
3z −1
}
z
z −2
= 2 k +3(2) k−1 u[k −1]
{ 1 k=0
=
2 k +3(2) k−1 k=1,2,...
{ 1 k=0
=
2(2) k−1 +3(2) k−1 k = 1,2,...
{ 1 k=0
=
5(2) k−1 k =1,2,...
Often it is necessary to split a complicated expression into several simpler ones, using
partialfractions,beforeinversioncanbecarriedout.Also,ifweexaminetheztransform
table, Table 22.2, wenoticethat nearlyall ofthe entrieshave azterminthe numerator.
For this reason it is convenient to divide a complicated expression byzbefore splitting
it into partialfractions. We illustrate this techniqueby meansofanexample.
Example22.27 Findthe sequencewhoseztransformis
F(z) =
2z 2 −z
(z −5)(z +4)
Solution The first stage istodivide the expression forF(z)byz. We have
F(z) =
F(z) =
F(z)
z
=
2z 2 −z
(z −5)(z +4)
z(2z −1)
(z −5)(z +4)
2z−1
(z −5)(z +4)
22.11 Inversion of z transforms 717
We now split the r.h.s. expression into partial fractions using the standard techniques
discussed inSection 1.7.This gives
F(z)
z
= 1
z −5 + 1
z +4
Multiplying byzgives
F(z) =
z
z −5 + z
z +4
It is now possible to invert this expression for F(z) using the z transform table,
Table 22.2. This gives
f[k] =5 k + (−4) k
22.11.1 Directinversion
Sometimes it is possible to invert a transform F(z) directly by reading off the coefficients.
Example22.28 Find f[k]if
F(z)=1+z −1 +z −2 +z −3 +···
Solution Usingthe definitionoftheztransform wesee that
f[k]=1,1,1,...
Occasionallyitispossible torewriteF(z)toobtain the required form.
Example22.29 Use the binomial theorem to expand
sequence withztransformF(z) =
Solution Usingthe binomial theorem, wehave
(
1 − 1 ) −3 (
=1+(−3) − 1 z z
(
1 − 1 ) −3
up to the term 1 z z4. Hence find the
z 3
(z −1) 3.
)
+ (−3)(−4)
2!
+ (−3)(−4)(−5) (
− 1 3! z
= 1 + 3 z + 6 z + 10
2 z + 15
3 z +··· 4
provided |z| > 1.Since
z 3 ( ) z −1 −3 (
F(z) =
(z −1) = = 1 − 1 ) −3
3 z z
(
− 1 ) 2
z
) 3
+ (−3)(−4)(−5)(−6)
4!
(
− 1 z
) 4
+···
718 Chapter 22 Difference equations and the z transform
we have
F(z)=1+ 3 z + 6 z 2 + 10
z 3 + 15
z 4 +···
ThusF(z) can beinverted directly togive
f[k] = 1,3,6,10,15,... thatis,f[k] =
(k +2)(k +1)
2
k 0
EXERCISES22.11
1 Find the inverseztransformsofthe following:
4z z 2 +2z
(a) (b)
z −4 3z 2 −4z−7
z +1 2z 3 +z
(c)
(z −3)z 2 (d)
(z−3) 2 (z−1)
2 Find the inverseztransformsof
(a)
2z
(z −2)(z −3)
(c) 1− 2 z +
z
(z −3)(z −4)
(b)
(d)
3 Express
(e)
ez
(ez −1) 2 4 If
z 2
(z 2 − 1 9 )
2z 2
(z −1)(z −0.905)
F(z) =
(z +1)(2z −3)(z −2)
z 3
in partial fractions andhence obtain its inversez
transform.
F(z) =
find f[k].
10z
(z −1)(z −2)
Solutions
1 (a) 4(4 k )
(b)
13(7/3) k
30
− (−1)k
10
(c) u[k −2] 4(3)k−2 − δ[k −2]
3
May be written as:
(d)
3 k−2 u[k −2] +3 k−3 u[k −3]
19k(3 k )
6
+ 3u[k]
4
2 (a) −2(2 k ) +2(3 k )
+ 5(3k )
4
(b) e −k k
(c) δ[k] −2δ[k −1] +4 k −3 k
(1/3) k + (−1/3) k
(d)
2
(e) 21.05u[k] −19.05(0.905) k
3 2 − 5 z − 1 z 2 + 6 z 3
f[0]=2,f[1]=−5,f[2]=−1,
f[3]=6,f[k]=0 k4
4 10(2 k −u[k])
22.12 THEzTRANSFORMANDDIFFERENCEEQUATIONS
InChapter21wesawhowusefultheLaplacetransformcanbeinthesolutionoflinear,
constantcoefficient,ordinarydifferentialequations.Similarlytheztransformhasarole
toplayinthe solutionofdifference equations.
22.12 The z transform and difference equations 719
Example22.30 Solve the difference equationy[k +1] −3y[k] = 0,y[0] = 4.
Solution Taking theztransformof both sides of the equation we have
Z{y[k +1] −3y[k]} = Z{0} = 0
since Z{0} = 0.Usingthe properties of linearitywefind
Z{y[k +1]} −3Z{y[k]} = 0
Usingthe first shift theorem on the first of the terms on the l.h.s. weobtain
zZ{y[k]} −4z −3Z{y[k]} = 0
Writing Z{y[k]} =Y(z), thisbecomes
(z −3)Y(z) =4z
so that
Y(z)=
4z
z −3
Thefunctiononther.h.s.istheztransformoftherequiredsolution.Invertingthis,from
Table 22.2 we findy[k] = 4(3) k .
Higherorderequationsaretreatedinthe same way.
Example22.31 Solve the second-order difference equation
y[k +2] −5y[k +1] +6y[k] = 0 y[0] = 0 y[1] = 2
Solution Takingtheztransformofbothsidesoftheequationandusingthepropertiesoflinearity
wehave
Z{y[k +2]} −5Z{y[k +1]} +6Z{y[k]} = 0
Fromthe first shift theoremwehave
where
z 2 Y(z) −z 2 y[0] −zy[1] −5(zY(z) −zy[0]) +6Y(z) = 0
Y(z) = Z{y[k]}
Substitutingvalues forthe conditionsgives
Hence
so that
z 2 Y(z) −0z 2 −2z −5(zY(z) −0z) +6Y(z) = 0
(z 2 −5z +6)Y(z) =2z
Y(z) = Z{y[k]} =
z 2 Y(z) −5zY(z) +6Y(z) = 2z
2z
(z −2)(z −3)
720 Chapter 22 Difference equations and the z transform
Dividing both sides byzgives
Y(z)
z
=
2
(z −2)(z −3)
Expressing the r.h.s.inpartialfractions yields
Y(z)
z
Y(z)=
= 2
z −3 − 2
z −2
2z
z −3 − 2z
z −2
Inverting gives the solution tothe difference equation:
y[k] = 2(3 k ) −2(2 k )
EXERCISES22.12
1 Useztransformsto solvethe following difference
equations:
(a) x[k +1] −3x[k] = −6,x[0] = 1
(b) 2x[k +1] −x[k] = 2 k ,x[0] = 2
(c) x[k +1] +x[k] = 2k +1,x[0] = 0
(d) x[k +2] −8x[k +1] +16x[k] = 0,
x[0] = 10,x[1] = 20
(e) x[k +2] −x[k] = 0,x[0] = 0,x[1] = 1
2 Solvethe difference equation
x[k +2] −3x[k +1] +2x[k] = δ[k]
subjectto the conditionsx[0] =x[1] = 0.
3 Solvethe difference equation
y[k +2] +3y[k +1] +2y[k] = 0
subjectto the conditionsy[0] = 0,y[1] = 1.
4 Solvethe difference equation
x[k +2] −7x[k +1] +12x[k] =k
subjectto the conditionsx[0] = 1,x[1] = 1.
Solutions
1 (a) x[k] = 3 −2(3 k )
2 k
(b)
3 + 5(1/2)k
3
(c) k (d) 10(4 k ) −5(k4 k )
(e)
u[k] − (−1) k
2
May be expressed asx[k] =
{ 0 k even
1 kodd
2 (2 k−1 −1)u[k −1]
3 (−1) k − (−2) k
4
k
6 + 5 11
u[k] +
36 4 (3)k − 17
9 (4)k
REVIEWEXERCISES22
1 State
(i) the order
(ii) the independentvariable
(iii) the dependentvariable
(iv) whether linearornon-linear
foreachofthe following equations:
(a) x[n] +x[n −2] = 6
(b) y[k+1]+ky[k−1]−k=0
(c) (y[z] +1)y[z +1] =z 2
(d) z[n] −z[n −1] =n 2 z[n −2]
Review exercises 22 721
(e) q[k +3] + √ q[k+2]=q[k]−1
Foreach linearequation state whether itis
homogeneous orinhomogeneous.
2 Given
3(x[n +1]) 2 −2x[n] =n 2 x[0] = 2
findx[1],x[2]andx[3].
3 Rewrite eachequationsothatthe highestargumentof
the dependent variableisasspecified.
(a) 3ny[n +1] −y[n −1] =n 2 ,highest argumentof
the dependent variableisto ben.
(b) z[k +2] + (3 +k/2)z[k] = √ kz[k −1],highest
argumentofthedependentvariableistobek+1.
(c) x[3]x[n] −x[2]x[n −1] = (n +1) 2 ,highest
argumentofthedependentvariableistoben +1.
4 Find f[k]if
F(z) =
z(1 −a)
(z −1)(z −a)
5 Find the inverseztransform of
(a)
(b)
3z(z +2)
(z −2)(z −3) 2
z 2 +3z
3z 2 +2z−5
6 Thesequence δ[k −i]istheKroneckerdeltasequence
shiftediunits to the right. Finditsztransform.
7 Show thatsinak canbe writtenas
Given that
show that
e akj −e −akj
2j
Z{e −ak } =
Z{sinak} =
z
z−e −a
zsina
z 2 −2zcosa+1
Solutions
1 (a) Second order,n,x,linear, inhomogeneous (b) z[k +1] +
(3 + 1 )
2 (k−1) z[k −1]
(b) Second order,k,y,linear, inhomogeneous
= √ k−1z[k−2]
(c) First order,z,y, non-linear
(c) x[3]x[n +1] −x[2]x[n] = (n +2) 2
(d) Second order,n,z,linear, homogeneous
4 u[k] −a k
(e) Third order,k,q,non-linear
5 (a) 12(2 k ) −12(3 k ) +5k3 k
2 1.1547,1.0503,1.4260
u[k] − (−5/3) k /3
(b)
3 (a) 3(n −1)y[n] −y[n −2] = (n −1) 2 2
1
6
z i
23 Fourierseries
Contents 23.1 Introduction 722
23.2 Periodicwaveforms 723
23.3 Oddandevenfunctions 726
23.4 Orthogonalityrelationsandotherusefulidentities 732
23.5 Fourierseries 733
23.6 Half-rangeseries 745
23.7 Parseval’stheorem 748
23.8 Complexnotation 749
23.9 Frequencyresponseofalinearsystem 751
Reviewexercises23 755
23.1 INTRODUCTION
The ability to analyse waveforms of various types is an important engineering skill.
Fourieranalysisprovidesasetofmathematicaltoolswhichenabletheengineertobreak
down a wave into its various frequency components. It is then possible to predict the
effect a particular waveform may have from knowledge of the effects of its individual
frequency components. Often an engineer finds it useful to think of a signal in terms of
its frequency components rather than in terms of its time domain representation. This
alternative view is called a frequency domain representation. It is particularly useful
when trying to understand the effect of a filter on a signal. Filters are used extensively
inmanyareasofengineering.Inparticular,communicationengineersusetheminsignal
receptionequipmentforfilteringoutunwantedfrequenciesinthereceivedsignal;thatis,
removing the transmission signal to leave the audio signal. We shall begin this chapter
by reviewing the essential properties of waves before describing how breaking down
into frequency components isachieved.
23.2 Periodic waveforms 723
23.2 PERIODICWAVEFORMS
Inthischapterweshallbeconcernedwithperiodicfunctions,especiallysineandcosine
functions. Let us recall some important definitions and properties ( already discussed in
Section 3.7. The function f (t) = Asin(ωt + φ) = Asin ω t + φ )
is a sine wave of
ω
amplitudeA, angular frequency ω, frequency ω 2π , periodT = 2π ω
and phase angle φ.
The time displacement isdefined tobe φ . These quantities areshown inFigure 23.1.
ω
SimilarremarkscanbemadeaboutthefunctionAcos(ωt + φ)andtogetherthesine
andcosinefunctionsformaclassoffunctionsknownassinusoidsorharmonics.Itwill
beparticularlyimportantforwhatfollowsthatyouhavemasteredtheskillsofintegrating
these functions. The following results can be found inTable 13.1 (see page 431):
∫
sinnωtdt = − cosnωt
nω +c ∫
cosnωtdt = sinnωt
nω +c
forn=±1,±2,...
Sometimesafunctionoccursasthesumofanumberofdifferentsineorcosinecomponents
such as
f (t) = 2sin ω 1
t +0.8sin2ω 1
t +0.7sin4ω 1
t (23.1)
The r.h.s.ofEquation (23.1) isalinear combinationof sinusoids.
Note in particular that the angular frequencies of all components in Equation (23.1)
are integer multiples of the angular frequency ω 1
. Functions like these can easily be
plottedusingagraphicscalculatororcomputergraph-plottingpackage.Thecomponent
with the lowest frequency, or largest period, is 2sinω 1
t. The quantity ω 1
is called the
fundamentalangularfrequencyandthiscomponentiscalledthefundamentalorfirst
harmonic. The component with angular frequency 2ω 1
is called the second harmonic
andsoon.Inwhatfollowsallangularfrequenciesareintegermultiplesofthefundamental
angular frequency as in Equation (23.1). A consequence of this is that the resulting
function, f (t), is periodic and has the same frequency as the fundamental. Some harmonicsmaybemissing.Forexample,inEquation(23.1)thethirdharmonicismissing.
Insomecases the first harmonic may be missing.For example, if
f(t) =cos2ω 1
t +0.5cos3ω 1
t +0.4cos4ω 1
t + ···
f(t)
A
2p
T = —
v , period
Amplitude
– f v
t
–A
Figure23.1
The function: f (t) =Asin(ωt + φ).
724 Chapter 23 Fourier series
all angular frequencies are integer multiples of the fundamental angular frequency ω 1
,
which is missing. Nevertheless, f (t) has the same angular frequency as the fundamental.
A common value for ω 1
is 100π as this corresponds to a frequency of 50 Hz, the
frequency of the UK mains supply.
Example23.1 Describe the frequency and amplitude characteristics of the different harmonic components
of the function
f (t) = cos20πt +0.6cos60πt −0.2sin140πt
Solution Thefundamentalangularfrequencyis20πarisingthroughthetermcos20πt.Thiscorresponds
to a frequency of 10 Hz. This term has amplitude 1. The second, fourth, fifth
andsixthharmonicsaremissing,whilethethirdandseventhhaveamplitudes0.6and0.2,
respectively.
Example23.2 If f (t) = 2sint + 3cost, express f (t) as a single sinusoid and hence determine its
amplitude and phase.
Solution Both terms have angular frequency ω = 1. Recalling the trigonometric identity (Section
3.7)
Rcos(ωt−θ)=acosωt+bsinωt
whereR = √ a 2 +b 2 ,tanθ = b a , we see that in this caseR = √ 3 2 +2 2 = √ 13 and
tanθ = 2 , that is θ = 0.59 radians. Therefore we can express f (t)inthe form
3
f(t)= √ 13cos(t −0.59)
We see immediately that this is a sinusoid of amplitude √ 13 and phase angle −0.59
radians.
Example23.3 Findthe amplitudeand phase ofthe fundamentalcomponent ofthe function
f(t) =0.5sinω 1
t +1.5cosω 1
t +3.5sin2ω 1
t −3cos3ω 1
t
Solution Contributions to the fundamentalcomponent -- that is, that with the lowest frequency --
come from the terms 0.5sin ω 1
t and 1.5cos ω 1
t only. To find the amplitude and phase
wemustexpress theseasasinglecomponent.Usingthe trigonometricidentity
Rcos(ωt−θ)=acosωt+bsinωt
whereR = √ a 2 +b 2 ,tanθ = b a , wefind
R = √ 1.5 2 +0.5 2 = 1.58
tanθ = 0.5
1.5 = 1 thatis, θ = 0.32 radians
3
Therefore the fundamental can bewritten1.58cos(ω 1
t −0.32) and has amplitude1.58
and phase angle −0.32 radians.
23.2 Periodic waveforms 725
f(t)
1
f(t)
1
–1 0 1 2 3 t
Figure23.2
Graph forExample 23.4.
–1 0 1
Figure23.3
Graph forExample 23.5.
2 3 4 5 6 t
Many other periodic functions arise in engineering applications as well as the more familiar
harmonic waves. Remember, to be periodic the function values must repeat at
regularintervalsknownastheperiod,T.Theangularfrequency ω isgivenby ω = 2π
T .
To describe a periodic function mathematically it is sufficient to give its equation over
one full period and state that period. From this information the complete graph can be
drawn as Examples 23.4 and 23.5 show.
Example23.4 Sketch the graph ofthe periodicfunctiondefined by
f(t)=t 0t<1 period1
Solution To proceed we first sketch the graph on the given interval 0 t < 1 (Figure 23.2), and
thenusethefactthatthefunctionrepeatsregularlywithperiod1tocompletethepicture.
Example23.5 Write down a mathematical expression for the function whose graph is shown in
Figure 23.3.
Solution We first note that the interval over which the function repeats itself is 2; that is, period
= 2. It is then sufficient to describe the function over any interval of length 2. The
simplest interval to take is 0 t < 2. We note in this example that a single formula is
insufficienttodescribethefunctionfor0 t < 2sincedifferentbehaviourisexhibited
inthetwointervals0 t < 1and1 t < 2.For0 t < 1thefunctionisarampwith
slope1andpassesthroughtheorigin,thatisithasequation f (t) =t.For1 t < 2the
function value remains constant at 1. Therefore this periodic function can be described
by the expression
{
t 0t<1 period2
f(t)=
1 1t<2
EXERCISES23.2
1 Describethe frequency andamplitudecharacteristics
ofthedifferentharmoniccomponentsofthefollowing
waveforms:
(a)f(t) =3sin100πt −4sin200πt
+0.7sin300πt
(b) f (t) = sin40t −0.5cos120t
+0.3cos240t
Use agraph-plottingcomputer packageorgraphics
calculator to graphthese waveforms.
726 Chapter 23 Fourier series
2 Expresseach ofthe following functionsasasingle
sinusoidandhence findtheir amplitudes and phases.
(a) f(t)=2cost−3sint
(b) f(t) =0.5cost +3.2sint
(c) f(t) =3cos3t
(d) f(t)=2cos2t+3sin2t
3 Sketch the graphs ofthe following functions:
(a) f(t) =t 2 ,−1t 1,period2
{
0 0 t < π/2 period π
(b) f(t) =
sint π/2tπ
{
−t −2t<0 period3
(c) f(t) =
t 0t<1
f(t)
f(t)
1
1
–2 –1 1 2 t –2 –1 0 1 2 t
(a)
(b)
f(t)
1
–3 –2 1 3 4 t
(c)
Figure23.4
4 Write down mathematical expressions to describe the
functionswhose graphs are shown in Figure23.4.
Solutions
1 (a) Fundamental frequency is50 Hz, amplitude3.
Second harmonic has frequency of100 Hz,
amplitude4. Thirdharmonic has frequency of
150 Hz,amplitude0.7.
(b) Fundamental frequency is 20 Hz, amplitude 1.
π
Second harmonic is missing.Third harmonic has
frequency of 60 Hz, amplitude 0.5. Fourth and
π
fifth harmonics are missing.Sixth harmonic has
frequency of 120 Hz, amplitude 0.3.
π
√ √
2 (a) 13cos(t +0.983);amplitude = 13,
phase = 0.983
(b) 3.24cos(t +4.867);amplitude = 3.24,
phase = 4.867
(c) 3cos3t;amplitude = 3,phase = 0
√ √
(d) 13cos(2t −0.983);amplitude = 13,
phase = −0.983
{
1 0 t1
4 (a) f(t)=
0 1<t<2 period =2
{
2t 0t1/2
(b) f(t) =
2−2t 1/2<t<1
period =1
{
0 0 t1
(c) f(t) =
t/2−1/2 1<t<3
period =3
23.3 ODDANDEVENFUNCTIONS
Thefunctionssint andcost eachpossesscertainpropertieswhichcanbegeneralizedto
otherfunctions.Figure23.5showsthegraphof f (t) = cost.Itisobviousfromthegraph
that the function value at a negativet value, say − π , will be the same as the function
4
value at the corresponding positivet value, in this case + π . This is true because the
4
graph is symmetrical about the vertical axis. We can therefore state that for any value
oft,cos(−t) =cost.
23.3 Odd and even functions 727
f(t)
1
–2p
–p
– p –4 p –4
p
2p
t
Figure23.5
Thefunction: f (t) = cost.
f(t)
f(t)
3
(a)
Figure23.6
Examples ofeven functions:(a) f (t) =t 2 ;(b) f (t) = 3.
t
(b)
t
More generally, any function with the property that f (t) = f (−t) for any value of
its argument,t, issaidtobe aneven function.
If f (t) = f (−t) then f isaneven function.
In particular, the set of functions cosnωt, for any integern, is even. The graphs of all
even functions are symmetrical about the vertical axis -- or equivalently, the graph on
the left of the origin can be obtained by reflecting in the vertical axis that on the right.
Someother examples ofeven functions areshown inFigure 23.6.
Sketchingagraphshowsuptherequiredsymmetryimmediately.However,evenfunctions
can be identified by an algebraic approach as shown in Example 23.6. Given any
function f (t), weexamine f (−t)tosee if f (t) = f (−t).
Example23.6 Show that f (t) =t 2 is even.
Solution We can argue asfollows. If
f(t)=t 2
then
f(−t) = (−t) 2
=t 2
=f(t)
so that f (t)iseven, by definition.
728 Chapter 23 Fourier series
Example23.7 Test whether or notthe function f (t) = 4t 3 iseven.
Solution If
f(t)=4t 3
then
f(−t) = 4(−t) 3
= −4t 3
=−f(t)
so that f (−t) isnotequal to f (t)and therefore the given function isnot even.
Let us turn now to the graph of f (t) = sint in Figure 23.7. It is obvious from the
graph that the function value at a negativet value, say − π , will not be the same as the
4
function value at the corresponding positivet value, in this case + π . This graph is not
4
symmetricalabouttheverticalaxis.However,wecanstatesomethingelse.Thefunction
value at a negativet value ( is minus the function value at the corresponding positivet
value.Forexample,sin − π ) ( ) π
=−sin .Infact,forallvaluesoft wecanstatethat
4 4
sin(−t) = −sint. More generally, any function with the property that f (−t) = −f (t)
for all values of its argument,t, issaidtobe anodd function.
If f (−t) = −f (t) then f isanoddfunction.
In particular, the set of functions sinnωt, for any integer n, is odd. In Example 23.7,
f(t)=4t 3 , we found that f (−t) = −f (t) so this function is odd. The graph of an odd
functioncanbeobtainedbyreflectionfirstinthehorizontalaxisandtheninthevertical
axis. Somemoreexamples ofoddfunctions areshown inFigure 23.8.
Therearesomefunctionsthatareneitheroddnoreven--forexample,theexponential
function (see Chapter2).
Example23.8 Showthatanyfunction, f (t),canbeexpressedasthesumofanoddcomponentandan
even component.
Solution We can write
f(t)=f(t)+ f(−t)
2
Rearranging gives
− f(−t)
2
= f(t)
2 + f(t)
2 + f(−t) − f(−t)
2 2
f(t)= f(t)+f(−t)
2
+ f(t)−f(−t)
2
23.3 Odd and even functions 729
f(t)
t
f(t)
1
(a)
f(t)
1
– p –4
p –4
t
0
–1
t
–1
(b)
Figure23.7
The function: f (t) = sint.
Figure23.8
Examples ofodd functions:
(a)f(t) =t; { 1 t>0
(b)f(t) =
−1 t<0.
Now it is easy to check that the first term on the r.h.s. is even and the second term is
odd, so that we have expressed f (t) as the sum of an even and an odd component as
required.
Example23.9 Showthattheproductoftwoevenfunctionsisitselfanevenfunction.Determinewhether
the product of two odd functions is even or odd. Is the product of an even function and
anoddfunctioneven orodd?
Solution If f (t) andg(t) are even then f (−t) = f (t) andg(−t) = g(t). LetP(t) = f (t)g(t) be
the product of f andg. Then
P(−t) = f (−t)g(−t)
by definition
= f (t)g(t) since f andgare even
=P(t)
Therefore,P(−t) = P(t) and so the product f (t)g(t) is itself an even function. On the
otherhand, if f (t)andg(t) areboth oddwefind
P(−t) = f(−t)g(−t)
= (−f(t))(−g(t))
= f(t)g(t)
=P(t)
so thatthe product f (t)g(t)iseven.
730 Chapter 23 Fourier series
f(t)
f(t)
–a 0
a
t
Figure23.9
A typicalodd function, f (t).
–a 0
Figure23.10
Atypical even function, f (t).
a
t
If f (t)iseven andg(t) isodd, we find
P(−t) = f(−t)g(−t)
= f(t)(−g(t))
= −f(t)g(t)
= −P(t)
so the product is an odd function. These rules are obviously analogous to the rules for
multiplying positive and negative numbers.
Theresults ofExample 23.9 are summarizedthus:
(even) × (even) = even
(odd) ×(odd) =even
(even) × (odd) =odd
23.3.1 Integralpropertiesofevenandoddfunctions
Consider a typical odd function, f (t), such as that shown in Figure 23.9. Suppose we
wish to evaluate ∫ a
f (t)dt where the interval of integration [−a,a] is symmetrical
−a
abouttheverticalaxis.RecallfromChapter13thatadefiniteintegralcanberegardedas
theareaboundedbythegraphoftheintegrandandthehorizontalaxis.Areasabovethe
horizontalaxisarepositivewhilethosebelowarenegative.Weseethatbecausepositive
andnegativecontributionscancel,theintegralofanoddfunctionoveranintervalwhich
issymmetricalaboutthe vertical axiswill bezero.
Example23.10 Evaluate ∫ π
−π tcosnωtdt.
Solution The functiont is odd. The function cosnωt is even and hence the producttcosnωt is
odd. The interval [−π, π] is symmetrical about the vertical axis and hence the required
integral iszero.
Considernowatypicalevenfunction, f (t),suchasthatshowninFigure23.10.Suppose
we wish to evaluate ∫ a
f (t)dt. Clearly the area bounded by the graph and thet axis in
−a
23.3 Odd and even functions 731
the interval [−a,0] is the same as the corresponding area in the interval [0,a]. Hence
wecan write
∫ a
−a
f(t)dt =2
∫ a
0
f(t)dt
Example23.11 Evaluate ∫ π
−π tsintdt.
Solution Thefunctionst andsint arebothodd,andhencetheirproductiseven.Therefore,using
the factthatthe integrand iseven we can write
∫ π
−π
tsintdt=2
So,integrating by parts,
2
∫ π
0
∫ π
0
tsintdt
∫ π
)
tsintdt=2
([−tcost] π 0 + costdt
0
= 2((−πcosπ) − (0) +[sint] π 0 )
= 2π
EXERCISES23.3
1 Determine byinspection whether each ofthe (c)f(t) =t 3 ;
(b)f(t) = t 2 +1;
functionsin Figure23.11 isodd, even orneither. (d) f(t) =cost +0.1t 2 .
f(t)
f(t)
2 Byusingthe properties ofodd andeven functions
developed in Example23.9 statewhether the
following are odd, even orneither:
(a) t 3 sin ωt (b) tcos2t
t
t (c) sint sin4t (d) cos ωt sin2ωt
(e) e t sint
(a)
(b)
f(t)
f(t) 3 Evaluate the following integrals usingthe integral
propertiesofodd andeven functionswhere
appropriate:
t
t (a) ∫ 5
−5 t3 dt (b) ∫ 5
−5 t3 cos3tdt
(c)
(d)
(c) ∫ π
−π t2 sintdt (d) ∫ 2
−2 tcosh3tdt
Figure23.11
(a)The function: f (t) = −t;
(e) ∫ 1
−1 |t|dt (f) ∫ 1
−1 t|t|dt
732 Chapter 23 Fourier series
Solutions
1 (a) odd (b) neither
(c) odd (d) even
2 (a) even (b) odd (c) even
(d) odd (e) neither
3 (a)0 (b)0
(c) 0 (d) 0
(e) 1 (f) 0
23.4 ORTHOGONALITYRELATIONSANDOTHER
USEFULIDENTITIES
RecallfromChapter16thattwofunctions f (t)andg(t)aresaidtobeorthogonalonthe
intervala t bif
∫ b
a
f(t)g(t)dt =0
Example23.12 Show that the functions cosmωt and cosnωt withm,npositive integers andm ≠n are
orthogonal on the interval − π ω t π ω .
Solution We mustevaluate
∫ π/ω
−π/ω
cosmωt cosnωtdt
Usingthetrigonometricidentity2cosAcosB = cos(A +B) +cos(A −B),wefindthe
integral becomes
∫
1 π/ω
cos(m +n)ωt +cos(m −n)ωtdt
2 −π/ω
= 1 2
[ sin(m +n)ωt
(m +n)ω
]
sin(m −n)ωt π/ω
+
(m −n)ω
−π/ω
= 0
since sin(m ±n)π = 0 for all integersm,n. It was necessary to requirem ≠ n since
otherwise the second quantity inbrackets becomes undefined.
Hence cosmωt and cosnωt are orthogonal on the given interval.
AnumberofotherfunctionsregularlyappearinginworkconnectedwithFourieranalysis
are orthogonal. The main results together with some other useful integral identities are
given inTable 23.1.
23.5 Fourier series 733
Table23.1
Someusefulintegral identities.
∫ T
sin 2nπt dt = 0 forallintegersn
0 T
∫ T
0
∫ T
0
∫ T
0
∫ T
0
∫ T
0
cos 2nπt
T
cos 2nπt
T
cos 2mπt
T
sin 2mπt
T
sin 2mπt
T
dt =0
dt=T n=0
cos 2nπt
T
sin 2nπt
T
cos 2nπt
T
n=1,2,3,...
{ 0 m≠n
dt =
T/2 m=n≠0
{ 0 m≠n
dt =
T/2 m=n≠0
dt = 0
forall integersmandn
23.5 FOURIERSERIES
We have seen that the functions sinωt, sin2ωt, sin3ωt,...,cos ωt, cos2ωt,... are
periodic.Furthermore,linearcombinationsofthemarealsoperiodic.Theyarealsoconvenient
functions to deal with because they can be easily differentiated, integrated, etc.
Theyalsopossessanotherveryusefulproperty--thatof completeness.Thismeansthat
almost any periodic function can be expressed as a linear combination of them and no
additional functions are required to do this. In other words, they can be used as buildingblockstoconstructperiodicfunctionssimplybyaddingparticularmultiplesofthem
together.
We shall see, for example, that the sawtooth waveform with period 2π, shown in
Figure 23.12, isgiven by the particular combination
f(t)=2
(sint− 1 2 sin2t + 1 3 sin3t − 1 4 sin4t + 1 sin5t−···)
5
Thisisaninfiniteserieswhichcanbeshowntoconvergeforalmostallvaluesoft tothe
function f.Thismeansthatifanyvalueoft issubstitutedintotheinfiniteseriesandthe
series is summed, the result will be the same as the value of the sawtooth function at
thatvalueoft.Thereisanexception:ift isoneofthepointsofdiscontinuitytheinfinite
series will converge tothe mean of the values toitsleft and right,that is0.
Toobtainafeelforwhatishappening considerFigure23.13. Graphs(a),(b)and(c)
showtheeffectofincludingmoreandmoretermsintheseries.Asmoretermsaretaken
f(t)
p
–p
p
3p
t
Figure23.12
Sawtooth waveform.
734 Chapter 23 Fourier series
f(t)
f(t)
f(t)
(a)
t
(b)
Figure23.13
Fourier synthesisofasawtooth waveform: (a) f (t) = 2sint;
(b)f(t) =2(sint − 1 2 sin2t + 1 3 sin3t);
(c)f(t) =2(sint − 1 2 sin2t + 1 3 sin3t − 1 4 sin4t + 1 5 sin5t).
t
(c)
t
weseethattheseriesapproachesthedesiredsawtoothwaveform.Theprocessofadding
together sinusoids to form a new periodic function is called Fourier synthesis. We see
thatthesawtoothwaveformhasbeenexpressedasaninfiniteseriesofharmonicwaves,
sint beingthefundamentalorfirstharmonic,andtherestbeingwaveswithfrequencies
thatareintegermultiplesofthefundamentalfrequency.Thisinfiniteseriesiscalledthe
Fourier series representation of f (t) and what we have succeeded in doing is to break
down f (t) into its component harmonic waveforms. In this example, only sine waves
were required to construct the function. More generally we shall need both sine and
cosine waves.
Suppose the function f (t) is defined in the interval 0 < t < T and is periodic with
periodT. Then, under certainconditions, its Fourier series isgiven by
f(t)= a 0
2 + ∞
∑
orequivalently
n=1
(
a n
cos 2nπt
T
+b n
sin 2nπt )
T
f(t)= a ∞
0
2 + ∑
(a n
cosnωt +b n
sinnωt)
n=1
(23.2)
where a n
and b n
are constants called the Fourier coefficients. These are given by the
formulae
a 0
= 2 T
a n
= 2 T
b n
= 2 T
∫ T
0
∫ T
0
∫ T
0
f(t)dt (23.3)
f(t)cos 2nπt
T
f(t)sin 2nπt
T
dt fornapositive integer (23.4)
dt fornapositive integer (23.5)
The term a 0
represents the mean value or d.c. component of the waveform (see Section15.2).ThederivationoftheseformulaeappearsinExample23.16.Itisimportantto
2
23.5 Fourier series 735
point out that the integrals in Equations (23.3), (23.4) and (23.5) can be evaluated over
anycompleteperiod,forexamplefromt = − T 2 tot =T 2 .Prudentchoiceoftheinterval
ofintegrationcanoftensaveeffort.Theexpressionappearinginther.h.s.oftheFourier
representation, Equation (23.2), is an infinite series. We list conditions, often called the
Dirichlet conditions, sufficient for the series to converge to the value of the function
f (t). The integral ∫ |f (t)|dt over a complete period must be finite, and f (t) may have
no more than a finite number of discontinuities in any finite interval. Fortunately, most
signals of interest to engineers satisfy these conditions. At a point of discontinuity the
Fourier series converges to the average of the two function values at either side of the
discontinuity.
Example23.13 Findthe Fourier series representation ofthe function with periodT = 1 given by
50
{
1 0 t<0.01
f(t)=
0 0.01 t < 0.02
Solution Thefunction f (t)isshowninFigure23.14.Notethat f (t)isdefinedtobezerobetween
t = 0.01 and t = 0.02. This means we need only consider 0 t < 0.01. Using
Equations (23.3)--(23.5) wefind
∫ 0.02
a 0
= 100 f(t)dt =100
0
a n
= 100
b n
= 100
∫ 0.01
0
∫ 0.01
0
Noting thatcosnπ = (−1) n wefind
b n
= 1
nπ (1 − (−1)n )
∫ 0.01
0
= 100[t] 0.01
0
= 1
1dt+100
∫ 0.02
[ sin100nπt
cos100nπtdt = 100
100nπ
0.01
] 0.01
= 0
[ −cos100nπt
sin100nπtdt = 100
100nπ
0
0dt
] 0.01
= − 1 (cosnπ −cos0)
nπ
0
f(t)
1
1 –––
100
T =
1 ––
50
t
Figure23.14
Graph forExample 23.13.
736 Chapter 23 Fourier series
Ifnisevenb n
=0.Ifnisoddb n
= 2 . Therefore the Fourier series representation of
nπ
f(t)is
f(t)= 1 2 + 2 (
sin100πt + sin300πt + sin500πt )
+···
π 3 5
The average value of the waveform is 1 . This is the zero frequency component or d.c.
2
value. We notethat inthisexample only odd harmonics arepresent.
Example23.14 Findthe Fourier series representation of f (t) = 1 +t, −π <t π, period2π.
Solution As usual we sketch f (t) first as this often provides insight into what follows (see Figure
23.15). HereT = 2π, ω = 1, and for convenience we shall consider the period of
integration tobe[−π,π]. UsingEquation (23.3)wefind
a 0
= 1 ∫ π
π −π1+tdt= 1 [ ] π
t + t2
π 2 −π
= 1 ) ))
((π + π2
−
(−π + π2
π 2 2
= 1 π (2π)
= 2
Similarly, using Equation (23.4) wefind
a n
= 1 π
∫ π
−π
(1+t)cosntdt
Integrating by parts gives
a n
= 1 ([
(1+t) sinnt
π n
= 1 π
] π
−π
( [ ] cosnt π )
0 +
n 2 −π
= 1 (cosnπ −cos(−nπ))
πn2 ∫ π
)
sinnt
− dt
−π n
since sin±nπ = 0
f(t)
1
–p
p
t
Figure23.15
Graph forExample23.14.
23.5 Fourier series 737
but cos(−nπ) = cosnπ and hence a n
= 0, for all positive integers n. Using Equation
(23.5) we find
∫ π
b n
= 1 (1+t)sinntdt
π −π
= 1 ([
−(1+t) cosnt
π n
= 1 π
] π
−π
∫ π
)
cosnt
+ dt
−π n
(
−(1+π) cosnπ +(1−π) cos(−nπ) [ ] sinnt π )
+
n n n 2 −π
= 1 πn (−2πcosnπ)
since sin ±nπ = 0.Hence,
b n
=− 2 n cosnπ=−2 n (−1)n
Wefindb 1
=2,b 2
=−1,b 3
= 2 ,.... Thus the Fourier series representation is given
3
fromEquation (23.2) as
f(t)=1+2sint−sin2t+ 2 3 sin3t+···
which wecan writeconcisely as
f(t)=1−
∞∑
n=1
2
n (−1)n sinnt
Example23.15 Find the Fourier series representation of the function with period 2π defined by
f(t)=t 2 ,0<t2π.
Solution Asusualwesketch f (t),asshowninFigure23.16.HereT = 2πandweshallintegrate,
forconvenience, over the interval [0,2π]. UsingEquation (23.3)wefind
a 0
= 1 π
∫ 2π
0
t 2 dt = 1 π
[ t
3
3
] 2π
0
= 8π2
3
f(t)
4p 2
2p
t
Figure23.16
Graph forExample 23.15.
738 Chapter 23 Fourier series
Using Equation (23.4) wehave
a n
= 1 π
∫ 2π
0
t 2 cosntdt
Integrating by parts, wefind
a n
= 1 π
= − 2
nπ
([
t 2sinnt
n
∫ 2π
0
] 2π
0
−
tsinntdt
∫ 2π
= − 2 ([
−t cosnt ] 2π
nπ n
0
= − 2 ( −2πcos2nπ
+
nπ n
0
2t sinnt
n
)
dt
∫ 2π
cosnt
+
0 n
)
[ sinnt
n 2 ] 2π
0
)
dt
= 4 n 2
Hencea 1
= 4,a 2
= 1,a 3
= 4 ,.... Similarly,
9
b n
= 1 π
∫ 2π
0
t 2 sinntdt
= 1 ([
−t 2cosnt
] 2π
+
π n
0 0
= 1 (− 4π2
π n cos2nπ + 2 n
= 1 π
= 1 π
∫ 2π
(− 4π2
n + 2 ([ tsinnt
n n
(− 4π2
n − 2 [
− cosnt
n 2 n
∫ 2π
0
] 2π
0
2t cosnt
n
] 2π
0
)
dt
)
tcosntdt
∫ 2π
))
sinnt
− dt
0 n
)
= − 4π n
Thus b 1
= −4π, b 2
= −2π,.... Finally, the required Fourier series representation is
given by
f(t)= 4π2
3 + (
4cost+cos2t+ 4 9 cos3t+···)
(
−π 4sint+2sin2t+ 4sin3t
3
)
+···
23.5 Fourier series 739
Example23.16 Obtain the expressions for the Fourier coefficients a 0
, a n
and b n
in Equations (23.3),
(23.4) and (23.5).
Solution Assumethat f (t)can be expressed inthe form
f(t)= a ∞
0
2 + ∑
(
a n
cos 2nπt +b
T n
sin 2nπt )
T
n=1
(23.6)
Multiplying Equation (23.6) through by cos 2mπt and integrating from 0 toT wefind
T
∫ T
0
f(t)cos 2mπt
T
dt =
∫ T
a 0
2
0
∫ T
+
0
2mπt
cos dt
T
∞∑
(
a n
cos 2nπt
T
n=1
+b n
sin 2nπt )
cos 2mπt dt
T T
If we now assume that it is legitimate to interchange the order of integration and
summation weobtain
∫ T
0
f(t)cos 2mπt
T
dt =
∫ T
a 0
0
+
2
∞∑
n=1
2mπt
cos dt
T
∫ T
0
(
a n
cos 2nπt +b
T n
sin 2nπt
T
)
cos 2mπt dt
T
The first integral on the r.h.s.iseasily shown tobe zero unlessm = 0. Furthermore, we
can use the previously found orthogonality properties (Table 23.1) to show that the rest
of the integrals on the r.h.s. vanish except for the case whenn = m, in which case the
r.h.s.reduces to a m T
2 . Consequently,
a m
= 2 T
∫ T
0
f(t)cos 2mπt
T
dt
forall positive integersm
as required. Whenm = 0 all terms on the r.h.s.except the first vanish and weobtain
so that
∫ T
0
a 0
= 2 T
f(t)dt =
∫ T
0
∫ T
0
= a 0 T
2
f(t)dt
a 0
2 dt
To obtain the formula for the b n
multiply Equation (23.6) through by sin 2mπt and
T
integrate from 0 toT.
∫ T
0
f(t)sin 2mπt
T
dt =
∫ T
a 0
2
0
∫ T
+
0
2mπt
sin dt
T
∞∑
(
a n
cos 2nπt
T
n=1
+b n
sin 2nπt )
sin 2mπt dt
T T
740 Chapter 23 Fourier series
Again assuming that it is legitimate to interchange the order of integration and summation,
weobtain
∫ T
0
f(t)sin 2mπt
T
dt =
∫ T
a 0
0
+
2
∞∑
n=1
2mπt
sin dt
T
∫ T
0
(
a n
cos 2nπt +b
T n
sin 2nπt
T
)
sin 2mπt dt
T
The first integral on the r.h.s. is easily shown to be zero. Furthermore, we can use the
properties given in Table 23.1 to show that the rest of the integrals on the r.h.s. vanish
exceptforthecasewhenn =m,inwhichcasether.h.s.reducesto b m T
2 .Hencewefind
b m
= 2 T
as required.
∫ T
0
f(t)sin 2mπt
T
dt
23.5.1 Fourierseriesofoddandevenfunctions
LetusnowconsiderwhathappenswhenwedetermineFourierseriesoffunctionswhich
are either oddoreven.
Example23.17 Findthe Fourier series forthe functionwith period2π definedby
⎧
⎪⎨ 0 −π<t<−π/2
f(t)= 4 −π/2tπ/2
⎪⎩
0 π/2<t<π
Solution As usual we sketch the function first (Figure 23.17). Inspection of Figure 23.17 shows
that the Dirichlet conditions (page 735) are satisfied. We note from the graph that the
function is symmetrical about the vertical axis; that is, it is an even function. We shall
see shortly that this fact has important implications for the Fourier series representation.
For convenience we consider the period of integration to be
Equation (23.4) becomes
[
− T 2 ,T 2
]
. Hence
a n
= 2 T
∫ T/2
−T/2
f(t)cos 2nπt
T
dt
for all positive integersn
Inthis example the periodT equals 2π. The formulafora n
then simplifies to
a n
= 1 π
∫ π
−π
f(t)cosntdt
The interval of integration is fromt = −π tot = π. However, a glance at the graph
shows that the function is zero outside the interval − π 2 t π , and takes the value 4
2
23.5 Fourier series 741
y
4
– 3π –– –π π –2
π –2
2
–
π
3π ––2
t
Figure23.17
Graph forExample 23.17.
inside. The integral thus reduces to
a n
= 1 π
∫ π/2
−π/2
4cosntdt
= 4 π
∫ π/2
−π/2
cosntdt
[ sinnt
] π/2
= 4 π n
−π/2
= 4 [sin nπ (
nπ 2 −sin − nπ 2
= 8
nπ sinnπ 2
)]
We obtaina 1
= 8 π ,a 2 =0,a 3 =−8 3π , etc. We finda 0
using Equation (23.3), again
[
integrating over − T ]
2 ,T :
2
a 0
= 2 T
= 1 π
= 4
∫ T/2
−T/2
∫ π/2
−π/2
f(t)dt
4dt = 4 π [t]π/2 −π/2
Similarly, tofind the Fourier coefficients,b n
, we use Equation (23.5):
b n
= 2 T
∫ T/2
−T/2
which reduces to
∫ π/2
f(t)sin 2nπt
T
dt
b n
= 1 4sinntdt
π −π/2
= 4 [ ] −cosnt π/2
π n
−π/2
= 4 [
−cos nπ ]
nπ 2 +cosnπ 2
= 0
742 Chapter 23 Fourier series
that is, all the Fourier coefficients,b n
, are zero. Finally we can gather together all our
results and writedown the Fourier series representation of f (t):
f(t)=2+ 8 π cost− 8
3π cos3t + 8
5π cos5t−···
In this example we see that there are no sine terms at all. In fact, whenever a function
is even its Fourier series will possess no sine terms. To see this we note thatb n
can be
found from
b n
= 2 T
∫ T/2
−T/2
f(t)sin 2nπt
T
dt
Since f (t) is even and sin 2nπt is odd, the product f (t)sin 2nπt is odd also. Now the
T
T
integral of an odd function on an interval which is symmetrical about the vertical axis
wasshowninSection23.3tobezero.Hencewhenever f (t)isevenwecanimmediately
assumeb n
= 0 for alln.
Correspondingly, when a function is odd its Fourier series will contain no cosine or
constant terms.This isbecause the product
f(t)cos 2nπt
T
isodd alsoand so the integral
a n
= 2 T
∫ T/2
−T/2
f(t)cos 2nπt
T
dt
will equal zero. We conclude that when f (t) is odd,a n
= 0 for alln. These facts can
oftenbeusedtosavetimeandeffort.KnowingthefunctioninExample23.17waseven
before we started the Fourier analysis, we could have assumed that theb n
would all be
zero.
Example23.18 FindtheFourierseriesrepresentationofthesawtoothwaveformdescribedatthebeginning
ofthissection (see Figure 23.12).
Solution This function is defined by f (t) =t, −π <t < π, and has periodT = 2π. It is an odd
function and hencea n
= 0 foralln. To find theb n
wemustevaluate
b n
= 1 π
= 1 π
∫ π
−π
tsinntdt
{[ −tcosnt
n
] π
−π
= 1 {−πcosnπ − πcosnπ}
nπ
∫ π
}
cosnt
+ dt
−π n
since the lastintegral vanishes. Therefore,
b n
=− 2 n cosnπ=−2 n (−1)n
23.5 Fourier series 743
We conclude thatb 1
= 2,b 2
= −1,b 3
= 2 ,.... Therefore f (t)has Fourier series
3
f(t)=2
{sint − 1 2 sin2t + 1 3 sin3t − 1 4 sin4t + 1 sin5t−···}
5
Engineeringapplication23.1
Spectrumofapulsewidthmodulationcontrolledsolarcharger
Pulse width modulated (PWM) signals are found in a wide variety of electronics
applications. APWMsignalissimplyarectangular wave with a fixed timeperiod,
T,wherethe‘on’timeofthesignal,knownasthemarktime,m,canbevaried.The
ratio of the ‘on’ time,m, of the signal to the period,T, is known as the duty cycle,
denotedbyd:
duty cycle (d) = ‘on’time
period = m T
The frequency of the PWM signalis f = 1 T .
Solarchargecontrollers,whichoftenusepulsewidthmodulation,aredevicesthat
protect the batteries in the system from becoming overcharged or from becoming
fully discharged, both of which can cause permanent damage. Pulse width modulation
is used to vary the charge rate to the batteries. Charge controllers and other
devicesthatemploypulsewidthmodulationgeneratehigh-frequencyharmonicsdue
totheir switching.
Consider the PWM signal shown in Figure 23.18. It has an amplitude ofAvolts.
Ithas periodT and mark timem, so thatthe duty cycle isd = m T .
Note that the function is even because of the symmetry about the vertical axis.
This means that the Fourier series will contain only cosine terms, that is all theb n
values arezero. Thea n
values can beshown (see Exercises 23.5, Question 8) tobe
a n
= 2A
nπ sin(nπd) T t
m
A
m
T
Figure23.18
A PWM signal ofperiodT andduty cyclem/T.
➔
744 Chapter 23 Fourier series
Notethatthevalueofa n
istheamplitudeofthenthharmonic.Further,thefrequency
of the nth harmonic is nf, that is integer multiples of the frequency of the PWM
signal.Agraphof |a n
|against fnisoftenreferredtoasthefrequencyspectrum.This
concept isdiscussed further inChapter 24.
Consider the specific case of a 12 V signal at a frequency of f = 300 Hz.
Figures 23.19(a)--(d) show the frequency spectra when d = 0.01,0.15,0.30 and
0.50 respectively.
It can be seen that the frequency spectrum becomes narrower for higher values
of the duty cycle. The sinc function (see Section 3.5) forms the envelope for the
spectrum, which is most clearly evident for d = 0.15. For low values of the duty
cycletheharmonicsmayextenduptoradiofrequenciesandhencethereisapotential
forinterference tooccur.
Note thatwhend = 0.5
a n
= 2A ( nπ
)
nπ sin 2
( nπ
)
When n is even the value of sin is zero and so all the even harmonics are
2
absent. In other words, when d = 0.5 the PWM signal possesses only odd
harmonics.
a n (V)
—
—
0.24
0.22
0.20
0.18
0.16
0.14
a n (V) 3.5
—
—
3.0
2.5
2.0
1.5
1.0
0.5
a n (V)
—
—
6
(a)
2 4 6 8
f (kHz)
a n (V)
—
—
(b)
2 4 6 8
f (kHz)
5
6
4
3
4
2
1
2
(c)
2 4 6 8
f (kHz)
Figure23.19
Spectrum for(a)d = 0.01, (b)d = 0.15,(c)d = 0.30,(d)d = 0.50.
(d)
2 4 6 8
f (kHz)
23.6 Half-range series 745
EXERCISES23.5
1 Find the Fourier seriesrepresentation ofthe function
{
0 −5 <t < 0 period10
f(t)=
1 0<t<5
2 Find the Fourier seriesrepresentation ofthe function
{
−t −π <t < 0 period2π
f(t)=
0 0<t<π
3 Find the Fourier seriesrepresentation ofthe function
f(t)=t 2 +πt −π<t <π period2π
4 Find the Fourier seriesrepresentation ofthe function
{
−4 −π <t 0 period 2π
f(t)=
4 0<t<π
5 Find the Fourier seriesrepresentation ofthe function
{
2(1+t) −1<t0 period2
f(t)=
0 0<t<1
6 Find the Fourier seriesrepresentation ofthe function
with period2π given by
{
t 2 0t<π
f(t)=
0 πt<2π
7 Find the Fourier seriesrepresentation ofthe function
f(t)=2sint 0<t <2π
period2π
8 Forthe signal in Engineering application 23.1, show
that
a n = 2A
nπ sin(nπd)
Solutions
1
2
3
1
2 + 2 πt
sin
π 5 + 2 3πt
sin
3π 5 + 2 5πt
sin
5π 5
+ 2
4
7πt
sin
7π 5 · · ·
π
4 − 2 π cost−sint+1 2 sin2t− 2
5
9π cos3t−1 3 sin3t...
π 2
3 +2πsint−4cost−πsin2t+cos2t+ 6
2π
3 sin3t − 4 9 cos3t···
8
(2sint sin5t+···)
+ 3 2 sin3t + 5 2
π
1
{
2 +2 (2/π)cosπt−sinπt− 1 2 sin2πt
+(2/9π)cos3πt − 1 3 sin3πt+···}/π
π 2
6 + π2 −4
sint−2cost− π π 2 sin2t + 1 2 cos2t
+ 9π2 −4
27π
sin3t − 2 9 cos3t+···
7 2sint
23.6 HALF-RANGESERIES
Sometimes an engineering function is not periodic but is only defined over a finite
interval, 0 < t < T say, as shown in Figure 23.20. In cases like this Fourier analysis
can still be useful. Because the region of interest is only that between t = 0
2
and
t = T we may choose to define the function arbitrarily outside the interval. In particular,
we can make our choice so that the resulting function is periodic, with periodT.
2
Thereismorethanonewaytoproceed.Forexample,wecanreflecttheabovefunction
in the vertical axis and then repeat it periodically so that the result is the periodic even
function shown in Figure 23.21. We have performed what is called a periodic extension
of the given function. Note that within the interval of interest nothing has altered
746 Chapter 23 Fourier series
y
y
0 T –2
t
0 T –2
t
Figure23.20
Function defined over interval0 <t < T 2 .
Figure23.21
An even periodic extension.
y
y
0 T –2
t
0 T –2
t
Figure23.22
An odd periodicextension.
Figure23.23
Aperiodic extension that is neither even nor odd.
but we have now achieved our objective of finding a periodic function. We can find
the Fourier series of this periodic function and within the interval of interest this will
converge to the required function. What happens outside this interval is not important.
Moreover, since the periodic function is even the Fourier series will contain no sine
terms.
An alternative periodic extension is that shown in Figure 23.22, which has been obtainedbyreflectinginboththeverticalandt
axesbeforerepeatingitperiodicallytogive
a periodic odd function. Its Fourier series will contain no cosine terms and within the
interval of interest will converge tothe function required.
AthirdalternativeperiodicextensionisshowninFigure23.23.However,thisextensionisneitheroddnorevenandsoithasnoneofthedesirablepropertiesoftheothertwo.
Whicheverextensionwechoose,theresultingFourierseriesonlygivesarepresentation
of the original function in the interval 0 < t < T and as such is termed a half-range
2
Fourier series. Similarly we have the terminology half-range sine series for a series
containingonlysinetermsandhalf-rangecosineseriesforaseriescontainingonlycosine
terms. The Fourier series formulae then simplify to give the following half-range
formulae:
Half-range sineseries:
a 0
= 0,a n
= 0
b n
= 4 T
∫ T/2
0
fornapositive integer
f(t)sin 2nπt
T
and f (t)isgiven by
∞∑
f(t)= b n
sin 2nπt
T
n=1
dt fornapositive integer (23.7)
23.6 Half-range series 747
Half-range cosine series:
a 0
= 4 T
a n
= 4 T
b n
= 0
∫ T/2
0
∫ T/2
and then f (t)isgiven by
0
f(t)dt (23.8)
f(t)cos 2nπt
T
fornapositive integer
f(t)= a ∞
0
2 + ∑
a n
cos 2nπt
T
n=1
dt fornapositive integer (23.9)
Example23.19 BydefininganappropriateperiodicextensionofthefunctionillustratedinFigure23.24,
find the half-range cosineseries representation.
Solution The function illustrated in Figure 23.24 is given by the formula f (t) =t for 0 <t < π
and is undefined outside this interval. Since the cosine series is required an even periodic
extension must be formed. This is illustrated in Figure 23.25. Taking T = 2π in
Equations(23.8) and (23.9),wefinda 0
anda n
.
∫ π
a 0
= 2 tdt
π 0
= 2 [ ] t
2 π
π 2
0
= π
∫ π
a n
= 2 tcosntdt
π 0
= 2 {[ ] tsinnt π ∫ π
}
sinnt
− dt
π n
0 0 n
[ cosnt
= 2 π
n 2 ] π
0
y
π
y
π
π
t
–2π
–π 0
π
2π
t
Figure23.24
Graph forExample 23.19.
Figure23.25
GraphforExample23.19.
748 Chapter 23 Fourier series
Now cosnπ = (−1) n , so that
a n
= 2 ( (−1)
n
− 1 )
π n 2 n 2
n=1,2,...
Of course, all theb n
are zero.Therefore the half-range cosine series is
f(t)= π 2 − 4 π cost− 4
9π cos3t...
and thisseries converges tothe given function within the interval 0 <t < π.
EXERCISES23.6
1 Graph anappropriate periodicextension of
f(t)=3t
0<t<π
andhence findits half-range cosineseries
representation.
2 Find the half-range sineseriesrepresentation ofthe
functiongiven in Example 23.19.
3 Find the half-range cosineseries representingthe
function
f(t)=sint
0<t<π
4 Graph anappropriateperiodicextension of
f(t)=e t
0<t<1
andfinditshalf-range cosine series.
5 Findthe half-rangesineseries representation of
f(t)=2−t,0t2.
Solutions
( )
3π
1
2 + 6 ∞∑ cosnπ−1
π n 2 cosnt
1
( )
∞∑ cosnπ
2 −2 sinnt
n
1
( )
2
3
π − 2 ∞∑ cosnπ+1
π n 2 cosnt
−1
2
( )
∞∑ ecosnπ−1
4 e−1+2
n 2 π 2 cosnπt
+1
1
5
4
π
∞∑
1
sin(nπt/2)
n
23.7 PARSEVAL’STHEOREM
Ifthefunction f (t)isperiodicwithperiodT andhasFouriercoefficientsa n
andb n
,then
Parseval’s theoremstates:
2
T
∫ T
0
(f(t)) 2 dt = 1 2 a2 0 + ∞
∑
n=1
( )
a
2
n +b2 n
Itis frequently usefulinpower calculations as the following example shows.
23.8 Complex notation 749
Engineeringapplication23.2
Averagepowerofasignal
Find the average power developed across a 1 resistor by a voltage signal with
period 2πgiven by
v(t) =cost − 1 3 sin2t + 1 2 cos3t
Solution
We note that v(t) is periodic with period T = 2π; v(t) is already expressed as a
Fourierserieswitha 1
= 1,a 3
= 1 2 andb 2 =−1 3 .AllotherFouriercoefficientsare0.
The instantaneous power is (v(t)) 2 and hence the average power over one period is
given by
∫ 2π
P av
= 1 (v(t)) 2 dt
2π 0
Therefore, using Parseval’s theorem wefind
P av
= 1 ( (
1 2 + − 1 2 ( 1 2 )
+ = 0.68 W
2 3)
2)
23.8 COMPLEXNOTATION
AnalternativenotationforFourierseriesinvolvingcomplexnumbersisavailablewhich
leadsnaturallyintothemoregeneraltopicofFouriertransforms.RecallfromChapter9
the Euler relations
e ±jθ =cosθ±jsinθ
fromwhich wecan obtain expressions forcosθ and sinθ:
cosθ = ejθ +e −jθ
2
sinθ = ejθ −e −jθ
2j
which enable us torewrite the Fourier representation
f(t)= a ∞
0
2 + ∑
(
a n
cos 2nπt +b
T n
sin 2nπt )
T
as
n=1
f(t)= a ∞
0
2 + ∑ e
(a j2nπt/T +e −j2nπt/T
n
2
n=1
= a ∞
0
2 + ∑
(
an −jb n
2
n=1
e j2nπt/T + a n +jb n
2
+b n
e j2nπt/T −e −j2nπt/T
2j
e −j2nπt/T )
)
750 Chapter 23 Fourier series
which wecan write equivalently as
∞∑
f(t)= c n
e j2nπt/T
n=−∞
where
c n
= a n −jb n
c
2 −n
= a n +jb n
n=1,2,...
2
andc 0
=a 0
/2. Itcan be shown thatthe Fourier coefficients,c n
, arethen given by
c n
= 1 T
∫ T/2
−T/2
f (t)e −j2nπt/T dt
The integral can also be evaluated over any complete period as convenient. Further, if
we writeT = 2π
ω 1
then thiscomplex formcan be expressed as
where
f(t)=
c n
= ω 1
2π
∞∑
n=−∞
∫ π/ω1
c n
e jnω 1 t
−π/ω 1
f (t)e −jnω 1 t dt
Example23.20 FindthecomplexFourierseriesrepresentationofthefunctionwithperiodT definedby
{
1 −T/4 <t <T/4
f(t)=
0 otherwise
Solution We find
c n
= 1 T
∫ T/4
−T/4
1e −j2nπt/T dt
= 1 T
[ e
−j2nπt/T
−j2nπ/T
] T/4
−T/4
Therefore,
= −1
2nπ j (e−jnπ/2 −e jnπ/2 )
= 1 ( ) e jnπ/2 −e −jnπ/2
nπ 2j
= 1
nπ sinnπ 2
f(t)=
∞∑
n=−∞
1
nπ sinnπ 2 ej2nπt/T
23.9 Frequency response of a linear system 751
The observant reader will note that the expressions forc n
appear invalid whenn = 0,
since the denominator is then zero. We can computec 0
in either of two ways. Using an
integral expression we see that
c 0
= 1 T
∫ T/4
−T/41dt = 1 2
Also,using a Taylor series expansion itispossible toshow
[ ]
1 e
−j2nπt/T T/4
lim
= 1
n→0T
−j2nπ/T
−T/4
2
giving a consistentresult.
EXERCISES23.8
1 Find the complex Fourier seriesrepresentation of
(a) f(t) =
{
1 0 <t<2
0 2<t<4
period 4
(b) f(t)=e t −1<t<1 period2
{
Asinωt 0<t <π/ω
(c) f(t) =
0 π/ω<t<2π/ω
period 2π/ω
2 If
∞∑
f(t)= c n e j2nπt/T
−∞
show thatthe coefficients,c n ,are given by
c n = 1 ∫ T
f (t)e −j2nπt/T dt
T 0
[Hint: multiply both sidesbye −j2mπt/T and integrate
over [0,T].]
Solutions
1 (a)
(b)
∞∑ j
(cosnπ −1)ejnπt/2
2nπ
−∞
(
)
1
∞∑ e −jnπ+1 −e −1+jnπ
e jnπt
2 1−jnπ
−∞
(c)
∞∑ A(1 +e −jnπ )e jnωt
2π(1 −n
−∞
2 )
23.9 FREQUENCYRESPONSEOFALINEARSYSTEM
Linear systems have the property that the response to several inputs being applied to
thesystemcanbeobtainedbyaddingtheeffectsoftheindividualinputs.Anotheruseful
property of linear systems is that if a sinusoidal input is applied to the system then the
output will also be a sinusoid of the same frequency but with modified amplitude and
phase.This isillustratedinFigure 23.26.
InSection9.7wesawthatsinusoidalsignalscanberepresentedbycomplexnumbers
and that an a.c. electrical circuit can be analysed using complex numbers. This is true
752 Chapter 23 Fourier series
A o
f
v
A i
t
Linear
system
t
Figure23.26
The response ofalinear systemto a sinusoidalinput is alsosinusoidal.
for linear systems in general. It is possible to define a complex frequency function,
G(jω), where ω is the frequency of the input; G relates the output and the input of a
linear system.
If a sinewave ofamplitudeA i
isapplied tothe systemthen the amplitude,A o
, of the
output isgiven by
A o
= |G(jω)|A i
The phase shift, φ, isgiven by
φ = ̸
G(jω)
Note that A o
and φ depend upon ω. It is important to note that G(jω) is a frequencydependent
function. Although the notation for G(jω) may seem slightly odd it arises
becauseonemethodofobtainingthefrequencyfunctionforalinearsystemistosubstitutes
= jω inthe Laplace transform transferfunction,G(s), of the system.
Itisnowpossibletoanalysetheeffectofapplyingageneralizedperiodicwaveformto
alinearsystem.ThefirststageistocalculatetheFouriercomponentsoftheinputwaveform.Theamplitudeandphaseshiftofeachoftheoutputcomponentsisthencalculated
usingG(jω). Finally, the output components are added to obtain the output waveform.
This is only possible because of the additive nature of linear systems. An example will
help toclarifythese points.
Engineeringapplication23.3
Analoguelow-passfilter
Recall from Engineering application 22.2 that a low-pass filter is a filter that has a
tendencytoattenuatehighfrequencysignals.Ifthefilterconsistsofelectroniccomponentsandoperatesonanaloguesignalsdirectlythenitisknownasananaloguefilter.Asimplecircuitthatactsasananaloguelow-passfilterisshowninFigure23.27.
Derive the frequency response for this circuit and draw graphs of its amplitude and
phase characteristics.
Consider the circuit of Figure 23.27. Using Kirchhoff’s voltage law and Ohm’s
lawweobtain
v i
=iR+v o
For the capacitor,
v o
=
i
jωC
23.9 Frequency response of a linear system 753
Eliminatingiyields
v i
= v o
jωCR+v o
= v o
(1+jωRC)
v o
v i
=
1
1 +jωRC
(23.10)
Equation (23.10) relates the output of the system to the input of the system.
Therefore,
G(jω) =
1
1 +jωRC
Itis convenient toconvertG(jω) into polar form:
1̸ 0
G(jω) = √
1 + (ωRC)
2̸
=
1
√ ̸
1 + (ωRC)
2
tan −1 ωRC
−tan −1 ωRC
Therefore,
|G(jω)| =
1
√
1+ (ωRC)
2
(23.11)
̸ G(jω) = −tan −1 ωRC (23.12)
The amplitude and phase characteristics for the circuit of Figure 23.27 are shown in
Figure 23.28. These show the variation of |G(jω)| and ̸ G(jω) with angular
frequency ω.
|G(jv)|
1
0.707
0
1/RC
v
R
G(jv)
1
v
y i
Figure23.27
Circuit forEngineering
application 23.3.
i C y o
– p –4
– p –2
Figure23.28
Amplitude andphase characteristics forthe circuit
ofFigure23.27.
Note that the circuit is a low-pass filter; it allows low frequencies to pass easily
and rejects high frequencies. The cut-off point of the filter, that is the point at
which significant frequency attenuation begins to occur, can be varied by changing ➔
754 Chapter 23 Fourier series
thevaluesofRandC.ThequantityRC isusuallyknownasthetimeconstantforthe
system.Consider the case whenRC = 0.3. Equations (23.11) and (23.12) reduce to
|G(jω)| =
1
√
1 +0.09ω
2
(23.13)
̸ G(jω) = −tan −1 0.3ω (23.14)
Letusexaminetheresponseofthissystemtoasquarewaveinputwithfundamental
angular frequency 1 and amplitude 1. This waveform is shown in Figure 23.29(a).
We note that T = 2π. The waveform function is odd and so will not contain any
cosine Fourier components. It has an average value of 0 and so will not have a zero
frequencycomponent;thatis,therewillbenod.c.component.Thereforecalculating
the Fourier components reduces toevaluating
f(t)=
∞∑
n=1
b n
sin 2πnt
T
b n
= 2 T
∫ T/2
−T/2
f(t)sin 2πnt
T
dt
forpositive integersn
SinceT = 2π, wefind
b n
= 1 (∫ 0 ∫ π
)
−1sinntdt + sinntdt
π −π 0
= 1 ([ [ ] cosnt −cosnt π )
+
π n n
0
] 0
−π
= 1 (cos0 −cosπn −cosπn +cos0)
nπ
= 1 (2 −2cosπn)
nπ
= 2 (1 −cosπn)
nπ
y i
y o
t
t
(a)
Figure23.29
(a) Input to low-pass filter;(b)outputfrom low-pass filter.
(b)
̸
̸
̸
Review exercises 23 755
The values of the first few coefficients are
b 1
= 2 π (1−cosπ)= 4 π
b 2
= 2
2π (1−cos2π)=0
b 3
= 2 4
(1 −cos3π) =
3π 3π
The next stage is toevaluate the gain and phase changes of the Fourier components.
Using Equations (23.13) and (23.14):
n = 1
ω 1
=1
|G(jω 1
)| =
1
√
1+0.09×1
= 0.96
n = 3
n = 5
G(jω 1
) = −tan −1 0.3 = −16.7 ◦
ω 3
=3
|G(jω 3
)| =
1
√
1+0.09×9
= 0.74
G(jω 3
) = −tan −1 0.9 = −42.0 ◦
ω 5
=5
|G(jω 5
)| =
1
√
1+0.09×25
= 0.55
G(jω 5
) = −tan −1 1.5 = −56.3 ◦
It is clear that high-frequency Fourier components are attenuated and phase shifted
more than low-frequency Fourier components. The effect is to produce a rounding
of the rising and falling edges of the square wave input signal. This is illustrated
in Figure 23.29(b). The output signal has been obtained by adding together the
attenuated and phase-shifted output Fourier components. This is possible because
the systemislinear.
REVIEWEXERCISES23
1 Find the half-range Fourier sineseries representation
off(t)=tsint,0tπ.
2 Find the half-range sineseries representationof
f(t)=cos2t,0tπ.
3 Find (a) the half-range sineseries,and(b)the
half-rangecosine seriesrepresentation ofthe function
defined in the interval [0, τ]by
⎧
4t
⎪⎨ τ
f(t)= ( )
4
⎪⎩ 1 − t 3 τ
0t τ 4
τ
4 tτ
4 Find the Fourier series representation ofthe function
with periodT defined by
{
V(constant) |t| <T/6
f(t)=
0 T/6|t|T/2
756 Chapter 23 Fourier series
5 Theoutput from ahalf-wave rectifier isgiven by
{
Isinωt 0<t <T/2
i(t) =
0 T/2<t<T
and isperiodic with periodT = 2π ω .
Find its Fourier seriesrepresentation.
6 Find the complex Fourier series representation ofthe
functionwith periodT = 0.02defined by
{
V(constant) 0 t < 0.01
v(t) =
0 0.01 t < 0.02
7 Find the Fourier seriesrepresentation ofthe function
with period8given by
{
2 −t 0<t<4
f(t)=
t−6 4<t<8
8 Ther.m.s.voltage, v r.m.s. ,ofaperiodicwaveform,
v(t),with periodT, isgiven by
√
∫
1 T
v r.m.s. = (v(t))
T
2 dt
0
If v(t)has Fourier coefficientsa n andb n show, using
Parseval’stheorem,that
√
1
v r.m.s. =
4 a2 0 + 1 ∞∑
(an 2 2
+b2 n )
n=1
9 If f (t)has Fourier series
f(t)= a ∞ (
0
2 + ∑
a n cos 2nπt
T
n=1
)
+b n sin 2nπt
T
prove Parseval’stheorem.
[Hint: multiply both sidesby f (t)to obtain
(f(t)) 2 = a 0 f(t)
(
∞∑
+ a
2 n f(t)cos 2nπt
T
n=1
)
+b n f(t)sin 2nπt
T
and integrate both sidesover the interval[0,T]using
Equations(23.3)--(23.5).]
10 Findthe half-rangecosine seriesandthe half-range
sineseriesforthe function
f(t)=sinhπt 0<t <1
Solutions
1
2
π
∞ 2 − ∑ 4n(1 +cosnπ)
π(n +1) 2 (n −1) 2 sin(nt)
2
2
π
3 (a)
∞∑
1
n(1 −cosnπ)
(n 2 −4)
32 ∑ ∞
3π 2
1
sin(nt)
sin(nπ/4)sin(nπt/τ)
n 2
5
6
7
I
π + I 2 sin ωt − I π
v
2π
∞∑
−∞
8 ∑ ∞
π 2
1
j cosnπ−1
n
1−cosnπ
n 2
∞∑
2
cosnπ+1
n 2 −1
e 100nπjt
cos
(
)
nπt
4
cosnωt
4
(b)
1
2 + 8 π 2
×
v
3 + 2v π
∞∑
1
∞∑
1
4cos(nπ/4) −cosnπ −3
3n 2
sin(nπ/3)cos(2nπt/T)
n
cos
(
)
nπt
τ
10 cosineseries: 1 (coshπ −1)
π
+
∞∑
n=1
2
π(1+n 2 ) ((−1)n coshπ −1)cosnπt
sineseries:
∞∑ 2n
−
π(1+n 2 ) (−1)n sinh πsinnπt
n=1
24 TheFouriertransform
Contents 24.1 Introduction 757
24.2 TheFouriertransform--definitions 758
24.3 SomepropertiesoftheFouriertransform 761
24.4 Spectra 766
24.5 Thet−ω dualityprinciple 768
24.6 Fouriertransformsofsomespecialfunctions 770
24.7 TherelationshipbetweentheFouriertransformandthe
Laplacetransform 772
24.8 Convolutionandcorrelation 774
24.9 ThediscreteFouriertransform 783
24.10 Derivationofthed.f.t. 787
24.11 Usingthed.f.t.toestimateaFouriertransform 790
24.12 Matrixrepresentationofthed.f.t. 792
24.13 Somepropertiesofthed.f.t. 793
24.14 Thediscretecosinetransform 795
24.15 Discreteconvolutionandcorrelation 801
Reviewexercises24 821
24.1 INTRODUCTION
Wehaveseenthatalmostanyperiodicsignalcanberepresentedasalinearcombination
ofsineandcosinewavesofvariousfrequenciesandamplitudes.Allfrequenciesareintegermultiplesofthefundamental.However,manypracticalwaveformsarenotperiodic.
Examples are pulse signals and noise signals. The function shown in Figure 24.1 is an
example of a non-periodic signal.
758 Chapter 24 The Fourier transform
WeshallnowseehowFouriertechniquescanstillbeusefulbyintroducingtheFourier
transformwhichisusedextensivelyincommunicationsengineeringandsignalprocessing.Forexample,itcanbeusedtoanalysetheprocessesof
modulation,whichinvolves
superimposinganaudiosignalontoacarriersignal,anddemodulation,whichinvolves
removing the carrier signaltoleave the audio signal.
24.2 THEFOURIERTRANSFORM--DEFINITIONS
f(t)
1
0
a
Figure24.1
Anon-periodic
function.
b
t
Under certain conditions it can be shown that a non-periodic function, f (t), can be
expressed notasthe sum ofsineand cosinewaves but asanintegral. Inparticular,
where
f(t)=
Provided
∫ ∞
0
A(ω) = 1 π
A(ω)cosωt +B(ω)sinωtdω (24.1)
∫ ∞
−∞
f(t)cosωtdt and B(ω) = 1 π
∫ ∞
(1) f(t)andf ′ (t) are piecewise continuous inevery finiteinterval, and
(2) ∫ ∞
|f (t)|dt exists
−∞
−∞
f (t)sinωtdt (24.2)
then the above Fourier integral representation of f (t) holds. At a point of discontinuity
of f (t) the integral representation converges to the average value of the right- and lefthand
limits. As with Fourier series, an equivalent complex representation exists which
is, infact, more commonly used:
Fourier integral representationof f (t):
where
f(t)= 1
2π
F(ω) =
∫ ∞
∫ ∞
−∞
−∞
F(ω)e jωt dω (24.3)
f (t)e −jωt dt (24.4)
Thereisnouniversalconventionconcerningthedefinitionoftheseintegralsandanumber
of variants are still correct. For instance, some authors write the factor in the
1
2π
1
second integral rather than the first, while others place a factor √ in both, giving
2π
some symmetry to the equations. There is also variation in the location of the factors
e −jωt ande jωt .Weshallusedefinitions(24.3)and(24.4)throughoutbutitisimportantto
be aware of possible differences when consulting other texts.
Equations (24.3) and (24.4) form what is called a Fourier transform pair. The
Fourier transform of f (t) is F(ω) which is sometimes written F{f (t)}. Similarly
f (t) in Equation (24.3) is the inverse Fourier transform of F(ω), usually denoted
F −1 {F(ω)}.
24.2 The Fourier transform -- definitions 759
The Fourier transform of f (t)is defined tobe
F{f(t)}=F(ω) =
∫ ∞
−∞
f (t)e −jωt dt
YouwillalsonotethesimilaritybetweenEquation(24.4)andthedefinitionoftheLaplace
transformof f (t):
L{f(t)} =
∫ ∞
0
f (t)e −st dt (24.5)
We see that, apart from the limits of integration, the substitution jω = s in Equation(24.4)resultsintheLaplacetransforminEquation(24.5).ThereisindeedanimportantrelationshipbetweenthetwotransformswhichweshalldiscussinSection24.7.We
notethatEquation(24.3)providesaformulafortheinverseFouriertransformofF(ω),
although the integral isfrequently difficulttoevaluate.
Example24.1 Find the Fourier transform of the function f (t) = u(t)e −t , whereu(t) is the unit step
function.
f(t)
1
Figure24.2
Graph ofu(t)e −t .
Solution The functionu(t)e −t is shown in Figure 24.2. Using Equation (24.4), its Fourier transformis
given by
t
thatis,
F(ω) =
=
=
∫ ∞
−∞
∫ ∞
0
∫ ∞
0
f (t)e −jωt dt
e −t e −jωt dt since f (t) = 0 fort < 0
e −(1+jω)t dt
[ e
−(1+jω)t ∞
=
−(1+jω)]
0
= 1
1+jω
F(ω) = 1
1+jω
sincee −(1+jω)t → 0ast → ∞
Example24.2 Use Equation (24.3) to find the Fourier integral representation of the function defined
by
{ 1 −1t1
f(t)=
0 |t|>1
Solution UsingEquation (24.3)wefind
f(t)= 1
2π
∫ ∞
−∞
F(ω)e jωt dω
760 Chapter 24 The Fourier transform
where
F(ω) =
=
∫ ∞
−∞
∫ 1
−1
f (t)e −jωt dt
1e −jωt dt
[ ] e
−jωt 1
=
−jω
−1
= e−jω −e jω
−jω
= ejω −e −jω
jω
Using Euler’s relation (Section 9.6)
sinθ = ejθ −e −jθ
2j
we find
F(ω) = 2sinω
ω
so that
f(t)= 1 ∫ ∞
2sinω
2π ω
−∞
since f (t)iszero outside [−1,1]
ejωt dω
istherequiredintegralrepresentation.NotethatF(ω) = 2sinω istheFouriertransform
ω
of f (t).Thefunction sin ω
ω
occursfrequentlyandisoftenreferredtoasthesincfunction
(see Section 3.5).
As with Laplace transforms, tables have been compiled for reference. Such a table of
common transforms appearsinTable 24.1.
EXERCISES24.2
1 Find the Fourier transformsof
{ 1/4 |t| 3
(a)f(t) =
0 |t|>3
⎧
1 − t 0t2
⎪⎨ 2
(b)f(t) =
1 + t −2t0
⎪⎩ 2
0 otherwise
{ e −αt t0 α>0
(c)f(t) =
e αt t<0
{ e
(d)f(t) =
−t cost t 0
0 t<0
(e) f (t) =u(t)e −t/τ
where τ isaconstant
2 Find
(a) the Fourier transform, and
(b) the Laplace transformof
f(t) =u(t)e −αt α >0
Show thatmakingthe substitutions = jω in the
Laplacetransform of f resultsin the Fourier
transform.
{ 1 |t|2
3 Iff(t)=
0 otherwise
andg(t) = e jt ,find F{f (t)g(t)}.
24.3 Some properties of the Fourier transform 761
Table24.1
CommonFourier transforms.
f(t)
f(t) =Au(t)e −αt ,α >0
{ 1 −αtα
f(t)=
0 otherwise
F(ω)
A
α+jω
2sinωα
ω
f(t) =A constant 2πAδ(ω)
( )
f(t) =u(t)A A πδ(ω)− j ω
f(t) = δ(t) 1
f(t) = δ(t −a)
f(t) =cosat
f(t) =sinat
f(t) =e −α|t| ,α>0
{ 1 t>0
f(t) =sgn(t) =
−1 t<0
f(t)= 1 t
e −jωa
π(δ(ω +a)+δ(ω −a))
π
(δ(ω −a)−δ(ω +a))
j
2α
α 2 +ω 2
2
jω
−jπsgn(ω)
f(t) =e −at2 √ π
a e−ω2 /4a
Solutions
1 (a)
(d)
sin3ω
(b)
2ω
1+jω
(1+jω) 2 +1
1−cos2ω
ω 2
τ
(e)
1+jωτ
(c)
2α
α 2 +ω 2
2 (a)
3
1
α+jω
2sin2(1−ω)
1 − ω
(b)
1
s + α
24.3 SOMEPROPERTIESOFTHEFOURIERTRANSFORM
A number of the properties of Laplace transforms that we have already discussed hold
forFourier transforms. We considerlinearityand two shifttheorems.
24.3.1 Linearity
If f andgarefunctions oft andkisaconstant, then
F{f +g}=F{f}+F{g}
F{kf}=kF{f}
762 Chapter 24 The Fourier transform
Both of these properties follow directly from the definition and linearity properties of
integrals, and mean that F isalinear operator.
Example24.3 Find F{u(t)e −t +u(t)e −2t }.
Solution We sawinExample 24.1 that
F{u(t)e −t } = 1
1+jω
Furthermore,
F{u(t)e −2t } =
Therefore,
=
∫ ∞
−∞
∫ ∞
0
u(t)e −2t e −jωt dt
e −(2+jω)t dt
[ e
−(2+jω)t ∞
=
−(2+jω)]
0
= 1
2+jω
F{u(t)e −t +u(t)e −2t } = 1
1+jω + 1
2+jω
by linearity
= 2+jω+1+jω
(1+jω)(2+jω)
=
3+2jω
2−ω 2 +3jω
24.3.2 Firstshifttheorem
IfF(ω)isthe Fourier transform of f (t),then
F{e jat f (t)} =F(ω −a)
whereaisaconstant
Example24.4 (a) Show thatthe Fourier transform of
{ 3 −2t2
f(t)=
0 otherwise
isgivenbyF(ω) = 6sin2ω .
ω
(b) Usethe first shift theorem tofind the Fourier transform of e −jt f (t).
(c) VerifythefirstshifttheorembyobtainingtheFouriertransformofe −jt f (t)directly.
∫ 2
[ ] e
Solution (a) F(ω)=3 e −jωt −jωt 2
dt = 3
−2 −jω
−2
( e −2jω −e 2jω )
=3
−jω
( e 2jω −e −2jω )
=6
2jω
= 6 ω sin2ω
24.3 Some properties of the Fourier transform 763
(b) Wehave F{f(t)} =F(ω) = 6sin2ω . Using the first shift theorem witha = −1
ω
we have
(c) e −jt f (t) =
F{e −jt f(t)} =F(ω+1) = 6 sin2(ω +1)
ω +1
{
3e
−jt
−2t2
0 otherwise
So toevaluate its Fourier transformdirectly wemustfind
F{e −jt f(t)} =3
as required.
∫ 2
−2
e −jt e −jωt dt
∫ 2
= 3 e −(1+ω)jt dt
−2
[ e
−(1+ω)jt 2
= 3
−j(1+ω)]
−2
= 6
1 + ω
( ) e 2(1+ω)j −e −2(1+ω)j
2j
= 6
1 + ω sin2(1+ω)
Example24.5 UsethefirstshifttheoremtofindthefunctionwhoseFouriertransformis
given that F{u(t)e −mt } =
Solution Fromthe given resultwe have
Now
1
m+jω ,m>0.
F{u(t)e −3t } = 1
3+jω =F(ω)
1
3+j(ω−2) =F(ω−2)
1
3+j(ω−2) ,
764 Chapter 24 The Fourier transform
Therefore, fromthe first shift theorem witha = 2 wehave
F{e 2jt u(t)e −3t } =
1
3+j(ω−2)
Consequently the function whose Fourier transform is
1
3+j(ω−2) isu(t)e−(3−2j)t .
Example24.6 Findthe Fourier transformof
{ e
−3t
t0
f(t)=
e 3t t<0
Deducethe functionwhoseFourier transformisG(ω) =
∫ ∞
Solution F(ω) = f (t)e −jωt dt
−∞
Now
=
=
∫ 0
−∞
∫ 0
−∞
e 3t e −jωt dt +
e (3−jω)t dt +
∫ ∞
0
∫ ∞
0
e −3t e −jωt dt
e −(3+jω)t dt
[ e
(3−jω)t 0 [ e
−(3+jω)t ∞
= +
3−jω]
−∞
−(3+jω)]
0
= 1
3−jω + 1
3+jω
= 6
9+ω 2
G(ω) =
6
10+2ω+ω = 6
2 (ω+1) 2 +9 =F(ω+1)
6
10+2ω+ω 2.
Then, using the first shift theorem F(ω + 1) will be F{e −jt f (t)}; that is, the required
function is
{ e
(−3−j)t
t 0
g(t) =
e (3−j)t t < 0
24.3.3 Secondshifttheorem
IfF(ω)isthe Fourier transform of f (t)then
F{f(t −α)} =e −jαω F(ω)
24.3 Some properties of the Fourier transform 765
Example24.7 Giventhatwhen f (t) =
{ 1 |t|1
0 |t|>1
tofind the Fourier transform of
g(t) =
{ 1 1t3
0 otherwise
Verifyyour resultdirectly.
2sinω
,F(ω) =
ω
,applythesecondshifttheorem
g(t)
1
1 2 3
t
Figure24.3
The function
{ 1 1t3
g(t) =
0 otherwise.
Solution The functiong(t) is depicted in Figure 24.3. Clearlyg(t) is the function f (t)translated
2unitstotheright,thatisg(t) = f (t−2).NowF(ω) = 2sinω istheFouriertransform
ω
of f (t).Therefore, by the second shift theorem
F{g(t)} = F{f(t −2)} =e −2jω F(ω) = 2e−2jω sinω
ω
To verifythis resultdirectly wemustevaluate
∫ ∞ ∫ 3
[ ] e
F{g(t)} = g(t)e −jωt dt = e −jωt −jωt 3
dt =
−∞ 1 −jω
1
as required.
= e−3jω −e −jω
−jω
= 2e−2jω sinω
ω
= e −2jω ( e −jω −e jω
−jω
)
EXERCISES24.3
1 Provethe first shift theorem.
2 Find the Fourier transform of
{
1−t 2 |t|1
f(t)=
0 |t|>1
Usethe first shift theoremto deduce the Fourier
transformsof
{
e
(a)g(t) =
3jt (1−t 2 ) |t|1
0 |t|>1
{
e
(b)h(t) =
−t (1−t 2 ) |t|1
0 |t|>1
3 Find the inverse Fourier transformsof
(a)
(b)
1
(ω+7)j+1
2
1+2(ω−1)j
4 Prove the second shift theorem.
766 Chapter 24 The Fourier transform
5 Given F{u(t)e −t } = 1 ,use the second shift
1+jω
theorem to find
F{u(t +4)e −(t+4) }
Verify your result bydirect integration.
6 Find,usingthe secondshift theorem,
{ }
F −1 6e −4jωsin2ω
ω
Solutions
2
4cosω
−ω 2 + 4sinω
ω 3
(a)
(b)
−4cos(ω −3) 4sin(ω −3)
(ω −3) 2 +
(ω −3) 3
−4cos(ω −j) 4sin(ω −j)
(ω −j) 2 +
3 (a) u(t)e −t e −7jt (b) e jt u(t)e −t/2
e 4jω
5
1+jω
6 3 for2 t 6,0otherwise
(ω −j) 3
24.4 SPECTRA
IntheFourieranalysisofperiodicwaveformswestatedthatalthoughawaveformphysicallyexistsinthetime(orspatial)domainitcanberegardedascomprisingcomponents
with a variety of temporal (or spatial) frequencies. The amplitude and phase of these
components are obtained from the Fourier coefficients a n
and b n
. This is known as a
frequency domain description. Plots of amplitude against frequency and phase against
frequency are together known as the spectrum of a waveform. Periodic functions have
discrete or line spectra; that is, the spectra assume non-zero values only at certain frequencies.
Only a discrete set of frequencies is required to synthesize a periodic waveform.Ontheotherhandwhenanalysingnon-periodicphenomenaviaFouriertransform
techniqueswefindthat,ingeneral,acontinuousrangeoffrequenciesisrequired.Instead
ofdiscretespectrawehavecontinuousspectra.ThemodulusoftheFouriertransform,
|F(ω)|,givesthespectrumamplitudewhileitsargumentarg(F(ω))describesthespectrum
phase.
Example24.8 InExample 24.2 the Fourier transformof
f(t)=
{ 1 |t|1
0 |t|>1
wasfoundtobeF(ω) = 2sinω . Sketch the spectrum of f (t).
ω
Solution F(ω) is purely real. The spectrum of f (t) is depicted by plotting |F(ω)| against ω as
sinω
illustrated in Figure 24.4. Note that lim = 1. (See Review exercises in
ω→0 ω
Chapter 18.)
24.4 Spectra 767
uF(v)u
2
–4p –3p –2p –p p 2p 3p 4p
v
Figure24.4 { 1 |t|1
Spectrum of f (t) =
0 |t|>1.
Engineeringapplication24.1
Amplitudemodulation
Amplitudemodulationisatechniquethatallowsaudiosignalstobetransmittedas
electromagnetic radio waves. The maximum frequency of audio signals is typically
10 kHz. If these signals were to be transmitted directly then it would be necessary
to use a very large antenna. This can be seen by calculating the wavelength of an
electromagnetic wave offrequency 10 kHzusingthe formula
c=fλ
Herecisthevelocityofanelectromagneticwaveinavacuum (3 ×10 8 ms −1 ), f is
the frequency of the wave and λ is its wavelength, and hence λ = c = 30000 m.
f
It can be shown that an antenna must have dimensions of at least one-quarter of the
wavelengthofthesignalbeingtransmittedifitistobereasonablyefficient.Clearlya
verylargeantennawouldbeneededtotransmita10kHzsignaldirectly.Thesolution
istohaveacarriersignalofamuchhigherfrequencythantheaudiosignalwhichis
usuallytermedthemodulationsignal.Thisallowstheantennatobeareasonablesize
as a higher frequency signal has a smaller wavelength. The arrangement for mixing
the two signals isshown inFigure 24.5.
x(t)
3
x(t)cosv c t
Antenna
cosv c t
Figure24.5
Amplitudemodulation.
Let us now derive an expression for the frequency spectrum of an amplitudemodulated
signalgiven by
φ(t) =x(t)cosω c
t
where ω c
is the angular frequency of the carrier signal and x(t) is the modulation
signal. Now φ(t) =x(t)cos ω c
t can bewritten as
φ(t) =x(t) ejω c t +e −jω c t
2
➔
768 Chapter 24 The Fourier transform
uX(v)u
uF(v)u
(a)
v
(b)
–v c
Figure24.6
Amplitude modulation. (a)Spectrum ofthe modulation signal; (b)spectrum ofthe
amplitude-modulated signal.
Taking the Fourier transformand using the first shifttheorem yields
{ x(t)(e
jω t c +e −jω c t }
)
F{φ(t)} = (ω) = F
2
{ e
jω t } {
c x(t) e
−jω t }
c x(t)
= F + F
2 2
v c
v
= 1 2 (X(ω−ω c )+X(ω+ω c ))
whereX(ω) = F{x(t)}, the frequency spectrum of the modulation signal.
Let us consider the case where the frequency spectrum, |X(ω)|, has the profile
shown in Figure 24.6(a). The frequency spectrum of the amplitude-modulated signal,
|(ω)|, is shown in Figure 24.6(b). All of the frequencies of the amplitudemodulatedsignalaremuchhigherthanthefrequenciesofthemodulationsignalthus
allowing a much smaller antenna to be used to transmit the signal. This method of
amplitude modulation is known as suppressed carrier amplitude modulation because
the carrier signal is modulated to its full depth and so the spectrum of the
amplitude-modulated signalhas no identifiable carrier component.
EXERCISES24.4
1 Show thatthe Fourier transformofthe pulse
{ t+1 −1<t<0
f(t)=
1−t 0<t<1
can be written as
Plotagraph ofthe spectrum of f (t)for
−2π ω 2π.Writedown an integral expression
forthe pulse whichwould result ifthe signal f (t)
were passed through a filterwhicheliminatesall
angularfrequenciesgreaterthan2π.
Solutions
F(ω) = 2 ω 2 (1−cosω)
1
∫
1 2π
2(1−cosω) ejωt
2π −2π ω 2 dω
24.5 THEt−ω DUALITYPRINCIPLE
We have, fromthe definition ofthe Fourier integral,
f(t)= 1 ∫ ∞
F(ω)e jωt dω (24.6)
2π
−∞
The t−ω duality principle 769
f(t)
1
F(v)
2 sint
——— t
2pf(v)
2p
–1 1
t
v
t
–1 1
v
Figure24.7
Illustrating thet--ω duality principle.
Figure24.8
Illustrating thet--ω duality principle.
where
F(ω) =
∫ ∞
−∞
f (t)e −jωt dt (24.7)
is the Fourier transform of f (t). In Equation (24.6), ω is a dummy variable so, for example,
Equation (24.6) could be equivalently written as
f(t)= 1
2π
∫ ∞
−∞
F(z)e jzt dz (24.8)
Then, from Equation (24.8),replacingt by −ω wefind
f(−ω) = 1
2π
∫ ∞
−∞
F(z)e −jωz dz = 1
2π
∫ ∞
−∞
F(t)e −jωt dt
which werecognize as 1 timesthe Fourier transformofF(t).
2π
We have the following result:
IfF(ω)isthe Fourier transformof f (t)then
f(−ω)is 1 ×(theFourier transform ofF(t))
2π
which isknown as thet--ω duality principle.
We have seen inExample 24.2 that if
{ 1 |t|1
f(t)=
0 |t|>1
thenF(ω) = 2sinω
ω
.ThisisdepictedinFigure24.7.Fromthedualityprinciplewecan
immediately deduce that
{ } 2sint
F =2πf(−ω)=2πf(ω)
t
since f is an even function (Figure 24.8). Unfortunately it is very difficult to verify this
result in most cases because while one of the integrals is relatively straightforward to
evaluate, the other is usually very difficult. However, we can use the result to derive a
number of new Fourier transforms.
Example24.9 GiventhattheFouriertransformofu(t)e −t is
the transform of
1
1+jt .
1
1+jω usethedualityprincipletodeduce
770 Chapter 24 The Fourier transform
1
Solution We know F(ω) =
1+jω is the Fourier transform of f (t) = u(t)e−t . Therefore
2π(u(−ω)e ω 1
)is the Fourier transform of
1+jt .
24.6 FOURIERTRANSFORMSOFSOMESPECIALFUNCTIONS
WesawinSection24.4thattheFouriertransformtellsusthefrequencycontentofasignal.IfweweretofindtheFouriertransformofasignalcomposedofonlyonefrequency
component, for example f (t) = sint, we would hope that the exercise of finding the
Fourier transform would resultinaspectrum containing thatsinglefrequency.
Unfortunately if we try to find the Fourier transform of, say, f (t) = sint problems
arisesince the integral
∫ ∞
−∞
sinte −jωt dt
cannot be evaluated in the usual sense because sint oscillates indefinitely as |t| → ∞.
In particular, Condition (2) of Section 24.2 fails since ∫ ∞
|sint|dt diverges. There are
−∞
many other functions which give rise to similar difficulties, for instance the unit step
function, polynomials and so on. All these functions fail to have a Fourier transform
in its usual sense. However, by making use of the delta function it is possible to make
progress even with functions like these.
24.6.1 TheFouriertransformof δ(t −a)
Example24.10 Usethe propertiesofthe deltafunction todeduceits Fourier transform.
Solution Bydefinition
F{δ(t −a)} =
∫ ∞
−∞
δ(t −a)e −jωt dt
Next, recall the following propertyofthe delta function
∫ ∞
−∞
f(t)δ(t −a)dt = f(a) (24.9)
foranyreasonablywell-behavedfunction f (t).UsingEquation(24.9)with f (t) = e −jωt
wehave
F{δ(t −a)} =
∫ ∞
−∞
e −jωt δ(t −a)dt = e −jωa
In particular, ifa = 0 wehave F{δ(t)} = 1.This resultisdepictedinFigure 24.9.
24.6 Fourier transforms of some special functions 771
f(t)
1
F(v)
1
F(t)
1
2pd(v)
2p
t
v
t
v
Figure24.9
F{δ(t)} =1.
Figure24.10
F{1} = 2πδ(ω).
Example24.11 Apply thet--ω dualityprinciple tothe previous result.Interpretthe resultphysically.
Solution We have f (t) = δ(t) andF(ω) = 1.Thedualityprinciple tells us that
f (−ω) = δ(−ω) whichequals 1
2π F{1}
thatis,
F{1} =2πδ(ω)
(since δ(−ω) = δ(ω)). This is illustrated in Figure 24.10. PhysicallyF(t) = 1 can be
regardedasad.c.waveform.Thisresultconfirmsthatad.c.signalhasonlyonefrequency
component, namely zero.
Example24.12 Given that F{δ(t −a)} = e −jωa find F{e −jta }.
Solution We have f (t) = δ(t −a),F(ω) = e −jωa . Applyingthet--ω duality principle wefind
f(−ω) = δ(−ω −a) = 1
2π F{e−jta }
Therefore
F{e −jta } = 2πδ(−ω −a)
= 2πδ(−(ω +a))
=2πδ(ω +a)
since δ(ω)isan even function.
24.6.2 Fouriertransformsofsomeperiodicfunctions
From Example 24.12 we have F{e −jta } = 2πδ(ω +a) and also, replacing a by −a,
F{e jta } = 2πδ(ω −a). Addingthesetwo expressions wefind
F{e −jta }+ F{e jta } =2π(δ(ω +a)+δ(ω −a))
Recallingthe linearityproperties of F wecan write
F{e −jta +e jta } =2π(δ(ω +a)+δ(ω −a))
and usingEuler’srelations wefind
F{cosat} = π(δ(ω +a)+δ(ω −a))
772 Chapter 24 The Fourier transform
f(t) = cos at
F(v)
p
t
–a a
v
Figure24.11
The spectrum ofcosat.
We see that the spectrum of cosat consists of single lines at ω = ±a corresponding to
a singlefrequency component (Figure 24.11).
Example24.13 Find F{sinat}.
Solution Subtractingthepreviousexpressionsfor F{e jta }and F{e −jta }andusingEuler’srelations
wefind
F{e jta }− F{e −jta } =2π(δ(ω −a)−δ(ω +a))
that is,
{ e jta −e −jta }
F = π (δ(ω −a)−δ(ω +a))
2j j
so that
F{sinat} = π (δ(ω −a)−δ(ω +a))
j
24.7 THERELATIONSHIPBETWEENTHEFOURIER
TRANSFORMANDTHELAPLACETRANSFORM
Wehavealreadynoted(Section24.2)thesimilaritybetweentheLaplacetransformand
the Fourier transform.Let usnow look atthis a littlemoreclosely. We have
F{f(t)} =
∫ ∞
−∞
f (t)e −jωt dt and L{f (t)} =
∫ ∞
0
f (t)e −st dt
InthedefinitionoftheLaplacetransform,theparametersiscomplexandwemaywrite
s=σ+jω,sothat
L{f(t)} =
∫ ∞
0
f(t)e −σt e −jωt dt
Thus an additional factor, e −σt , appears in the integrand of the Laplace transform. For
σ > 0thisrepresentsanexponentiallydecayingfactor,thepresenceofwhichmeansthat
theintegralexistsforawidervarietyoffunctionsthanthecorrespondingFourierintegral.
Example24.14 Find,ifpossible,
(a) the Laplace transform
(b) the Fourier transform
of f (t) =u(t)e 3t . Comment uponthe result.
24.7 The relationship between the Fourier transform and the Laplace transform 773
Solution (a) Either by integration, orfrom Table 21.1, we find
L{u(t)e 3t } = 1
s −3
provideds > 3
(b) F{u(t)e 3t } =
∫ ∞
0
e 3t e −jωt dt =
∫ ∞
0
[ e
(3−jω)t ∞
e (3−jω)t dt = .
3−jω]
0
Now, as t → ∞, e 3t → ∞, so that the integral fails to exist. Clearly, u(t)e 3t has a
Laplace transform butno Fourier transform.
Suppose f (t)isdefined tobe0fort < 0.Thenits Fourier transform becomes
F{f(t)} =
∫ ∞
0
f (t)e −jωt dt
and its Laplace transform is
L{f(t)} =
∫ ∞
0
f (t)e −st dt
ByreplacingsbyjωintheLaplacetransformweobtaintheFouriertransformof f (t)if
itexists.CaremustbetakenheresincewehaveseenthattheFouriertransformmaynot
exist forafunctionthatnevertheless has a Laplace transform.
Example24.15 Findthe Laplace transforms of
(a) u(t)e −2t
(b) u(t)e 2t
Lets = jω and comment uponthe result.
Solution (a) L{u(t)e−2t } = 1
s +2 .
(b) L{u(t)e 2t } = 1
s −2 .
Replacingsby jω in (a) gives
Now
F{u(t)e −2t } = 1
jω+2
1
jω+2 . Similarly, replacingsby jω in (b) gives 1
jω−2 .
sothatreplacingsbyjωintheLaplacetransformresultsintheFouriertransform.However,
F{u(t)e 2t } does not exist and even though we can lets = jω in the Laplace transformand
obtain , wecannot interpretthis asaFourier transform.
1
jω−2
TheFouriertransformdoespossesscertainadvantagesovertheLaplacetransform.While
the Laplace transform can only be applied to functions which are zero fort < 0, the
774 Chapter 24 The Fourier transform
Fourier transform is applicable to functions with domain −∞ < t < ∞. In some applications
where, for example, t represents not time but a spatial variable, it is often
necessary towork with negative values.
The inverse Fourier transformisgiven by
F −1 {F(ω)} = 1 ∫ ∞
F(ω)e jωt dω (24.10)
2π −∞
ThecorrespondinginverseLaplacetransformrequiresadvancedtechniquesinthetheory
of complex variables which are beyond the scope of this book. The existence of Equation
(24.10) is not quite as advantageous as it may seem because it is often difficult to
perform the required integration analytically.
24.8 CONVOLUTIONANDCORRELATION
Convolution is an important technique in signal and image processing. It provides a
means of calculating the response or output of a system to an arbitrary input signal if
the impulse response is known. The impulse response is the response of the system to
animpulsefunction.Convolvingtheinputsignalandtheimpulseresponseresultsinthe
response to the arbitrary input. Correlation is a second important technique. It can be
used to determine the time delay between a transmitted signal and a received signal as
might occur inradarorsonar detection equipment.
24.8.1 Convolutionandtheconvolutiontheorem
If f (t) andg(t) are two real piecewise continuous functions, their convolution, which
wedenoteby f ∗g, isdefinedasfollows:
Theconvolution of f (t)andg(t):
f∗g=
∫ ∞
−∞
f(λ)g(t − λ)dλ
f ∗gisitselfafunctionoft, andtoshowthisexplicitly wesometimes write (f ∗g)(t).
Note that convolution is an integral with respect to the dummy variable λ. In general,
as can be seen from the limits of integration, λ varies from −∞ to ∞, but in particular
caseswewillseethatthisintervalofintegrationcanbereduced.Intheexampleswhich
follow the precisemeaning ofthe terms f (λ)andg(t − λ) will become apparent.
Because convolution is commutative, that is f ∗g = g ∗ f, the convolution can be
defined equivalently as
∫ ∞
−∞
f(t − λ)g(λ)dλ
Incaseswhen f (t)andg(t)arezerofort < 0thisexpressionreducestothatdefinedfor
the Laplace transform inSection21.9.
24.8 Convolution and correlation 775
Theconvolution theoremstatesthatconvolutioninthetimedomaincorresponds to
multiplication inthe frequency domain:
The convolution theorem:
If F{f(t)} =F(ω)and F{g(t)} =G(ω)then
F{f ∗g} =F(ω)G(ω)
Thistheoremgivesusatechniqueforcalculatingtheconvolutionoftwofunctionsusing
the Fourier transform,since
f∗g=F −1 {F(ω)G(ω)}
So,itis possible tofind the convolution of f (t)andg(t), by
(1) finding the corresponding Fourier transforms,F(ω)andG(ω),
(2) multiplying these together toformF(ω)G(ω),
(3) finding the inverse Fourier transform which then yields f ∗g.
This is a process often used for finding convolutions using a computer as will be explained
inSection 24.15.
Agraphicalrepresentationofconvolutionisusefulasitcanthrowlightontheunderlyingprocessandhelpustodetermineappropriatelimitsofintegration.Wewillillustrate
thisinthe following example.
Example24.16 (a) Usingthe definition ofconvolution, calculate the convolution f ∗gwhen
f (t) =u(t)e −t andg(t) =u(t)e −2t , whereu(t) isthe unit stepfunction.
(b) Verifythe convolution theorem for these functions.
Solution (a) The convolution of f andgisgiven by
1
f(t)=u(t)e –t
g(t) = u(t)e –2t t
Figure24.12
Graphsof
f(t) =u(t)e −t and
g(t) =u(t)e −2t .
f∗g=
∫ ∞
−∞
f(λ)g(t − λ)dλ
Graphs of f (t)andg(t) areshown inFigure 24.12.
Evaluatingaconvolutionintegralcanbedifficult,sowewilldevelopthesolutioninstages.Firstofallitisnecessarytobeclearaboutthemeaningofthedifferent
termsintheintegrand.Notethatif f (t) =u(t)e −t then f (λ) =u(λ)e −λ .Similarly
g(λ) =u(λ)e −2λ .Thefunctiong(−λ)isfoundbyreflectingg(λ)intheverticalaxis
asshowninFigure24.13(a).Insignalprocessingterminologythisisalsoknownas
folding.Thefoldedgraphcanbetranslatedadistancet totheleftortotherightby
changingtheargumentofgtog(t −λ).Ift isnegativethegraphmovestotheleftas
shown in Figure 24.13(b) whereas ift is positive it moves to the right as shown in
Figure 24.13(c). In Figure 24.13 we have superimposed the graphs of f (λ), shown
dashed, andg(t − λ), fort being negative, zero and positive. Where the graphs do
notoverlap,theproduct f (λ)g(t −λ),andhencetheconvolution,mustbezero.We
examineseparatelythedomainst < 0andt 0correspondingtowherethegraphs
do notoverlap and where they do overlap respectively.
776 Chapter 24 The Fourier transform
g(–l)
f( l)= u( l)e – l
(a)
l
g(t – l) when t < 0
(b)
t
l
g(t – l) when t > 0
(c)
t
l
Figure24.13
Thefunctiong(t − λ)forvariousvalues
oft.
Whent<0
Whent < 0there isclearlyno overlap and itfollows that f ∗g= 0.
Whent0
Whent 0thereisoverlapforvaluesof λbetween0andt,thatiswhen0 λ t,
and hence
f∗g=
=
∫ t
0
∫ t
0
e −λ e −2(t−λ) dλ
e −λ e −2t e 2λ dλ
∫ t
= e −2t e λ dλ
0
= e −2t [e λ ] t 0
= e −2t (e t −1)
= e −t −e −2t
Finally the complete expression for the convolution is
(f ∗g)(t)=
This may alsobe writtenas
{ e −t −e −2t whent 0
0 whent<0
(f ∗g)(t) =u(t)(e −t −e −2t )
(b) UsingTable 24.1 the Fourier transforms of f andgare
F(ω) = 1
1+jω
G(ω) = 1
2+jω
and theirproduct is
F(ω)G(ω) =
1
(1+jω)(2+jω)
24.8 Convolution and correlation 777
The Fourier transform of the convolution is, usinglinearityand Table 24.1,
F{u(t)(e −t −e −2t )} = 1
1+jω − 1
2+jω
= (2+jω)−(1+jω)
(1+jω)(2+jω)
1
=
(1+jω)(2+jω)
(24.11)
whichisthesameasEquation(24.11).Wehaveshownthat F{f ∗g} =F(ω)G(ω)
and sothe convolution theorem has been verified.
Example24.17 (a) Usingthe definition ofconvolution find the convolution, f ∗g, ofthe ‘top-hat’
function
{ 1 −1t1
f(t)=
0 otherwise
and the functiong(t) =u(t)e −t , whereu(t) isthe unit stepfunction.
(b) Verifythe convolution theoremforthesefunctions.
Solution (a) Thefunctions f (t)andg(t) are shown inFigure 24.14.
Theconvolution of f (t)andg(t) isgiven by
f∗g=
∫ ∞
−∞
f(λ)g(t − λ)dλ
Note that since g(t) = u(t)e −t , it follows that g(λ) = u(λ)e −λ as shown in Figure
24.15(a). The function g(−λ) is found by reflecting, or folding, g(λ) in the
vertical axis. This folding is shown in Figure 24.15(b). The folded graph can be
translated a distancet to the left or to the right by changing the argument of g to
g(t − λ).Ift isnegativethegraphinFigure24.15(b)movestotheleft,whereasift
ispositive itmoves tothe right.Study Figures24.15(c--g)toobserve this.
Convolution is the integral of the product of f (λ) andg(t − λ). We have superimposed
f (λ) on the graphs in Figure 24.15. For values of λ where the graphs do
notoverlap, thisproduct mustbezero.
Inspectionofthegraphsshowsthatwhent islessthan −1(Figure24.15(c))there
isno overlap and hence f (λ)g(t − λ) = 0.So:
ift<−1
f∗g=0
f(t)
1
g(t) =
u(t) e – t
–1 1
t
t
Figure24.14
The‘top-hat’ function f (t),
andg(t) =u(t)e −t .
778 Chapter 24 The Fourier transform
1
g( l)
(a)
l
g(– l)
1
(b)
l
g(t– l ) when t< –1
1
(c)
t
–1
1
l
g(t– l ) when t = –1
(d)
–1
1
l
g(t– l ) when – 1ø t ø 1
(e)
–1
t
1
l
g(t– l )whent= 1
(f)
–1
1
l
g(t– l ) when t . 1
(g)
–1
1
t
l
Figure24.15
Thefunctiong(t − λ) forvarious
values oft.
24.8 Convolution and correlation 779
Whent isgreaterthan −1butlessthan1,asinFigure24.15(e),thereisanoverlap,
andhenceanon-zeroproduct.Thisoccursforvaluesof λbetween −1andt,thatis
in the interval −1 λ t. Within this interval f (λ) = 1 andg(t − λ) = e −(t−λ) .
So:
if −1t<1
f∗g=
∫ t
−1
∫ t
e −(t−λ) dλ = e −t e λ dλ = e −t [e λ ] t −1 = e−t [e t −e −1 ] = 1 −e −1−t
−1
Whent is greater than 1 the graphs overlap, but only for values of λ between −1
and 1,thatisfor −1 λ < 1.So:
ift>1
f∗g=
∫ 1
−1
e −(t−λ) dλ = e −t [e λ ] 1 −1 = e−t (e 1 −e −1 )
Putting all these results together
⎧
⎨0 t<−1
(f ∗g)(t)= 1−e −1−t −1t<1
⎩
e −t (e 1 −e −1 ) t 1
(b) Using Table 24.1 the Fourier transformsof f andgare
F(ω) = 2sinω
ω
and theirproduct is
F(ω)G(ω) =
2sinω
ω(1+jω)
G(ω) = 1
1+jω
(24.12)
Since the convolution is defined differently on each part of the domain, then the
Fouriertransformoftheconvolutionmustbefoundbyintegratingovereachpartof
the domain separately. You will alsoneed torecall that
So,
sinω= ejω −e −jω
2j
F{(f ∗g)(t)} =
=
=
=
∫ ∞
−∞
∫ −1
−∞
∫ ∞
+
1
∫ 1
−1
∫ 1
−1
(f ∗g)(t)e −jωt dt
0 ·e −jωt dt +
∫ 1
−1
e −t (e 1 −e −1 )e −jωt dt
(1 −e −1−t )e −jωt dt
e −jωt −e −1−t−jωt dt + (e 1 −e −1 )
∫ ∞
e −jωt −e −1 e −t(1+jω) dt+(e 1 −e −1 )
1
e −t e −jωt dt
∫ ∞
1
e −t(1+jω) dt
[ ] e
−jωt 1 [ e
= −e −1 −t(1+jω) 1 [ e
+(e
−jω
−1
−(1+jω)] 1 −e −1 −t(1+jω) ∞
)
−1
−(1+jω)]
1
780 Chapter 24 The Fourier transform
( )
= e−jω
−jω + ejω e
−(1+jω)
jω −e−1 −(1+jω)
( ) ( )
e
+e −1 (1+jω) e
+(e 1 −e −1 −(1+jω)
)
−(1+jω) 1+jω
Thefirsttwotermssimplifyto 2sinω
ω
.Bysimplifyingandrearrangingtheremainder
becomes − ejω −e −jω
. Putting all thistogether we find
1+jω
F{(f ∗g)(t)} = 2sinω
ω
= 2sinω
ω
= 2sinω
ω(1+jω)
− ejω −e −jω
1+jω
− 2jsin ω
1+jω
whichisthesameasEquation(24.12).Wehaveshownthat F{f ∗g} =F(ω)G(ω)
and so the convolution theorem has been verified.
24.8.2 Correlationandthecorrelationtheorem
If f (t) and g(t) are two piecewise continuous functions their correlation (also called
their cross-correlation), whichwedenoteby f ⋆g, isdefined asfollows:
Thecorrelationof f (t)andg(t):
f⋆g=
∫ ∞
−∞
f(λ)g(λ −t)dλ
Thisformulaisthesameasthatforconvolutionexceptfortheimportantdifferencethat
the functiongisnotfolded; thatis, wehaveg(λ −t) hererather thang(t − λ).
It is possible to show that the correlation of f (t) andg(t) can be written in the alternative
form
f⋆g=
∫ ∞
−∞
f(t + λ)g(λ)dλ
Sometextsusethisform,butnotethatitistheargumentof f whichist + λ(seeQuestion
8 inExercises 24.8).
We can now statethecorrelation theorem.
Thecorrelationtheorem:
If F{f(t)} =F(ω)and F{g(t)} =G(ω)then
F{f ⋆g} =F(ω)G(−ω)
24.8 Convolution and correlation 781
Example24.18 (a) Usingthe definition ofcorrelation, calculate the correlation of f (t) =u(t)e −t and
g(t) =u(t)e −2t , whereu(t) isthe unit stepfunction.
(b) Verifythe correlation theorem for these functions.
Solution (a) The correlation of f andgisgiven by
f⋆g=
∫ ∞
−∞
f(λ)g(λ −t)dλ
1
f(t)= u(t) e –t
g(t) = u(t) e – 2t
Figure24.16
Graphsof f (t) =u(t)e −t
andg(t) =u(t)e −2t .
t
Graphs of f (t)andg(t) areshown inFigure 24.16.
InFigure24.17wehavesuperimposedthegraphsof f (λ),showndashed,and
g(λ −t), fort being negative, and then positive. Where the graphs do not overlap,
the product f (λ)g(λ −t), and hence the correlation, iszero.
Whent<0
Whent < 0 the graphs overlap for 0 λ < ∞. Hence
f⋆g=
=
∫ ∞
0
∫ ∞
0
f(λ)g(λ −t)dλ
e −λ e −2(λ−t) dλ
∫ ∞
= e 2t e −3λ dλ
0
= e 2t [ e
−3λ
−3
] ∞
0
= 1 3 e2t
f( l)= u( l)e –
l
(a)
l
g( l – t) when t < 0
(b)
t
l
(c)
g( l – t )when t > 0
t
l
Figure24.17
Graphsof f(λ),andg(λ −t)for(b)t < 0,
(c)t>0.
782 Chapter 24 The Fourier transform
Whent0
Whent 0 the graphs overlap fort λ < ∞. Hence
f⋆g=
=
∫ ∞
t
∫ ∞
t
f(λ)g(λ −t)dλ
e −λ e −2(λ−t) dλ
∫ ∞
= e 2t e −3λ dλ
= e 2t [ e
−3λ
−3
t
= e 2te−3t
3
= 1 3 e−t
] ∞
t
Finally the complete expression forthe correlation is
⎧
1
⎪⎨
(f ⋆g)(t)=
3 e2t whent < 0
⎪⎩ 1
3 e−t whent 0
(b) UsingTable 24.1 the Fourier transforms of f andgare
F(ω) = 1 G(ω) = 1
1+jω 2+jω
and hence
F(ω)G(−ω) =
1
(1+jω)(2−jω)
Furthermore, taking the Fourier transformof f ⋆g, obtained inpart(a), we have
∫ 0
∫ ∞
1
F{f⋆g}=
−∞ 3 e2t e −jωt 1
dt +
0 3 e−t e −jωt dt
[ e
t(2−jω) 0 [ e
t(−1−jω) ∞
=
+
3(2−jω)]
−∞
3(−1 −jω)]
0
=
1
3(2−jω) + 1
3(1+jω)
= 3+3jω+6−3jω
9(2−jω)(1+jω)
=
1
(2−jω)(1+jω)
We have shown that F{f ⋆g} = F(ω)G(−ω) and so the correlation theorem has
been verified.
24.9 The discrete Fourier transform 783
EXERCISES24.8
1 Find the convolution of
{ 2
f(t)= 3 t 0t3
0 otherwise
and
{ 4 −1t3
g(t) =
0 otherwise
2 Theconvolution ofafunctionwith itselfis known as
autoconvolution.Find the autoconvolution f ∗ f
when
{ 1 −1t1
f(t)=
0 otherwise
3 Find the correlationof f (t) = 1 for −1 t 1 and
zerootherwise, andg(t) =u(t)e −t .Verify the
correlationtheoremforthese functions.
4 Prove thatconvolution iscommutative, thatis
f ∗g=g ∗ f.Note thatcorrelationis not.
5 Show thatif f (t)andg(t) are both zero,fort < 0,
then (f ∗g)(t) = ∫ t
0 f(λ)g(t − λ)dλ.
6 Prove the convolution theorem.
7 Show that f (t) ⋆g(t) = f (t) ∗g(−t).Deduce thata
correlationcanbeexpressedintermsofaconvolution.
Showalsothat f(t) ∗g(t) = f(t) ⋆g(−t).
8 Prove thatthe correlationintegral
f⋆g= ∫ ∞
−∞ f (λ)g(λ −t)dλcanalsobe writtenin
the form ∫ ∞
−∞g(λ)f(t + λ)dλ.
9 Prove the correlationtheorem.
Solutions
⎧
0 t−1
4
⎪⎨ 3 (t+1)2 −1<t2
1 f∗g= 12 2<t3
12 − 4 ⎪⎩ 3 (t−3)2 3<t6
0 t>6
⎧
⎪⎨
t+2 −2t0
2 f∗f= 2−t 0<t2
⎪⎩
0 otherwise
⎧
⎪⎨
e t (e 1 −e −1 ) t <−1
3 f⋆g= 1−e
⎪⎩
t−1 −1t1
0 t>1
24.9 THEDISCRETEFOURIERTRANSFORM
FormostpracticalengineeringproblemsrequiringtheevaluationofaFouriertransform
it is necessary to use a computer and so some form of approximation is needed. In this
section we show how a function f (t), with Fourier transformF(ω), can be sampled at
intervalsofT togiveasequenceofvalues f (0), f (T), f (2T)....ThediscreteFourier
transform (d.f.t.) takes such a sequence and processes it to produce a new sequence
whichcanbethoughtofasasampledversionofF(ω).Thed.f.t.isimportantasitisthe
basis of most signal and image processing methods. The use of a computer is essential
because ofthe enormousnumberofcalculationsrequired.
We start by stating the transform and provide a simple example to show how it is
calculated. The interested reader should refer to Section 24.10 for a derivation which
shows the relationshipbetween the Fourier transform and the d.f.t.
784 Chapter 24 The Fourier transform
24.9.1 Definitionofthed.f.t.
We consider a sequence ofN terms, f[0], f[1], f[2],..., f[N −1].
Thed.f.t.ofasequence f[n],n = 0,1,2,...,N −1,isanothersequenceF[k],also
havingN terms,defined by
F[k] =
∑N−1
n=0
f[n]e −2jnkπ/N
fork=0,1,2,...,N−1
We writeF[k] = D{f[n]}todenote the d.f.t. of the sequence f[n].
There are a number of variations of this definition and you need to be aware that
different authors may use slightly different formulae. This can cause confusion when
thed.f.t.isfirstmet.Inparticular,someauthorsincludeafactor 1 N inthedefinition,and
others use a positive exponential term instead of the negative one given above. What is
crucial isthatyou know which formula isbeing used and you apply itconsistently.
We now give an example toshow the application of the formula.
Example24.19 Findthe d.f.t. ofthe sequence f[n] = 1,2,−5,3.
Solution We use the formula
D{f[n]} =F[k] =
∑N−1
n=0
f[n]e −2jnkπ/N
fork=0,1,2,...,N−1
Herethe number ofterms,N, isfour:
3∑
F[k] =
n=0
f[n]e −2jnkπ/4
fork=0,1,2,3
So,whenk = 0,
3∑
F[0] = f[n]e 0
n=0
= 1
Whenk = 1,
3∑
F[1] =
=1+2+(−5)+3
n=0
f[n]e −2jnπ/4
= 1 +2e −2jπ/4 + (−5)e −2j2π/4 +3e −2j3π/4
=1+2e −πj/2 −5e −πj +3e −3jπ/2
= 1 +2(−j) −5(−1) +3j
=6+j
24.9 The discrete Fourier transform 785
Whenk = 2,
3∑
F[2] = f[n]e −2jn2π/4
n=0
3∑
= f[n]e −πjn
n=0
= 1 +2e −jπ + (−5)e −2jπ +3e −3jπ
= 1 +2(−1) −5(1) +3(−1)
= −9
Finally, whenk = 3,
3∑
F[3] = f[n]e −2jn3π/4
n=0
3∑
=
n=0
f[n]e −3jnπ/2
= 1 +2e −3jπ/2 + (−5)e −3jπ +3e −9jπ/2
= 1 +2(j) −5(−1) +3(−j)
=6−j
So, the d.f.t. of the sequence 1,2,−5,3 isthe sequence 1,6 +j,−9,6 −j.
You will note from this simple example that a significant amount of calculation is necessary
even when there are just four points in the sampled sequence. This is why it is
necessary to use a computer program to find the transform.Even then, the computation
can be very time consuming. For this reason, much effort has gone into developing fast
algorithms known as fast Fourier transforms (f.f.t.s). For details of these algorithms
youshouldrefertooneofthemanyspecialisttextsonthesubject.Youshouldalsoinvestigatewhichpackagesareavailabletoenableyoutoperformthissortofcalculation.For
example, the computer package MATLAB ® has built-in commands for finding a d.f.t.
using an f.f.t. For example, the MATLAB ® command fft(f) calculates the d.f.t. of a
sequencefas shown below:
f=[1 2 -5 3];
y=fft(f)
y =
1.0000 6.0000+ 1.0000i -9.0000 6.0000- 1.0000i
InMATLAB ® either j or i can beused ininput torepresent √ −1,for example
1+5j
or
1+5i
786 Chapter 24 The Fourier transform
EXERCISES24.9.1
and they are both equivalent. However when variables are printed, i is always used.
Hence both of these inputs iftyped atthe command line would give
ans= 1+5i
this should beborne inmindwhen viewing the results of the f.f.t.
1 Usethe definition to findthe d.f.t. ofthe sequences
f[n] = 1,2,0, −1 andg[n] = 3,1, −1,1.
2 Calculate the d.f.t. ofthe sequence f[n] = 5,−1,2.
3 Useatechnical computing language suchas
MATLAB ® to verifyyour answers to Questions1
and 2.
4 Fromthe definition ofthe d.f.t. showthat if f[n] is a
sequence ofreal numberswith d.f.t.F[k], then
∑ N−1
n=0 f[n]e2jnkπ/N =F[k] where the overline
denotes the complex conjugate ofF[k].
5 Forasequence ofcomplex numbersF[k], letF[k]
represent the sequence obtained by taking the
complex conjugate ofeach termin the sequence.
Show that if f[n]is asequence ofreal numbers,then
F[N−k]=F[k]fork=0,1,2,...,N/2ifNis
even,andfork =0,1,2,...,(N −1)/2ifN isodd.
Thiscan be seen in Example 24.20 whereN = 4 and
F[3] =F[1].
Solutions
1 F[k]=2,1−3j,0,1+3j.G[k]=4,4,0,4 2 6,4.5 +2.5981j,4.5 −2.5981j
24.9.2 Theinversed.f.t.
Just as there is an inverse Fourier transform which transformsF(ω) back to f (t), there
isaninverse d.f.t. whichconvertsF[k] backto f[n].
Theinverse d.f.t. ofthe sequenceF[k],
k = 0,1,2,...,N −1,isthe sequence f[n], alsohavingN terms,given by
D −1 {F[k]} = f[n] = 1 N
∑
N−1
F[k]e 2jknπ/N
k=0
n=0,1,2,...,N−1
Example24.20 Using the definition, find the inverse d.f.t. of the sequence F[k], for k = 0,1,2,3,
given by
F[k] = −4,1,0,1
Solution Inthis sequencetherearefourtermsand soN = 4.Fromthe definition
D −1 {F[k]} = f[n] = 1 3∑
F[k]e 2jknπ/4 n=0,1,2,3
4
= 1 4
k=0
3∑
k=0
F[k]e jknπ/2
n=0,1,2,3
= 1 4 (−4 +1ejnπ/2 +0e jnπ +1e 3jnπ/2 )
24.10 Derivation of the d.f.t. 787
Then, takingn = 0,1,2,3 gives the sequence:
f[n]=− 1 2 ,−1,−3 2 ,−1
EXERCISES24.9.2
1 Usethe definition to findthe inverse d.f.t. ofthe
sequenceF[k] = 6, −2,2, −2.
2 Investigatewhether you have access to acomputer
packagewhichwill calculatean inverse d.f.t. Use the
packageto verifyyour answer to Question1.
3 Prove that f[n] = 1 ∑ N−1
N k=0 F[k]e2jknπ/N is indeed
the inverse ofF[k] = ∑ N−1
m=0 f[m]e−2jkmπ/N by
substituting the expression forF[k] fromthe second
formula intothe first,interchanging the orderof
summation andsimplifyingthe result.
Solutions
1 f[n]=1,1,3,1
24.10 DERIVATIONOFTHED.F.T.
24.10.1 Somepreliminaryresults
A number of results discussed earlier in this book will be required in the development
whichfollows. To assistinthisdevelopment weremind youofthesenow.
Euler’s relations have beendiscussedinSection9.7.Recallthat
e −jθ =cosθ−jsinθ
and inparticular that
e −2nπj = cos2nπ −jsin2nπ
Furthermore, whennisaninteger cos2nπ = 1 and sin2nπ = 0 and so e −2nπj = 1.
Thefollowing integral propertyofthe delta, orimpulse, functionwill beneeded:
∫ ∞
−∞
f(t)δ(t −a)dt = f(a)
So,multiplyinganyfunction, f (t),bythedeltafunction δ(t−a),representinganimpulse
occurring att = a, and integrating, results in f (a). This sifting property of the delta
function, so called because it sifts out the value of f (t) at the location of the impulse,
has beendiscussedinSection16.4. Inparticular, when f (t) = e −jωt wenote
∫ ∞
−∞
e −jωt δ(t −a)dt = e −jωa
When a continuous function f (t), which is defined for t 0, is evaluated at times
t = 0,T,2T,...,nT,... we obtain the sequence of values f(0),f(T),f(2T),...,
f (nT),..., which we denote more concisely by the sequence f[0], f[1], f[2],...,
f[n],…. This sequence can be expressed in the form of a continuous function, ˜f(t),
788 Chapter 24 The Fourier transform
inthe following way:
∞∑
˜f(t)=T f[n]δ(t −nT)
n=0
Notethatwhereas f (t)istheoriginalcontinuousfunctionoft, ˜f (t)isanapproximation
obtainedusingonlythesampledvalues.Thisrepresentationisdiscussedingreaterdetail
inAppendix I.
24.10.2 Derivation
Thisderivationisbaseduponthefactthatafunction f (t),definedfort 0,andhaving
been sampled atintervals ofT, can be represented by the function ˜f (t) where
∞∑
˜f(t)=T f[n]δ(t −nT)
n=0
In any real problem it is only possible to sample a signal over a finite time interval.
Suppose we obtainN samples of the signal at timest = 0,T,2T,..., (N −1)T. Then
thesampledsignalcanberepresentedbyamendingthelimitsofsummationandwriting
˜f(t)=T
∑N−1
n=0
f[n]δ(t −nT)
Taking the Fourier transform of both sides gives
F{ ˜f(t)}=T
∫ ∞
−∞
∑N−1
e −jωt
n=0
f[n]δ(t −nT)dt
If we make the assumption that it is permissible to interchange the order of integration
and summation wefind
F{ ˜f(t)}=T
=T
∑N−1
f[n]
∫ ∞
n=0
−∞
∑N−1
f[n]e −jωnT
n=0
e −jωt δ(t −nT )dt
where we have used the sifting property of the delta function. The quantity on the r.h.s.
isacontinuousfunctionof ωderivedusingthevaluesinthesequence f[n].Writethisas
F(ω)andnotethat ˜ F(ω)isanapproximationtotheFouriertransformF(ω) ˜
= F{f (t)}.
It can also be thought of as a Fourier transform for sequences. An important point to
note about this function of ω is that it is periodic with period 2π . This is proved in the
T
following example.
Example24.21 Show thatthe function
∑N−1
F(ω) ˜ =T
n=0
f[n]e −jωnT
isperiodic with period 2π
T .
(
Solution ConsiderF
˜ ω + 2π )
for any ω.
T
(
F˜ ω + 2π )
∑N−1
−j(ω+2π/T )nT
=T f[n]e
T
n=0
=T
=T
∑N−1
n=0
∑N−1
n=0
Now wesaw inSection 24.10.1 that
f[n]e −jωnT e −j2π T nT
f[n]e −jωnT e −2nπj
24.10 Derivation of the d.f.t. 789
usingthe laws of indices
e −2nπj =cos2nπ−jsin2nπ=1
Finally wehave
(
F˜ ω + 2π )
=T
T
∑N−1
n=0
f[n]e −jωnT
which equalsF(ω).
˜ (
We have shown that F ˜ ω + 2π )
= F(ω) ˜ for any ω. HenceF(ω) ˜ is periodic with
T
period 2π
T .
For computational purposes it is necessary to sampleF(ω) ˜ at discrete values. Suppose
we choose to sample at N points over the interval 0 ω < 2π . By choosing this
T
intervalwearesamplingoveracompleteperiodinthefrequencydomain.Sowechoose
the sample points given by ω =k 2π
(
fork = 0,1,2,...,(N −1).WritingF˜
k 2π )
NT NT
asF(ω ˜
k
)wehave
∑N−1
F(ω ˜
k
)=T
n=0
N−1
2π
−jk
f[n]e NT nT
∑
=T f[n]e −2jnkπ/N (24.13)
n=0
Thislastsum,withoutthefactorT,isusuallytakenasthedefinitionofthed.f.t.of f[n],
writtenF[k]. To denote the d.f.t. of the sequence f[n] wewill write D{f[n]}.
Thed.f.t. ofthe sequence f[n],n = 0,1,2,...,N −1,isthe sequencegiven by
D{f[n]} =F[k] =
∑N−1
n=0
f[n]e −2jnkπ/N
fork=0,1,2,...,N−1
790 Chapter 24 The Fourier transform
When we use this transform to approximate the Fourier transform of the function f (t)
sampled at intervalsT, it follows from Equation (24.13) that we must multiply by the
factor T to obtain our approximation. An example of this is given in the next
section.
24.11 USINGTHED.F.T.TOESTIMATEAFOURIERTRANSFORM
One application of the d.f.t. is to estimate the continuous Fourier transform of a signal
f (t). The following example shows how this can bedone.
Example24.22 The signal f (t) = u(t)e −2t , where u(t) is the unit step function, is shown in
Figure 24.18(a). Its Fourier transform, which can be found from Table 24.1, is
F(ω) = 1 and a graph of |F(ω)|isshown inFigure 24.18(b).
2+jω
Suppose we obtain 16 sample values of f (t) at intervals ofT = 0.1 fromt = 0 to
t = 1.5.
(a) Obtain the d.f.t. of the sampled sequence.
(b) Usethe d.f.t. toestimatethe trueFourier transform values.
(c) Plot a graph tocompare values of |F[k]| and |F(ω)|.
Solution (a) If f (t) =u(t)e −2t is sampled att =nT, that ist = 0,0.1,0.2,...,1.5, we obtain
the sequence f[n] = e −2nT = e −0.2n forn = 0,1,2,...,15. This is the sequence
given inthe second column of Table 24.2.
(b) Thecalculationofthed.f.t.isfartoolaborioustobedonebyhand.Insteadwehave
used the MATLAB ® command fft() to do the calculation and the results, F[k],
have been placed inthe third column.
(c) From Section 24.10 we know that the d.f.t. samples at intervals of 2π in the frequencydomain.SoforcomparisonthefourthcolumnshowsthetruevaluesofF(ω)
NT
1 f(t) = u(t)e – 2 t t
4 + 2 30
u F( v) u =
1
v
0.5
(a)
1
(b)
Figure24.18
The signal f (t) =u(t)e −2t and itsFourier transform.
–30
v
24.11 Using the d.f.t.to estimate a Fourier transform 791
Table24.2
n,k f[n] = e −0.2n F[k] F(ω k ) 0.1F[k]
0 1.0000 5.2918 0.5000 0.5292
1 0.8187 1.4835 −1.9082j 0.1030 −0.2022j 0.1484 −0.1908j
2 0.6703 0.7882 −1.0837j 0.0304 −0.1196j 0.0788 −0.1084j
3 0.5488 0.6311 −0.6952j 0.0140 −0.0825j 0.0631 −0.0695j
4 0.4493 0.5743 −0.4702j 0.0080 −0.0626j 0.0574 −0.0470j
5 0.3679 0.5485 −0.3159j 0.0051 −0.0504j 0.0548 −0.0316j
6 0.3012 0.5355 −0.1964j 0.0036 −0.0421j 0.0536 −0.0196j
7 0.2466 0.5293 −0.0944j 0.0026 −0.0362j 0.0529 −0.0094j
8 0.2019 0.5274 0.0020 −0.0317j 0.0527
9 0.1653 0.5293 +0.0944j 0.0016 −0.0282j 0.0529 +0.0094j
10 0.1353 0.5355 +0.1964j 0.0013 −0.0254j 0.0536 +0.0196j
11 0.1108 0.5485 +0.3159j 0.0011 −0.0231j 0.0548 +0.0316j
12 0.0907 0.5743 +0.4702j 0.0009 −0.0212j 0.0574 +0.0470j
13 0.0743 0.6311 +0.6952j 0.0008 −0.0196j 0.0631 +0.0695j
14 0.0608 0.7882 +1.0837j 0.0007 −0.0182j 0.0788 +0.1084j
15 0.0498 1.4835 +1.9082j 0.0006 −0.0170j 0.1484 +0.1908j
also sampled at intervals of 2π . Recall that to obtain an estimate of the Fourier
NT
transform of f (t) we must multiply the d.f.t. values byT = 0.1. This is shown in
the fifth column of the table.
Figure24.19showsgraphsof |0.1F[k]|and |F(ω)|.Valuesof |F[k]|obtainedbeyond
k = 8aremirrorimagesoftheearliervalues.ThisisbecauseF[N−k] =F[k]asproved
inQuestion5inExercises24.9.1.Thisisaphenomenon ofthed.f.t.andarisesbecause
ofitsperiodicityandsymmetryproperties,furtherdetailsofwhicharebeyondthescope
of this book. The interested reader is referred to a text on signal processing for further
details.
0.6
0.5
0.4
F ( k) = F
0.3
(
2 pk
u v u
N T )
0.1uF[k]u
0.2
0.1
0 2 4 6 8 10 12 14
k
Figure24.19
Comparisonoftruevalues ofaFourier
transformand approximate values obtained
usingad.f.t.
792 Chapter 24 The Fourier transform
24.12 MATRIXREPRESENTATIONOFTHED.F.T.
Whenitisnecessarytodevelopcomputercodeforperformingad.f.t.,anunderstanding
of the following matrix representation isuseful.
We have seen thatthe d.f.t. ofasequence f[n] isgiven by
F[k] =
∑N−1
n=0
f[n]e −2jnkπ/N
Considertheterme −2jnkπ/N whichcanbewrittenusingthelawsofindicesas (e −2jπ/N ) nk .
DefineW = e −2jπ/N so that
F[k] =
∑N−1
n=0
f[n]W nk
Note thatW does not depend uponnor k. For a fixed value of N we can calculateW
which isthen a constant. Writingout thissum explicitly we find
F[k] = f[0]W 0 + f[1]W k + f[2]W 2k + f[3]W 3k +···
+f[N−1]W (N−1)k
Writingthisout foreachkwefind
fork=0,1,2,...,N−1
F[0] = f[0]W 0 + f[1]W 0 + f[2]W 0 + f[3]W 0 +···+f[N−1]W 0
F[1] = f[0]W 0 + f[1]W 1 + f[2]W 2 + f[3]W 3 +···+f[N−1]W N−1
F[2] = f[0]W 0 + f[1]W 2 + f[2]W 4 + f[3]W 6 +···+f[N−1]W 2(N−1)
. = . .
F[N −1] = f[0]W 0 + f[1]W N−1 + f[2]W 2(N−1)
+ f[3]W 3(N−1) +···+f[N−1]W (N−1)(N−1)
These equations can be written inmatrix form asfollows:
⎛ ⎞ ⎛
⎞ ⎛
F[0] W 0 W 0 W 0 ... W 0
F[1]
W 0 W 1 W 2 ... W N−1
F[2]
=
W 0 W 2 W 4 ... W 2(N−1)
⎜
⎝
⎟ ⎜
. ⎠ ⎝
.
.
.
.
⎟ ⎜
. ⎠ ⎝
F[N −1] W 0 W N−1 W 2(N−1) ...W (N−1)(N−1)
whereW = e −2jπ/N .
f[0]
f[1]
f[2]
.
f[N−1]
⎞
⎟
⎠
Example24.23 (a) Find the matrix representing a three-point d.f.t.
(b) Usethe matrix tofind the d.f.t. of the sequence f[n] = 4,−7,11.
Solution (a) HereN = 3 and soW = e −2πj/3 . The required matrix is
⎛
⎞
1 1 1
⎝1 e −2πj/3 e −4πj/3 ⎠
1 e −4πj/3 e −8πj/3
24.13 Some properties of the d.f.t. 793
(b)
which usingthe Cartesian form can be written
⎛
⎞
1 1 1
⎜
1 − 1 √
3
⎝ 2 −j − 1 √
3
2 2 +j 2
1 − 1 √
3
2 +j − 1 √ ⎟
⎠ 3
2 2 −j 2
⎛
⎞
⎛ ⎞ 1 1 1
F[0]
⎝F[1]
⎠ = ⎜
1 − 1 √
3
F[2]
⎝ 2 −j − 1 √ ⎛ ⎞ ⎛ ⎞
3
2 2 +j 4 8
2
1 − 1 √
3
2 +j − 1 √ ⎟ ⎝−7⎠ = ⎝2 +15.5885j⎠
⎠ 3
2 2 −j 11 2 −15.5885j
2
So, the required d.f.t. isthe sequence
8,2 +15.5885j,2 −15.5885j
EXERCISES24.12
1 Using the definition ofthe d.f.t. show thatthe matrix
whichcan be used to implement the d.f.t. ofa
four-point signal is
⎛
⎞
1 1 1 1
W = ⎜1 −j −1 j
⎟
⎝1 −1 1 −1⎠
1 j −1 −j
Use thismatrix to findthe d.f.t. ofthe sequences
f[n] = {1,2,0, −1} andg[n] = {3,1, −1,1}.
Solutions
1 F[k]=2,1−3j,0,1+3j.G[k]=4,4,0,4
24.13 SOMEPROPERTIESOFTHED.F.T.
24.13.1 Periodicity
Thed.f.t.F[k] isperiodic with periodN, thatis
F[k+N] =F[k]
This is why we take onlyN terms in the frequency domain. Calculating further values
will only reproducethe earlier ones.Theinverse d.f.t. isalsoperiodicwith periodN.
24.13.2 Linearity
The d.f.t. is a linear transform. This means that the d.f.t. of the sequenceaf[n] +bg[n]
whereaandbareconstants isaF[k] +bG[k].
794 Chapter 24 The Fourier transform
24.13.3 Parseval’stheorem
If D{f[n]} =F[k] and D{g[n]} =G[k] then
∑N−1
n=0
f[n]g[n] = 1 ∑N−1
F[k]G[k]
N
k=0
where the overline indicates the complex conjugate.
24.13.4 Rayleigh’stheorem
This theorem isobtained fromParseval’s theorem by lettingg[n] = f[n].
∑N−1
|f[n]| 2 = 1 ∑N−1
|F[k]| 2
N
n=0 k=0
Example24.24 VerifyRayleigh’s theoremforthe sequence f[n] = 5,4.
Solution HereN = 2.We first calculateF[k].
F[k] =
1∑
n=0
f[n]e −jnkπ
= f[0] + f[1]e −jkπ
= 5 +4e −jkπ
Whenk =0,wehaveF[0] =5+4e 0 =9.
Whenk=1,wehaveF[1]=5+4e −jπ =5−4=1.
For brevity weoften writeF[k] = 9,1.
Then ∑ 1
n=0 |f[n]|2 =5 2 +4 2 =41.
Also, ∑ 1
k=0 |F[k]|2 = 9 2 +1 2 = 82.
We note that41 = 1 ×82 and this verifies Rayleigh’s theorem.
2
EXERCISES24.13
1 (a)Obtainthe d.f.t.sof f[n] = 1,2,2,1 and
g[n] = 2,1,1,2.
(b)Verify Rayleigh’stheoremforeachofthese
sequences.
2 Suppose f[n] = 3,1,5,4 andg[n] = 2, −1,9,5.
(a) Show that ∑ 3
n=0 f[n]g[n] = 70.
(b) Obtainthe d.f.t.sF[k] andG[k].
(c) CalculateF[k]G[k].
(d) HenceverifyParseval’stheoremforthese
sequences.
3 Show thatifthe d.f.t. of f[n] isF[k] thenthe d.f.t. of
f[n −i] is e −2πjki/N F[k]. This isknown asthe shift
theorem.
4 ProveParseval’stheorem.
5 ProveRayleigh’stheorem.
24.14 The discrete cosine transform 795
Solutions
1 (a) F[k]=6,−1−j,0,−1+j.
G[k]=6,1+j,0,1−j
(b) ∑ |f[n]| 2 = 10, ∑ |F[k]| 2 = 40.
∑ |g[n]| 2 = 10, ∑ |G[k]| 2 = 40
2 (b) F[k] = 13,−2 +3j,3,−2 −3j.
G[k] = 15,−7 +6j,7,−7 −6j
(c) 195,32 −9j,21,32 +9j
24.14 THEDISCRETECOSINETRANSFORM
There isanalternative method tothatofthe d.f.t.fortransformingaset oftimedomain
samples of the signal f (t) into the frequency domain,F[ω]. Itis known as the discrete
cosinetransformord.c.t.WehavealreadyseenthatthediscreteFouriertransformtakes
N samples, f[0], f[1], f[2], ..., f[N − 1] and producesN output samples in the frequencydomainF[0],F[1],F[2],
...,F[N −1].Thed.c.t.iscalculatedinasimilarway.
However, thed.c.t.ofasetofsampleshas specialpropertieswhichmakeitparticularly
suitable for image and audio processing. These will become apparent in the following
examples.
A detailed derivation will not be presented here for the d.c.t., although much of the
mathematics of the d.c.t. and the d.f.t. is very similar. It should also be mentioned that
very efficient methods for computing the d.c.t. are possible; this is beyond the scope of
thisbook which will focus on the importantproperties of the d.c.t. as a transformation.
24.14.1 Definitionofthed.c.t.anditsinverse
We consider a sequence ofN terms, f[0], f[1], f[2], ..., f[N −1].
Thed.c.t.ofasequence f[n],n = 0,1,2,...,N −1,isanothersequenceF[k],also
havingN terms,defined by
F[k] = √ 1 ∑N−1
[ ( π
f[n]cos n + 1 ) ]
k fork=0,1,2, ... ,N−1
N N 2
n=0
Thereareseveralvariantsofthedefinitionofthed.c.t.Theonegivenaboveisprobably
themostcommonlyusedandisusuallytermedd.c.t.-II.Themainmathematicaldifference
between this and the d.f.t. is that in the d.c.t. only real numbers are produced in
F[k], assuming real-valued samples in f[n]. The same cannot be said for the d.f.t. becauseevenwithreal-valuedsamplesin
f[n]theoutputF[k]ingeneralcontainssamples
which have both a real and imaginary part.
The d.c.t. and its variants are popular choices for image and audio compression.
Compression is the process of reducing the volume of data associated with a signal by
removing redundant, partially redundant or repeated data. We will see in the following
examples thatthe d.c.t. isvery usefulforcompressing certaintypes of signal.
Inordertomakeuseofthed.c.t.intheseapplicationsitisnecessarytousetheinverse
operationtorecovertheoriginalsamples.Theinverseofd.c.t.-IIisanotherd.c.t.termed
d.c.t.-III.
796 Chapter 24 The Fourier transform
The inverse d.c.t. of a sequenceF[k],k = 0,1,2,...,N −1, is another sequence
f[n],alsohavingN terms,defined by
f[n] = √ 1 ∑N−1
[ ( π
{F[0] +2 F[k]cos
N N k n + 1 )] }
2
k=1
forn=0,1,2, ... ,N−1
Again,aderivationofthisinversewillnotbepresentedhere.However,wewilldemonstrate
the inverse property for a specific example.
Example24.25 (a) Findthe d.c.t.,F[k], ofthe sequence f[n] = 2,4,6.
(b) Apply the inverse d.c.t. to F[k] and show that the original sequence, f[n], is
obtained.
Solution (a) We use the formula
F[k] = √ 1 ∑N−1
N
n=0
[ π
f[n] cos
N
Herethe number of terms,N, isthree.
Whenk=0
F[0] = 1 √
3
2∑
Whenk=1
n=0
(
n + 1 ) ]
k
2
[ ( π
f[n]cos n + 1 ) ]
×0
3 2
= 1 √
3
[2cos0 +4cos0+6cos0]= 12 √
3
F[1] = 1 √
3
2∑
= √ 1 [
3
Whenk=2
n=0
[ ( π
f[n]cos n + 1 ) ]
×1
3 2
]
fork=0,1,2, ... ,N−1
2cos π 6 +4cosπ 2 +6cos5π 6
√ )]
3
= 1 [√ √ ]
√ 3−3 3 = −2
2 3
= √ 1
√ (
3
[2 ×
3 2 +0+6× −
F[2] = 1 √
3
2∑
= √ 1 [
3
n=0
[ ( π
f[n]cos n + 1 ) ]
×2
3 2
]
2cos π 3 +4cosπ+6cos5π 3
= √ 1 [
2 × 1
3 2 +4× (−1) +6× 1 ]
=1−4+3=0
2
So the d.c.t. of the sequence 2,4,6is 12 √
3
, −2, 0.
24.14 The discrete cosine transform 797
(b) We now apply the inverse d.c.t. toF[k].
The inverse d.c.t. is
f[n] = √ 1 ∑N−1
[ ( π
{F[0] +2 F[k]cos
N N k n + 1 )] }
2
k=1
forn=0,1,2, ... ,N−1
Whenn=0
{
f[0] = √ 1 12
2∑
√ +2
3 3
k=1
= √ 1 [ 12
√ +2× (−2) ×cos
3
[ 3
= √ 1 12
√ −4×
3 3
[ ( π
F[k]cos
3 k 0 + 1 )] }
2
( π
) ]
+2×0
6
√
3
2 +0 ]
=4−2=2
Whenn=1
{
f[1] = √ 1 12
2∑
√ +2
3 3
= 1 √
3
[ 12
k=1
√ +2× (−2) ×cos
3
]
= 4
= 1 √
3
[ 12
√
3
−4×0+0
[ ( π
F[k]cos
3 k 1 + 1 )] }
2
( π
) ]
+2×0
2
Whenn=2
{
f[2] = √ 1 12
2∑
√ +2
3 3
k=1
[ ( π
F[k]cos
3 k 2 + 1 )] }
2
( ) ] 5π
+2×0
6
= √ 1 [ 12
√ +2× (−2) ×cos
3
[ 3
(
= √ 1
√ ) ]
12 3
√ −4× − +0 =4+2=6
3 3 2
So f[n] = 2,4,6, which was the originalsequence as expected.
If a d.f.t. had been performed on the sequence f[n], the output,F[k], would have been
the sequence of complex terms 12, −3 + j1.7321,−3 − j1.7321. Compare this with
the d.c.t. which isF[k] = 12 √
3
,−2, 0. Note that this latter sequence contains only realvalued
terms.
798 Chapter 24 The Fourier transform
Engineeringapplication24.2
Effectoftruncatingthed.c.t.andd.f.t.ofasetofsamples
Now we turn our attention to a longer sequence and consider the effect of deleting
samples inthe d.c.t.
In this example, we produce both a d.c.t. and a d.f.t. of a given set of samples
andthensetsomeofthesamplesinthelatterpartofthesequencetozero.Following
this we carry out an inverse d.c.t. to return an approximation to the original sample
set.
We consider the sequence of samples
f[n]=0,1,2,3,4,5,6,7,8,9
Usingthe methods described inthe previous example wecan show that the d.c.t. is
F DCT
[k] = 14.2302,−6.3815,0.0000,−0.6835,0.0000,
−0.2236,0.0000,−0.0904,0.0000,−0.0254
ThenotationF DCT
hasbeenusedtodistinguishthissequencefromthed.f.t.sequence
thatitwill be compared to. The d.f.t. is
F DFT
[k] = 45.0000,−5.0000 +j15.3884,−5.0000 +j6.8819,−5.0000
+j3.6327,−5.0000 +j1.6246,−5.0000,−5.0000 −j1.6246,
−5.0000 −j3.6327,−5.0000 −j6.8819,−5.0000 −j15.3884
Bydefinition,applyingtheappropriateinverseoperationstoeithersetofdatawould
recover the originalinput samples.
If instead we delete samples from the end of each sequence by setting the values
to zero and then perform the inversion then the results are different. In this example
weset the lastfive samples tobezero. So weperform the inversion on
F DCT
[k] = 14.2302,−6.3815,0.0000,−0.6835,0.0000,0,0,0,0,0
F DFT
[k] = 45.0000,−5.0000 +j15.3884,−5.0000 +j6.8819,−5.0000
+j3.63271,−5.0000 +j1.6246,0,0,0,0,0
The results of the inversion are shown plotted in Figure 24.20. The values obtained
from the inverse d.c.t. are all real values. The values plotted on the graph for the
inversed.f.t.arethemagnitudesofthecomplexvaluesthatarereturned.Thisisnecessary
because now samples have been removed from the d.f.t., it can no longer be
guaranteed thatthe solution consists of wholly real numbers.
Notice that the samples in both cases are not exactly the same as the ones we
startedwith.Thisisexpectedbecauseinsettingsomeofthesamplevaluestozeroin
thed.c.t.wehavedestroyedsomedata.Infactwehavedestroyedhalfofthesamples
in this example. However, notice also that the d.c.t. appears to perform much better
than the d.f.t. in recovering the input data. The reason the d.c.t. looks qualitatively
better than the d.f.t. is that the information content or energy is concentrated in the
lowest order samples. Hence when we set the later samples to zero they affect the
d.c.t. far lessthan the d.f.t.
24.14 The discrete cosine transform 799
f(n)
8
6
4
2
2 4 6 8
Figure24.20
Lossycompression usingthe d.c.t. andthe d.f.t. Markers:circle = original data; square =
d.c.t. derived values; diamond =d.f.t. derived values.
n
Thisisanexampleof lossycompression,whichisusedextensivelyinaudio,image
and video compression. Because some of the samples are known to be zero they do
not need to be stored and hence the file or data storage requirement is much lower. An
approximation of the original data can be recovered even though not all of the original
d.c.t. samples are provided. It can be shown that the more samples we retain, the better
thequalityofthereproduction.Itiswellknownthatthed.c.t.performswellonstraight
line data and is used extensively in compression standards for images such as JPEG. In
image data which is viewed qualitatively a certain amount of data loss is tolerable, and
thiswill be illustratedinthe following example.
Engineeringapplication24.3
Two-dimensionald.c.t.andimagecompression
Sofarwehaveonlyseenthed.c.t.appliedtoaone-dimensional dataset.Thiscould
be used to handle the compression of audio data, for example. When applying the
d.c.t. to a two-dimensional problem such as an image a slightly different form is
required:
F[k,l] = √ 1 ∑N−1
NM
n=0
{ M−1 ∑
[ ( π
f[n,m]cos m + 1 )
l] } [ ( π
cos n + 1 ) ]
k
M 2 N 2
m=0
forn=0,1,2, ... ,N−1
m=0,1,2, ... ,M−1
This formula can be applied to a 2D data set. We take the greyscale image given
in Figure 24.21 as the source data. Each pixel in the image has assigned to it a
numerical brightness value and itistothese values that weapply the d.c.t.
➔
800 Chapter 24 The Fourier transform
f(0, 0)
f(124, 124)
Figure24.21
Source imageford.c.t.
application.
We use the transform given above to produce an N ×M matrix containing the
real d.c.t. values,F[k,l]. Figure 24.22 is a visualization ofF[k,l]. Dark pixels represent
large values, white pixels represent zero or close to zero elements. Notice the
concentration of largevalues close toF[0,0], which isinthe top-left corner.
F(0, 0)
Figure24.22
Thed.c.t. matrix.
F(124, 124)
Inthenextstepwecompresstheimagebysettingthearrayelementstozeroatthe
right-handsideandbottom.Sincethecoefficientsaresmallinthisregiontheeffects
on the image after inversion are minor, unless a significant number of values on the
right-hand sideand bottom are set tozero.
The results of applying the inverse transform are shown in Figure 24.23. We can
seethattheimageontherightwhichcontainsthelowestnumberofd.c.t.coefficients
hasmoredistortionduetothecompression.Suchdistortionsareoftentermedvisual
artefacts. The advantage of having a highly compressed file is that it requires less
space for data storage.
In practice, the image is often broken into smaller blocks with the d.c.t. being
carried out on these blocks. If a highly compressed image is viewed on a computer
atalowresolutionitisoftenpossibletoseetheseblocks,whichtypicallymeasure8
by 8 pixels.
24.15 Discrete convolution and correlation 801
Figure24.23
Effect ofsetting elements to zero in the d.c.t.matrix. Moving from left to right agreater
number ofelements in the matrix are setto zero, resulting in more imageartefacts.
24.15 DISCRETECONVOLUTIONANDCORRELATION
As stated at the beginning of Section 24.8, convolution and correlation are important
techniquesinsignalandimageprocessing.Inthissectionwewilldescribetheirdiscrete
representations. Both ofthese can be implemented efficiently usingthe d.f.t.
24.15.1 Linearconvolution
The linear convolution of two real sequences f[n] andg[n] is another sequence,h[n]
say, which we denote by f ∗g, which isdefined as follows:
Linear convolution of f[n] andg[n]:
∞∑
h[n]=f∗g= f[m]g[n−m] forn=...−3,−2,−1,0,1,2,3,...
m=−∞
Notice the similaritybetween this definition and thatof convolution defined inthe context
of the continuous Fourier transform in Section 24.8. Notice also that in this formulathesequencegisfoldedand
shifted.Thiswillbeillustratedintheexample which
follows.
Frequently the sequences being considered will be finite.
Assume now that f[n] is a finite sequence ofN 1
terms, so that all terms other than
f[0], f[1],..., f[N 1
−1] arezero.
Suppose also that g[n] is a finite sequence of N 2
terms, with all terms other than
g[0],g[1],...,g[N 2
−1] being zero.
Itcanbeshownthatthesequenceh[n]willhavelengthN 1
+N 2
−1,andtheconvolution
sum simplifies tothe following:
The linear convolution of the two finite sequences f[n] andg[n] isdefined as
n∑
h[n]=f∗g= f[m]g[n−m] forn=0,1,2,3,...,(N 1
+N 2
−2)
m=0
802 Chapter 24 The Fourier transform
The terms of this sequence can be calculated by brute force as the following example
will show, although in practice a convolution can be calculated much more efficiently
using a d.f.t.
Example24.26 Suppose f[n]isthe sequence 3,9,2,−1, andg[n] isthe sequence −4,8,5.
(a) Find the linear convolutionh[n] = f ∗gusing the previous formula.
(b) Develop a graphical interpretation of thisprocess.
Solution (a) Notethatboth f andgarefinitesequences oflength4and3respectively. Theconvolution
f ∗gwill be a sequence of length 4 +3−1 = 6.By definition,
h[n] =
n∑
f[m]g[n−m]
m=0
forn=0,1,2,...,5
The first terminh[n] isobtained by lettingn = 0:
h[0] =
0∑
f[m]g[0 −m]
m=0
= f[0]g[0]
= (3)(−4)
= −12
The second termisobtained by lettingn = 1:
h[1] =
1∑
f[m]g[1 −m]
m=0
= f[0]g[1] + f[1]g[0]
= (3)(8) + (9)(−4)
=24−36
= −12
Subsequent terms are calculated in a similar manner. You should obtain these for
yourself toensure thatyou understand the process.The complete sequence is
h[n] = f ∗g= −12,−12,79,65,2,−5
(b) The graphical interpretation is developed along the same lines as was done for the
continuous convolution inSection 24.8.
The sequences f[m]andg[m] areshown inFigure 24.24.
The sequence g[−m] is found by reflection in the vertical axis, that is folding as
shown in Figure 24.25(a). Then the folded graph can be translatednunits to the left or
therightbychangingtheargumentofgfromg[m]tog[n −m].Ifnispositivethegraph
moves to the right. Study Figures 24.25(b--g) to observe this. Convolution is the sum
of products of f[m] with g[n −m]. The graph of f[m] is superimposed. We are only
interested in values ofnfor which the graphs overlap -- otherwise each product is zero.
For each value ofnthe superimposed graphs make it easy to see which values must be
24.15 Discrete convolution and correlation 803
f [m]
g[m]
10
8
6
4
2
–2
–4
1 2
m
10
8
6
4
2
–2
–4
Figure24.24
Thesequences f[m] = 3,9,2, −1 andg[m] = −4,8,5.
1 2
m
multipliedtogetherandadded.Ifrequired,atablecanbeconstructedwhichsummarizes
all the necessary information as shown below.
m −2 −1 0 1 2 3 4 5
f[m] -- -- 3 9 2 −1 -- --
n=0 g[−m] 5 8 −4 -- -- -- -- --
n=1 g[1−m] -- 5 8 −4 -- -- -- --
n=2 g[2−m] -- -- 5 8 −4 -- -- --
n=3 g[3−m] -- -- -- 5 8 −4 -- --
n=4 g[4−m] -- -- -- -- 5 8 −4 --
n=5 g[5−m] -- -- -- -- -- 5 8 −4
EXERCISES24.15.1
1 Given f[n] = 1,2,2,1 andg[n] = 2,1,1,2, findthe
linearconvolution f ∗g.
2 (a) Thelinearconvolution ofthe sequences
3,9,2, −1 and −4,8,5 was obtainedin
Example24.27. Show thatthisconvolution is
equivalent to multiplying the two polynomials
3+9x+2x 2 −x 3 and−4+8x+5x 2 .
(b) Byusingpolynomial multiplicationfind the
linearconvolution ofthe sequences f[n] = 9, −8
andg[n] = 1,2, −4.
(c) Use the polynomial methodto findthe linear
convolution of f[n] = 9, −1,3 and
g[n] = 7,2, −4.
3 Find the linearconvolution of f[n] = 1, −1,1,3 and
g[n] = 7,2,0,1.
4 Prove from the definition thatlinearconvolution is
commutative, that is f ∗g=g ∗ f.
Solutions
1 2,5,7,8,7,5,2
3 7, −5,5,24,5,1,3
2 (b) 9,10, −52,32 (c) 63,11, −17,10, −12
24.15.2 Circularconvolution
InthissectionweconsiderperiodicsequenceshavingperiodN.Wecanselectoneperiod
forexaminationbylookingattheterms f[0], f[1], f[2],..., f[N−1],say.Forexample,
the sequence
f[n]=...−7,11,2,−7,11,2,−7,11,2...
804 Chapter 24 The Fourier transform
(a)
(b)
(c)
(d)
(e)
g[ – m]
10
6 8
4
2
– 4
g[ – m]
10
6 8
4
2
– 4
g[ 1– m]
10
6 8
4
2
– 4
g[ 2 – m]
10
6 8
4
2
– 4
g[ 3 – m]
10
6 8
4
2
– 4
1 2
1 2
1 2
1 2
1 2
m
m
m
m
m
sum of products:
(3)( – 4) = –12
sum of products:
(3)(8) + (9)( – 4) = –12
sum of products:
(3)(5) + (9)(8) + (2)( – 4) = 79
sum of products:
(9)(5) + (2)(8) + ( – 1)( – 4) = 65
(f)
(g)
g[ 4 – m]
10
6 8
4
2
– 4
g[ 5 – m]
10
6 8
4
2
– 4
1 2
1 2
m
m
sum of products:
(2)(5) + (– 1)(8) = 2
sum of products:
(– 1)(5) = –5
Figure24.25
The effect oftranslatingg[−m] bynunits,forn = 0,1, . . . ,5.
24.15 Discrete convolution and correlation 805
f [0]= f [3]
–7
f [0]
–7
11 2
f [1]= f [4] f [2]= f [5]
(a)
11 2
f [–2] f [–1]
(b)
Figure24.26
Aperiodic sequence can be visualized
bylisting its termsaround acircle.
isaperiodicsequencewithperiodN = 3.Wecanselecttheterms f[0], f[1], f[2],thatis
terms −7,11,2, and use these tostudythe entire sequence.
Suchasequencecanberepresentedgraphicallybylistingitstermsaroundacircleas
shown in Figure 24.26. Doing this allows us to calculate further terms in the sequence
as we require them. Rotation anticlockwise represents increasingnas in (a). Rotation
clockwise represents decreasingnas in(b).
Suppose f[n] andg[n] are two periodic sequences having periodN. Their circular
convolution,orperiodicconvolution,isthesequenceh[n],whichwedenoteby f ○∗ g
and which isdefined as follows:
The circular convolution of two periodic sequences each of periodN isdefined as
h[n]=f○∗g=
∑N−1
m=0
f[m]g[n−m] forn=0,1,2,...,N−1
The sequence is periodic with period N and so we can state it for n = 0,1,2,...,
N−1.
Example24.27 (a) Calculate the circular convolution,h[n] = f ○∗ g, of the two periodic sequences
f[n] = 9,−1,3 andg[n] = 7,2,−4.
(b) Develop a graphical representation of thisprocess.
Solution (a) The sequenceg[n] isdepicted inFigure 24.27.
We use the formulagiven above. InthisExample,N = 3.First letn = 0.
g[0]
2∑
h[0] = f[m]g[0 −m]
7
2 – 4
g[1] g[2]
g[–2]
g[–1]
Figure24.27
Theperiodic sequence
g[n] = 7,2, −4.
m=0
= f[0]g[0] + f[1]g[−1] + f[2]g[−2]
= (9)(7) + (−1)(−4) + (3)(2)
= 73
Next letn = 1.
2∑
h[1] = f[m]g[1 −m]
m=0
= f[0]g[1] + f[1]g[0] + f[2]g[−1]
= (9)(2) + (−1)(7) + (3)(−4)
= −1
806 Chapter 24 The Fourier transform
n=0 n=1 n=2
7
2
–4
9
9
9
–4
– 1 3
2
7
– 1 3
–4
2
– 1 3
7
Sums of products:
(9)(7) + ( –1)(– 4) + (3)(2) = 73
(9)(2) + ( –1)(7) + (3)( – 4) = –1
(9)( – 4) + ( –1)(2) + (3)(7) = –17
Figure24.28
The sequences f[m]drawn around the inner circle,andg[n −m] drawn around the outer circle,
forn=0,1,2.
Finally, letn = 2.
h[2] =
2∑
f[m]g[2 −m]
m=0
= f[0]g[2] + f[1]g[1] + f[2]g[0]
= (9)(−4) + (−1)(2) + (3)(7)
= −17
So the circular convolution f ○∗ g = 73,−1,−17.
If required, a table can be constructed which summarizes all the necessary informationaswasshowninExample24.26.Thesequencesmustbeextendedtoshow
their periodicity, and this time we are only interested in generating the convolution
sequence over one period, namely thatsection of the table form = 0,1,2.
m −2 −1 0 1 2 3 4 5
f[m] −1 3 9 −1 3 9 −1 3
n=0 g[−m] −4 2 7 −4 2 7 −4 2
n=1 g[1−m] 7 −4 2 7 −4 2 7 −4
n=2 g[2−m] 2 7 −4 2 7 −4 2 7
(b) A graphical representation can be developed by listing the fixed sequence f[m],
for m = 0,1,2, anticlockwise around an inner circle as shown in Figure 24.28.
We list g[−m] around an outer circle but do so clockwise to take account of the
folding.Byrotatingtheoutercircleanticlockwiseweobtaing[1−m]andg[2−m].
Bymultiplyingneighbouringtermsandaddingweobtaintherequiredconvolution.
The resultis73,−1,−17 as obtained inpart(a).
In the following section you will see how circular convolution can be performed using
the d.f.t.
24.15 Discrete convolution and correlation 807
EXERCISES24.15.2
1 Calculate the circular convolution of f[n] = 1, −1,1,3 andg[n] = 7,2,0,1.
Solutions
1 12, −4,8,24
24.15.3 The(circular)convolutiontheorem
Suppose f[n]andg[n]areperiodicsequencesofperiodN.Supposefurtherthatthed.f.t.s
of f[n] andg[n] forn = 0,1,2,...,N −1 are calculated and are denoted byF[k] and
G[k]. The convolution theorem states that the d.f.t. of the circular convolution of f[n]
andg[n] isequal tothe product of the d.f.t.sof f[n]andg[n].
The convolution theorem:
D{f ○∗ g} =F[k]G[k]
This is important because it provides a technique for finding a circular convolution. It
follows fromthe theorem that
f ○∗ g = D −1 {F[k]G[k]}
So,tofind the circular convolution of f[n] andg[n] weproceed as follows:
(1) Find the corresponding d.f.t.s,F[k] andG[k].
(2) Multiply these together toobtainF[k]G[k].
(3) Find the inverse d.f.t. togive f ○∗ g.
Whilst this procedure may seem complicated, it is nevertheless an efficient way of calculating
a convolution.
Example24.28 Usethe convolution theorem tofind f ○∗ gwhen f[n] = 5,4 andg[n] = −1,3.
Solution Firstwefind the corresponding d.f.t.s,F[k] andG[k]. Using
F[k] =
∑N−1
n=0
withN = 2 gives
F[0] =
f[n]e −2jnkπ/N
1∑
f[n] =9, F[1] =
n=0
1∑
f[n]e −jnπ = 5 +4e −jπ = 1
n=0
808 Chapter 24 The Fourier transform
and similarly,
and so
G[0] =
1∑
g[n] = 2, G[1] =
n=0
F[k] =9,1
G[k] =2,−4
1∑
g[n]e −jnπ = −1 +3e −jπ = −4
These transformsaremultiplied together, termby term, togive
n=0
H[k] =F[k]G[k] = (9)(2), (1)(−4) = 18,−4
Finally the inverse d.f.t. of the sequence 18,−4, isfound using
togive
and so
h[n] = 1 N
∑
N−1
H[k]e 2jnkπ/N
k=0
(f ○∗ g)[0] = 1 2 (18−4)=7
(f ○∗ g)[1] = 1 2 (18) + 1 2 (−4ejπ ) = 11
f○∗g=7,11
TheconvolutioncouldalsobeevaluateddirectlyusingthetechniqueofSection24.15.2.
You should trythistoconfirm the resultobtained usingthe theorem.
Example24.29 Usetheconvolutiontheoremandacomputerpackagewhichcalculatesd.f.t.stofindthe
circularconvolution ofthe sequences f[n] = 1,2,−1,7 andg[n] = −1,3,2,−5.
Solution You will need access to a computer package such as MATLAB ® to work through this
example.Thed.f.t.of f[n]canbecalculatedeitherdirectly,whichislaborious,orusing
the MATLAB ® commandfft().
F=fft([1 2 -1 7])
ans =
9.0000 2.0000+ 5.0000i -9.0000 2.0000- 5.0000i
HenceF[k] = 9,2 +5j,−9,2 −5j.
Similarly,
G=fft([-1 3 2 -5])
ans =
-1.0000 -3.0000- 8.0000i 3.0000 -3.0000+ 8.0000i
HenceG[k] = −1,−3 −8j,3,−3 +8j. Then, the product of these d.f.t.s is calculated
by multiplyingcorrespondingtermstogether:
F[k]G[k] = −9,34 −31j,−27,34 +31j
24.15 Discrete convolution and correlation 809
InMATLAB ® ,
product = F.*G
product =
-9.0000 34.0000-31.0000i -27.0000 34.0000+31.0000i
Finally, taking the inverse d.f.t.,using the MATLAB ® commandifft(), gives
ifft([-9 34-31i -27 34+31i])
ans =
8 20 -26 -11
This isthe circular convolution of f[n]andg[n], thatis
f[n] ○∗ g[n] = 8,20,−26,−11
This example has illustrated how circular convolution can be achieved through the use
of the d.f.t.
Theconvolutiontheoremappliestocircularconvolutionbutnotlinearconvolution.However,bymodifyingtheprocedureslightly,thelinearconvolutionoftwofinitesequences
can alsobe found.
If f[n] is a finite sequence of lengthN 1
andg[n] is a finite sequence of lengthN 2
we
know from Section 24.15.1 that their linear convolution is a sequence of length N 1
+
N 2
−1.
Firstweextendboththesequences f[n]andg[n]tomakeeachhavelengthN 1
+N 2
−1.
This extension is done by adding zeros. This process is known as ‘padding’ with zeros.
Then the d.f.t.s of the padded sequences are calculated to give F[k] and G[k]. It can
be shown that the linear convolution of the original sequences is equal to the circular
convolution of the padded sequences. Hence the linear convolution f ∗gis then found
from
f ∗g = D −1 {F[k]G[k]}
Consider the following example.
Example24.30 If f[n] is the finite sequence 7,−1 andg[n] is the finite sequence 4,2,−7 use circular
convolution with padded zeros toobtain the linear convolution f ∗g.
Solution Their linear convolution isasequence of length 2 +3−1 = 4.
We pad f andgtogive sequences of length 4.
f[n] = 7,−1,0,0 andg[n] = 4,2,−7,0
Then, either by direct calculation or by using a computer package you can verify that
F[k]=6,7+j,8,7−j and G[k]=−1,11−2j,−5,11+2j
810 Chapter 24 The Fourier transform
Then
F[k]G[k] = −6,79 −3j,−40,79 +3j
Then taking the inverse d.f.t. gives
D −1 {F[k]G[k]} = 28,10,−51,7
whichistherequiredlinearconvolution.Youshouldverifythisbycalculatingthelinear
convolution directly.
Engineeringapplication24.4
Convolutionreverb
Convolutioncanbeusedonanaudiosignaltosimulatetheechoingorreverberation
of a real room or space. An application of such a method can be found in recording
studioswhereitmaybedesirabletosimulatetheacousticsofalargeroomsuchasa
concert hall on a recording.
The first stage involves capturing the impulse response of the room to be simulated.Animpulsivesoundsourcesuchasapistolorasmallexplosioncanbeusedto
exciteabroadrangeoffrequencies.Itcanbeshownthatashort-durationimpulsehas
a very broad spectrum. A popular alternative is to use a sinusoidal signal source of
time-varyingfrequency.Inthelattercasetheoutputoftheconvolutionprocess,h[n],
is known because the spectrum can be measured directly. The second stage consists
ofapplyingtheinverseprocess,termeddeconvolution,tothisdatainordertoobtain
the impulse response of the room.
When the impulse response has been found, whichever method has been used, it
becomes a case of convolving the impulse response of the room or space with the
signal. It is necessary to carry out a linear convolution process to obtain the desired
effect.Wedonotwishtousecircularconvolutionbecausethesignalisnotperiodic.
However, we may wishtomake use of the circular convolution theorem
f ○∗ g =D −1 {D{f[n]} · D{g[n]}}
Recall, however, that this theorem relates only to circular convolution, not to linear
convolution.Wemaystillmakeuseoftheconvolutiontheoremifwepadthesignals
withzerovaluesinordertopreventunwantedoverlap.Thisisbestillustratedbyuse
of anexample.
Theproblemofcalculatingthecircularconvolutionh[n] = f ○∗ gforthesignals
f[n] = 9, −1,3andg[n] = 7,2, −4hasalreadybeenexplained(seeExample24.27).
Theresultofh[n] = 73,−1,−17couldhavebeenobtainedeithergraphicallyorusingtheconvolution
theorem.Thelinearconvolution ish[n] = 63,11, −17,10, −12
and can be found using the direct method, and this result is presented here for reference
(see Exercises 24.15.1, Question 2(c)).
To use the convolution theorem it is first necessary to take the original signals
and pad them with zero values so that the overall length of each is equal to the sum
24.15 Discrete convolution and correlation 811
of the two − 1. So in the simple example, both signals start with length 3, and after
padding they musthave a length of 3 +3−1 = 5.The padded signals are
f ′ [n] =9,−1,3,0,0
g ′ [n] = 7, 2, −4, 0, 0
The calculation of the first term in the output signal is shown graphically in Figure
24.29.
7
n = 0
0
–1
9
0
2
0
3 0
–4
Figure24.29
The padded signals f ′ [m] (inner circle)andg ′ [m] (outer
circle).
The first element ofh[n] is (9)(7) = 63. Notice how the padding is sufficient to
ensurethatallofthevaluesinthelinearconvolutioncanbeproducedwithoutoverlap.
In this position all of the other terms are multiplied by 0 and hence can be removed
from the calculation. By rotating the outer circle anticlockwise four more times the
other values can be calculated. It can be seen that this method is exactly equivalent
tocarrying out the linear convolution.
Hence wecan usethe convolution theoremon thepadded signals f ′ [n],g ′ [n]and
f ○∗ g= D −1 {D{f ′ [n]} · D{g ′ [n]}}
Thismethodissometimespreferredbecausethed.f.t.valuesofthesignalsarereadily
and quickly calculated on a computer.
The signals may be padded with additional zeros if desired with no effect on the
result. In order to optimize the computation speed for the d.f.t. it is often desirable
to use the f.f.t. algorithm mentioned earlier. One restriction of the f.f.t. is that the
numberofsamplesintheinputsignalmustbeapowerof2,soithastocontain2,4,
8,16,32,... samples.Sincepaddingispossiblewithoutaffectingtheresultthenthe
f.f.t.can be used.
As an example we consider adding reverb to a signalg[n], which is a 30 second
duration piece of music played on an electric guitar. The signalg[n] is very ‘clean’,
obtained bypluggingtheguitardirectlyintotherecording equipment. Asaresult,it
contains no reverberant room effects that would have been present if a microphone
andamplifierhadbeenusedinstead.Animpulseresponse f[n]isobtainedbybursting
a balloon in an environment to be simulated. The two signals prior to introducing
padding areshown inFigure 24.30.
The two padded Fourier transformed signals are multiplied together and the inversed.f.t.takentoproducethesignalinFigure24.31.Thesignalh[n]hasaduration
of 32 s due to the effects of linear convolution. When the signal is played through
speakersithasadistinctiveechoorreverbeffectthatisnotpresenting[n].Thereare
visible differences inthe signalifitis examined carefully.
812 Chapter 24 The Fourier transform
f [n]
Scale:
1s
g[n]
Figure24.30
Two audio signals:a2simpulseresponse ofan environment to be simulated, f[n], anda30s
music trackto be processed,g[n]. Bothsignals are sampled atarate of44100 samples per
second.
Figure24.31
The output signalh[n] = f ○∗ gincluding reverb.
EXERCISES24.15.3
1 Thecircular convolution of f[n] = 1, −1,1,3 and
g[n] = 7,2,0,1 wascalculated in Question 1 in
Exercises24.15.2. Verify the convolution theorem for
these sequences.
2 Usecircular convolution and padding with zerosto
obtain the linear convolution of f[n] = 9,0,1 and
g[n] = 5,4,5,2,1.Further, verifythe convolution
theorem forthese sequences.
3 Provethe circular convolution theorem.
Solutions
1 F[k] =4,4j,0,−4j.G[k] =10,7−j,4,7+j 2 45,36,50,22,14,2,1
24.15.4 Linearcross-correlation
The linear cross-correlation of two real sequences f[n] andg[n] is another sequence,
c[n] say, which wedenote by f ⋆g, which isdefined asfollows:
Linear cross-correlation of f[n]andg[n]:
∞∑
c[n]=f⋆g= f[m]g[m−n] forn=...−3,−2,−1,0,1,2,3...
m=−∞
24.15 Discrete convolution and correlation 813
Note the similarity between this definition and that of linear convolution defined in
Section 24.15.1. In the formula for cross-correlation the sequence g is not folded. If
the two sequences are finite and of length N, that is f[n] and g[n] are non-zero only
when 0 nN−1, there are 2N −1 terms in the cross-correlation sequence and the
formulacan be written as follows:
Linear cross-correlation of two finite sequences f[n] andg[n]:
and
c[n]=f⋆g=
c[n]=f⋆g=
∑N−1
m=n
N+n−1
∑
m=0
f[m]g[m−n]
f[m]g[m−n]
forn=0,1,2,...,N−1
forn=0,−1,−2,...,−(N−1)
Example24.31 Suppose f[n] = 7,2,−3 andg[n] = 1,9,−1. Assume both sequences f andgstart at
n=0.
(a) Find the linear cross-correlationc[n] = f ⋆gusing the formulae above.
(b) Develop a graphical interpretation of thisprocess.
Solution (a) Both f and g are finite sequences of length N = 3. Their cross-correlation is a
sequencec[n], forn = −2,−1,0,1,2, of length 5.
Usingthe formulae above withn = −2 gives
c[−2] =
0∑
f[m]g[m +2]
m=0
= f[0]g[2]
= (7)(−1)
= −7
Whenn = −1 we have
c[−1] =
1∑
f[m]g[m +1]
m=0
= f[0]g[1] + f[1]g[2]
= (7)(9) + (2)(−1)
= 61
The remaining terms are calculated in a similar fashion. You should calculate one
or two terms yourself toverifythat the fullsequence is
c[n] = −7,61,28,−25,−3
n = −2,−1,0,1,2
(b) The graphical interpretation is developed along the same lines as for linear convolution
in Example 24.26. Figure 24.32 shows the sequence f[m], form = 0,1,2,
denoted by the symbols ◦. Also shown isthe sequenceg[m] denoted by •.
814 Chapter 24 The Fourier transform
(a)
(b)
(c)
(d)
(e)
g[m ]
g[m – 1]
g[m – 2]
g[m + 1]
g[m + 2]
10
6 8
4
2
– 4
10
6 8
4
2
– 4
10
6 8
4
2
– 4
10
6 8
4
2
– 4
10
6 8
4
2
– 4
m
m
m
m
m
Figure24.32
The effect oftranslatingg[m].
n = 0
sum of products:
(7)(1) + (2)(9) + (–3)(–1) = 7 + 18 + 3 = 28
n = 1
sum of products:
(2)(1) + (–3)(9) = 2 – 27 = –25
n = 2
sum of products:
(–3)(1) = –3
n=–1
sum of products:
(7)(9) + (2)( –1) = 63 – 2 = 61
n= –2
sum of products:
(7)( –1) = –7
The graph ofg[m] can be translatednunits to the left or right by changing the
argument ofgfromg[m] tog[m −n]. Ifnis positive the graph moves to the right.
Study the figure to observe this. Correlation is the sum of products of f[m] and
g[m − n]. For each value of n, the graph of f[m] is superimposed. We are only
interested in values ofnfor which the graphs overlap -- otherwise each product is
zero.Foreachvalueofnthesuperimposedgraphsmakeiteasytoseewhichvalues
must be multiplied together and added. Placing these results in order confirms the
resultobtained inpart(a), thatisthe correlation is −7,61,28,−25,−3.
Whenasequenceiscross-correlatedwithitselftheprocessisknownasautocorrelation.
Autocorrelationisusedtosearchforpossibleperiodicitiesinsignals,becauseifasignal
isperiodicwithperiodN itsautocorrelationsequencewillshowpeaksatintervalsofN.
EXERCISES24.15.4
1 Find the linearcross-correlationofthe sequences
f[n] = 4,5,9 andg[n] = 3,1,1.
2 Show thatthe linearcross-correlationof f[n] andg[n]
can be written in the alternative form
f⋆g= ∑ ∞
m=−∞ f[m +n]g[m].
24.15 Discrete convolution and correlation 815
3 There are variants ofthe definition ofcorrelation.
Showthat if f ⋆gisredefined to be
∑ ∞m=−∞
f[m −n]g[m] then the corresponding
correlation ofthe sequences in Question1is
27,24,26,9,4.
4 Find the linear autocorrelation ofthe sequence
f[n]=3,2,1.
Solutions
1 4,9,26,24,27 4 3,8,14,8,3
24.15.5 Circularcross-correlation
The circular cross-correlation of two periodic sequences of period N is defined in a
similarmanner totheir circular convolution. Itisasequencec[n] oflengthN.
The circular cross-correlation of two periodic sequences, f[n] and g[n], each of
periodN, isdefined as
c[n]=f○⋆ g =
∑N−1
m=0
f[m]g[m−n]
forn=0,1,2,...,N−1
Whenasequenceiscross-correlatedwithitselftheprocessisknownasautocorrelation.
Example24.32 (a) Find the circular autocorrelation of the sequence f[n] = 3,2,1 using the formula.
(b) Develop a graphical method forperforming thiscalculation.
Solution (a) HereN = 3.From the definition
2∑
c[n]=f○⋆ f = f[m]f[m−n]
m=0
forn=0,1,2
2∑
c[0] = f[m]f[m]
m=0
= (3)(3) + (2)(2) + (1)(1)
= 14
2∑
c[1] = f[m]f[m−1]
m=0
= f[0]f[−1]+f[1]f[0]+f[2]f[1]
= (3)(1) + (2)(3) + (1)(2)
= 11
816 Chapter 24 The Fourier transform
n=0 n=1 n=2
3
1
2
3
3
3
2
2 1
1
3
2 1
2
1
2 1
3
Sums of products:
(3)(3) + (2)(2) + (1)(1) = 14 (3)(1) + (2)(3) + (1)(2) = 11 (3)(2) + (2)(1) + (1)(3) = 11
Figure24.33
Graphical calculation ofthe autocorrelation of f[n] = 3,2,1.
c[2] =
2∑
f[m]f[m−2]
m=0
= (3)(2) + (2)(1) + (1)(3)
= 11
Hencec[n] = 14,11,11.
(b) The graphical method involves listing the sequence f[m], m = 0,1,2, around an
inner circle. Around an outer circle we listitagain. This method is identical to that
used in Example 24.27 for circular convolution, but because now there is no folding,
the sequence on the outer circle is not reversed. The calculation can be seen in
Figure 24.33.
24.15.6 (Circular)correlationtheorem
For real sequences f[n]andg[n] the correlation theorem states:
D{f ○⋆ g} =F[k]G[k]
whereG[k] denotes the complex conjugate ofG[k].
This provides a technique forcalculating a correlation usingthe d.f.t.
Example24.33 Findthe circularcorrelation f ○⋆ gwhen f[n] = 8,−9,3,2 andg[n] = 11,4,−1,−5.
Solution Either directly from the definition of the d.f.t., or by using a computer package, we can
show that
F[k] = 4,5 +11j,18,5 −11j G[k] = 9,12 −9j,11,12 +9j
Theconjugate ofG[k] isG[k] = 9,12 +9j,11,12 −9j. Then
F[k]G[k] = 36,−39 +177j,198,−39 −177j
24.15 Discrete convolution and correlation 817
Eitherdirectly,orusingacomputerpackage,takingtheinversed.f.t.yieldsthesequence
39,−129,78,48
which is the required circular correlation. You may like to verify this by directly calculating
f ○⋆ g.
Itfollowsfromthetheoremthatthecircularautocorrelationsequencehasthefollowing
property:
D{f ○⋆ f}=F[k]F[k]
Example24.34 Usethe d.f.t. tofind the circularautocorrelationofthe sequence f[n] = 8,3,−1,2.
Solution Itisstraightforward buttedioustoshow thatthe d.f.t.of f[n] is
F[k]=12,9−j,2,9+j
Then,the conjugate ofF[k] isF[k]=12,9+j,2,9−j,and
F[k]F[k] = 144,82,4,82
Finally,takingtheinversed.f.t.givesthesequence78,35,−4,35whichyoucanverify
isthe circular autocorrelation of f[n] = 8,3,−1,2.
Engineeringapplication24.5
Useofcorrelationinradar
Both sonar and radar operate by transmitting a signal which bounces off a distant
target and returns to a receiver. The main difference is that sonar uses sound waves
whereasradarusesradiowaves.Thetimetakenforthesignaltoreturntothereceiver
having reached the target can be used to deduce,d, the distance between the transmitter
and the target. In both radar and sonar, the received signal is typically very
much smaller than the transmitted signal and can contain a lot of unwanted noise.
High-gain electronic amplifiers may be required to make the signal large enough to
be processed and these introduce even more noise. In addition the receiver experiences
interference from other sources which has to be separated from the returning
signal. Correlation techniques are very useful for retrieving a signal from the noise
and the interference.
We will consider a radar system. The signals transmitted are usually modulated
usingahigh-frequencycarriersignal(seeEngineeringapplication24.1).Forourpurposesthe
presence ofacarrier signalisnotimportant.
However, the high-frequency component of the signal can be removed
electronically prior to the signals being sampled by a process called
demodulation.
➔
818 Chapter 24 The Fourier transform
The received and demodulated signal is represented by f[n] and the transmitted
signal prior to modulation is represented byg[n]. Consider the case whereg[n] is a
square pulse comprising 10 samples. Let the time between each sample beT. Both
signalsareshownplottedinFigure24.34.Notethatthisisanidealizedcaseinwhich
thereis no noise on the received signal.
1.0
0.8
0.6
g[n]
0.4
f [n]
0.2
20 40 60 80 100 120 140
Figure24.34
Plotoftransmitted and received digital signals foraradar system.
n
The returned signal is delayed due to the time taken for it to travel to a distant
target and back. Examining Figure 24.34 we note that the returned signal is delayed
by 65 samples when compared with the transmitted signal. This equates to a time
delayof65T.Weknowthespeedthatthetransmittedpulsetravelsisc,thespeedof
light.Using
distance =time × velocity
gives total distance travelled = 65T ×c. But the total distance travelled is twice the
distance between the transmitter and the target,thatis2d. Henced = 65Tc
2 .
We now confirm the number of samples delay by calculating the circular crosscorrelation
of f[n] andg[n]. We wish to calculatec[n] = f ○∗ gand plot this on a
graphforanalysis.Thiscanbedoneeitherdirectlybycalculatingthecircularcrosscorrelation
or by using the circular correlation theorem. Owing to the large number
ofcalculationsinvolvedtheyarenotpresentedhere.Normallysuchaprocesswould
be carriedout usingacomputer.
TheresultisasequenceofsampleswhichareplottedinFigure24.35.Thelargest
value of c[n] occurs at 65, which corresponds to the delay in samples between the
two signals. It agrees with our initial observation where the difference was quite
straightforward tosee by inspecting the two signals.
24.15 Discrete convolution and correlation 819
c[n]
2.0
1.5
1.0
0.5
20 40 60 80 100 120 140
Figure24.35
Cross-correlationfunctionc[n] forthe signals f[n] andg[n].
n
We now consider a more realistic case in which there is random noise on the returnedsignalandcarryoutthesamecalculationthatwedidpreviously.Figure24.36
shows the two signals. Although the return signal is still present it is no longer possible
to accurately determine where it actually starts and finishes by just looking at
the graph.
1.0
0.8
0.6
g[n]
0.4
f [n]
0.2
20 40 60 80 100 120 140
Figure24.36
Plotoftransmittedandreceiveddigitalsignalsforaradarsysteminwhichnoiseispresenton
the returnedsignal.
n
➔
820 Chapter 24 The Fourier transform
Carryingoutthecircularcross-correlationonthesesignalsgivestheresultshown
in Figure 24.37. The highest peak still occurs at 65 time intervals, allowing the distance
of the remoteobject tobe accurately calculated.
c[n]
2.0
1.5
1.0
0.5
20 40 60 80 100 120 140 n
Figure24.37
Cross-correlationfunctionc[n] forthe signals f[n]andg[n] forthe case where f[n]contains
additive noise.
Thecross-correlationfunctionallowsthedistanceoftheremoteobjecttobecalculatedevenwhenthereisalargequantityofnoiseonthereturnedsignal.Itistherefore
a very useful signal processing operation for radar systems. Similar considerations
apply when working with sonar signals.
EXERCISES24.15.6
1 Find the circular cross-correlationof f[n] = 2,3, −1
andg[n] = 8,7,1. Verify the correlation theorem.
2 Fromthe definition,prove the correlationtheorem.
3 There are variants ofthe definition ofcorrelation.
∑
Show that if f ○⋆ gis redefined to be
N−1
m=0 f[m −n]g[m] then the corresponding
correlation theorem states D{f ○⋆g}=F[k]G[k].
Solutions
1 36,19, 9
Review exercises 24 821
REVIEWEXERCISES24
1 Find the Fourier transformsof
{
1−t 2 |t|<1
(a)f(t) =
0 otherwise
{ sint |t|<π
(b)f(t) =
0 otherwise
{ 1 0<t<τ
(c)f(t) =
0 otherwise
{ e −αt t>0
(d)f(t) =
−e αt α >0
t<0
2 Find the Fourier integral representations of
{ 3t |t|<2
(a) f(t) =
0 otherwise
⎧
⎨0 t<0
(b) f(t) = 6 0<t<2
⎩
0 t>2
3 (a) IfF(ω) = F{f(t)},showthat
F{f ′ (t)} = jωF(ω).
(b) IfF(ω) = F{f(t)},showthat
F{f (n) (t)} = (jω) n F(ω).Theseresultsenableus
to calculate the Fourier transformsofderivatives
offunctions.
4 IfF(ω) = F{f(t)},showthat
( )
F{f(at)} = 1 a F ω
.
a
5 IfF(ω)is the Fourier transform of f (t)show that
(a) F(0) = ∫ ∞
−∞ f(t)dt.
(b) f(0) = 1 ∫ ∞
F(ω)dω.
2π −∞
(c) Show that if f (t)isan even function,
F(ω) =2 ∫ ∞
0 f(t)cosωtdt.
6 Theconvolution theoremgiven in Section24.8.1
representsconvolution in the time domain.
Convolution can alsobe performedin the frequency
domain,in whichcasethe equivalent convolution
theoremis
F{f(t)g(t)} = 1
2π [F(ω)∗G(ω)]
Provethe convolution theorem in thisform.
7 (a) Given that the Fourier transform of f (t) = e −α|t|
2α
is
α 2 use thet--ω duality principleto find
+ω2 the Fourier transformof
1
α 2 +t 2.
(b) Fromatable oftransformswrite down the
Fourier transformofcosbt.
(c) Use the convolution theoremobtained in
Question 6 to find the Fourier transformof
cosbt
α 2 +t 2,forb>0.
(d) Using the result in part(c)evaluate the integral
∫ ∞ cosbt
−∞ α 2 +t 2dt
8 Find the d.f.t. ofthe sequence f[n] = {3,3,0,3}.
Verify Rayleigh’stheoremforthissequence.
9 Findthelinearconvolutionofthetwofinitesequences
f[n] = 3, −1, −7 andg[n] = 4,0, 1 2 .
10 The signum functionis defined to be
⎧
⎨ 1 t>0
sgn(t) = −1 t<0
⎩
0 t=0
This functioncan be representedasthe exponential
functione −ǫt ,ift > 0, andas −e ǫt ,ift < 0,in the
limitas ǫ → 0.
(a) Show that
∫ ∞
sgn(t)e −jωt dt = 2
−∞ jω
(b) Usethet--ω duality principle to showthat
∫ ∞
−∞
1
πt e−jωt dt = −jsgn(ω)
(c) Usethe second resultin part (b), the convolution
theorem and the integral propertiesofthe delta
function to showthat
1
∗cos(αt) = sin(αt)
πt
11 Show that f[n] ○⋆ g[n] = f[n] ○∗ g[−n] andhence
deduce thatacorrelationcanbe expressed in termsof
a convolution.
822 Chapter 24 The Fourier transform
Solutions
1 (a)
4(sinω −ωcosω)
ω 3
(b) − 2jsinωπ
1−ω 2
sinωτ j(cosωτ −1)
(c) +
ω ω
(d) −
2jω
α 2 +ω 2
∫
1 ∞ 6
2 (a)
2π −∞ ω 2 (2jωcos2ω −jsin2ω)ejωt dω
(b)
7 (a)
(c)
(d)
∫
1 ∞ 6
2π −∞ ω [sin2ω + (cos2ω −1)j]ejωt dω
π
α e−α|ω|
π
2α [e−α|ω+b| +e −α|ω−b| ]
πe −αb
α
8 F[k] =9,3,−3,3
9 12, −4, −26.5, −0.5, −3.5
25 Functionsofseveral
variables
Contents 25.1 Introduction 823
25.2 Functionsofmorethanonevariable 823
25.3 Partialderivatives 825
25.4 Higherorderderivatives 829
25.5 Partialdifferentialequations 832
25.6 TaylorpolynomialsandTaylorseriesintwovariables 835
25.7 Maximumandminimumpointsofafunctionoftwovariables 841
Reviewexercises25 846
25.1 INTRODUCTION
Inengineeringtherearemanyfunctionswhichdependuponmorethanonevariable.For
example,thevoltageonatransmissionlinedependsuponpositionalongthetransmission
lineaswellastime.Theheight ofliquidinatankdepends upon theflowratesintoand
outof the tank. We shall examine some more examples inthis chapter.
Wehavealreadydiscusseddifferentiationoffunctionsofonevariable.However,we
alsoneedtobeabletodifferentiatefunctionsoftwoormorevariables.Thisisachieved
by allowing one variable to change at a time, holding the others fixed. Differentiation
under these conditions iscalledpartial differentiation.
25.2 FUNCTIONSOFMORETHANONEVARIABLE
InChapter10wesawhowtodifferentiateafunctiony(x)withrespecttox.Manystandard
derivatives were listed and some techniques explained. Sinceyis a function ofx
wecallythedependentvariableandxtheindependentvariable.Thefunctionydepends
upon the one variablex. Consider the following example.
The area,A, ofacircle depends only upon the radius,r, and isgiven by
A(r) = πr 2
824 Chapter 25 Functions of several variables
The rate of change of area w.r.t. the radius is dA = 2πr. In practice functions often
dr
dependuponmorethanonevariable.Forexample,thevolume,V,ofacylinderdepends
upon the radius,r, and the height,h, and isgiven by
V = πr 2 h
whereV isthedependentvariable;r andhareindependentvariables.V isafunctionof
the two independent variables,r andh. We writeV =V(r,h).
Engineeringapplication25.1
Electricalpotentialinsideacathoderaytube
In recent years the cathode ray tube has declined in popularity as a screen display
device. It is now rarely used in consumer electronics as flat screen televisions have
becomemorepopular.However,itisstillsometimesusedinthespecialistelectronics
sector and instruments such as cathode ray oscilloscopes are still commonly in use
inmany laboratories.
Withinacathoderaytubetheelectricalpotential,V,willvarywithspatialposition
and time.Given Cartesiancoordinatesx,yandz, wecan write
V =V(x,y,z,t)
toshow thisdependence. Note thatV isafunctionoffourindependentvariables.
Engineeringapplication25.2
Powerdissipatedinavariableresistor
Thepower,P,dissipatedinavariableresistordependsupontheinstantaneousvoltage
acrossthe resistor, v, and the resistance,r. Itisgiven by
P = v2
r
Hence we may writeP =P(v,r) to show this dependence. The power is a function
of two independent variables.
Asanotherexampleofafunctionofmorethanonevariableconsiderathree-dimensional
surface as shown in Figure 25.1. The height,z, of the surface above thex--y plane depends
upon thexandycoordinates, that isz = z(x,y). If we are given values ofxand
y, then z(x,y) can be evaluated. This value of z is the height of the surface above the
point (x,y). We write, for example,z(3,−1) for the value ofzevaluated whenx = 3
and y = −1. The dependent variable, z, is a function of the independent variables
x andy.
Some important features are shown in Figure 25.1. The value of z at a maximum
point is greater than the values of z at nearby points. Point A is such a point. As you
25.3 Partial derivatives 825
D
C
A
z = z(x,y)
x
z
y
E
B
Figure25.1
Theheight ofthe surface
above thex--yplane isz.
move away from A the value of z decreases. A minimum point is similarly defined.
At a minimum point, the value ofzis smaller than thezvalue at nearby points. This is
illustratedbypointB.PointCillustratesasaddlepoint.Atasaddlepoint,zincreasesin
one direction, axis D on the figure, but decreases in the direction of axis E. These axes
are at right angles to each other. The term ‘saddle’ is descriptive as a horse saddle has
a similar shape. Maximum points, minimum points and saddle points are considered in
greater depth inSection 25.7.
25.3 PARTIALDERIVATIVES
Consider
z =z(x,y)
thatis,zisafunctionoftheindependent variablesxandy.Wecandifferentiatezeither
w.r.t.x,orw.r.t.y.Weneedsymbolstodistinguishbetweenthesetwocases.Whenfinding
the derivative w.r.t. x, the other independent variable, y, is held constant and only
x changes. Similarly when differentiating w.r.t. y, the variable x is held constant. We
write ∂z
∂x todenotedifferentiationofzw.r.t.xforaconstanty.Itiscalledthefirstpartial
derivative of z w.r.t. x. Similarly, the first partial derivative of z w.r.t. y is denoted ∂z
∂y .
Referring to the surfacez(x,y), ∂z gives the rate of change ofzmoving only in thex
∂x
direction, and henceyisheld fixed.
Ifz =z(x,y), thenthe firstpartialderivatives ofzare
∂z
∂x
and
∂z
∂y
If we wish to evaluate a partial derivative, say ∂z
∂x , at a particular point (x 0 ,y 0
), we
indicate thisby
∂z
∂x (x 0 ,y 0 ) or ∂z
∣
∂x
∣
(x0 ,y 0
)
justas wedid forfunctions of one variable.
826 Chapter 25 Functions of several variables
Example25.1 Givenz(x,y) =x 2 y +sinx +xcosyfind ∂z ∂z
and
∂x ∂y .
Solution To find ∂z we differentiate z w.r.t. x, treating y as a constant. Note that since y is a
∂x
constant then soiscosy.
∂z
∂x =2xy+cosx+cosy
Infinding ∂z
∂y ,xand hencex2 and sinx areheld fixed, thus
∂z
∂y =x2 −xsiny
Example25.2 Givenz(x,y) = 3e x −2e y +x 2 y 3
(a) findz(1,1)
(b) find ∂z ∂z
and
∂x ∂y whenx=y=1.
Solution (a) z(1,1) = 3e 1 −2e 1 +1 = 3.718
∂z
(b)
∂x =3ex +2xy 3
∂z
∂y =−2ey +3x 2 y 2
Whenx =y=1,then
∂z
∂x =3e+2=10.155
∂z
∂y =−2e+3=−2.437
Atthepoint(1,1,3.718)onthesurfacedefinedbyz(x,y),theheightofthesurface
abovethex--yplaneisincreasinginthexdirection,anddecreasingintheydirection.
Note thatwecould alsowrite
∂z
∂z
∂x∣ = 10.155 and
(1,1)
∂y∣ = −2.437
(1,1)
or
∂z
∂x (1,1) = 10.155 and ∂z
(1,1) = −2.437
∂y
Both notations areincommon use.
25.3 Partial derivatives 827
Engineeringapplication25.3
Eddycurrentlosses
Eddycurrentsarecirculatingcurrentsthatariseinironcoresofelectricalequipment
as a resultof ana.c. magnetic field. They lead toenergy lossesgiven by
where
P e
=k e
f 2 B 2 max
P e
= eddy currentlosses (Wper unit mass),
B max
= maximum value of the magnetic field wave (T),
f = frequency of the magnetic field wave (Hz),
k e
= a constant thatdepends upon factors such as the lamination thickness of
the ironcore.
Calculate ∂P e
∂f , and ∂P e
∂B max
.
Solution
∂P e
∂f
=2k e
fB 2 max
and
∂P e
∂B max
=2k e
f 2 B max
Example25.3 IfV(x,y) = sin(xy),find ∂V
∂x
and
∂V
∂y .
Solution To find ∂V
∂x we treat y as a constant. Recalling that d
(sinkx) = kcoskx we find
dx
∂V
∂V
∂V
=ycos(xy).Tofind wetreatxas a constant. Thus
∂x ∂y ∂y =xcos(xy).
Example25.4 Findthe first partialderivatives ofzwhere
(a) z(x,y) =yxe x
(b) z(x,y) =x 2 sin(xy)
Solution (a) To find ∂z we must treatyas a constant. However, the differentiation of the factor
∂x
xe x will require use of the product rule.We find
∂z
∂x =y ∂ ∂x (xex )
=y((1)e x +xe x )
=ye x (x+1)
828 Chapter 25 Functions of several variables
To find ∂z the variablexisheld constant and so
∂y
∂z
∂y =xex
(b) Observe the product term in the variable x which means we shall need to use the
product rule.We find
∂z
∂x = 2xsin(xy) +x2 (ycos(xy)) = 2xsin(xy) +x 2 ycos(xy)
To find ∂z wetreatxas a constant and find
∂y
∂z
∂y =x2 (xcos(xy)) =x 3 cos(xy)
EXERCISES25.3
(c) y=te t +x (d) y=3x 2 e 2t +t 3 e −x
1 Find the firstpartialderivatives of
(a) y=te x (b) y=x 2 e −t h=1.
(a) z=2x+3y (b) z=x−y+x 2 (e) y = e xt (f) y = e 2x+3t
(c) z=x 2 +y 2 +3 (d) z=xy−1 5 Findthe firstpartialderivatives ofzwhere
(e)z =x 2 y+xy 2
(a)z= √ x 2 +y 2 x
(b) z=
(f)z=x 3 +2x 2 y−4xy 2 +2y 3
2x+3y
( )
x
2 Find the firstpartialderivatives of
(c) z = sin (d) z = e 3xy
(a)z= x (b)z= √ y
xy
y
(e) z = ln(2x −3y) (f) z = ln(xy)
( )
(c)z= y2
x 3 +2 (d)z= 1 y
(g) z =xln(xy) (h) z =xln
xy
x
√
(e)z= 3x2 y
y − (f)z= √ xy−3(x+y) 6 Evaluate the first partialderivatives of f atx = 1,
x
y=2.
3 Find the firstpartialderivatives of
(a) f =3x 2 y−2xy (b) f = x +y
(a) z=2sinx+3cosy (b) z=xsiny
y
(c) z=xtany−y 2 (c) f = 2sin(3x +2y) (d) f = 2e xy
sinx (d) z=xysiny
(e) f =ylnx+ln(xy) (f) f = √ 3x+y
(e) z =sin(x +y) (f) z =4cos(4x −6y)
(g)z= siny
7 Given
x
f(r,h) =2r 2 h− √ rh
4 Find the firstpartialderivatives ofy.Note thatyisa
functionofxandt.
evaluate the firstpartial derivatives of f whenr = 2,
Solutions
∂z ∂z
1 (a) = 2,
∂x ∂y = 3
(b) 1 +2x,−1
(c) 2x,2y
(d) y,x
(e) 2xy +y 2 ,x 2 +2xy
(f) 3x 2 +4xy −4y 2 ,2x 2 −8xy +6y 2
25.4 Higher order derivatives 829
∂z
2 (a)
∂x = 1 y , ∂z
∂y = − x y 2
y
(b)
2 √ x , √ x
(c) − 3y2
x 4 , 2y
x 3
(d) − 1
x 2 y ,− 1
xy 2
(e)
(f)
√
6x y
y + x 2 , −3x2 y 2 − 1
2x √ y
1
2√ y
x −3, 1 2
√ x
y −3
∂z ∂z
3 (a) =2cosx,
∂x ∂y =−3siny
(b) siny,xcosy
(c) tany −y 2 cosx,xsec 2 y −2ysinx
(d) ysiny,xsiny+xycosy
(e) cos(x +y),cos(x +y)
(f) −16sin(4x −6y),24sin(4x −6y)
(g) − siny
x 2 , cosy
x
∂y
4 (a)
∂x =tex , ∂y
∂t = ex
(b) 2xe −t ,−x 2 e −t
(c) 1,e t (1+t)
(d) 6xe 2t −t 3 e −x ,6x 2 e 2t +3t 2 e −x
(e) te xt ,xe xt
(f) 2e 2x+3t ,3e 2x+3t
5 (a)
(b)
(c)
∂z
∂x =
x ∂z
√
x 2 +y2, ∂y =
3y
(2x +3y) 2 , − 3x
(2x +3y) 2
( ( )
1
y cos x
), − x y y 2 cos x
y
(d) 3ye 3xy ,3xe 3xy
(e)
(f)
2
2x−3y ,− 3
2x−3y
1
x , 1 y
(g) 1 +ln(xy), x y
( )
y
(h) ln −1, x x y
y
√
x 2 +y 2
∂f ∂f
6 (a) (1,2) =8,
∂x ∂y (1,2)=1
(b) 0.5, −0.25
7
(c) 4.5234, 3.0156
(d) 29.5562, 14.7781
(e) 3,0.5
(f) 0.6708, 0.2236
∂f
∂f
(2,1) = 7.6464, (2,1) = 6.5858
∂r ∂h
25.4 HIGHERORDERDERIVATIVES
Just as functions of one variable have second and higher derivatives, so do functions of
several variables. Consider
z =z(x,y)
Thefirstpartialderivativesofzare ∂z ∂z
and
∂x ∂y .Thesecondpartialderivativesarefound
bydifferentiatingthefirstderivatives.Wecandifferentiatefirstpartialderivativeseither
w.r.t.xor w.r.t.ytoobtain various second partialderivatives:
differentiating ∂z
∂x w.r.t.xproduces ∂ ( ) ∂z
∂x ∂x
differentiating ∂z
∂x w.r.t.yproduces ∂ ( ) ∂z
∂y ∂x
= ∂2 z
∂x 2
= ∂2 z
∂y∂x
830 Chapter 25 Functions of several variables
differentiating ∂z
∂y w.r.t.xproduces ∂ ( ) ∂z
∂x ∂y
differentiating ∂z
∂y w.r.t.yproduces ∂ ( ) ∂z
∂y ∂y
= ∂2 z
∂x∂y
= ∂2 z
∂y 2
For mostcommon functions, the mixed derivatives ∂2 z
∂y∂x and ∂2 z
∂x∂y areequal.
Example25.5 Given
z(x,y) = 3xy 3 −2xy +sinx
find allsecondpartialderivatives ofz.
Solution
∂z
∂x =3y3 −2y+cosx
∂ 2 z
∂x = ∂ ( ) ∂z
=−sinx
2 ∂x ∂x
∂ 2 z
∂x∂y = ∂ ( ) ∂z
=9y 2 −2
∂x ∂y
Note that
∂ 2 z
∂x∂y = ∂2 z
∂y∂x .
∂z
∂y = 9xy2 −2x
∂ 2 z
∂y = ∂ ( ) ∂z
= 18xy
2 ∂y ∂y
∂ 2 z
∂y∂x = ∂ ( ) ∂z
=9y 2 −2
∂y ∂x
Example25.6 Given
H(x,t) =3x 2 +t 2 +e xt
verify that ∂2 H
∂x∂t = ∂2 H
∂t∂x .
Solution
∂H
∂x =6x+text
∂ 2 H
∂t∂x = ∂ ∂t
∂ 2 H
∂x∂t = ∂ ∂x
∂H
∂t
=2t+xe xt
( ) ∂H
= ∂ ∂x ∂t (6x+text )= e xt +txe xt
( ) ∂H
= ∂ ∂t ∂x (2t+xext )=e xt +xte xt
Third,fourthandhigherderivativesarefoundinasimilarwaytofindingsecondderivatives.
The third derivatives are found by differentiating the second derivatives and
so on.
25.4 Higher order derivatives 831
Example25.7 Find all third derivatives of f (r,s) = sin(2r) −3r 4 s 2 .
Solution
∂f
∂r = 2cos(2r) −12r3 s 2
∂ 2 f
∂r 2 = −4sin(2r) −36r2 s 2
∂ 2 f
∂r∂s = −24r3 s
∂f
∂s = −6r4 s
∂ 2 f
∂s 2 = −6r4
The third derivatives are ∂3 f
∂r , ∂ 3 f
3 ∂r 2 ∂s , ∂ 3 f
∂r∂s and ∂3 f
, and these are found by differentiating
the second derivatives.
2 ∂s3 ∂ 3 f
∂r = ∂ ( ) ∂ 2 f
= −8cos(2r) −72rs 2
3 ∂r ∂r 2
∂ 3 f
∂r 2 ∂s = ∂ ( ∂ 2 )
f
= −72r 2 s
∂r ∂r∂s
∂ 3 f
∂r∂s = ∂ ( ∂ 2 )
f
= −24r 3
2 ∂r ∂s 2
∂ 3 f
∂s = ∂ ( ∂ 2 )
f
= 0
3 ∂s ∂s 2
Note that the mixed derivatives can be calculated inavariety ofways.
∂ 3 f
∂r 2 ∂s = ∂ ( ∂ 2 )
f
= ∂ ( ∂ 2 )
f
∂r ∂r∂s ∂s ∂r 2
∂ 3 f
∂r∂s = ∂ ( ) ∂ 2 f
= ∂ ( ) ∂ 2 f
2 ∂r ∂s 2 ∂s ∂r∂s
EXERCISES25.4
1 Calculateall second derivatives of v where
v(h,r) =r 2√ h
2 Find the second partialderivatives of f given
(a) f =x 2 y+y 3 (b) f =2x 4 y 3 −3x 3 y 5
(c) f=4 √ xy 2 (d) f = x2 +1
y
(e) f= 3x3 √ y
(f) f=4 √ xy
3 Find allsecond partialderivatives of
(a) z =xe 2y
(c) z =xcos(2x +3y)
(e) z = e x siny
(g) z = e xy
(b) z = 2sin(xy)
(d) z =ysin(4xy)
(f) z = e 3x−y
4 Find allsecond partialderivatives of
(a) z = (3x −2y) 20
(c) z = sin(x 2 +y 2 )
1
(e) z=
3x−2y
(b) z = √ 2x+5y
(d) z = ln(2x +5y)
5 Find allthirdpartialderivatives ofzwhere
z(x,y) =
x2
y +1
6 Evaluate all secondpartialderivatives of f atx = 2,
y=1.
(a) f =ye x (b) f =sin(2x −y)
( )
y
(c) f=ln
x
832 Chapter 25 Functions of several variables
Solutions
1
2 (a)
∂ 2 v
∂h 2 = − r2
4h 3/2 , ∂2 v
∂h∂r = r √
h
, ∂2 v
∂r 2 = 2√ h
∂ 2 f
∂x 2 = 2y, ∂2 f
∂x∂y = 2x, ∂2 f
∂y 2 = 6y
(b) 24x 2 y 3 −18xy 5 ,
24x 3 y 2 −45x 2 y 4 ,
12x 4 y −60x 3 y 3
(c) −x −3/2 y 2 ,4x −1/2 y,8 √ x
(d) 2 y , −2x y 2 , 2(x2 +1)
y 3
(e) 18xy −1/2 , − 9 2 x2 y −3/2 , 9 4 x3 y −5/2
(f) −x −3/2 y 1/2 ,x −1/2 y −1/2 , −x 1/2 y −3/2
∂ 2 z
3 (a)
∂x 2 = 0, ∂2 z
∂x∂y =2e2y , ∂2 z
∂y 2 = 4xe2y
(b) −2y 2 sin(xy),
4 (a)
2cos(xy) −2xysin(xy),
−2x 2 sin(xy)
(c) −4sin(2x +3y) −4xcos(2x +3y),
−3sin(2x +3y) −6xcos(2x +3y),
−9xcos(2x +3y)
(d) −16y 3 sin(4xy),
8ycos(4xy) −16xy 2 sin(4xy),
8xcos(4xy) −16x 2 ysin(4xy)
(e) e x siny,e x cosy, −e x siny
(f) 9e 3x−y ,−3e 3x−y ,e 3x−y
(g) y 2 e xy ,e xy (1 +xy),x 2 e xy
∂ 2 z
∂x 2 = 3420(3x −2y)18 ,
∂ 2 z
∂x∂y = −2280(3x −2y)18 ,
5
∂ 2 z
= 1520(3x −2y)18
∂y2 (b) −(2x +5y) −3/2 ,
− 5 2 (2x +5y)−3/2 ,
− 25 (2x +5y)−3/2
4
(c) 2cos(x 2 +y 2 ) −4x 2 sin(x 2 +y 2 ),
−4xysin(x 2 +y 2 ),
2cos(x 2 +y 2 ) −4y 2 sin(x 2 +y 2 )
4
(d) −
(2x +5y) 2,
(e)
10
−
(2x +5y) 2,
25
−
(2x +5y) 2
18
(3x −2y) 3 , − 12
(3x −2y) 3 , 8
(3x −2y) 3
∂ 3 z
∂x 3 = 0, ∂ 3 z
∂x 2 ∂y = − 2
(y +1) 2,
∂ 3 z
∂x∂y 2 = 4x ∂3 z
(y +1) 3, ∂y 3 = − 6x2
(y +1) 4
6 (a)
∂ 2 f
(2,1) = 7.3891,
∂x2 ∂ 2 f
(2,1) = 7.3891,
∂x∂y
∂ 2 f
∂y 2 (2,1)=0
(b) −0.5645,0.2822, −0.1411
(c) 0.25,0, −1
25.5 PARTIALDIFFERENTIALEQUATIONS
Partial differential equations (p.d.e.s) occur in many areas of engineering. If a variable
dependsupontwoormoreindependentvariables,thenitislikelythisdependencecanbe
describedbyap.d.e.Theindependentvariablesareoftentime,t,andspacecoordinates
x,y,z.
One example is the wave equation. The displacement,u, of the wave depends upon
time and position. Under certain assumptions, the displacement of a wave travelling in
one direction satisfies
∂ 2 u ∂2 u
∂t 2 =c2 ∂x 2
25.5 Partial differential equations 833
wherecisthespeedofthewave.Thisp.d.e.iscalledtheone-dimensionalwaveequation.
Underprescribedconditionsthesubsequentdisplacementofthewavecanbecalculated
as a function of position and time.
Example25.8 Verifythat
u(x,t) = sin(x +2t)
isasolution ofthe one-dimensional wave equation
∂ 2 u
∂t = u
2 4∂2 ∂x 2
Solution The first partialderivatives arecalculated.
∂u
∂u
= cos(x +2t)
∂x
∂t =2cos(x+2t)
The second derivatives, ∂2 u
∂x and ∂2 u
, arenow found.
2 ∂t2 ∂ 2 u
∂x = −sin(x+2t) ∂ 2 u
= −4sin(x +2t)
2 ∂t2 Now
∂ 2 u
∂t = −4sin(x +2t) = 4[−sin(x +2t)] = u
2 4∂2 ∂x 2
Henceu(x,t) = sin(x +2t) isasolution ofthe given wave equation.
Another equally important p.d.e. is Laplace’s equation. This equation is used extensivelyinelectrostatics.Undercertainconditionstheelectrostaticpotentialinaregionis
describedby a function φ(x,y) which satisfiesLaplace’s equation intwo dimensions.
∂ 2 φ
∂x + ∂2 φ
2 ∂y = 0 2
Thisequationissoimportantthatawholeareaofappliedmathematics,calledpotential
theory, isdevoted tothe studyof its solution.
Example25.9 Verifythat
φ(x,y,z) =
1
√
x2 +y 2 +z 2
satisfies the three-dimensional Laplace’s equation
∂ 2 φ
∂x + ∂2 φ
2 ∂y + ∂2 φ
2 ∂z = 0 2
Solution We begin by calculating the first partialderivative, ∂φ . We aregiven
∂x
φ = (x 2 +y 2 +z 2 ) −1/2
and so
∂φ
∂x = −1 2 (x2 +y 2 +z 2 ) −3/2 (2x) = −x(x 2 +y 2 +z 2 ) −3/2
834 Chapter 25 Functions of several variables
We use the product ruletofind ∂2 φ
∂x . 2
∂ 2 φ
∂x = 2 −1(x2 +y 2 +z 2 ) −3/2 −x ( )
− 3 2 (x 2 +y 2 +z 2 ) −5/2 2x
= (x 2 +y 2 +z 2 ) −5/2 [−(x 2 +y 2 +z 2 ) +3x 2 ]
By a similaranalysis we have
∂ 2 φ
∂y 2 = (x2 +y 2 +z 2 ) −5/2 [−(x 2 +y 2 +z 2 ) +3y 2 ]
∂ 2 φ
∂z = 2 (x2 +y 2 +z 2 ) −5/2 [−(x 2 +y 2 +z 2 ) +3z 2 ]
So
∂ 2 φ
∂x + ∂2 φ
2 ∂y + ∂2 φ
2 ∂z = 2 (x2 +y 2 +z 2 ) −5/2 [−3(x 2 +y 2 +z 2 ) +3x 2 +3y 2 +3z 2 ]
= 0
1
Hence φ = √ isasolution of the three-dimensional Laplace’s equation.
x2 +y 2 +z2 Thetransmissionequationisanotherimportantp.d.e.Thepotential,u,inatransmission
cablewith leakagesatisfies a p.d.e. ofthe form
∂ 2 u u
∂x 2 =A∂2 ∂t +B∂u 2 ∂t +Cu
whereA,BandC areconstants relating tothe physical properties of the cable.
Theanalyticalandnumericalsolutionofp.d.e.sisanimportanttopicinengineering.
Coverage isbeyond the scope ofthis book.
EXERCISES25.5
1 Verify that
u(x,y) =x 2 +xy
is asolution ofthe p.d.e.
2 Verify that
∂u
∂x −2∂u ∂y =y
φ = sin(xy)
satisfiesthe equation
3 Verify that
∂ 2 φ
∂x 2 + ∂2 φ
∂y 2 +(x2 +y 2 )φ=0
u(x,y) =x 3 y +xy 3
isasolution ofthe equation
4 Verify that
xy ∂2 u
∂x∂y +x∂u ∂x +y∂u ∂y = 7u
u(x,y) =xy + x y
isasolution of
5 Verify that
satisfies
y ∂2 u
∂y 2 +2x ∂2 u
∂x∂y = 2x
φ(x,y)=xsiny+e x cosy
∂ 2 φ
∂x 2 + ∂2 φ
∂y 2 =−xsiny
25.6 Taylor polynomials and Taylor series in two variables 835
6 Given
φ =
(a) Show
√
x 2 +y 2
∂ 2 φ
∂x 2 =y2 (x 2 +y 2 ) −3/2
(b) Verify that φ is asolution of
∂ 2 φ
∂x 2 + ∂2 φ
∂y 2 = (x2 +y 2 ) −1/2
25.6 TAYLORPOLYNOMIALSANDTAYLORSERIES
INTWOVARIABLES
In Chapter 18 we introduced Taylor polynomials and Taylor series for functions of a
single variable. We now extend this to include functions of two variables. Recall the
mainideabehindTaylorpolynomialsandseriesforafunctionofonevariable.Knowing
thevaluesofafunction, f (x),anditsderivativesatx =awecanwritedowntheTaylor
polynomial generated by f aboutx = a. This polynomial approximates to the function
f. The values of the Taylor polynomial and the function are usually in close agreement
for values of x near to x = a. To put it another way, knowing the value of f and its
derivatives atx =aallows us toestimatethe value of f near tox =a.
The same idea holds when f is a function of two variables,xandy. If we know the
valueof f anditspartialderivativesatapointx =a,y =b,thentheTaylorpolynomials
allow ustoestimate f atpoints near to (a,b).
25.6.1 First-orderTaylorpolynomialintwovariables
Suppose f is a function of two independent variables, x and y, and that the values of
f, ∂f ∂f
and areknown atthe pointx =a,y =b, that isweknow
∂x ∂y
f(a,b)
∂f
∂x (a,b)
∂f
∂y (a,b)
The first-order Taylor polynomial, p 1
(x,y), generated by f about (a,b) isgiven by
p 1
(x,y)= f(a,b)+(x−a) ∂f
∂x (a,b)+(y−b)∂f ∂y (a,b)
We note the following properties of a first-order Taylor polynomial:
(1) ThevaluesoftheTaylorpolynomialandthefunctionareidenticalatthepoint (a,b).
(2) The values of the first partial derivatives of the Taylor polynomial and the function
are identical atthe point (a,b).
(3) Thehighestderivative needed tocalculatetheTaylor polynomialisthefirstderivative.
(4) The first-order Taylor polynomial contains only linear terms; that is, there are no
powers ofxoryhigher than 1.
The first-order Taylor polynomial represents a plane which is tangent to the surface
f(x,y)at (a,b).
We can use p 1
(x,y) toestimatethe value of f near to (a,b).
836 Chapter 25 Functions of several variables
Example25.10 Afunction, f, issuch that
f(3,1)=2
∂f
∂f
(3,1) = −1
∂x
∂y (3,1)=4
(a) State the first-order Taylor polynomial generated by f about (3, 1).
(b) Estimate the values of f(3.5,1.2) and f(3.2,0.7).
(c) Verifythatthe Taylor polynomial and the function have identical values at(3, 1).
(d) VerifythatthefirstpartialderivativesoftheTaylorpolynomialandthefunctionare
identical at(3, 1).
Solution (a) Inthisexamplea = 3 andb= 1.Hence
p 1
(x,y) = f(3,1)+(x−3) ∂f
∂x (3,1)+(y−1)∂f ∂y (3,1)
=2+(x−3)(−1)+(y−1)4
=4y−x+1
The first-order Taylor polynomial generated by f about (3, 1) is
p 1
(x,y)=4y−x+1
(b) We use p 1
(x,y) toestimate f(3.5,1.2) and f(3.2,0.7).
p 1
(3.5,1.2) = 4(1.2) −3.5 +1 = 2.3
p 1
(3.2,0.7) = 4(0.7) −3.2 +1 = 0.6
Hence 2.3 isan estimateof f(3.5,1.2) and 0.6 isanestimate of f(3.2,0.7).
(c) We aregiven f (3,1) = 2.Also
p 1
(3,1)=4(1)−3+1=2
andso f(3,1) = p 1
(3,1).
(d) We aregiven ∂f
∂f
(3,1) = −1and (3,1) =4.Now
∂x ∂y
p 1
(x,y)=4y−x+1
and so
∂p 1
∂x = −1 ∂p 1
∂y = 4
Hence
∂p 1
∂f
(3,1) =
∂x ∂x (3,1) = −1 ∂p 1
∂f
(3,1) =
∂y ∂y (3,1)=4
Example25.11 Afunction, f (x,y), isdefinedby
f(x,y)=x 2 +xy−y 3
(a) State the first-order Taylor polynomialgeneratedby f about(1, 2).
(b) Verify that the Taylor polynomial in (a) and the function f have identical values at
(1, 2).
25.6 Taylor polynomials and Taylor series in two variables 837
(c) VerifythatthefirstpartialderivativesoftheTaylorpolynomialand f haveidentical
values at(1, 2).
(d) Estimate f(1.1,1.9)usingtheTaylorpolynomial.Comparethiswiththetruevalue.
Solution (a) We aregiven f =x 2 +xy −y 3 and so
∂f
∂x =2x+y
∂f
∂y =x−3y2
Evaluating these atthe point(1, 2) gives
f(1,2)=−5
∂f
∂x (1,2)=4
∂f
(1,2) = −11
∂y
Puttinga = 1 andb = 2 in the formula for p 1
(x,y) we are able to write down the
Taylor polynomial:
p 1
(x,y) = f(1,2)+(x−1) ∂f
∂x (1,2)+(y−2)∂f ∂y (1,2)
= −5 + (x −1)4 + (y −2)(−11)
=4x−11y+13
The first-order Taylor polynomial is p 1
(x,y) = 4x −11y +13.
(b) We can evaluate the Taylor polynomial and the function at(1, 2).
p 1
(1,2)=4−22+13=−5
f(1,2)=−5
Hence p 1
(1,2) = f (1,2); that is, the Taylor polynomial and the function have
identical values at(1, 2).
(c) The first partialderivatives of p 1
(x,y) are found.
∂p 1
∂x = 4 ∂p 1
∂y = −11
∂f
∂x (1,2)=4
∂f
(1,2) = −11
∂y
Hence the first partialderivatives of p 1
and f are identical at(1, 2).
(d) p 1
(1.1,1.9) = 4(1.1) −11(1.9) +13 = −3.5
f(1.1,1.9) = (1.1) 2 + (1.1)(1.9) − (1.9) 3 = −3.559
ThevaluesoftheTaylorpolynomialandthefunctionareincloseagreementnearto
the point(1, 2).
25.6.2 Second-orderTaylorpolynomialintwovariables
Given a function, f, and its first and second partial derivatives at (a,b) we can write
down the second-order Taylor polynomial, p 2
(x,y).
p 2
(x,y) = f(a,b)+(x−a) ∂f
∂x (a,b)+(y−b)∂f ∂y (a,b)
+ 1 (
)
(x−a) 2∂2 f
2! ∂x (a,b)+2(x−a)(y−b) ∂2 f f
2 ∂x∂y (a,b)+(y−b)2∂2 ∂y (a,b) 2
838 Chapter 25 Functions of several variables
We note the following properties of the second-order Taylor polynomial:
(1) The values of the Taylor polynomial and the function areidentical at (a,b).
(2) The values of the first partial derivatives of the Taylor polynomial and the function
areidentical at (a,b).
(3) The values of the second partial derivatives of the Taylor polynomial and the function
areidentical at (a,b).
(4) Thesecond-orderTaylorpolynomialcontainsquadraticterms,thatistermsinvolvingx
2 ,y 2 andxy.
Example25.12 A function, f, and its first and second partial derivatives are evaluated at (2,−1). The
values are
f = 3
∂f
∂x = 4
∂f
∂y = −3
∂ 2 f
∂x 2 = 1
∂ 2 f
∂x∂y = 2
(a) State the second-order Taylor polynomial generated by f about (2,−1).
(b) Estimate f (1.8,−0.9).
(c) Verifythatthe Taylor polynomial and f have identical values at (2,−1).
∂ 2 f
∂y 2 = −1
(d) Verify that the first partial derivatives of the Taylor polynomial and f are identical
at (2,−1).
(e) VerifythatthesecondpartialderivativesoftheTaylorpolynomialand f areidentical
at (2,−1).
Solution (a) We puta = 2 andb=−1 inthe formulafor p 2
(x,y):
p 2
(x,y)=f +(x−2) ∂f
∂x +(y+1)∂f ∂y
+ 1 (
)
(x−2) 2∂2 f
2 ∂x +2(x −2)(y +1) ∂2 f f
2 ∂x∂y +(y+1)2∂2 ∂y 2
where f and its derivatives areevaluated at (2,−1). So
p 2
(x,y) =3+(x−2)4+(y+1)(−3)+ 1 2 {(x −2)2 +2(x −2)(y +1)2
+(y +1) 2 (−1)} = x2
2 − y2
2 +2xy+4x−8y−21 2
(b) The value of p 2
(1.8,−0.9) isan estimateof f (1.8,−0.9):
p 2
(1.8,−0.9) = (1.8)2
2
= 1.875
− (−0.9)2 +2(1.8)(−0.9)+4(1.8) −8(−0.9)− 21 2
2
(c) p 2
(2,−1) = 22
2 − (−1)2 +2(2)(−1) +4(2) −8(−1) − 21 2
2 = 3
f(2,−1)=3
Hence p 2
(2,−1) = f (2,−1); that is, the Taylor polynomial and the function have
identical values at (2,−1).
(d) The first partialderivatives of p 2
are
∂p 2
∂x =x+2y+4 ∂p 2
∂y =−y+2x−8
25.6 Taylor polynomials and Taylor series in two variables 839
so
∂p 2
∂x (2,−1)=2+2(−1)+4=4 ∂p 2
∂y (2,−1)=1+4−8=−3
Hence
∂p 2
∂f
(2,−1) =
∂x ∂x (2,−1) ∂p 2
∂f
(2,−1) =
∂y ∂y (2,−1)
that is, the first partial derivatives of the Taylor polynomial and the function have
identical values at (2,−1).
(e) The second partialderivatives of p 2
arefound:
∂ 2 p 2
∂x 2 = 1
Example25.13 Afunction f isgiven by
∂ 2 p 2
∂x∂y = 2 ∂ 2 p 2
∂y 2 = −1
These values areidentical tothe second partial derivatives of f at (2,−1).
f(x,y)=x 3 +x 2 y+y 4
(a) State the second-order Taylor polynomialgeneratedby f about(1, 1).
(b) Usethepolynomialtoestimate f(1.2,0.9).Comparethisvaluewiththetruevalue.
(c) Verify that the second partial derivatives of the function and the Taylor polynomial
are identical at(1,1).
Solution (a) Herea = 1 andb= 1.We aregiven that f =x 3 +x 2 y +y 4 and so
∂f
∂x = 3x2 +2xy
∂ 2 f
∂x∂y = 2x
∂f
∂y =x2 +4y 3
∂ 2 f
∂y 2 = 12y2
Evaluation of f and itsderivatives at(1, 1) yields
f = 3
∂f
∂x = 5
∂f
∂y = 5
∂ 2 f
∂x 2 = 8
The second-order Taylor polynomial is p 2
(x,y):
∂ 2 f
∂x 2 =6x+2y
∂ 2 f
∂x∂y = 2
p 2
(x,y)=f +(x−1) ∂f
∂x +(y−1)∂f ∂y
+ 1 (
)
(x−1) 2∂2 f
2 ∂x +2(x −1)(y −1) ∂2 f f
2 ∂x∂y +(y−1)2∂2 ∂y 2
∂ 2 f
∂y 2 = 12
=3+5(x−1)+5(y−1)+ 1 2 (8(x −1)2 +4(x −1)(y −1) +12(y −1) 2 )
=4x 2 +6y 2 +2xy−5x−9y+5
(b) The value of p 2
(1.2,0.9) isanestimate of f(1.2,0.9).
p 2
(1.2,0.9) = 4(1.2) 2 +6(0.9) 2 +2(1.2)(0.9) −5(1.2) −9(0.9) +5 = 3.68
The actual value is
f(1.2,0.9) = (1.2) 3 + (1.2) 2 (0.9) + (0.9) 4 = 3.6801
840 Chapter 25 Functions of several variables
(c) The partialderivatives of p 2
(x,y) arefound.
∂p 2
∂x =8x+2y−5 ∂p 2
∂y =12y+2x−9
∂ 2 p 2
∂x 2 = 8
∂ 2 p 2
∂x∂y = 2 ∂ 2 p 2
∂y 2 = 12
Thesecondpartialderivativesofp 2
areidenticaltothesecondpartialderivativesof
f evaluated at(1, 1).
25.6.3 Taylorseriesintwovariables
The third-order Taylor polynomial involves all the third-order partial derivatives of f,
the fourth-order Taylor polynomial involves all the fourth-order partial derivatives of f
and so on. Taylor polynomials approximate more and more closely to the generating
function as more and more terms are included. As more and more terms are included,
weobtainaninfiniteseriesknownasaTaylorseriesintwovariables.Thegeneralform
ofthis series isbeyond the scope ofthis book.
EXERCISES25.6
1 Useafirst-orderTaylor polynomial to estimate
f(2.1, 3.2) given
f(2,3)=4
∂f
∂y (2,3)=3
∂f
(2,3) = −2
∂x
2 Useafirst-orderTaylor polynomial to estimate
g(−1.1,0.2) given
g(−1,0) =6
∂g
(−1,0) = −1
∂y
∂g
(−1,0) =2
∂x
3 Useafirst-orderTaylor polynomial to estimate
h(−1.2, −0.7) given
h(−1.3, −0.6) = 4
∂h
(−1.3, −0.6) = −1
∂x
∂h
(−1.3, −0.6) = 1
∂y
4 Useasecond-order Taylor polynomial to estimate
f(3.1, 4.2) given
f(3,4)=1
∂f
∂x (3,4)=0
∂f
∂y (3,4)=2
∂ 2 f
∂x∂y (3,4)=3
∂ 2 f
(3,4) = −1
∂x2 ∂ 2 f
(3,4) =0.5
∂y2 5 Useasecond-order Taylor polynomial to estimate
g(−2.9,3.1) given
g(−3,3) =1
∂g
∂
(−3,3) =4
∂y
∂ 2 g
∂
(−3,3) = −2
∂x∂y
∂g
(−3,3) = −1
∂x
2 g
(−3,3) =3
∂x2 2 g
(−3,3) =2
∂y2 6 Useasecond-order Taylor polynomial to estimate
h(0.1,0.1) given
h(0,0) =4
∂h
∂
(0,0) = −3
∂y
∂ 2 h
∂x∂y (0,0)=2
∂h
(0,0) = −1
∂x
2 h
∂x 2 (0,0)=2
∂ 2 h
(0,0) = −1
∂y2
25.7 Maximum and minimum points of a function of two variables 841
7 Afunction, f (x,y),is defined by
f(x,y) =x 3 y+xy 3
(a) Calculate the first-orderTaylor polynomial
generated by f about (1,1).
(b) Calculate the second-order Taylor polynomial
generated by f about (1,1).
(c) Estimate f(1.2, 1.2) usingthe first-orderTaylor
polynomial.
(d) Estimate f(1.2, 1.2) usingthe second-order
Taylor polynomial.
(e) Compare your answers in (c) and (d)with the
true value of f(1.2, 1.2).
8 Afunctiong(x,y) is defined by
g(x,y)=xsiny+ x y
(a) Calculate the first-orderTaylor polynomial
generated bygabout (0,1).
(b) Calculate the second-order Taylor polynomial
generated bygabout (0,1).
(c) Estimateg(0.2, 0.9) usingthe polynomial in (a).
(d) Estimateg(0.2, 0.9) usingthe polynomial in (b).
(e) Compare your estimateswith the exact value of
g(0.2, 0.9).
9 A functionh(x,y) isdefined by
h(x,y) =e x y+x 2 e y
(a) Calculate the first-orderTaylor polynomial
generated byhabout (0,0).
(b) Calculate the second-order Taylor polynomial
generated byhabout (0,0).
(c) Estimateh(0.2, 0.15)usingthe polynomial
from (a).
(d) Estimateh(0.2, 0.15)usingthe polynomial
from (b).
(e) Compare youranswers in (c)and(d)with the
exact value ofh(0.2,0.15).
10 A function, f (x,y,z),is defined by
f(x,y,z)=x 2 +xyz+yz 2
(a) Writedown the first-orderTaylor polynomial
generated by f about (0,1,2).
(b) Use the polynomial from (a)to estimate
f(0.1,1.2, 1.9).
(c) Compare youranswerin (b)with the exact value
of f(0.1,1.2, 1.9).
Solutions
1 4.4
2 5.6
3 3.8
4 1.465
5 1.305
6 3.625
7 (a) 4x+4y−6
(b) 3x 2 +3y 2 +6xy−8x−8y+6
(c) 3.6 (d) 4.08 (e) 4.1472
8 (a) 1.8415x (b) 2.3012x −0.4597xy
(c) 0.3683 (d) 0.3775
(e) 0.3789
9 (a) y (b) x 2 +xy +y (c) 0.15
(d) 0.22 (e) 0.2297
10 (a) 2x+4y+4z−8 (b) 4.6
(c) 4.57
25.7 MAXIMUMANDMINIMUMPOINTSOFAFUNCTION
OFTWOVARIABLES
We saw inChapter12 thattofind the turningpoints ofy(x) wesolve
dy
dx = 0
842 Chapter 25 Functions of several variables
The sign of the second derivative, d2 y
dx2, is then used to distinguish between a maximum
pointandaminimumpoint.Theanalysisisverysimilarforafunctionoftwovariables.
Whenconsideringafunctionoftwovariables, f (x,y),weoftenseekmaximumpoints,
minimum points and saddle points. Collectively these are known as stationary points.
Figure 25.1 illustrates a maximum point at A, a minimum point at B and a saddle point
at C. When leaving a maximum point the value of the function decreases; when leaving
a minimum point the value of the function increases. When leaving a saddle point,
the function increases in one direction, axis D on the figure, and decreases in the other
direction, axis Eon the figure.
To locate stationary points we equate both first partial derivatives to zero, that is we
solve
∂f
∂x = 0
∂f
∂y = 0
Stationarypoints are locatedby solving
∂f
∂x = 0
∂f
∂y = 0
Example25.14 Locatethe stationarypoints of
(a) f(x,y)=2x 2 −xy−7y+y 2
(b) f(x,y)=x 2 −6x+4xy+y 2
Solution (a) Thefirst partialderivatives arefound:
∂f
∂x =4x−y
∂f
∂y =−x−7+2y
The stationary points are located by solving ∂f
∂x =0and∂f ∂y = 0 simultaneously,
thatis
4x−y=0
−x+2y−7=0
Solving these equations yields x = 1, y = 4. Hence the function f (x,y) has one
stationary pointand itislocated at(1, 4).
(b) The first partialderivatives arefound:
∂f
∂x =2x−6+4y
∂f
∂y =4x+2y
The first partialderivatives areequated tozero:
2x+4y−6=0
4x+2y=0
Solving the equations simultaneously yieldsx = −1,y = 2. Thus the function has
one stationary pointlocated at (−1,2).
25.7 Maximum and minimum points of a function of two variables 843
Example25.15 Locate the stationary points of
f(x,y)= x3
3 +3x2 +xy+ y2
2 +6y
Solution The first partialderivatives arefound:
∂f
∂x =x2 +6x+y
∂f
∂y =x+y+6
These derivatives are equated tozero:
x 2 +6x+y=0 (25.1)
x+y+6=0 (25.2)
Equations (25.1) and (25.2) are solved simultaneously. From (25.2)y = −x − 6, and
substituting this into Equation (25.1) yields x 2 + 5x − 6 = 0. Solving this quadratic
equationgivesx = 1, −6.Whenx = 1,y = −7,andwhenx = −6,y = 0.Thefunction
has stationary values at (1,−7) and (−6,0).
Equating the first partial derivatives to zero locates the stationary points, but does not
identifythemasmaximumpoints,minimumpointsorsaddlepoints.To distinguishbetween
these various points a test involving second partial derivatives must be made.
Note that this is similar to locating and identifying turning points of a function of one
variable.
To identifythe stationarypoints weconsider the expression
( )
∂ 2 f ∂ 2 f ∂ 2
∂x 2 ∂y − f 2
2 ∂x∂y
If the expression isnegative atastationary point,then thatpoint isasaddle point.
Iftheexpressionispositiveandinaddition ∂2 f
∂x ispositiveatastationarypoint,then
thatpoint isaminimum point.
2
Iftheexpressionispositiveandinaddition ∂2 f
∂x isnegativeatastationarypoint,then
thatpoint isamaximum point.
2
If the expression is zero then further tests are required. These are beyond the scope
of the book.
Insummary:
∂ 2 f ∂ 2 (
f ∂ 2 )
∂x 2 ∂y − f 2
< 0 Saddle point
2 ∂x∂y
( )
∂ 2 f ∂ 2 f ∂ 2
∂x 2 ∂y − f 2
>0 2 ∂x∂y
and
∂ 2 f ∂ 2 (
f ∂ 2 )
∂x 2 ∂y − f 2
>0 2 ∂x∂y
and
∂ 2 f
∂x 2 > 0
∂ 2 f
∂x 2 < 0
Minimum point
Maximum point
844 Chapter 25 Functions of several variables
Example25.16 Locate and identify the stationary values of
f(x,y)=x 2 +xy+y
Solution The first partialderivatives arefound:
∂f
∂x =2x+y
∂f
∂y =x+1
The first partialderivatives areequated tozero:
2x+y=0
x+1=0
This yieldsx = −1,y = 2.Thus thereisone stationary point,positioned at (−1,2).
The second partialderivatives arefound:
∂ 2 f
∂x 2 = 2
∂ 2 f
∂x∂y = 1
∂ 2 f
∂y 2 = 0
Now
∂ 2 f ∂ 2 (
f ∂ 2 )
∂x 2 ∂y − f 2
=2(0) − (1) 2 = −1
2 ∂x∂y
Since the expression isnegative, we conclude that (−1,2) isasaddle point.
Example25.17 Locateand identifythe stationarypoints of
f(x,y)=xy−x 2 −y 2
Solution Thefirst partialderivatives arefound:
∂f
∂x =y−2x
∂f
∂y =x−2y
Solving ∂f
∂x
= 0,
∂f
∂y =0yieldsx=0,y=0.
The second partialderivatives arefound:
∂ 2 f
∂x 2 = −2
∂ 2 f
∂x∂y = 1
∂ 2 f
∂y 2 = −2
The second derivative testtoidentify the stationary point isused. Now
∂ 2 f ∂ 2 (
f ∂ 2 )
∂x 2 ∂y − f 2
= (−2)(−2) − (1) 2 =3
2 ∂x∂y
( )
Since ∂2 f f ∂ 2 f ∂ 2
∂x 2 <0and∂2 ∂x 2 ∂y − f 2
> 0 then (0, 0) isamaximum point.
2 ∂x∂y
25.7 Maximum and minimum points of a function of two variables 845
Example25.18 Locate and identifythe stationarypoints of
f(x,y)= x3 y2
−x+
3 2 +2y
Solution The first partialderivatives arefound:
∂f
∂x =x2 −1
∂f
∂y =y+2
The first partialderivatives areequated tozero:
x 2 −1=0
y+2=0
These equations have two solutions:x = 1,y = −2 andx = −1,y = −2.
In order to identify the nature of each stationary point, the second derivatives are
found:
∂ 2 f
∂x 2 = 2x
∂ 2 f
∂x∂y = 0
∂ 2 f
∂y 2 = 1
Each stationary point isexamined inturn.
At (1,−2)
( )
∂ 2 f ∂ 2 f ∂ 2
∂x 2 ∂y − f 2
=2x(1)−0 2 =2x=2sincex=1
2 ∂x∂y
∂ 2 f
∂x 2 =2x=2
Since both expressions are positive the point (1,−2) isaminimum point.
At (−1,−2)
Here
∂ 2 f ∂ 2 (
f ∂ 2
∂x 2 ∂y − f
2 ∂x∂y
) 2
=2x=−2sincex=−1
Since the expression isnegative then (−1,−2) isasaddle point.
Figure 25.2 illustratesaplot of the surface defined by f (x,y).
z
1
0.5
0
–0.5
–1
–1.5
–2
–2.5
–3
0
–1
–2
y
–3
–4 –2
–1
0
1
2
x
Figure25.2
Thefunction
f(x,y)= x3
3
−x+
y2
2 +2y
has a minimumat (1, −2)
and asaddle point at
(−1, −2).
846 Chapter 25 Functions of several variables
EXERCISES25.7
1 Determine the position and nature ofthe stationary
pointsofthe following functions:
(a) f(x,y)=x+y+xy
(b) f(x,y)=x 2 +y 2 −2y
(c) f(x,y)=x 2 +xy−y
(d) f(x,y)=x 2 +y 2 −xy
(e) f(x,y)=x 2 +2y 2 −3xy+x
2 Determine the position and nature ofthe stationary
pointsofthe following functions:
(a) z(x,y) =x 2 +y 2 −3xy +2x
(b) z(x,y) =x 3 +xy +y 2
(c) z(x,y) = y3 3 −x2 −y
(d) z(x,y) = 1 x + 1 y − 1 xy
(e) z(x,y) = 4x 2 y −6xy
3 Locateandidentify the stationary pointsofthe
following:
(a) f(x,y)=x 2 y−x 2 +y 2
(b) f(x,y) = x y +x+y
(c) f(x,y) =x 4 +16xy +y 4
(d) f(x,y)=y−y 2 −e x x
(e) f(x,y) = ex
y 4 −xy
Solutions
1 (a) (−1, −1),saddle point
(b) (0,1),minimum
(c) (1, −2),saddle point
(d) (0,0),minimum
(e) (4,3),saddle point
(
2 (a) 45
, 5)
6 ,saddle point
)
,minimum
(
(b) (0,0),saddle point; 16
, −12
1
(c) (0,1),saddle point; (0, −1),maximum
(d) (1,1),saddle point
(e) (0,0), (1.5,0),saddlepoints
3 (a) (0,0),saddle point
(b) (1,−1),saddle point
(c) (0,0),saddle point; (2,−2),minimum; (−2,2),
minimum
(d) (−1,0.5),maximum
(e) (−4,0.4493), saddlepoint
REVIEWEXERCISES25
1 Find all firstpartialderivatives ofthe following
functions:
(a) z(x,y) = 9x 2 +2y 2
(b) z(x,y) = 3x 3 y 6
(c) z(x,y) = 4x 3 −5y 3
(d) z(x,y) =xy +x 2 y 2
(e) z(x,y) = sin(2xy)
(f) z(x,y) = 2e 3xy
2 Evaluate the firstpartialderivatives of f atx = −1,
y=2.
(a) f(x,y) =3x 3 −y 3 +x 2 y 2
(b) f(x,y) = 4x
y
(c) f(x,y)=sinx+2cosy
(d) f(x,y) = (xy) 3 +x
(e) f(x,y) = x +y
x −y
(f) f(x,y)=2e x e y
3 Findallsecond partialderivatives ofzwhere
(a) z(x,y) = 3x 4 −9y 4 +x 3 y 3
(b) z(x,y) = (x 2 +y 2 ) 2
(c) z(x,y) = 2e 4x−3y
(d) z(x,y) = 2
x +y
(e) z(x,y) = 2 √ x +y
(f) z(x,y) = (sinx)(cosy)
Review exercises 25 847
4 Find allfirstand second partialderivatives of
f(x,y) = (ax +by) n
wherea,bandnare constants.
5 Find the firstpartial derivatives of
f(x,y) = (ax 2 +by 2 +cxy) n
wherea,b,candnare constants.
6 Verifythat
z(x,y)=3x+2y+1
isasolution of
∂z
∂x − ∂z
∂y = 1
7 Verifythat
z(x,y)=2xy−x+y
isasolution of
∂z
∂x + ∂z
∂y =2(x+y)
8 Verifythat
f(x,y)=x 2 +y 2 −2xy
isasolution of
∂f
∂x + ∂f
∂y = 0
9 Verifythat
z(x,y) =sinx +cosy
isasolution of
∂ 2 z
∂x 2 + ∂2 z
∂y 2 +z=0
10 Verify that
z(x,y) =xye x
is asolution of
∂ 2 z
∂x 2 + ∂2 z
∂y 2 −y ∂2 z
∂x∂y =yex
11 (a) Writedown the second-order Taylor polynomial
generatedby f (x,y) aboutx = 2,y = 3 given
f(x,y) =3x 3 y−x 2 y 3
(b) Estimate f(2.1,2.9) usingyourpolynomial from
(a)andcompare thiswith the exact answer.
12 Writedown the second-order Taylor polynomial
generated byz(x,y) aboutx = 1,y = 1 given
z(x,y) = x +y
x
13 Calculate the second-order Taylor polynomial
generated by
√
f(x,y)= x 2 +y 2
aboutx=1,y=0.
14 Locateandidentify all the stationary pointsofthe
following functions:
(a) f(x,y)=x 2 +y 3 −3y
(b) f(x,y) =4xy−x 2 y
(c) f(x,y) =x 3 +2y 2 −12x
(d) f(x,y)=xy−y 2 −x 3
15 Locateandidentify the stationary pointsof
f(x,y)= y x −x2 +y 2
Solutions
1 (a)
∂z ∂z
= 18x,
∂x ∂y = 4y
(b) 9x 2 y 6 ,18x 3 y 5
(c) 12x 2 , −15y 2
(d) y+2xy 2 ,x+2x 2 y
(e) 2ycos(2xy),2xcos(2xy)
(f) 6ye 3xy ,6xe 3xy
∂f ∂f
2 (a) (−1,2) =1, (−1,2) = −8
∂x ∂y
(b) 2,1 (c) 0.5403, −1.8186
3 (a)
(d) 25, −12 (e) − 4 9 ,−2 9
(f) 5.4366,5.4366
∂ 2 z
∂x 2 = 36x2 +6xy 3 ,
∂ 2 z
∂x∂y =9x2 y 2 ,
∂ 2 z
∂y 2 = −108y2 +6x 3 y
(b) 12x 2 +4y 2 ,8xy,4x 2 +12y 2
(c) 32e 4x−3y , −24e 4x−3y ,18e 4x−3y
848 Chapter 25 Functions of several variables
4
4 4 4
∂f
(d)
(x +y) 3, (x +y) 3, (x +y) 3
5
∂x = (2ax +cy)n(ax2 +by 2 +cxy) n−1 ,
(e) − 1 2 (x +y)−3/2 ,
∂f
−2 1 (x ∂y = (2by +cx)n(ax2 +by 2 +cxy) n−1
+y)−3/2 ,
−2 1 11 (a)27x
(x 2 −36y 2 −72xy +108x +276y −432
+y)−3/2
(b)p
(f) −(sinx)(cosy),
2 (2.1,2.9) = −26.97,
f (2.1,2.9) = −26.985
−(cosx)(siny),
12 x 2 −xy−2x+2y+2
−(sinx)(cosy)
∂f
∂x =an(ax +by)n−1 ,
13 y2
2 +x
∂f
14 (a) (0,1)minimum; (0, −1)saddle point
∂y =bn(ax +by)n−1 ,
(b) (0,0),(4,0)saddlepoints
∂ 2 f
(c) (2,0)minimum; (−2,0)saddle point
∂x 2 =a2 n(n −1)(ax +by) n−2 ,
( )
(d) (0,0)saddlepoint; 16
, 12
1 maximum
∂ 2 f
( ) ( )
∂x∂y =abn(n −1)(ax +by)n−2 ,
1
15 √ , − 1
2
∂ 2 2 √ , − 1 1
√ ,
2 2 2 √ ;saddlepoints
2
f
∂y 2 =b2 n(n −1)(ax +by) n−2
26 Vectorcalculus
Contents 26.1 Introduction 849
26.2 Partialdifferentiationofvectors 849
26.3 Thegradientofascalarfield 851
26.4 Thedivergenceofavectorfield 856
26.5 Thecurlofavectorfield 859
26.6 Combiningtheoperatorsgrad,divandcurl 861
26.7 Vectorcalculusandelectromagnetism 864
Reviewexercises26 865
26.1 INTRODUCTION
This chapter draws together several threads from previous chapters. It builds upon differential
and integral calculus, functions of several variables and the study of vectors.
Thesetopicstogetherformabranchofengineeringmathematicsknownasvectorcalculus.
Vector calculus is used to model a vast range of engineering phenomena including
electrostaticcharges,electromagneticfields,airflowaroundaircraft,carsandothersolid
objects,fluidflowaroundshipsandheatflowinnuclearreactors.Thechapterstartsby
explainingwhatismeantbytheoperatorsgrad,divandcurl.Theseareusedtocarryout
various differentiation operations on the fields.
26.2 PARTIALDIFFERENTIATIONOFVECTORS
Consider the vector field v = v x
i + v y
j + v z
k, where each component v x
,v y
and v z
is a
function ofx,y andz. We can partiallydifferentiate the vector w.r.t.xas follows:
∂v
∂x = ∂v x
∂x i + ∂v y
∂x j + ∂v z
∂x k
This isanew vector with a magnitude and direction different fromthose ofv.
850 Chapter 26 Vector calculus
Partial differentiation w.r.t.yandzis defined in a similar way, as are higher derivatives.
For example,
∂ 2 v
∂x 2 = ∂2 v x
∂x 2 i + ∂2 v y
∂x 2 j + ∂2 v z
∂x 2 k
Example26.1 Ifv = 3x 2 yi +2xyzj −3x 4 y 2 k, find ∂v
∂x , ∂v
∂y , ∂v
∂z . Further, find ∂2 v
∂x 2 and ∂2 v
∂x∂z .
Solution We find
∂v
∂x = 6xyi +2yzj −12x3 y 2 k
∂v
∂y = 3x2 i +2xzj −6x 4 yk
∂v
∂z = 2xyj
∂ 2 v
∂x 2 = 6yi −36x2 y 2 k
∂ 2 v
∂x∂z = 2yj
EXERCISES26.2
1 Givenv = 2xi +3yzj +5xz 2 kfind
(a)
∂v
∂x
(b)
∂v
∂y
(c)
2 Iff =2i−xyzj+3x 2 zkfind
(a)
(d)
∂f
∂x
∂ 2 f
∂x 2
(b)
∂f
∂y
(e) ∂2 f
∂y 2
(c)
(f)
∂v
∂z
∂f
∂z
∂ 2 f
∂z 2
3 GivenE = (x 2 +y)i + (1 −z)j + (x +2z)kfind
(a)
(d)
∂E
∂x
∂ 2 E
∂x 2
(b)
(e)
∂E
∂y
∂ 2 E
∂y 2
(c)
(f)
∂E
∂z
∂ 2 E
∂z 2
4 Ifv=3xyzi+(x 2 −y 2 +z 2 )j+(x+y 2 )kfind
∂v
∂x , ∂v
∂y , ∂v
∂z , ∂2 v ∂2 v ∂2 v
∂x 2, ∂y 2,and ∂z 2.
5 Ifv = sin(xyz)i +ze xy j −2xykfind
∂v
∂x , ∂v
∂y , ∂v
∂z .
6 Ifv =xi+x 2 yj−3x 3 k,and φ =xyz,find
φv, ∂ ∂φ
(φv),
∂x ∂x , ∂v .Deduce that
∂x
∂
(φv) = φ∂v
∂x ∂x + ∂φ
∂x v
7 Ifv = ln(xy)i +2xycoszj −x 4 yzk,find
∂v
∂x , ∂v
∂y , ∂v
∂z , ∂2 v
∂x 2 , ∂2 v
∂y 2 , and ∂2 v
∂z 2.
Solutions
1 (a) 2i +5z 2 k (b) 3zj
(c) 3yj +10xzk
2 (a) −yzj +6xzk (b) −xzj
(c) −xyj +3x 2 k
(d) 6zk
(e) 0 (f) 0
26.3 The gradient of a scalar field 851
3 (a) 2xi+k (b) i
4
5
(c) −j +2k
(d) 2i
(e) 0 (f) 0
∂v
∂x =3yzi+2xj+k
∂v
= 3xzi −2yj +2yk
∂y
∂v
= 3xyi +2zj
∂z
∂ 2 v
∂x 2 = 2j
∂ 2 v
∂y 2 =−2j+2k
∂ 2 v
∂z 2 = 2j
∂v
∂x =yzcos(xyz)i +yzexy j −2yk
∂v
∂y =xzcos(xyz)i +xzexy j −2xk
∂v
∂z =xycos(xyz)i +exy j
6 φv =x 2 yzi+x 3 y 2 zj−3x 4 yzk
7
∂(φv)
= 2xyzi +3x 2 y 2 zj −12x 3 yzk
∂x
∂φ
∂x =yz
∂v
∂x =i+2xyj−9x2 k
∂v
∂x = i x +2ycoszj −4x3 yzk
∂v
∂y = i y +2xcoszj−x4 zk
∂v
∂z = −2xysinzj −x4 yk
∂ 2 v
∂x 2 = − i
x 2 −12x2 yzk
∂ 2 v
∂y 2 = − i
y 2
∂ 2 v
∂z 2 = −2xycoszj
26.3 THEGRADIENTOFASCALARFIELD
Given a scalar functionofx,y,z
φ = φ(x,y,z)
we can differentiate it partially w.r.t. each of its independent variables to find ∂φ
∂x , ∂φ
∂y
and ∂φ . If wedo this, the vector
∂z
∂φ
∂x i + ∂φ
∂y j + ∂φ
∂z k
turns out to be particularly important. We call this vector the gradient of φ and denote
itby
∇φ or grad φ
An alternative form of writing ∇φ isas threecomponents
( ∂φ
∂x , ∂φ
∂y , ∂φ )
∂z
gradφ=∇φ= ∂φ
∂x i + ∂φ
∂y j + ∂φ
∂z k
The process of forming a gradient applies only to a scalar field and the result is always
a vector field.
852 Chapter 26 Vector calculus
Itisoften useful towrite ∇φ inthe form
( ∂
∂x , ∂ ∂y , ∂ ∂z)
φ
where the quantity in brackets is called a vector operator and is regarded as operating
on the scalar φ. Thus the vector operator, ∇, isgiven by
( ∂
∂x , ∂ ∂y , ∂ ∂z)
Example26.2 If φ = φ(x,y,z) = 4x 3 ysinz,find ∇φ.
Solution
φ(x,y,z) = 4x 3 ysinz
so thatby partialdifferentiation weobtain
Therefore
∂φ
∂x = 12x2 ysinz
∂φ
∂y = 4x3 sinz
∂φ
∂z =4x3 ycosz
∇φ = 12x 2 ysinzi +4x 3 sinzj +4x 3 ycoszk
We often need to evaluate ∇φ at a particular point, say, for example, at (4, 2, 3). We
write ∇φ| (4,2,3)
todenotethe value of ∇φ atthe point(4,2,3).
Example26.3 If φ =x 3 y +xy 2 +3yfind
(a) ∇φ
(b) ∇φ| (0,0,0)
(c) |∇φ|at(1, 1,1)
Solution (a) If φ =x 3 y +xy 2 +3y then
so that
∂φ
∂x =3x2 y+y 2
∂φ
∂y =x3 +2xy+3
∂φ
∂z = 0
∇φ = (3x 2 y+y 2 )i+ (x 3 +2xy+3)j+0k
(b) At (0,0,0), ∇φ =0i+3j+0k =3j.
26.3 The gradient of a scalar field 853
(c) At(1,1,1),∇φ=(3×1 2 ×1+1)i+(1 3 +2×1×1+3)j+0k=4i+6j+0k
so that |∇φ| at (1,1,1) isequal to √ 4 2 +6 2 = √ 52.
So far we have been given φ and have calculated ∇φ. Sometimes we will be given ∇φ
and will needtofind φ. ConsiderExample 26.4.
Example26.4 IfF = ∇φ find φ whenF = (3x 2 +y 2 )i + (2xy +5)j.
Solution Note that in this example F has only two components. Consequently ∇φ will have two
components,thatisF = ∇φ = ∂φ
∂x i + ∂φ j. Therefore
∂y
(3x 2 +y 2 )i + (2xy +5)j = ∂φ
∂x i + ∂φ
∂y j
Equating theicomponents we have
∂φ
∂x =3x2 +y 2 (26.1)
Equating thejcomponents we have
∂φ
∂y
=2xy+5 (26.2)
Integrating Equation (26.1) w.r.t.xand treatingyas a constant we find
φ=x 3 +xy 2 +f(y) (26.3)
where f (y) is an arbitrary function of y which plays the same role as the constant of
integration does when there is only one independent variable. Note in particular that
∂
∂φ
f (y) = 0.Check by partialdifferentiation that
∂x ∂x =3x2 +y 2 .
Integrating Equation (26.2) w.r.t.yand treatingxas a constant we find
φ=xy 2 +5y+g(x) (26.4)
∂
whereg(x) is an arbitrary function ofx. Note that g(x) = 0. Check by partial differentiationthat
∂φ = 2xy+5.Comparingbothformsfor φ giveninEquations(26.3)and
∂y
∂y
(26.4) wesee thatby choosingg(x) =x 3 and f (y) = 5y wehave
φ=x 3 +xy 2 +5y
Check that F is indeed equal to ∇φ. Also check that by adding any constant to φ the
same property holds, thatisFisstillequal to ∇φ.
854 Chapter 26 Vector calculus
26.3.1 Physicalinterpretationof ∇φ
Suppose wethinkofthescalarfield φ(x,y,z)asdescribingthetemperaturethroughout
a region. This temperature will vary from point to point. At a particular point it can be
shown that ∇φ is a vector pointing in the direction in which the rate of temperature increaseisgreatest.
|∇φ|isthemagnitudeoftherateofincreaseinthatdirection.Similarly,
the rate of temperature decrease is greatest in the direction of −∇φ. Analogous interpretationsarepossibleforotherscalarfieldssuchaspressureandelectrostaticpotential.
Engineeringapplication26.1
Electrostaticpotential
Engineers working on the design of equipment such as cathode ray tubes and electricalvalves,whicharealsocommonlyknownbytheirgenerictermvacuumtubes,
needtocalculatetheelectrostaticpotentialthatresultsfromanaccumulationofstatic
chargesatvariouspointsinaregionofspace.Complicatedexamplesrequiretheuse
of a computer. Let us consider a simple example.
The electrostatic potential,V, inaregion isgiven by
y
V =
(x 2 +y 2 +z 2 ) 3/2
Suppose a unit charge is located in the region at the point with coordinates (2, 1, 0).
Find the direction atthis point,inwhich the rate ofdecrease inpotential isgreatest.
Solution
The rate of decrease isgreatest inthe direction of −∇V.
We first calculate the first partialderivatives ofV. Writing
we find
∂V
V =y(x 2 +y 2 +z 2 ) −3/2
∂x = (− 3 2
)
y(x 2 +y 2 +z 2 ) −5/2 (2x) = −3xy(x 2 +y 2 +z 2 ) −5/2
∂V
(−
∂y = 3 )
y(x 2 +y 2 +z 2 ) −5/2 (2y) + (x 2 +y 2 +z 2 ) −3/2
2
= −3y 2 (x 2 +y 2 +z 2 ) −5/2 + (x 2 +y 2 +z 2 ) −3/2
∂V
(−
∂z = 3 )
y(x 2 +y 2 +z 2 ) −5/2 (2z) = −3yz(x 2 +y 2 +z 2 ) −5/2
2
These partialderivatives can each be evaluated atthe point(2, 1,0). Thatis,
∂V
∂V
∂x ∣ = −0.107
∂V
(2,1,0)
∂y ∣ = 0.036
(2,1,0)
∂z ∣ = 0
(2,1,0)
and so
Finally,
∇V| (2,1,0)
= −0.107i +0.036j +0k
−∇V| (2,1,0)
= 0.107i −0.036j −0k
26.3 The gradient of a scalar field 855
Atthepoint (2,1,0)thisvectorpointsinthedirectionofgreatestrateofdecreasein
potential. This is also the direction of the electrostatic force experienced by the unit
charge atthatpoint.
EXERCISES26.3
1 If φ =x 2 −y 2 −3xyz,(a)find ∇φ,(b)evaluate ∇φ at
the point (0,0,0).
2 If φ =x 2 yz 3 ,find(a) ∇φ,(b) ∇φat (1,2,1),(c) |∇φ|
at (1,2,1).
3 Ifv = ∇φ,find φwhenv = (2x−4y 2 )i−8xyj.
4 Given φ =xyzfind(a) ∇φ,(b) −∇φ,(c) ∇φ
evaluated at (3,0, −1).
5 Find ∇V when
(a)V =x 2 +y 2 +z
( )
2
(b)V =zsin −1 y
x
(c)V = e x+y+z .
6 An electrostatic potential is given byV =xe −√y .
Find ∇V and deduce the direction in which the
decrease ofpotential isgreatest atthe point with
coordinates (1,1,1).
7 Inthe theoryoffluid mechanics the scalar field φ is
known asthevelocity potential ofafluidflow. The
fluid velocityvector, v,atapoint canbe foundfrom
the equationv = ∇φ.Foraparticulartypeofflow
φ =Ux,whereU is aconstant. Show thatthe
corresponding fluidmotion is entirely parallelto thex
axis, andatany point the fluidspeed isU.Finda
velocitypotential forasimilar flow whichisentirely
parallelto theyaxis.
8 Is −∇φ the same as ∇(−φ)?Explain youranswer.
9 Given φ = 3x 2 y +xz,
(a) determine ∇φ
(b) determine ∇(7φ)
(c) determine7∇φ
(d) Is ∇(7φ)thesameas7∇φ?
10 Foranyscalarfield φ andanyconstantk,is ∇(kφ)the
sameask∇φ?
Solutions
1 (a) (2x −3yz)i − (2y +3xz)j −3xyk
(b) 0
2 (a) 2xyz 3 i +x 2 z 3 j +3x 2 yz 2 k
(b) 4i+j+6k
√
(c) 53
3 φ=x 2 −4xy 2 +c
4 (a) yzi +xzj +xyk
(b) −yzi −xzj −xyk
(c) −3j
5 (a) 2xi +2yj +2zk
−zy
(b)
x 2√ 1−y 2 /x 2i
( )
z
+
x √ y
1−y 2 /x 2j +sin−1 k
x
(c) e x+y+z (i +j+k)
6 e −√y i − xe−√ y
2 √ j,−0.368i +0.184j
y
7 φ=Uy
8 yes
9 (a) (6xy +z)i +3x 2 j +xk
(b) (42xy +7z)i +21x 2 j +7xk
(c) (42xy +7z)i +21x 2 j +7xk
(d) yes
10 yes
856 Chapter 26 Vector calculus
26.4 THEDIVERGENCEOFAVECTORFIELD
Given a vector field v = v(x,y,z) let us consider what happens when we differentiate
its individual components. If
v=v x
i+v y
j+v z
k
wecantakeeachcomponentinturnanddifferentiateitpartiallyw.r.t.x,yandz,respectively;
thatis, we can evaluate
∂v x
∂x
∂v y
∂y
∂v z
∂z
Ifweaddthecalculatedquantitiestheresultturnsouttobeaveryusefulscalarquantity
known asthedivergenceof v, thatis
divergence ofv = ∂v x
∂x + ∂v y
∂y + ∂v z
∂z
Thisisusuallyabbreviatedtodivv.Alternatively,thenotation ∇ ·visoftenused.Ifwe
use the vector operator notation introduced inthe previous section wehave
( ∂
∇·v=
∂x , ∂ ∂y ∂z)
, ∂ ·v
=
( ∂
∂x , ∂ ∂y , ∂ ∂z)
·(v x
,v y
,v z
)
Interpreting the · as a scalar product wefind
∇·v= ∂v x
∂x + ∂v y
∂y + ∂v z
∂z
( ∂
asbefore,althoughthisisnotascalarproductintheusualsensebecause
∂x , ∂ ∂y ∂z)
, ∂
is a vector operator. We note that the process of finding the divergence is always performed
on a vector field and the resultisalways a scalar field:
divv=∇·v= ∂v x
∂x + ∂v y
∂y + ∂v z
∂z
Example26.5 Ifv =x 2 zi +2y 3 z 2 j +xyz 2 kfind divv.
Solution Partially differentiating the first component ofvw.r.t.xwefind
∂v x
∂x = 2xz
Similarly,
∂v y
∂y = 6y2 z 2
and
Adding these results wefind
∂v z
∂z = 2xyz
divv=∇·v=2xz+6y 2 z 2 +2xyz
26.4.1 Physicalinterpretationof ∇ · v
26.4 The divergence of a vector field 857
If the vector field v represents a fluid velocity field, then, loosely speaking, the divergence
of v evaluated at a point represents the rate at which fluid is flowing away from
or towards that point. If fluid is flowing away from a point then either the fluid density
must be decreasing there or there must be some source providing a supply of new
fluid.
If the divergence of a flow is zero at all points then outflow from any point must be
matchedbyanequalinflowtobalancethis.Suchavectorfieldissaidtobesolenoidal.
Example26.6 Show thatthe vector field
v=xsinyi+ysinxj−z(sinx+siny)k
issolenoidal.
Solution We have
v x
=xsiny so that
Also,
v y
=ysinx so that
Finally,
v z
= −z(sinx +siny)
Therefore,
∂v x
∂x =siny
∂v y
∂y =sinx
sothat
∂v z
∂z
= −(sinx +siny)
∇·v=siny+sinx−(sinx+siny)=0
and hencevissolenoidal.
Engineeringapplication26.2
ElectricfluxandGauss’slaw
WesawinEngineeringapplication7.5thatelectricchargesproduceanelectricfield,
E, around them which can be visualized by drawing lines of force. Suppose we surroundaregioncontainingchargeswithasurfaceS.Ifasmallportionofthissurface,
δS,ischosenwecandrawthefieldlineswhichpassthroughthisportionasshownin
Figure 26.1.
The flux of E through δS is a measure of the number of lines of force passing
through δS. Gauss’s law states that the total flux out of any closed surfaceSis proportional
to the total charge enclosed. It is possible to show that this law can be
expressed mathematically as
∇·E= ρ ε 0
➔
858 Chapter 26 Vector calculus
E
dS
Figure26.1
Theflux ofEthrough δS isa
measure ofthe number oflines
offorce passingthrough δS.
where ρ is the charge density and ε 0
is a constant called the permittivity of free
space. Note that in a charge-free region, ρ = 0, and so ∇ ·E = 0. This means there
isno net flux ofE.
EXERCISES26.4
1 Avector fieldvisgiven by
Find
v =3x 2 yi+2y 3 zj+xz 3 k
(a)v x , v y , v z
(b) ∂v x
∂x , ∂v y
∂y , ∂v z
∂z
(c)∇·v
2 Avector fieldFis defined by
F=(x+y 2 )i+(y 2 −z)j+(y+z 2 )k
(a) Find ∇ ·F.
(b) Calculate ∇ ·F atthe point (3,2, −1).
3 IfA = 3yzi +2xyj +xyzkfind ∇ ·A.
4 Find the divergence ofeachofthe following vector
fields:
(a) v =x 2 i+y 2 j+z 2 k
(b) v = e xy i +2zsin(xy)j +x 3 zk
(c) v=xyi−2yzj+k
(d) v =x 2 y 2 i−y 2 j−xyzk
5 IfE=xi+z 2 j−yzkfind
(a) E·i
(b) E·j
(c) E·k
(d) ∇·E
6 Avector field is given by
F = (3x 2 −z)i + (2x +y)j
+(x +3yz)k
Find ∇ ·F atthe point (1,2,3).
7 Given the scalar field φ =x 2 +y 2 −2z 2 ,find ∇φ and
showthat ∇ ·(∇φ) =0.
8 Forany vectorfield F is ∇ · (−F)the same as
−(∇ ·F)?
9 ThevectorfieldFis given by
F =xz 2 i+2xy 2 zj+xz 3 k
(a) Find ∇ ·F.
(b) State4F.
(c) Find ∇ · (4F).
(d) Is4(∇ ·F)thesameas ∇ · (4F)?
10 Forany vectorfield F andany scalar constantkis
∇ · (kF)thesameask∇ ·F?
11 Give an example ofavectorfield
v = v x i+v y j+v z ksuchthat ∂v x
∂x ≠ 0, ∂v y
∂y ≠ 0,
∂v z
∂z ≠0,but∇·v=0.
26.5 The curl of a vector field 859
Solutions
1 (a) 3x 2 y,2y 3 z,xz 3
(b) 6xy,6y 2 z,3xz 2
(c) 6xy +6y 2 z +3xz 2
2 (a) 1+2y+2z (b) 3
3 ∇·A=2x+xy
4 (a) 2x+2y+2z
(b) ye xy +2xzcos(xy) +x 3
(c)y−2z
(d)2xy 2 −2y −xy
5 (a) x (b) z 2 (c) −yz (d) 1−y
6 13
7 ∇φ=2xi+2yj−4zk
8 yes
9 (a)z 2 +4xyz +3xz 2
(b)4xz 2 i +8xy 2 zj +4xz 3 k
(c)4z 2 +16xyz +12xz 2
(d)yes
10 yes
11 v = 2xyi +y 2 j −4yzk forexample
26.5 THECURLOFAVECTORFIELD
Athird differential operator isknown ascurl. Itisdefined rather like a vector product.
curlv=∇×v
( ∂
=
∂x , ∂ ∂y ∂z)
, ∂ × (v x
,v y
,v z
)
∣ i jk ∣∣∣∣∣∣∣∣
=
∂ ∂ ∂
∂x ∂y ∂z
∣
v x
v y
v z
This determinant is evaluated in the usual way except that we must regard ∂ ∂x , ∂ ∂y and
∂
as operators, notmultipliers. Thus, forexample,
∂z
∣ ∂ ∂ ∣∣∣∣∣
∂x ∂y
∣ v x
v y
Explicitly wehave
curlv =
means
∂v y
∂x − ∂v x
∂y
( ∂vz
∂y − ∂v ) (
y ∂vx
i +
∂z ∂z − ∂v ) ( ∂vy
z
j +
∂x ∂x − ∂v )
x
k
∂y
860 Chapter 26 Vector calculus
Example26.7 Ifv =x 2 yzi −2xyj +yzk find ∇ ×v.
i j k
Solution ∇ ×v=
∂ ∂ ∂
∂x ∂y ∂z
∣
x 2 yz −2xy yz
∣
[ ∂(yz)
= − ∂(−2xy) ] [ ] [ ]
∂(yz)
i − − ∂(x2 yz) ∂(−2xy)
j + − ∂(x2 yz)
k
∂y ∂z ∂x ∂z ∂x ∂y
=zi+x 2 yj− (2y+x 2 z)k
Notethatthecurloperationisonlyperformedonavectorfieldandtheresultisanother
vector field.
Adetaileddiscussionofthephysicalinterpretationofthecurlofavectorfieldisbeyondthescopeofthisbook.However,ifthevectorfieldvunderconsiderationrepresents
a fluid flow then it may be shown that curl v is a vector which measures the extent to
whichindividualparticlesofthefluidarespinningorrotating.Forthisreason,avector
field whosecurliszero forallvalues ofx,y andzissaidtobe irrotational.
Example26.8 Show thatthe vectorfield
F=ye xy i+xe xy j+0k
isirrotational.
i j k
Solution ∇ ×F =
∂ ∂ ∂
∂x ∂y ∂z
∣
ye xy xe xy 0
∣
( ∂
=
∂y 0 − ∂ ) ( ∂
∂z xexy i −
∂x 0 − ∂ ) ( ∂
∂z yexy j +
∂x xexy − ∂ )
∂y yexy k
=0i +0j + ((xye xy +e xy ) − (yxe xy +e xy ))k
=0 forallx,yandz
The field istherefore irrotational.
EXERCISES26.5
1 Find the curlofthe vectorfield
v =xi −3xyj +4zk.
2 Ifv =3xi−2y 2 zj+3xyzkfind ∇ ×v.
3 SupposeF =P(x,y)i +Q(x,y)jisa
two-dimensionalvector field. Show thatFis
irrotationalif ∂P
∂y = ∂Q
∂x .
4 Findthe divergence andcurlofthe vector field
E = cosxi +sinxj.
26.6 Combining the operators grad, div and curl 861
5 Find the curlofeach ofthe following vector fields:
(a) E =x 2 yi +7xyzj +3x 2 k
(b) v =y 2 xi+4xzj+y 2 xk
(c) F = sinxi +cosxj +3xyzk
6 Avector fieldFisgiven by
F=x 3 yi+2y 2 j+(x+z 2 )k
(a) Find ∇ ×F.
(b) State 3F.
(c) Find ∇ × (3F).
(d) Is3(∇ ×F)thesameas ∇ × (3F)?
7 F is avector field andkis aconstant. Is ∇ × (kF)the
sameask(∇ ×F)?
Solutions
1 −3yk
2 (3xz +2y 2 )i −3yzj
4 −sinx,cosxk
5 (a) −7xyi −6xj + (7yz −x 2 )k
(b) x(2y −4)i −y 2 j + (4z −2xy)k
(c) 3xzi −3yzj −sinxk
6 (a) −j−x 3 k
(b) 3x 3 yi +6y 2 j +3(x +z 2 )k
(c) −3j −3x 3 k
(d) yes
7 yes
26.6 COMBININGTHEOPERATORSGRAD,DIVANDCURL
We have now metthreevector operators; these aresummarized inTable 26.1.
Itisimportanttobeabletocombinethethreeoperatorsgrad,divandcurlinsensible
ways.Forinstance,becausethegradientofascalarisavectorwecanconsiderevaluating
its divergence, thatis( ) ∂φ
∇·(∇φ)=∇·
=
∂x , ∂φ
∂y , ∂φ
∂z
( ∂
∂x , ∂ ∂y ∂z)
, ∂ ·
= ∂2 φ
∂x + ∂2 φ
2 ∂y + ∂2 φ
2 ∂z 2
( ∂φ
∂x , ∂φ
∂y , ∂φ
∂z
This lastexpression isvery important and isoften abbreviated tosimply
∇ 2 φ
Table26.1
Thethree vectoroperators.
Operator Acts on Resultisa Definition
grad scalar field vector field ∇φ = grad φ = ∂φ
∂x i + ∂φ
∂y j + ∂φ
∂z k
div vector field scalar field ∇ ·v = divv = ∂v x
∂x + ∂v y
∂y + ∂v z
∂z
∣ i jk ∣∣∣∣∣∣∣
∂ ∂ ∂
curl vector field vector field ∇ ×v = curlv =
∂x ∂y ∂z
∣ v x v y v z
)
862 Chapter 26 Vector calculus
pronounced‘del-squared φ’,andoccursinLaplace’sequation ∇ 2 φ = 0andotherpartial
differential equations.
Example26.9 If φ = 2x 2 −y 2 −z 2 ,find ∇φ, ∇ · (∇φ)anddeducethat φ satisfiesLaplace’sequation.
Solution ∇φ = 4xi −2yj −2zk
∇·(∇φ)=4−2−2=0
that is,
∇ 2 φ = 0
Hence φ satisfies Laplace’s equation.
Example26.10 If φ(x,y,z)isanarbitrarydifferentiablescalarfield,showthatcurl(grad φ) = ∇×(∇φ)
isalways zero.
Solution Given φ = φ(x,y,z) we have, by definition,
Then
∇φ = ∂φ
∂x i + ∂φ
∂y j + ∂φ
∂z k
curl(grad φ) = ∇ × (∇φ)
i j k
∂ ∂ ∂
=
∂x ∂y ∂z
∂φ ∂φ ∂φ
∣
∂x ∂y ∂z
∣
( ( ∂ ∂φ
=
∂y ∂z
( ( ∂ ∂φ
+
∂x ∂y
)
withsimilarresultsfortheothermixedpartialderiva-
Now,since ∂ ∂x
tives, itfollows that
∇×(∇φ)=0
( ) ∂φ
= ∂ ∂y ∂y
( ∂φ
∂x
for any scalar field φ whatsoever.
)
− ∂ ∂z
( ∂φ
∂y
)
− ∂ ∂y
( ∂φ
∂x
)) ( ∂
i −
∂x
))
k
( ) ∂φ
− ∂ ∂z ∂z
( )) ∂φ
j
∂x
Foranarbitrarydifferentiable scalar field φ
∇×(∇φ)=0
26.6 Combining the operators grad, div and curl 863
Engineeringapplication26.3
Poisson’sequation
Recall from Engineering application 26.1 that engineers sometimes need to solve
complex electrostatic problems when designing electrical equipment. For example,
this need arises when calculating the electrical field strength in a high-voltage electrical
distribution station to ensure there is no danger of electrical discharge across
the air gap between components that have different voltages. Poisson’s equation is
an equation thatcan beusefulwhen carrying outthis work.Consider the following.
If E is an electric field andV an electrostatic potential, then the two fields are
related by
E=−∇V
FromExample26.2weknowGauss’slaw: ∇ · E = ρ ε 0
.Combiningthesetwoequations
we can write
that is
∇·(−∇V)= ρ ε 0
∇ 2 V = − ρ ε 0
ThispartialdifferentialequationisknownasPoisson’sequationandbysolvingitwe
coulddeterminetheelectrostaticpotentialinaregionoccupiedbycharges.Notethat
inacharge-freeregion, ρ = 0andPoisson’sequationreducestoLaplace’sequation
∇ 2 V = 0.
EXERCISES26.6
1 Ascalar field φ is given by
φ = 3x +y−y 2 z 2 .Show that φ satisfies
∇ 2 φ = −2(y 2 +z 2 ).
2 If φ =2x 2 y−xz 3 showthat ∇ 2 φ =4y−6xz.
3 Ifv =xyi −yzj + (y +2z)k findcurl(curl(v)).
4 If φ =xyzandv =3x 2 i+2y 3 j+xykfind ∇φ, ∇·v,
and ∇ · (φv).Show that
∇·(φv)=(∇φ)·v+φ∇·v.
5 Verifythat φ =x 2 y +y 2 z +z 2 x satisfies
∇·(∇φ)=2(x+y+z).
6 IfAis an arbitrarydifferentiablevectorfield show
thatthe divergence ofthe curlofAis always 0.
7 Expresseachofthe following in operator notation
using‘∇’, ‘∇ ·’ and‘∇×’:
(a) grad (divF)
(b) curl(grad φ)
(c) curl(curlF)
(d) div (curlF)
(e) div (grad φ)
8 Scalar fields φ 1 and φ 2 are givenby
φ 1 =2xy+y 2 z φ 2 =x 2 z
(a) Find ∇φ 1 .
(b) Find ∇φ 2 .
(c) State φ 1 φ 2 .
(d) Find ∇(φ 1 φ 2 ).
(e) Find φ 1 ∇φ 2 +φ 2 ∇φ 1 .
(f) Whatdo you conclude from(d)and(e)?
864 Chapter 26 Vector calculus
9 Ascalar field φ and avector fieldFare given by
φ=xyz 2 F=x 2 i+2j+zk
(a) Find ∇φ.
(b) Find ∇ ·F.
(c) Calculate φ(∇ ·F) +F · (∇φ).[Hint:recall the
dot product oftwo vectors.]
(d) State φF.
(e) Calculate ∇ · (φF).
(f) What do you conclude from(c)and (e)?
Solutions
3 j−k
(d) (6x 2 yz +2xy 2 z 2 )i + (2x 3 z +2yx 2 z 2 )j
+(2x 3 y +2y 2 x 2 z)k
4 ∇φ=yzi+xzj+xyk
∇·v=6x+6y 2
(e) same as(d)
∇·(φv) =9x 2 yz+8xy 3 z+x 2 y 2
(f) ∇(φ 1 φ 2 ) = φ 1 ∇φ 2 +φ 2 ∇φ 1
9 (a) yz 2 i +xz 2 j +2xyzk
7 (a) ∇(∇·F) (b) ∇ ×(∇φ)
(b) 2x+1
(c) ∇×(∇×F) (d) ∇·(∇×F)
(e) ∇·(∇φ)
(c) 3x 2 yz 2 +2xz 2 +3xyz 2
8 (a) 2yi + (2x +2yz)j +y 2 (d) x 3 yz 2 i +2xyz 2 j +xyz 3 k
k
(b) 2xzi +x 2 k
(e) same as(c)
(c) 2x 3 yz +y 2 x 2 z 2 (f) ∇·(φF)=φ(∇·F)+F·(∇φ)
26.7 VECTORCALCULUSANDELECTROMAGNETISM
Vector calculus provides a useful mechanism for expressing the fundamental laws of
electromagnetisminaconcisemanner.Theselawscanbesummarizedbymeansoffour
equations,knownasMaxwell’sequations.Muchofelectromagnetismisconcernedwith
solving Maxwell’s equations for different boundary conditions.
Equation1
divD=∇·D=ρ
where D = electric flux density, and ρ = charge density. This equation is a general
formofGauss’stheoremwhichstatesthatthetotalelectricfluxflowingoutofaclosed
surface isproportional tothe electriccharge enclosed by thatsurface.
Equation2
divB=∇·B=0
whereBisthe magnetic fluxdensity.
This equation arises from the observation that all magnetic poles occur in pairs and
therefore magnetic field lines are continuous; that is, there are no isolated magnetic
poles.Incontrast,electricfieldlinesoriginateonpositivechargesandterminateonnegative
charges and so a net positive charge in a region leads to an outflow of electric
flux.
Equation3
Review exercises 26 865
curlE=∇×E=− ∂B
∂t
where E is the electric field strength. This equation is a statement of Faraday’s law. A
time-varying magnetic field produces a space-varying electric field.
Equation4
curlH=∇×H=J+ ∂D
∂t
where H is the magnetic field strength and J is the free current density. This equation
statesthatatime-varying electric field gives risetoaspace-varying magnetic field.
The derivation of these equations is beyond the scope of this text but can be found in
manybooksonelectromagnetism.Thepoweroftheseequationsliesintheirgenerality.
Thebrevitywithwhichthemainlawsofelectromagnetismcanbeexpressedisatribute
tothe utility ofvector calculus.
REVIEWEXERCISES26
1 Find ∇φif
(a) φ = 3xyz
(b) φ =x 2 yz+xy 2 z+xyz 2
(c) φ =xy 2 z 2 +x 2 yz 2 +x 2 y 2 z
2 Ifφ=1/ √ x 2 +y 2 +z 2 ,showthat
∂ 2 φ
∂x 2 + ∂2 φ
∂y 2 + ∂2 φ
∂z 2 = 0
3 Functions satisfyingLaplace’sequationare called
harmonicfunctions.Showthat the following
functionsare harmonic:
(a) z =x 4 −6x 2 y 2 +y 4
(b) z = 4x 3 y −4xy 3
4 IfA=xi+yj+zkand
B = cosxi −sinxj,find
(a) A×B
(b) ∇·(A×B)
(c) ∇×A
(d) ∇×B
Verifythat
∇·(A×B)=B·(∇×A)−A·(∇×B).
5 Forarbitrary differentiablescalar fields φ and ψ show
that∇(φψ) = ψ∇φ +φ∇ψ.
6 Ifψ=x 2 yanda=xi+yj+zk,find∇ψ,
∇ ×a,∇ × (ψa).Showthat
∇×(ψa)=ψ∇×a+(∇ψ)×a.
7 Ascalarfield φ isafunctionofx,zandt only.Vectors
E and Hare defined by
( )
E = 1 ∂φ
ε ∂z i − ∂φ
∂x k H = − ∂φ
∂t j
where ε isaconstant.
(a) Show that ∇ ·E = 0.
(b) Show that ∇ ·H = 0.
Given that ∇ ×E = −µ ∂H ,where µ isaconstant,
∂t
show that φ satisfiesthe partial differential equation
∂ 2 φ
∂x 2 + ∂2 φ
∂z 2 = φ
µε∂2 ∂t 2
8 Ifv = (2x 2 y+3x 5 )i+e xy j+xyzkfind ∂v
∂x and ∂2 v
∂x 2.
9 An electrostatic potential isgiven byV = 5xyz.Find
(a) the associatedelectric field E,
(b) |E| atthe point (1,1,1).
10 An electrostatic field is given by
E = 3(x+y)i+2xyj.Findthedirectionofthisfieldat
(a) the point (2,2)
(b) the point (3,4).
11 Find the curlofA =yi +2xyj +3zk.
12 Find the curlofthe vector field
F = (x 2 −y)i + (xy −4y 2 )j.
13 If φ = 5e x+2y coszfind ∇φ atthepoint (0,0,0).
866 Chapter 26 Vector calculus
14 Thevector fieldsAand B are given by
A =x 2 yi+2xzj+xk
B=yzi+x 2 yj+3k
(a) Calculate the vector productA ×B.
(b) Calculate ∇ · (A ×B).
(c) CalculateB·(∇ ×A) −A·(∇ ×B).
(d) What doyou conclude from (b)and(c)?
(e) Can you prove thisresult istrueforany two
vector fields?
15 Given
F = (3x,2y,4z)
φ =x 2 yz
G = (1,−1,3)
find
(a) ∇·F
(b) ∇φ
(c) ∇×F
(d) ∇ ·(φF)
(e) ∇ × (φG)
(f) ∇(F·G)
(g) ∇×(F×G)
(h) ∇·(F+G)
(i) ∇ × (3F +4G)
Solutions
1 (a) 3yzi +3xzj +3xyk
(b) (2xyz +y 2 z +yz 2 )i
+ (x 2 z +2xyz +xz 2 )j
+ (x 2 y +xy 2 +2xyz)k
(c) (y 2 z 2 +2xyz 2 +2xy 2 z)i
+ (2xyz 2 +x 2 z 2 +2x 2 yz)j
+ (2xy 2 z +2x 2 yz +x 2 y 2 )k
4 (a) zsinxi+zcosxj+(−ycosx−xsinx)k
(b) zcosx (c) 0 (d) −cosxk
6 ∇ψ =2xyi+x 2 j
∇×a=0
∇ × (ψa) =x 2 zi −2xyzj + (2xy 2 −x 3 )k
8 (4xy +15x 4 )i +ye xy j +yzk,
(4y +60x 3 )i +y 2 e xy j
9 (a) E = −∇V = −5yzi −5xzj −5xyk
√
(b) 75
10 (a) 12i +8j
(b) 21i +24j
11 (2y −1)k
12 (y +1)k
13 5i +10j
14 (a) (6xz −x 3 y)i + (xyz −3x 2 y)j
+(x 4 y 2 −2xyz 2 )k
(b) −3x 2 (y +1) −xz(4y −1) +6z
(c) −3x 2 (y +1) −xz(4y −1) +6z
(d) ∇·(A×B)=B·(∇×A)−A·(∇×B)
15 (a) 9
(b) 2xyzi +x 2 zj +x 2 yk
(c) 0
(d) 21x 2 yz
(e) x 2 (y +3z)i + (x 2 y −6xyz)j
− (x 2 z +2xyz)k
(f) 3i −2j +12k
(g) −6i +7j −15k
(h) 9
(i) 0
27 Lineintegralsand
multipleintegrals
Contents 27.1 Introduction 867
27.2 Lineintegrals 867
27.3 Evaluationoflineintegralsintwodimensions 871
27.4 Evaluationoflineintegralsinthreedimensions 873
27.5 Conservativefieldsandpotentialfunctions 875
27.6 Doubleandtripleintegrals 880
27.7 Somesimplevolumeandsurfaceintegrals 889
27.8 ThedivergencetheoremandStokes’theorem 895
27.9 Maxwell’sequationsinintegralform 899
Reviewexercises27 901
27.1 INTRODUCTION
In this chapter a number of new sorts of integral are introduced. These are intimately
connected with the developments of the previous chapter on differential vector calculus.
The chapter starts by explaining the physical significance of line integrals and how
theseareevaluated.Thisleadsnaturallyintothetopicsofconservativevectorfieldsand
potentialfunctions.Theseareimportantinthestudyofelectrostatics.Doubleandtriple
integrals are then introduced; these generalize the earlier work on integration to integrands
which contain two and threeindependent variables.
Finallysomesimplevolumeandsurfaceintegralsareintroduced,togetherwiththedivergence
theorem and Stokes’ theorem. These enable Maxwell’s equations to be
expressed inintegral form.
27.2 LINEINTEGRALS
Consider an object of massmplaced in a gravitational field. Because the force of gravity
is a vector the gravitational field is an example of a vector field. The gravitational
force on the mass is known as its weight and is given by mg where g is a constant
868 Chapter 27 Line integrals and multiple integrals
A
s
m
M
N
ds
B
Figure27.1
An object ofmassmfallsfrom A to B.
vector called the acceleration due to gravity. Suppose we release the mass and allow
it to fall from point A in Figure 27.1. The vertical displacement measured downwards
from Aiss.
Workisbeingdonebythegravitationalforceinordertomakethemassaccelerate.We
wish to calculate the work done by the field in moving the mass from A to B. Suppose
we consider the amount of work done as the mass moves from point M to point N, a
distance δs.Elementaryphysicstellsusthattheworkdoneisequaltotheproductofthe
magnitudeoftheforceandthedistancemovedinthedirectionoftheforce.Inthiscase
themagnitudeoftheforceismg,andsothesmallamountofworkdone, δW,inmoving
from M toN,is
δW =mgδs
from which we have δW δs
=mg. As δs → 0 weobtain
δW
lim
δs→0 δs = dW ds =mg
To find the total work done as the mass falls from A to B we must add up, or integrate,
the contributions over the whole interval of interest,thatis
total work done =W =
∫ B
A
mgds
This is an elementary example of a line integral, so called because we are integrating
alongthelinefromAtoB.Inthiscaseitisstraightforwardtoevaluate.Sincebothgand
mare constants the integral becomes
W=mg
∫ B
A
ds
which equalsmg× (distancefrom AtoB).
Engineeringapplication27.1
Theworkdonebythegravitationalfield
AnobjectofmassmfallsverticallyfromAtoB.IfAisthepointwheres = 0andB
is the point wheres = 10 find the total work done by gravity as the mass falls from
AtoB.
27.2 Line integrals 869
Solution
The work done by gravity isfound by evaluating the line integral
W=mg
∫ s=10
s=0
ds =mg[s] 10
0 = 10mg
Note thatthisis alsothe potential energy lostbyminfallingadistance of 10 units.
Inthe previous example the path along which weintegrated was a straightline, butthis
need not always bethe case.Consider the following example.
Engineeringapplication27.2
Theworkdonebyanelectricfield
Figure 27.2 shows a unit charge moving along a curveC from point A to point B in
an electric fieldE.
At the particular point of interest, M, we have resolved the electric field vector
into two components. Resolving a vector into perpendicular components has been
described in Example 7.3. One component is tangential toC, namelyE t
, and one is
normaltoC,namelyE n
.AsaunitchargemovesfromMtoN,adistance δsalongthe
curve, the work done by the electric field isE t
δs. The componentE n
does no work
sincethereisnomotionperpendiculartothecurveC.Tofindthetotalworkdonewe
mustadd up all contributions, resulting inthe integral
total work done =
∫ B
A
E t
ds
Thisisasecondexampleofalineintegral,thelinebeingthecurve,C,joiningAand
B. Itisusual todenote thisby
∫
total work done = E t
ds
∫
C
where the symbol tells us tointegrate along the curveC.
C
C
B
M
E t
ds
N
E n
A
E
Figure27.2
Asacharge movesfrom Mto N,
the fieldEdoesworkE t δs.
870 Chapter 27 Line integrals and multiple integrals
If the coordinates of the end points of the curveC are known, say (x 1
,y 1
) and (x 2
,y 2
),
we often write ∫ (x 2
,y 2
)
(x 1
,y 1
)
to show this, but care must then be taken to define the intended
route from AtoB.
Wenowexplainhowtointegrateafunctionalongacurve.Consideravectorfield,F,
through which runs a curve,C, as shown inFigure 27.3.
F(x,y)
F
F n
M
u
F t
N
C
MN 5 ds
Figure27.3∫
Theintegral F · dsis equal to the work
C
done by Fasthe particle movesalongC.
Suppose we restrict ourselves to two-dimensional situations. In the general case the
vector field will vary asxandyvary, that is F = F(x,y). Consider the small element
ofC joining points M and N. Let θ be the angle between the tangent to the curve at M
andthedirectionofthefieldthere.WeshalldenotethevectorjoiningMandN(i.e. −→ MN)
by δs. Consider the quantity
F·δs
where · represents the scalar product. When F represents a gravitational force field,
F·δsrepresentsthesmallamountofworkdonebythefieldinmovingaparticleofunit
mass from M to N. The appropriate integral along the whole curve represents the total
work done. From the definition of the scalar product wenote that
F·δs = |F‖δs|cosθ
Writingthe modulus ofFas simplyF, and the modulus of δs, as δs, we have
F·δs=Fδscosθ
= (Fcosθ)δs
=F t
δs
whereF t
is the component of F tangential toC. This result is of the same form as the
expressions for work done obtained previously. We are therefore interested in integrals
of the form
∫
F(x,y)·ds
C
27.3 Evaluation of line integrals in two dimensions 871
ds
dxi
Figure27.4
Thevector dscan be
written asdxi +dyj.
dyj
Since F is a vector function of x and y it will have Cartesian components P(x,y) and
Q(x,y) and so wecan writeFinthe form
F(x,y) =P(x,y)i +Q(x,y)j
Similarly,referringtoFigure27.4, wecanwritethevectordsasds = dxi +dyj.Hence
∫ ∫
F(x,y)· ds = (P(x,y)i +Q(x,y)j)· (dxi +dyj)
C
Taking the scalar product gives
∫
P(x,y)dx +Q(x,y)dy
C
and thisisthe lineintegral inCartesian form.
C
IfF(x,y) =P(x,y)i +Q(x,y)jthen
∫ ∫
F(x,y)· ds = (P(x,y)i +Q(x,y)j)· (dxi +dyj)
C
C
∫
= P(x,y)dx +Q(x,y)dy
C
In order to proceed further we now need to explore how such integrals are evaluated
inpractice. This isthe topic of the next section.
27.3 EVALUATIONOFLINEINTEGRALSINTWODIMENSIONS
Example27.1 Evaluate
∫
F·ds
C
whereF = 5y 2 i +2xyj andC isthe straightline joiningthe originand the point (1,1).
Solution We compare the integrand with the standard form and recognize that in this case
P(x,y) = 5y 2 andQ(x,y) = 2xy. The integralbecomes
∫
5y 2 dx +2xydy
C
The curve of interest, in this case a straight line, is shown in Figure 27.5. Along this
curve it is clear that y = x at all points. We use this fact to simplify the integral by
writing everything in terms of x. We could equally well choose to write everything in
terms of y. If y = x then dy = 1, that is dy = dx. As we move along the curveC, x
dx
ranges from0to1,and the integral reduces to
∫
∫ ∫ x=1
[ 7x
5x 2 dx +2x 2 dx = 7x 2 dx = 7x 2 3
dx = = 7 3 3
C
C
x=0
] 1
0
872 Chapter 27 Line integrals and multiple integrals
y
1 (1, 1)
y
1
(1, 1)
C
C 2
C 1
C
0 1 x
Figure27.5
Thepath ofintegration joins
(0,0)and(1,1)and has equation
y=x.
0 1 x
Figure27.6
ThepathC is made upoftwo distinct
parts.Integrationis performed
separately oneach part.
Let us now see what happens when we choose to evaluate the integral of Example 27.1
along a different path joining (0,0) and (1,1).
Example27.2 Evaluate the integral ∫ C 5y2 dx +2xydy along the curve,C, consisting of thexaxis betweenx
= 0 andx = 1 and the linex = 1 asshown inFigure 27.6.
Solution WeevaluatethisintegralintwopartsbecausethecurveCismadeupoftwopieces.The
firstpieceC 1
ishorizontal,andthesecondC 2
isvertical.Therequiredintegralisthesum
of the two separate ones. Along thexaxis,y = 0 and dy = 0. This means that both the
terms5y 2 dx and 2xydy arezero,and so the integral reducesto
∫ x=1
x=0
0dx=0
andsothereiszerocontributiontothefinalanswerfromthispartofthecurveC.Along
the linex = 1, the quantity dx equals zero. Hence 5y 2 dx also equals zero, and 2xydy
equals2ydy. Becauseyrangesfrom0to1the integral becomes
∫ y=1
y=0
2ydy = [y 2 ] 1 0 = 1
Note thatthis isadifferent answerfrom thatobtained inExample 27.1.
As we have seen from Examples 27.1 and 27.2 the value of a line integral can depend
upon the pathtaken. Itisthereforeessential tospecifythispathclearly.
Example27.3 Evaluatetheintegral ∫ C F(x,y)·ds,whereF(x,y) = (3x2 +y)i+(5x−y)jandCisthe
portionofthe curvey = 2x 2 between A(2,8)and B(3, 18).
∫ ∫
Solution F(x,y)·ds = (3x 2 +y)dx + (5x −y)dy
C
C
The curveC has equationy = 2x 2 for 2 x3. Along this curve we can replaceyby
2x 2 . Note also that dy = 4x and so we can replace dy with 4xdx. This will produce an
dx
27.4 Evaluation of line integrals in three dimensions 873
integral entirely in terms of the variablexwhich is then integrated betweenx = 2 and
x = 3.Thus
∫
C
(3x 2 +y)dx+(5x−y)dy=
=
∫ 3
2
∫ 3
2
(3x 2 +2x 2 )dx + (5x −2x 2 )(4xdx)
25x 2 −8x 3 dx
[ ] 25x
3 3
= −2x 4
3
2
= 28.333
EXERCISES27.3
1 Evaluate ∫ C3ydx +2xdyalong the straightlineC
between(1,1)and(3,3).
2 Evaluate ∫ C 2yxdx +x2 dyalong the straightline
y = 4xfrom (0,0)to (3,12).
3 Evaluate ∫ C (7x +3y)dx +2ydy alongthe curve
y =x 2 between (0,0)and (2,4).
4 IfE = (x +2y)i + (x −3y)j,Aisthepoint (0,0)and
Bis the point (3,2),evaluate
∫ B
E·ds
A
(a)along the straightline joining AandB,
(b)horizontally along thexaxisfromx = 0 tox = 3
andthenverticallyfromy = 0 toy = 2.
5 IfF
∫
= (2xy −y 4 +3)i + (x 2 −4xy 3 )jevaluate
CF · dswhereC is the straightlinejoining A(1,3)
andB(2,5).
6 Evaluate the integral
∫
(3x 2 +2y)dx + (7x +y 2 )dy
C
from A(0,1)to B(2, 5)along the curveC defined by
y=2x+1.
Solutions
1 20
2 108
3 38
4 (a)
15
2
5 −1149
6
268
3
(b)
9
2
27.4 EVALUATIONOFLINEINTEGRALSINTHREEDIMENSIONS
Evaluation of line integrals along curves lying in three-dimensional space is performed
inasimilar way. It isoften helpful inthis case toexpress the independentvariablesx,y
andzintermsofasingleparameter. Consider the following example.
874 Chapter 27 Line integrals and multiple integrals
Example27.4 Acurve,C, isdefined parametrically by
x=4 y=t 3 z=5+t
and islocated within a vector fieldF =yi +x 2 j + (z +x)k.
(a) Find the coordinates of the point P on the curve where the parameter t takes the
value 1.
(b) Find the coordinates of the point Qwhere the parametert takes the value 3.
(c) By expressing the line integral ∫ F·ds entirely in terms oft find the value of the
C
lineintegral from PtoQ.Note thatds = dxi +dyj +dzk.
Solution (a) Whent = 1,x = 4,y = 1 andz = 6,and so Phas coordinates (4, 1,6).
(b) Whent = 3,x = 4,y = 27 andz = 8,and so Qhas coordinates (4, 27, 8).
(c) To express the line integral entirely in terms of t we note that if x = 4,
dx
= 0 so that dx is also zero. If y = t 3 then dy = 3t 2 so that dy = 3t 2 dt.
dt
dt
Similarlysincez = 5 +t, dz = 1 so thatdz = dt. The line integralbecomes
dt
∫ ∫
F·ds= (yi +x 2 j + (z +x)k)·(dxi +dyj +dzk)
C
C
∫
= ydx+x 2 dy+(z+x)dz
=
=
C
∫ t=3
t=1
∫ 3
1
0 +16(3t 2 )dt+(9+t)dt
48t 2 +9+tdt
[ ] 48t
3 3
=
3 +9t+t2 2
1
( 48(3 3 )
=
3
= 438
+ (9)(3) + 32
2
) ( 48
−
3 +9+ 1 2)
EXERCISES27.4
1 You are requiredto evaluate the lineintegral
∫
CF ·ds where F is the vector field
F = (2y +3x)i + (yz +x)j +3xykand
ds = dxi +dyj +dzk.The curveC is defined
parametrically byx =t 2 ,y = 3t andz = 2t for
values oft between0and1.
(a) Findthe coordinatesofthe point A, wheret = 0.
(b) Findthe coordinatesofthe point B,wheret = 1.
(c) Byexpressing the line integralentirely in terms
oft,evaluate the lineintegral from Ato B along
the curveC.
27.5 Conservative fields and potential functions 875
Solutions
1 (a) (0,0, 0)
(b) (1,3, 2)
(c) 17
27.5 CONSERVATIVEFIELDSANDPOTENTIALFUNCTIONS
We have seen that, in general, the value of a line integral depends upon the particular
pathchosen.However,inspecialcases,theintegralofagivenvectorfieldFturnsoutto
be the same on any path with the same end points; that is, it is independent of the path
chosen.InthesecasesthevectorfieldFissaidtobeconservative.Thereisasimpletest
which tells us whether or notFis conservative:
Fisaconservative vector field if
curlF = 0
everywhere
Ifwearedealingwithatwo-dimensionalvectorfieldF(x,y) =P(x,y)i+Q(x,y)jthen
thistestbecomes:Fisconservative if ∂P
∂y = ∂Q . (SeeQuestion 7 inExercises 27.5.)
∂x
Example27.5 Show thatthe vector field
F =ze x sinyi +ze x cosyj +e x sinyk
isaconservative field.
Solution We find curlF:
i j k
∇×F=
∂ ∂ ∂
∂x ∂y ∂z
∣
ze x sinyze x cosy e x siny
∣
= (e x cosy −e x cosy)i − (e x siny −e x siny)j + (ze x cosy −ze x cosy)k
= 0
Wehaveshownthat ∇ ×F = 0andsothefieldisconservative.NotefromSection26.5
thatsuch a field isalsosaidtobe irrotational.
Therearetwoimportantpropertiesofconservativevectorfieldswhichwenowdiscuss.
27.5.1 Thepotentialfunction
Itcanbeshownthatanyconservativevectorfield,F,canbeexpressedasthegradientof
some scalar field φ. That is, we can find a scalar field φ such that F = ∇φ. The scalar
field φ issaidtobeapotentialfunctionandFissaidtobederivablefromapotential.
876 Chapter 27 Line integrals and multiple integrals
Inthree dimensions, if
F =P(x,y,z)i +Q(x,y,z)j +R(x,y,z)k
isconservative, wecan write
So
F=∇φ
= ∂φ
∂x i + ∂φ
∂y j + ∂φ
∂z k
P = ∂φ
∂x
Q = ∂φ
∂y
R = ∂φ
∂z
You should refer back toExample 26.4 where thisidea was introduced.
Example27.6 Thevector field visderivable from the potential φ = 2xy +zx. Find v.
Solution If v isderivable from the potential φ, then v = ∇φ and so
v=∇φ=(2y+z)i+2xj+xk
This vector field isconservative asiseasilyverified by findingcurl v. Infact,
i jk
∇×v=
∂ ∂ ∂
∂x ∂y ∂z
∣
2y+z2x x
∣
=0i−(1−1)j+(2−2)k
= 0
Indeed, recall from Example 26.10 thatcurl (grad φ) isidentically zero forany φ.
When the vector field F is conservative there is an alternative method of evaluating the
line integral ∫ F·ds which involves the use of the potential function φ. Consider the
C
following example.
Example27.7 Thetwo-dimensional vector fieldF = i +2j isconservative.
(a) Findasuitablepotential function φ suchthat F = ∇φ = ∂φ
∂x i + ∂φ
∂y j.
(b) Evaluate ∫ (3,2)
F·ds along any convenient path.
(0,0)
(c) Find the value of φ at B(3, 2), and the value of φ at A(0, 0), and show that the
differencebetweentheseisequaltothevalueofthelineintegralobtainedinpart(b).
Solution (a) We are given that ∂φ = 1 so that φ = x + f (y), where f is an arbitrary function
∂x
ofy. We are also given that ∂φ = 2 so that φ = 2y +g(x), wheregis an arbitrary
∂y
function ofx. Itiseasy toverify that φ =x +2yisasuitablepotential function.
27.5 Conservative fields and potential functions 877
(b) Tofind thelineintegralweshallchoosethestraightlinepath,C,joining (0,0) and
(3,2). This has equationy = 2 x. The lineintegral becomes
3
∫ (3,2)
(0,0)
F·ds=
=
=
=
∫ (3,2)
(0,0)
∫ (3,2)
(0,0)
∫ (3,2)
(0,0)
∫ x=3
x=0
[ ] 7 3
=
3 x 0
= 7
(i +2j)·(dxi +dyj)
dx+2dy
dx + 4 3 dx
7
3 dx
since dy = 2 3 dxonC
(c) The value of φ at B(3, 2) is 3 + 2(2) = 7. The value of φ at A(0, 0) is 0. Clearly
the difference between these values is 7, the same as the value of the line integral
obtained inpart(b).
Theresultobtained inthe previousexample isimportant:
IfFisaconservative vectorfield suchthat F = ∇φ then, forany points Aand B,
∫ B
A
F· ds = φ(B)−φ(A)
27.5.2 Thelineintegralaroundaclosedloop
AsecondimportantpropertyofconservativefieldsariseswhenthecurveCofintegration
forms a closed loop. Suppose we evaluate a line integral from A to B firstly along the
curveC 1
and secondly alongC 2
asshown inFigure 27.7.
If the field,F, isconservative bothintegrals will yield the same answer, thatis
∫ ∫
F·ds= F·ds
C 1
C 2
But
∫ ∫
F·dsfromAtoB =−
F·dsfromBtoA
Consequently,thelineintegralaroundtheclosedcurveAtoBandbacktoAmustequal
zero.When the pathofintegration isaclosedcurve weuse the symbol
∮
878 Chapter 27 Line integrals and multiple integrals
B
y
1
B
C 2
C 1
C 2
C 1
A
Figure27.7 ∫
Theline integral
F·dscanbe
evaluated along different paths
between Aand B.
A
0 1
Figure27.8
Thepath ofintegration is the curveC 1
followed bythe lineC 2 .
x
torepresent the integral.So, foraconservative field wehave the result
∮
F·ds=0
C
for any closed curveC.
Insummarywe have the following results:
Foraconservative field, F,all the following statements areequivalent:
∇×F=0
∮
F·ds=0
∫ B
A
F·ds isindependentofthe pathbetween Aand B
Fisderivable from a scalar potential,thatisF = ∇φ
Example27.8 Evaluate
∫
F·ds
C
whereFisthe vectorfieldy 2 i +2xyj and where:
(a) C =C 1
isthe curvey =x 2 goingfrom A(0,0)toB(1, 1).
(b) C =C 2
isthe straightlinegoingfromB(1, 1)toA(0,0).
(c) Deduce that ∮ F·ds = 0 where the closed line integral is taken around the pathC 1
and thenC 2
.
Solution Thesituation isdepictedinFigure 27.8.
(a) C 1
hasequationy =x 2 andsody = 2xdx.NotethatweareintegratingfromAtoB.
Therefore
∫
F·ds=
C 1
∫
(y 2 i +2xyj)·(dxi +dyj) =
C 1
∫
y 2 dx +2xydy
C 1
Substitutingy =x 2 and dy = 2xdx we have
∫ ∫
F·ds= x 4 dx + (2x)x 2 (2xdx)
C 1
C 1
=
∫ x=1
x=0
27.5 Conservative fields and potential functions 879
5x 4 dx = [x 5 ] 1 0 = 1
(b) C 2
hasequationy =xandsody = dx.NotethatthistimeweareintegratingfromB
toA.Therefore
∫
F·ds=
C 2
∫
(y 2 i +2xyj)·(dxi +dyj) =
C 2
∫
y 2 dx +2xydy
C 2
Substitutingy =xand dy = dx we have
∫ ∫ x=0
F·ds= x 2 dx+2x 2 dx=
C 2
x=1
∫ 0
(c) Now
∮ ∫ ∫
F·ds= F·ds+ F·ds=1−1=0
C 1
C 2
1
3x 2 dx = [x 3 ] 0 1 = −1
The line integral around the closed path is zero because the field is conservative.
You should check that ∇ ×F = 0and thatasuitablepotential function is φ =y 2 x.
EXERCISES27.5
1 Verifythat the vector field
F = (4x 3 +y)i +xj is conservative.
2 Forthe vector field
F =ycosxyi +xcosxyj +2zkfind ∇ ×Fandverify
that the field isconservative.
3 Considerthe fieldF =yi +xj.
(a) ShowthatFis aconservative field.
(b) Find afunction φ such that ∂φ
∂x =yand∂φ ∂y =x.
(c) Find asuitable potential function φ for F.
(d) IfAisthe point (2,1)and Bis the point
(5,8)evaluate ∫ B
A F·ds.
(e) Find φ at Band φ atAandshow that the value of
the line integralcalculated in part (d)is equal to
thedifferencebetweenthevaluesof φ atBandA.
4 Showthat the vectorfield,
F = (2xy −y 4 +3)i + (x 2 −4xy 3 )j,ofQuestion5in
Exercises27.3, isconservative andfindasuitable
potential function φ from which Fcan be derived.
Showthat the difference between φ evaluated at
B(2, 1)andatA(1,0)is equalto the value ofthe line
integral ∫ B
A F·ds.
5 Show that
∫ (1,1)
I = 3x 2 y 2 dx +2x 3 ydy
(−1,−1)
is independentofthe pathofintegration,andevaluate
the integral.
6 The function φ = 4xy is apotential functionfor
whichF = ∇φ.
(a) Find F.
(b) Evaluate ∫ F ·dsalong the curvey =x 3 from
A(0, 0)to B(2, 8).
(c) Evaluate φ at BandatAandshowthat the
difference between these values is equalto the
value ofthe lineintegral obtainedin part (b).
7 IfF =P(x,y)i
(
+Q(x,y)j
)
+0k,showthat
∂Q
∇×F=
∂x − ∂P k.Deduce that ifFis
∂y
conservative then ∂Q
∂x = ∂P
∂y .
880 Chapter 27 Line integrals and multiple integrals
Solutions
3 (b) φ=xy+c (c) xy+c (d) 38
4 φ=x 2 y−y 4 x+3x+c,5
5 2
6 (a) F = 4yi +4xj (b) 64 (c) 64
27.6 DOUBLEANDTRIPLEINTEGRALS
27.6.1 Doubleintegrals
Expressions such as
∫ y=y2
x=x 1
y=y 1
∫ x=x2
f(x,y)dxdy
are known asdouble integrals. What ismeant by the above is
∫ (
y=y2 ∫ )
x=x2
f(x,y)dx dy
y=y 1
x=x 1
wherefirstlytheinnerintegralisperformedbyintegrating f withrespecttox,treatingy
asifitwereaconstant.Thelimitsofintegrationareinsertedasusualandthenthewhole
expression isintegrated with respect toybetween the limitsy 1
andy 2
.
It is important that you can distinguish between double integrals and line integrals.
Physically they mean quite different things, and they are evaluated in different ways.
Whilstbothtypesofintegralcontaintermsinvolvingx,y,dxanddy,inadoubleintegral
dx and dy always occur as a product, that is as dxdy. In a line integral they occur separately.
When evaluating a line integral we integrate along a curve. When evaluating a
double integral we integrate over a two-dimensional region.
Double integrals may alsobe writteninthe form
∫ ∫
f(x,y)dxdy
R
whereRiscalledtheregionofintegration.TheregionRmustbedescribedmathematically.
Inthe integral
∫ y=y2
x=x 1
y=y 1
∫ x=x2
f(x,y)dxdy
R is the rectangular region defined by x 1
x x 2
, y 1
y y 2
. Non-rectangular
regions are alsocommon.
Example27.9 Sketch the regionRdefined by 1 x4 and 0 y2.
Solution TheregionisshowninFigure27.9.Weseethatxliesbetween1and4,andyliesbetween
0 and 2.Consequently the region isrectangular.
27.6 Double and triple integrals 881
y
4
y
4
3
R
3
R
2
2
1
dy
1
dx
0
1 2
3 4 x
Figure27.9
Therectangular region over which the
integration in Example 27.10 is
performed.
0
1 2
3 4 x
Figure27.10
Theintegral could have been foundbyfirst
integrating over a vertical stripsuchasthat
shown.
Example27.10 Evaluate
∫ y=2 ∫ x=4
y=0 x=1
x+2ydxdy
over the region,R, given inExample 27.9 and defined by 1 x4 and 0 y2.
Solution The inner integral is found first by integrating with respect tox, keepingyfixed, that is
constant.
∫ x=4
[ ] x
2 x=4
x+2ydx=
2 +2xy =8+8y− 1 2 −2y=15 2 +6y
x=1
x=1
Then the outer integral isfound by integrating the resultwith respecttoy:
∫ y=2
y=0
[ ]
15 15 2
2 +6ydy= 2 y+3y2 =15+12=27
0
Now consideragain Example 27.10. Byperformingthe innerintegral first,
∫ x=4
x=1
x+2ydx
weareeffectivelysinglingoutahorizontalstrip,suchasthatshowninFigure27.9,and
integrating along it fromx = 1 tox = 4. When we then integrate the result fromy = 0
toy = 2 weare effectively adding upcontributions fromall suchhorizontalstrips.
Theintegraloverthesameregioncouldalsohavebeenperformedbyintegratingfirst
over vertical stripsfor0 y2(Figure 27.10)yielding
∫ y=2
y=0
x+2ydy =[xy+y 2 ] y=2
y=0 =2x+4
882 Chapter 27 Line integrals and multiple integrals
The result is then integrated fromx = 1 tox = 4 effectively adding up contributions
from each vertical strip:
∫ x=4
x=1
2x+4dx=[x 2 +4x] 4 1 =32−5=27
as expected. We note that
∫ y=2 ∫ x=4
y=0
x=1
x+2ydxdy=
∫ x=4 ∫ y=2
x=1
y=0
x +2ydydx (27.1)
In particular, note the order of dx and dy in Equation (27.1). The simple interchange of
limits isonly possible because inthis example the regionRisarectangle.
WhenevaluatingadoubleintegraloverarectangularregionR,theintegrationmay
becarried outinany order, thatis
∫ ∫
∫ ∫
f(x,y)dxdy= f(x,y)dydx
R R
The double integral in Example 27.10 could have the following interpretation. Consider
the surfacez = x +2y, for 0 y 2, 1 x 4, as shown in Figure 27.11. In
this example the surface is a plane but the theory applies to more general surfaces. The
variable z represents the height of the surface above the point (x,y) in the x--y plane.
For example, consider a point in thex--y plane, say (2, 3), that isx = 2,y = 3. Then at
(2,3),z =x +2y = 2 +2(3) = 8. Thus the point on the surface directly above (2,3)
is8unitsfrom (2,3).
Let us now consider dxdy. We call dxdy an element of area. It is a rectangle with
dimensionsdxanddyandareadxdy.Hencezdxdyrepresentsthevolumeofarectangular
column of base dxdy. As we integrate with respect toxfor 1 x4and integrate
with respect toyfor 0 y2weareeffectively summing all such volumes. Thus
∫ y=2 ∫ x=4
y=0
x=1
x+2ydxdy
represents the volume bounded by the surfacez = x + 2y and the regionRin thex--y
plane.
Example27.11 Sketch the regionRover which wewould evaluate the integral
∫ y=1 ∫ x=2−2y
y=0
x=0
f(x,y)dxdy
Solution Firstconsidertheouterintegral.Therestrictiononymeansthatinterestcanbeconfined
to the horizontal strip 0 y 1. Then examine the inner integral. The lower limit on
x means that we need only consider values ofxgreater than or equal to 0. The upperx
limit depends upon the value ofy. Ify = 0 this upper limit isx = 2 −2y = 2. Ify = 1
theupperlimitisx = 2 −2y = 0.Atanyotherintermediatevalueofywecancalculate
thecorrespondingupperxlimit.Thisupperlimitwilllieonthestraightlinex = 2−2y.
With this information the region of integration can be sketched. The region is shown in
Figure 27.12 and isseen tobe triangular.
27.6 Double and triple integrals 883
z
8
7
6
5
4
3
2
1
0 0
1
2
3
4
x
dx dy
Figure27.11
Thesurfacez=x+2y,0y2,1x4;the
quantityzdxdyis the volume ofthe rectangular column
with base area dxdy.
z
y
y
1
dy
2–x
x = 2 – 2y, y = ––— 2
0 2
Figure27.12
Theregion over which the integration in Example 27.12
isperformed.
x
Example27.12 Evaluate
∫ y=1 ∫ x=2−2y
y=0 x=0
4x+5dxdy
over the region described inExample 27.11.
Solution We first perform the inner integral
∫ x=2−2y
x=0
4x+5dx
integrating with respect tox. This gives
[2x 2 +5x] x=2−2y
x=0
=2(2−2y) 2 +5(2−2y)−0
=8y 2 −26y+18
Next weperform the outer integral
∫ y=1
y=0
[ 8y
8y 2 3
−26y+18dy=
3 − 26y2
2
= 8 3 − 26 2 +18
= 7.667
] 1
+18y
0
The region of integration is shown in Figure 27.12. The first integral, with respect tox,
corresponds to integrating along the horizontal strip fromx = 0 tox = 2 − 2y. Then
asyvaries from 0 to 1 in the second integral, the horizontal strips will cover the entire
region.
884 Chapter 27 Line integrals and multiple integrals
If we wish to change the order in which the integration is carried out, care must be
takenwiththelimitsofanon-rectangularregion.ConsideragainFigure27.12.Theline
x = 2 − 2y can be written asy = 2 −x . We can describe the regionRby restricting
2
attention to the vertical strip 0 x 2, and then letting y vary from 0 up to 2 −x
2 ,
that is
∫ x=2 ∫ y=(2−x)/2
x=0
y=0
4x+5dydx
You should check by evaluating this integral that the same result is obtained as in
Example 27.12.
Example27.13 Evaluate the double integral of f (x,y) = x 2 + 3xy over the regionRindicated in Figure
27.13by
(a) integrating first with respecttox, and thenwith respecttoy,
(b) integrating first with respecttoy, and thenwith respecttox.
Solution (a) If we integrate first with respect toxwe must select an arbitrary horizontal strip as
shown in Figure 27.13 and integrate in the x direction. On OB, y = x so that the
lower limit of thexintegration isx =y. On AB,x = 1 so the upper limit isx = 1.
Therefore
∫ x=1
[ ] x
x 2 3
+3xydx=
x=y 3 + 3x2 y 1
2
y
( 1
=
3 + 3y ) ( ) y
3
−
2 3 + 3y3
2
= 1 3 + 3y
2 − 11y3
6
Asyvaries from 0 to 1 the horizontal strips will cover the entire region. Hence the
limitsof integration of theyintegral are 0 and 1.So
∫ y=1
y=0
1
3 + 3y
2 − 11 6 y3 dy=
[ 1
3 y + 3 ] 1
4 y2 − 11y4
24
0
= 1 3 + 3 4 − 11
24
= 15
24
= 5 8
thatis,
∫ y=1 ∫ x=1
y=0
x=y
x 2 +3xydxdy= 5 8
27.6 Double and triple integrals 885
y
y
1
B
1
B
x = y
y = x
y = x
R
dy
x = 1
R
O 1
A
x
0 y = 0 1
A
x
Figure27.13
Theintegral with respect toxhas
limitsx =yandx = 1.
Figure27.14
We integrate along a vertical strip
fromy=0toy=x.
(b) If we choose to integrate with respect toyfirst we must select an arbitrary vertical
stripasshowninFigure27.14.Atthelowerendofthestripy = 0.Attheupperend
y=x.
To integrate along the stripweevaluate
∫ y=x
y=0
] x
x 2 +3xydy=
[x 2 y + 3xy2 =x 3 + 3x3
2
0
2 = 5x3
2
Weaddcontributionsofallsuchverticalstripsbyintegratingwithrespecttoxfrom
x=0tox=1:
∫ 1
0
5x 3
[ 5x
4
2 dx = 8
] 1
0
= 5 8
We see that the prudent selection of the order of integration can yield substantial
savings inthe effort required.
27.6.2 Tripleintegrals
The techniques we have used for evaluating double integrals can be generalized naturally
to triple integrals. Whereas double integrals are evaluated over two-dimensional
regions, tripleintegrals areevaluated over volumes.
Example27.14 Evaluate
I =
∫ x=1 ∫ y=1 ∫ z=1
x=0 y=0 z=0
x+y+zdzdydx
Solution What ismeantby thisexpression is
I =
∫ x=1
(∫ y=1
(∫ z=1
x=0
y=0
z=0
) )
x+y+zdz dy dx
886 Chapter 27 Line integrals and multiple integrals
where,asbefore,theinnerintegralisperformedfirst,integratingwithrespecttoz,with
x andybeing treated asconstants. So
∫ (
x=1 ∫ y=1
] ) 1
I =
[xz +yz+ z2
dy dx
x=0 y=0 2
0
∫ x=1
(∫ y=1
= x+y+ 1 )
2 dy dx
=
=
x=0
∫ x=1
x=0
∫ x=1
x=0
y=0
[xy + y2
[ ] x
2 1
=
2 +x 0
x + 1 2 + 1 2 dx
2 + 1 ] 1
2 y dx
0
= 3 2
Considerationofthelimitsofintegrationshowsthattheintegralisevaluatedoveraunit
cube.
27.6.3 Green’stheorem
There is an important relationship between line and double integrals expressed in
Green’s theorem inthe plane:
If the functions P(x,y) and Q(x,y) are finite and continuous in a region of the
x--y plane, R, and on its boundary, the closed curveC, provided the relevant partialderivatives
exist and arecontinuousinand onC, then
∮
∫ ∫ ( ∂Q
Pdx+Qdy=
C
R
∂x − ∂P
∂y
)
dxdy
where the direction of integration alongC is such that the region R is always to
the left.
Theconditionsgiveninthetheoremarepresentformathematicalcompleteness.Most
of the functions that engineers deal with satisfy these conditions, and so we will not
consider these further. The important thing to note is that this relationship states that a
line integral around a closed curve can be expressed in terms of a double integral over
the region,R, enclosed byC.
Example27.15 (a) Evaluate ∮ C xydx + x2 dy around the sides of the square with vertices D(0, 0),
E(1,0),F(1,1)and G(0,1).
(b) Convert the lineintegral toadouble integral and verify Green’s theorem.
y
1
0
G
D
R
27.6 Double and triple integrals 887
Solution (a) The path of integration is shown in Figure 27.15. To apply Green’s theorem the
path of integration should be followed in such a way that the region of integration
isalways toits left.We musttherefore travel anticlockwise aroundC.
1
F
E
Figure27.15
ThepathC ischosen so
that the regionRis
always to itsleft.
x
OnDE,y=0,dy=0and0x1.
OnEF,x=1,dx=0and0y1.
On FG,y = 1, dy = 0 andxdecreases from1to0.
On GD,x = 0, dx = 0 andydecreases from1to0.
The integral around the curveC can thenbe written as
∮
C
=
∫ E
D
+
∫ F
E
+
∫ G
Therefore
∮
xydx+x 2 dy=0+
C
F
∫ D
+
G
∫ 1
0
= [y] 1 0 + [ x
2
1dy+
2
] 0
1
∫ 0
1
xdx+0
= 1 − 1 2
= 1 2
(b) Applying Green’s theorem withP(x,y) = xy andQ(x,y) = x 2 we can convert the
line integral into a double integral. Note that ∂Q ∂P
= 2x and = x. Clearly the
∂x ∂y
region of integration isthe squareR. We find
∮
∫ ∫
xydx+x 2 dy= (2x −x)dxdy
R
C
=
=
=
∫ 1 ∫ 1
0
∫ 1
0
∫ 1
0
0
[ x
2
2
1
2 dy
[ ] 1 1
=
2 y 0
xdxdy
] 1
dy
0
= 1 2
We see that the same result as that in part (a) is obtained and so Green’s theorem
has been verified.
888 Chapter 27 Line integrals and multiple integrals
EXERCISES27.6
1 Evaluate
∫ 1 ∫ 3
(a) z 2 dxdz
0 0
∫ 2 ∫ 3
(c) x 2 y+1dxdy
1 2
∫ 4 ∫ 2
(b) xdydz
0 0
∫ 1 ∫ 3 y
(d)
−1 2 x dxdy
2 R isthe shaded region shown in Figure27.16.
Evaluate ∫∫ R x + √ ydxdy
(a) performing the integration with respect toxfirst,
(b) performing the integration with respect toyfirst.
y
4
2
0 1
y = x 2
2 x
Figure27.16
Theregion ofintegration forQuestion2.
3 Evaluate ∫∫ R (5x2 +2y 2 )dxdywhereRis the interior
ofthe triangularregion boundedby
A(1,1),B(2, 0)andC(2, 2).
4 (a) Sketch the region ofintegration ofthe double
integral
∫ 3 ∫ √ 4−y
ydxdy
1 1
(b) Evaluate the integral byfirst reversingthe order
ofintegration.
5 Evaluate
∫ 1 ∫ 5
(x 2 +y 2 )dxdy
−1 1
6 Evaluate
∫ ∫
(x 2 +y 2 )dxdy
R
over the triangularregionRwith vertices at
(0,0),(2,0)and(1,1).
7 Evaluate
∫
(x +2y) −1/2 dxdy
R
over the regionRgiven byx −2y 1 and
xy 2 +1.
8 Evaluate ∮ C (sinx +cosy)dx +4ex dywhereC isthe
boundary ofthe triangle formedby the points(1,0),
(3,0)and(3,2).Byconverting thisline integralinto a
doubleintegral, verifyGreen’stheorem.
9 Evaluate the line integral
∮
(x+y)dx+3xydy
C
whereC is the boundary ofthe triangle formedbythe
points(0,0),(2,0)and(0,5).Expressthelineintegral
in terms ofan appropriatedoubleintegral and
evaluate this.Verify Green’stheorem.
Solutions
1 (a) 1 (b) 8x (c) 10.5 (d) 0
2 (a)
3
33
2
20
3
(b)
20
3
7
2
.TheregionRis shown in FigureS.26.
3
y
2
x = y 2 + 1
y = –
x 1
– –
1
2 2
4 (b) 1.35435
5
6
256
3
4
3
0 1 2 3 4 5
FigureS.26
8 92.306
9 20
x
27.7 Some simple volume and surface integrals 889
27.7 SOMESIMPLEVOLUMEANDSURFACEINTEGRALS
Volume and surface integrals arise frequently in electromagnetism and fluid mechanics.
We will illustrate these concepts through some simple examples. For a thorough
treatment ofmore general cases you will need torefer toamore advanced text.
Engineeringapplication27.3
Themassofasolidobject
Consider a solid object such as that shown in Figure 27.17. The density, ρ, of this
objectmayvaryfrompointtopoint.So,atanypointPwithcoordinates (x,y,z),the
density is a function of position, that is ρ = ρ(x,y,z). Since density is a scalar, this
isan example ofascalar field, like those discussed inSection 7.4.
Supposeweselectaverysmallpieceofthisobjecthavingvolume δV andlocated
atP(x,y,z). Recall fromelementary physics that
density = mass
volume
Then the mass of thissmallpiece, δm, isgiven by
δm=ρδV
If we wish to calculate the total mass,M, of the object we must sum all such contributionsfromtheentirevolume.Thisisfoundbyintegratingthroughout
thevolume.
We write thisas ∫
total mass,M = ρdV
V
This is an example of a volume integral, so called because the integration is performed
throughout the volume. It will usually take the form of a triple integral such
as those discussed in Section 27.6.2. Technically, there are three integral signs, but
forbrevitythesehavebeenreplacedbythesingle ∫ V whereitistobeunderstoodthat
the integral is to be performed over a volume. In any specific problem care must be
takentoensurethattheentirevolumeisincludedwhentheintegrationisperformed.
For example, consider the case of a solid cube with sides of length 1 unit. Let
one corner be positioned at the origin and let the edges coincide with the positive
x, y and z axes. Suppose the density of the cube varies from point to point, and is
∫givenby ρ(x,y,z) =x+y+z.Thentheintegralwhichgivesthemassofthecubeis
(x + y + z)dV, where the volume V is the region occupied by the cube. This
V
integral has been evaluated in Example 27.16 and found to be 3 , representing the
2
mass of the cube.
volume V
volume dV
densityr(x, y,z)
P (x, y,z)
Figure27.17
Thetotal massofabodyis foundby
summing,orintegrating,throughout
the wholevolume.
890 Chapter 27 Line integrals and multiple integrals
Engineeringapplication27.4
Theelectricchargeenclosedinaregion
Suppose electric charge is distributed throughout a volume,V, and at any point has
charge density ρ(x,y,z). Charge density isascalar field.
Supposeweselectaverysmallportionhavingvolume δV andlocatedatP(x,y,z)
as shown inFigure 27.18. The charge inthisportion, δQ, isgiven by
δQ=ρδV
Ifwewishtocalculatethetotalchargeenclosedwithinthevolume,wemustsumall
such contributions from the entire volume. This is found by integrating throughout
the volume. We write thisas
∫
total charge,Q = ρdV
This isanother example of a volume integral.
volume V
V
volume dV
charge density
r(x, y,z)
P (x, y,z)
Figure27.18
The total charge enclosed is foundby
summing, orintegrating, throughout
the whole volume.
Engineeringapplication27.5
Fluidflowacrossasurface
Figure27.19(a)representsthemotionofabodyoffluidthroughoutaregion.Atany
pointP(x,y,z) fluid will be moving with a certain speed in a certain direction. So,
each small fluid element has a particular velocityv, which varies with position, that
isv = v(x,y,z). This isan example of a vector field, as discussedinSection 7.4.
surface S
v = v(x, y,z)
v = v(x, y,z)
P(x, y,z)
dS
small portion of surface dS
(a)
(b)
Figure27.19
Thevector fieldvrepresentsfluid velocity andSisan imaginary surface through whichthe
fluidflows.
27.7 Some simple volume and surface integrals 891
In Figure 27.19(b) we have placed an imaginary surface,S, within the flow and
it is reasonable to ask ‘what is the volume of fluid which crosses this surface in
any given time?’ Suppose we select a very small portion of the surface having area
δS. Note that δS is a scalar. We can also define an associated vector δS, which has
magnitude δS and whose direction is normal to this portion of the surface. The
component of fluid velocity in the direction of δS is given by the scalar product
v· ̂δS where ̂δS is a unit vector in the direction of δS. The volume of fluid crossing
this portion each second is v·δS. This is also known as the flux of v. If we
wishtocalculatethetotalfluxthroughthesurfaceSwemustsumallsuchcontributions
over the entire surface. This is found by integrating over the surface. We write
this as
∫
volume flowper second =total flux = v·dS
This is an example of a surface integral, so called because we must integrate over
a surface. It will usually take the form of a double integral such as those discussed
inSection27.6.1.Technically,therearetwointegralsigns,butforbrevitythesehave
been replaced by the single ∫ where it is to be understood that the integral is to be
S
performed over the surfaceS. In any specific problem care must be taken to ensure
that the entire surface is included when the integration is performed. The double
integralsevaluatedinSection27.6arespecialcasesofsurfaceintegrals,inwhichthe
surface isaplane region (thex--y plane).
Forexample,supposethevelocityfieldisgivenbyv = (x+2y)k.Thisrepresents
a flow in the z direction whose magnitude varies with x and y. Suppose S is the
plane surface defined by 1 x 4, 0 y 2,z = 0. This surface is shown in
Figure 27.20. Note that the normal to this surface is parallel to thezaxis and so we
can writedS = dxdyk. Then
v·dS=(x+2y)k·dxdyk
=(x+2y)dxdy
S
y
S
y= 2
dS = dxdyk
v= (x+2y)k
x
y= 0
0
x = 1
z
x = 4
Figure27.20
The vectorfieldvrepresents
fluid flowing acrossthe
surfaceS.Theintegral over
the surface ofv ·dSgives the
volume flow persecond
acrossS.
➔
892 Chapter 27 Line integrals and multiple integrals
The surfaceintegral which gives the flux ofvthroughSis
∫ y=2 ∫ x=4
y=0
x=1
(x +2y)dxdy
Note how the limits of integration are chosen so that contributions from the whole
surfaceareincluded.ThisintegralhasalreadybeenevaluatedinExample27.10and
found tobe 27. This represents the volume flow per second throughS.
Engineeringapplication27.6
Electriccurrentdensity
Electric current is the flow of electric charges. Figure 27.21 represents an electric
current flowing across a surfaceS.
If ρ(x,y,z)isthechargedensityatapoint,thatisthechargeperunitvolume,and
v(x,y,z) isthe velocity of the charges, thenthe quantityJdefined as
J=ρv
is called the current density and has units of amperes per square metre. Suppose
weselectaverysmallportionofthesurfacehavingarea δS.Notethat δS isascalar.
Supposewedefineavector, δS,whichhasmagnitude δSandwhichisnormaltothis
portion of surface. The component of current density in the direction of δS is given
by the scalar productJ· ̂δS. The current crossing this portion isJ·δS. If we wish to
calculate the total current,I, through the surfaceSwe must sum all such contributions
over the entire surface. This is found by integrating over the surface. We write
thisas
∫
current,I = J·dS
S
This isan example of asurface integral.
surface S
J = J(x, y,z)
dS
small portion of surface dS
Figure27.21
Theelectric current through asurface is
obtained usingasurface integral.
Note that this surface integral involves integrating a current density with units of
amperes per square metre over an area with units of square metres. Therefore the
result is a quantity with units of amperes and so the units balance on both sides of
the equation.
27.7 Some simple volume and surface integrals 893
Engineeringapplication27.7
ElectricfluxandGauss’slaw
Electric charges produce an electric field, E, which can be visualized by drawing
lines of force. Suppose we surround a region containing charges with a surface S.
ThefluxofEthroughSisameasureofthenumberoflinesofforcepassingthrough
it. The flux isgiven by
∫
flux = E·dS
S
Gauss’slawstatesthatthetotalfluxoutofanyclosedsurfaceisproportionaltothe
totalchargeenclosed.Soconsideraclosedsurfaceinfreespace,enclosingavolume
V. The total charge enclosed
∫
inV isgiven by the volume integral
total charge = ρdV
Gauss’s lawthen states:
∮
E·dS= 1 ∫
ρdV
ε 0
S
V
V
where ε 0
isthepermittivityoffreespace.Thenotation ∮ indicatesthatthesurfaceis
a closed surface. Note how a surface integral can be expressed as a volume integral.
This relationship isgeneralized inthe divergence theorem inthe following section.
Example27.16 Avectorfield isgiven by v =x 2 i + 1 2 y2 j+ 1 2 z2 k.
(a) Find divv.
(b) Evaluate ∫ V divvdVwhereVistheunitcube0x1,0y1,0z1.
Solution (a) divv = 2x +y+z.
(b) We seek ∫ (2x +y+z)dV whereV is the given unit cube. The small element of
V
volume dV is equal to dzdydx. With appropriate limits of integration the integral
becomes
∫ x=1 ∫ y=1 ∫ z=1
x=0
y=0
z=0
(2x+y+z)dzdydx
This isatripleintegral of the kind evaluated inSection 27.6.2:
∫ x=1 ∫ y=1 ∫ z=1
∫ x=1 ∫ y=1
] 1
(2x+y+z)dzdydx =
[2xz+yz+ z2
dydx
x=0 y=0 z=0
x=0 y=0 2
0
∫ x=1 ∫ y=1
(
= 2x+y+ 1 )
dydx
2
=
x=0
∫ x=1
x=0
y=0
[2xy + y2
2 2] + y 1
dx
0
894 Chapter 27 Line integrals and multiple integrals
=
∫ x=1
x=0
= [x 2 +x] 1 0
= 2
2x+1dx
Example27.17 Evaluate ∮ S v·dSwherev =x2 i+ 1 2 y2 j + 1 2 z2 k and whereSis the surface of the unit
cube 0 x 1, 0 y 1, 0 z 1. The vector dS should be drawn on each of the
sixfaces inan outward sense.
Solution ThecubeisshowninFigure27.22.Weevaluatethesurfaceintegralovereachofthesix
faces separately and then add the results.
OnsurfaceA,x = 1and0 y1,0 z1.dSisavectornormaltothissurface,
drawn inanoutward sense,and so wecan write itas dydzi. Then
v·dS= ( x 2 i + 1 2 y2 j+ 1 2 z2 k ) ·dydzi
=x 2 dydz
=dydz
sinceonA,x=1
The required surfaceintegral overAisthen
∫ z=1 ∫ y=1
z=0
y=0
1dydz=
= 1
∫ z=1
z=0
[y] 1 0 dz
OnsurfaceB,x = 0and0 y1,0 z1.dSisavectornormaltothissurfaceand
sowecanwriteitas −dydzi.Thenv·dSbecomes −x 2 dydz = 0sincex = 0.Overthis
surface, the integral iszero.
You should verify in a similar manner that over each ofC and E the integral is 1 2 ,
whilst integrals overDandF arezero. The total surfaceintegral isthen 2.
on E, dS = dxdy k
z
z = 1
E
dx
dy
D
B
dz
dx
on C, dS = dxdz j
x
x = 1
A
y= 1
C
dy
F
dz
on A, dS = dydz i
y
Figure27.22
Theintegral isevaluated over
allsixsurfaces.
27.8 The divergence theorem and Stokes’ theorem 895
EXERCISES27.7
1 Given F = −yi −xk evaluate ∮ SF·dSwhereSis the
surface ofaunitcube 0 x1, 0 y1,
0z1.
2 Given F = 6xy 2 z 2 k evaluate ∫ SF ·dS whereSisthe
plane surfacez = 2, 0 x2, 0 y2. Take the
direction ofthe vector element ofarea to bek.
3 Evaluate the volume integral ∫ V 6xdV whereV isthe
parallelepiped 0 z2,0 x1, 0 y3.
4 Evaluate ∫ V 1 +zdV whereV isaunitcube
0x1,0y1,0z1.
Solutions
1 0
3 18
2 128
4
3
2
27.8 THEDIVERGENCETHEOREMANDSTOKES’THEOREM
Thereareanumberoftheoremsinvectorcalculuswhichallowline,surfaceandvolume
integrals to be expressed in alternative forms. One of these, Green’s theorem, has been
describedinSection27.6.3.Thisallowsalineintegraltobewrittenintermsofadouble
integral.Now wegive details ofthe divergence theoremand Stokes’ theorem.
27.8.1 Thedivergencetheorem
The divergence theorem relates the integral over a volume,V, to an integral over the
closedsurface,S, whichsurroundsthatvolume, asillustratedinFigure 27.23.
When calculating such surface integrals vectors drawn normal to the surface should
always be drawn in an outward sense, that is away from the enclosed volume. Recall
thatwhen a surfaceisclosedthe symbolforasurfaceintegralis ∮ S .
Thedivergence theorem:
∮ ∫
v·dS= divvdV
S V
Example27.18 Verify the divergence theorem for the vector field v = x 2 i + 1 2 y2 j + 1 2 z2 k over the unit
cube0x1,0y1,0z1.
Solution Firstlyweneedtoevaluate ∮ S v·dSwhereSisthesurfaceofthecube.Thisintegralhas
been calculated inExample 27.17 and shown tobe2.
Secondlyweneedtocalculate ∫ divvdV overthevolumeofthecube.Thishasbeen
V
done inExample 27.16 and again the resultis2.
We have verified the divergence theorem that ∮ S v·dS=∫ V divvdV.
896 Chapter 27 Line integrals and multiple integrals
surface S
open surface S
dS
volume V
outwardly drawn normal
dS
Figure27.23
The divergence theoremrelates avolume integral
to asurface integral.
bounding curve C
Figure27.24
An open surfaceSwith bounding
curveC.
27.8.2 Stokes’theorem
Stokes’ theorem relates the integral over an open surface,S, to a line integral around a
closed curve,C, which bounds thatsurfacesuch as thatshown inFigure 27.24.
When a surface is open we adopt the following convention when drawing vectors
normal to the surface. When the direction of the normal vector has been specified, use
theright-handscrewruletoobtainasenseofturningaroundthenormalvector,asshown
inFigure27.24.Imaginenowmovingthecirclewhichsurroundsthenormalvectoralong
the surface until it just meets the curveC. Transfer its sense of turning to the curveC.
WhencalculatingthelineintegralaroundthecurveC,itshouldbetraversedinthesame
sense.
SpecificallyconsiderthetwoopensurfacesshowninFigure27.25.Inbothcaseswe
are considering cubes. In the first case the cube has no top face. In the second case the
cubehasnobottomface.Drawingvectordxdzjnotethatthesenseofturningrequiredby
theright-handscrewruleisthatshown.ThecurveC mustbetraversedinthedirections
shown.
Recall thatwhen a curve isclosed the symbol foralineintegral is ∮ C .
z
z
dS = dx dy k
C
dS=–dxdzj
dS=dxdzj
dS=–dxdzj
dS=dxdzj
y
y
C
x
x
dS= –dxdyk
Figure27.25
The right-handscrew rulegivesthe directionin whichC must be traversed.
27.8 The divergence theorem and Stokes’ theorem 897
Stokes’ theorem:
∮ ∫
v·dr= curlv·dS
C S
HeredrisanelementoflengthalongthecurveC.Recallthatdr = dxi +dyj +dzk.
WehaveusedthesymboldrratherthandsaswedidinSection27.2toavoidconfusion
with the element ofarea dS.
Example27.19 Acubeofside2unitsisconstructedwithfivesolidfacesandoneopenface.Itislocated
in the region defined by 0 x 2, 0 y 2, 0 z 2 and its open face is its top
face, bounded by the curveC, lying inthe planez = 2,asshown inFigure 27.26.
Throughout thisregion a vector field isgiven by
v=(x+y)i+(y+z)j+(x+z)k
(a) Evaluate ∮ C v·dr.
(b) Evaluate curlv.
(c) Evaluate ∫ curlvdS, and verifyStokes’ theorem.
S
Solution (a) The open face is highlighted in Figure 27.26. It is bounded by the curveC around
whichthelineintegral ∮ C v·drmustbeperformedinthesenseshown.Inthisplane
z = 2 and dz = 0,and hence
v·dr=(x+y)dx+(y+2)dy
We perform the lineintegral aroundC infour stages.
OnI,x = 0,dx = 0andhencev·dr = (y +2)dy.Notingthatyvariesfrom0to
2,the contribution tothe lineintegral is
∫ 2
[ ] y
2 2
y+2dy=
0 2 +2y = 6
0
z
E
z = 2 I bounding curve C
dS =
dxdzj
D
IV
III
II
B
dS= dxdzj
x
x = 2
A
F
y= 2
C
y
dS =
dxdyk
Figure27.26
Acubicalbox with open topbounded bycurveC.
898 Chapter 27 Line integrals and multiple integrals
On II,y = 2, dy = 0 and hence v·dr = (x +2)dx. Noting thatxvaries from 0
to2,the contribution tothe lineintegral is
∫ 2
[ ] x
2 2
x+2dx=
0 2 +2x = 6
0
On III,x = 2, dx = 0 and hence v·dr = (y +2)dy. Hereyvaries from 2 to 0.
This contribution tothe lineintegral is
∫ 0
[ ] y
2 0
y+2dy=
2 2 +2y = −6
2
On IV,y = 0, dy = 0 and hence v·dr = xdx. Herexvaries from 2 to 0. The
contribution tothe lineintegral is
∫ 0
[ x
2
xdx= = −2
2
2
] 0
2
Putting all these results together we find
∮
v·dr=6+6−6−2=4
C
i j k
(b) curlv =
∂ ∂ ∂
∂x ∂y ∂z
=−i−j−k
∣
x+yy+zx+z
∣
(c) Now we calculate the surface integral which must be performed over the five solid
surfaces separately. Refer to Figure 27.26. On surfaceA, the front face lying in the
planex = 2,dS = dydzi. Hence curlv·dS = −dydz. Then
∫ ∫ z=2 ∫ y=2
curlv·dS= −dydz
A
=
=
z=0
∫ z=2
z=0
∫ z=2
z=0
= [−2z] 2 0
= −4
y=0
[−y] 2 0 dz
−2dz
On B, the back face lying in the plane x = 0, dS = −dydzi. It follows that
curlv·dS = dydz. The required integral overBis
∫ z=2 ∫ y=2
z=0
y=0
dydz=4
OnC, the right-hand face, dS = dxdzj. Hence curlv·dS = −dxdz. The required
integral overC is
∫ z=2 ∫ x=2
z=0
x=0
−dxdz = −4
27.9 Maxwell’s equations in integral form 899
Similarly, onD, the left-hand face, dS = −dxdzj. Hence curlv·dS = dxdz. The
required integral overDis
∫ z=2 ∫ x=2
z=0
x=0
dxdz=4
On surfaceF, the base,z = 0 and dS = −dxdyk. Hence curlv·dS = dxdy. The
required integral overF is
∫ y=2 ∫ x=2
y=0
x=0
dxdy=4
RecallthatthetopsurfaceE isopen,andsowehavecompletedthesurfaceintegrals.
Finally, putting these results together,
∫
curlv·dS=−4+4−4+4+4=4
S
Notefrompart(a)thatthisequals ∮ v·drandsowehaveverifiedStokes’theorem.
EXERCISES27.8
1 VerifyStokes’theorem forthe field v =xyi +yzj
whereSisthe surface ofthe cube 0 x1,
0y1,0z1andthefacez=0isopen.
2 Ifv = 3x 2 i +xyzj −5zkverify Stokes’theorem
whereSisthe surface ofthe cube 0 x1,
0y1,0z1andthefacez=0isopen.
3 Supposev =x 3 i −3x 2 y 2 z 2 j + (7x +z)kandSisthe
surface ofacube ofside2units lying in the region
0x2,0y2,0z2withanopentopin
the planez = 2. Verify Stokes’theorem forthisfield.
4 Consideracube given by 0 x1, 0 y1,
1 z2,above the surfacez = 1. Suppose the
surfacez = 1 is the only open face. LetSbe the
surface ofthiscube. Verify Stokes’theorem forthe
field v =yi + (x −2xz)j −xyk. TakeC asthe square
withcorners (0,0,1), (1,0,1), (1,1,1), (0,1,1)in
the planez = 1.
5 Considerthat part ofthe positiveoctant, that iswhere
x,yandzare allpositive, bounded bythe planes
x=0,y=0,z=0andx+y+2z=2.Assumethat
the tetrahedron soformedhas three solidfaces, and
one open face onthe planex+y+2z = 2.Verify
Stokes’theorem forthe field v =x 2 i −2xzk.TakeC
asthe triangle (2,0,0), (0,2,0)and (0,0,1).
Solutions
When calculating the relevant line and surface integrals
the signofthe result dependsupon the orientation of
the curveC.
1 ∮ v·dr=± 1 2
2 ∮ v·dr=0
3 ∮ v·dr=±128
4 ±2
5 ± 2 3
27.9 MAXWELL’SEQUATIONSININTEGRALFORM
InSection26.7Maxwell’sequationswerestatedasexamplesoftheapplicationofvector
calculus.Infact,alternativeintegralformsoftheseequationsareoftenmoreusefuland
theseare given here forcompleteness.
900 Chapter 27 Line integrals and multiple integrals
Equation1
∮ ∫
D·dS=
S
V
ρdV
whereD =electricfluxdensity,and ρ iselectricchargedensity.Notethatther.h.s.isthe
totalchargeenclosedbythevolumeV.Thisequationstatesthatthetotalfluxcrossinga
closedsurfaceSwhichenclosesavolumeV isequaltothetotalchargeenclosedbythe
surface.ThisisalsotheintegralformofGauss’slaw.(SeeEngineeringapplication27.7
which isobtained by lettingD = ε 0
E.)
Equation2
∮
B·dS=0
S
where B = magnetic flux density. This law states that the net magnetic flux crossing
any closed surface is zero. So whilst charges can be thought of as sources or sinks of
electric flux,there areno equivalent sources or sinks ofmagnetic flux.
Equation3
∮
CE·dr=− ∂ ∂t
∫
S
B·dS
Note that in the theory of electrostatics, differentiating
∮
partially with respect to time
always yields zero, and so this equation reduces to E·dr = 0. This is a condition
discussed inSection 27.5 and confirms thatan electrostatic field isaconservative field.
Equation4
∮
CH·dr= ∂ ∂t
∫
S
∫
D·dS+ J·dS
S
This isthe integral formof Ampère’s circuitallaw. The closed curveC bounds an open
surfaceS. Acurrentwith density ∂D +Jflows through the surfaceS.
∂t
C
EXERCISES27.9
1 Startingwith Maxwell’sequation ∇ ·D = ρ,by
integrating both sides over anarbitraryvolumeV and
usingthe divergence theoremobtain Equation1
above.
2 Startingwith Maxwell’sequation ∇ ·B = 0, by
integrating both sides over anarbitraryvolumeV and
usingthe divergence theoremobtain Equation2
above.
3 Startingwith the equation ∇ ×E = 0forstatic
electric fields, use Stokes’theoremto showthat
∮
E·dr=0.
4 Startingfrom ∇ ×H = ∂D +J andusingStokes’
∂t
theorem obtain Equation 4.
( )
∂
5 Inmagnetostatics
∂t = 0 ,Ampère’slaw (orthe
magnetic circuit law) states:
∮
H·dr=I
C
whereI is the currentenclosed bythe closedpathC.
Obtain
∫
thislaw from Equation 4 andusing
S J·dS=I.
Review exercises 27 901
REVIEWEXERCISES27
1 Find ∫ C (2x −y)dx +xydy along the straightline
joining (−1, −1)and (1,1).
2 Find ∫ C (2x+y)dx+y3 dy
(a)along the curvey =x 3 between (1,1)and (2,8),
and
(b)along the straightline joining these points.
3 IfF = 3xyi +2e x yjfind ∫ CF ·ds whereC is the
straightliney = 2x +1 between (1,3)and (4,9).
4 ShowthatthefieldF = (2x +1)yi + (x 2 +x+1)jis
conservative and findasuitable potential function φ.
5 Find ∮ C (x2 i +4xyj) ·ds whereC is aclosed pathin
the form ofatriangle with vertices at
(0,0),(3,0)and(3,5).
6 Find ∫ y=3
y=0
7 Find ∫ y=2
y=0
∫ x=2
x=1 (3x −5y)dxdy.
∫ x=1+y
x=0 (2y +5x)dxdy.
8 Find
∫ x=2 ∫ y=1 ∫ z=3
x=0 y=0 z=0 (x2 +y 2 +z 2 )dzdydx.
9 Evaluate ∫∫ R (4y +x2 )dxdy whereRisthe interior of
the squarewith verticesat (0,0),
(4,4),(0,4)and(4,0).
10 Evaluate ∮ C ey dx +e x dywhereC isthe boundary
ofthe triangle formedbythe linesy =x,y = 5
andx = 0. Byconverting thisline integral intoa
doubleintegral verifyGreen’s theoremin the
plane.
11 TheregionRis bounded bytheyaxisand the lines
y=xandy=3−2x.
(a) Sketch the regionR.
(b) Find the volume between the surfacez =xy +1
and the regionR.
(a) Evaluate ∮ CF ·dswhereC is the curve enclosing
the regionR.
(b) Verify Green’stheorem in the plane.
(c) Thesurface,z(x,y),isgivenbyz = 1 +x+xy.
Calculatethevolumeunderthesurfaceandabove
the regionR.
13 The regionRisbounded bythexaxisand the curve
y=8+2x−x 2 .
(a) SketchR.
(b) Evaluate ∫∫ R 3x+2ydxdy
14 Evaluate
∫ 0 ∫ 4
(a) −1 3
∫ 3xydxdy
3 ∫ 2
(b) 2 0
∫ 4+x−ydydx
1 ∫ 4
(c) 0 2y x2 +y 2 dxdy
∫ −1 ∫ x 3
(d) −2 0 1dydx
15 Sketch the regionsofintegration ofthe double
integrals in Question14.
16 Ifa,b,candd are constantsshow that
∫ d ∫ b
f(x)g(y)dxdy
c a
is identical to
[∫ b
f(x)dx
] [∫ d ]
g(y)dy
a c
y
9
C
R
y = 9 – x 2
12 TheregionRis shown in Figure 27.27.
Thevector fieldFisgiven by
Solutions
F =y 2 i+3xyj
0
3
Figure27.27
The regionRforQuestion12.
x
1
2
3
2 (a) 1030.5 (b) 1031.25
3 1666.37
4 φ=x 2 y+yx+y+c
902 Chapter 27 Line integrals and multiple integrals
10 −452.239
5 50
9
640
3 14 (a) − 21 4 (b) 11 (c) 21.5
6 −9
7 31
17
11 (b) 8
12 (a) 64.8 (c) 99
8 28
13 (b) 367.2
(d) − 15 4
28 Probability
Contents 28.1 Introduction 903
28.2 Introducingprobability 904
28.3 Mutuallyexclusiveevents:theadditionlawofprobability 909
28.4 Complementaryevents 913
28.5 Conceptsfromcommunicationtheory 915
28.6 Conditionalprobability:themultiplicationlaw 919
28.7 Independentevents 925
Reviewexercises28 930
28.1 INTRODUCTION
Probability theory is applicable to several areas of engineering. One example is reliabilityengineeringwhichisconcernedwithanalysingthelikelihoodthatanengineering
system will fail. For most systems calculating the exact time of failure is not feasible
but it is often possible to obtain a good estimate of whether or not a system will fail in
a certain time interval. This is useful information to have for any engineering system,
butitisvitalifthefailureofthesystemresultsinthepossibilityofinjuryorlossoflife.
Examplesincludethefailureofhigh-voltageswitchgearsothatthecasingbecomeslive,
or electricalequipment producing a spark whilebeing usedunderground inamine.
Probability theory is also used extensively in production engineering, particularly in
the field of quality control. No manufacturing process produces components of exactly
the same quality each time. There is always some variation in quality and probability
theory allows this variation to be quantified. This enables some predictability to be introduced
into the activity of manufacturing and gives an engineer the confidence to say
components of a certainquality can be supplied toacustomer.
Afinalexampleisinthefieldofcommunicationengineering.Communicationchannelsaresubjecttonoisewhichisrandominnatureandsoismostsuccessfullymodelled
usingprobabilitytheory. We will examine some ofthese concepts inmore detailinthis
chapter.
904 Chapter 28 Probability
E
A
28.2 INTRODUCINGPROBABILITY
B = A
Figure28.1
A:acomponent meets
the specification.B =A:
acomponent fails to
meet the specification.
Consideramachinewhichmanufactureselectroniccomponents.Thesemustmeetacertain
specification. The quality control department regularly samples the components.
Suppose, on average, 92 out of 100 components meet the specification. Imagine that a
component is selected at random and letAbe the outcome that a component meets the
specification; let B be the outcome that a component does not meet the specification.
Then we say the probability of A occurring is 92 = 0.92 and the probability of B
100
8
occurring is = 0.08. The probability is thus a measure of the likelihood of the
100
occurrence of a particular outcome. We write
P(A) = probability ofAoccurring = 0.92
P(B) = probability ofBoccurring = 0.08
We note that the sum of the probabilities of all possible outcomes is 1. The process of
selecting a component is called a trial. The possible outcomes are also called events.
In this example there are only two possible events,AandB. We can depict this situation
using a Venn diagram as shown in Figure 28.1. Recall from Section 5.2 that Venn
diagrams areusedtodepictsets.InthisdiagramwearedepictingtheeventsAandBas
sets. The set of all possible outcomes is called the sample space and is represented by
the universal set E. The setArepresents the event that a component meets the specification.
The setBrepresents the event that a component fails to meet the specification.
In this instance, when a trial takes place there are only two possible outcomes, either a
component meets the specification or it does not; that is, either eventAoccurs or event
Boccurs. An alternative notation istowrite
B =A
whereAissaidtobethecomplementofA. We could alsowrite
A =B
When a bar appears over a set then wesay, for example, ‘notA’, or‘notB’.
Wenowseektodefineprobabilityinamoreformalway.LetE beanevent.Wewish
toobtaintheprobabilitythatE willoccur,thatisP(E),whenatrialtakesplace.Inorder
to do so we repeat the trial a large number of times,n. We count the number of times
that eventE occurs,denoted bym. We then conclude that
P(E)= m n
(28.1)
Thelargerthenumberoftrialsthattakeplace,themoreconfidentweareofourestimate
of the probability of E occurring. For example, consider the trial of tossing a coin. If
we wish to calculate the probability of a head occurring then measuring the results of
1000 tosses of the coin is likely to yield a more accurate estimate than measuring the
results of 10 tosses of the coin. Various consequences flow from Equation (28.1). The
numberoftimesE occursmustbenon-negativeandlessthanorequaltothenumberof
trials,thatis0 mn. So,
0P(E)1
28.2 Introducing probability 905
Ifm = 0,corresponding toeventE never occurring inntrials,then
P(E)= 0 n = 0
We conclude thatE isan impossible event.
Ifm =n, corresponding toeventE always occuring inntrials,then
P(E)= n n = 1
We deduce thatE isacertainevent.
IfP(E) > 0.5 then weconclude thatE ismore likely tooccur thannot.
Theapproachtodefiningprobabilitythatwehaveadoptedsofarisessentiallyexperimental.Wecarryoutaseriesoftrialsandmeasuretheprobabilityofaneventoccurring.
Sometimes it is possible to deduce the probability of an event purely from theoretical
considerations.Consideragainthetrialoftossingacoin.Ifthecoinisfairthendefining
H tobe the event thatthe coin lands with the head facing up we can easilydeduce that
P(H) = 0.5
because we intuitively recognize that a head has the same likelihood of occurring as a
tail. Similarly, if we roll a fair die -- die is the singular of dice -- andE is the event that
a 6 isobtained, thenP(E) = 1 6 .
Suppose there aretwo possible outcomes,E 1
andE 2
, of a trial. Then
P(E 1
)+P(E 2
)=1
If a trialhasnpossible outcomes,E 1
,E 2
,...,E n
, then
P(E 1
)+P(E 2
)+···+P(E n
)=1
The sum of the probabilities of all possible outcomes is 1, representing the total
probability.
28.2.1 Compoundevent
Consider the situation of rolling a fair die. It is possible to define a variety of events
or outcomes associated with this trial. We choose to define two events, E 1
and E 2
, as
follows:
So,
E 1
: a 1,2,3 or 4 isobtained
E 2
: an even number isobtained
E 1
= {1,2,3,4}
E 2
= {2,4,6}
Now the universal set is
E={1,2,3,4,5,6}
which embraces all possible outcomes.
Suppose we now define the eventE 3
as
E 3
:E 1
occurs andE 2
occurs
906 Chapter 28 Probability
TheeventE 3
isknownasacompoundeventandonlyoccurswhenbothE 1
andE 2
occur
atthe same time.Usingset notation we can write
E 3
=E 1
∩E 2
= {1,2,3,4} ∩ {2,4,6}
= {2,4}
E
SoE 3
occurswhena2ora4isrolled.HenceE 1
∩E 2
canoccurintwooutofsixequally
likely ways.Therefore,
E 1
E 2
1
3
2
4
5
6
P(E 3
)=P(E 1
∩E 2
)= 2 6 = 1 3
Figure 28.2 shows a Venn diagram forthis example.
We could alsodefine the compound eventE 4
as
Figure28.2
E 1 ∩E 2 corresponds to
the compound eventE 1
occursandE 2 occurs.
E 4
:E 1
occurs orE 2
occurs
The eventE 4
occurs when eitherE 1
occurs orE 2
occurs,or both. We can write
So,
E 4
=E 1
∪E 2
= {1,2,3,4} ∪ {2,4,6}
= {1,2,3,4,6}
P(E 4
)=P(E 1
∪E 2
)= 5 6
If eventsAandBboth occur thenthiscompound event isdenotedA ∩B.
If either eventAoreventBoccursthenthiscompound event isdenotedA ∪B.
Engineeringapplication28.1
Powersupplytoacomputer
Computersusedinthecontroloflife-criticalsystemsoftenhavetwoseparatepower
supplies.Ifonepowersupplyfailsthentheothertakesover.LetE 1
betheeventthat
powersupply1fails,andletE 2
betheeventthatpowersupply2fails.Forthepower
supplytothecomputertofailcompletelythecompoundeventE 1
andE 2
mustoccur,
thatisE 1
∩E 2
occurs.
A system such as this is said to have dual modular redundancy. One of the
difficultieswithsuchasystemisindeterminingwhichsystemisoperatingcorrectly
andwhichunitisswitchedtobeingtheactiveone.Insomesituationsitisnecessary
to have three systems operating in parallel and then using a voting system to decide
on the correct output. This way the decision is taken based on the majority of the
threesystems, thatis, when atleasttwo outofthe threeagree.
28.2 Introducing probability 907
Engineeringapplication28.2
Qualitycontrolonafactoryproductionline
Quality control is an important aspect of factory production. So much so that some
engineersspendtheirwholecareersinthisfieldandareknownasqualitycontrolengineers.Thefundamentalgoalistoensurethatmanufacturedgoodsareproducedto
aspecifiedstandardinspiteofvariationsinthefinishedproductthatareaninevitable
partof the manufacturing process.Consider the following problem.
Machines A and B make components, which are then placed on a conveyor belt.
OfthosemadebymachineA,93%areacceptable.OfthosemadebymachineB,95%
areacceptable.MachineAmakes60%ofthecomponentsandmachineBmakesthe
rest. Find the probability that a component selected at random from the conveyor
belt is
(a) made by machine A
(b) made by machine Aand acceptable
(c) made by machine Band acceptable
(d) made by machine Band unacceptable
Solution
(a) We are given that 60% of the components are made by machine A. Converting
thispercentage toadecimal number gives
P(component ismade by machine A) = 0.6
(b) Weknowthat60%ofthecomponentsaremadebymachineAand93%ofthese
areacceptable. Converting these percentages todecimal numbers wehave
P(component ismade by machine Aand is acceptable) = 60
100 × 93
100
= 0.60 ×0.93 = 0.558
(c) Weknowthat40%ofthecomponentsaremadebymachineBand95%ofthese
areacceptable. So,
P(component ismade by machine Band isacceptable) = 40
100 × 95
100
= 0.40 ×0.95 = 0.38
(d) We know that 40% of the components are made by machine B and 5% of these
areunacceptable. So,
P(component ismade by machine Band isunacceptable) = 40
100 × 5
100
= 0.40 ×0.05 = 0.02
An alternative way to solve this problem is by constructing a tree diagram. To
do so, consider the case when 1000 components are picked from the conveyor
belt. We know that machine A makes 60% of the components and so 600 of these
components, on average, will be made by machine A and the other 400 must
therefore be made by machine B. This isshown inFigure 28.3.
➔
908 Chapter 28 Probability
acceptable
558
1000
machine A
machine B
600
1000
machine A
machine B
600
400
not acceptable
acceptable
not acceptable
42
380
400
Figure28.3
Partial tree diagram forEngineering
application 28.2.
Figure28.4
Full tree diagram forEngineering
application 28.2.
Of the 600 components made by machine A, we know that 93% of these are acceptable,
thatis93% of 600 = 558.So 600 −558 = 42 are unacceptable.
Of the 400 components made by machine B, we know that 95% of these are acceptable,
thatis95% of 400 = 380. So 400 −380 = 20 are unacceptable.
We can now complete a full tree diagram for this problem, which is shown in
Figure 28.4.
Usingthetreediagramitisstraightforwardtocalculatetherequiredprobabilities.
(a) Theprobabilityacomponent ismadebymachineAisfoundbynotingthat600
of the 1000 components aremade by machine A,that is
P(component is made by machine A) = 600
1000 = 0.6
(b) TheprobabilityacomponentismadebymachineAandisacceptableisfoundby
notingthat558ofthe1000componentsaremadebymachineAandacceptable,
that is
P(component is made by machine Aand acceptable) = 558
1000 = 0.558
(c) P(component is made by machine Band acceptable) = 380
1000 = 0.38
(d) P(component is made by machine Band unacceptable) = 20
1000 = 0.02
20
EXERCISES28.2
1 Afair die isrolled. TheeventsE 1 , . . . ,E 5 are defined
asfollows:
E 1 :aneven number is obtained
E 2 :anodd numberis obtained
E 3 :ascore ofless than2isobtained
E 4 :a3is obtained
E 5 :ascore ofmore than3is obtained
Find
(a)P(E 1 ),P(E 2 ),P(E 3 ),P(E 4 ),P(E 5 )
(b)P(E 1 ∩E 3 )
(c)P(E 2 ∩E 5 )
(d)P(E 2 ∩E 3 )
(e)P(E 3 ∩E 5 )
2 Atrialcan have threeoutcomes,E 1 ,E 2 andE 3 .E 1
andE 2 are equally likely to occur.E 3 is threetimes
more likely to occur thanE 1 .FindP(E 1 ),P(E 2 ) and
P(E 3 ).
3 Acomponent ismadeby machines AandB.Machine
Amakes 70%ofthe components andmachine B
makesthe rest. Forboth machines, the proportion of
28.3 Mutually exclusive events: the addition law of probability 909
acceptable components is90%. Find the probability
that acomponent selected atrandom is
(a) unacceptable
(b) acceptable and ismade bymachine A
(c) unacceptable and ismade by machine B
4 MachinesA, Band Cmake components. Machine A
makes20% ofthe components, machine B makes
30% ofthe components and machine C makesthe
rest.Theprobabilitythatacomponentisfaultyis0.07
when madeby machine A, 0.06 when made by
machine B and0.05 when made bymachine C.A
component is picked atrandom. Calculate the
probability that the component is
(a) made bymachine C
(b) made bymachine A andis faulty
(c) made bymachine B and isnotfaulty
(d) made bymachine C and isfaulty
(e) made bymachine A andis notfaulty
(f) faultyand isnotmade by machine B
5 Circuit boards are made bymachines A, B,CandD.
Machine A makes15%ofthe components, machine
B makes30%, machine C makes35%and machine D
makes the remainder. Theprobability that aboard is
acceptable is0.93 when made bymachine A, 0.96
when made by machine B,0.95 when made by
machine Cand 0.93 when made by machine D. A
board ispicked atrandom. Calculate the probability
that itis
(a) made bymachine D
(b) made bymachine A andis acceptable
(c) made bymachine B and isunacceptable
(d) made bymachine C and isacceptable
(e) made bymachine D andis unacceptable
(f) unacceptable and made bymachine C
(g) Inabatch of1000 boards,how many would be
expected to be acceptable andmade by
machine D?
Solutions
1
1 (a)
2 , 1 2 , 1 6 , 1 6 , 1 2
1
(d)
6
2 0.2, 0.2, 0.6
(b) 0
(e) 0
(c)
1
6
3 (a) 0.1 (b) 0.63 (c) 0.03
4 (a) 0.5 (b) 0.014 (c) 0.282
(d) 0.025 (e) 0.186 (f) 0.039
5 (a) 0.2 (b) 0.1395 (c) 0.012
(d) 0.3325 (e) 0.014 (f) 0.0175
(g) 186
Table28.1
Theprobability ofacar
componentfallinginto
oneoffourcategories.
Category
28.3 MUTUALLYEXCLUSIVEEVENTS:THEADDITION
LAWOFPROBABILITY
Probability
topquality 0.18
standard 0.65
substandard 0.12
reject 0.05
Consideramachinewhichmanufacturescarcomponents.Supposeeachcomponentfalls
into one offourcategories:
topquality
standard
substandard
reject
After many samples have been taken and tested, it is found that under certain specific
conditions the probability that a component falls into a category is as shown in Table
28.1. The four categories cover all possibilities and so the probabilities must sum
to 1. If 100 samples are taken, then on average 18 will be top quality, 65 of standard
quality, 12 substandardand 5 will berejected.
910 Chapter 28 Probability
Example28.1 Using the data in Table 28.1 calculate the probability that a component selected at random
iseither standard or topquality.
Solution On average 18 out of 100 components are top quality and 65 out of 100 are standard
quality.So83outof100areeithertopqualityorstandardquality.Hencetheprobability
that a component is either top quality or standard quality is 0.83. The solution may be
expressed more formally as follows. LetAbe the event that a component is top quality.
LetBbe the event thatacomponent isstandard quality.
Then,
P(A) = 0.18 P(B) = 0.65
P(A ∪B) = 0.18 +0.65 = 0.83
Note thatinthisexample
P(A ∪B) =P(A) +P(B)
In Example 28.1 the eventsAandBcould not possibly occur together. A component is
either top quality or standard quality but cannot be both. We sayAandBare mutually
exclusivebecausetheoccurrenceofoneexcludestheoccurrenceoftheother.Theresult
applies more generally.
E
E i
E j
If the occurrence of either of eventsE i
orE j
excludes the occurrence of the other,
thenE i
andE j
aresaidtobemutuallyexclusive events.
IfE i
andE j
aremutuallyexclusive wedenotethis by
Figure28.5
E i andE j are mutually
exclusive eventsand so
are depicted asdisjoint
sets.
E i
∩E j
= ◦/
Weuse◦/todenotetheemptyset,thatisasetwithnoelements.Ineffectwearestating
that the compound eventE i
∩E j
is an impossible event and so will never occur. On a
Venn diagramE i
andE j
are shown as disjoint sets (see Figure 28.5). Suppose thatE 1
,
E 2
,...,E n
arenevents and that in a single trial only one of these events can occur. The
occurrenceofanyevent,E i
,excludestheoccurrenceofallotherevents.Sucheventsare
mutuallyexclusive.
Formutuallyexclusive events the addition lawofprobabilityapplies:
P(E 1
orE 2
or ... orE n
)=P(E 1
∪E 2
∪···∪E n
)
=P(E 1
)+P(E 2
)+···+P(E n
)
28.3 Mutually exclusive events: the addition law of probability 911
Engineeringapplication28.3
Electricalcomponentreliability
Electrical components fail after a certain length of time in use. Their lifespan is not
fixed but subject to variation. It is important to be able to quantify the reliability of
these components. Consider the following problem.
Thelifespansof5000electricalcomponentsaremeasuredtoassesstheirreliability.
The lifespan (L) is recorded and the results are shown in Table 28.2. Find the
probability thatarandomly selected component will last
(a) more than 3 years
(b) between 3 and 5 years
(c) lessthan 4 years
Table28.2
Thelifespansof5000electricalcomponents.
Lifespanofcomponent(years)
Number
L>5 500
4<L5 2250
3<L4 1850
L3 400
Solution
We define eventsA,B,C andD:
A: the component lasts more than 5 years
B: the component lasts between 4 and 5 years
C: the component lasts between 3 and 4 years
D: the component lasts 3 years or less
P(A) = 500 2250 1850
= 0.1, P(B) = = 0.45, P(C) =
5000 5000 5000 = 0.37,
P(D) = 400
5000 = 0.08.
The eventsA,B,C andDare clearly mutually exclusive and so the addition law
may be applied.
(a) P(component lasts more than 3 years) =P(A ∪B∪C)
=P(A) +P(B) +P(C)
=0.1 +0.45 +0.37
=0.92
There isa92% chance a component will lastformore than 3 years.
(b) P(component lasts between 3 and 5 years) =P(B ∪C)
=P(B) +P(C)
=0.45 +0.37
=0.82 ➔
912 Chapter 28 Probability
(c) P(component lasts less than 4 years) =P(C ∪D)
=P(C) +P(D)
=0.37 +0.08
=0.45
Engineeringapplication28.4
Calculatingtheprobabilityacomponentisacceptableon
afactoryproductionlinewithseveralmachines
Sometimesafactoryproductionlinemaybefedbyseveralmachinesthatmakecomponents.
It is important to be able to calculate the overall quality of the product that
emerges given knowledge of the performance of the individual machines. Consider
the following problem.
MachinesAandBmakecomponents.MachineAmakes60%ofthecomponents.
The probability that a component is acceptable is 0.93 when made by machine A
and0.95whenmadebymachineB.Acomponentispickedatrandom.Calculatethe
probability thatitis
(a) made by machine Aand isacceptable
(b) made by machine Band isacceptable
(c) acceptable
Solution
WehavealreadylookedatthisprobleminEngineeringapplication28.2.Figure28.4
shows the tree diagram for the problem.
(a) P (component ismade by machine Aand isacceptable) = 558
1000 = 0.558.
(b) P (component ismade by machine Band isacceptable) = 380
1000 = 0.38.
(c) Note that the events described in (a) and (b) are mutually exclusive and so the
addition lawcan beapplied.
P(component is acceptable) =P(component ismade by machine Aand isacceptable)
+P(component ismade by
machine Band isacceptable)
= 0.558 +0.38
= 0.938
We can now obtain this probability directly from the tree diagram. We see that
558 +380 = 938 components areacceptable and so
P(component isacceptable) = 938
1000 = 0.938
28.4 Complementary events 913
EXERCISES28.3
1 Acomponent is classifiedasone oftopquality,
standard quality orsubstandard, with respective
probabilities of0.07,0.85 and 0.08.Findthe
probability that acomponent is
(a) either topquality orstandard quality
(b) not topquality
2 Thelifespan (L)ofeach of2000 valvesis measured
andgiven in Table 28.3.
Table28.3
Thelifespans of2000valves.
Lifespan(hours)
Number
L 1000 119
800 L<1000 520
600 L<800 931
400 L<600 230
L < 400 200
Calculate the probability that the lifespanofavalve is
(a) more than800 hours
(b) less than600 hours
(c) between 400 and 800 hours
3 Chipsare manufactured bymachines A andB.
Machine Amakes65% ofthe chips and machine B
makesthe remainder. Theprobability that a chipis
faulty is0.03 when made bymachine A and 0.05
when made by machine B.Achip isselected at
random. Calculate the probability that itis
(a) faulty andmade by machine A
(b) faulty ormade bymachine A
(c) faulty ormade bymachine B
(d) faulty andmade by machine B
(e) faulty
4 Componentsare made bymachines A, B,Cand D.
Machine A makes25%ofthe components, machine
B makes16% ofthe components, machine Cmakes
21% ofthe components and machine Dmakes the
remainder. The probability that acomponent is
acceptable is0.92 when made bymachine A, 0.90
when made by machine B,0.96 when made by
machine Cand 0.93 when made by machine D. A
component ispicked at random. Calculate the
probability that itis
(a) made by machine Aormachine C
(b) made by machines A, B orD
(c) acceptable andmade by machine B
(d) acceptable andmade by machine C
(e) acceptable andmade by machine B oracceptable
and made bymachine C
(f) acceptable ormade bymachine A
(g) acceptable
Solutions
1 (a) 0.92 (b) 0.93
2 (a) 0.3195 (b) 0.215 (c) 0.5805
3 (a) 0.0195 (b) 0.6675 (c) 0.3695
(d) 0.0175 (e) 0.037
4 (a) 0.46 (b) 0.79 (c) 0.144
(d) 0.2016 (e) 0.3456 (f) 0.949
(g) 0.929
28.4 COMPLEMENTARYEVENTS
Consider an electronic circuit. Clearly, either the circuit works correctly or it does not
workcorrectly.LetAbetheeventthatthecircuitworkscorrectly,andletBbetheevent
that the circuit does not work correctly. EitherAorBmust happen when the circuit is
tested, and one excludes the other. The eventsAandBare said to be complementary.
914 Chapter 28 Probability
TheVenndiagramcorrespondingtothissituationisidenticaltothatinFigure28.1where
the setAcorresponds tothe eventA, and the eventBisrepresented byA.
Two events, A and B, are complementary if they are mutually exclusive and in a
singletrialeitherAorBmusthappen.
Hence,ifAandBarecomplementary then
and
P(A) +P(B) =1
P(A∩B)=0
Itis usualtodenotecomplementary events asAandA. For example,
A : the component istop quality
A : the component isnot topquality
B:n>6
B:n6
C : the circuithas failed tomeet the specification
C : the circuithas met the specification
Recall from Engineering application 28.3 that D was the event the component lasts
3 years or less, and thishad probabilityP(D) = 0.08. We could say
D : the component lasts 3 years orless
D : the component lasts more than3years
DandDarecomplementary events and so
P(D) +P(D)=1
P(D)=1−P(D)=1−0.08=0.92
Engineeringapplication28.5
Testingmultipleelectroniccomponents
Three transistors are tested. The probability that none of them works is 0.03. What
isthe probability thatatleastone transistor works?
Solution
We definethe eventsAandA:
A : all threetransistors fail
A : not all three transistorsfail
AandAare complementary events and so
P(A) = 0.03 P(A) =1−P(A) =0.97
28.5 Concepts from communication theory 915
To say that not all three transistors fail is the same as saying that one or more transistorswork.
So we could say
A : atleastone transistor works
Hence the probability thatatleastone transistor works is0.97.
EXERCISES28.4
1 Which pairsofevents are complementary?
A:the componentis reliable
B:there is only onecomponent
C:there are less thantwo components
D:more thantwo components are
reliable
E:the componentis unreliable
F:thereismore thanonecomponent
G:most ofthe components are
unreliable
2 EventsA,B,C andDare defined.
A:the lifespanis90 daysorless
B:the machine is reliable
C:allcomponents have been tested
D: atleastthree components from
the sample are unreliable
State the eventsA,B,C andD.
Solutions
1 AandE;CandF
2 A:the lifespanis more than 90days
B:the machine is unreliable
C:some components have not been tested
D:two orfewer components are unreliable
28.5 CONCEPTSFROMCOMMUNICATIONTHEORY
Communicationengineersfinditusefultoquantifyinformationforthepurposesofanalysis.Inordertodosoaveryrestrictedviewofinformationisused.Informationisseenin
termsofknowledgeofaneventoccurring.Ahighlyimprobableeventoccurringconstitutes
more information than an almost certain event occurring. This correlates to some
extent with human experience as people tend to be much more interested in hearing
aboutunlikely events. The information,I, associatedwith anevent, isdefined by
I=−logp
0<p1
where p = probability of an event occurring. Notice that p = 0 is excluded from the
domain of the function as the logarithm is not defined at 0. In practice this is not a
problem because an event with zero probability never occurs. Often logarithms to the
base 2 are used when calculating information as in many cases information arrives in
the form of a ‘bit stream’ consisting of a series of binary numbers. For this caseI has
unitsofbits and isgiven by
I=−log 2
p
Aformulaforevaluating logarithmstothe base2isgiven inChapter2.
916 Chapter 28 Probability
Engineeringapplication28.6
Informationcontentofabinarydatastream
Suppose that a computer generates a binary stream of data and that 1s and 0s occur
with equal probability, that is P(0) = P(1) = 0.5. Calculate the information per
binary digitgenerated.
Solution
Here, p = 0.5 whether the binary digitis0or 1,so
I = −log 2
(0.5) = −log 10 (0.5)
log 10
2
=1bit
Engineeringapplication28.7
Informationcontentofanalphabeticcharacterstream
Supposethatasystemgeneratesastreamofuppercasealphabeticcharactersandthat
the probability ofacharacteroccurring isthe same forall characters. Calculate
(a) the informationassociatedwith the character Goccurring
(b) the informationassociatedwith any singlecharacter.
Solution
(a) P(Goccurring) = 1
( ) 1
26 , I=−log 2
26
(b) Allcharacters areequally likely tooccur and so
( ) 1
I=−log 2
= −log 10 (1/26) = 4.70 bits
26 log 10
2
= −log 10 (1/26)
log 10
2
= 4.70 bits
Often a series of events may occur that do not have the same probability. For example,
ifastreamofalphabeticcharactersisbeinggeneratedthenitislikelythatsomecharacters
will occur more frequently than others and so have a higher probability associated
with them. For this situation it becomes convenient to introduce the concept of average
information. Given a sourceproducingasetofevents
E 1
,E 2
,E 3
,...,E n
with probabilities
p 1
,p 2
,p 3
,...,p n
thenforalong series ofevents theaverage informationper event isgiven by
∑i=n
H = − p i
log 2
p i
bits
i=1
H isalsotermedtheentropy.
28.5 Concepts from communication theory 917
Engineeringapplication28.8
Entropyofasignalconsistingofthreecharacters
Asourceproducesmessagesconsistingofthreecharacters,A,BandC.Theprobabilities
of each of these characters occurring isP(A) = 0.2,P(B) = 0.5,P(C) = 0.3.
Calculate the entropy of the signal.
Solution
H = −0.2log 2
(0.2) −0.5log 2
(0.5) −0.3log 2
(0.3) = 1.49 bits
Engineeringapplication28.9
Entropyofabinarydatastream
Asourcegeneratesbinarydigits0,1,withprobabilitiesP(0) = 0.3andP(1) = 0.7.
Calculate the entropy of the signal.
Solution
H = −0.3log 2
(0.3) −0.7log 2
(0.7) = 0.881 bits
Note that in Engineering application 28.9, on average, each binary digit only carries
0.881bitsofinformation.Infactthemaximumaverageamountofinformationthatcan
be carried by a binary digit occurs whenP(0) = P(1) = 0.5, as seen in Engineering
application 28.6. For this caseH = 1.
Foranydatastream,themaximumaverageamountofinformationthatcanbecarried
by a digit occurs when all digits have equal probability, that isH is maximized when
p 1
=p 2
=p 3
=···=p n
.
H is maximized when p 1
= p 2
= p 3
= ··· = p n
. The maximum value of H is
denotedH max
.
When the probabilities are not the same then one way of viewing the reduction in
H is to think of the likely event being given too much of the signalling time given its
lower information content. It is interesting to explore the two limiting cases, that is (a)
P(0) = 0,P(1) = 1;(b)P(0) = 1,P(1) = 0.InbothcasesitcanbeshownthatH = 0.
However,onexaminationthisisreasonablebecauseacontinuousstreamof1sdoesnot
relayanyusefulinformationtotherecipientandneitherdoesacontinuousstreamof0s.
The fact that some streams of symbols do not contain as much information as other
streams of the same symbols leads to the concept of redundancy. This allows the efficiency
with which information isbeing sent tobe quantified and isdefined as
redundancy =
maximum entropy -- actual entropy
maximum entropy
Alowvalue of redundancy corresponds toefficient transmission ofinformation.
918 Chapter 28 Probability
Engineeringapplication28.10
Redundancyofabinarydatastream
Consider the source of binary digits examined in Engineering application 28.6 and
28.9. The maximum entropy for a binary stream is 1 bit per binary digit. Calculate
the redundancy ineach case.
Solution
For Engineering application 28.6
redundancy = 1 −1 = 0
1
For Engineering application 28.9
redundancy = 1 −0.881
1
= 0.119
Engineeringapplication28.11
Redundancyofacharacterdatastream
A stream of data consists of four characters A, B, C, D with probabilities 0.1, 0.3,
0.2,0.4,respectively. Calculate the redundancy.
Solution
It can be shown that the maximum entropy, H max
, corresponds to the situation in
whichthe probabilityofeachsymbolisthe same, thatis0.25.
H max
= 4 × (−0.25log 2
(0.25)) = 2 bits
Theactual entropy,H act
, isgiven by
H act
= −(0.1log 2
(0.1) +0.3log 2
(0.3) +0.2log 2
(0.2) +0.4log 2
(0.4))
= 1.846bits
redundancy = 2 −1.846
2
= 0.0770
Intheexampleswehaveexaminedsofarwehaveusedthebitastheunitofinformation
because the most common form of digital signalling uses binary digits. When there are
only two possible events it is possible to represent an event by a single binary digit.
However, if there is a larger number of possible events then several binary digits are
neededtorepresentasingleevent.Whencalculatingvaluesforinformationandentropy
in these examples an assumption was made that each event was represented by binary
sequences or codes of the same length. It is only possible to do this efficiently if the
numberofeventsisapowerof2,thatis2,4,8,16, ....Inpracticethisproblemdoesnot
arise because it is more common to produce codes that have a small number of binary
digitsforlikelyeventsandalongnumberofbinarydigitsforunlikelyevents.Thisallows
28.6 Conditional probability: the multiplication law 919
the redundancy of a data stream to be reduced. The design of such codes is known as
coding theory. One complication is that most streams of data are not transmitted with
100%accuracyasaresultofthepresenceofnoisewithinthecommunicationchannel.It
isoftennecessarytobuildextraredundancyintoacodeinordertorecovertheseerrors.
EXERCISES28.5
1 Asourcegenerates sixcharacters, A, B,C,D, E, F,
with respective probabilities 0.05, 0.1, 0.25, 0.3, 0.15,
0.15.Calculate the average information percharacter
andthe redundancy.
2 Avisual displayunit has aresolutionof600 rowsby
800 columns.Ten different grey levelsare associated
with each pixel andtheir probabilitiesare 0.05,0.07,
0.09,0.10,0.11,0.13,0.12,0.12,0.11,0.10.Calculate
theaverageinformationcontentineachpictureframe.
3 Asourcegenerates five characters A, B,C,DandE
with respective probabilities of0.1, 0.15,0.2, 0.25
and0.3.
(a)Calculate the informationassociated with the
character B.
(b)Calculate the entropy.
(c)Calculate the redundancy.
4 Adata stream comprisesthe characters A, B,C,D
andEwithrespectiveprobabilitiesof0.23,0.16,0.11,
0.37 and 0.13.
(a) Calculate the informationassociated with the
character B.
(b) Calculate the informationassociated with the
character D.
(c) Calculate the entropy.
(d) Calculate the redundancy.
5 A data stream comprisesthe characters A, B,C,D
andEwithrespectiveprobabilitiesof0.12,0.21,0.07,
0.31 and 0.29.
(a) Which character carries the greatest information
content?
(b) Which character carries the leastinformation
content?
(c) Calculate the informationassociated with the
letter D.
(d) Calculate the entropy.
(e) Calculate the redundancy.
Solutions
1 2.3905,0.0752
2 3.2790
3 (a) 2.7370 (b) 2.2282 (c) 0.0404
4 (a) 2.6439 (b) 1.4344 (c) 2.1743
(d) 0.0636
5 (a) C (b) D (c) 1.6897
(d) 2.1501 (e) 0.0740
28.6 CONDITIONALPROBABILITY:THEMULTIPLICATIONLAW
Suppose two machines, M and N, both manufacture components. Of the components
made by machine M, 92% are of an acceptable standard and 8% are rejected. For machine
N, only 80% are of an acceptable standard and 20% are rejected. Consider now
the eventE:
E: a component isof an acceptable standard
If all the components are manufactured by machine M then P(E) = 0.92. However,
if all the components are manufactured by machine N then P(E) = 0.8. If half the
componentsaremanufacturedbymachineMandhalfbymachineNthenP(E) = 0.86.
Toseewhythisissoconsider1000components.OfthehalfmadebymachineM,92%×
920 Chapter 28 Probability
500 = 460 will be of an acceptable standard. Of the half made by machine N, 80% ×
500 = 400 will be acceptable. Hence 860 of the 1000 components will be acceptable
andsoP(E) = 860
1000 = 0.86.Clearly,therearedistinctprobabilitiesofthesameevent;
the probability changes as the conditions change. This is intuitive and leads to the idea
of conditional probability.
We introduce a notation forconditional probability. Define eventsAandBby
A: the component ismanufactured by machine M
B: the component ismanufactured by machine N
Thentheprobabilitythatacomponentisofanacceptablestandard,givenitismanufactured
by machine M, is written asP(E|A). We read this as the conditional probability
ofE givenA. SimilarlyP(E|B) is the probability ofE happening, givenBhas already
happened.
P(E|A) = 0.92 P(E|B) = 0.8
To be pedantic, all probabilities are conditional since the conditions surrounding any
event can change. However, for many situations there is tacitly assumed a definite set
of conditions which is always satisfied. The probability of an event calculated under
onlytheseconditionsisknownastheunconditionalprobability.Iffurtherwell-defined
conditions areattached, the probability isconditional.
Engineeringapplication28.12
Productionlineproductfedbytwomachines
MachinesMandNmanufactureacomponent.Theprobabilitythatthecomponentis
of an acceptable standard is 0.95 when manufactured by machine M and 0.83 when
manufactured by machine N. Machine M supplies 65% of components; machine N
supplies 35%. Acomponent ispicked atrandom.
(a) What isthe probability thatthe component isofanacceptable standard?
(b) Whatistheprobabilitythatacomponentisofanacceptablestandardandismade
by machineM?
(c) What is the probability that the component is of an acceptable standard given it
is made by machineM?
(d) What isthe probability thatthe component wasmade bymachine M?
(e) What is the probability that the component was made by machine M given it is
ofanacceptable standard?
(f) The component is not of an acceptable standard. What is the probability that it
wasmade by machineN?
Solution
We definethe events:
A: the component ismanufactured by machineM
B: the component ismanufactured by machineN
C: the component isofanacceptable standard
28.6 Conditional probability: the multiplication law 921
(a) Consider 1000 components. Then 650 are manufactured by machine M, 350 by
machineN.Ofthe650manufacturedbymachineM,95%willbeacceptable,that
is650× 95
100 = 617.5.Ofthe350manufacturedbymachineN,83%willbeacceptable,thatis350×
83
100 = 290.5.Soonaveragein1000components,617.5+
290.5 = 908 will be acceptable, thatisP(C) = 0.908.
(b) From part (a) we know that out of 1000 components 617.5 will be made by
machine M and be of an acceptable standard. Hence P(A ∩C) = 617.5
1000 =
0.6175.
(c) We requireP(C|A). The probability a component is acceptable given it is manufactured
by machine M is
P(C|A) = 0.95
(d) Machine M makes 65%of components, thatisP(A) = 0.65.
(e) We requireP(A|C). Consider again the 1000 components. On average 908 are
acceptable.Ofthese908acceptablecomponents,617.5aremanufacturedbymachine
M and 290.5 by machine N. We are told the component is acceptable and
sowemustrestrictattentiontothe908acceptablecomponents.So,outof908acceptablecomponents,617.5aremadebymachineM,thatisP(A|C)
= 617.5
908 =
0.68.
(f) We require P(B|C). Consider 1000 components. Machine M manufactures
650 components of which 617.5 are acceptable and hence 32.5 are unacceptable.
Machine N manufactures 350 components of which 290.5 are acceptable
and59.5areunacceptable.Thereare92unacceptablecomponentsofwhich59.5
weremadebymachineN.Theprobabilityofthecomponentbeingmadebymachine
Ngiven itisunacceptable is
P(B|C) = 59.5
92 = 0.647
that is, almost 65% of unacceptable components are manufactured by
machine N.
As an alternative method of solution, we can represent the information via a tree
diagram. This isshown inFigure 28.6.
617.5 (acceptable)
M
650
32.5 (unacceptable)
1000
N
350
290.5 (acceptable)
59.5 (unacceptable)
Figure28.6
Tree diagram forEngineeringapplication
28.12.
➔
922 Chapter 28 Probability
(a) There are617.5 +290.5 = 908 acceptable components, so
P(component isacceptable) = 0.908
(b) There are617.5 components which are acceptable and made by machine M. So
P(component isacceptable and made by machine M)
=P(C ∩A) = 0.6175
(c) Of the 650 components made by machine M, 617.5 areacceptable, and so
P(component isacceptable given itismade by machine M) =P(C|A)
= 617.5
650 = 0.95
(d) Of the 1000 components, 650 aremade by machine M, so
P(component ismade by machine M) =P(A) = 0.65
(e) Thereare908acceptablecomponents.Ofthese,617.5aremadebymachineM.
Hence
P(component ismade by machine M given itisacceptable) =P(A|C)
= 617.5
908 = 0.68
(f) There are 32.5 + 59.5 = 92 unacceptable components. Of these 92, 59.5 are
made by machine N.Thus
P(component ismade by machine Ngiven itisnot acceptable) =P(B|C)
= 59.5
92 = 0.647
28.6.1 Themultiplicationlaw
Consider events A and B for which A ∩B ≠ ◦/ as shown in Figure 28.7. Suppose we
knowthateventAhasoccurredandweseektheprobabilitythatBoccurs,thatisP(B|A).
Knowing event A has occurred we can restrict our attention to the set A. Event B will
occur ifany outcome is inA ∩B. Hence,
E
P(B|A) =
P(A ∩B)
P(A)
A
B
Themultiplication lawofprobabilitystates:
P(A ∩B) =P(A)P(B|A)
A > B
Figure28.7
AandBare not
mutually exclusive.
Since the compound event ‘Aand B’ isidentical to‘B and A’ wemay alsosay
P(A ∩B) =P(B ∩A) =P(B)P(A|B)
Consider Engineering application 28.12(e). We require the probability that the component
was manufactured by machine M, given itisacceptable. This isP(A|C). Now
P(A ∩C) =P(C ∩A) =P(C)P(A|C)
P(A|C) = P(A∩C)
P(C)
28.6 Conditional probability: the multiplication law 923
NowP(A∩C)istheprobabilitythatthecomponentismanufacturedbymachineMand
isacceptable.Thisisknowntobe0.6175,usingEngineeringapplication28.12(b).Also
fromEngineering application 28.12(a) we seeP(C) = 0.908. Hence,
P(A|C) = 0.6175
0.908 = 0.68
Engineeringapplication28.13
Reliabilityofmanufacturedcomponents
A manufacturer studies the reliability of a certain component so that suitable guarantees
can be given: 83% of components remain reliable for at least 5 years; 92%
remain reliable for at least 3 years. What is the probability that a component which
has remainedreliable for3yearswill remainreliablefor5years?
Solution
We definethe events:
A: a component remainsreliableforatleast3years
B: a component remainsreliableforatleast5years
ThenP(A) = 0.92,P(B) = 0.83.Notethattheseareunconditionalprobabilities.We
requireP(B|A), a conditionalprobability.
P(A ∩B) =P(A)P(B|A)
A ∩Bisthecompoundeventacomponentremainsreliableforatleast3yearsandit
remainsreliableforatleast5years. Clearlythis isthe same asthe eventB. So
Hence
P(A ∩B) =P(B)
P(B) =P(A)P(B|A)
P(B|A) = P(B)
P(A) = 0.83
0.92 = 0.90
thatis,90%ofthecomponentswhichremainreliablefor3yearswillremainreliable
for atleast5years.
Alternatively, a tree diagram is shown in Figure 28.8. Starting with 100 components,92remainreliable3yearslater.Ofthese92,2yearslater,83arestillreliable.
So
P(component isreliable after 5 years, given itisreliable after 3 years) = 83
92
= 0.90 ➔
924 Chapter 28 Probability
3 years 5 years
100
92 (reliable)
83 (reliable)
}
9 (unreliable)
17 (unreliable)
8 (unreliable) 8 (unreliable)
Figure28.8
Tree diagram forEngineering application 28.13.
EXERCISES28.6
1 Acomponent is manufactured bymachines 1 and2.
Machine 1 manufactures 72% oftotal production of
the component. Thepercentage ofcomponents which
are acceptable varies, depending uponwhich machine
is used.Formachine 1, 97% ofcomponents are
acceptable and formachine 2, 92% are acceptable. A
component ispicked at random.
(a) What is the probability itwasmanufactured by
machine 1?
(b) What is the probability itisnot acceptable?
(c) What is the probability that itis acceptable and
made bymachine 2?
(d) If the component isacceptable what is the
probability itwasmanufactured bymachine 2?
(e) If the component isnot acceptable what isthe
probability itwasmanufactured bymachine 1?
2 Themeasured lifespans (L)of1500components are
recorded in Table 28.4.
Table28.4
Lifespansof1500components.
Lifespan(hours)
Numberofcomponents
L 1000 210
900 L<1000 820
800 L<900 240
700 L<800 200
L<700 30
(a) Whatistheprobabilitythatacomponentwhichis
still workingafter800 hours willlast foratleast
900 hours?
(b) Whatistheprobabilitythatacomponentwhichis
still workingafter900 hours willcontinue to last
foratleast 1000 hours?
3 Thelifespan,L,of1000 components is measuredand
detailed in Table 28.5.
Table28.5
Lifespansof1000 components.
Lifespan(hours)
Numberofcomponents
L 1750 70
1500 L<1750 110
1250 L<1500 150
1000 L<1250 200
750 L<1000 317
500L<750 96
250L<500 42
L<250 15
(a) Calculate the probability that the lifespanofa
component is more than 1000hours.
(b) Calculate the probability that the lifespanofa
component is lessthan 750 hours.
(c) Calculate the probability that a component which
isstill workingafter 500 hours willcontinue to
last forat least1500hours.
4 MachinesA, Band C manufacture components.
Machine Amakes 50%ofthe components, machine
Bmakes 30%ofthe components andmachine C
makesthe rest. Theprobability that acomponent is
reliable is0.93 when made bymachine A, 0.90 when
made bymachine Band 0.95 when made by machine
C.Acomponent is picked atrandom.
(a) Calculate the probability that itis reliable.
(b) Calculate the probability that itis made by
machine B given itisunreliable.
28.7 Independent events 925
5 Componentsare made by machines A, B,Cand D.
Machine Amakes17% ofthe components, machine
Bmakes 21% ofthe components, machine Cmakes
20% ofthe components and machine D makesthe
remainder. Formachine A, 96% ofthe components
are reliable, formachine B,89% are reliable, for
machine C,92% are reliable and formachine D, 97%
are reliable. Acomponent ispicked atrandom.
Calculate the probability that itis
(a) reliable
(b) notreliable
(c) reliable, given itis madeby machine B
(d) notreliable, given itis made bymachine D
(e) made bymachine Agiven itis reliable
(f) made bymachine Cgiven itis unreliable
Solutions
1 (a) 0.72 (b) 0.044 (c) 0.2576
(d) 0.2695 (e) 0.4909
2 (a) 0.8110 (b) 0.2039
4 (a) 0.925 (b) 0.4
5 (a) 0.9415 (b) 0.0585 (c) 0.89
(d) 0.03 (e) 0.1733 (f) 0.2735
3 (a) 0.53 (b) 0.153 (c) 0.1909
28.7 INDEPENDENTEVENTS
Twoeventsareindependentiftheoccurrenceofeithereventdoesnotinfluencethe
probability of the other event occurring.
Engineeringapplication28.14
Acceptabilityofmanufacturedelectronicchips
Machine 1 manufactures an electronic chip, A, of which 90% are acceptable. Machine2manufacturesanelectronicchip,B,ofwhich83%areacceptable.Twochips
are picked at random, one of each kind. Find the probability that they are both
acceptable.
Solution
The eventsE 1
andE 2
aredefined:
E 1
: chip Aisacceptable
E 2
: chip Bis acceptable
P(E 1
) = 0.9 P(E 2
) = 0.83
A single trial consists of choosing two chips at random. We require the probability
that the compound event,E 1
∩E 2
, istrue.Usingthe multiplication lawwe have
P(E 1
∩E 2
) =P(E 1
)P(E 2
|E 1
) = 0.9P(E 2
|E 1
)
➔
926 Chapter 28 Probability
P(E 2
|E 1
) is the probability ofE 2
happening givenE 1
has happened. However, machine1andmachine2areindependent,sotheprobabilityofchipBbeingacceptable
is in no way influenced by the acceptability of chip A. The events E 1
and E 2
are
independent.
Therefore
P(E 2
|E 1
) =P(E 2
) = 0.83
P(E 1
∩E 2
) =P(E 1
)P(E 2
) = (0.9)(0.83) = 0.75
For independent eventsE 1
andE 2
:
(1)P(E 1
|E 2
) =P(E 1
), P(E 2
|E 1
) =P(E 2
)
(2)P(E 1
∩E 2
) =P(E 1
)P(E 2
)
The concept of independence may be applied to more than two events. Three or more
events are independent if every pair of events is independent. If E 1
,E 2
,...,E n
are n
independent events then
P(E i
|E j
)=P(E i
)
foranyiandj,i≠j
and
P(E i
∩E j
) =P(E i
)P(E j
)
i≠j
Acompoundeventmaycompriseseveralindependentevents.Themultiplicationlawis
extended inanobvious way:
P(E 1
∩E 2
∩E 3
) =P(E 1
)P(E 2
)P(E 3
)
P(E 1
∩E 2
∩E 3
∩E 4
) =P(E 1
)P(E 2
)P(E 3
)P(E 4
)
and soon.
Engineeringapplication28.15
Probabilityoffaultycomponents
The probability that a component is faulty is 0.04. Two components are picked at
random. Calculate the probability that
(a) both components arefaulty
(b) both components arenot faulty
(c) one ofthe components isfaulty
(d) one ofthe components isnotfaulty
(e) atleastone of the components isfaulty
(f) atleastone of the components isnot faulty
28.7 Independent events 927
Solution
We define eventsF 1
andF 2
tobe
F 1
: the first component isfaulty
F 2
: the second component isfaulty
Then eventsF 1
andF 2
are
Then
F 1
: the first component isnotfaulty
F 2
: the second component isnotfaulty
P(F 1
) =P(F 2
) = 0.04
P(F 1
)=P(F 2
)=1−0.04=0.96
(a) We requireboth components tobefaulty:
P(F 1
∩F 2
) =P(F 1
)P(F 2
)
= (0.04) 2
= 0.0016
since events areindependent
(b) We requireboth components tobenot faulty:
P(F 1
∩F 2
)=P(F 1
)P(F 2
)
= (0.96) 2
= 0.9216
(c) Consider the two components. Then either both components are faulty or one
component isfaulty or neither of the components isfaulty. So,
P(one component isfaulty) = 1 −P(both components are faulty)
−P(neither of the components isfaulty)
= 1 −0.0016 −0.9216
= 0.0768
Analternativeapproachisasfollows.Eitherthefirstcomponentisfaultyandthe
second one is not faulty, or the first component is not faulty and the second one
isfaulty. These two cases arerepresented byF 1
∩F 2
andF 1
∩F 2
.So,
P(one component isfaulty) =P(F 1
∩F 2
)+P(F 1
∩F 2
)
=P(F 1
)P(F 2
)+P(F 1
)P(F 2
)
= (0.04)(0.96) + (0.96)(0.04)
= 0.0768
(d) Werequireonecomponenttobenotfaulty.Sincetherearetwocomponentsthen
requiring one to be not faulty is equivalent to requiring one to be faulty. Hence
the calculation isthe same as thatin(c):
P(one component isnot faulty) = 0.0768
(e) At least one of the components is faulty means that one or both of the components
isfaulty:
➔
928 Chapter 28 Probability
P(atleastone component isfaulty) =P(exactly one component isfaulty)
+P(both components arefaulty)
= 0.0768 +0.0016
= 0.0784
Asanalternativewecannotethatthecomplementof‘atleastoneofthecomponentsisfaulty’is‘noneofthecomponentsisfaulty’.Theprobabilitythatneither
of the components is faulty isgiven in(b). Hence
P(atleastone component isfaulty)
= 1 −P(none of the components isfaulty)
= 1 −0.9216
= 0.0784
(f) At least one of the components is not faulty means that one or both of the components
isnotfaulty. So
P(atleastone component isnotfaulty)
=P(exactly one component isnot faulty)
+P(both components are notfaulty)
= 0.0768 +0.9216
= 0.9984
Asanalternativewenotethatthecomplementof‘atleastoneofthecomponents
is not faulty’ is ‘none of the components are not faulty’. The last statement is
equivalent to‘both the components arefaulty’.Hence
P(atleastone component isnot faulty) = 1−P(both components arefaulty)
= 1 −0.0016
= 0.9984
Engineeringapplication28.16
Probabilityofacceptablecomponents
Machines 1, 2 and 3 manufacture resistors A, B and C, respectively. The probabilities
of their respective acceptabilities are 0.9, 0.93 and 0.81. One of each resistor is
selected atrandom.
(a) Find the probability thatthey are all acceptable.
(b) Find the probability thatatleastone resistor isacceptable.
Solution
Define eventsE 1
,E 2
andE 3
by
E 1
: resistor Ais acceptable P(E 1
) = 0.9
E 2
: resistor Bisacceptable P(E 2
) = 0.93
E 3
: resistor Cisacceptable P(E 3
) = 0.81
28.7 Independent events 929
E 1
,E 2
andE 3
areindependent events.
(a) P(E 1
∩E 2
∩E 3
) =P(E 1
)P(E 2
)P(E 3
) = (0.9)(0.93)(0.81) = 0.68
(b) LetE 4
andE 5
be the events
E 4
: atleastone resistor isacceptable
E 5
: no resistorisacceptable
E 4
andE 5
arecomplementary events and so
P(E 4
)+P(E 5
)=1
E 5
may be expressed as
E 5
: resistor Aisnotacceptable and resistor Bisnot acceptable and
resistor Cisnotacceptable
thatis,E 1
∩E 2
∩E 3
P(E 5
) =P(E 1
∩E 2
∩E 3
)=P(E 1
)P(E 2
)P(E 3
)
= (1 −0.9)(1 −0.93)(1 −0.81)
= (0.1)(0.07)(0.19)
= 0.00133
P(E 4
) = 1 −P(E 5
) = 1 −0.00133 = 0.99867
EXERCISES28.7
1 AandBare two independenteventswithP(A) = 0.7
andP(B) = 0.4. The compound eventAoccurs, then
Aoccurs, thenBoccurs, is denotedAAB, andother
compound eventsare denotedin a similarway.
Calculatethe probability ofthe following compound
events:
(a) AAB
(b) BAB
(c) AAAA
2 Theprobabilityacomponentis faultyis 0.07.Two
components are picked at random.Calculate the
probabilitythat
(a) both are faulty
(b) both are not faulty
(c) the firstonepicked is faultyandthe second one
picked isnot faulty
(d) atleast oneisnot faulty
3 Theprobabilityacomponentis acceptable is 0.92.
Three components are picked atrandom. Calculate
the probability that
(a) allthree are acceptable
(b) none are acceptable
(c) exactlytwo are acceptable
(d) atleast two are acceptable
4 Componentsare madebymachines A, BandC.
Machine A makes30%ofthe components,machine
B makes25% ofthe components andmachine C
makes the rest. Two components are picked at
random. Calculate the probability that
(a) they are both madebymachine C
(b) oneismadeby machine Aandoneismadeby
machine B
(c) exactlyoneofthe components ismadeby
machine B
(d) atleast oneofthe components ismadeby
machine B
(e) both components are madebythe same machine
5 Capacitorsaremanufacturedbyfourmachines,1,2,3
and4.Theprobability a capacitor is manufactured
acceptablyvaries according to the machine.The
930 Chapter 28 Probability
probabilitiesare 0.94, 0.91,0.97 and0.94,
respectively, formachines 1, 2, 3 and 4.
(a) Acapacitor istaken from each machine. What is
the probability all fourcapacitors are acceptable?
(b) Twocapacitorsaretakenfrommachine1andtwo
from machine 2. What isthe probability all four
capacitors are acceptable?
(c) Acapacitor istaken from each machine.
Calculate the probability that atleastthree
capacitors are acceptable.
(d) Acapacitor istaken from each machine. From
thissampleoffour capacitors, one istaken at
random.
(i) What is the probability itis acceptable and
made by machine 1?
(ii) What is the probability itis acceptable and
made by machine 2?
(e) Acapacitor istaken from each machine. From
thissampleoffour capacitors, one istaken at
random. What isthe probability itis acceptable?
[Hint: use the resultsin (d).]
Solutions
1 (a) 0.196 (b) 0.112 (c) 0.2401
2 (a) 0.0049 (b) 0.8649 (c) 0.0651
(d) 0.9951
3 (a) 0.7787 (b) 5.12 ×10 −4
(c) 0.2031 (d) 0.9818
4 (a) 0.2025 (b) 0.15 (c) 0.375
(d) 0.4375 (e) 0.355
5 (a) 0.7800 (b) 0.7317 (c) 0.9808
(d) (i) 0.235 (ii) 0.2275 (e) 0.94
REVIEWEXERCISES28
1 Componentsare made bymachines A, B and C.
Machine Amakes 30% ofthe components, machine
B makes50% ofthe components and machine C
makes the rest. Theprobability that a component is
acceptable when made by machine Ais 0.96, when
made bymachine Bthe probability is0.91 andwhen
made bymachine Cthe probability is0.93.
(a) Acomponent is picked atrandom. Calculate the
probability that itis made bymachine C.
(b) Acomponent is picked atrandom. Calculate the
probability that itis made byeither machine Aor
machine C.
(c) Acomponent is picked atrandom. Calculate the
probability that itis made bymachine Band is
acceptable.
(d) Acomponent is picked atrandom. Calculate the
probability that itis notacceptable.
(e) Acomponent picked atrandom is notacceptable.
Calculate the probability that itismade by
machine A.
(f) Acomponent picked atrandom is acceptable.
Calculate the probability that itismade by either
machine Aormachine B.
(g) Two components are picked atrandom. Calculate
the probability that they are both acceptable.
(h) Two components are picked atrandom. Calculate
the probability that oneis acceptable and oneis
unacceptable.
2 Three machines, A, Band C,manufacture a
component. Machine A manufactures 35%ofthe
components, machine B manufactures 40%ofthe
components and machine C makesthe rest.A
component is eitheracceptable ornotacceptable: 7%
ofcomponents made bymachine Aare not
acceptable, 12%ofcomponents made by machine B
are not acceptable and 2% ofthosemade bymachine
Care alsonotacceptable.
(a) Findthe probability that acomponent is made by
eithermachine Aormachine B.
(b) Two components are picked atrandom. What is
the probability that they are both made by
machine B?
(c) Three components are picked atrandom. What is
the probability they are each made by adifferent
machine?
Review exercises 28 931
(d) A component is picked atrandom. What is the
probability itis notacceptable?
(e) A component is picked atrandom. Itisnot
acceptable. What isthe probability itwas made
by machine B?
(f) A component is picked atrandom and is
acceptable. What isthe probability itwas made
by eithermachine Aormachine B?
3 MachinesMandNmanufacture components. The
probability that acomponent isofan acceptable
standard is 0.93 when manufactured bymachine M
and0.86 when manufactured by machine N. Machine
Msupplies70% ofthe components andmachine N
suppliesthe rest.
(a) Calculate the probability that acomponent picked
atrandom is ofan acceptable standard.
(b) Acomponent isnot ofan acceptable standard.
Calculate the probability that itismade by
machine N.
(c) Two components are picked at random. Calculate
the probability that they are made bydifferent
machines.
4 E 1 is the event the component is reliable.E 2 is the
event the component ismade bymachine A. State
(a) E 1
(b) E 2
(c) E 1 ∩E 2
(d) E 1 ∪E 2
(e) E 2 ∩E 1
5 Thelifespans,L,of2500components were measured
andthe resultswere recorded in Table 28.6.
Table28.6
Lifespan(hours)
Numberofcomponents
0L<200 76
200 L<300 293
300 L<400 574
400 L<500 1211
500 L<600 346
(a) Calculate the probability thatacomponenthas a
lifespanofover 400 hours.
(b) Calculate the probability thatacomponenthas a
lifespanofless than300 hours.
(c) Calculate the probability thatacomponentthat
lasts for300 hours willcontinueto lastforat
least 500 hours.
6 Given the eventsE 1 ,E 2 andE 3 are
E 1 :the circuitisfunctioning
E 2 :the circuitismadeby machine A
E 3 :the circuitwilllast foratleast600 hours state
(a) E 2
(b) E 3
(c) E 1 ∪E 2
(d) E 1 ∩E 3
(e) E 2 ∩E 3
7 A data stream consistsofthe characters A, B,C,D
andE, with respective probabilities of0.16,0.23,
0.31, 0.12 and0.18.
(a) Calculate the informationassociated with the
character B.
(b) Calculate the entropy.
(c) Calculate the redundancy.
8 The probabilityacomponentis reliableis0.93.Three
components are picked atrandom. Calculatethe
probability that
(a) all components are unreliable
(b) exactly onecomponent isreliable
(c) at leastonecomponentis reliable
(d) exactly two components are reliable
9 A circuitis asshown in Figure 28.9.Thecircuitis
operational ifcurrent can flowfrom Pto Q alongany
route. Theprobability thatresistor typeAis faultyis
0.09, andforresistor typeBthe probability ofafault
is 0.14.Calculate the probability thatthe circuitis
operational.
P
Figure28.9
A
A
10 The probabilityacomponentis reliableis0.96.Five
components are picked atrandom. Calculatethe
probability that
(a) allfive components are reliable
(b) none are reliable
(c) atleast oneis reliable
B
A
B
Q
932 Chapter 28 Probability
Solutions
1 (a) 0.2 (b) 0.5 (c) 0.455
(d) 0.071 (e) 0.1690 (f) 0.7998
(g) 0.8630 (h) 0.1319
2 (a) 0.75 (b) 0.16 (c) 0.21
(d) 0.0775 (e) 0.619 (f) 0.734
3 (a) 0.909 (b) 0.4615 (c) 0.42
4 (a) Thecomponent isnot reliable.
(b) Thecomponent isnot made bymachine A.
(c) Thecomponent isreliable and made by
machine A.
(d) Thecomponent isreliable oritis made by
machine A.
(e) Thecomponent ismade by machine A anditis
notreliable.
5 (a) 0.6228 (b) 0.1476 (c) 0.1624
6 (a) Thecircuit isnot made bymachine A.
(b) Thecircuit willlast forlessthan 600 hours.
(c) Thecircuit isfunctioning oritis made by
machine A.
(d) Thecircuit isfunctioning and willlast foratleast
600 hours.
(e) Thecircuit ismade by machine Aand willlast
foratleast600 hours.
7 (a) 2.1203 (b) 2.2469 (c) 0.0323
8 (a) 3.43 ×10 −4 (b) 0.0137 (c) 0.9997
(d) 0.1816
9 0.9948
10 (a) 0.8154 (b) 1.024 ×10 −7
(c) 0.9999999
29 Statisticsandprobability
distributions
Contents 29.1 Introduction 933
29.2 Randomvariables 934
29.3 Probabilitydistributions--discretevariable 935
29.4 Probabilitydensityfunctions--continuousvariable 936
29.5 Meanvalue 938
29.6 Standarddeviation 941
29.7 Expectedvalueofarandomvariable 943
29.8 Standarddeviationofarandomvariable 946
29.9 Permutationsandcombinations 948
29.10 Thebinomialdistribution 953
29.11 ThePoissondistribution 957
29.12 Theuniformdistribution 961
29.13 Theexponentialdistribution 962
29.14 Thenormaldistribution 963
29.15 Reliabilityengineering 970
Reviewexercises29 977
29.1 INTRODUCTION
Engineersoftenneedtomeasuremanydifferentvariables,forexampletheoutputvoltage
of a system, the strength of a beam or the cost of a project. Variables are classified into
one of two types,discrete orcontinuous. These arediscussed inSection 29.2.
It is of interest to know the probability that a variable has of falling within a given
range. There is a high probability that a given variable will fall within some ranges,
and a small probability for other ranges of values. The way in which the probability is
934 Chapter 29 Statistics and probability distributions
distributedacrossvariousrangesofvaluesgivesrisetotheideaofaprobabilitydistribution.Thestudyofprobabilitydistributionsformsthemajorpartofthischapter.Themost
important distributions, the binomial, the Poisson, the uniform, the exponential and the
normal,areallincluded. Thechapter concludes withastudyofreliabilityengineering.
29.2 RANDOMVARIABLES
Engineeringquantitieswhosevariationcontainsanelementofchancearecalledrandom
variables. Some examples are listedbelow:
(1) the diameter of a motor shaft of nominal size0.2 m;
(2) the weight of a steelbox used tocontain anelectronic circuitboard;
(3) thenumberofcomponentspassingapointonafactoryproductionlinein1minute;
(4) the nominal resistance value of resistors;
(5) the length oftime a machine works withoutfailing.
All of these quantities vary. In (1), (2) and (5) the quantities vary continuously; that
is, they can assume any value in some range. For example, a motor shaft may have any
diameterbetween0.197mand0.203m;asteelboxcouldhaveanyweightbetween,say,
0.345 kg and 0.352 kg. These are examples of continuous variables. The value itself
will only be recorded to a certain accuracy, which depends on the measuring device
and the use to which the data will be put. For example, the shaft diameter may only
be measured to the nearest tenth of a millimetre. Although such a measurement is only
integerinmultiplesofatenthofamillimetre,thevariablebeingmeasurediscontinuous.
In(3)and(4)thevariablescanassumeonlyalimitednumberofvalues.Thenumberof
componentspassingapointinaminutewillbeanon-negativeinteger0,1,2,3....The
nominal resistance value of resistors has a limited number of values which is specified
by manufacturers in their catalogues. Variables such as these, which can assume only a
limitedset of values, arecalleddiscrete variables.
EXERCISES29.2
1 Isthe length oftime amachine workswithout failing
a continuous oradiscrete variable?
2 State whether the following variables are continuous
ordiscrete:
(a)the length ofabridge
(b)the number ofelectrical sockets in ahouse
(c)the length ofcable used to wire ahouse
(d)the weight ofsolderused to build acircuit board
3 Statewhether the following variablesare discreteor
continuous:
(a) the force requiredto stretch a spring bya
specifiedlength
(b) the output voltage ofasystem
(c) the height ofacolumn ofliquid
(d) the number ofresistorsin acircuit
(e) the number ofbitsofmemory ofacomputer
Solutions
1 continuous
2 (a) continuous (b)discrete
(c) continuous (d)continuous
3 (a)continuous (b)continuous (c)continuous
(d)discrete (e)discrete
29.3 Probability distributions -- discrete variable 935
29.3 PROBABILITYDISTRIBUTIONS--DISCRETEVARIABLE
The range of values that a variable can take does not give sufficient information about
the variable. We need to know which values are likely to occur often and which values
willoccuronlyinfrequently.Forexample,supposexisadiscreterandomvariablewhich
cantakevalues0,1,2,3,4,5and6.Wemayaskquestionssuchas‘Whichvalueismost
likelytooccur?’,‘Isa6morelikelytooccurthana5?’,andsoon.Weneedinformation
on the probability of each value occurring. Suppose that information is provided and is
given in Table 29.1. Ifxis sampled 100 times then on average 0 will occur 10 times, 1
will occur 10 times, 2 will occur 15 times and so on. Table 29.1 is called aprobability
distribution for the random variable x. Note that the probabilities sum to 1; the table
tells us how the total probability isdistributed among the various possible values of the
random variable. Table 29.1 may be represented ingraphical form (see Figure 29.1).
P(x)
0.3
0.2
Table29.1
The probability of a discrete
value occurring.
x 0 1 2 3 4 5 6
P(x) 0.1 0.1 0.15 0.3 0.2 0.1 0.05
0.1
0 1 2 3 4 5 6
Figure29.1
Plotted data ofTable 29.1.
x
EXERCISES29.3
1 Theprobabilitydistribution forthe randomvariable,
x,is
x 2 2.5 3.0 3.5 4.0 4.5
P(x) 0.07 0.36 0.21 0.19 0.10 0.07
(a) StateP(x = 3.5)
(b) CalculateP(x 3.0)
(c) CalculateP(x < 4.0)
(d) CalculateP(x > 3.5)
(e) CalculateP(x 3.9)
(f) The variable,x,is sampled50000 times. How
many timeswould you expectxto have a value
of2.5?
2 The probabilitydistribution ofthe randomvariable,y,
is given as
y −3 −2 −1 0 1 2 3
P(y) 0.63 0.20 0.09 0.04 0.02 0.01 0.01
Calculate
(a) P(y 0) (b) P(y 1)
(c) P(|y| 1) (d) P(y 2 > 3)
(e) P(y 2 < 6)
Solutions
1 (a) 0.19 (b) 0.57 (c) 0.83 (d) 0.17
(e) 0.83 (f) 18000
2 (a) 0.08 (b) 0.98 (c) 0.15 (d) 0.85
(e) 0.36
936 Chapter 29 Statistics and probability distributions
29.4 PROBABILITYDENSITYFUNCTIONS--CONTINUOUS
VARIABLE
Suppose x is a continuous random variable which can take any value on [0, 1]. It is
impossible to list all possible values because of the continuous nature of the variable.
There are infinitely many values on [0, 1] so the probability of any one particular value
occurring is zero. It is meaningful, however, to ask ‘What is the probability ofxfalling
inasub-interval,[a,b]?’Dividing[0,1]intosub-intervalsandattachingprobabilitiesto
each sub-interval will result in a probability distribution. Table 29.2 gives an example.
The probability thatxwill lie between 0.4 and 0.6 is 0.35, that isP(0.4 x < 0.6) =
0.35. Similarly,
P(0.2 x < 0.4) = 0.25
Figure 29.2 shows the table in a graphical form. By making the sub-intervals smaller a
more refined distributionisobtained. Table 29.3 and Figure 29.3 illustratethis.
The probability thatxlies in a particular interval is given by the sum of the heights
of the rectangles on that interval. For example, the probability thatxlies in [0.5, 0.8] is
0.2+0.1+0.1 = 0.4;thatis,thereisaprobabilityof0.4thatxliessomewherebetween
0.5 and 0.8.Note thatthe sum ofall the heights is1,representing total probability.
Considerthesub-interval[a,b].Werequiretheprobabilitythatxliesinthisinterval.
The way this is answered is by means of a probability density function (p.d.f.), f (x).
Suchap.d.f.isshowninFigure29.4,whereitistheareaunderthegraphbetweenaand
Table29.2
Probability thatxlies in agiven sub-interval.
x [0,0.2) [0.2, 0.4) [0.4,0.6) [0.6,0.8) [0.8,1.0]
P(x) 0.1 0.25 0.35 0.2 0.1
Table29.3
Refining the sub-intervals in Table 29.2.
x [0,0.1) [0.1, 0.2) [0.2,0.3) [0.3,0.4) [0.4, 0.5)
P(x) 0.03 0.07 0.1 0.15 0.15
x [0.5, 0.6) [0.6, 0.7) [0.7,0.8) [0.8,0.9) [0.9, 1.0]
P(x) 0.2 0.1 0.1 0.07 0.03
P(x)
0.4
0.3
0.2
P(x)
0.2
0.1
0.1
0
0.2 0.4 0.6 0.8 1.0
x
0
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1.0
x
Figure29.2
Plotted data ofTable 29.2.
Figure29.3
Plotted data ofTable 29.3.
29.4 Probability density functions -- continuous variable 937
f(x)
0 a b
x
Figure29.4
Shaded area representsP(a xb).
bthatgivesP(a xb).
P(a xb) = areaabove[a,b] =
The total area under a p.d.f. isalways 1.
∫ b
a
f(x)dx
Example29.1 Supposexisacontinuousrandomvariabletakinganyvalueon[1,4].Itsp.d.f., f (x),is
given by
f(x)= 1
2 √ x
1x4
(a) Check that f (x)isasuitable function forap.d.f.
(b) What isthe probability that (i)xlieson [2, 3.5],(ii)x 2,(iii)x < 3?
Solution (a) x can have any value on [1, 4]. For f (x) to be a p.d.f., then the total area under it
should equal 1,thatis
∫ 4
1
∫ 4
1
f(x)dx=1
f(x)dx=
∫ 4
1
1
2 √ x dx=[√ x] 4 1 = 1
Hence f (x) isasuitablefunction forap.d.f.
(b) (i) P(2 x 3.5) =
(ii) P(x 2) =
(iii) P(x < 3) =
∫ 4
2
∫ 3
1
∫ 3.5
2
f(x)dx=[ √ x] 3.5
2
= 0.457
f(x)dx=[ √ x] 4 2 = 0.586
f(x)dx=[ √ x] 3 1 = 0.732
Example29.2 Arandomvariable,z, has a p.d.f. f (z)where
f(z)=e −z
0z<∞
Calculate the probabilitythat
(a) 0z2
(b) z ismorethan1
(c) z islessthan0.5
938 Chapter 29 Statistics and probability distributions
Solution Note that ∫ ∞
0
e −z dz = 1 sothat f (z) = e −z issuitableas a p.d.f.
(a)P(0z2)= ∫ 2
0 e−z dz = [−e −z ] 2 0 = 0.865
(b) P(z >1) = ∫ ∞
1
e −z dz = [−e −z ] ∞ 1
= 0.368
(c) P(z < 0.5) = [−e −z ] 0.5
0
= 0.393
EXERCISES29.4
1 f(x) =kx 2 ,k constant, −1 x1. f(x)isap.d.f.
(a) What is the value ofk?
(b) Calculate the probability thatx > 0.5.
(c) IfP(x >c) = 0.6 then what isthe value ofc?
2 f (x)is ap.d.f. forthe random variablex,which can
varyfrom0to10.ItisillustratedinFigure29.5.What
is the probability thatxlies in [2,4]?
f(x)
0 10 x
Figure29.5
Probability densityfunctionforQuestion 2.
3 Ap.d.f. isgiven by
f(z)=2e −2z
0z<∞
(a) If200 measurements ofzare made, howmany,
onaverage, willbe greaterthan1?
(b) If50% ofmeasurementsare less thank,findk.
4 Ap.d.f.,h(x),is defined by
Calculate
h(x) = 3 4 (1−x2 )
(a) P(0 x 0.5)
(b) P(−0.3 x0.7)
(c) P(|x| < 0.5)
(d) P(x > 0.5)
(e) P(x 0.7)
5 (a) Verify that
f(t)=λe −λt t 0
issuitable asap.d.f.
(b) CalculateP(t 2) if λ = 3.
−1x1
Solutions
1 (a) 1.5 (b) 0.4375 (c) −0.5848
2 0.28
3 (a) 27 (b) 0.3466
4 (a) 0.3438 (b) 0.6575 (c) 0.6875
(d) 0.1563 (e) 0.9393
5 (b) 2.479 ×10 −3
29.5 MEANVALUE
If {x 1
,x 2
,x 3
,...,x n
} is a set ofnnumbers, then the mean value of these numbers, denotedbyx,is
x =
sum ofthe numbers
n
=
∑
xi
x issometimes called thearithmetic mean.
n
29.5 Mean value 939
Example29.3 Find the mean of −2.3, 0,1,0.7.
Solution
x = −2.3+0+1+0.7
4
= −0.6
4
= −0.15
The mean value is a single number which characterizes the set of numbers. It is useful
inhelpingtomake comparisons.
Engineeringapplication29.1
Comparisonoftwoproductionmachines
A component is made by two machines, A and B. The lifespans of six components
made byeachmachineare recordedinTable 29.4. Whichisthe preferred machine?
Table29.4
Thelifespans ofsixcomponents madeby machines AandB.
Lifespan(hours)
Machine A 92 86 61 70 58 65
Machine B 64 75 84 80 63 70
Solution
For componentsmanufacturedby machineA
average lifespan = 432
6 = 72
For components manufactured by machine B
average lifespan = 436
6 = 72.7
Machine B produces components with a higher average lifespan and so is the preferredmachine.
Example29.4 A variable, x, can have values 2, 3, 4, 5 and 6. Many observations of x are made and
denotedbyx i
. They have correspondingfrequencies, f i
. Theresults are asfollows:
value(x i ) 2 3 4 5 6
frequency (f i ) 6 9 3 7 4
Calculate the mean ofx.
Solution Thesumofallthemeasurementsmustbefound.Thevalue2occurssixtimes,contributing12
tothe total.Similarly, 3 occursninetimescontributing 27 tothe total.Thus
total =2(6) +3(9) +4(3) +5(7) +6(4) =110
940 Chapter 29 Statistics and probability distributions
The number of measurements made is6 +9+3+7+4 = 29. So,
mean =x= 110
29 = 3.79
Example 29.4 illustratesageneralprinciple.
If valuesx 1
,x 2
,...,x n
occur with frequencies f 1
, f 2
,..., f n
then
∑
xi f
x = ∑ i
fi
EXERCISES29.5
1 Engineersinadesigndepartmentareassessedbytheir
leader. A ‘0’is ‘Terrible’ anda‘5’is‘Outstanding’.
The29 members ofthe department are evaluated and
their scores recorded asfollows:
score 0 1 2 3 4 5
number ofstaff 2 5 6 9 4 3
Calculate the meanoutputvoltage.
4 Thecurrent, in amps,through aresistor is measured
140 times. The resultsare:
current(amps) 2.25 2.50 2.75 3.00 3.25
number of 32 27 39 22 20
measurements
Whatis the meanscore forthe wholedepartment?
2 Two samples ofnailsare taken.Thefirst sample has
12nailswith ameanlength of2.7cm, the second
sample has 20nailswith a meanlength of2.61 cm.
Whatis the meanlength ofall32nails?
3 Theoutput,in volts, from asystem is measured40
times. Theresults are recorded asfollows:
voltage(volts) 9.5 10.0 10.5 11.0 11.5
number of 6 14 8 7 5
measurements
Calculate the meancurrent through the resistor.
5 Inacommunication network, packets ofinformation
travel alonglines. The numberoflinesusedby each
packet variesaccording to the following table:
number oflinesused 1 2 3 4 5
number ofpackets 17 54 32 6 1
Calculate the meannumber oflinesusedperpacket.
Solutions
1 2.586
2 2.64
4 2.70
5 2.27
3 10.39
29.6 Standard deviation 941
29.6 STANDARDDEVIATION
Althoughthemeanindicateswherethecentreofasetofnumberslies,itgivesnomeasure
ofthespreadofthenumbers.Forexample, −1,0,1and −10,0,10bothhaveameanof
0 but clearly the numbers in the second set are much more widely dispersed than those
inthe first. Acommonly used measure of dispersion isthestandard deviation.
Let x 1
,x 2
,...,x n
be n measurements with a meanx. Then x i
−x is the amount by
whichx i
differs from the mean. The quantityx i
−x is called the deviation ofx i
from
the mean. Some of these deviations will be positive, some negative. The mean of these
deviations is always zero (see Question 3 in Exercises 29.6) and so this is not helpful
in measuring the dispersion of the numbers. To avoid positive and negative deviations
summing to zero the squared deviation is taken, (x i
−x) 2 . The variance is the mean of
the squared deviations:
variance =
∑ (xi −x) 2
n
and
standarddeviation = √ variance
Standard deviation has the same units as thex i
.
Example29.5 Calculate the standarddeviation of
(a) −1,0,1
(b) −10, 0,10
Solution (a) x 1
= −1,x 2
= 0,x 3
= 1.Clearlyx=0.
x 1
−x=−1 x 2
−x=0 x 3
−x = 1
variance = (−1)2 +0 2 +1 2
3
= 2 3
standard deviation =
√
2
3 = 0.816
(b) x 1
= −10,x 2
= 0,x 3
= 10. Againx=0andsox i
−x=x i
,fori=1,2,3.
variance = (−10)2 +0 2 +10 2
= 200
3 3
√
200
standard deviation =
3 = 8.165
As expected, the second set has a much higher standard deviation than the first.
942 Chapter 29 Statistics and probability distributions
Example29.6 Find the standard deviation of −2,7.2,6.9, −10.4, 5.3.
Solution x 1
= −2,x 2
= 7.2,x 3
= 6.9,x 4
= −10.4,x 5
= 5.3,x=1.4
x 1
−x = −3.4
x 2
−x=5.8
x 3
−x=5.5
x 4
−x = −11.8
x 5
−x=3.9
variance = (−3.4)2 + (5.8) 2 + (5.5) 2 + (−11.8) 2 + (3.9) 2
√ 5
229.9
standard deviation = = 6.78
5
= 229.9
5
Calculatingx i
−x,i = 1,2,...,n,istediousforlargenandsoamoretractableformof
the standard deviation is sought. Firstly observe that ∑ x i
=nx and ∑ x 2 =nx 2 , since
x 2 isaconstant. Now
∑
(xi −x) 2 = ∑ x 2 i
− ∑ 2x i
x + ∑ x 2
= ∑ x 2 i
−2x ∑ x i
+nx 2
= ∑ x 2 i
−2x(nx)+nx 2
= ∑ x 2 i
−nx 2
Hence,
and
variance =
∑ x
2
i
−nx 2
n
√ ∑x
2
i
−nx 2
standard deviation =
n
Using these formulae itisnot necessary tocalculatex i
−x.
Example29.7 RepeatExample 29.6 usingthe newly derived formulae.
Solution ∑ xi 2 = (−2) 2 + (7.2) 2 + (6.9) 2 + (−10.4) 2 + (5.3) 2 = 239.7
√
239.7 −5(1.4)
2
standard deviation =
= 6.78
5
29.7 Expected value of a random variable 943
EXERCISES29.6
1 Calculate the means andstandard deviationsof:
(a) 1, 2, 3, 4, 5
(b) 2.1, 2.3, 2.7, 2.6
(c) 37, 26,19, 21,19,25, 17
(d) 6, 6, 6, 6, 6, 6
(e) −1, 2, −3, 4, −5, 6
2 Asetofmeasurements {x 1 ,x 2 ,x 3 , . . . ,x n }hasamean
ofxandastandard deviation ofs.What are the mean
and standard deviation ofthe set
{kx 1 ,kx 2 ,kx 3 ,...,kx n }wherek isaconstant?
3 The meanofthe numbers {x 1 ,x 2 ,x 3 , . . . ,x n }isx.
Show that the sum ofthe deviations about the mean
is 0; that is,show ∑ n
i=1 (x i −x)=0.
Solutions
1 (a) mean = 3; st.dev. = 1.414
(b) 2.425,0.238
(c) 23.4,6.321
(d) 6,0
(e) 0.5,3.862
2 mean=kx,st.dev. =ks
29.7 EXPECTEDVALUEOFARANDOMVARIABLE
InSections 29.5and29.6 weshowedhowtocalculatethe mean andstandarddeviation
ofagivensetofnumbers.Noreferencewasmadetoprobabilitydistributionsorp.d.f.s.
Supposenowthatwehaveknowledgeoftheprobabilitydistributionofadiscreterandom
variable or the p.d.f. of a continuous random variable. The mean value of the random
variable can still be found. Under these circumstances the mean value is known as the
expectedvalueor expectation.
29.7.1 Expectedvalueofadiscreterandomvariable
Supposefordefinitenessthatxisadiscreterandomvariablewithprobabilitydistribution
asgiven inTable29.5.
Table29.5
Probabilitydistribution foradiscrete randomvariablex.
x 0 1 2 3 4
P(x) 0.1 0.2 0.4 0.15 0.15
In100trialsxwillhaveavalueofzero10timesonaverage,avalueofone20timeson
average and soon. Themean value, thatisthe expected value,istherefore
expected value =
0(10) +1(20) +2(40) +3(15) +4(15)
100
We could have arranged the calculation as follows:
expected value = 0 ( ) (
10
100 +1 20
) (
100 +2 40
100
)
+3
( 15
100
= 2.05
) (
+4
15
)
100
= 0(0.1) +1(0.2) +2(0.4) +3(0.15) +4(0.15)
944 Chapter 29 Statistics and probability distributions
Each termisof the form (value) ×(probability). Thus,
∑i=5
expected value = x i
P(x i
)
i=1
The symbol, µ, isused todenote the expected value of a random variable.
If a discrete random variable can take values
x 1
,x 2
,...,x n
with probabilitiesP(x 1
),P(x 2
),...,P(x n
), then
∑i=n
expected value ofx = µ = x i
P(x i
)
i=1
Example29.8 Arandom variable,y, has a known probability distributiongiven by
y 2 4 6 8 10
P(y) 0.17 0.23 0.2 0.3 0.1
Find the expected value ofy.
Solution We have
µ = expectedvalue = 2(0.17) +4(0.23) +6(0.2) +8(0.3) +10(0.1) = 5.86
29.7.2 Expectedvalueofacontinuousrandomvariable
Suppose a continuous random variable,x, has p.d.f. f (x),a x b. The probability
thatxliesinavery smallinterval, [x,x + δx], is
∫ x+δx
x
f(t)dt
Since the interval is very small, f will vary only slightly across the interval. Hence the
probability is approximately f (x)δx: see Figure 29.6. The contribution to the expected
value as a resultofthis interval is
(value) ×(probability)
that is,xf(x)δx. Summing all such termsyields
expected value = µ =
∫ b
a
xf(x)dx
29.7 Expected value of a random variable 945
f(x)
Area ≈ f(x).dx
x
x + dx
x
Figure29.6
The shaded area represents the probability thatxlies
in the smallinterval [x,x + δx].
Example29.9 Arandom variable has p.d.f. given by
f(x)= 1
2 √ x
1x4
Calculate the expected value ofx.
Solution µ =
∫ 4
1
[ x
3/2
=
3
x 1
2 √ x dx = ∫ 4
] 4
1
= 7 3
1
√ x
2 dx
So, if several values ofxare measured, the mean of these values will be near to 7 3 . As
more and more values are measured the mean will get nearer and nearer to 7 3 .
EXERCISES29.7
1 Calculatethe expected value ofthe discrete random
variable,h,whose probabilitydistribution is
h 1 1.5 1.7 2.1 3.2
P(h) 0.32 0.24 0.17 0.15 0.12
2 Calculatethe expected value ofthe random variable,
x,whose probabilitydistribution is
x 2 2.5 3.0 3.5 4.0 4.5
P(x) 0.07 0.36 0.21 0.19 0.10 0.07
4 A randomvariable,x,has p.d.f. f (x) given by
f(x)= 5
4x 2
1x5
(a) Calculate the expected value ofx.
(b) Ten values ofxare measured. They are
1.9, 2.9, 2.8, 2.1, 3.2, 3.4, 2.7, 2.3, 2.8, 2.7
Calculate the mean ofthe observations and
comment on yourfindings.
5 A p.d.f.h(x)is defined by
h(x) = 3 4 (1−x2 )
−1x1
Calculate the expected value ofx.
3 Arandom variable,z,has p.d.f. f (z) = e −z ,
0 z<∞. Calculatethe expected value ofz.
6 Is the expected value ofadiscrete random variable
necessarily oneofits possiblevalues?
946 Chapter 29 Statistics and probability distributions
Solutions
1 1.668
2 3.05
3 1
4 (a) 2.012 (b) 2.68
5 0
6 no
29.8 STANDARDDEVIATIONOFARANDOMVARIABLE
29.8.1 Standarddeviationofadiscreterandomvariable
RecallfromSection29.6thatthestandarddeviationofasetofnumbers, {x 1
,x 2
,...,x n
},
isgiven by
√∑ (xi −x)
standard deviation =
2
n
Nowsupposethatxisadiscreterandomvariablewhichcanhavevaluesx 1
,x 2
,x 3
,...,x n
with respective probabilities of p 1
,p 2
,p 3
,...,p n
; thatis, we have
x x 1 x 2 x 3 ... x n
P(x) p 1 p 2 p 3 ... p n
Let the expected value of x be µ. Then the square of the deviation from the expected
value has anidentical probabilitydistribution:
value (x 1 − µ) 2 (x 2 − µ) 2 ... (x n − µ) 2
probability p 1 p 2 . . . p n
Theexpectedvalueofthemeansquareddeviationisthevariance.Thesymbol σ 2 isused
todenotethe varianceofarandom variable:
variance = σ 2 =
n∑
p i
(x i
− µ) 2
1
As before the standarddeviation isthe squarerootofthe variance:
standard deviation = σ =
√ ∑pi
(x i
− µ) 2
Example29.10 Adiscrete randomvariable has probabilitydistribution
x 1 2 3 4 5
P(x) 0.12 0.15 0.23 0.3 0.2
29.8 Standard deviation of a random variable 947
Calculate
(a) the expected value
(b) the standard deviation
Solution (a) µ = ∑ x i
p i
= 1(0.12) +2(0.15) +3(0.23) +4(0.3) +5(0.2) = 3.31
(b) σ 2 = ∑ p i
(x i
− µ) 2
=0.12(1 −3.31) 2 +0.15(2 −3.31) 2 +0.23(3 −3.31) 2
+0.3(4 −3.31) 2 +0.2(5 −3.31) 2
=1.6339
standard deviation = σ = √ 1.6339 = 1.278
29.8.2 Standarddeviationofacontinuousrandomvariable
Wesimplystatetheformulaforthestandarddeviationofacontinuousrandomvariable.
It is analogous to the formula for the standard deviation of a discrete variable. Letxbe
a continuousrandom variablewith p.d.f. f (x),a xb. Then
√ ∫ b
σ =
a
(x − µ) 2 f(x)dx
Example29.11 Arandomvariable,x, has p.d.f. f (x)given by
f(x)=1
0x1
Calculate the standarddeviation ofx.
Solution Theexpected value, µ,is found:
∫ 1
[ x
2
µ = xf(x)dx= = 1 2 2
0
] 1
0
The variance can now befound:
∫ 1
(
variance = σ 2 = x − 1 2
1dx
2)
Hence
=
σ =
√
1
12 = 0.29
0
∫ 1
0
x 2 −x+ 1 4 dx
The standard deviation ofxis0.29.
[ x
3
=
3 − x2
2 4] + x 1
= 1
0
3 − 1 2 + 1 4 = 1
12
948 Chapter 29 Statistics and probability distributions
EXERCISES29.8
1 Ap.d.f.h(x)ofthe random variablexisdefined by
h(x) = 3 4 (1−x2 )
−1x1
(a) Calculate the expected value ofx.
(b) Calculate the standard deviation ofx.
2 Arandom variable,t,has p.d.f.H(t)given by
H(t)=3e −3t
t0
(a) Calculate the expected value oft.
(b) Calculate the standard deviation oft.
3 Adiscrete random variable, w,has a known
probability distribution
w −1 −0.5 0 0.5 1
P(w) 0.1 0.17 0.4 0.21 0.12
Calculate the standard deviation of w.
4 Adiscrete randomvariable,y,has a probability
distribution
y 6 7 8 9 10 11
P(y) 0.13 0.26 0.14 0.09 0.11 0.27
Calculate the standard deviation ofy.
5 Acontinuous random variablehas p.d.f.
{ x+1 −1x0
f(x)=
−x+1 0<x1
(a) Calculate the expected value ofx.
(b) Calculate the standard deviation ofx.
Solutions
1 (a) 0 (b) 0.4472
4 1.84
2 (a)
3 0.5598
1
3 (b)
1
3
5 (a) 0 (b) 0.4082
29.9 PERMUTATIONSANDCOMBINATIONS
Bothpermutationsandcombinationsareusedextensivelyinthecalculationofprobabilities.
29.9.1 Permutations
Thefollowing problem introducespermutations.
Engineeringapplication29.2
Linkingprocesscontrolcomputers
A primary and a secondary route must be chosen from available routes A, B and C,
for the linking of two process control computers in order to provide redundancy in
case one fails.Inhow many ways can the choicesbemade?
29.9 Permutations and combinations 949
Solution
The various possibilitiesare listed.
Primary route
A
B
A
C
B
C
Secondary route
B
A
C
A
C
B
There are six ways in which the choices can be made. Alternatively we could argue
as follows. Suppose the primary route is chosen first. There are three choices: any
oneofA,BorC.Thesecondaryrouteisthenchosenfromthetworemainingroutes,
givingtwopossiblechoices.Togetherthereare3×2 = 6waysofchoosingaprimary
route and a secondary route.
InEngineeringapplication29.2thechoiceABisdistinctfromthechoiceBA,thatisthe
order is important. Choosing two routes from three and arranging them in order is an
example of apermutation. More generally:
Apermutationofndistinctobjectstakenr atatimeisanarrangementofr ofthen
objects.
Informingpermutations,theorderoftheobjectsisimportant.Ifthreelettersarechosen
fromthealphabetthepermutationXYZisdistinctfromthepermutationZXY.Wepose
the question ‘How many permutations are there of n objects taken r at a time?’ The
following example will helptoestablish a formulafor the number ofpermutations.
Example29.12 Calculate the number of permutations thereare of
(a) four distinctobjects taken two atatime
(b) five distinctobjects taken threeatatime
(c) seven distinctobjects taken four atatime
Solution (a) Listing all possible permutations is not feasible when the numbers involved are
large. In choosing the first object, four choices are possible. In choosing the second
object, three choices are possible. There are thus 4 × 3 = 12 permutations of
four objects taken two atatime.Note that12 may be writtenas
12=4×3= 4!
2! = 4!
(4 −2)!
(b) There are five objects available for the first choice, four for the second choice and
threeforthethirdchoice.Hencethereare5×4×3 = 60permutations.Againnote
that
60 = 5!
2! = 5!
(5 −3)!
950 Chapter 29 Statistics and probability distributions
(c) Therearesevenobjectsavailableforthefirstchoice,sixforthesecond,fiveforthe
third and four for the fourth. The number of permutations is 7 ×6×5×4 = 840.
7!
This may bewritten as
(7 −4)! .
Theexample illustratesthe following generalrule.
Thenumberofpermutationsofndistinctobjectstakenratatime,writtenP(n,r),is
P(n,r) =
n!
(n−r)!
Example29.13 Findthe numberofpermutations of
(a) 10 distinctobjects takensixatatime
(b) 15 distinctobjects takentwo atatime
(c) sixdistinctobjects takensixatatime
Solution (a) P(10,6) = 10!
(10 −6)! = 10!
4! = 151200
15!
(b) P(15,2) =
(15 −2)! = 15!
13! = 210
6!
(c) P(6,6) =
(6 −6)! = 6! = 720 (Note that0! = 1)
0!
P(6,6) issimplythe number of ways of arranging all sixof the objects.
Note that
P(n,n) =n!
This isthe numberofways ofarrangingngiven objects.
29.9.2 Combinations
Closelyrelatedto,but nevertheless distinctfrom, permutations arecombinations.
Acombination isaselection ofrdistinctobjectsfromnobjects.
Inmakingaselectiontheorderisunimportant.Forexample,giventhelettersA,BandC,
ABandBAarethesamecombinationbutdifferentpermutations.Aswithpermutations
wedevelopanexpressionforthenumberofcombinationsofnobjectstakenratatime.
Examples29.14and 29.15helpwith thisdevelopment.
29.9 Permutations and combinations 951
Example29.14 There are three routes, A, B and C, joining two computers. In how many ways can two
routes be chosen from A,Band C?
Solution The possible combinations (selections) can be listedas
AB,BC, AC
thatis,therearethreepossiblewaysofmakingtheselection.Wecanalsouseourknowledge
of permutations to calculate the number of combinations. There areP(3,2) ways
of arranging the threeroutes taken two atatime.
P(3,2) =
3!
(3 −2)! = 6
EachcombinationoftworoutescanbearrangedinP(2,2)ways;thatis,eachcombinationgivesrisetotwopermutations.Forexample,thecombinationABcouldbearranged
as AB or BA, giving two permutations. Thus, the number of combinations is half the
number of permutations. There are 6/2 = 3 combinations.
Example29.15 Calculate the numberofcombinationsof
(a) sixdistinctobjectstaken fouratatime
(b) 10 distinctobjects takensixatatime
Solution (a) Consider one combination of four objects. These four objects can be arranged in
4! ways; that is, each combination gives rise to 4! permutations. The number of
permutations ofsixobjects takenfouratatime is
P(6,4) = 6!
2!
Hence the number of combinations is
P(6,4)
4!
= 6!
2!4! = 15
There are15 combinations of sixobjects taken four atatime.
(b) Eachcombination,comprisingsixobjects,givesriseto6!permutations.Thenumber
of permutations of 10 objects taken sixatatime is
P(10,6) = 10!
4!
Hence the number of combinations = P(10,6)
6!
= 10!
4!6! = 210.
( n
We write to denote the number of combinations ofnobjects takenrat a time. A
r)
( n
formulafor isnow developed.
r)
952 Chapter 29 Statistics and probability distributions
Each combination ofr objects gives rise tor! permutations, but
P(n,r) =
n!
(n−r)!
Thenumberofcombinationsofndistinctobjects takenratatime is
( n
=
r)
P(n,r) n!
=
r! (n −r)!r!
Example29.16 Calculate the numberofcombinationsof
(a) sixdistinctobjects takenfive atatime
(b) ninedistinctobjectstaken nineatatime
(c) 25 distinctobjects takenfive atatime
Solution (a)
( 6
5)
= 6! ( ) 9
1!5! =6 (b) = 9! ( ) 25
9 0!9! =1 (c) = 25!
5 5!20! = 53130
( n
We can generalize the resultofExample 29.16(b)and statethat = 1.
n)
Example29.17 There are k identical objects and n compartments (n k). Each compartment can
hold only one object. In how many different ways can thekobjects be placed in then
compartments?
Solution The order in which the objects are placed is unimportant since all the objects are identical.
Placing the k objects is identical to selecting k of the n compartments (see
(
Figure29.7).Butthenumberofwaysofselectingkcompartmentsfromnisprecisely
. n
(
k)
n
Hence thekobjects can beplacedinthencompartmentsin different ways.
k)
n compartments
k objects
Figure29.7
Placingkobjectsinncompartments.
29.10 The binomial distribution 953
EXERCISES29.9
1 Evaluate
(a) P(8,6)
( ) 12
(c)
9
(b) P(11,7)
( ) 15
(d)
12
2 Writeoutexplicitly
( ( n n
(a) (b)
0)
1)
(c)
(e)
( ) 15
3
( n
2)
3 Theexpansionof (a+b) n wherenisapositiveinteger
maybe written with the help ofcombination notation.
(
(a+b) n =a n n
+ a
1)
n−1 b
( n
+ a
2)
n−2 b 2 +···
( ) ( n
+ ab n−1 n
+ b
n −1 n)
n
n∑
( n
= a
k)
n−k b k
k=0
Expand
(a) (a +b) 4 (b) (1 +x) 6 (c) (p+q) 5
4 Primary and secondary routesconnecting two
computers need to be chosen. Two primaryroutesare
needed from eightwhich are suitable and three
secondary routesmust be chosen from four available.
Inhow many ways can the routesbe chosen?
5 A combination lock can be opened by diallingthree
correct lettersfollowed bythree correct digits. How
many different possibilitiesare there forarranging the
lettersand digits? Isthismore secure than alock
which has seven digits? Isthe word ‘combination’
being usedcorrectly?
6 A nuclear power station isto be builton oneof20
possiblesites.Ateamofengineersiscommissionedto
examine the sites andrank the three most favourable
in order. Inhow many ways can thisbe done?
Solutions
1 (a) 20160 (b) 1663200 (c) 220 (c)p 5 +5p 4 q+10p 3 q 2 +10p 2 q 3 +5pq 4 +q 5
(d) 455 (e) 455
4 112
n(n −1)
2 (a) 1 (b) n (c)
2 5 17576000. Yes,itis more secure. Combination is not
3 (a)a 4 +4a 3 b+6a 2 b 2 +4ab 3 +b 4
being usedcorrectly:permutation lockwould be
better.
(b)1 +6x +15x 2 +20x 3 +15x 4 +6x 5 +x 6 6 6840
29.10 THEBINOMIALDISTRIBUTION
In a single trial or experiment a particular result may or may not be obtained. For example,
in an examination, a student may pass or fail; when testing a component it may
workornotwork.Theimportantpointisthatthetwooutcomesarecomplementary.Assumingthattheprobabilityofanoutcomeisfixed,suchatrialiscalledaBernoullitrial
inhonour ofthe mathematician, J.Bernoulli.
Weaddressthefollowingproblem.InasingletrialtheoutcomeiseitherAorB,thatis
A.WerefertoAasasuccessandBasafailure.IfweknowP(A) = pthenP(B) = 1−p.
Ifnindependenttrialsareobserved,whatistheprobabilitythatAoccursktimes,andB
occursn −k times? The number of successful trials innsuch experiments is a discrete
randomvariable;suchavariableissaidtohaveabinomialdistribution.Letusconsider
a particular problem.
954 Chapter 29 Statistics and probability distributions
Engineeringapplication29.3
Samplingcomponentsmadebyamachine
Amachinemakescomponents.Theprobabilitythatacomponentisacceptableis0.9.
(a) If three components are sampled find the probability that the first is acceptable,
the second isacceptable and the thirdisnot acceptable.
(b) If three components are sampled what is the probability that exactly two are
acceptable?
Solution
Let the eventsAandBbe defined thus:
A:thecomponentis acceptable,P(A) = 0.9
B:thecomponentis notacceptable,P(B) = 0.1
(a) We denote byAAB the compound event that the first is acceptable, the second
is acceptable, the third is not acceptable. Since the three events are independent
the multiplication lawinSection 28.7 gives
P(AAB) =P(A)P(A)P(B) = (0.9)(0.9)(0.1) = (0.9) 2 (0.1) = 0.081
(b) Weareinterestedinthecompoundeventinwhichtwocomponentsareacceptable
and one is not acceptable. We denote byAAB the compound event that the first
is acceptable, the second is acceptable, the third is not acceptable. Compound
eventsABA andBAA have obvious interpretations.
If exactly two components are acceptable then either AAB or ABA or BAA
occurs.Thesecompoundeventsaremutuallyexclusiveandwecanthereforeuse
the addition law (see Section 28.3). Hence
P(exactly two acceptable components) =P(AAB) +P(ABA) +P(BAA)
From (a) P(AAB) = 0.081 and by similar reasoning P(ABA) = P(BAA) =
0.081. Hence,
P(exactly two acceptable components) = 3(0.081) = 0.243
29.10.1 Probabilityofksuccessesfromntrials
Let us now return to the general problem posed earlier. We define the compound
eventC:
C:Aoccursktimes andBoccursn −k times
( n
The k occurrences of event A can be distributed amongst the n trials in different
k)
ways (see Example 29.17). The probability of a particular distribution
(
ofkoccurrences
n
ofAandn−koccurrencesofBisp k (1−p) n−k .Sincethereare distinctdistributions
k)
possible then
P(C) =
( n
k)
p k (1−p) n−k
k=0,1,2,...,n
29.10 The binomial distribution 955
Example29.18 The probability that a component is acceptable is 0.93. Ten components are picked at
random. What is the probability that
(a) atleastnine areacceptable
(b) atmostthree areacceptable?
Solution (a) P(exactly 9 components are acceptable) =
Hence,
P(exactly 10 components areacceptable) =
( 10
9
)
(0.93) 9 (0.07) = 0.364
( ) 10
(0.93) 10 = 0.484
10
P(atleast9components areacceptable) = 0.364 +0.484 = 0.848
(b) We require the probability that none are acceptable, one is acceptable, two are acceptable
and threeare acceptable:
( ) 10
P(0are acceptable) = (0.93) 0 (0.07) 10 = 2.825 ×10 −12
0
( ) 10
P(1is acceptable) = (0.93) 1 (0.07) 9 = 3.753 ×10 −10
1
( ) 10
P(2are acceptable) = (0.93) 2 (0.07) 8 = 2.244 ×10 −8
2
( ) 10
P(3are acceptable) = (0.93) 3 (0.07) 7 = 7.949 ×10 −7
3
Hence,
P(atmost3are acceptable) = 2.825 ×10 −12 +3.753 ×10 −10
+2.244 ×10 −8 +7.949 ×10 −7
= 8.18 ×10 −7
that is, the probability that at most three components are acceptable is almost zero;
itisvirtually impossible.
29.10.2 Meanandstandarddeviationofabinomialdistribution
Let the probability of success in a single trial be p and let the number of trials be n.
The number of successes in n trials is a discrete random variable, x, with a binomial
distribution. Thenxcan have any value from {0,1,2,3,...,n}, although clearly some
valuesaremorelikelytooccurthanothers.Theexpectedvalueofxcanbeshowntobe
np. Thus,ifmany values ofxarerecorded, the mean of these will approachnp.
Expected value of the binomial distribution =np
The standard deviation of the binomial distributioncan alsobefound. This isgiven by
standard deviation of the binomial distribution = √ np(1−p)
956 Chapter 29 Statistics and probability distributions
29.10.3 Mostlikelynumberofsuccesses
When conducting a series of trials it is sometimes desirable to know the most likely
outcome. For example, what is the most likely number of acceptable components in a
sample of five tested?
Example29.19 Theprobabilityacomponentisacceptableis0.8.Fivecomponentsarepickedatrandom.
What isthe mostlikely number of acceptable components?
( 5
Solution P(no acceptable components) = (0.8)
0)
0 (0.2) 5 = 3.2 ×10 −4
( 5
P(1 acceptable component) = (0.8)
1)
1 (0.2) 4 = 6.4 ×10 −3
( 5
P(2 acceptable components) = (0.8)
2)
2 (0.2) 3 = 0.0512
( 5
P(3 acceptable components) = (0.8)
3)
3 (0.2) 2 = 0.2048
( 5
P(4 acceptable components) = (0.8)
4)
4 (0.2) 1 = 0.4096
( 5
P(5 acceptable components) = (0.8)
5)
5 (0.2) 0 = 0.3277
The mostlikely number of acceptable components isfour.
Example 29.19 illustrates an important general result. Suppose we conductnBernoulli
trialsandwishtofindthemostlikelynumberofsuccesses.Ifp =probabilityofsuccess
on a single trial,andi =mostlikely number ofsuccesses inntrials,then
p(n+1)−1<i<p(n+1)
InExample 29.19, p = 0.8,n = 5 and so
(0.8)(6) −1 <i < (0.8)(6)
3.8<i<4.8
Sinceiisan integer, theni = 4.
EXERCISES29.10
1 Theprobabilityacomponentisacceptableis0.8.Four
components are sampled. What is the probability that
(a) exactly one is acceptable
(b) exactly two are acceptable?
2 Amachine requires allseven ofitsmicro-chips to
operate correctly in order to be acceptable. The
probability amicro-chip isoperating correctly is 0.99.
(a) What isthe probability the machine is
acceptable?
(b) What isthe probability that sixofthe seven chips
are operating correctly?
(c) Themachine is redesigned sothat the original
seven chips are replaced by four new chips.The
probability a new chipoperates correctly is0.98.
29.11 The Poisson distribution 957
Isthe new designmore orlessreliable than the
original?
3 Theprobability a machine has a lifespanofmore than
5 years is0.8. Ten machines are chosen atrandom.
What isthe probability that
(a) eightmachines have a lifespanofmore than 5
years
(b) allmachines have alifespan ofmore than 5 years
(c) atleast eight machines have alifespanofmore
than5years
(d) nomore than two machines have alifespanof
lessthan 5 years?
4 Theprobability a valve remainsreliable formore than
10years is 0.75.Eightvalves are sampled. What is
the most likely number ofvalvesto remain reliable
formore than 10years?
5 The probability a chipis manufactured to an
acceptable standard is 0.87.Asampleofsixchipsis
picked at random from a large batch.
(a) Calculate the probability all sixchipsare
acceptable.
(b) Calculate the probability none ofthe chipsis
acceptable.
(c) Calculate the probability that fewer than five
chips in the sampleare acceptable.
(d) Calculate the mostlikely number ofacceptable
chips in the sample.
(e) Calculate the probability that more than two
chips are unacceptable.
Solutions
1 (a) 0.0256 (b) 0.1536
2 (a) 0.9321 (b) 0.0659 (c) 0.9224.
New designis lessreliable
3 (a) 0.3020 (b) 0.1074 (c) 0.678 (d) 0.678
4 6
5 (a) 0.4336 (b) 4.826 ×10 −6 (c) 0.1776 (d) 6
(e) 0.0324
29.11 THEPOISSONDISTRIBUTION
ThePoissondistributionmodelsthenumberofoccurrencesofaneventinagiveninterval.
Consider the number of emergency calls received by a service engineer in one day.
Wemayknowfromexperiencethatthenumberofcallsisusuallythreeorfourperday,
butoccasionallyitwillbeonlyoneortwo,orevennone,andonsomedaysitmaybesix
or seven, or even more. This example suggests a need for assigning a probability to the
numberofoccurrencesofaneventduringagiventimeperiod.ThePoissondistribution
serves this purpose.
Thenumberofoccurrencesofanevent,E,inagiventimeperiodisadiscreterandom
variable which we denote by X. We wish to find the probability that X = 0, X = 1,
X = 2, X = 3, and so on. Suppose the occurrence of E in any time interval is not
affectedbyitsoccurrenceinanyprecedingtimeinterval.Forexample,acarisnotmore,
or less, likely to pass a given spot in the next 10 seconds because a car passed (or did
notpass) the spot inthe previous 10 seconds, thatisthe occurrences areindependent.
Let λ be the expected (mean) value ofX, the number of occurrences during the time
period. IfX is measured for many time periods the average value ofX will be λ. Under
thegivenconditionsX followsaPoissondistribution.TheprobabilitythatX hasavalue
r isgiven by
P(X=r)= e−λ λ r
r!
r=0,1,2,...
958 Chapter 29 Statistics and probability distributions
The expected value and variance of the Poisson distributionareboth equal to λ.
Engineeringapplication29.4
Emergencycallstoaserviceengineer
Recordsshowthatonaveragethreeemergencycallsperdayarereceivedbyaservice
engineer. What isthe probability thaton a particular day
(a) three (b) two (c) four
calls will be received?
Solution
Thenumber ofcallsreceived follows aPoissondistribution.Theaverage number of
calls isthreeper day, thatis λ = 3.
(a) P(X =3) = e−3 3 3
= 0.224 (b) P(X = 2) = e−3 3 2
= 0.224
3!
2!
(c) P(X =4) = e−3 3 4
= 0.168
4!
The engineer will receive three calls on approximately 22 days in 100, two calls on
approximately 22 days in100 and four calls on approximately 17 days in100.
Engineeringapplication29.5
Machinebreakdownsinaworkshop
A workshop has several machines. During a typical month two machines will break
down. What arethe probabilitiesthatinamonth
(a) none (b) one (c) morethantwo
will break down?
Solution
λ = average number ofmachinesthat break down = 2
X = numberofmachines brokendown
(a) P(X =0) = e−2 2 0
= 0.135
0!
(b) P(X =1) = e−2 2 1
= 0.271
1!
(c) P(X >2)=1−P(X =0)−P(X =1)−P(X =2)
=1−e −2 −2e −2 −2e −2 =0.323
Table29.6
Theprobabilitiesforbinomial and Poissondistributions.
29.11 The Poisson distribution 959
Binomial (n,p) Poisson (λ)
P(X =r);n =15,p=0.05 P(X =r); λ =0.75
r = 0 0.46339 0.47237
r = 1 0.36576 0.35427
r = 2 0.13475 0.13285
r = 3 0.03073 0.03321
r = 4 0.00485 0.00623
r = 5 0.00056 0.00090
r = 6 0.00005 0.00007
r = 7 0.00000 0.00001
29.11.1 Poissonapproximationtothebinomial
ThePoissonandbinomialdistributionsarerelated.Consider abinomialdistribution,in
whichntrialstakeplaceandtheprobabilityofsuccessisp.Ifnincreasesandpdecreases
such that np is constant, the resulting binomial distribution can be approximated by a
Poisson distribution with λ = np. Recall thatnp is the expected value of the binomial
distribution,and λisthe expected value of the Poisson distribution.
Toillustratetheabovepoint,Table29.6liststheprobabilitiesforbinomialandPoissondistributionswithn
= 15,p= 0.05andhence λ = 15(0.05) = 0.75.Theremaining
probabilities are all almost 0.
Asnincreases and p decreases withnp remaining constant, agreement between the
two distributions becomes closer.
Engineeringapplication29.6
Workforceabsentees
A workforce comprises 250 people. The probability a person is absent on any one
day is0.02. Find the probability thaton a day
(a) three
people are absent.
(b) seven
Solution
This problem may be treated either as a sequence of Bernoulli trials or as a Poisson
process.
Bernoullitrials
The probabilities followabinomial distribution.
E:apersonis absent
n =number oftrials = 250
p = probability thatE occurs inasingletrial =0.02
X = numberof occurrencesof eventE
➔
960 Chapter 29 Statistics and probability distributions
( 250
(a) P(X =3) =
3
( 250
(b) P(X =7) =
7
Poissonprocess
)
(0.02) 3 (0.98) 247 = 0.140
)
(0.02) 7 (0.98) 243 = 0.105
Sincenislargeand pissmallthePoissondistributionwillbeagoodapproximation
tothe binomial distribution
λ=np=5
(a) P(X =3) = e−5 (5) 3
= 0.140 (b) P(X = 7) = e−5 (5) 7
= 0.104
3!
7!
EXERCISES29.11
1 Acomputer network has several hundredcomputers.
During an 8 hour period,thereare on averageseven
computers notfunctioning. Find the probabilitythat
duringan 8 hour period
(a) nine (b) five
donot function.
2 Aworkforce has on averagetwo people absent
through illnessonany given day. Find the probability
that onatypicalday
(a) two
(b) atleast three
(c) less thanfour
people are absent.
3 Amachine manufactures 300 micro-chips perhour.
Theprobability an individual chipis faultyis 0.01.
Calculate the probabilitythat
(a) two
(b) four
(c) more thanthree
faultychipsaremanufacturedinaparticularhour.Use
both the binomial andPoisson approximations and
comparethe resulting probabilities.
4 Theprobability ofadisk drive failurein any weekis
0.007. Acomputer servicecompany maintains900
disk drives.Usethe Poisson distribution to calculate
the probability of
(a) seven (b) more thanseven
disk drivefailures in a week.
5 Theprobability an employee fails to cometo work is
0.017. Alarge engineeringfirm employs650 people.
Whatisthe probability thatonaparticularday
(a) nine (b) 10
people are away from work?
6 Amachine manufactures electrical components for
the car industryatthe rate of750 per hour.The
probability acomponentis faultyis 0.013. Useboth
the binomialdistribution andthe corresponding
Poisson approximation to findthe probabilitythat in a
sampleof200 components
(a) none are faulty
(b) oneisfaulty
(c) two are faulty
(d) three are faulty
(e) more thanthree are faulty
Solutions
1 (a) 0.1014 (b) 0.1277
2 (a) 0.2707 (b) 0.3233
(c) 0.8571
3 (a) Binomial0.2244; Poisson 0.2240
(b) 0.1689,0.1680
(c) 0.353,0.353
4 (a) 0.1435 (b) 0.2983
29.12 The uniform distribution 961
5 (a) 0.1075 (b) 0.1188
6 (a) Binomial0.0730; Poisson 0.0743
(b) 0.1923,0.1931
(c) 0.2521, 0.2510
(d) 0.2191, 0.2176
(e) 0.2634, 0.2640
29.12 THEUNIFORMDISTRIBUTION
Wenowconsideracontinuousdistribution--theuniformdistribution.Supposetheprobability
of an event occurring remains constant across a given time interval. The p.d.f.,
f (t),of such a distributiontakes the form shown inFigure 29.8.
f(t)
1 –
T
0
Area = 1
T
t
Figure29.8
Theuniform p.d.f.
The area under f (t) must equal 1 and so if the interval is of lengthT, the height of
the rectangle is 1 T .
Thep.d.f. forthe uniformdistributionisgiven by
⎧
⎨ 1
0<t<T
f(t)= T
⎩
0 otherwise
EXERCISES29.12
The probability an event occurs in an interval [a,b] is ∫ b
a f(t)dt.If0abT
thisprobabilityissimply b −a
T .WeshallmakeuseofthisdistributioninSection29.15
when we deal with reliabilityengineering.
1 Arandom variable,x,has a uniform p.d.f.with
T = 10.Calculate the probability that
(a) 1x3 (b) 1.6x9.3
(c) x 2.9 (d) x < 7.2
(e) −1<x<2
(f) 9.1<x<12.3
2 A randomvariablet has a uniform p.d.f.with
T = 1.5. Calculate the probability that
(a) 0.7t1.3 (b) 1<t<2
(c) |t| <0.5 (d) |t| >1
Solutions
1 (a) 0.2 (b) 0.77 (c) 0.71 (d) 0.72
(e) 0.2 (f) 0.09
2 (a) 0.4 (b) 0.3333 (c) 0.3333
(d) 0.3333
962 Chapter 29 Statistics and probability distributions
29.13 THEEXPONENTIALDISTRIBUTION
SupposearandomvariablehasaPoissondistribution;forexample,therandomvariable
could be the number of customers arriving at a service point, the number of telephone
callsreceivedataswitchboard,thenumberofmachinesbreakingdowninaweek.Then
the time between events happening is a random variable which follows an exponential
distribution. Note that whereas the number of events is a discrete variable, the time
between events isacontinuous variable.
Lett be the time between events happening.
The exponential p.d.f. f (t)isgiven by
{
αe
−αt
t 0
f(t)=
0 otherwise
where α > 0.
The probability of an event occurring in a time interval, T, is given by ∫ T
f(t)dt.
0
Figure29.9shows f (t)forvariousvaluesof α.Theexpectedvalueofthedistributionis
given, by definition, as
expected value = µ =
For example, if
∫ ∞
0
tαe −αt dt = 1 α
f(t) =3e −3t tinseconds t 0
then the mean time between events is 1 3
happening per second.
s; that is, on average there are three events
f(t)
a increasing
t
Figure29.9
Theexponential p.d.f.forvariousvalues of α.
Engineeringapplication29.7
Timebetweenbreakdownsofamachine
Thetimebetweenbreakdownsofaparticularmachinefollowsanexponentialdistribution,withameanof17days.Calculatetheprobabilitythatamachinebreaksdown
ina15 day period.
29.14 The normal distribution 963
Solution
Themeantimebetweenbreakdowns = 17 = 1 α ,soα= 1 .Thusthep.d.f., f (t),is
17
given by
f(t)= 1 17 e−t/17 t 0
We require the probability thatthe machine breaks down ina15 day period:
∫ 15
P(0t15)= f(t)dt
0
∫ 15
1
=
0 17 e−t/17 dt
] 15
=
[−e −t/17
0
= −e −15/17 +1
= 0.5862
There isa58.62% chance thatthe machine will break down ina15 day period.
EXERCISES29.13
1 Aservice engineer receives onaveragean emergency
callevery 3 hours. If the time betweencalls follows
an exponential distribution, calculatethe probability
thatthe time from oneemergency callto the next is
(a) greaterthan3hours
(b) less than4.5hours
2 Themeantimebetweenbreakdownsforacertaintype
ofmachineis400hours.Calculatetheprobabilitythat
the time between breakdowns foraparticular
machine is
(a) greater than450 hours
(b) less than350 hours
3 The meantime taken byan engineerto repair an
electrical faultin asystem is 2.7hours.Calculate the
probability thatthe engineerwillrepair afaultin less
thanthe meantime.
Solutions
1 (a) 0.3679 (b) 0.7769
3 0.6321
2 (a) 0.3247 (b) 0.5831
29.14 THENORMALDISTRIBUTION
The normal probability density function, commonly called the normal distribution, is
one of the most important and widely used. It is used to calculate the probable values
of continuous variables, for example weight, length, density, error measurement. Probabilities
calculated using the normal distribution have been shown to reflect accurately
thosewhichwould befound usingactual data.
964 Chapter 29 Statistics and probability distributions
N(x)
N(x)
N(x) (a) (b)
(a)
m
x
(b)
m
x
m 1 m 2 x
Figure29.10
Two typical normal curves.
Figure29.11
Two normalcurveswith the same
standarddeviationbutdifferentmeans.
Letxbeacontinuous random variable with a normal distribution,N(x). Then
N(x) = 1
σ √ 2π e−(x−µ)2 /2σ 2
−∞<x<∞
where µ =expected(mean)valueofx, σ =standarddeviationofx.Figure29.10shows
two typical normal curves. All normal distributions are bell shaped and symmetrical
about µ.
In Figure 29.10(a) the values ofxare grouped very closely to the mean. Such a distribution
has a low standard deviation. Conversely, in Figure 29.10(b) the values of the
variable are spread widely about the mean and so the distribution has a high standard
deviation.
Figure29.11showstwonormaldistributions.Theyhavethesamestandarddeviation
butdifferentmeans.ThemeanofthedistributioninFigure29.11(a)is µ 1
whilethemean
of that in Figure 29.11(b) is µ 2
. Note that the domain ofN(x) is (−∞,∞); that is, the
domain is all real numbers. As for all distribution curves the total area under the curve
is1.
29.14.1 Thestandardnormal
A normal distribution is determined uniquely by specifying the mean and standard deviation.
The probability thatxliesinthe interval [a,b] is
P(axb)=
∫ b
a
N(x)dx
Themathematicalformofthenormaldistributionmakesanalyticintegrationimpossible,
so all probabilities must be computed numerically. As these numerical values would
change every time the value of µ or σ was altered some standardization is required. To
this end we introduce the standard normal. The standard normal has a mean of 0 and
a standard deviation of 1.
Considertheprobabilitythattherandomvariable,x,hasavaluelessthanz.Forconvenience
wecall thisA(z).
A(z) =P(x <z) =
∫ z
−∞
N(x)dx
Figure29.12illustratesA(z).ValuesofA(z)havebeencomputednumericallyandtabulated.TheyaregiveninTable29.7.Usingthetableandthesymmetricalpropertyofthe
distribution,probabilities can becalculated.
29.14 The normal distribution 965
N(x)
A(z)
0
z
x
Figure29.12
A(z) =P(x <z) = ∫ z
−∞ N(x)dx.
Example29.20 Thecontinuousrandomvariablexhasastandardnormaldistribution.Calculatetheprobability
that
(a) x < 1.2 (b) x > 1.2 (c) x > −1.2 (d) x < −1.2
Solution (a) From Table 29.7
P(x < 1.2) = 0.8849
This isdepicted inFigure 29.13.
(b) P(x > 1.2) = 1 −0.8849 = 0.1151
This isshown inFigure 29.14.
(c) By symmetryP(x > −1.2) isidentical toP(x < 1.2) (see Figure 29.15). So,
P(x > −1.2) = 0.8849
(d) Using part(c)we find
P(x < −1.2) = 1 −P(x > −1.2) = 0.1151
(see Figure 29.16).
N(x)
N(x)
Figure29.13
P(x < 1.2) = 0.8849.
0 1.2 x
Figure29.14
P(x > 1.2) = 0.1151.
0 1.2 x
N(x)
N(x)
–1.2 0
x
Figure29.15
P(x > −1.2) = 0.8849.
–1.2 0
x
Figure29.16
P(x < −1.2) = 0.1151.
966 Chapter 29 Statistics and probability distributions
Example29.21 Thecontinuousrandomvariable vhasastandardnormaldistribution.Calculatetheprobability
that
(a)0<v<1 (b) −1<v<1 (c) −0.5v2
Solution (a) Figure 29.17 shows the area (probability) required.
P(v < 1) = 0.8413 using Table 29.7
P(v < 0) = 0.5
using symmetry
P(0 < v < 1) = 0.8413 −0.5 = 0.3413
(b) Figure 29.18 shows the area (probability) required.
P(−1 < v <1) = 2 ×P(0 < v <1)
= 2 ×0.3413 = 0.6826
usingsymmetry
This tells us that 68.3% of the values of v are within one standard deviation of the
mean.
(c) Figure 29.19 shows the area (probability) required.
P(v 2) = 0.9772
P(v −0.5) =P(v >0.5) =1−P(v <0.5)
= 1 −0.6915 = 0.3085
P(−0.5 v 2) = 0.9772 −0.3085 = 0.6687
Note that whether or not inequalities defining v are strict is of no consequence in
calculating the probabilities.
N(v)
N(y)
N(y)
Figure29.17
P(0 < v < 1) = 0.3413.
0 1 v
–1 0 1 y
Figure29.18
P(−1 < v < 1) = 0.6826.
–0.5 0 2 y
Figure29.19
P(−0.5 v 2) = 0.6687.
29.14.2 Non-standardnormal
Table 29.7 allows us to calculate probabilities for a random variable with a standard
normal distribution. This section show us how to use the same table when the variable
has a non-standard distribution. A non-standard normal has a mean value other than 0
and/or a standard deviation other than 1. The non-standard normal is changed into a
standard normal by application of a simple rule. Suppose the non-standard distribution
hasamean µandastandarddeviation σ.Thenallnon-standardvaluesaretransformed
tostandard values using
non-standard → standard
X → X − µ
σ
29.14 The normal distribution 967
Table29.7
Cumulative normal probabilities.
z A(z) z A(z) z A(z) z A(z) z A(z) z A(z)
0.00 0.5000000 0.40 0.6554217 0.80 0.7881446 1.20 0.8849303 1.60 0.9452007 2.00 0.9772499
0.01 0.5039894 0.41 0.6590970 0.81 0.7910299 1.21 0.8868606 1.61 0.9463011 2.01 0.9777844
0.02 0.5079783 0.42 0.6627573 0.82 0.7938919 1.22 0.8887676 1.62 0.9473839 2.02 0.9783083
0.03 0.5119665 0.43 0.6664022 0.83 0.7967306 1.23 0.8906514 1.63 0.9484493 2.03 0.9788217
0.04 0.5159534 0.44 0.6700314 0.84 0.7995458 1.24 0.8925123 1.64 0.9494974 2.04 0.9793248
0.05 0.5199388 0.45 0.6736448 0.85 0.8023375 1.25 0.8943502 1.65 0.9505285 2.05 0.9798178
0.06 0.5239222 0.46 0.6772419 0.86 0.8051055 1.26 0.8961653 1.66 0.9515428 2.06 0.9803007
0.07 0.5279032 0.47 0.6808225 0.87 0.8078498 1.27 0.8979577 1.67 0.9525403 2.07 0.9807738
0.08 0.5318814 0.48 0.6843863 0.88 0.8105703 1.28 0.8997274 1.68 0.9535213 2.08 0.9812372
0.09 0.5358564 0.49 0.6879331 0.89 0.8132671 1.29 0.9014747 1.69 0.9544860 2.09 0.9816911
0.10 0.5398278 0.50 0.6914625 0.90 0.8159399 1.30 0.9031995 1.70 0.9554345 2.10 0.9821356
0.11 0.5437953 0.51 0.6949743 0.91 0.8185887 1.31 0.9049021 1.71 0.9563671 2.11 0.9825708
0.12 0.5477584 0.52 0.6984682 0.92 0.8212136 1.32 0.9065825 1.72 0.9572838 2.12 0.9829970
0.13 0.5517168 0.53 0.7019440 0.93 0.8238145 1.33 0.9082409 1.73 0.9581849 2.13 0.9834142
0.14 0.5556700 0.54 0.7054015 0.94 0.8263912 1.34 0.9098773 1.74 0.9590705 2.14 0.9838226
0.15 0.5596177 0.55 0.7088403 0.95 0.8289439 1.35 0.9114920 1.75 0.9599408 2.15 0.9842224
0.16 0.5635595 0.56 0.7122603 0.96 0.8314724 1.36 0.9130850 1.76 0.9607961 2.16 0.9846137
0.17 0.5674949 0.57 0.7156612 0.97 0.8339768 1.37 0.9146565 1.77 0.9616364 2.17 0.9849966
0.18 0.5714237 0.58 0.7190427 0.98 0.8364569 1.38 0.9162067 1.78 0.9624620 2.18 0.9853713
0.19 0.5753454 0.59 0.7224047 0.99 0.8389129 1.39 0.9177356 1.79 0.9632730 2.19 0.9857379
0.20 0.5792597 0.60 0.7257469 1.00 0.8413447 1.40 0.9192433 1.80 0.9640697 2.20 0.9860966
0.21 0.5831662 0.61 0.7290691 1.01 0.8437524 1.41 0.9207302 1.81 0.9648521 2.21 0.9864474
0.22 0.5870604 0.62 0.7323711 1.02 0.8461358 1.42 0.9221962 1.82 0.9656205 2.22 0.9867906
0.23 0.5909541 0.63 0.7356527 1.03 0.8484950 1.43 0.9236415 1.83 0.9663750 2.23 0.9871263
0.24 0.5948349 0.64 0.7389137 1.04 0.8508300 1.44 0.9250663 1.84 0.9671159 2.24 0.9874545
0.25 0.5987063 0.65 0.7421539 1.05 0.8531409 1.45 0.9264707 1.85 0.9678432 2.25 0.9877755
0.26 0.6025681 0.66 0.7453731 1.06 0.8554277 1.46 0.9278550 1.86 0.9685572 2.26 0.9880894
0.27 0.6064199 0.67 0.7485711 1.07 0.8576903 1.47 0.9292191 1.87 0.9692581 2.27 0.9883962
0.28 0.6102612 0.68 0.7517478 1.08 0.8599289 1.48 0.9305634 1.88 0.9699460 2.28 0.9886962
0.29 0.6140919 0.69 0.7549029 1.09 0.8621434 1.49 0.9318879 1.89 0.9706210 2.29 0.9889893
0.30 0.6179114 0.70 0.7580363 1.10 0.8643339 1.50 0.9331928 1.90 0.9712834 2.30 0.9892759
0.31 0.6217195 0.71 0.7611479 1.11 0.8665005 1.51 0.9344783 1.91 0.9719334 2.31 0.9895559
0.32 0.6255158 0.72 0.7642375 1.12 0.8686431 1.52 0.9357445 1.92 0.9725711 2.32 0.9898296
0.33 0.6293000 0.73 0.7673049 1.13 0.8707619 1.53 0.9369916 1.93 0.9731966 2.33 0.9900969
0.34 0.6330717 0.74 0.7703500 1.14 0.8728568 1.54 0.9382198 1.94 0.9738102 2.34 0.9903581
0.35 0.6368307 0.75 0.7733726 1.15 0.8749281 1.55 0.9394292 1.95 0.9744119 2.35 0.9906133
0.36 0.6405764 0.76 0.7763727 1.16 0.8769756 1.56 0.9406201 1.96 0.9750021 2.36 0.9908625
0.37 0.6443088 0.77 0.7793501 1.17 0.8789995 1.57 0.9417924 1.97 0.9755808 2.37 0.9911060
0.38 0.6480273 0.78 0.7823046 1.18 0.8809999 1.58 0.9429466 1.98 0.9761482 2.38 0.9913437
0.39 0.6517317 0.79 0.7852361 1.19 0.8829768 1.59 0.9440826 1.99 0.9767045 2.39 0.9915758
(continued)
968 Chapter 29 Statistics and probability distributions
Table29.7
Cumulativenormal probabilities (continued).
z A(z) z A(z) z A(z) z A(z) z A(z) z A(z)
2.40 0.9918025 2.45 0.9928572 2.50 0.9937903 2.55 0.9946139 2.60 0.9953383 3.20 0.9993129
2.41 0.9920237 2.46 0.9930531 2.51 0.9939634 2.56 0.9947664 2.70 0.9965330 3.40 0.9996631
2.42 0.9922397 2.47 0.9932443 2.52 0.9941323 2.57 0.9949151 2.80 0.9974449 3.60 0.9998409
2.43 0.9924506 2.48 0.9934309 2.53 0.9942966 2.58 0.9950600 2.90 0.9981342 3.80 0.9999277
2.44 0.9926564 2.49 0.9936128 2.54 0.9944574 2.59 0.9952012 3.00 0.9986501 4.00 0.9999683
4.50 0.9999966
5.00 0.9999997
5.50 0.9999999
This table is condensed from Table 1 of theBiometrikaTablesforStatisticians, Vol. 1 (1st ed.), edited by E. S. Pearson
andH. O. Hartley. Reproduced with the kind permissionofE. S.Pearsonand the trusteesofBiometrika.
Source:StatisticsVol.1ProbabilityInferenceandDecisionbyHays,W.L.andWinkler,R.L.(HoltRinehart&Winston,
New York, 1970).
Example29.22 A random variable,h, has a normal distribution with mean 7 and standard deviation 2.
Calculate the probability that
(a)h>9 (b)h<6 (c)5<h<8
Solution (a) Applying the transformation gives
9 → 9 −7 = 1
2
Soh > 9 has the same probability asx > 1, wherexis a random variable with a
standard normal distribution:
P(h >9) =P(x >1) =1−P(x <1) =1−0.8413 =0.1587
(b) Applying the transformation gives
6 → 6 −7 = −0.5
2
Soh < 6 has the same probability asx < −0.5:
P(x < −0.5) =P(x > 0.5) = 1 −P(x < 0.5) = 0.3085
(c) Applying the transformation to5and 8 gives
5 → 5 −7 =−1 8→ 8 −7 = 0.5
2 2
and so werequireP(−1 <x<0.5). Therefore
and then
P(x < 0.5) = 0.6915 P(x < −1) = 0.1587
P(−1 <x<0.5) = 0.6915 −0.1587 = 0.5328
29.14 The normal distribution 969
EXERCISES29.14
1 Arandom variable,x,has a standard normal
distribution. Calculate the probability thatxlies in the
followingintervals:
(a) (0.25, 0.75)
(b) (−0.3,0.1)
(c) within 1.5standard deviationsofthe mean
(d) more thantwo standard deviations from the mean
(e) (−1.7, −0.2)
2 Arandom variable,x,has a normal distribution with
mean 4 andstandard deviation 0.8. Calculate the
probability that
(a) 3.0 x4.4
(b) 2.5 <x<3.9
(c) x > 4.6
(d) x < 4.2
(e) x is within 0.6ofthe mean
3 Arandom variable,t,has anormal distribution with
mean 1 andstandard deviation 2.5. Calculate the
probability that
(a) −1t2 (b)t>0
(c) |t| 0.9 (d) |t| > 1.6
4 Thescores fromIQ tests have amean of100 anda
standard deviation of15. What should aperson score
in order to be described asin the top10% ofthe
population?
5 Amachine produces car pistons.Thediameter ofthe
pistonsfollows anormal distribution, mean 6.04 cm
with astandard deviation of0.02 cm. Thepistonis
acceptable ifitsdiameter is in the range 6.010 cm to
6.055cm. What percentage ofpistonsis
acceptable?
6 The random variable,x, has anormal distribution.
How many standard deviations above the mean must
the pointPbe placed ifthe tail-end isto represent
(a) 10% (b) 5% (c) 1%
ofthe total area? (See Figure29.20.)
N(x)
? P
Figure29.20
Graph forQuestion6.
7 ConsiderFigure29.21. Thetwo tail-endshave equal
area. How many standard deviations from the mean
mustAandBbe placed ifthe tail-ends are
(a) 10% (b) 5% (c) 1%
ofthe total area?
A
N(x)
Figure29.21
Graph forQuestion7.
B
x
x
Solutions
1 (a) 0.1747 (b) 0.1577 (c) 0.8664
(d) 0.0455 (e) 0.3762
2 (a) 0.5858 (b) 0.4199 (c) 0.2266
(d) 0.5987 (e) 0.5467
3 (a) 0.4436 (b) 0.6554 (c) 0.2604
(d) 0.5544
4 119
5 71%
6 (a)1.28 (b)1.645 (c)2.33
7 (a)1.64 (b)1.96 (c)2.57
970 Chapter 29 Statistics and probability distributions
29.15 RELIABILITYENGINEERING
Reliability engineering is an important area of study. Unreliable products lead to human
frustration, financial loss and in the case of life-critical systems can lead to death.
As the complexity of engineering systems has increased, mathematical methods of assessing
reliability have grown in importance. Probability theory forms a central part of
thedesignofhighlyreliablesystems.Formostitemsthefailureratechangeswithtime.
A common pattern is exhibited by the appropriately named ‘bath tub’ curve, illustrated
inFigure 29.22.
Carpartfailuresarequitewellmodelledbythisdistribution.Forexample,acrankshaft
may fail quite quickly as a result of a manufacturing defect. If it does not, then there is
usuallyalongperiodduringwhichthelikelihoodoffailureislow.Aftermanyyearsthe
probability offailure increases.
ConsidertheprobabilityofanitemfailingoveratotaltimeperiodT.Thisperiod,T,
isthetimeduringwhichtheitemisfunctioning.Thetimetakentorepairtheitemisnot
consideredinthiscalculationbutisconsideredalittlelateron.Supposetheprobabilityof
failureisevenlydistributedoverthisperiod;thatis,theprobabilityoffailureismodelled
bytheuniformdistribution(seeSection29.12).Itisimportanttonotethatthisisafairly
simplistic assumption. IfN = number of failures of the item over a time periodT, it is
possible todefine a mean failure rate,k, by
k = N T
For example, if an item fails 10 times in a period of 5 years we define the mean failure
ratetobek = 10/5 = 2,thatistwofailuresperyear.Becauseoftheuniformdistribution
ofthefailuresacrossthetimeperiod,thequantitykisconstant.Toillustratethisconsider
the previous example with a period of 10 years. During this time the item will fail 20
times and so
k = 20 = 2 failures per year
10
as found earlier.
Another usefultermisthemean timebetween failures(MTBF), which isgiven by
MTBF = 1 k
The termMTBF isonly used foritems thatarerepairable.
Let the interval T be divided into n small sub-intervals each of length δT, that is
nδT = T. Suppose each sub-interval is so small that only one failure can occur during
it. Note that since repair time has been neglected it is always possible to have a failure
in a sub-interval. So theN failures which occur during timeT occur inN distinct sub-
Failure rate
Time
Figure29.22
The‘bath tub’curve.
29.15 Reliability engineering 971
intervals. The probability of failure occurring in a particular sub-interval, P f
, is then
given by
P f
=
number of sub-intervals inwhich failure occurs
total number ofsub-intervals
= N n
Ineach sub-interval, δT, the probability ofan item notfailing,P nf
, is
P nf
=1−P f
=1− N n = n −N
n
Astheitemprogressesthrougheachofthesuccessivesub-intervals δT itcanbethought
ofasundergoingaseriesoftrials,theresultofwhichisfailureornon-failure.Therefore,
the probability of the item not failing as it passes through all of thensub-intervals can
be obtained by multiplying each of the sub-interval probabilities of notfailing, thatis
( ) n −N n
probability of notfailingintimeT =
n
As the sub-interval δT becomes small, that is δT → 0, the number of sub-intervals,n,
becomes large, thatisn → ∞. Hence weneed toconsider the quantity
( ) n −N n
lim
n→∞
n
This limitcan be shown tobe e −N (see Appendix IV). Hence
probability of notfailingintimeT = e −N
Finally,theprobabilityofanitemfailingoneormoretimes--itemscanberepairedand
fail again -- inthe time intervalT isgiven by
P(T)=1−e −N =1−e −kT
P(T ) is the probability of at least one failure in the time period T. As a consequence
of the constancy of k, this formula can be used to calculate the probability of an item
failingone or more times inan arbitrarytimeperiod,t, inwhich case
P(t) =1−e −kt (29.1)
Itisimportanttostressthatthisformulaonlyappliesiftheprobabilityoffailureisevenly
distributed.
Engineeringapplication29.8
Breakdownsonafactoryprocessline
A factory process line makes use of 12 controllers to maintain process variables at
their correct values; all 12 controllers need to be working in order for the process
line to be operational. Records show that each controller fails, on average, once
every six months. Calculate the probability of the process line being stopped as a
resultofacontrollerfailure within a time periodofone month.
➔
972 Chapter 29 Statistics and probability distributions
Solution
The mean failure rate of each controller is 1/6 breakdowns per month. Given that
thereare12controllerstheoverallfailurerateforcontrollersis12/6 = 2breakdowns
per month. Using Equation (29.1) withk = 2 andt = 1 gives the probability,P(t),
of the process line being stopped because of a controller failure within a period of
one month as
P(t) =1−e −kt =1−e −2 =0.865
So far we have ignored the ‘downtime’ associated with an item waiting to be repaired
after it has failed. This can be a significant factor with many engineering systems. The
simplestpossibilityisthatanitemisoutofactionforafixedperiodoftimeT r
whileitis
being repaired. If there areN failures during a periodT then the total downtime isNT r
.
The time the item isavailable,T a
, isgiven by
T a
=T−NT r
Ausefulquantityisthefractionaldeadtime,D,whichistheratioofthemeantimethe
item isinthe dead statetothe total time.Inthiscase
D = NT r
(29.2)
T
Anotherusefulquantityistheavailability,A,whichistheratioofthemeantimeinthe
working statetothe total time. For the present model
A = T−NT r
=1−D
T
Thefailureratemodeldevelopedpreviouslywasbasedonthetimetheitemwasworking
rather thanthe total time.When the repair time isincludedkbecomes
and so
k = N T a
N =kT a
=k(T −NT r
)
N+kNT r
=kT
and therefore
N =
kT
1+kT r
So using Equation (29.2)
Also
D =
kT
1+kT r
T r
T =
A=1−D=1−
kT r
1+kT r
kT r
1
=
(29.3)
1+kT r
1+kT r
For more complicated repair characteristics the equations forDandAare correspondingly
more complicated.
29.15 Reliability engineering 973
Engineeringapplication29.9
Availabilityofanelectricalsupplytoalargefactory
Theelectricalsupplytoalargefactoryhasameantimebetweenfailuresof350hours.
When the supply fails it takes 3 hours to repair the failure and restore the supply.
Calculate the average availability of the electricalsupply tothe factory.
Solution
Failure rate of the supply is k where k = 1/350 failures per hour. Using Equation
(29.3) withT r
= 3 hours gives
A =
1
= 0.992 that is,99.2%
1 +3/350
The supply isup and running for99.2% of the time.
So far we have only examined systems in which failure was caused by one or more
componentseachwiththesamefailurerateorMTBF.Amorecommonsituationisonein
whichthedifferentcomponentsofasystemhavedifferentdegreesofreliability.Itisstill
useful to be able to calculate the overall reliability of the system although the analysis
ismorecomplicated.Inordertodosoitisnecessarytodefinethetermreliability.From
Equation(29.1)weknowthatP(t)definestheprobabilityofoneormorefailuresduring
atimeperiod,t.Thereforetheprobabilityofnofailuresisgivenby1−P(t).Thequantity
1−P(t)iscalledthereliabilityofthesystemduringatimeperiodt,andisdenotedR(t),
thatis
R(t) =1−P(t) =e −kt (29.4)
R(t) can be interpreted as the probability a component works properly during a period
t. We now examine the reliabilityoftwo simple systemconfigurations.
29.15.1 Seriessystem
A series system is one in which all the components of a system must operate satisfactorily
if the system is to function correctly. Consider a system consisting of three
components, shown in Figure 29.23. The reliability of the system is the product of the
reliabilitiesofthe individual components,that is
R =R A
R B
R C
(29.5)
This formula is a direct consequence of the fact that the failure of any one of the components
is an independent event. So the probability of the system not failing, that is its
reliability,istheproductoftheprobabilitiesofeachofthecomponentsnotfailing.(See
Section28.7 forindependentevents.)
Input A B C Output Figure29.23
Seriessystem.
974 Chapter 29 Statistics and probability distributions
Engineeringapplication29.10
Reliabilityofaradiosystem
A radio system consists of a power supply, a transmitter/amplifier and an antenna.
During a 1000 hour period the reliabilityofthe various components is asfollows:
R ps
= 0.95
R ta
= 0.93
R a
= 0.99
Calculate the overall reliabilityof the radio system.
Solution
R =R ps
R ta
R a
= 0.95 ×0.93 ×0.99 = 0.87
thatis,thereisan87%chancethattheradiowillnotfailduringa1000hourperiod.
Using Equations (29.4) and (29.5) it is possible to generate a formula for the reliability
of a systemconsisting ofnseries components:
R = e −k 1 t e −k 2 t ...e −k n t = e −(k 1 +k 2 +···+k n )t
wherek 1
,k 2
,...,k n
arethe mean failure ratesof thencomponents.
Engineeringapplication29.11
Reliabilityofasatellitelink
AsatellitelinkisbeingusedtotransmittelevisionpicturesfromAmericatoEngland.
ThecomponentsofthissystemarethetelevisionstudioinNewYork,thetransmitter
groundstation,thesatelliteandthereceivergroundstationinLondon.TheMTBF of
each ofthe components isasfollows:
MTBF ts
= 1000 hours
MTBF tgs
= 2000 hours
MTBF s
= 500000 hours
MTBF rgs
= 5000 hours
Calculate the overall reliability of the system during a 28 day period and a yearly
period of 365 days.
Solution
Firstwecalculate the mean failure rate of each of the components:
k ts
= 1
1000 = 1 ×10−3 failuresper hour
k tgs
= 1
2000 = 5 ×10−4 failures per hour
29.15 Reliability engineering 975
k s
=
1
500000 = 2 ×10−6 failuresper hour
k rgs
= 1 = 2 ×10 −4 failuresper hour
5000
So the overall reliabilityof the systemis
R = e −(k ts +k tgs +k s +k rgs )t = e −(1×10−3 +5×10 −4 +2×10 −6 +2×10 −4 )t
= e −1.702×10−3 t
Fort = 28 ×24 hours,R = 0.319.
Fort = 365 ×24hours,R = 3.35 ×10 −7 .Clearlyduringayearlyperiodthesystem
isalmost certain tofail atleastonce.
29.15.2 Parallelsystem
A parallel system is one in which several components are in parallel and all of them
mustfailforthesystemtofail.ThecaseofthreecomponentsisshowninFigure29.24.
The probability of all three components failing in a time period,t, is the product of the
individual probabilities of each component failing, that is (1 −R A
)(1 −R B
)(1 −R C
).
So the overall systemreliabilityis
R=1−(1−R A
)(1−R B
)(1−R C
)
This formulacan begeneralizedtothe case ofncomponentsinparallelquite easily.
A
Input
B
Output
C
Figure29.24
Parallel system.
Engineeringapplication29.12
Reliabilityofprocesscontrolcomputerpowersupplies
A process control computer is supplied by two identical power supplies. If one fails
then the other takes over. TheMTBF of the power supplies is 1000 hours. Calculate
thereliabilityofthepowersupplytothecomputerduringa28dayperiod.Compare
this with the reliabilityifthereisno standbypower supply.
Solution
k = 1 = 0.001 failures per hour
1000
R ps
= e −kt = e −0.001×24×28 = 0.511
➔
976 Chapter 29 Statistics and probability distributions
Therefore the overall reliabilityof the power supply tothe computer is
R = 1 − (1 −0.511)(1 −0.511) = 0.761
Clearly the existence of a standby power supply does improve the reliability of the
power supply to the computer. In practice, the figure would be much higher than
this because once a power supply failed the standby would take over and maintenance
engineers would quickly repair the failed power supply. Therefore the period
of time in which the computer was relying on one power supply would be very
small.
We have only examined the simplest possible reliability models. In practice, reliability
engineering can often be very complicated. Some models can cater for maintenance
strategiesofthesortwetouchedoninthepreviousexample.Theeffectofnon-uniform
failureratescanalsobecateredfor.Also,manysystemsconsistofamixtureofseriesand
parallel subsystems of the type we have discussed as well as more complicated failure
modes than we have examined.
EXERCISES29.15
1 Acomputer contains 20circuit boards each with an
MTBF of3months. Calculate the probability that the
computer willsufferacircuit boardfailure duringa
monthly period.
2 Threepumpsarerequiredtofeedwatertoaboilerina
power station in order foritto be fullyoperational.
TheMTBF ofeach ofthe pumps is 10days.Calculate
the probability that the boiler willnotbe fully
operational during amonthly periodifthere are only
three pumps available.
3 Aprocesscontrol computer has apower supply with
anMTBF of600 hours. If the power supply fails then
ittakes 2 hoursto replace it.Calculate the average
availability ofthe computer power supply.
4 Aradar station has anMTBF of1000hours. The
average repair time is 10hours. Calculate the average
availability ofthe radar station.
5 Aprocessline to manufacture bread consistsoffour
main stages:mixing ofingredients, cooking,
separation andfinishing, packaging. Each ofthe
stageshas anMTBF of
MTBF m = 30 hours
MTBF c = 10 hours
MTBF sf = 20 hours
MTBF p = 15 hours
Calculate the probability that the processline will be
stopped during an 8 hour shift.
6 Aremote water pumpingstation hastwo pumps.Only
one pumpis needed in normal operation; the other
actsasastandby. Access to thisstation isdifficult and
soifapumpfails itis noteasy to repair it
immediately. Given that theMTBF ofthe pumps is
3000hours, calculate the overall reliability ofthe
pumping station duringa28 dayperiod in which the
maintenance engineers are unable to repair apump
which fails.Calculate the improvement in reliability
that would be obtained ifasecond standby pump was
installed in the pumping station.
7 Repeat the calculation carried out in Question6with
the following information.The pumps,which are
different from those in Question 6,can be considered
to consist oftwo components: a motor andthe pump
itself.TheMTBF ofthe motors is 4000hours and that
ofthe pumps is 6000 hours.
Review exercises 29 977
Solutions
1 0.999
2 0.9998
3 0.9967
5 0.865
6 0.960, 0.992
7 0.940, 0.985
4 0.9901
REVIEWEXERCISES29
1 Find the mean and standard deviation of
8,6.9,7.2,8.4,9.6,10.3,7.4,9.0
2 Adiscrete random variable, v,has a probability
distributiongiven by
v −2 −1.5 −1 −0.5 0 0.5 1.0
P(v) 0.23 0.17 0.06 0.03 0.17 0.31 0.03
Calculate
(a)P(v = −1) (b)P(v = −2or v =0)
(c)P(v 0) (d)P(v < −0.5)
3 Acontinuous random variable,x, has ap.d.f., f (x),
given by
f(x)=3x 2
(a) FindP(0 x0.5).
(b) FindP(x > 0.3).
(c) FindP(x < 0.6).
0x1
(d) Find the expected value ofx.
(e) Find the standard deviation ofx.
4 Adiscrete random variable,y, has aprobability
distributiongiven by
y −0.25 0.25 0.75 1.25 1.75
P(y) 0.25 0.20 0.10 0.15 0.30
(a) Find the expected value ofy.
(b) Find the standard deviation ofy.
5 Therandom variable,t,has ap.d.f.,H(t),given by
H(t)=λe −λt
t0
(a) Calculate the expected value oft.
(b) Calculate the standard deviation oft.
6 Calculate the number ofpermutations of
(a) five distinct objects,taken two atatime
(b) eight distinct objects,taken four at atime
7 Calculate the number ofcombinations of
(a) seven distinct objects,taken four at atime
(b) 200 distinct objects,taken 198 atatime
8 The probability a component is manufactured to an
acceptable standard is 0.92.Twelve components are
picked at random. Calculate the probability that
(a) sixare acceptable
(b) 10are acceptable
(c) more than10are acceptable
(d) atleast two are notacceptable
9 The number ofbreakdowns ofacomputer systemin a
month isarandom variable with aPoisson
distribution. The mean number ofbreakdowns per
month istwo. Calculate the probability that the
number ofbreakdowns ofthe system in a monthis
(a) two
(b) three
(c) one
(d) more thantwo
10 The probability ofacomputer failingin asystem in a
given week is 0.03. Inasystem thereare 150
computers.
(a) Usethe binomial distribution to calculate the
probability that in a given week fivecomputers
fail.
(b) Usethe binomial distribution to calculate the
probability that in a given week more than two
computers fail.
(c) Usethe Poissonapproximation to calculate the
probability that in a given week four computers
fail.
978 Chapter 29 Statistics and probability distributions
(d) Usethe Poissonapproximation to calculate the
probability that in a given week more than three
computersfail.
11 Arandom variable,t,has a uniform distribution, f (t),
given by
{ 14
0<t<4
f(t)=
0 otherwise
Calculate the probability that
(a) 1t3
(b) 0.2t2.7
(c) |t| >1 (d) |t| 1.5
12 Thetime,t,betweenfailures foraparticular typeof
electrical component follows an exponential
distribution with a meanvalue of24weeks. Calculate
the probabilitythat
(a)t>24 (b)t<22
13 Theresistance ofresistorsis arandom variablewith a
normal distributionwith ameanof3ohmsanda
standard deviation of0.1ohm. Calculate the
probability thataresistor has aresistance of
(a) between 2.9and3.05 ohms
(b) more than2.95 ohms
(c) less than2.83 ohms
14 Thecapacitance ofcapacitors isarandom variable
with anormal distribution, with ameanof12farads
andastandard deviation of0.6farads.Inabatchof
300 capacitors how many would you expectto have
with capacitance
(a) between 11and12.3 farads
(b) more than11.6 farads
(c) less than12.8 farads?
15 Acomputer fails,onaverage, onceevery 4 months.
An engineering systemusesninecomputers, all of
whichneedto be workingforthe system to operate.
Calculate the probability thatthe system willfail
sometimeduring
(a) a1monthperiod
(b) a2weekperiod(i.e.half amonth)
16 TheMTBF foramotor is270 hours.When the motor
fails ittakes7hours to repair. Calculate the
availabilityofthe motor.
17 Asystem comprisesfour components in series.The
MTBF foreachcomponentis 170 hours,200 hours,
250 hours and210 hours. Calculate the reliabilityof
the system over a100 hour period.
18 Asystem comprisestwo components in parallel.The
components have anMTBF of150 hours and
170 hours. Calculate the probabilitythe systemwill
failin a 200 hourperiod.
Solutions
1 mean = 8.35;st.dev. = 1.1314
2 (a) 0.06 (b) 0.4 (c) 0.51 (d) 0.46
3 (a) 0.125 (b) 0.973 (c) 0.216
(d) 0.75 (e) 0.194
4 (a) 0.775 (b) 0.799
5 (a)
1
λ
(b)
1
λ
6 (a) 20 (b) 1680
7 (a) 35 (b) 19900
8 (a) 1.4687 ×10 −4 (b) 0.1835
(c) 0.7513 (d) 0.2487
9 (a) 0.2707 (b) 0.1804 (c) 0.2707
(d) 0.3233
10 (a) 0.1736 (b) 0.8307 (c) 0.1898
(d) 0.6577
11 (a) 0.5 (b) 0.625 (c) 0.75
(d) 0.375
12 (a) 0.3679 (b) 0.6002
13 (a) 0.5328 (b) 0.6915 (c) 0.0446
14 (a) 193 (b) 225 (c) 272
15 (a) 0.8946 (b) 0.6753
16 0.9747
17 0.1402
18 0.5093
Appendices
Contents I Representingacontinuousfunctionandasequenceasa
sumofweightedimpulses 979
II TheGreekalphabet 981
III SIunitsandprefixes 982
IV Thebinomialexpansionof ( )
n−N n
n
982
AppendixI REPRESENTINGACONTINUOUSFUNCTIONANDA
SEQUENCEASASUMOFWEIGHTEDIMPULSES
Inthisappendixweshowhowacontinuousfunction, f (t),oradiscretesequence, f[k],
obtained by sampling this function, can be represented as a sum of weighted impulses.
SucharepresentationisimportantinthestudyoftheztransformandthediscreteFourier
transform.
Thedeltafunction
ThedeltafunctionisintroducedinChapter2.Foreaseofreferencewerestateitsdevelopmenthere.Thedeltafunction
δ(t),orunitimpulsefunction,isthelimitofarectangle
function bounding an area of 1 unit, and located at the origin, as its width approaches
zero,anditsheightincreasesaccordinglytoensurethatthearearemains1.Theenclosed
area is known as the strength, or the weight, of the impulse. This is illustrated in Figure
AI.1. Note that the impulse is represented by an arrow and the height of the arrow
gives the strength of the impulse. If the impulse occurs at the pointt = d, then this is
written δ(t −d).Further,bymultiplyingthedeltafunctionbyanumberA,togiveAδ(t),
we obtain the limit of a rectangle function bounding an area ofA. This is an impulse of
strengthA.
Samplingacontinuousfunction
Now consider a function f (t) defined fort 0 as shown in Figure AI.2. Suppose this
function is sampled at timest = 0,T,2T,3T,...,kT,..., to generate a sequence of
values f(0), f(T), f(2T), f(3T),..., f(kT),..., which we shall write as f[0], f[1],
f[2],f[3],...,f[k],....
Note from the discussion above that the quantity f[0]δ(t) is the limit of a rectangle
functionlocated atthe originand boundinganarea equal to f[0].
980 Appendices
d(t)
as h 0
1
1–
h
0 t
0
h
t
FigureAI.1
The limitofthe rectangle function is the delta function.
f(t)
f(3T)
f(kT)
f(0)
f(2T)
f(T)
0 t 0 T 2T 3T....... kT t
FigureAI.2
Sampling a continuous function f (t)gives the sequence f[k].
Similarly, f[1]δ(t −T )isthelimitofarectanglefunctionboundinganareaequalto
f[1]and located att =T.
In general f[k]δ(t −kT) is the limit of a rectangle function bounding an area equal
to f[k]andlocatedatt =kT.
The quantity
∞∑
f[k]δ(t −kT)
k=0
isaseries of delta functions. The area bounded by these isgiven by
∞∑
area=f[0]+f[1]+f[2]+···+f[k]+···,
thatis
f[k]
(AI.1)
k=0
Approximatingtheareaunderacurve
Now consider approximating the area under the continuous function f (t) by a series of
rectangular areas as shown in Figure AI.3. The first rectangle has widthT and height
f (0) and so its area isTf (0). Using our sampling notation this may be writtenTf[0].
Similarly, the second rectangle has widthT and height f (T) and so its area isTf (T).
Using the sampling notation this is written Tf[1]. The third rectangle has area Tf[2]
and soon. The total area isthus given by
area=Tf[0]+Tf[1]+Tf[2]+···+Tf[k]+···=T
∞∑
f[k]
k=0
(AI.2)
II The Greek alphabet 981
f(t)
T
0 t 0 T 2T kT t
FigureAI.3
Thearea under f (t)can be approximated byaseriesofrectangular areas.
Now compare Equations (AI.1) and (AI.2). If we multiply the series of delta functions
byT we see that
∞∑
T f[k]δ(t −kT)
k=0
will bound the same area as our approximation to the area under the graph of f (t). In
thissense wecan regard
∞∑
T f[k]δ(t −kT)
k=0
as anapproximation tothe function f (t).We will denote thisapproximation by ˜f(t).
In summary, a continuous function f (t) can be represented as a sum of weighted
impulses each ofstrength f[k]occurring att =kT:
∑ ∞
˜f(t)=T f[k]δ(t −kT)
k=0
Thisrepresentationcanalsobethoughtofasawayofexpressingadiscretesequenceof
values, f[k],asacontinuousfunction ˜f (t).Thisisusefulwhenstudyingtheztransform
andthediscreteFouriertransform.Sometimesitwillbeconvenienttoworkwithoutthe
factorT, inwhich casewedefine
∞∑
f ∗ (t) = f[k]δ(t −kT)
k=0
It should be remembered that when using this form, f ∗ needs to be multiplied by the
factorT inordertoapproximatethe function f (t).
AppendixII THEGREEKALPHABET
A α alpha I ι iota P ρ rho
B β beta K κ kappa σ sigma
Ŵ γ gamma λ lambda T τ tau
δ delta M µ mu Y υ upsilon
E ε epsilon N ν nu φ phi
Z ζ zeta ξ xi X χ chi
H η eta O o omicron ψ psi
θ theta π pi ω omega
982 Appendices
AppendixIII SIUNITSANDPREFIXES
ThroughoutthisbookSIunitshavebeenused.Belowisalistoftheseunitstogetherwith
their symbols.
Quantity SIunit Symbol
Prefix
Symbol
length metre m
mass kilogram kg
time second s
frequency hertz Hz
electric current ampere A
temperature kelvin K
energy joule J
force newton N
power watt W
electric charge coulomb C
potential difference volt V
resistance ohm
capacitance farad F
inductance henry H
10 15 peta P
10 12 tera T
10 9 giga G
10 6 mega M
10 3 kilo k
10 2 hecto h
10 1 deca da
10 −1 deci d
10 −2 centi c
10 −3 milli m
10 −6 micro
10 −9 nano n
10 −12 pico p
10 −15 femto f
AppendixIV THEBINOMIALEXPANSIONOF ( n−N
( ) n −N n
Consider the binomial expansion of .
( )
n
n −N
n (
= 1 − N ) n
n n
(
=1+n − N )
+
n
n(n −1)
2!
n
(
− N ) 2
n
(
n(n −1)(n −2)
+ − N ) 3
+···
3! n
( ) ( )( ) n −1 N
2 n −1 n −2 N
3
=1−N+
n 2! − n n 3! +···
(
=1−N+ 1 − 1 ) N
2
(1
n 2! − − 1 )(
1 − 2 ) N
3
n n 3! +···
Suppose now weletn → ∞. We find
( n −N
lim
n→∞
n
) n
=1−N+ N2
2! − N3
3! +···
But thisisthe power series expansion of e −N (see Section 6.5).We conclude that
( ) n −N n
= e −N
lim
n→∞
n
Inparticular, note thatifN = −1
( ) n +1 n
= e
lim
n→∞
n
) n
Index
absolute quantity88
absorption laws 179, 185, 186
acceleration 690, 868
addition:
algebraic fraction 30--1
complex numbers329
law ofprobability 909--13, 954
matrix 259--60
vectors 225--6,336--7
adjoint matrix281--2
algebraseeBoolean algebra; matrixalgebra
algebraic equations 627, 649
algebraic fractions 26--33
addition 30--1
division29--30
equivalent fractions 27--8
expressing fraction in simplestform28--9
multiplication 29--30
proper andimproper fractions 27
resistorsin parallel 32
subtraction 30--1
algebraic techniques 1--53
algebraic fractionsseealgebraic fractions
indices, laws of2--11
dividing expressions 4--5
firstlaw 2
fractional indices 8--11
multiple7
multiplying expressions 2--4
negative indices 5, 6
positiveindices 6
power density ofsignal transmitted byradio antenna 6
power dissipationin resistor3--4
radar scattering7--8
second law 4
skindepth, in radial conductor 9--10
thirdlaw, 8 7
inequalities, solution of33--8
number basesseebase, number
partial fractionsseepartial fractions
polynomial equations 20--6
current used by electric vehicle 21--3
ofhigher degree 24--5
quadratic equations 20--4
summation notation 46--50
Kirchhoff’s currentlaw 47--8
Kirchhoff’s voltage law 48--9
alphabetic character stream,informationcontent of916
alternating current:
circuits 324
waveforms 134
Ampère’s circuital law 900
amplitude 581, 724, 744, 751--3
Fourier transform766
modulation 767--8
trigonometric functions133, 134, 135, 136
ofwave 131
analog-to-digital converter 690
analogue simulation 603--5
analytical integration 457
analytical solutions534, 618--19,621--2
AND gate 184, 187, 188
angular acceleration 613
angular frequency 133, 134, 135, 136, 340
Fourier series723--4,753
fundamental 723--4
ofwave 131
antennae 169--70
approximate solution 615
arbitraryconstant 538, 589, 608
arbitraryfunction 876
arbitraryscalingconstant 304
Argand diagram332, 334, 337, 345--6, 351
argument:
ofangle 333
offunction 57--8
arithmetic mean (mean value) 938--40
arithmetic progressions204
arithmetic series,finite 209--10
armaturecurrent/voltage 612--13
arrays1
arrows(signals)661
associative laws179, 186
associative matrixaddition 260
associative matrixmultiplication 264
asymptote 76--7, 81
Bode plotoftransfer function with real poles andzeros
671--5
ofgraph75
oblique 76
983
984 Index
attenuation 87,88, 578
in step-index opticalfibre 89--90
augmented matrix 309
autocorrelation 814, 815--16, 817
automated vehicles, routing226--7
auxiliary equation 562--4,572
particular integral574
availability, in reliability 972
average information perevent 916
average value:
offunction471--5
ofperiodic waveform 477--8
back substitution287--90
backward wave 578--80
base, number 2, 11--20, 80
binarycoded decimal 11,17--20
binarysystem 11--14
decimal system 11
hexadecimal system 11,14--17
seven-segment displays18--19
‘bath tub’curve 970
Bernoulli, J.953
Bernoulli trials953, 956, 959
Besselfunctions584, 586, 587--601
firstkind oforder one591
firstkind oforder zero589
in frequency modulation 598--601
and generating function595--7
ofhigherorder 592--3
Jacobi-Anger identities 597--8
modesofcylindrical microwave cavity 593--5
second kind oforder one 591
second kind oforder zero 589
tabulated values of601
Bessel’sequations 584, 587--601
ofhigherorder 592--3
oforder one 589--91
oforder zero 587--9
power seriessolution of587--8, 590--1
biasedcircuit 379
binarycoded decimal 11, 17--20
binarydata stream:
entropy of917
informationcontent of916
redundancy of918
binarydigits (bits)11,17
binaryfull-adder circuit 190--2
binarysystem 11--14
binaryto decimal conversion 12
decimal to binary conversion 12--14
binarywords190--1
binomial distribution 953--7
mean andstandard deviation 955
most likely number ofsuccesses 956
probability ofksuccesses fromntrials954--5
binomial expansion 982
binomial theorem 214--18, 700, 717--18
Biot-Savart law 250
bit stream 915
bits(binarydigits)11,17
block 627--8, 661
basic 661
delay 688--9
diagram661, 663--4,665, 688--94
Bode plot oflinearcircuit 96--7
Bode plot oftransferfunction with real polesand zeros
671--5
Boolean algebra 175, 183, 185--97
expressions185, 187--8
lawsof185--7
logicalequivalence 187--93
variables 185
bounded sequence 205
branch current 307--10
breakpoints 673, 674
byte 11,17
capacitance 575, 637
ofcoaxial cable 448--50
mutual 102
between two parallel wires102--3
capacitive reactance 435
capacitors 577, 637
with capacitance 435
currentthrough 368--9
discharge of82, 655
energystoredin 487--8
phasors341--3
in series484--5
voltage across435
cardinal sinefunction 123--4
carrier 599
carrier signal 767
carry-out value 190
Cartesian components 232--40
line and multipleintegrals 871
Cartesian coordinate system160--1, 162, 166--7, 171
complex numbers333--4
functionsofmore than onevariable 823--5
threedimensions 157--8
two dimensions 154--7
vectors 240
Cartesian equations 169
Cartesian form 334
complex numbers342
discrete Fourier transform793
cathode raytube, electrical potential inside 824
chain rule389--91,395, 463
character data stream, redundancy of918
characteristic equation 299, 300--2
characteristic impedance 348
ofcoaxial cable 450--1
charged particle,movement in electric field 244--5
Index 985
chord 357--8
circuital law (Ampère)900
circuits:
biased 379
binary full-adder 190--2
digital electronic 175
integrated 428
inverting 437
LC 572--3
LCR560--1
linear 97
RC 535--6, 554--7
RL 544--5,553--4, 651--2
RLC 109, 573--5
thyristorfiring473--4
truthtable for187--8
circular (periodic) convolution 803--7
circular convolution theorem 807--12
circular correlation theorem816--20
circular cross-correlation815--16, 818--20
closed form(binomial theorem)216
closed loop system with negative feedback 665
CMOS(complementary metal-oxide semiconductor) logic
192--3
co-domain 180--1
coaxial cable 576--8
capacitance of448--50
characteristic impedance of450--1
codes (communication theory)918
coding theory919
coefficient:
damping 610
difference equations and the z transform685--6
differential equations, linearconstant 649--59
equating 40--1, 585, 586
Fourier 734, 739, 741--2, 748--50, 766
matrix 285, 289
polynomial 20
reflection 581--2
cofactor ofelement 279, 282
column vector 233, 258--9, 261--2
n-component 612
p-component 612
r-component 612
combinations 950--2
common derivatives 372--4
common difference 204
common ratio 204--5, 211, 213
communication engineering 903
communication theory915--19
commutative laws 179, 186, 191
commutative numbers259--60
complement:
laws 179, 186, 189--90, 191
in probability 904
in set theory 178--9
complementary events913--15
complementary function558, 561--9,572
inhomogeneous term appears in 575--83
particular integral574--5
complementary metal-oxide semiconductor (CMOS)logic
192--3
completeness andperiodic functions733
completing the square 23,37--8
complex frequency:
function 751
variable 636
complex notation andFourier series749--51
complex numbers97, 324--55
addition 329
complex conjugate pairs 327
De Moivre’stheorem 344--51
division330
in polar form 335
exponential form337--8
Fourier series751
graphical representation 332--3
imaginary part325--6, 328--30, 334
and inverse Laplace transform643--6
loci and regionsofcomplex plane 351--3
modulus333
multiplication 329--30
in polar form 335
operationswith 328--30
phasors 340--4
polar form 333--6
Poynting vector343--4
real part 326, 328--30, 334
subtraction329
vectors and 336--7
complex plane 332
complex poles 668, 669
complex roots24,564, 567
complex translation theorem 714
component laws,differential equations 572, 577
composition, offunctions61--2
compound events 905--8,954
compression, image and audio 795
computer software packages 73
Fourier transform816--17
graph plotting 723
Laplace transform675
seealso MATLAB ®
computer solutionsformatrixalgebra 319--21
concave down function402--3, 415--17
concave upfunction 402--3,415--17
conditional probability 919--25
conduction current 369
conjunction (AND gate) 184
conservative fields875--80
constant689
arbitrary538, 589, 608
arbitraryscaling 304
coefficient differential equations, linear627, 649--59
986 Index
constant(Continued)
coefficient equations 560--84
exponential 81
ofintegration 429
phase581
time 82, 545
constantpolynomial 71
continuous function 64--5,361
andsequencerepresentationassumofweightedimpulses
979--81
approximating area under acurve 980--1
delta function 979
sampling a continuous function979--80
continuous integrand 491
continuous random variable 965--6
expected value 944--5
standard deviation 947
continuous signal sampling 702--4
continuous spectra 766
continuous variable 962, 963
control engineering 319
control systems628
controllersignal 693
converged series211
convergence, radiusof218, 698
convergence criteria 212
converges sequence 205
convolution 647
circular (periodic) 803--7
linear801--3, 809--11
theorem 647--9,676, 772--8
circular 807--12
seealso discrete convolution and correlation
coordinate frame 268
coordinate system 154--74
CartesianseeCartesian coordinate system
electrode 156
polarsee polar coordinates
polar curves 163--6
corner(derivatives) 371
correlation:
theorem 780--2
circular 816--20
seealso discrete convolution and correlation
cosecant (cosec)120, 434
hyperbolic 100
cosine(cos):
complex notation 749
complex numbers333--4, 338--9
De Moivre’stheorem 344--50
definite andindefinite integrals 445
differentialequations 539
differentiation 391, 394, 420
discrete Fourier transform795--7
first-orderlinear differentialequations 547--9,555
Fourier series734--43
odd andeven functions 740--3
functions120--3
half-range series746--8
hyperbolic (cosh)100--1,102--3
improper integrals 486, 488
integration, elementary 429--33, 438--9
integrationby parts 458, 460
integrationby substitution464--5
Jacobi-Anger identities 598
Maclaurin series525--6
modellingwaves using131--44
odd and even functions726--7,730--1
orthogonalfunctions 481--3
orthogonality relations 732--1
partialdifferential equations 825, 827
polar coordinates 160, 171
polar curves164--5
second-order linear differentialequations 541, 559--60,
564, 566--7,571--3
Taylor polynomials 522
Taylor series526--7
trigonometricequations 144, 146, 147, 148
trigonometricidentities 125--30
trigonometricratios 116--20
vectors 244
waves734, 757
cosine (cos)periodic waveforms 723--4
cotangent (cot)120, 434
hyperbolic 100
coupled difference equations 694
coupled equations 606--7
coupled first-orderequations 607--8
coupled-tanks system 614--15
Cramer’s rule280, 658
critical resistance 412
critically damped response 667
cross-correlation780
circular 815--16, 818--20
linear812--15
cubic equation 543
cubic polynomial 71
cumulative normalprobabilities 967--8
current 577--8
alternating 134, 324
branch307--10
conduction 369
density892
direct510
displacement 369
eddy 827
lawsee Kirchhoff’scurrent law
leakage 576--7
mesh 254, 307, 308--10
node 47--8, 310--12, 317--19
sinusoidalsignals 136--7
through capacitor 368--9
usedbyelectric vehicle 21--3
cusp(derivatives) 371
Index 987
cycle ofsint 131
cycles oflogarithmic scale94--5
cylindrical microwave cavities, modesof593--5
cylindrical polar coordinates166--70
decaying exponential 670
fluid flow along apipe 167
helical antennae 169--70
damped sinusoidalsignal 387--8
damping coefficient 610
damping ratio 411--12
DCT-II795
DCT-III795
De Moivre’stheorem 344--51
De Morgan’s laws 179, 186, 191
decaying exponential 578
decibels (dB)87
decimal system 11
deconvolution 810
definite integrals 442--51
degree 116, 117
ofequation 20
ofpolynomial 71
delta function111--12,979, 980, 981
Dirac 480
discrete Fourier transform787
Fourier transform770--1
integral properties489--91
Laplace transform675--6
sifting property of490
demodulation 817--18
demodulator 598
denominator 27--8, 29--30, 39,75, 465
dependent variable 58,59, 366
difference equations 682, 683, 684--5, 686--7
differential equations 535--7,543, 550--2
derivable from a potential 875
derivatives 365, 370--2
common 372--4
higherseehigher order derivatives
and Laplace transform635--8
partialseepartial derivatives
secondseesecond-derivative
table 356
determinants 275--6,278--81
to evaluate vector products248--9
and vector products279--80
deviation:
from mean 941
seealsostandard deviation
diagonal matrix 271
diagonally dominant, matrix ofcoefficients 317
dielectric 82
difference, common 204
difference equations 175, 203, 681--98
block diagram representation 688--93
coefficient 685--6
coupled 694
dependent andindependent variables 682
discrete-time controllerdesign693--5
discrete-time filter689--90
homogeneous and inhomogeneous equations 684--5
linear andnon-linear equations 683
low-pass filter696--7
non-recursive 684
numerical solution 695--8
order684
recursive684
rewriting686--8
second-order 719
signal processingusingamicroprocessor684--5
solution ofdifference equation 683
andztransform718--20
differentialcalculus 357
differentialequations 534--626, 638
analogue simulation 603--5
condition 538
coupled-tanks system 614--15
Euler’smethod 616--19
first-orderseefirst-orderequations
first-orderlinearseefirst-orderlinear differential
equations
higherorder equations 606--9,625
Laplace transform635, 659--61, 663
LC circuit with sinusoidalinput 572--3
linear 536--7
linear constantcoefficient 627, 649--59
numerical methods 615--16
order536
oscillatingmass-springsystem 565--6
parallelRLC circuit573--5
RC charging circuit 535--6
RC circuit: zero-input response and zero-state response
554--8
RL circuitwith ramp input 553--4
RL circuitwith stepinput 544--5
Runge-Kutta method offourth-order623--5
second-order 538--9,605, 610
second-order linearseesecond-order linearordinary
differential equation
seriessolution of584--6
solution of537--9
standing waves on transmissionlines579--82
state-space models609--15
voltage reflection coefficient 578--80
differentiation356--85,429, 457--8
applicationsseedifferentiation applications
common derivatives 372--4
currentthrough capacitor 368--9
defined 366
derivatives, existence of370--2
dynamic resistance ofsemiconductor diode 378--9
explicit 394
from firstprinciples 366, 372
988 Index
differentiation(Continued)
fluid flow intotank376--7
graphical approach to 357--8
implicit 394--7
limitsand continuity 358--61
linearmodel forsimple fluid system 380--3
aslinearoperator 375--83
linearity430
logarithmic 397--8
parametric 393--4
rate ofchange
atgeneral point 364--9
atspecific point 362--4
techniquesseedifferentiation techniques
voltage across an inductor 368
differentiationapplications 406--27
inflexion, pointsof415--18
maximum pointsandminimum pointssee
maximum/minimum points
maximum power transfer413--14
Newton-Raphson method 406, 418--23
risetime forsecond-order electrical system411--13
seriesdiode-resistor circuit421--3
vectors 423--6
differentiationtechniques 386--405
chain rule389--91, 395
damped sinusoidalsignal 387--8
higherderivatives 400--4
logarithmic differentiation397--8
parametric differentiation 393--4
product rule 386--8,397--8
quotient rule388--9,401
differentiator691
digital computers 615
digital controller 694
digital electronic circuits 175
digital filtering692
digital signal processing200, 692
diode equation 83--4
Dirac delta function 480
directcurrent electrical network 510
directtransmissionmatrix 612
Dirichlet conditions 735, 740
discontinuous function64--5, 361
discrete,usage ofterm 175
discrete convolution and correlation 801--20
circular (periodic) convolution 803--7
circular convolution theorem 807--12
circular correlation theorem 816--20
circular cross-correlation815--16,818--20
convolution reverb 810--12
linearconvolution 801--3,809--11
linearcross-correlation812--15
radar,use ofcorrelation in 817--20
discrete cosine transform795--801
definition andits inverse 795--801
truncation ofset ofsamples 798--9
two-dimensional andimage compression799--801
discrete Fourier transform783--7,979, 981
definition 784--6
derivation 787--90
to estimateFourier transform790--1
inverse 786--7
matrixrepresentation 792--3
properties793--5
linearity793
Parseval’s theorem 794
periodicity 793
Rayleigh’s theorem 794
discrete independent variable 202
discrete mathematicsseeBoolean algebra; settheory
discrete random variable 953, 957
expected value 943--4
standard deviation 946--7
discrete spectra766
discrete-time:
controllerdesign693--5
data 688
filter689--90, 696
interval702
discrete variable 962
disjointsets 178
disjunction connective 183
disjunctive normal form188
displacement:
current369
vector226, 227, 423
dissipation, power, in resistor3--4
functionto model 59
distance travelled by aparticle 434
distance travelled by rocket498
distributive laws179, 186, 189, 191
divergence theorem 893, 895
divergent sequence 205
diverging integral 486--7
division:
algebraic fractions 29--30
complex numbers330
indices4--5
in polar form 335
domain (sets)180--1
domain, function 58
doping 83
dot product 241
double integrals 880--5,886--7,891, 895
double the input rule56--7, 61
dual modular redundancy 906
dummy variable, and integration 451
duty cycle 743--4
dynamic resistance, ofsemiconductor diode 378--9
dynamic systems535
echelon form 290--1
eddy current losses827
Index 989
eigenvalues andeigenvectors 294--307
eigenvalues 297--303, 305
eigenvectors 303--6
linear homogeneous equations 294--7
electric charge density 900
electric displacement, electric field strength and
240--1
electric field:
ofelectrostatic dipole 529--31
movement ofcharged particle in 244--5
strength, and electric displacement 240--1
work done by 869
electric flux857--8, 893, 900
electrical components:
multiple, testing 914--15
reliability 911--12
electrical networks, analysis of307--12
electrical potential inside acathode ray tube824
electrode coordinates 156
electromagnetism andvector calculus 864--6
electronic chips, manufactured, acceptability of
925--6
electronic circuits, digital 175
electronic thermometer measuring oven temperature
656--7
electrostatic dipole, electric field of529--31
electrostatic potential 241, 854--5,863
electrostatics theory900
elements 176, 259
ofarea 882
cofactor of279, 282
minor of278--9
ofsystem 661
empty set 178, 910
energy 798
storedin capacitor 487--8
used by electric motor 447
engineering functions 54--114
argument offunction 57--8
composition of61--2
continuous function 64--5
discontinuous function64--5
exponential functions80--4
graph offunction58--9
hyperbolic functionsseehyperbolic functions
inverse offunction62--4
logarithm functionsseelogarithm functions
many-to-one functions60--1
numbersand intervals 55--6
one-to-many function 60
one-to-one functions60--1
parametric definition 61
periodic function 65--6
piecewise continuous functions 64--5
polynomial functions70--4
power dissipatedin resistor59
rational functions75--9
saw-tooth waveform 66--7
squarewaveform 67
triangularwaveform 66
entropy 916--19
ofbinarydata stream 917
ofsignal consistingofthree characters 917
equal sets177
equating coefficients 40--1,585, 586
equations:
algebraic 627, 649
auxiliary 562--4,572, 574
Bessel’sseeBessel’sequations
Cartesian 169
characteristic 299, 300--2
circle, centre onthe origin 163, 164
coupled 606--7
coupled difference 694
coupled first-order607--8
cubic 543
degree of20
differenceseedifference equations
differentialseedifferentialequations
diode 83--4
exact 547--9
familyof587
first-orderseefirst-orderequations
higherorder 606--9,625
homogeneous 558--60, 684--6
inconsistent288
inhomogeneous 558--60,572--3
Laplace 833--4, 862, 863
linear 163, 164, 220, 536--7, 683, 685
non-linear 219--22, 536--7, 683, 685
non-recursive 691
output614
partialdifferential 832--5, 862
Poisson’s863
quadratic 20--4,103, 325--6
second-order 684, 685, 696
separable 541
simple 540--1
simultaneoussee simultaneous equations
state609, 613
state-space 615
third-order684, 685
transmission834
trigonometric 144--50
wave 833
zero order684
equipotential surface 245
equivalent fractions 27--8
equivalent resistance 77--8
error-correcting code 291
errorsignal 693
errorterm 521
Euler’smethod 616--19, 623
improved 620--2
990 Index
Euler’srelations339, 749
differentialequations 564, 566
discrete Fourier transform787
Fourier transform760, 771--2
even functions726--32
exact equations 547--9
exact solution 616, 617--18,620--1,624
exclusive OR gate189
expansion along the firstrow278
expected value 943--5, 946--7,955, 959, 962
experiment and binomial distribution 953
explicitdifferentiation 394
exponentseepower
exponent, defined 80
exponential constant 81
exponential decay 81
exponential distribution 962--3
exponential expressions 80
exponential form,ofcomplex number 337--8
exponential functions80--4, 728
diode equation 83--4
discharge ofcapacitor 82
seealso logarithm functions
exponential growth 81
exponential term 669, 670, 671
factorialnotation 55
factorization 20--1,28--9,39
factory production line, quality control on907--8
failurein asingle trial953
Faraday, M. 240
Faraday’s law 368, 865
feedback:
loop, negative, elimination of662--8
negative 436
path689
Fibonacci sequence 203
field:
irrotational860
line and multiple integrals 876
solenoidal 857
filters722
digital 692
discrete-time 689--90,696
high pass689
lowpass689, 696--7
final value theorem 634, 657
finite sequences 801, 809, 813
finite series209
arithmetic 209--10
geometric 211
firstcondition (differentialequations) 539
first-derivative417
Laplace transform635
Taylor polynomials 508--9
test 407, 411
firstlaw ofindices 3
first-orderequations 540--5, 573, 606--8, 685
coupled 607--8
difference equations 696
Laplace transform656
separation ofvariables 541--5
simplest540--1
first-orderlinear differential equations 547--57
exact equations 547--9
integratingfactor method 550--7
separation ofvariables 549
first-orderTaylor polynomial 509--13
directcurrent electrical network 510
gravityfeed water supply 510--11
linearity509--13
power dissipationin resistor511--12
first-orderTaylor polynomials, in two variables 835--7
firstshifttheorem 632--3,719, 768
Fourier transform762--4
ztransform711--12
fluidflow:
acrossasurface 890--2
along a pipe167
coupled-tanks system614--15
intotank376--7
flux 891
folding (signalprocessing)775
forward wave 578--80
Fourier analysis 482, 745
Fourier coefficients 734, 739, 741--2,748--50, 766
Fourier components 752, 754--5
Fourier series722--56, 723--4,728, 731--42, 746, 749--50
complex notation 749--51
frequency response oflinearsystem 751--5
half-range series745--8
low-passfilter752--5
odd and even functions726--32, 740--4
Parseval’s theorem 748--9
periodicwaveforms 723--6
spectrum ofpulse width:
modulation controlled solar charger743--4
Fourier synthesis 734
Fourier transform749, 757--822
amplitude modulation 767--8
convolution and convolution theorem 774--82
definitions 758--61
ofdelta function 770--1
discrete cosinetransform795--81
fast785
integration480
inverse 758--9,774
andLaplace transform772--4
pair758
periodicfunctions 771--2
properties761--6
firstshifttheorem 762--4
linearity761--2
second shift theorem 764--5
Index 991
spectra 766--8
t-w duality principle 768--70, 771
seealsodiscrete convolution and correlation; discrete
Fourier transform
fourth-orderTaylor polynomials 518
fractional dead time 972
fractional indices 8--11
fractions:
algebraicseealgebraic fractions
equivalent 27--8
expressing,in simplestform 28--9
improper 27,43--4
negative 35--6
partialseepartial fractions
positive 35--7
proper 27
freevariables 288--9
freevectors 225
frequency 132, 724
angularseeangular frequency
component, zero736
domain 722, 766
Fourier series723
natural 411
resonant 343, 411, 594
response ofasystem636--7
response oflinear system751--5
sinusoidalsteady-state 636--7
spectrum 744
variable, generalized 636
frequency modulation (FM),Besselfunctionsin 598--601
full-wave rectifier 106--7
function:
average value of471--5
orthogonal 480--3
rootmean square value of475--9
function handle 61
functionsofseveral variables 823--48
dependent variable 823--4
eddy current losses827
electrical potential inside acathode raytube 824
first-orderTaylor polynomial in two variables 835--7
functionsofmore thanone variable 823--5
higher order derivatives 829--32
independent variables 823--4,825
maximum and minimumpointsofafunctionoftwo
variables 841--6
partial derivatives 825--9
partial differentialequations 832--5
power dissipatedin a variable resistor824
second-order Taylor polynomial in two variables 837--40
Taylor seriesin two variables 840
gain/attenuation 87,88, 578
Gauss-Seidel method 314--17
Gaussian elimination 286--94,306, 307, 309
augmented matrix287--91, 292
back substitution 287--90
rowoperations 287--90
Gauss’stheorem 448, 857--8,864, 893, 900
general solution 537
complementary function558--9
difference equations 683
differential equation 584--5
first-orderlinear differentialequations 547
higherorder equations 607--8
particular integral578
second-order differentialequation 538, 561--4,566--8,
569--70
generalized frequency variable 636
generating function 509
Besselfunctionsand 595--7
geometric progressions204--5
geometric series:
finite 211
infinite 212--13
gradient 620
ofchord357--8
ofsolution 616
oftangent 366--7,508
graph,offunction 58--9
graphics calculators 73,723
gravitational field, work done by868--9
gravityfeed water supply 510--11
Greek alphabet 981
Green’s theorem 895
half-range cosine series746--8
half-range Fourier series745--8
half-range sineseries746
half-wave dipole, radiation patternsof172--3
half-wave rectifier 107
Halleffect, in semiconductor 251--2
harmonics723
components 724
even 744
fundamental orfirst723
nth 744
odd 736, 744
second 723
waves 725, 734
helical antennae 169--70
hexadecimal system 11,14--17
binaryto hexadecimal conversion 16--17
conversion to decimal 14--15
decimal to hexadecimal conversion15--16
high passfilter689
higherorderderivatives 400--4,829--32
fifth402
first400--2
fourth402, 503--4,831
mixed 831
second 400--2,831
third402, 831
992 Index
higherorder equations 606--9,625
homogeneous equations 558--60, 572, 684--6
linear294--7
horizontalaxis turbines 73
horizontalshift 138--44
horizontalstrips881--4
hyperbolic functions100--4
capacitance between two parallelwires102--3
cosecant (cosech) 100
cosine(cosh) 100--1
cotangent (coth) 100
identities 101
inverse 102
sine(sinh)100--1
iandjcomponents ofr233
identity
hyperbolic function 101
Jacobi-Anger 597--8
laws 179, 186, 189--90, 191
matrix271--2, 274, 284, 292--3
trigonometric 125--30
imaginaryaxis 670, 709
imaginaryaxis (yaxis) 332
imaginaryparts 578, 669
complex numbers325--6, 334, 348
implicit differentiation 394--7
impossibleevent 910
improper fraction 27, 43--4
improper integrals 480, 483--9
impulseresponse 112, 676, 810--12
impulsetrain 111--12
inclusiveOR gate 189
inconsistentequations 288
increment (rateofchange at aspecific point) 363
indefinite integrals 442--51
independent events 925--30
independent solution 568
independent variable 58,59,366, 374
difference equations 682, 683, 684--5,686, 691
differentialequations 535--7, 540--1,543, 550--2
discrete 202, 682
integration 429, 431--2
indices,laws ofseeunder algebraic techniques
inductor577
phasors341
inequalities, solution of33--8
inertia,moment of612, 613
infinite integrand 483, 486
infinite limitsofintegration 483
infinite series209, 211--12
geometric, sum of212--13
inflexion, pointsof415--18
informationcontent:
ofalphabetic character stream 916
ofbinarydata stream 916
informationperevent 915
inhomogeneous equations 294, 558--60, 569--70, 572--3
difference equations 684--6
inhomogeneous term,in complementary function
575--83
initial conditions539, 542, 556--7, 566
analogue simulation 604--5
difference equations 686, 696
differentialequations 616
Laplace transform659
particularintegral 578
zero660, 663
initial values 696
input:
matrix612
sequence 685, 689, 691, 697
signal 661, 669
to system659--61
transferfunctions663, 665
vector612
voltage 676
integers 55,345
non-negative 682
positive55,214--16
integrals 438, 460, 463, 480, 490
differentialequations 541--2
diverging 486--7
double880--5,886--7,891, 895
improper 480, 483--9
indefinite/definite 442--51
infinite limits483
inner 881--3,886
Laplace transform628, 635--8
limits443, 446, 488
lower limits443--5
odd and even functions730--1
outer 881--3
periodicfunctions 677--8
scalar 493
second-order linear ordinary differential equation
569--75
Simpson’srule502
surface 889--95, 896, 898
trapezium rule499--500
triple 885--6,889, 893
upperlimits443--5
volume 889--95
seealso line integrals and multiple integrals
integrands 458, 465, 480
continuous 491
infinite 483, 488
integrating factor 551, 552
integrating factor method550--7
integration 428--56, 480--95
analytical 457
applications ofsee integration applications
capacitance ofcoaxial cable 448--50
capacitors in series484--5
Index 993
characteristic impedance ofcoaxial cable 450--1
constant of429
delta function, integralproperties of489--91
differential equations 542
distance travelled by aparticle 434
elementary 429--41
electronic integrators 435--7
energy storedin capacitor 487--8
energy used by electric motor 447
improprerintegrals 483--9
integrals, definiteand indefinite 442--51
limits491
aslinear operator432--4
linearity 438
numericalsee numerical integration
orthogonal functions480--3
ofpiecewise continuous functions491--2
region of880
techniquessee integration techniques
oftrigonometric functions437--9
vectors 493
voltage across a capacitor 435
integration applications 471--9
average value and r.m.s.value ofperiodic waveform
477--8
average value offunction471--5
load resistor476--7
rootmean square value offunction475--9
saw-tooth waveform 472--3
thyristorfiringcircuit473--4
integration techniques 457--70
by parts457--63
by substitution463--6
usingpartial fractions 466--8
integrator 604--5
circuits 428
intersection (settheory) 178
intervals 56
closed 56
ofconvergence 218
open 56
semi-open 56
inverse:
offunction 62--4
ofmatrix 285
usingrow operations292--3
of3×3matrix281--2
trigonometric functions121--3
of2×2matrix274--7
finding 275--6
orthogonalmatrix 276
inverse square law 6
inverters 188
inverting circuit437
irrational number 55
irrotationalvector field 860
isotropic antenna 6
iteration420
ofnon-linear equations,sequences arisingfrom 219--22
simple 220
Jacobi-Anger identities 597--8
Jacobi’s method 313--14, 315--16
Kirchhoff’scurrent law 310
algebraic techniques 47--8
complex numbers342
Kirchhoff’svoltage law72, 308
algebraic techniques 48--9
complex numbers342
differential equations 535, 544, 560, 572, 577
differentiation 413
Fourier series752
integration 435, 485
Laplace transform655
Kraus,J.169
Kronecker delta sequence 202, 203, 698
Laplace equation 862, 863
three-dimensional 833--4
Laplace transform411, 627--80, 681--2,706, 709, 759, 774
asymptotic Bodeplotoftransfer functionwith real poles
and zeros671--5
ofcommon functions(look-uptable) 629--30
convolution theorem 647--9
definition 628
delta function675--6
ofderivatives and integrals 635--8
discharge ofacapacitor 655
electronic thermometer measuring oven temperature
656--7
and Fourier transform772--4
frequency response ofasystem 636--7
integration 480
inverse 627, 638--41
usingcomplex numbers 643--6
usingpartial fractions 641--3
linear constantcoefficient differentialequations 649--59
linearity 631--2
partialdifferential equations 833
periodic functions677--8
poles,zerosandsplane 668--75
rule1670
rule2670
rule3670--5
position controlsystem 665--8
properties631--5
final value theorem 634
firstshift theorem 632--3
linearity 631--2
second shifttheorem 633--4
RL circuitwith ramp input 651--2
rule1: combining two transfer functionsin series
661--2
994 Index
Laplace transform(Continued)
rule2: eliminating negative feedback loop 662--8
transferfunctions 659--68, 752
transport lag 664--6, 664--5
voltage across acapacitor 636
andztransform, relationshipbetween 704--9
LCRcircuit560--1
leakage, current 576--7
left-handlimits360
lifespan, electrical components 911
limitsequence 205
limit5sequence 206
limits463
differentiation 358--61
ofintegrals 443, 446, 486
integration 491
left-hand360
right-hand360
line integrals and multiple integrals 867--902
conservative fields875--80
line integralaround closedloop 877--9
potential function 875--7
divergence theorem 895
double integrals 880--5,886--7, 891
electric charge enclosed in a region 890
electric current density 892
electric field, work done by869
electric fluxand Gauss’slaw 893
evaluation ofline integrals in three dimensions
873--5
evaluation ofline integrals in two dimensions 871--3
fluid flow acrossasurface 890
gravitational field, work done by868--9
Green’s theorem 886--7
line integrals 867--71
massofasolid object 889
Maxwell’s equations in integral form 899--900
simple volume and surface integrals 889--95
Stokes’theorem 896--9
three-dimensional line integrals 873--5
triple integrals 885--6, 889
two-dimensional line integrals 871--3
line spectra 766
linearcircuit, Bode plotof96--7
linearcombination 237
ofsinusoids723
linearconstant coefficient:
differentialequations 627, 649--59
linearconvolution 801--3,809--11
linearcross-correlation812--15
linearequations 163, 164, 220, 536--7, 683, 685
linearfactors 39--41
equating coefficients 40--1
evaluation usingspecific value ofx40
Laplace transform651, 668
repeated 39, 41--2
linearhomogeneous equations 294--7
linear modelsforsimple fluidsystem 380--3
linear operator:
differentiationas375--83
integrationas432--4
Laplace transform638
linear polynomial 71
linear scale94
linear system:
frequency response 751--5
Taylor polynomials 511
linear time-invariant system 560--1
linearity:
differentialequations 536--7
ofdifferentiation 430
discrete Fourier transform793
first-orderTaylor polynomial 509--13
Fourier transform761--2
integration438
Laplace transform631--2
ztransformanddifference equations 710, 718--19
linearly dependent vectors 238
linearly independent functions558
linearly independent vectors 238
linesofforce 240
linking processcontrolcomputers 948--9
load:
line 422
resistor476--7,655
voltage waveform 473--4
log-linear graph92
log-linear scales 92--4
log-log graph94--6
log-log plot92--3
log-log scales 92--4
log scale92
logarithm functions85--100, 187--93
attenuation in a step-index optical fibre89--90
decibelsin radio frequency engineering 88--9
delta function 111--12
exponential functionsand 91--2
logarithms85--7
modulusfunction104--8
ramp function 108
reference levelsseereference levels
signal ratiosand decibels 87--8
unitimpulsefunction 111--12
unitstepfunction108--10
logarithmic differentiation 397--8
logarithmic frequency scale 672
logarithmic scale 94
logarithms85--7
laws86,398, 433, 672
natural 85, 89
logical equivalence:
AND gate184, 187, 188
binaryfull-adder circuit design190--2
connective 183
Index 995
exclusive OR gate 189
NAND gate 185, 192--3
NOR gate 184--5
NOT gate 184
OR gateseeOR gate
realization oflogicalgates 198--9
losslessline 578
lossycompression 799
low passfilter689, 692
analogue 752--5
digital 696--7
lower bound 56, 499, 504
lowest common denominator (l.c.d.)30
m-files 61
Maclaurin series524--31
magnetic field:
due to moving charge 250
intensity249--50
strength 249--50
magnetic flux 900
magnitude:
ofphasor 340
ofvector 225
many-to-one functions60--1, 122
maps 180
mark time 743
mass610--11
mass-springsystem, oscillating565--6
mathematical model 54
MATLAB ® 1--2,58, 320--1,615, 785, 790, 808--9
toolboxes 67
matrices 1
matrixalgebra 257--328
addition 259--60
adjoint matrix281--2
augmented matrix309
computer solutionsfor319--21
definitions 258--9
determinants 275--6, 278--81
Cramer’srule280
andvector products279--80
diagonal matrix 271
direct transmission612
eigenvalues and eigenvectorsseeeigenvalues and
eigenvectors
electrical networks,analysis of307--12
Gaussian eliminationseeGaussian elimination
identity matrix271--2, 292--3
input 612
inverse of2×2matrix 274--7
finding 275--6
orthogonalmatrix 276
inverse of3×3matrix 281--2
multiplication 261--4, 285
non-singularmatrix 276
output 612
representation 792--3
robot coordinate frames268
scalar multiplication 260--1
simultaneous equations and 283--6
simultaneous equations, iterative techniques forsolution
of312--19
singular matrix276, 282
skewsymmetric matrix272--3
squarematrix 271, 272--3, 275, 300
state612
subtraction259--60
symmetric matrix 272
translation and rotation ofvectors 267--71
transpose ofmatrix272
Vandermonde matrix 291--2
maximum/minimum points406--15, 824--5
first-derivativetest 407, 411
ofafunction oftwo variables 841--6
local maximum 406--7
local minimum406--7
second-derivative test 410, 412
turningpoints407--9, 412, 414
Maxwell’s equations 864--6
in integral form 899--900
mean 964
deviation and binomial distribution 955
time between failures (MTBF)970, 973, 975
value (arithmetic mean) 938--40
mesh current307, 308--10
vector 254
microwave cavities 593--5
minimization laws179, 185, 186
minimumpointsseemaximum/minimum points
minor ofelement 278--9
mixed partial derivatives 862
modelling systems566
modulation index 599
modulation signal 767
modulus:
ofcomplex number 333
function 104--8
ofvector 225
moment ofinertia 612, 613
moving average filter692
multipath-induced fading 142
multipleintegralsseeline integrals andmultiple integrals
multiplication:
algebraic fractions 29--30
complex numbers329--30
indices2--4
law 919--25, 926, 954
matrix261--4
in polar form335
scalar 260--1
mutual capacitance 102
mutuallyexclusive events:addition law ofprobability
909--13
996 Index
n-component column vector 612
n-typemetal-oxide semiconductor (NMOS) transistors
192--3
n-typesemiconductor 83, 251
NAND gate 185, 192--3
natural frequency 411
natural logarithms85, 89
natural numbers55
negative feedback 436
negative number 325
neper (Np)89
Newton-Raphson method221, 406, 418--23
Newton’s Second Law 610, 613
nibble 17
NMOS(n-typemetal-oxide semiconductor) transistors
192--3
node 47--8
reference 310
voltages/currents47--8, 310--12, 317--19
non-linear equations 219--22, 536--7,683, 685
non-linear system, in Taylor polynomials 511
non-negative number 333
non-periodic function757--8
non-rectangular region 880, 884
non-recursiveequations 691
non-singularmatrix276
non-standard normal 966--9
non-trivial solution 294--6,298--300
non-uniformfailurerates976
non-zeroinitial value ofstate vector 660
NOR gate 184--5
normal distribution 963--9
non-standard normal 966--9
standard normal 964--6
normalized sincfunction123, 124--5
norms254
NOTgate 184
nth-order Taylor polynomial 517--21
numbers55--6
commutative 259--60
complexseecomplex numbers
numerator 27--8, 29--30, 75,465
numerical integration 457, 496--506
distance travelled byrocket 498
energy dissipationin resistor502--3
Simpson’srule 500--5
trapezium rule496--500, 502
numerical methods,and differentialequations 615--16
numerical solution 220, 534, 617--18, 695--8
obliqueasymptote 76
odd and even functions726--32, 740--4
integral properties730--1
Ohm’s law3, 55
differentialequations 535, 544
differentiation 413
fluid equivalent 614
Fourier series752
integration436, 487
Laplace transform655
phasor form341
polynomial functions71--2
Taylor polynomials 510
one-to-many function 60
one-to-one functions60--1, 122
operating point 379--80
OR gate 183, 184--5,187, 188, 192
exclusive 189
inclusive189
truthtable 183
order258, 263, 587
difference equations and theztransform684
differentialequations 536
ofobjects 949
ofsystem 609
ordinary differentialequationsseedifferential equations
origin (coordinate systems)154
orthogonal functions480--3
orthogonal matrix276
orthogonal vectors 232, 242
oscillate sequence 205
oscillating mass-springsystem565--6
oscillatory term669
output 663, 665
equation 614
matrix612
reduction 436
sequence 685, 689, 691, 697
signal 661, 669
to system659--61
variables 614
vector612
voltage 676
overdamped response 666--7
overshooting 666
p-component columnvector 612
p-n junction 83
p-type metal-oxide semiconductor (PMOS)transistor
192--3
p-type semiconductor 83,251
padding signals with zeros 809--11
parallel system 975--7
parallel vectors 242
parallelogram 336--7
parameters 393, 587
functions61
parametric differentiation 393--4
Parseval’s theorem 748--9, 794
partial derivatives 825--9,835
first-order825, 829, 833, 835--7, 842--5
fourth-order840
line and multipleintegrals 886
mixed 862
Index 997
second-order 829--30, 833, 838--40, 842--5
third-order840
vector calculus 854
partial differentialequations 832--5,862
partial fractions 39--45
improper fractions 43--4
and integration 466--8
linear factors 39--41
quadratic factors 42--3
repeated linearfactors 39, 41--2
partial fractions:
inverse Laplace transform641--3,644--5,651, 653--4,
656
Laplace transform666
z transform716--17,720
partial sumssequence 212
particular integralfunction 558
particular solution 538, 556, 563--4,575
parts,integration by457--63
Pascal’s triangle 214--15
period 131
offunction 65
periodic convolutionsee circular (periodic) convolution
periodic extension 746--7
periodic functions65--6,131--2, 725, 733
Fourier transform766, 771--2
Laplace transform677--8
periodic waveforms 477--8, 723--6,752
periodicity and discrete Fourier
transform793
permeability, ofconductor 10
permittivity offree space 102, 858
permutations 948--50, 951--2
phase 136, 724, 751--3
angle 132, 133, 138--9, 723
constant 581
Fourier transform766
phasors 340--4
capacitor 341--3
inductor 341
magnitude of340
reference 340
resistor341
pick andplace robot 159--60
piecewise continuous functions64--5, 648, 780
integration of491--2
place sign 279
PMOS(p-typemetal-oxidesemiconductor)transistor192--3
Poissondistribution 957--61,962
Poissonprocess959
Poisson’sequation 863
polar coordinates 159--63, 333--4
cylindricalsee cylindrical polar coordinates
pick and place robot 159--60
spherical 170--3
polar curves 163--6
equation ofcircle, centre on the origin 163, 164
equation ofline 163, 164
and radiationpatterns fromantennas 164--5
polar form 333--6,338, 345--6
poles 706--9
complex 668, 669
dominant 670
offunction 76
Laplace transform668--75
atthe origin 673
positions667
real 668, 669
stablecomplex 671
system 668
polynomial:
coefficients 20
constant 71
cubic 71
ofdegree 2 300
ofdegreen--d 43--4
degree of71
equationsseeunder algebraic techniques
expression 70
functions70--4
non-ideal voltage source 72--3
Ohm’s law 71--2
wind power turbines 73
linear 71
quadratic 71
quartic 71
Taylorsee Taylor polynomials
position controlsystem (servo-system)108, 665--8
position vector423--4
positiveintegers 55
positivenumber 325
postmultiplying263
potential function875--7
potential theory833
potentiometer 604--5
power:
dissipationin resistor3--4,511--12, 824
function to model 59
gain 87
series218--19, 337--8
supply to a computer 906
power density6
power seriessolution, ofBessel’sequation 587--8,
590--1
Poyntingvector 343--4
premultiplying 263, 272
primaryroute (permutations)948--9
probability 903--32
acceptability ofmanufactured electronic chips925--6
ofacceptable components 928--9
addition law909--13,954
alphabetic character stream,information content
of916
binarydata stream,information content of916
998 Index
probability (Continued)
breakdowns onfactory processline 971--2
character data stream,redundancy of918
communication theory915--19
complementary events913--15
component, calculating 912
compound event 905--8
conditional 919--25
cumulative normal 967--8
electrical component reliability911--12
electrical supply to a large factory,availability of973
emergency calls to service engineer 958
entropy ofbinary data stream 917
entropy ofsignal consistingofthree characters 917
events 904
offaultycomponents 926--8
independent events925--30
machine breakdowns in workshop 958
mutually exclusive events909--13
power supply to acomputer 906
production line product fedbytwo machines 920--2
quality control onfactory production line 907--8
redundancy ofbinary data stream 918
reliabilityofmanufactured components 923--4
testing multiple electronic components 914--15
trial904
unconditional 920, 923
workforce absentees 959--60
seealso statisticsand probability
product rule543
differentialequation 548, 551
differentiation 386--8,397--8, 412
partialdifferential equations 827--8,833
productionengineering 903
projection(cylindrical polar coordinates) 166
properfraction 27
proportional/integral/derivative (p.i.d.) controller 694
Pythagoras’s theorem106, 141, 161, 233, 235--6, 333
quadratic approximation to diode characteristic
515--16
quadratic equations 20--4, 103, 325--6
quadratic expression 37
quadratic factors 39,42--3
quadratic polynomial 71
quality control 903, 907--8
quantity, absolute88
quartic polynomial 71
quotientrule 388--9,401, 414
r-component column vector 612
radar:
and correlation817--20
defined 8
scattering7--8
radar cross-section (RCS)8
radians116, 117
radiation patterns:
ofhalf-wave dipole 172--3
polar curvesand 164--5
radio antenna, power densityofsignal transmitted by6
radio-frequency (RF)engineering 579
decibelsin 88--9
radiusofconvergence 218, 698
ramp function108
random variables 934, 962, 968
continuous 944--5, 947, 965--6
discreteseediscrete random variable
range, sets andfunctions180
rate ofchange 357--8, 362--4
atgeneral point 364--9
negative 365
positive365
atspecific point 362--4
rational functions75--9
rational numbers55, 176, 345
Rayleigh’s theorem 794
RC charging circuit535--6
real axis(x axis) 332
real line 55--6
real numbers55,176, 181
real parts 578
complex numbers326, 334, 348
negative 669, 670
real roots24
real system670
rectangle function979--80
rectangular region 880--3
rectangular wave 743
rectifier:
defined 106
full-wave106--7
half-wave 107
recurrence relations203, 220
recursive difference equations 684
recursive filter689
recursive sequence 203
reduction formula460--1
redundancy 917--19
Reed-Solomon codes291
reference levels 88,90--7
Bode plotoflinearcircuit 96--7
exponential and logarithmfunctions, connection between
91--2
log-linearand log-loggraph paper 94--6
log-logandlog-linear scales 92--4
logarithmfunctions90--1
reference node 310
reference phasor 340
relation (setsand functions)180--1
reliability engineering 903, 970--7
parallelsystem 975--7
seriessystem 973--5
remainder ofordern522
Index 999
remainder term,Taylor’s formulaand 521--4
repeated linear factors 39,41--2
resistors577
load 476--7
in parallel 32
phasors 341
power dissipationin 3--4,511--12
functionto model 59
resolvingforce intotwo perpendicular directions
227--8
resonant frequencies 343, 411, 594
resultantoftwo forcesacting onabody 227
reverberation 810--12
reverse saturated diode 83
right-hand limits360
right-handed screwrule 896
risetime 411--13
RL circuitwith ramp input 651--2
RL circuitwith stepinput 544--5
RLC circuit 109
parallel 573--5
robot coordinate frames268
robot positions237
rootmean square(r.m.s.)340, 471
value offunction 475--9
value ofperiodic waveform 477--8
roots20,219--21, 563--4,568, 668
complex 24,564, 567
formula 21
real 24
routing, ofautomated vehicle 226--7
row258--9, 261--2
operations 287--90, 292--3
vector 233
Runge-Kutta method offourth-order623--5
s domain661, 665
s plane 668--75
mapping ofto the zplane 706--9
saddle points825, 842--5
sample space 904
sampled signal 690
sampling 202, 688
saw-tooth waveform 66--7, 472--3,733--4, 742--3
scalar 224
field 240, 851--5, 856, 875, 889
integrals 493
multiple 230--1,260--1
product 241--5,870--1, 891
secant (sech)120, 434
hyperbolic 100
second-derivative 416--17
difference equations 691
differential equations 536
Laplace transform635
numerical integration 499
test 410, 412
second law ofindices 4
second-order difference equations 719
second-order differential equations 538--9,604, 610
second-order electrical systems,risetime for411--13
second-order equations 573, 606--8,684, 685, 696
second-order linear ordinary differential equation 558--84
complementary function561--9
constant coefficient equations 560--84
integrals 569--75
property 1 558
property 2 558--60
transmissionlines576--8
second-order system 610
second-order Taylor polynomials 513--17
second shift theorem 633--4,665, 712--14, 716, 764--5
secondary route (permutations)948--9
semiconductors192--3
diode:
dynamic resistance of378--9
ideal, operating point for379--80
Hall effect in 251--2
n-type83, 251
p-type83, 251
separable equations 541
separation ofvariables 541--5,549
sequences 200--9,683, 688--9,976
arithmetic progressions204
binomial theorem 214--18
bounded sequence 205
converges sequence 205
divergent sequence 205
Fibonacci sequence 203
finite sequence 801, 809, 813
geometric progressions204--5
input sequence 685, 689, 691, 697
from iterative solution ofnon-linear equations 219--22
Kronecker delta sequence 202, 203
limitsequence 205
limit5sequence 206
oscillate sequence 205
outputsequence 685, 689, 691, 697
partialsums sequence 212
recursivesequence 203
unbounded sequence 205
unitramp sequence 699--700
unitstepsequence 202, 699
seealso series
series200, 209--14
binomial theorem 214--18
converged 211
diode-resistorcircuit 421--3
finite 209
arithmetic, sum of209--10
geometric, sum of211
inductance series576
infinite 209, 211--12
geometric, sum of212--13
1000 Index
series(Continued)
power 218--19
sequence ofpartial sums212
system series973--5
seealso sequences
servo-systems665
settheory 55,175--83
absorption laws 179, 185, 186
associative laws 179, 186
commutative laws 179, 186, 191
complement 178--9, 186, 189--90
De Morgan’slaws 179, 186, 191
disjointsets178
distributivelaws 179, 186, 189, 191
empty set178
empty sets910
equal sets177
identitylaws 179, 186, 189--90, 191
intersection 178
logic183--5
minimization laws 179, 185, 186
setsand functions179--81
subsets178
union178
universalset 177
Venn diagrams 177, 178--9
seven-segment displays18--19
shrinking intervalmethod 363, 364, 372
SIunits and prefixes982
sidebands601
siftingproperty, ofdelta function490, 787--8
sigma notation 209
signals133, 688
damped sinusoidal387--8
input 661, 669, 697
output 661, 669
processing200, 201
digital 200, 692
usingamicroprocessor 684--5
ratios, anddecibels 87--8
sampling 690, 702--4
sequence 697
sinusoidal751--2
sinusoidalcurrent 136--7
sinusoidalmodulating 599
sinusoidalvoltage 134--5
ofsystem661
telescope drive 108
simple iteration220
Simpson’srule500--5
simultaneous equations:
algebraic 658
directmethods 312
and iterativetechniques 312--19
Gauss-Seidel method 314--17
Jacobi’s method 313--14, 315--16
and matrixalgebra 283--6
sincfunction 123--5,744, 760
normalized 123, 124--5
zerocrossingpoints124--5
sine(sin)115
complex numbers333--4,338--9
DeMoivre’s theorem 344--50
definiteand indefinite integrals 444--5
differentialequations 534, 537, 539
differentiation387--9,391, 394, 412, 420
elementary integration 432--3,434, 438--9
first-orderlineardifferential equations 547--8
Fourier transform763, 766
functions120--3
hyperbolic 100--1
improper integrals 486, 488
integration, elementary 429--33
integrationby parts 458
integrationby substitution464
inverse function 122
Jacobi-Anger identities 598
Maclaurin series525--6
modellingwaves using131--44
orthogonalfunctions 481--3
partialdifferential equations 826, 827
polar coordinates 160, 171
polar curves164--5
rootmean squarevalue offunction475--6
second-order linear differentialequations 541, 559--60,
564, 566--7,571--3
series,half-range 746
Taylor polynomials 522--3
Taylor series526--7
trigonometricequations 144, 145, 148, 149
trigonometricidentities 125--30
trigonometricratios 116--20
vectors 246--7,857
waves723, 734, 757
single loop industrial controlsystem 693
single trial953, 956
singular matrix 276, 282
sinusoidalfunctions121
sinusoidalmodulating signal 599
sinusoidalsignals 751--2
current136--7
damped 387--8
voltage 134--5
sinusoidalsteady-state frequency 636--7
sinusoidalterm670--1
sinusoidalwaveform 340
sinusoids723
linearcombination 723
skew symmetric matrix272--3
skin depth, in radial conductor 9--10
slope (differentiation) 357
slotted line apparatus 582
small-angle approximations 219
smoothing the input signal 696
Index 1001
solenoidal vectorfield 857
solution:
analytical 534, 618--19,621--2
differential equations 537--9
exact 616, 617--18, 620--1,624
generalsee general solution
independent 568
non-trivial 294--6,298--300
numerical 220, 534, 617--18,695--8
particular 538, 556, 563--4, 575
specific 683
trivial 294--6, 298
true 615
solve, defined 219
sonar817, 820
specific solution 683
spectra:
continuous 766
discrete 766
Fourier transform766--8
line 766
ofpulsewidthmodulationcontrolledsolarcharger743--4
spherical polar coordinates 170--3
squarematrix 271, 272--3, 275, 300
squarethe input rule58--9, 61,64
squarewave input 753--4
squarewaveform 67
standard deviation 941--3,964
binomial distribution 955
ofacontinuous random variable 947
ofadiscrete random variable 946--7
standard formsandLaplace transform639--40
standard normal 964--6
standing waves on transmissionlines580--2
state equations 609, 613
state matrix 612
state-space equations 615
state-space models603, 609--15,660
forcoupled-tanks system 614--15
forspring-mass-dampersystem610--11
state variables 609--11, 613--14
state vector 254, 612
stationary points842--5
stationary values 407
statisticsand probability
binomial distribution 953--7
combinations 950--2
continuous random variable 944--5
continuous variable 933, 934, 936--8
discrete random variable 943--4
discrete variable 933, 934, 935
distributions933--53
exponential distribution 962--3
linking processcontrol computers 948--9
mean value (arithmetic mean) 938--40
normal distribution 963--9
permutations 948--50, 951--2
Poissondistribution 957--61
probability densityfunctions936--8
random variables 934
reliabilityengineering 970--7
standard deviation 941--3
ofacontinuous random variable 947
ofadiscrete random variable 946--7
uniform distribution 961
seealso probability
steady-stateerror 657
steady-stateresponse 556
step-index optical fibre,attenuation in 89--90
stepinput 411
stepsize615, 616--18
Stokes’theorem896--9
straightline approximation 620
sub-intervals936, 971
subscripts46
subsets178
substitution:
back 287--90
integration by463--6
subsystems,parallel 976
subtraction:
algebraic fraction 30--1
complex numbers329
matrix259--60
vectors 228--30, 337
successin asingle trial953
sum to infinity213
summationnotation 46--50
Kirchhoff’s currentlaw 47--8
Kirchhoff’s voltage law 48--9
summer 604--5,689
summing890
point 661, 662
superposition principle509
suppressedcarrier amplitude modulation 768
surface integrals 889--95, 896, 898
symmetric matrix272
t-w duality principle 768--70, 771
take-off point 661
‘takeplus orminusthe squarerootofthe input’ 60, 64
‘takethe positivesquare rootofthe input’ rule60
tangent (tan)115, 358
complex numbers333--4
differential equations 543
functions120--3
gradient of366--7
hyperbolic 100
integration 430, 434, 464
line approximationseestraightline approximation
polar coordinates 161--2
trigonometric equations 144, 147, 148
trigonometric identities 125--8
trigonometric ratios116--20
1002 Index
Taylor polynomials:
first-orderseefirst-orderTaylor polynomial
fourth-order518, 840
non-linear system511
ofnth order 517--21
second-order 513--17,837--40
third-order518--19, 840
Taylor series524--31
Taylor seriesexpansion 617, 620, 623, 751
truncation 620
Taylor seriesin two variables 840
Taylor’sformula, andremainder term 521--4
telescope drivesignal 108
thirdlawofindices 7, 8
third-orderequation 684, 685
third-orderTaylor polynomials 518--19
thyristorfiringcircuit473--4
time:
constant82, 545, 753
delayseetransport lag
dependency 340
displacement 132, 133, 723
domain 340, 628, 661, 665
interval 689
lineartime-invariant system560--1
seealso discrete-time
toolboxes67
‘top-hat’ function777
torque613
transferfunctions609, 627--8,637, 659--68, 669
Laplace transform674--6
rule1: combining two transferfunctions in series661--2
rule2: eliminating a negative feedback loop 662--8
system 663
transform,discrete 795--801
transients556, 669--71
response 659
transistors192--3
transmissionequation 834
transmissionlines576--8
transportlag 664--5
transverse wave 343
trapezium rule496--500, 502
tree diagram 907, 912, 921, 923--4
triangle law 225--6,228--30,233, 336, 351
triangularwaveform 66
trigonometricfunctions115--53,349
cosinefunction 120--3
degrees 116
equations, trigonometric144--50
integration 431
integration of437--9
inverse 121--3
radians116
ratios, trigonometric116--20
sincfunction123--5
sinefunction120--3
tangent function120--3
seealso trigonometricidentity
trigonometric identity 345, 348
Fourier series724, 732
integration481--3
trivial solution 294--6,298
true solution 615
truthtable 186--7
exclusive OR gate189
full-adder circuit191
AND gate184
NAND gate 185
NOR gate 184
NOTgate 184
ORgate 183
turbines, wind 73
turning points407--9,412, 414
two raypropagation model 140--2
unbounded sequence 205
unconditional probability 920, 923
underdamped response 667
underdamped system 411
uniform distribution 961
union (settheory) 178
unit impulsefunctionsee delta function
unit ramp sequence 699--700
unit step:
function108--10
functions712
input 669
sequence 202, 699
unit vectors 231
universal set177
upper bound 56,499--500, 503--4
upper limits484
ofintegrals 443--5
Vandermonde matrix291--2
variables 58
Boolean185
complex frequency 636
continuous 962, 963
dependentsee dependent variable
discrete 962
dummy451
free288--9
generalized frequency 636
independentsee independent variable
Laplace transform637
output614
randomseerandom variables
separation of541--5, 549
state613--14
variance 941--2,946--7, 958
vectors 1,224--55
addition225--6,336--7
Index 1003
calculus 849--66
curl859--61, 861--4
divergence 856--9, 861--4
electric fluxand Gauss’slaw 857--8
andelectromagnetism 864--6
electrostatic potential 854--5
gradient861--4
partialdifferentiation ofvectors 849--51, 852--3
Poisson’sequation 863
scalar field 861--2
scalar field gradient 851--5
vectorfield 861
Cartesian components 232--40
column 233, 258--9
and complex numbers336--7
differentiation 423--6
displacement 226, 227, 423
electric field strength andelectric displacement 240--1
electrostatic potential 241
equal 225
field 240, 244, 867
line and multipleintegrals 890, 893
free 225
Hall effect in semiconductor 251--2
input 612, 615
integration of493
linearly dependent 238
linearly independent 238
magnetic field due to moving charge 250
magnitude 225
mesh current 254
modulus225
ofndimensions 253--4
negative 225
operator 852
orthogonal 232, 242
output 612
parallel 242
partial differentiation 849--51, 852--3
position 423--4
Poynting 343--4
product 246--53
applications of249--53
determinantsand 248--9,279--80
magnetic fluxdensityand magnetic field strength
249--50
resolvingforce intotwo perpendicular directions
227--8
resultantoftwo forcesacting onabody227
robot positions237
routing ofautomated vehicle 226--7
row 233
scalar multiple230--1
scalar product 241--5
state 254, 612
subtraction 228--30, 337
translation androtation of267--71
unit231
work done by force 244
zero 236
Venn diagram 904, 906, 910, 914
Venn diagrams 177, 178--9
vertical strips881--2,884--5
very-large-scale integration (VLSI)192--3
VHDL design languages 193
virtualearth 436
visual artefacts800--1
voltage 577--8,580--2
across acapacitor 636, 655
across an inductor 368
across capacitor 435
gain 88,90
input 676
lawseeKirchhoff’s voltage law
node 47--8,310--12, 317--19
non-ideal source72--3
output676
reflection coefficient 578--80
saw-tooth waveform 66--7
sinusoidalsignals 134--5
voltage controlled oscillator(VCO) 599
wave:
amplitude of131
backward 578--80
combining 134--8
destructive interfering 142
equation 833
forward578--80
input, square 753--4
modelling, usingsineand cosine131--47
number 138--44
rectangular 743
transverse 343
two-raypropagation model 140--2
waveforms 133
alternating current 134
periodic 477--8,677--8, 723--6,752
saw-tooth 66--7, 472--3
sawtooth 733--4,742--3
sinusoidal340
square67
time dependency of340
triangular66
waveguides 593--5
wavelength 138--44
weighted impulses112, 703--4,979--81
whole numbers176
windpower turbines 73
work done byforce 244
worldcoordinate frame 268
x axis58, 157, 159, 168, 171
sequences and series220
1004 Index
x coordinate 154--5
x-yplane 166, 169, 170, 171, 233, 336, 351
yaxis58,157
matrixalgebra 270--1
ycoordinate 154--5
zaxis157, 167, 169, 171
ztransforms175, 200, 979, 981
continuous signal sampling 702--4
definition 698--702
and difference equations 718--20
and Laplace transform,relationship between 704--9
mapping thesplane to thezplane 706--9
propertiesof709--15
complex translation theorem714
directinversion 717--18
firstshifttheorem 711--12
inversion715--18
linearity710
second shift theorem 712--14
zero frequency component 736
zero initialconditions 660, 663
zero-input response 554--7,573, 575
zero orderequations 684
zero real part 670
zero-state response 554--7,573, 575
zero vector 236
zeros668, 706, 709
Laplace transform668--75
padding signalswith 809--11